Jupyterhub: Sharding hub by users with a HubDispatcher

Created on 25 Jul 2017 · 23Comments · Source: jupyterhub/jupyterhub

This is a meta issue to discuss the project to scale JupyterHub up to potentially a way larger number of users. In particular as one given hub can only support a couple thousand users it woudl be nice to coordinate many hubs. Also because a single hub is a single point of failure. @yuvipanda came up with the following (which I'll attempt to describe), and I've look at implementing it.

Instead of having 1 hub, we want many hub each with their users.
Once a user has reach a hub, it should alway get to this hub again. The issue is with authentication.
To know which hub a user should go to, you need to authenticate. To authenticate you need to reach a hub. So what we can do is deploy a fleet of hub, with specific one only responsible only for authentication, and telling the proxy to dispatch that user to a given hub, here is a schema.

                                |
                    +-----------+
                    |           |
                    |           v
                    |   +-------------------------+      Cookie Set
                    |   |                         |
                    |   | Configurable HTTP Proxy +---------+------------+---...
                    |   |       (aka CHP)         |         |            |
                    |   +-------------------------+         |            |
                    |           |                           |            |
                    |           |                           v            v
      Set Cookie    |           | No Cookies         +---------+  +---------+
      and redirect  |           |                    |         |  |         |
                    |           |                    |  Hub A  |  |  Hub B  |
                    |           |                    |         |  |         |
                    |           |                    +---------+  +---------+
                    |           v
                    |   +--------------------------+
                    |   |                          |
                    |   |   Hub Dispatcher         |
                    |   |                          |
                    |   |   - Authenticate         |
                    |   |   - Which Hub For User   |
                    +---+                          |
                        +--------------------------+
                                ^
                                |
                                |
                                v
                        +--------------------------+
                        |                          |
                        |  DataBase or User/Hub    |
                        |                          |
                        +--------------------------+

Note, (see SVGBOB)

The hub dispatcher is kinda like a hub, except it only:

authenticate,
set a cookie telling CHP: "please forward this user to hub X" + set secure cookie that authenticate user with hub X as user U
redirect to original request so the proxy can – that time – forward to the right hub.
The Hub Dispatcher talk to a DB that store which user is on which hub, and also know what is the capacity of each hub. That's read heavy (read everytime a user logs in), but only few writes: only a new user will trigger a write.

We want to minimize the change in current authenticator, so if possible no change at all.

I'm diving through the code, which I haven't touched in a while. Seemed relatively easy at first glance but I'm unsure it actually is.

1) Authenticator can set arbitrary route for login, typically OAuth have a /auth_login and /oath_callback As we don't really know which handler does what we can't blindly rewrite the response to set_cookie.

2) is there any authentication flow where the (final) hub actually
need the credential of the user that is going to connect (e.g decrypt
home dir). In which case just carrying the token in a cookie with
shared secret does not help and we need to have HubDispatcher<->Hubs communication ?

3) it looks like we can make that a pure Fake Authenticator (dynamic subclass and overwrite/extend set_login_cookie). But that feels dirty, do we want to "hack Hub" to serve only as login node, potentially exposing more services (i'm not a huge fan), or start conservatively with an application that "just" expose the authentication flow.

4) Actually don't distinguish login nodes from normal hub, and just have authentication setting a accepted but "not me" value ?

We don't deal with fragmentation yet.
We don't deal with autospawning hub yet.

Thoughts welcome.

architecture enhancement

Source

Carreau

👍1

Most helpful comment

This looks very similar to the structure we use for deploying JupyterHub on Quantopian. I'm giving a talk about this at JupyterCon in August, but here's the rough cliffnotes:

We have an application we call the "hub discovery" service, that we use to persistently map users to jupyterhub instances. When a user goes to quantopian.com/research, our frontend (which is actually a Ruby on Rails app), sends a request to the discovery service asking which hub to route the user to. If the user has already been allocated, then the discovery service just returns the uri for their hub's server (which is running a CHP, and a JupyterHub using a heavily-customized Dockerspawner). If the user hasn't been allocated, then the discovery service chooses the hub server with the smallest number of allocated users. Once the frontend has the uri for the server, it renders an iframe for that server, and from that point on it's just a regular JupyterHub connection. Our hubs are totally ephemeral (we store users' notebooks in PGContents), so we actually just use in-memory sqlite for our hub db. I looked in the early stages of the project at having a single database shared between multiple hubs, but I wasn't sure I could make it work without major changes. Having no shared state between the hubs sidesteps a large class of potential problems.

A couple implementation notes that might be of interest:

We override JupyterHub.initialize in our subclass to make the hub register itself with the discovery service on startup. The uri and credentials for discovery are part of our hub's configuration.
In our initialize, we also spawn a periodic callback to make the hub send heartbeats to discovery. If a hub stops heartbeating for longer than some threshold, discovery will mark the server as unhealthy and stop routing users there. Our hub will also gracefully shut itself down if more than N consecutive heartbeats fail.
Our discovery service doesn't know anything about jupyterhub-specific authentication mechanisms; both our frontend and hubs just authenticate with discovery using http basic auth, and our hubs authenticate users via OAuth provided by our frontend.

ssanderson on 25 Jul 2017

👍4

All 23 comments

This is great!

Some additional notes:

This won't be CHP at the edge, but probably a fleet of nginx's with much simpler cookie based routing. This lets us scale out easily
The big problem with needing to send one user to one hub only is local persistent storage for the user. We can't really move that around easily yet, so we have to send each user to the same hub forever (or an equivalent hub that has their local storage)
One of the big reasons I want us to reuse as much hub code as possible is that I want every authenticator that's compatible with the hub to work here. I don't have too much of a preference if we make it use the hub itself, or if we write a new application that just uses the exact same interface. I would like the individual hubs to not have to know about each other tho.

yuvipanda on 25 Jul 2017

I think we should use nchp instead of chp to avoid the a single point of failure like @yuvipanda said.
I don't look really into the authenticate codes but really interested in the scaling up things.

I thought of the issue before.I think we should use a way to share the redirect path. I prefer a way that each hub can direct the each user instead of one user to one hub.

zsluedem on 25 Jul 2017

@zsluedem because of the state in a given Hub corresponding to a user (it's not all in the database, so things will go awry if multiple Hubs try to manage the same user), it is important that a given user always be routed to the same Hub.

The lightest-possible implementation for me is a small, dedicated application for authentication that does:

login
maintain mapping of username:hub
redirect or proxy to the appropriate hub

And use a special Authenticator in the Hubs that talks to this service, rather than trying to set cookies for the Hubs on the dispatcher. There is an example of this that uses the Apache REMOTE_USER header with shibboleth, which is the sort of pattern I would probably choose:

shibboleth plugin handles authentication
apache sets REMOTE_USER header
Hub Authenticator checks REMOTE_USER header for login and proceeds to set its own cookies, etc.

In particular, I would probably not base the dispatcher on JupyterHub. At least, I would probably not allow them to set the cookies that the Hubs would set. To do that, you need to make sure that the dispatcher and all Hubs are talking to the same database and use the same cookie secret in order for the cookies set by the dispatcher to be transferrable. Instead, a dedicated cookie/token that is understood by the dedicated Authenticator is probably simplest. You could use an Authenticator object to do the login in the dispatcher, but I'm not sure how much that gets you, since tornado and nginx, etc. tend to have their own support for things like Google OAuth already.

is there any authentication flow where the (final) hub actually need the credential of the user that is going to connect

In theory, eventually. But not at the moment, so I would ignore this for now. If you want this, I think the dispatcher does have to use a JupyterHub Authenticator, and can then store the response for the Hub's "AskTheDispatcherAuthenticator" to retrieve later.

minrk on 25 Jul 2017

I made a sketch of a simple tornado oauth application that logs in with Google. It's pretty simple, and probably a good deal simpler than anything that tries to integrate more deeply with JupyterHub.

There would be a corresponding Authenticator that uses the token to identify users and trigger the regular login process.

minrk on 25 Jul 2017

+1 on the cookie being set by hub itself - the dispatcher should only set a meta-cookie that can be authenticated by the dispatcher proxy (which probably would be a thing by itself - not even just nchp). The hubs wouldn't know about the dispatcher directly, and the dispatcher wouldn't know about the hubs directly either. Something based off the REMOTE USER authenticator could work - I don't see it authenticating the authenticity of the User header (but maybe I missed how Shibboleth works?) so any user who can hit the hub directly can pretend to be whoever. Trivially fixable tho.

While I do agree that we can make this simpler by not making it compatible with Hub's Authenticators, I think that'll be a long term maintenence burden. It'll also make scaling from 1 hub to 2 much easier. I think being able to reuse hub authenticators should be a hard requirement...

yuvipanda on 25 Jul 2017

This looks very similar to the structure we use for deploying JupyterHub on Quantopian. I'm giving a talk about this at JupyterCon in August, but here's the rough cliffnotes:

A couple implementation notes that might be of interest:

We override JupyterHub.initialize in our subclass to make the hub register itself with the discovery service on startup. The uri and credentials for discovery are part of our hub's configuration.
In our initialize, we also spawn a periodic callback to make the hub send heartbeats to discovery. If a hub stops heartbeating for longer than some threshold, discovery will mark the server as unhealthy and stop routing users there. Our hub will also gracefully shut itself down if more than N consecutive heartbeats fail.
Our discovery service doesn't know anything about jupyterhub-specific authentication mechanisms; both our frontend and hubs just authenticate with discovery using http basic auth, and our hubs authenticate users via OAuth provided by our frontend.

ssanderson on 25 Jul 2017

👍4

@ssanderson awesome! That sounds perfect. I look forward to hearing more at JupyterCon.

minrk on 25 Jul 2017

That's awesome, @ssanderson!

The one big difference seems to be that your hubs are somewhat interchangeable because of pgcontents, which makes things a lot easier!

yuvipanda on 25 Jul 2017

❤1

The one big difference seems to be that your hubs are somewhat interchangeable because of pgcontents, which makes things a lot easier!

Yup. I think the two important differences are this and the fact that we already had a separate frontend service to act as the top-level proxy for routing users to the right hub. This is nice because it means we don't have to have any client-side code for hub-routing; our rails app just renders an iframe with the right hub's URI embedded in the larger page.

ssanderson on 25 Jul 2017

So I gave a try as a custom App that accept any authenticator it ends up duplicating almost 1/2 the code of JupyterHubApp, so I'm unconvinced it is the right way. It does have a custom proxy-authenticator and no-op-spawner that could work though.

Carreau on 27 Jul 2017

@ssanderson it seems like very practical way to deploy!! Looking forward to the con now

zsluedem on 28 Jul 2017

@minrk

because of the state in a given Hub corresponding to a user (it's not all in the database, so things will go awry if multiple Hubs try to manage the same user), it is important that a given user always be routed to the same Hub.

Is there a way to store all the states in a database so that each of the hub can access?

zsluedem on 28 Jul 2017

I found out we can use some service discoveries like consul or zookeeper as @ssanderson said above and store the state which each hub hold so that a given user can be routed to the different Hub.

zsluedem on 28 Jul 2017

Wow, that's a lot of stuff you had to keep, @Carreau! Let's see what we can decouple out of JupyterHub.

I want to note on timelines - I'm perfectly happy for us to run HubDispatcher experimentally and with fast changing requirements for the short term. I'd want us to get JupyterHub 0.8 out out the door asap first, since it has a lot of really good changes and it's been a while since our last release.

@Carreau would you be at BIDS tomorrow?

yuvipanda on 28 Jul 2017

Wow, that's a lot of stuff you had to keep

Yes, there might be way o remove some, but there is a lot of coupling.
It works, but the dispatcher still need to spawn something and poll for it. I need to subclass User for that (thinking of making that an option)

@Carreau would you be at BIDS tomorrow?

No, I'm in SF. I'm working with Paul on the Jupyter Talk. I'm thinking earlier discussions are right and as a first pass we should do a completely different authenticator and work _toward_ decoupling of 0.9 or 0.10 then have an easy migration path forward.

Carreau on 28 Jul 2017

Yup, that makes sense, @Carreau!

I'll now figure out what kinda authenticator we'll need for our use case, and figure out a small separate service we can write for that.

yuvipanda on 28 Jul 2017

Do we have this on roadmap for a release version? Our notebook platform architecture is using JupyterHub currently but there have been continuous questions about hub being SPOF. I understand that a hub restart does not effect logged in users but just the fact of having a single server deployment is making it difficult for people to accept.
I'll be more then happy to contribute if there is a story around this.

ckbhatt on 6 Sep 2017

Hi @ckbhatt. I'll probably start working on this in a week or two. We ant to deploy this sometime in october.

yuvipanda on 7 Sep 2017

I've spent some time over the last few weeks working on some part of this. https://github.com/berkeley-dsep-infra/data8xhub is the beginnings of the infrastructure.

I've now gotten to an architecture where the user's home directories can actually be shared easily across hubs in a scaleable way. So I can now load balance the hubs, sharding only the home directory locations. Hooray!

So how to load balance the hub? Me and @minrk brainstormed this earlier today, and here is a summary:

We can't use a shared database, since hub currently is limited by total number of users than running users. Each hub will start doing polls of everything and what not, so bad. Each hub must be fully independent, with its own database and CHP
We can just use a simple sticky session with a cookie (that is inspected by an edge proxy) that specifies which hub to route to (hub-a, hub-b, etc)
Upon completion of authentication flow on any hub, we'll check if the user is actually running on any other hub. If they are, we just change the sticky hub-id cookie and redirect them, ensuring they land on the right hub.
For (3) to work, all the hubs must share authentication state. This can be done by: 1. using the same cookieSecret for all of them and 2. Changing the function that creates user tokens from a UUID to something more deterministic (HMAC of another secret + username)?
Also for (3) to work properly, a user must not have a running server in more than one hub. We accomplish this by deleting users whenever their pod stops. This forces them to re-login more frequently unfortunately, but that's perhaps the price to pay?

With this, we should be able to dynamically scale number of hubs up and down.

Adding a new hub:

Bring hub up
Add hub to list maintained by edge proxy

Removing a hub:

Remove hub from list maintained by edge proxy for new requests. People with existing sessions in it continue to be served.
Wait for hub to have 0 users
Bring hub down

Does this accurately capture what we talked about, @minrk?

yuvipanda on 15 Nov 2017

I believe so. Some further details on (3) for the shared auth state: cookie authentication is handled by storing a UUID for each user as user.cookie_id. This is the value that is encrypted and stored in a cookie when a user logs in, and the value used to lookup the user. When a user makes a request with a cookie, the value is decrypted (to cookie_id) and a user looks up the user in the database by cookie_id. If cookie_id were deterministic based on the username (e.g. HMAC(shared secret + username)), then the cookie_id would be the same on all Hubs for a given user.

Consequences of this:

a user cookie set on one Hub is valid on another provided a row for the user is present in both databases
user cookies cannot be revoked server-side without revoking all user cookies

The one piece we missed in the shared cookie scenario is that the cookie_id lookup to work, the corresponding User row must already exist in the database. So instead of just overriding how cookie_id is set, I think you'll also need to override _get_user_cookie to work before the user exists, and perhaps set_login_cookie to write a cookie that has both name and digest, so that your get_user_cookie can check if the name matches the digest without having to do a lookup in the database. That's a bit more complicated and more overriding than we hoped.

minrk on 15 Nov 2017

@minrk if the user is already running on the target hub, the row should exist there right? and in the 'source' hub (where authentication has just finished) the user would exist too (at least temporarily - we can perhaps delete it before doing the redirect).

yuvipanda on 16 Nov 2017

I agree this is starting to feel somewhat fragile from the complexity.

yuvipanda on 16 Nov 2017

if the user is already running on the target hub, the row should exist there right?

Ah, yes. I forgot that the user is only redirected across hubs when they are already running and thus guaranteed to exist on another hub. Then that should be fine and most of my comment is irrelevant.

minrk on 16 Nov 2017

Was this page helpful?

0 / 5 - 0 ratings