[WIP] Rework kernel status #4724

jasongrout · 2018-06-14T13:21:38Z

Related to work in and built on top of #4697.

jasongrout · 2018-06-19T22:09:20Z

Related issue about notifying on kernel status change: #4748

jasongrout · 2018-07-19T04:00:09Z

@minrk and I talked more about this today. Here are some thoughts:

The kernel states are conflating the connection status with the kernel status. Actually, there are three sources for status here: the client (mostly dealing with connection state), the notebook server (dealing with kernel management, like a restarting state), and the kernel itself.

Perhaps we need two states: the connection state and the kernel state. When we are in the disconnected state, the kernel state is unknown.

Another thing: on restart, don't create a new websocket connection to the kernel. Keep the existing connection, and we'll receive the restarting event.

Here are some scenarios:

New Kernel start

Start at connection disconnected, state unknown
Request a kernel and initiate a websocket connection - connection -> connecting
Websocket connection set up: connection -> connected
Send a request_kernel_info to cache the kernel info.
Receive the kernel info reply, cache the kernel info, thus receiving several kernel state messages such as busy, idle, so that the state ends up on idle.
Send the pending messages queued up while the kernel was not connected.
Resolve the kernel ready promise, signaling that the kernel information is cached and the kernel is ready for messages

Kernel connection to existing kernel
Exactly like the new kernel start, but we don't request a kernel start, but just initiate a websocket connection to the url we already know.

Kernel restart

Don't disconnect the kernel in this entire process.

Request a kernel restart from the rest api. Set a restart_requested flag that we requested a restart. Don't set the state to restarting, since we might have more iopub messages coming through, especially when the notebook flushes the iopub buffer.
Wait until the notebook's restarting kernel status comes through. Change the kernel state to restarting. If the requested_restart flag is set, don't notify the user (since the user requested a restart). If the flag is not set, we have an automatic restart, so notify the user somehow.
Set kernel.isReady to false, and make a new ready() promise. Start queuing messages here?
The kernel starting status might come through, make that a status? Since it's supposed to be an optional status, perhaps we shouldn't make it a recognized value?
On kernel restart (e.g., the first message that comes through that uses a different kernel session id?), send a kernel info request.
Just as in kernel start set the kernel to ready and resolve the ready promise

jasongrout · 2018-07-19T04:12:05Z

A big question: do we want the kernel status / signal to be only some specific values, or do we want to acknowledge kernels can have their own optional status? Do we want the kernel's status to just be a string? Or do we want to ignore status values we don't understand?

blink1073 · 2018-07-19T09:52:59Z

Set kernel.isReady to false, and make a new ready() promise. Start queuing messages here?

Sounds good

The kernel starting status might come through, make that a status? Since it's supposed to be an optional status, perhaps we shouldn't make it a recognized value?

It doesn't look like it is optional here. # The kernel will publish state 'starting' exactly once at process startup.

On kernel restart (e.g., the first message that comes through that uses a different kernel session id?), send a kernel info request.

Sounds good

Do we want the kernel's status to just be a string? Or do we want to ignore status values we don't understand?

I'd say forward them all since kernel-specific extensions may want that information

jasongrout · 2018-07-19T10:03:38Z

Perhaps also the connected state should just be a boolean flag, rather than also having a 'connecting' state signaling when we are actively trying to connect.

blink1073 · 2018-07-19T10:12:19Z

I think connection state should be: connecting, connected, disconnected.

blink1073 · 2018-07-19T10:12:38Z

We are disconnected if we explicitly disconnect or give up on retries.

jasongrout · 2018-07-19T10:18:54Z

I think connection state should be: connecting, connected, disconnected.

I guess we want to make sure the user knows through some UI element that we are trying to connect, so I think you're right that it makes sense to make that a specific state.

jasongrout · 2018-07-19T10:20:37Z

It doesn't look like it is optional here. # The kernel will publish state 'starting' exactly once at process startup.

See the open PR to jupyter_client, based on conversations with @minrk about kernel status: jupyter/jupyter_client#388

ellisonbg · 2018-08-12T03:01:09Z

@jasongrout are you still wanting this to get into Monday's 0.34 release?

jasongrout · 2018-09-15T03:25:23Z

It doesn't look like it is optional here. # The kernel will publish state 'starting' exactly once at process startup.

By the way, we were also discussing lots of updates to that document: jupyter/jupyter_client#388

jasongrout · 2018-11-29T16:20:00Z

In the kernel restart scenario in #4724 (comment), we should switch steps 2 and 3, so that a user can listen for the 'restarting' status and in that handler, await the kernel ready promise. So the revised steps look like:

Kernel restart

Don't disconnect the kernel in this entire process.

Request a kernel restart from the rest api. Set a restart_requested flag that we requested a restart. Don't set the state to restarting, since we might have more iopub messages coming through, especially when the notebook flushes the iopub buffer.
Wait until the notebook's restarting kernel status comes through.
Set kernel.isReady to false, and make a new ready() promise. Start queuing messages here?
Change the kernel state to restarting. If the requested_restart flag is set, don't notify the user (since the user requested a restart). If the flag is not set, we have an automatic restart, so notify the user somehow.
On kernel restart (e.g., the first message that comes through that uses a different kernel session id?), send a kernel info request.
Just as in kernel start set the kernel to ready and resolve the ready promise

jasongrout · 2018-11-29T18:16:10Z

Some further simplifications based on conversations with @saulshanabrook, @blink1073:

Get rid of the public kernel isReady value and ready promise. Instead, if you want to do something on kernel startup, you can just listen for the kernel status to change to restarting (at which point you can send a message and it will be queued).
we discussed removing the connection status (implying it from an unknown kernel state), but decided to keep it because it really is different information, and this nicely paves the way to a app-wide connection status when we eventually have a single websocket for all server communication.

So now the scenarios look like:

New Kernel start

Start at connection disconnected, state unknown
Request a kernel and initiate a websocket connection - connection -> connecting
Websocket connection set up: connection -> connected
Send a request_kernel_info to cache the kernel info.
Receive the kernel info reply, cache the kernel info, thus receiving several kernel state messages such as busy, idle, so that the state ends up on idle.
Send the pending messages queued up while the kernel was not connected.

Kernel connection to existing kernel

Exactly like the new kernel start, but we don't request a kernel start, but just initiate a websocket connection to the url we already know.

Kernel restart

Don't disconnect the kernel in this entire process.

Request a kernel restart from the rest api. The kernel object that sends this message is also responsible to add the kernel id a static singleton set kernelsRestarting, perhaps in the Kernel base class. Don't set the state to restarting, since we might have more iopub messages coming through, especially when the notebook flushes the iopub buffer.
When the notebook's restarting kernel status message comes, change the kernel state to restarting. Clear any pending messages and start storing new pending messages. Anyone can check the kernelsRestarting set to see if this is a kernel restart the user requested, and if it isn't, notify the user (perhaps in each client that is connected to the kernel?) that the kernel has restarted, so state has been cleared.
Immediately do steps 4-6 of the new kernel process above (send a kernel info request synchronously so it's the first queued message).

Execute code when kernel is reset

Immediately send the kernel message (which will be queued if necessary). Also, connect to the kernel status, and send the kernel message when the status changes to restarting.

jasongrout · 2018-11-29T18:17:41Z

@saulshanabrook, @blink1073, can you look at the new writeup at #4724 (comment) ?

jasongrout · 2019-01-23T20:35:46Z

As you can see, I've dug deeper and deeper into how we expose kernel status and other information, including peeling back the session and clientsession objects we have. There were a lot of really confusing dependencies between those concepts.

Here's another thought: how about we get rid of the concept of sessions entirely, at least for most users. They're not extremely useful to us as an abstraction. We would probably still use the concept of sessions in the background to maintain a mapping between kernels and documents, but what is exposed to the user on a document context would be just a kernel, plain and simple. Or perhaps a kernel container of some kind that was very lightweight (i.e., had a kernel attribute, and a kernelChanged signal, or something). Behind the scenes, if you wanted to associate document paths with kernels, you could, and then those would be persisted to the server, but it wouldn't be necessary for things to use sessions.

saulshanabrook · 2019-01-23T22:19:42Z

Here's another thought: how about we get rid of the concept of sessions entirely, at least for most users.

I am 100% in favor of removing unneeded abstractions! Happy to walk through this with you tomorrow.

jasongrout · 2019-01-23T23:52:34Z

To elaborate more: let's make kernels first-class objects in JLab. Like you should be able to start a kernel with no attached activity, and a kernel should (optionally?) be able to live beyond any document associated with it.

…eturn undefined if the object was not found. This makes it more consistent with standard Javascript practices, and preserves the errors for true errors.

This simplified logic quite a bit. Most of these ideas came from previous work in jupyterlab#4724

jasongrout · 2019-10-03T06:36:54Z

This is being superseded by #7252

jasongrout added this to the Beta 3 milestone Jun 14, 2018

jasongrout mentioned this pull request Jun 14, 2018

WIP Make kernel message handling async and in order #4697

Merged

2 tasks

afshin mentioned this pull request Jun 26, 2018

Reconcile tree routing handler with workspaces. #4708

Merged

jasongrout modified the milestones: Beta 3, Beta 4 Jun 27, 2018

blink1073 mentioned this pull request Jul 11, 2018

set kernel.isReady to true after KernelInfoReply #4871

Closed

jasongrout force-pushed the kernelstatus branch from d3d2101 to 96fc99b Compare July 19, 2018 04:07

ellisonbg added maintenance pkg:services labels Aug 12, 2018

blink1073 modified the milestones: 0.34, 0.35 Aug 13, 2018

blink1073 modified the milestones: 0.35, 1.0 Sep 5, 2018

blink1073 assigned jasongrout Sep 12, 2018

This was referenced Nov 16, 2018

Should Session.kernelChanged trigger on kernel restart? #5632

Open

Client session status change signal fires twice on kernel shutdown. #5594

Open

jasongrout added 3 commits January 23, 2019 21:31

Make the findById and findByPath functions for kernels and sessions r…

e782389

…eturn undefined if the object was not found. This makes it more consistent with standard Javascript practices, and preserves the errors for true errors.

Convert one more function to async/await.

3985453

WIP initial implementation of a kernel resource.

56f1543

afshin mentioned this pull request Jan 24, 2019

Interrupting of kernel doesn't work (Jupyter Lab) twosigma/beakerx#7928

Open

This was referenced Jan 25, 2019

Shutting down lots of things in the running tab seems to block JLab #5305

Open

if kernel died because of memory, jupyter lab is stuck #4748

Closed

Make extension tutorial explaining how to start and control a kernel #4409

Open

jasongrout added 3 commits January 25, 2019 08:48

Fix a few things in the kernel resource class.

94e0416

Add a set kernel() to the kernel resource.

dea4c48

Add a bunch of TODO items.

0f0569f

jasongrout modified the milestones: 1.0, Future Jan 26, 2019

saulshanabrook mentioned this pull request Jan 28, 2019

WIP: Stop associated kernel when closing widget tab #5914

Closed

This was referenced Feb 8, 2019

No warning/error message when kernal is forced to restart #4273

Closed

Make running extension support third-party sessions #5728

Closed

recamshak mentioned this pull request Feb 19, 2019

Modular running extension #6002

Closed

jasongrout mentioned this pull request Mar 7, 2019

[WIP] more fixes for kernel tests #6073

Closed

jasongrout mentioned this pull request May 15, 2019

Notify the user when a notebook kernel autorestarts #6246

Merged

jasongrout added status:Work in Progress and removed status:Work in Progress labels May 29, 2019

jasongrout added a commit to jasongrout/jupyterlab that referenced this pull request Sep 21, 2019

Update kernel status to have a connection status and a kernel status.

078896a

This simplified logic quite a bit. Most of these ideas came from previous work in jupyterlab#4724

jasongrout mentioned this pull request Sep 21, 2019

Rework kernel and session architecture #7252

Merged

2 tasks

jasongrout added a commit to jasongrout/jupyterlab that referenced this pull request Sep 21, 2019

Update kernel status to have a connection status and a kernel status.

b893d5c

This simplified logic quite a bit. Most of these ideas came from previous work in jupyterlab#4724

jasongrout closed this Oct 3, 2019

lock bot added the status:resolved-locked Closed issues are locked after 30 days inactivity. Please open a new issue for related discussion. label Nov 2, 2019

lock bot locked as resolved and limited conversation to collaborators Nov 2, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Rework kernel status #4724

[WIP] Rework kernel status #4724

jasongrout commented Jun 14, 2018 •

edited

jasongrout commented Jun 19, 2018

jasongrout commented Jul 19, 2018 •

edited

jasongrout commented Jul 19, 2018

blink1073 commented Jul 19, 2018

jasongrout commented Jul 19, 2018

blink1073 commented Jul 19, 2018

blink1073 commented Jul 19, 2018

jasongrout commented Jul 19, 2018

jasongrout commented Jul 19, 2018

ellisonbg commented Aug 12, 2018

jasongrout commented Sep 15, 2018

jasongrout commented Nov 29, 2018 •

edited

jasongrout commented Nov 29, 2018 •

edited

jasongrout commented Nov 29, 2018

jasongrout commented Jan 23, 2019

saulshanabrook commented Jan 23, 2019

jasongrout commented Jan 23, 2019

jasongrout commented Oct 3, 2019

[WIP] Rework kernel status #4724

[WIP] Rework kernel status #4724

Conversation

jasongrout commented Jun 14, 2018 • edited

jasongrout commented Jun 19, 2018

jasongrout commented Jul 19, 2018 • edited

jasongrout commented Jul 19, 2018

blink1073 commented Jul 19, 2018

jasongrout commented Jul 19, 2018

blink1073 commented Jul 19, 2018

blink1073 commented Jul 19, 2018

jasongrout commented Jul 19, 2018

jasongrout commented Jul 19, 2018

ellisonbg commented Aug 12, 2018

jasongrout commented Sep 15, 2018

jasongrout commented Nov 29, 2018 • edited

jasongrout commented Nov 29, 2018 • edited

New Kernel start

Kernel connection to existing kernel

Kernel restart

Execute code when kernel is reset

jasongrout commented Nov 29, 2018

jasongrout commented Jan 23, 2019

saulshanabrook commented Jan 23, 2019

jasongrout commented Jan 23, 2019

jasongrout commented Oct 3, 2019

jasongrout commented Jun 14, 2018 •

edited

jasongrout commented Jul 19, 2018 •

edited

jasongrout commented Nov 29, 2018 •

edited

jasongrout commented Nov 29, 2018 •

edited