Restart engine on panic #2100

Sytten · 2020-04-05T14:52:26Z

Problem

As I understand it, if the engine panics during a request, the client will be rendered nonoperational. Since I only create one client for my application (shared via the Context in an Apollo server), it would mean that the service would need to be restarted. This is problematic for uptime since it would affect other customers that have queries that work.

Solution

It would be best if the client would restart the engine automatically. Otherwise it would be best to document that we need to create a client or call connect for each request.

Additional context

This is needed for a production deployment since errors will happen and they need to be not catastrophic.

Sytten · 2020-04-05T15:10:57Z

I dig a bit and I have some findings:

For each test, I killed the engine process manually (simulating a panic)
Calling connect on the client for each request did not restart the process
Creating a new client for each request starts a new process

entrptaher · 2020-04-06T07:49:48Z

+1 for this.

I just tested the prisma.connect() for each request and it has a heavy burden of 15-35ms in any given case. Here is a random case for just one request.

connect: 31.921ms
promises:push(user.create): 24.488ms
disconnect: 2.218ms

If I don't connect it myself, then it connects on first request,

promises:push(user.create): 57.864ms
disconnect: 2.666ms

I'm disconnecting purposefully because last night I saw a lot of zombie processes during a little benchmark of prisma2. It's already half slower than the prisma 1 in that case just because we have to avoid the panic.

However, repeated queries are fast (even faster) enough, but the engine panic still remains an issue,

➜  prisma-1 node index.js
createUser: 24.935ms
createUser: 9.794ms
createUser: 8.186ms
createUser: 8.543ms
createUser: 8.417ms
createUser: 8.088ms
createUser: 8.178ms
createUser: 7.846ms
createUser: 8.537ms
createUser: 8.128ms

➜  prisma-2 node index.js
connect: 37.185ms
user.create: 24.196ms
user.create: 3.773ms
user.create: 3.126ms
user.create: 2.653ms
user.create: 2.588ms
user.create: 2.713ms
user.create: 2.811ms
user.create: 3.830ms
user.create: 3.053ms
user.create: 2.302ms
disconnect: 2.423ms

Zombie Process

I posted this over the prisma.slack.com thread,
and I think this might be related, if there are panic, it should be cleaned properly.

mavilein · 2020-04-16T14:43:41Z

@Sytten : If there's a panic during request execution the engine process will not die. The only panics that may kill the process are the ones that happen before the server has finished starting.

Sytten · 2020-04-16T14:50:39Z

@mavilein I was discussing it with @timsuchanek and he explain that, but my whole premise is based on the experience I had on a panic that crashed the rust process. It is probably fixed now, but I still think that it can happen until the Neon binding is implemented. Meaning that if the child dies in the JS code, it should be restarted (if the client wants that).
So the way we are currently planning would be:

In the async intercept in NodeEngine, we know if the process died/exited/crashed
We have to bubble that to the PrismaClient and remove the connectPromise
The client can then call connect on each request in a middleware to restart the engine
Requests will wait for the engine to restart

wSedlacek · 2020-05-05T05:01:23Z

I am running into an issue with limited memory with a large database and exposing queries pubically. The result is the query engine dieing when memory runs out which is to be expected then all future request failing due to the query engine being dead. While I should limit request so they don't end up this large of someone does find a request that kills the engine it would be nice if that engine didn't require a manual restart of the application.

kindywu · 2020-05-20T07:02:00Z

engine crash and never come back

findMany(
{
skip: -1
}
)

timsuchanek · 2020-05-29T14:31:34Z

Thanks a lot for reporting 🙏
This issue is fixed in the latest alpha version of @prisma/cli.
You can try it out with npm i -g @prisma/cli@alpha.

In case it’s not fixed for you - please let us know and we’ll reopen this issue!

entrptaher · 2020-05-29T15:05:03Z

Oh very interesting. Gonna test out.

Sytten · 2020-05-29T16:01:38Z

Will test it too this weekend.

schickling added bug/1-unconfirmed Bug should have enough information for reproduction, but confirmation has not happened yet. process/candidate labels Apr 5, 2020

entrptaher mentioned this issue Apr 6, 2020

Fails inside node cluster - Address already in use prisma/prisma-client-js#632

Closed

pantharshit00 added kind/improvement An improvement to existing feature and code. and removed bug/1-unconfirmed Bug should have enough information for reproduction, but confirmation has not happened yet. labels Apr 8, 2020

divyenduz added this to the Beta 3 milestone Apr 9, 2020

janpio assigned schickling and timsuchanek Apr 9, 2020

divyenduz added tech/typescript Issue for tech TypeScript. and removed process/candidate labels Apr 9, 2020

Sytten mentioned this issue Apr 17, 2020

Emit an error event when the database connection is interrupted #2218

Closed

divyenduz added the process/next-milestone label Apr 21, 2020

janpio unassigned schickling Apr 21, 2020

janpio modified the milestones: Beta 3, Beta 4 Apr 21, 2020

janpio removed the process/next-milestone label Apr 21, 2020

divyenduz added process/next-milestone and removed process/next-milestone labels Apr 30, 2020

divyenduz modified the milestones: Beta 4, Beta 5 May 4, 2020

Sytten mentioned this issue May 5, 2020

Prisma's query engine does not restart after crashing prisma/prisma-client-js#678

Closed

squirly mentioned this issue May 5, 2020

After Prisma Engine panic all future requests fail #2366

Closed

janpio modified the milestones: Beta 5, Beta 6 May 12, 2020

Sytten mentioned this issue May 18, 2020

Query Engine does not come back if manually killed #2507

Closed

janpio modified the milestones: Beta 6, Beta 7 May 26, 2020

timsuchanek closed this as completed May 29, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Restart engine on panic #2100

Restart engine on panic #2100

Sytten commented Apr 5, 2020

Sytten commented Apr 5, 2020

entrptaher commented Apr 6, 2020 •

edited

mavilein commented Apr 16, 2020

Sytten commented Apr 16, 2020

wSedlacek commented May 5, 2020

kindywu commented May 20, 2020 •

edited

timsuchanek commented May 29, 2020

entrptaher commented May 29, 2020

Sytten commented May 29, 2020

Restart engine on panic #2100

Restart engine on panic #2100

Comments

Sytten commented Apr 5, 2020

Problem

Solution

Additional context

Sytten commented Apr 5, 2020

entrptaher commented Apr 6, 2020 • edited

Zombie Process

mavilein commented Apr 16, 2020

Sytten commented Apr 16, 2020

wSedlacek commented May 5, 2020

kindywu commented May 20, 2020 • edited

timsuchanek commented May 29, 2020

entrptaher commented May 29, 2020

Sytten commented May 29, 2020

entrptaher commented Apr 6, 2020 •

edited

kindywu commented May 20, 2020 •

edited