Support for Async Django #1394

jaw9c · 2023-04-02T02:58:23Z

Feedback requested.

This PR adds support for Django async views. The general strategy follows the same principal as the base graphene library when supporting async. It implements the following things:

Django based resolvers now check if they are being executed in an async thread and if so wrap any ORM queries in sync_to_async
Resolvers now follow the same principals as the base graphene lib to handle resolving async and sync agnostically.
A new async view - credit to this PR

The current progress is as followed:

Get the library working for all custom fields in graphene-django when running under async
Duplicate the existing tests and edit them to have an async version

Other thoughts:

It's annoying to wrap all resolvers in sync_to_async if they are performing ORM ops. It would be nice to detect this error and throw a warning on existing codebases.

…nections

…the fields

jaw9c · 2023-05-05T10:27:35Z

Progress update: All the fields resolvers are supporting both being executed in the async context and wrapping django sync code into a sync context. I've made the assumption that any resolving of DjangoXType/Field fields will need to be executed in a sync context which I think is pretty safe. I've started running through all the tests and adding an extra assertion that running in an async executor produces the same result as the sync one.

Currently people switching to using the async view will need to wrap all their top level resolvers (those that are directly on the Query class passed into the schema) in @sync_to_async if they run sync code. If someone has an idea of how to prevent this that would be great, but can't seem to wrap my head around that!

Next steps are to run through all the test files and adding the helper.

It would great if people could test this either on their production projects (speciically when running under ASGI), as 'm interested in how swapping in/out of the sync context affects the performance of the execution.

firaskafri · 2023-05-05T11:00:20Z

Progress update: All the fields resolvers are supporting both being executed in the async context and wrapping django sync code into a sync context. I've made the assumption that any resolving of DjangoXType/Field fields will need to be executed in a sync context which I think is pretty safe. I've started running through all the tests and adding an extra assertion that running in an async executor produces the same result as the sync one.

Currently people switching to using the async view will need to wrap all their top level resolvers (those that are directly on the Query class passed into the schema) in @sync_to_async if they run sync code. If someone has an idea of how to prevent this that would be great, but can't seem to wrap my head around that!

Next steps are to run through all the test files and adding the helper.

It would great if people could test this either on their production projects (speciically when running under ASGI), as 'm interested in how swapping in/out of the sync context affects the performance of the execution.

This is great news! Would discuss with my team to try this on production once its ready!

firaskafri · 2023-10-01T16:05:02Z

@jaw9c any updates?

pfcodes · 2023-12-21T08:27:48Z

Really need this!

dima-kov · 2024-01-07T09:34:55Z

Hey mates, ready to donate some bucks via buymeacoffee if you make this PR to live!

pfcodes · 2024-01-08T10:36:47Z

Hey mates, ready to donate some bucks via buymeacoffee if you make this PR to live!

I'm down to throw some $ too. Maintainers please consider enabling sponshorship for this project if it means that things can move more quickly. A lot of people are dependent on this.

alexandrubese · 2024-01-09T19:04:53Z

Migrating to v3 version, without making sure dataloaders and everything works, so that people need to implement new libraries (check @superlevure comments above)

dima-kov · 2024-01-09T19:20:02Z

@alexandrubese so what's the way?

alexandrubese · 2024-01-09T19:27:39Z

@dima-kov
Use an older version of django-graphene where things work?

Which might not be ideal because you’re basically just adding “legacy code” that you will need to update once they will make v3 work (although it’s been almost a year this is discussed, I don’t know when this will work and they will fix dataloaders)

I don’t jnderstand the intricacies of the v3 update, maybe they had to do it, that’s why it was maybe forced ?

Or use a different library

PS: I don’t know why the Django team didn’t work to add proper GraphQL support.

Spring, .Net MVC and many other “web frameworks” did it.

dima-kov · 2024-02-01T18:27:33Z

Regarding the comment from @superlevure:

I agree with releasing AsyncGraphQLView.

We are almost there; the only remaining tasks are performance benchmarks and documentation updates. This implies that most of the work is already done, and only a few percentages are left.

@jaw9c, it seems you might be short on time for this. I'm willing to contribute efforts at this point.

@superlevure, to be transparent, I haven't conducted performance testing before, but I'm attempting it now. Any suggestions on how to approach this appropriately would be highly appreciated.

Data loaders that exist are not fully compatible with new versions of graphene and graphene-django. DjangoConnectionField doesn't seem to handle loaders correctly and instead return errors like: "Cannot return null for non-nullable field EmailNodeConnection.edges." So for now, data loaders will be disabled for this field type. Use graphql-sync-dataloaders to make other types of fields work with data loaders. Some GitHub issues for reference: - graphql-python/graphene-django#1394 - graphql-python/graphene-django#1263 - graphql-python/graphene-django#1425 Refs: HP-2082

dima-kov · 2024-02-02T22:55:02Z

Benchmarks:
16.547 seconds vs 83.194 seconds for i/o bound queries. 5x faster. async wins

Details are here: https://github.com/dima-kov/django-graphene-benchmarks?tab=readme-ov-file#tldr

Async:

Concurrency Level:      10
Time taken for tests:   16.547 seconds
Complete requests:      1000

vs

Sync

Concurrency Level:      10
Time taken for tests:   83.194 seconds
Complete requests:      1000

@superlevure are we fine now? What should be next steps to make this public?

dima-kov

🔥

kamilglod · 2024-02-03T02:56:14Z

@dima-kov shouldn't you run sync benchmark with 10 workers instead of 4? I know that measuring sync vs async is hard but with 10 workers we would have consistent number of requests that are handled in the same time by the server so we should get actual comparison. I guess we should get similar results which is fine as the biggest benefit of using async is less resources used.

dima-kov · 2024-02-03T08:41:01Z

hmm, that would eats lots of memory.
from docs:

Gunicorn should only need 4-12 worker processes to handle hundreds or thousands of requests per second. Gunicorn relies on the operating system to provide all of the load balancing when handling requests. Generally we recommend (2 x $num_cores) + 1 as the number of workers to start off with.

I'm running on 4 cores machine, so trying gunicorn with 9 workers I've got only slightly better result:

Concurrency Level:      10
Time taken for tests:   80.785 seconds
Complete requests:      1000
Failed requests:        0

and memory usage was 9 processes ~80mb each = 720. The bottleneck here might be sqlite3 database.

vs async:

Concurrency Level:      10
Time taken for tests:   17.586 seconds
Complete requests:      1000
Failed requests:        0

and mem usage was 1 process 60mb.

kamilglod · 2024-02-07T07:35:36Z

Sqlite docs says that concurrent writes are locked, but reads should be fine so I think it's not a problem with sqlite itself.

Maybe it's because of the differences in sync and async resolvers? In async you're fetching related octopus_type, in sync not.
https://github.com/dima-kov/django-graphene-benchmarks/blob/main/project/api/schema/queries.py#L43-L51
https://github.com/dima-kov/django-graphene-benchmarks/blob/main/project/api/schema_async/queries.py#L48-L64

dima-kov · 2024-02-07T17:48:58Z

oh, shame on me. so here is fixed version comparison:

Comparison

	Sync	Async	Sync	Async
Requests	1000	1000	1000	1000
Concurrency	100	100	100	100
Processes	9	1	4	1
Threads per proc	1	100	1	100
Mem	~720mb	~80mb	~320mb	~80mb
Time	23.384s	13.411s	24.465s	13.670s

dima-kov · 2024-02-07T17:52:58Z

Now, it can be asserted that the releasing of the Async version will result in a twofold acceleration of I/O endpoints.

Moreover, take a look on this much fair (same resources) comparison:

	Sync	Async
Requests	1000	1000
Concurrency	100	100
Processes	9	9
Threads per process	1	1-30
Mem	~720mb	~720mb
Time	23.384s	4.719s

We encounter a tradeoff of x5 with identical resource utilization, excluding threads!

dima-kov · 2024-02-08T20:17:48Z

Guys, we really need this (having async resolvers). Please tell us how can we help to make it public., I do not want to start using it unsure this is merged in main.

Or at least let us know your are going to release this, but we are missing: 1,2,3...

superlevure · 2024-02-08T20:36:36Z

Thanks for the benchmarks, I'll review the PR again tomorrow. Note that I don't have merge rights on this repo, we'll need a review from one of @firaskafri, @sjdemartini, @kiendang (or others)

firaskafri · 2024-02-09T08:57:52Z

@superlevure i think it is good to go as soon as we clarify the docs
What do you think @kiendang @jaw9c @sjdemartini

superlevure · 2024-02-11T16:54:54Z

@dima-kov I had a look at your benchmarks and I have few remarks.

First, it looks like you're comparing the sync and async resolvers on the same branch of graphene-django (this PR's branch). I believe it would actually be more fair to compare the async / sync versions of this branch and the sync version of graphene-django's main branch since this PR also affects sync resolvers and the point of the benchmarks is also to make sure no performance penalty is introduced to already existing code.

Second, I noticed the sync resolver is returning 500 objects while the async version is only returning 10 objects.

I took the liberty of pushing a PR to your repo that covers those point, as well as setting up a docker based environment and postgres as a DB to be closer to a real life use case.

I obtain the following results:

# Sync version [main]
Concurrency Level:      100
Time taken for tests:   45.517 seconds
Complete requests:      1000
Failed requests:        0
Non-2xx responses:      1000
Total transferred:      60232000 bytes
Total body sent:        238000
HTML transferred:       59963000 bytes
Requests per second:    21.97 [#/sec] (mean)
Time per request:       4551.739 [ms] (mean)
Time per request:       45.517 [ms] (mean, across all concurrent requests)

# Sync version [this branch]
Concurrency Level:      100
Time taken for tests:   203.300 seconds
Complete requests:      1000
Failed requests:        0
Total transferred:      45805000 bytes
Total body sent:        244000
HTML transferred:       45382000 bytes
Requests per second:    4.92 [#/sec] (mean)
Time per request:       20330.019 [ms] (mean)
Time per request:       203.300 [ms] (mean, across all concurrent requests)

# Async version [this branch]
# didn't run till the end, see below

Data tends to show a huge perf hit for the sync version of this PR. Concerning the async version, I run into the following DB error which which makes all requests fail until I restart the DB:

2024-02-11 16:31:39.310 UTC [4579] FATAL:  sorry, too many clients already

I suspect the async version is leaving DB connections opened somewhere (Postgres max_connections settings is set at 100 which is the concurrency level used in the benchmark)

I'll continue to play with the PR a bit tonight to make sure there's nothing wrong with my setup, but I'm curious if others can reproduce similar results.

dima-kov · 2024-02-11T22:09:29Z

@superlevure thank you for looking into it! I missed that part

dima-kov · 2024-02-11T22:45:12Z

regarding the results you've got: hmm, thats really strange. Both sync and async (100 requests concurrently and 1k requests in total) showed me +- 20s. 100+s is smth unbelievable for me.
Maybe the thing is with dockerization.

I'm playing with main version now.
Main:

Concurrency Level:      100
Time taken for tests:   23.706 seconds
Complete requests:      1000
Failed requests:        0

dolgidmi · 2024-03-04T12:35:48Z

Data tends to show a huge perf hit for the sync version of this PR. Concerning the async version, I run into the following DB error which which makes all requests fail until I restart the DB:
2024-02-11 16:31:39.310 UTC [4579] FATAL:  sorry, too many clients already
I suspect the async version is leaving DB connections opened somewhere (Postgres max_connections settings is set at 100 which is the concurrency level used in the benchmark)

Django currently doesn't support persistent connections for ASGI https://code.djangoproject.com/ticket/33497

rw88 · 2024-05-24T13:10:18Z

Data tends to show a huge perf hit for the sync version of this PR. Concerning the async version, I run into the following DB error which which makes all requests fail until I restart the DB:
2024-02-11 16:31:39.310 UTC [4579] FATAL:  sorry, too many clients already
I suspect the async version is leaving DB connections opened somewhere (Postgres max_connections settings is set at 100 which is the concurrency level used in the benchmark)
Django currently doesn't support persistent connections for ASGI https://code.djangoproject.com/ticket/33497

For those who use any external connection pooler like pgbouncer or AWS RDS Connection Pooling, this will not be a problem.

jaw9c force-pushed the support-async branch from afa1979 to 99ae9aa Compare April 2, 2023 07:16

firaskafri requested review from tcleonard and jkimbo April 2, 2023 21:27

firaskafri mentioned this pull request Apr 11, 2023

Add async view #1256

Closed

3 tasks

jaw9c added 16 commits May 4, 2023 15:29

Resolve DjangoObjectType getNode when in an async context

6a5b28d

Support Django Connection resolving in an async context

74998af

Support foriegn key connections running async

f04f0d3

handle regualr django lists

e78fb86

drop in an async view

28846f9

Handle coroutine results from resolvers in connections and filter con…

7ddaf9f

…nections

Strange scope

ebbc578

async hates csrf

66938e9

handle async serlizer mutations

1b2d5e0

Handle async get_node

0a84a6e

Copy tests for query to test async execution

64d311d

Update cookbook for async testing

bdb8e84

Remove tests for now

4d5132d

Add logging of errors in execution

4e5862f

most recent changes

c10753d

Handle the default django list field and test the async execution of …

e9d5e88

…the fields

jaw9c force-pushed the support-async branch from 99ae9aa to e9d5e88 Compare May 5, 2023 10:19

jaw9c added 4 commits May 5, 2023 16:19

Update tests for queries

76eeea4

swap back to python 3.11

c501fdb

linting

58b92e6

Rejig concept to use middleware

791209f

jaw9c force-pushed the support-async branch from 05e266a to 791209f Compare May 9, 2023 20:53

jaw9c added 2 commits May 9, 2023 22:58

improve async detection

b69476f

Handle custom Djangoconnectionresolvers

b134ab0

MichaelKim0407 mentioned this pull request Aug 15, 2023

feat: use contextvars to separate batch callbacks execution between multiple threads/contexts jkimbo/graphql-sync-dataloaders#15

Open

Fix bug when running sync view under asgi

45fb299

kiendang mentioned this pull request Sep 6, 2023

Backport Python 3.11 and Django 4.2 in v2 #1456

Merged

dima-kov approved these changes Feb 2, 2024

View reviewed changes

Merge branch 'main' into support-async

b35f3b0

Fix newline

c2d601c

virinchi123 approved these changes Mar 1, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for Async Django #1394

Support for Async Django #1394

jaw9c commented Apr 2, 2023 •

edited

jaw9c commented May 5, 2023

firaskafri commented May 5, 2023

firaskafri commented Oct 1, 2023

pfcodes commented Dec 21, 2023

dima-kov commented Jan 7, 2024

pfcodes commented Jan 8, 2024

alexandrubese commented Jan 9, 2024 •

edited

dima-kov commented Jan 9, 2024

alexandrubese commented Jan 9, 2024 •

edited

dima-kov commented Feb 1, 2024

dima-kov commented Feb 2, 2024

dima-kov left a comment

kamilglod commented Feb 3, 2024

dima-kov commented Feb 3, 2024 •

edited

kamilglod commented Feb 7, 2024

dima-kov commented Feb 7, 2024 •

edited

dima-kov commented Feb 7, 2024 •

edited

dima-kov commented Feb 8, 2024

superlevure commented Feb 8, 2024

firaskafri commented Feb 9, 2024 •

edited

superlevure commented Feb 11, 2024

dima-kov commented Feb 11, 2024

dima-kov commented Feb 11, 2024 •

edited

dolgidmi commented Mar 4, 2024

rw88 commented May 24, 2024

Support for Async Django #1394

Are you sure you want to change the base?

Support for Async Django #1394

Conversation

jaw9c commented Apr 2, 2023 • edited

jaw9c commented May 5, 2023

firaskafri commented May 5, 2023

firaskafri commented Oct 1, 2023

pfcodes commented Dec 21, 2023

dima-kov commented Jan 7, 2024

pfcodes commented Jan 8, 2024

alexandrubese commented Jan 9, 2024 • edited

dima-kov commented Jan 9, 2024

alexandrubese commented Jan 9, 2024 • edited

dima-kov commented Feb 1, 2024

dima-kov commented Feb 2, 2024

dima-kov left a comment

Choose a reason for hiding this comment

kamilglod commented Feb 3, 2024

dima-kov commented Feb 3, 2024 • edited

kamilglod commented Feb 7, 2024

dima-kov commented Feb 7, 2024 • edited

Comparison

dima-kov commented Feb 7, 2024 • edited

dima-kov commented Feb 8, 2024

superlevure commented Feb 8, 2024

firaskafri commented Feb 9, 2024 • edited

superlevure commented Feb 11, 2024

dima-kov commented Feb 11, 2024

dima-kov commented Feb 11, 2024 • edited

dolgidmi commented Mar 4, 2024

rw88 commented May 24, 2024

jaw9c commented Apr 2, 2023 •

edited

alexandrubese commented Jan 9, 2024 •

edited

alexandrubese commented Jan 9, 2024 •

edited

dima-kov commented Feb 3, 2024 •

edited

dima-kov commented Feb 7, 2024 •

edited

dima-kov commented Feb 7, 2024 •

edited

firaskafri commented Feb 9, 2024 •

edited

dima-kov commented Feb 11, 2024 •

edited