Help prevent DOS attacks on graphql servers #2549

bbakerman · 2021-09-15T08:30:08Z

PR for limiting parsing tokens

bbakerman · 2021-09-15T09:28:47Z

src/main/java/graphql/parser/GraphqlAntlrToLanguage.java

+    public ParserOptions getParserOptions() {
+        return parserOptions;
+    }
+


@andimarek - we will want to be careful here with our Nadel hacks.

jord1e

Saw this

src/main/java/graphql/parser/Parser.java

Co-authored-by: Jordie <30464310+jord1e@users.noreply.github.com>

Motivation: Parser CPU and memory usage is linear to the number of tokens in a document however in extreme cases it becomes quadratic due to memory exhaustion. On my mashine it happens on queries with 2k tokens. For example: ``` { a a <repeat 2k times> a } ``` It takes 741ms on my machine. But if we create document of the same size but smaller number of tokens it would be a lot faster. Example: ``` { a(arg: "a <repeat 2k times> a" } ``` Now it takes only 17ms to process, which is 43 time faster. That mean if we limit document size we should make this limit small since it take only two bytes to create a token, e.g. ` a`. But that will hart legit documents that have long tokens in them (comments, describtions, strings, long names, etc.). That's why this PR adds a mechanism to limit number of token in parsed document. Also exact same mechanism implemented in graphql-java, see: graphql-java/graphql-java#2549 I also tried alternative approach of counting nodes and it gives slightly better approximation of how many resources would be consumed. However comparing to the tokens, AST nodes is implementation detail of graphql-js so it's imposible to replicate in other implementation (e.g. to count this number on a client).

* parser: limit maximum number of tokens Motivation: Parser CPU and memory usage is linear to the number of tokens in a document however in extreme cases it becomes quadratic due to memory exhaustion. On my mashine it happens on queries with 2k tokens. For example: ``` { a a <repeat 2k times> a } ``` It takes 741ms on my machine. But if we create document of the same size but smaller number of tokens it would be a lot faster. Example: ``` { a(arg: "a <repeat 2k times> a" } ``` Now it takes only 17ms to process, which is 43 time faster. That mean if we limit document size we should make this limit small since it take only two bytes to create a token, e.g. ` a`. But that will hart legit documents that have long tokens in them (comments, describtions, strings, long names, etc.). That's why this PR adds a mechanism to limit number of token in parsed document. Also exact same mechanism implemented in graphql-java, see: graphql-java/graphql-java#2549 I also tried alternative approach of counting nodes and it gives slightly better approximation of how many resources would be consumed. However comparing to the tokens, AST nodes is implementation detail of graphql-js so it's imposible to replicate in other implementation (e.g. to count this number on a client). * Apply suggestions from code review Co-authored-by: Yaacov Rydzinski <yaacovCR@gmail.com> Co-authored-by: Yaacov Rydzinski <yaacovCR@gmail.com>

Backport of graphql#3684 Motivation: Parser CPU and memory usage is linear to the number of tokens in a document however in extreme cases it becomes quadratic due to memory exhaustion. On my mashine it happens on queries with 2k tokens. For example: ``` { a a <repeat 2k times> a } ``` It takes 741ms on my machine. But if we create document of the same size but smaller number of tokens it would be a lot faster. Example: ``` { a(arg: "a <repeat 2k times> a" } ``` Now it takes only 17ms to process, which is 43 time faster. That mean if we limit document size we should make this limit small since it take only two bytes to create a token, e.g. ` a`. But that will hart legit documents that have long tokens in them (comments, describtions, strings, long names, etc.). That's why this PR adds a mechanism to limit number of token in parsed document. Also exact same mechanism implemented in graphql-java, see: graphql-java/graphql-java#2549 I also tried alternative approach of counting nodes and it gives slightly better approximation of how many resources would be consumed. However comparing to the tokens, AST nodes is implementation detail of graphql-js so it's imposible to replicate in other implementation (e.g. to count this number on a client). * Apply suggestions from code review Co-authored-by: Yaacov Rydzinski <yaacovCR@gmail.com> Co-authored-by: Yaacov Rydzinski <yaacovCR@gmail.com>

This adds a maximum number of tokens to be parse in queries by default

f331ee9

bbakerman added this to the 17.3 milestone Sep 15, 2021

Fixed test

b6878db

bbakerman commented Sep 15, 2021

View reviewed changes

jord1e reviewed Sep 16, 2021

View reviewed changes

src/main/java/graphql/parser/Parser.java Outdated Show resolved Hide resolved

Update src/main/java/graphql/parser/Parser.java

ddecadc

Co-authored-by: Jordie <30464310+jord1e@users.noreply.github.com>

andimarek approved these changes Sep 18, 2021

View reviewed changes

bbakerman merged commit 7f27a04 into master Sep 18, 2021

act1on3 mentioned this pull request Jul 18, 2022

Denial of Service via Directive overloading #2888

Closed

IvanGoncharov mentioned this pull request Jul 28, 2022

parser: limit maximum number of tokens graphql/graphql-js#3684

Merged

IvanGoncharov mentioned this pull request Aug 16, 2022

parser: limit maximum number of tokens graphql/graphql-js#3702

Merged

lrlna mentioned this pull request Oct 21, 2022

apply recursion limit api to all tokens apollographql/apollo-rs#327

Closed

stellanor mentioned this pull request Nov 30, 2022

Limiting token count in parse phase to prevent DoS absinthe-graphql/absinthe#1210

Merged

oryan-block mentioned this pull request Aug 18, 2023

graphqls file with at least 15000 tokens - What is config to fix graphql-java-kickstart/graphql-java-tools#759

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Help prevent DOS attacks on graphql servers #2549

Help prevent DOS attacks on graphql servers #2549

bbakerman commented Sep 15, 2021 •

edited

bbakerman Sep 15, 2021

jord1e left a comment

Help prevent DOS attacks on graphql servers #2549

Help prevent DOS attacks on graphql servers #2549

Conversation

bbakerman commented Sep 15, 2021 • edited

bbakerman Sep 15, 2021

Choose a reason for hiding this comment

jord1e left a comment

Choose a reason for hiding this comment

bbakerman commented Sep 15, 2021 •

edited