Experimental: use Normalized fields instead of lazy collect field #2358

andimarek · 2021-05-23T06:15:34Z

This change leverages the normalized query structure to collect the merged subfields instead of calling collectFields on demand.

This aims to improve the performance for especially large and deep queries with a lot of overlapping fields: the theory is that a one time build Normalized query is faster than the lazy/on demand collectFields code.

If this really improves the performance needs to be verified.

This potentially also enables caching of a normalized query which means the merging of overlapping fields could be cached.
One aspect to consider is that a NQ is currently not independent of the variables: it needs all variables present to be build.

andimarek · 2021-05-23T06:22:40Z

@dfa1 hey Davide .... It would be really great if you could try out this experimental version and let us know if it improved your performance. Thanks a lot

dfa1 · 2021-05-23T07:18:12Z

@andimarek of course! It looks very interesting... :-)

Would be possible to publish this branch in maven central?

andimarek · 2021-05-23T11:32:46Z

@dfa1 It is published as 230521-nf-execution

dfa1 · 2021-05-23T13:51:27Z

@andimarek there is a breaking change in a @PublicApi class:

  symbol:   method getObjectType()
  location: variable field of type graphql.schema.SelectedField

Not sure if this replacement is good enough for production usage:

field.getObjectTypes().get(0)

dfa1 · 2021-05-23T14:33:26Z

another breaking change is:

symbol:   method getFieldDefinition()
location: variable field of type graphql.schema.SelectedField

andimarek · 2021-05-23T20:39:31Z

@dfa1 yes, this PR relies on two other PRs: #2338 and #2325. The latter contains some breaking changes.

dfa1 · 2021-05-24T20:40:12Z

@andimarek using yourkit with cpu tracing profilter. I cannot see anymore the "collectField" hotspot. This is really nice result! 🥇

However, I can't push this version up to the integration env since I'm not sure about my fixes for the breaking changes in SelectedField. The PR you mentioned doesn't change SelectedField at all... please advise :)

andimarek · 2021-05-24T21:02:35Z

hi @dfa1 .... I am sorry, the PR with the breaking changes is this: #2345

SelectedField now can reference multiple object types and not just one: depending on your schema and use case you might be ok to just use the first one, but it really depends.

bbakerman · 2021-06-15T04:41:05Z

src/main/java/graphql/execution/ExecutionStrategy.java

+            MergedField newMergedField = normalizedQueryTree.getNormalizedFieldToMergedField().get(child);
+            subFieldsMap.put(child.getResultKey(), newMergedField);
+        }
+        MergedSelectionSet subFields = MergedSelectionSet.newMergedSelectionSet().subFields(subFieldsMap).build();


This feels like a bit of code that could be in NormalizedQueryTree

MergedSelectionSet subFields = normalizedQueryTree.getSelectionSet(normalisedField, resolvedObjectType);

dfa1 · 2023-05-29T18:04:50Z

@andimarek what is the result of this experiment?

timward60 · 2023-06-29T20:08:34Z

This potentially also enables caching of a normalized query which means the merging of overlapping fields could be cached.
One aspect to consider is that a NQ is currently not independent of the variables: it needs all variables present to be build.

Super interested to understand the feasibility and any thoughts on how it could be achieved (especially what would need to be break out the query runtime variables).

Our project relies on DataFetchingSelectionSets with some fairly large/deep queries. We are seeing a large portion of time spent calculating this, so if we can cache this it would be a large win.

github-actions · 2023-12-26T00:15:00Z

Hello, this pull request has been inactive for 60 days, so we're marking it as stale. If you would like to continue working on this pull request, please make an update within the next 30 days, or we'll close the pull request.

leverage NF to collection fields

1872182

skip one test

264fec3

andimarek changed the base branch from normalized-input to master May 24, 2021 20:59

bbakerman reviewed Jun 15, 2021

View reviewed changes

andimarek added the Not to be merged spikes or other stuff that should never or not yet to be merged label Jul 18, 2021

github-actions bot added the Stale label Dec 26, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Experimental: use Normalized fields instead of lazy collect field #2358

Experimental: use Normalized fields instead of lazy collect field #2358

andimarek commented May 23, 2021 •

edited

andimarek commented May 23, 2021

dfa1 commented May 23, 2021

andimarek commented May 23, 2021

dfa1 commented May 23, 2021

dfa1 commented May 23, 2021

andimarek commented May 23, 2021

dfa1 commented May 24, 2021

andimarek commented May 24, 2021

bbakerman Jun 15, 2021

dfa1 commented May 29, 2023

timward60 commented Jun 29, 2023 •

edited

github-actions bot commented Dec 26, 2023

Experimental: use Normalized fields instead of lazy collect field #2358

Are you sure you want to change the base?

Experimental: use Normalized fields instead of lazy collect field #2358

Conversation

andimarek commented May 23, 2021 • edited

andimarek commented May 23, 2021

dfa1 commented May 23, 2021

andimarek commented May 23, 2021

dfa1 commented May 23, 2021

dfa1 commented May 23, 2021

andimarek commented May 23, 2021

dfa1 commented May 24, 2021

andimarek commented May 24, 2021

bbakerman Jun 15, 2021

Choose a reason for hiding this comment

dfa1 commented May 29, 2023

timward60 commented Jun 29, 2023 • edited

github-actions bot commented Dec 26, 2023

andimarek commented May 23, 2021 •

edited

timward60 commented Jun 29, 2023 •

edited