Add stripIgnoredTokens feature to remove insignificant whitespace #1628

rybon · 2018-12-24T17:09:36Z

Solves #1523.

facebook-github-bot · 2018-12-24T17:09:44Z

Thank you for your pull request and welcome to our community. We require contributors to sign our Contributor License Agreement, and we don't seem to have you on file. In order for us to review and merge your code, please sign up at https://code.facebook.com/cla. If you are contributing on behalf of someone else (eg your employer), the individual CLA may not be sufficient and your employer may need the corporate CLA signed.

If you have received this in error or have any questions, please contact us at cla@fb.com. Thanks!

facebook-github-bot · 2018-12-24T18:09:09Z

Thank you for signing our Contributor License Agreement. We can now accept your code for this (and any) Facebook open source project. Thanks!

rybon · 2018-12-28T19:47:48Z

@IvanGoncharov can you review this one? Thanks.

jgcmarins

nice

IvanGoncharov · 2019-01-02T13:00:00Z

@rybon Thanks for PR 👍 and Happy New Year 🎄

Thanks a lot for writting bunch of test cases.
Here is a few desing question to move this PR forward:

Proposed functionality doesn't work with AST so I don't see any benefits in having it as part of print. Moreover it's hard to use with printSchema:

const schema = buildClientSchema(introspection);
let sdl = printSchema(schema);
const ast = parse(sdl);
sdl = print(ast, { condense: true });

I think we should make 'condense' a standalone function inside utils.

RegExp are good for PoC but it's a big maintainance problem in the future.
Plus the are huge number of edge cases, e.g. simplest one is:

{
  compile(jsCode: "function () { console.log('Valid JS code inside GraphQL string'); }")
}

You just can't handle such use cases with RegExp and should use lexer to correctly implement this functionality:

graphql-js/src/language/__tests__/lexer-test.js

Lines 726 to 740 in fc3e2e3

    
               const lexer = createLexer( 
        
                 new Source(`{ 
        
                 #comment 
        
                 field 
        
               }`), 
        
               ); 
        
               const startToken = lexer.token; 
        
               let endToken; 
        
               do { 
        
                 endToken = lexer.advance(); 
        
                 // Lexer advances over ignored comment tokens to make writing parsers 
        
                 // easier, but will include them in the linked list result. 
        
                 expect(endToken.kind).not.to.equal('Comment'); 
        
               } while (endToken.kind !== '<EOF>');

Last issue for me is the name, I understand why you can't call it minify (names of variables, fragments and operations stay unchanged) but I personally think it's imposible to figure out what condense do just from its name.
Since you just remove ignored tokens how about renaming it to removeIgnoredTokens?

@rybon What do you think?

rybon · 2019-01-02T14:59:29Z

@IvanGoncharov Happy New Year to you too :-). Thanks for the review.

I've put this feature in the printer module as that is the module that 'prints' an AST to a string. But if that isn't the right place to put it and we should move it to utils instead, fine by me. I don't really have an opinion on that. I'd rather have something in place in this library than nothing.

I agree that RegExp is kinda hacky and not an ideal solution. There are indeed a number of edge cases that are very hard to catch with this approach. And if the GraphQL spec expands, one would need to expand the RegExp as well. I don't have any experience with ASTs and lexers, so I don't know how to implement this feature with that approach instead. But I'm willing to try it out. Would you (or anyone else) be willing to help me get started on this?

I'm not really attached to the name condense. Naming things is hard. I was trying to come up with a name that is descriptive of the feature (https://en.wiktionary.org/wiki/condense) and minify didn't quite feel right. But another name is fine by me.

IvanGoncharov · 2019-01-02T19:37:05Z

@rybon Sorry, I accidentally screw this PR trying to add commit on top of it.

I don't have any experience with ASTs and lexers, so I don't know how to implement this feature with that approach instead. But I'm willing to try it out. Would you (or anyone else) be willing to help me get started on this?

I implemented it here: be79c4e
You just need to document and test it.
I can't reopen this PR :(
Please open a new one based on removeIgnoredTokens and make necessary changes.

IvanGoncharov · 2019-01-02T19:55:33Z

@rybon I tested it based on your example:

const {
  stripIgnoredTokens
} = require('./dist/utilities/stripignoredtokens');

const result = stripIgnoredTokens(`
query SomeQuery($foo: String!, $bar: String) {
  someField(foo: $foo, bar: $bar) {
    a
    b {
      c
      d
    }
  }
} 
`);

console.log(result);
// query SomeQuery($foo:String!$bar:String){someField(foo:$foo bar:$bar){a b{c d}}}

rybon · 2019-01-02T20:02:06Z

Looks good.

I'll revert the changes in printer and move the tests to utilities.

IvanGoncharov · 2019-01-02T20:10:27Z

@rybon Great 👍
Please also add tests for SDL.
Some introspection results are huge and storing them as condense SDL is valid use case.

rybon · 2019-01-02T21:09:20Z

@IvanGoncharov done. I've added tests for the Query document and SDL document.

langpavel · 2019-01-17T12:15:18Z

I like this idea, really.

But code coverage should be increased — especially new features should be deeply tested.

BTW can this be implemented for AST too? — like print is now?

I always have AST (processed, validated, etc) when I wish write it down for next tooling.

rybon · 2019-01-24T15:18:39Z

@langpavel OK fair enough. But keep in mind that I'm not very familiar with this codebase, so I would need some help to get that done. Could you help me out?

langpavel · 2019-01-24T16:00:48Z

@rybon I think that all you need is altered copy of src/language/printer.js

rybon · 2019-01-24T16:10:49Z

That is where the implementation was originally done (see earlier commits). What needs to be done in that copy?

langpavel · 2019-01-24T16:51:32Z

@rybon Ok, I see first commit.
Point is that you used regex replaces, why? All the source code is baked here from AST, so if you rewrite this functions (join, block, indent) and printDocASTReducer, you will have everythig for free.

BTW I like latest tiny implementation. But having only AST version make more sense to me.
@rybon you should cover code by tests, it's curious that you decreased code coverage.. 🤔

@IvanGoncharov What do you think?

rybon · 2019-01-24T17:31:25Z

The RegExp implementation was done because I wasn't aware of an alternative approach at the time. I didn't decrease coverage on purpose, but I see it decreased by 0.007%. Are we striving for 100% coverage in this project? If so, how do we achieve that (have every single line covered)?

OK, I can rewrite these functions, but they would have to be parameterized to deal with this use case. What should the implementation be in that case? I have no experience with ASTs.

langpavel · 2019-01-24T17:36:32Z

@rybon let's wait for @IvanGoncharov.

OK, I can rewrite these functions, but they would have to be parameterized to deal with this use case. What should the implementation be in that case? I have no experience with ASTs.

Better way is create new printDocASTReducer, say printDenseASTReducer which will define custom serializers.

You can learn about AST here: astexplorer.net

IvanGoncharov · 2019-01-24T18:31:50Z

@rybon Situation is like this I agree with the current implementation.
It contains some minor hack: testing token for value to distinguish between punctuation tokens and all other but it can be a fixed latter.

The real problem here is that graphql-js has a pretty high standard for tests.
Not only in coverage (ATM 98% and we slowly moving to 100%) but in testing functionality.
It's very easy to fully cover this function since it short and doesn't contain a lot of if statements.

The idea here is that it simply not enough to use tests for print they are designed for a totally different case in mind.
You can't just pass the output of print through stripIgnoredTokens and test result since stripIgnoredTokens is the standalone function and should be tested as such.
For example print never return tabs so how do you test that tabs are removed.
Or how we test for Unicode BOM which is also ignored token.

I'm fully supporting this functionality but currently I need to finish a couple of other long standing issues with this lib. I can promise that we will include it in upcoming 14.2.0. But I need some time to finish other projects that I already started.

rybon · 2019-01-24T19:26:58Z

OK understood. I was testing it from the perspective of an end user, in this case an app developer who just wants to cut down on the size of GraphQL strings in an app bundle or over the wire. In that case the implementation details (the call stack that gets invoked) don't matter that much, only the end result does.
I can understand and respect that a library maintainer may have a different perspective, where testing implementation details (every function and line in the call stack) and 100% coverage matter more. That wasn't apparent to me when I opened this PR.

langpavel · 2019-01-24T20:48:35Z

@rybon What is not covered:

First is public API, second expected behavior. None of both is implementation detail test.
NOTE: Testing implementation details is useless and really bad and unwanted practice. This is not the case

codecov-io · 2019-01-28T22:21:13Z

Codecov Report

❗ No coverage uploaded for pull request base (master@59d9f17). Click here to learn what that means.
The diff coverage is 99.23%.

@@           Coverage Diff            @@
##             master   #1628   +/-   ##
========================================
  Coverage          ?   98.6%           
========================================
  Files             ?     216           
  Lines             ?   13368           
  Branches          ?    1971           
========================================
  Hits              ?   13181           
  Misses            ?     187           
  Partials          ?       0

Impacted Files	Coverage Δ
src/index.js	`100% <ø> (ø)`
src/__fixtures__/index.js	`100% <100%> (ø)`
src/utilities/index.js	`100% <100%> (ø)`
src/utilities/__tests__/stripIgnoredTokens-test.js	`100% <100%> (ø)`
src/utilities/stripIgnoredTokens.js	`95.45% <95.45%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 59d9f17...b579351. Read the comment docs.

rybon · 2019-01-28T23:11:04Z

@langpavel done. Only line I can't seem to get covered is the continue; statement, could you help me with that?

rybon · 2019-02-22T10:49:39Z

@IvanGoncharov anything else that needs to be done for this PR to land?

IvanGoncharov · 2019-02-25T17:50:00Z

@rybon Sorry for the delay.
Lexer implementation that I added previously was just a proof of concept and not really future-proof.
Also based on feedback #1523 I think we should support striping of indentation inside block string.
Plus even though current test do 100% percent coverage it is still based on print tests some of don't make sense for this functionality and missing many other checks.

I totally understand that adding this feature taking too much time so I stoped development of all other features until I merge this one. I'm working on it in my free time but I expect it will take a couple days at most.

P.S. I need to rebase some commit and also add new ones so to prevent stupid situation with github (like what I did with Lexer commit) I will open separate PR.

@rybon

Heavily based on work done by @rybon in graphql#1628. Solves graphql#1523.

rybon · 2019-03-26T13:27:31Z

Just saw your PR (#1802). Great work! Thank you for your efforts, I really appreciate it.

@rybon

Heavily based on work done by @rybon in graphql#1628. Solves graphql#1523.

@rybon

Heavily based on work done by @rybon in graphql#1628. Solves graphql#1523.

@rybon

Heavily based on work done by @rybon in graphql#1628. Solves graphql#1523.

@rybon

Heavily based on work done by @rybon in graphql#1628. Solves graphql#1523.

@rybon

Heavily based on work done by @rybon in #1628. Solves #1523.

added condense feature to printer

75ddbc8

rybon changed the title ~~Added condense feature to printer to remove non-significant whitespace~~ Add condense feature to printer to remove non-significant whitespace Dec 24, 2018

docs

33e77eb

facebook-github-bot added the CLA Signed label Dec 24, 2018

rybon added 2 commits December 24, 2018 19:12

reorder, more docs

f2223d0

cleanup of test

30d99e3

rybon mentioned this pull request Dec 24, 2018

Add ast toString returning query string apollographql/graphql-tag#214

Open

1 task

jgcmarins reviewed Jan 2, 2019

View reviewed changes

IvanGoncharov closed this Jan 2, 2019

This comment has been minimized.

Sign in to view

IvanGoncharov reopened this Jan 2, 2019

Implementation based on Lexer

1046cf1

This comment has been minimized.

Sign in to view

rybon added 4 commits January 2, 2019 21:12

revert

5c65a60

added tests for stripIgnoredTokens

1657963

cleanup

9424811

added sdl tests

54db6e4

cleanup

2211ec5

rybon changed the title ~~Add condense feature to printer to remove non-significant whitespace~~ Add stripIgnoredTokens feature to remove non-significant whitespace Jan 2, 2019

IvanGoncharov added this to the v14.2.0 milestone Jan 16, 2019

test for Source

3d309ca

rybon added 2 commits January 28, 2019 23:30

test error condition

0f333c9

strip comment test

b579351

IvanGoncharov mentioned this pull request Jan 31, 2019

Add minify option to printSchema #685

Closed

glasser mentioned this pull request Feb 12, 2019

Feature request: strip whitespace from GraphQL queries / fragments / mutations / subscriptions #1523

Closed

glasser mentioned this pull request Mar 13, 2019

Full query response cache plugin apollographql/apollo-server#2437

Merged

IvanGoncharov added a commit to IvanGoncharov/graphql-js that referenced this pull request Mar 26, 2019

Add stripIgnoredCharacters function

7c07691

Heavily based on work done by @rybon in graphql#1628. Solves graphql#1523.

IvanGoncharov mentioned this pull request Mar 26, 2019

Add stripIgnoredCharacters utility function #1802

Merged

rybon closed this Mar 26, 2019

IvanGoncharov added a commit to IvanGoncharov/graphql-js that referenced this pull request Mar 26, 2019

Add stripIgnoredCharacters utility function

a9862a3

Heavily based on work done by @rybon in graphql#1628. Solves graphql#1523.

IvanGoncharov added a commit to IvanGoncharov/graphql-js that referenced this pull request Mar 26, 2019

Add stripIgnoredCharacters utility function

18c66cc

Heavily based on work done by @rybon in graphql#1628. Solves graphql#1523.

IvanGoncharov added a commit to IvanGoncharov/graphql-js that referenced this pull request Mar 26, 2019

Add stripIgnoredCharacters utility function

15441c9

Heavily based on work done by @rybon in graphql#1628. Solves graphql#1523.

IvanGoncharov added a commit to IvanGoncharov/graphql-js that referenced this pull request Mar 26, 2019

Add stripIgnoredCharacters utility function

328b065

Heavily based on work done by @rybon in graphql#1628. Solves graphql#1523.

IvanGoncharov added a commit that referenced this pull request Apr 3, 2019

Add stripIgnoredCharacters utility function

081db43

Heavily based on work done by @rybon in #1628. Solves #1523.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add stripIgnoredTokens feature to remove insignificant whitespace #1628

Add stripIgnoredTokens feature to remove insignificant whitespace #1628

rybon commented Dec 24, 2018 •

edited

facebook-github-bot commented Dec 24, 2018

facebook-github-bot commented Dec 24, 2018

rybon commented Dec 28, 2018

jgcmarins left a comment •

edited

IvanGoncharov commented Jan 2, 2019

rybon commented Jan 2, 2019

IvanGoncharov commented Jan 2, 2019 •

edited

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

IvanGoncharov commented Jan 2, 2019

rybon commented Jan 2, 2019

IvanGoncharov commented Jan 2, 2019

rybon commented Jan 2, 2019

langpavel commented Jan 17, 2019 •

edited

rybon commented Jan 24, 2019

langpavel commented Jan 24, 2019

rybon commented Jan 24, 2019

langpavel commented Jan 24, 2019 •

edited

rybon commented Jan 24, 2019

langpavel commented Jan 24, 2019 •

edited

IvanGoncharov commented Jan 24, 2019 •

edited

rybon commented Jan 24, 2019

langpavel commented Jan 24, 2019 •

edited

codecov-io commented Jan 28, 2019 •

edited

rybon commented Jan 28, 2019

rybon commented Feb 22, 2019

IvanGoncharov commented Feb 25, 2019

rybon commented Mar 26, 2019 •

edited

Add stripIgnoredTokens feature to remove insignificant whitespace #1628

Add stripIgnoredTokens feature to remove insignificant whitespace #1628

Conversation

rybon commented Dec 24, 2018 • edited

facebook-github-bot commented Dec 24, 2018

facebook-github-bot commented Dec 24, 2018

rybon commented Dec 28, 2018

jgcmarins left a comment • edited

Choose a reason for hiding this comment

IvanGoncharov commented Jan 2, 2019

rybon commented Jan 2, 2019

IvanGoncharov commented Jan 2, 2019 • edited

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

IvanGoncharov commented Jan 2, 2019

rybon commented Jan 2, 2019

IvanGoncharov commented Jan 2, 2019

rybon commented Jan 2, 2019

langpavel commented Jan 17, 2019 • edited

rybon commented Jan 24, 2019

langpavel commented Jan 24, 2019

rybon commented Jan 24, 2019

langpavel commented Jan 24, 2019 • edited

rybon commented Jan 24, 2019

langpavel commented Jan 24, 2019 • edited

IvanGoncharov commented Jan 24, 2019 • edited

rybon commented Jan 24, 2019

langpavel commented Jan 24, 2019 • edited

codecov-io commented Jan 28, 2019 • edited

Codecov Report

rybon commented Jan 28, 2019

rybon commented Feb 22, 2019

IvanGoncharov commented Feb 25, 2019

rybon commented Mar 26, 2019 • edited

rybon commented Dec 24, 2018 •

edited

jgcmarins left a comment •

edited

IvanGoncharov commented Jan 2, 2019 •

edited

langpavel commented Jan 17, 2019 •

edited

langpavel commented Jan 24, 2019 •

edited

langpavel commented Jan 24, 2019 •

edited

IvanGoncharov commented Jan 24, 2019 •

edited

langpavel commented Jan 24, 2019 •

edited

codecov-io commented Jan 28, 2019 •

edited

rybon commented Mar 26, 2019 •

edited