validate the length of names #1078

pjfanning · 2023-08-08T14:39:49Z

so far just a POC that only works with the UTF8StreamJsonParser and possibly NonBlockingUtf8JsonParsers
start with 50,000 char limit on names (as a default, that can be adjusted by users)
if this approach is ok, the other JsonParser implementations can be given the equivalent checks
- I have added some checks to the ReaderBasedJsonParser but we need more checks (while the name is being streamed)
see Add configurable limit for the maximum length of Object property names to parse before failing (default max: 50,000 chars) #1047

pjfanning · 2023-08-21T21:13:57Z

@cowtowncoder are these changes on the right track?

src/main/java/com/fasterxml/jackson/core/StreamReadConstraints.java

cowtowncoder · 2023-08-22T21:46:52Z

src/main/java/com/fasterxml/jackson/core/json/ReaderBasedJsonParser.java

                    _inputPtr = ptr+1; // to skip the quote
-                    return _symbols.findSymbol(_inputBuffer, start, ptr - start, hash);
+                    final int len = ptr - start;
+                    _streamReadConstraints.validateNameLength(len);


Ideally we should only (have to) check length when adding new name in CharsToNameCanonicalizer (and byte-based counterpart); not for every time we decode a name.

But to do that would need to pass constraints... that might be doable in makeChild() method. Maybe adding new variant that takes JsonFactory, instead of flags.

src/main/java/com/fasterxml/jackson/core/json/UTF8StreamJsonParser.java

cowtowncoder · 2023-08-22T21:51:13Z

@pjfanning yes, this is pretty much along the lines I was thinking. The main concern would be trying to avoid checks for char-based names on every decoding -- seems like it'd make sense to move check into CharsToNameCanonicalizer but that requires some plumbing to pass StreamReadConstraints.

For byte/quad-based decoding I think checks on array expansion make sense; just need to pass byte-size, not quad size.

pjfanning · 2023-08-23T12:54:38Z

@pjfanning yes, this is pretty much along the lines I was thinking. The main concern would be trying to avoid checks for char-based names on every decoding -- seems like it'd make sense to move check into CharsToNameCanonicalizer but that requires some plumbing to pass StreamReadConstraints.

For byte/quad-based decoding I think checks on array expansion make sense; just need to pass byte-size, not quad size.

I created #1086

cowtowncoder · 2023-08-25T03:04:24Z

Should have asked this before but... any chance to get StreamReadConstraints additions in a separate PR first, to get merged to master -- and then the rest separately?
I think the first set of changes follows a pattern and is fine as-is so I can merge this, leaving the rest easier to review (and merge too)

EDIT: I'll merge StreamReadConstraints manually, easy enough to pick.

src/main/java/com/fasterxml/jackson/core/json/async/NonBlockingUtf8JsonParserBase.java

cowtowncoder · 2023-08-26T03:01:27Z

@pjfanning Actually, I think this is good -- I can make minor changes after merge as I see fit. So if you think this is ready, please change to regular PR from draft?

pjfanning · 2023-08-27T19:39:49Z

@cowtowncoder should I move some of the UTF8StreamJsonParser parser checks to addName? The code there is a bit different because the ByteQuadsCanonicalizer doesn't validate the name length.

cowtowncoder · 2023-08-28T05:15:04Z

src/main/java/com/fasterxml/jackson/core/json/UTF8StreamJsonParser.java

            }
            quads[qlen++] = currQuad;
        }
+        _streamReadConstraints.validateNameLength(qlen << 2);


Yes, this could be instead moved to inside addName() which is only called if findName() does not find already canonicalized name.

cowtowncoder

LGTM, will merge tomorrow (merging to master/3.0 will take some time)

cowtowncoder · 2023-08-28T05:18:02Z

@cowtowncoder should I move some of the UTF8StreamJsonParser parser checks to addName? The code there is a bit different because the ByteQuadsCanonicalizer doesn't validate the name length.

Yes, I think moving those couple of cases to only validate in addName() and not earlier would make sense.
Theoretically could refactor ByteQuadsCanonicalizer to take in validation similar to char-based one, but that's more work and not necessarily better.

pjfanning · 2023-08-28T18:17:02Z

@cowtowncoder should I move some of the UTF8StreamJsonParser parser checks to addName? The code there is a bit different because the ByteQuadsCanonicalizer doesn't validate the name length.

Yes, I think moving those couple of cases to only validate in addName() and not earlier would make sense. Theoretically could refactor ByteQuadsCanonicalizer to take in validation similar to char-based one, but that's more work and not necessarily better.

I moved the check to addName().

cowtowncoder · 2023-08-29T04:50:37Z

Thank you again @pjfanning ! This time merging to master went VERY smoothly, somehow. Good job!
I made some minor tweaks to naming but essentially implementation looks straight-forward.
Over time should add similar verification to Smile, CBOR, YAML, XML, Properties. But for now great to have it for JSON.

pjfanning marked this pull request as draft August 8, 2023 14:40

cowtowncoder reviewed Aug 22, 2023

View reviewed changes

src/main/java/com/fasterxml/jackson/core/StreamReadConstraints.java Outdated Show resolved Hide resolved

cowtowncoder reviewed Aug 22, 2023

View reviewed changes

src/main/java/com/fasterxml/jackson/core/json/UTF8StreamJsonParser.java Outdated Show resolved Hide resolved

pjfanning force-pushed the name-len branch from 620e6e9 to 50e1c44 Compare August 23, 2023 12:39

pjfanning mentioned this pull request Aug 23, 2023

track JsonFactory in CharsToNameCanonicalizer #1086

Closed

cowtowncoder mentioned this pull request Aug 24, 2023

Refactor construction and use of CharsToNameCanonicalizer #1088

Merged

pjfanning force-pushed the name-len branch from 50e1c44 to 3923c37 Compare August 24, 2023 07:11

cowtowncoder added a commit that referenced this pull request Aug 26, 2023

Merge part of #1078 (StreamReadConstraints changes)

d40e548

cowtowncoder reviewed Aug 26, 2023

View reviewed changes

src/main/java/com/fasterxml/jackson/core/json/async/NonBlockingUtf8JsonParserBase.java Outdated Show resolved Hide resolved

cowtowncoder mentioned this pull request Aug 26, 2023

Remove BufferRecyclers.SYSTEM_PROPERTY_TRACK_REUSABLE_BUFFERS functionality from 3.0 #1090

Closed

pjfanning added 8 commits August 27, 2023 20:19

Create LargeNameReadTest.java

faaf3a1

wip

6842677

add test

58209c3

Update NonBlockingUtf8JsonParserBase.java

73a3623

reader check

0cb69a2

non blocking test

3dc7287

change name check

3f0c5bc

move some reader checks

d16b08c

pjfanning force-pushed the name-len branch from c5a902d to d16b08c Compare August 27, 2023 19:20

pjfanning marked this pull request as ready for review August 27, 2023 19:22

pjfanning changed the title ~~[DRAFT] validate the length of names~~ validate the length of names Aug 27, 2023

pjfanning added 2 commits August 27, 2023 20:26

remove some checks

f5bc2b4

move name len check (due to review comment)

7788d49

cowtowncoder reviewed Aug 28, 2023

View reviewed changes

cowtowncoder approved these changes Aug 28, 2023

View reviewed changes

Update UTF8StreamJsonParser.java

1d11c4a

cowtowncoder approved these changes Aug 29, 2023

View reviewed changes

cowtowncoder merged commit bc8433b into FasterXML:2.16 Aug 29, 2023
5 checks passed

pjfanning deleted the name-len branch August 29, 2023 07:54

cowtowncoder mentioned this pull request Aug 30, 2023

Add configurable limit for the maximum length of Object property names to parse before failing (default max: 50,000 chars) #1047

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

validate the length of names #1078

validate the length of names #1078

pjfanning commented Aug 8, 2023 •

edited

pjfanning commented Aug 21, 2023

cowtowncoder Aug 22, 2023

cowtowncoder commented Aug 22, 2023

pjfanning commented Aug 23, 2023

cowtowncoder commented Aug 25, 2023 •

edited

cowtowncoder commented Aug 26, 2023

pjfanning commented Aug 27, 2023

cowtowncoder Aug 28, 2023

cowtowncoder left a comment

cowtowncoder commented Aug 28, 2023

pjfanning commented Aug 28, 2023

cowtowncoder commented Aug 29, 2023

validate the length of names #1078

validate the length of names #1078

Conversation

pjfanning commented Aug 8, 2023 • edited

pjfanning commented Aug 21, 2023

cowtowncoder Aug 22, 2023

Choose a reason for hiding this comment

cowtowncoder commented Aug 22, 2023

pjfanning commented Aug 23, 2023

cowtowncoder commented Aug 25, 2023 • edited

cowtowncoder commented Aug 26, 2023

pjfanning commented Aug 27, 2023

cowtowncoder Aug 28, 2023

Choose a reason for hiding this comment

cowtowncoder left a comment

Choose a reason for hiding this comment

cowtowncoder commented Aug 28, 2023

pjfanning commented Aug 28, 2023

cowtowncoder commented Aug 29, 2023

pjfanning commented Aug 8, 2023 •

edited

cowtowncoder commented Aug 25, 2023 •

edited