fix: LIKE expression with invalid string literals returns a parse error instead of panicking #1046

Cali0707 · 2024-05-01T14:52:12Z

Fixes #931

When visiting the nodes of the AST while parsing the LIKE expression, it looks like we assumed that we would have a valid string literal. However, it is possible that there is no valid string literal, in which case we have to handle that and return an error rather than continuing to parse (and subsequently panicking)

…or instead of panic Signed-off-by: Calum Murray <cmurray@redhat.com>

Cali0707 · 2024-05-01T14:52:30Z

cc @duglin @pierDipi @lionelvillard

pierDipi

LGTM

For reference, here's the spec for like-operation [ref]

like-operation ::= expression not-operator? like-operator string-literal

string-literal ::= ( "'" ( [^'] | "\'" )* "'" ) | ( '"' ( [^"] | '\"' )* '"')

duglin · 2024-05-03T14:06:28Z

sql/v2/test/tck/like_expression.yaml

@@ -115,4 +115,11 @@ tests:
    result: false
  - name: With type coercion from bool (4)
    expression: "FALSE LIKE 'fal%'"
-    result: true 
+    result: true


Just wondering, in cases where the first operand is a string, does the spec mandate that we do a string insensitive compare? I see where it talks about doing so for ceSQL keywords, but not values. Did I miss it? Should we add testcases?

actually, I'm not sure if this is covered in the spec. Perhaps we can open an issue in the spec repo and clarify what should happen?

I imagine if we had a testcase FALSE LIKE 'FAL%', the test would fail - but maybe the spec should clarify whether or not that should actually happen

@duglin thinking about this some more, what is happening here is that the FALSE value is coerced to a string first, and then the string comparison is made.

Looking at the spec, I think there are two parts we need to clarify (I'll open a PR in a bit):

That the LIKE expression casts the left operand to a string before evaluating the comparison

That boolean values cast to lower case strings "true" and "false"

I was actually more wondering whether: "foo" = "FOO" is meant to return true or false

Oh, I think that would return false. My understanding is that the case insensitivity is only for the CESQL keywords, not for the actual values

From a "spec complete" perspective, is that what we want? Or is it expected that everyone will use a lower() type of built-in function if they need it? Which is fine, I just wanted to double check

Personally, I think having case sensitive values is easier to understand then the other way around. Normally in SQL I would expect that a string comparison on values would only be true if they match exactly, and would use something like lower() or upper() if I wanted a case insenstive comparison. @pierDipi not sure if you have any other ideas here

I've been using mySQL for my xRegistry impl and found out that it does case insensitive compares by default. Go figure! I expected the opposite, like you :-) See: https://makingdatameaningful.com/is-sql-case-sensitive/

I'm ok either way the spec decides to go, but I think it should state its decision explicitly so it's clear. Today, it's kind of implied by the presence of the lower() and upper() funcs.

Ah I've only ever used postgres which is case sensitive by default! I agree, let's clarify in the spec that string comparison is case sensitive instead of just implying it with the LOWER() and UPPER() functions.

duglin · 2024-05-03T14:09:13Z

Aside from my minor question, LGTM

sql/v2/test/tck/like_expression.yaml

Cali0707 · 2024-05-14T16:08:41Z

cc @embano1

duglin · 2024-05-14T23:16:12Z

do we have a testcase for this?

Cali0707 · 2024-05-15T14:11:48Z

do we have a testcase for this?

@duglin yes, the new tck test in this PR runs and passes with this change, but prior to the change it would panic

fix: LIKE expression with invalid string literals returns a parse err…

d22deae

…or instead of panic Signed-off-by: Calum Murray <cmurray@redhat.com>

Cali0707 requested a review from a team as a code owner May 1, 2024 14:52

pierDipi approved these changes May 2, 2024

View reviewed changes

duglin reviewed May 3, 2024

View reviewed changes

pierDipi reviewed May 9, 2024

View reviewed changes

sql/v2/test/tck/like_expression.yaml Show resolved Hide resolved

This was referenced May 9, 2024

test: verify that invalid string literals in LIKE expression are a parse error cloudevents/spec#1280

Merged

clarify type casting in CESQL spec cloudevents/spec#1281

Open

Merge branch 'main' into fix-like-expression-panic-on-parse

ff47a1c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: LIKE expression with invalid string literals returns a parse error instead of panicking #1046

fix: LIKE expression with invalid string literals returns a parse error instead of panicking #1046

Cali0707 commented May 1, 2024

Cali0707 commented May 1, 2024

pierDipi left a comment •

edited

duglin May 3, 2024

Cali0707 May 3, 2024

Cali0707 May 3, 2024

Cali0707 May 9, 2024

duglin May 14, 2024

Cali0707 May 15, 2024

duglin May 15, 2024

Cali0707 May 15, 2024

duglin May 15, 2024

Cali0707 May 15, 2024

duglin commented May 3, 2024

Cali0707 commented May 14, 2024

duglin commented May 14, 2024

Cali0707 commented May 15, 2024

fix: LIKE expression with invalid string literals returns a parse error instead of panicking #1046

Are you sure you want to change the base?

fix: LIKE expression with invalid string literals returns a parse error instead of panicking #1046

Conversation

Cali0707 commented May 1, 2024

Cali0707 commented May 1, 2024

pierDipi left a comment • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

duglin commented May 3, 2024

Cali0707 commented May 14, 2024

duglin commented May 14, 2024

Cali0707 commented May 15, 2024

pierDipi left a comment •

edited