enh(haskell) BinaryLiterals, NumericUnderscores, HexFloatLiterals #3150

martijnbastiaan · 2021-04-19T08:01:01Z

Changes

Added support for the following Haskell language extensions:

It slightly over-approximates. For example, it would highlight 10_. If that's an issue, I'll put more effort in the PR. IMO this isn't really a problem though, as it would be a syntax error anyway.

Edit: sorry for all the force pushes, should be done now :)

Checklist

Added markup tests
Updated the changelog at CHANGES.md

joshgoebel

Looks pretty good, just needs a few changes!

joshgoebel · 2021-04-19T12:09:59Z

src/languages/haskell.js

+      }),
+
+      hljs.inherit(hljs.C_NUMBER_MODE, {
+        begin: '(-?)(\\b0[xX][a-fA-F0-9_]+|(\\b(\\d|_)+(\\.(\\d|_)*)?|\\.(\\d|_)+)([eE][-+]?(\\d|_)+)?)'


Generally we do not match - since it could often by unary or binary and we do not try and figure out such distinctions. (very hard to do also without look-behind)

Also please add additional tests for some of the e variants, etc.

Are there two variants here? (hex and regular?) It seems so. Please split them out into variants for readability and maintenance rather than using a more complex single regex. See many other grammars for examples.

Ah right, I just copied it from C_NUMBER_MODE. I'll make a few tests though :-)

Well, if it's exactly the same it of course shouldn't need to be copied at all... but if it's now specific to this language then we should clean it up into variants for future use - which is just the general policy on such things.

I'd suggest we could perhaps clean up C_NUMBER_MODE as well but too many grammars are likely dependent on it being exactly how it is now - but that doesn't mean we can't have nicer and easy to read matches in the individual grammars. :)

I'll add such thoughts to the v12 list. v11 is already big enough.

Ah let me clarify a bit: I copied it and then added |_ to optionally match the underscores.

I'm a bit fuzzy on what you want me to do; should I just make a clean version for Haskell numbers?

See https://github.com/highlightjs/highlight.js/blob/main/src/languages/swift.js#L121 for an example.

When we start adding custom numeric rules to grammars then we always do it as nicely as possible (for future readability and maintenance). That means separate variants for hex, binary, etc... (as you see in Swift) not just one large regex with multiple rules "hidden" inside.

Excellent. I'll get to it tonight. Thanks for the quick responses :)

martijnbastiaan · 2021-04-19T19:19:36Z

@joshgoebel I've rewritten Haskell's number parsing taking inspiration from Swift's implementation. I've included a whole bunch of test cases taken from the GHC documentation. As a happy coincidence, the highlighter now also supports the HexFloatLiterals language extension :). Let me know if there's anything more I can do.

joshgoebel · 2021-04-19T19:23:24Z

Awesome. This looks MUCH much closer to mergeable. :-)

Not sure what all the valid/invalid is trying to accomplish since we randomly highlight some invalids? Is this sample code just copied and pasted from GHC? If so, ok, but we should add a comment saying so with a link to the source to explain what's going on and that the invalid can mostly be ignored - and that all the valid should be highlighted.

If it's not then I'm not a direct copy/paste I'm sure what valid having any invalids serves?

martijnbastiaan · 2021-04-19T19:27:27Z

Yeah, you're right, leaving the invalids in is a bit silly. Removed!

src/languages/haskell.js

joshgoebel · 2021-04-19T19:47:07Z

Please remove the -? for consistency with every other grammar. We have no way to know if that - is unary or an operator, do we. I assume Haskell has the - operator for subtraction?

IE:

5-3

The - is NOT part of the number 3, it's an operator, correct?

martijnbastiaan · 2021-04-19T19:56:37Z

Good point!

The - is NOT part of the number 3, it's an operator, correct?

That's absolutely correct. I've updated the code to reflec this. I've also added a comment to the negative number tests, in case someone else stumbles upon it in the future. Could you check whether the comment is accurate?

src/languages/haskell.js

joshgoebel

Looks great other than my last question!

Restructured Haskell's number parsing, taking a cue from Swift's implementation. As a happy coincidence, this patch adds support for Haskell's `HexFloatLiterals` language extension.

joshgoebel · 2021-04-20T11:28:55Z

@martijnbastiaan Thanks for all the effort!

martijnbastiaan · 2021-04-20T11:59:16Z

@joshgoebel Thanks for all the patience! This is what makes me love OSS ❤️

martijnbastiaan force-pushed the haskell-numerical-underscores branch from f7bb34a to 4d1afc7 Compare April 19, 2021 08:01

enh(haskell) Add support for BinaryLiterals

e2151e3

martijnbastiaan force-pushed the haskell-numerical-underscores branch 2 times, most recently from cb8d5fe to b3f881e Compare April 19, 2021 08:05

martijnbastiaan changed the title ~~Add support for Haskell's NumericalUnderscores~~ Add support for Haskell's BinaryLiterals/NumericalUnderscores Apr 19, 2021

enh(haskell) Add support for NumericUnderscores

c55ad76

martijnbastiaan force-pushed the haskell-numerical-underscores branch from b3f881e to c55ad76 Compare April 19, 2021 08:21

martijnbastiaan changed the title ~~Add support for Haskell's BinaryLiterals/NumericalUnderscores~~ Add support for Haskell's BinaryLiterals/NumericUnderscores Apr 19, 2021

joshgoebel requested changes Apr 19, 2021

View reviewed changes

martijnbastiaan force-pushed the haskell-numerical-underscores branch 3 times, most recently from 249ba7f to c2b2c2b Compare April 19, 2021 19:18

martijnbastiaan force-pushed the haskell-numerical-underscores branch from c2b2c2b to 9afd66d Compare April 19, 2021 19:27

martijnbastiaan force-pushed the haskell-numerical-underscores branch from 9afd66d to 133b0aa Compare April 19, 2021 19:28

joshgoebel reviewed Apr 19, 2021

View reviewed changes

src/languages/haskell.js Outdated Show resolved Hide resolved

joshgoebel reviewed Apr 19, 2021

View reviewed changes

src/languages/haskell.js Outdated Show resolved Hide resolved

joshgoebel reviewed Apr 19, 2021

View reviewed changes

src/languages/haskell.js Outdated Show resolved Hide resolved

martijnbastiaan force-pushed the haskell-numerical-underscores branch from 133b0aa to afa912b Compare April 19, 2021 19:39

martijnbastiaan force-pushed the haskell-numerical-underscores branch from afa912b to 1a2a838 Compare April 19, 2021 19:55

joshgoebel reviewed Apr 19, 2021

View reviewed changes

src/languages/haskell.js Outdated Show resolved Hide resolved

joshgoebel approved these changes Apr 19, 2021

View reviewed changes

enh(haskell) Restucture Haskell number parsing

29479c1

Restructured Haskell's number parsing, taking a cue from Swift's implementation. As a happy coincidence, this patch adds support for Haskell's `HexFloatLiterals` language extension.

martijnbastiaan force-pushed the haskell-numerical-underscores branch from 1a2a838 to 29479c1 Compare April 20, 2021 07:10

Update CHANGES.md

6277cf4

joshgoebel changed the title ~~Add support for Haskell's BinaryLiterals/NumericUnderscores~~ enh(haskell) BinaryLiterals & NumericUnderscores Apr 20, 2021

joshgoebel changed the title ~~enh(haskell) BinaryLiterals & NumericUnderscores~~ enh(haskell) BinaryLiterals, NumericUnderscores, HexFloatLiterals Apr 20, 2021

Merge branch 'main' into haskell-numerical-underscores

5fbd326

joshgoebel approved these changes Apr 20, 2021

View reviewed changes

joshgoebel merged commit b6e5b1a into highlightjs:main Apr 20, 2021

martijnbastiaan deleted the haskell-numerical-underscores branch April 20, 2021 11:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

enh(haskell) BinaryLiterals, NumericUnderscores, HexFloatLiterals #3150

enh(haskell) BinaryLiterals, NumericUnderscores, HexFloatLiterals #3150

martijnbastiaan commented Apr 19, 2021 •

edited

joshgoebel left a comment

joshgoebel Apr 19, 2021

joshgoebel Apr 19, 2021

martijnbastiaan Apr 19, 2021

joshgoebel Apr 19, 2021

joshgoebel Apr 19, 2021

martijnbastiaan Apr 19, 2021

joshgoebel Apr 19, 2021

martijnbastiaan Apr 19, 2021

martijnbastiaan commented Apr 19, 2021

joshgoebel commented Apr 19, 2021

martijnbastiaan commented Apr 19, 2021

joshgoebel commented Apr 19, 2021

martijnbastiaan commented Apr 19, 2021

joshgoebel left a comment

joshgoebel commented Apr 20, 2021

martijnbastiaan commented Apr 20, 2021

enh(haskell) BinaryLiterals, NumericUnderscores, HexFloatLiterals #3150

enh(haskell) BinaryLiterals, NumericUnderscores, HexFloatLiterals #3150

Conversation

martijnbastiaan commented Apr 19, 2021 • edited

Changes

Checklist

joshgoebel left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

martijnbastiaan commented Apr 19, 2021

joshgoebel commented Apr 19, 2021

martijnbastiaan commented Apr 19, 2021

joshgoebel commented Apr 19, 2021

martijnbastiaan commented Apr 19, 2021

joshgoebel left a comment

Choose a reason for hiding this comment

joshgoebel commented Apr 20, 2021

martijnbastiaan commented Apr 20, 2021

martijnbastiaan commented Apr 19, 2021 •

edited