Improved Long-Double Number Policy #2674

ctasada · 2024-05-09T12:09:01Z

Purpose

Fixes a performance issue while using the ToNumberPolicy.LONG_OR_DOUBLE with Double values

Description

The Parsing of a Double value was always executing a Long.parseLong(value), which generated a NumberFormatException.

Identifying that a Number is a Double or a Long can be easily achieve (in a naive way) looking for the decimal separator.

This simple change avoids the extra NumberFormatException

A simple JUnit test, parsing a Long or a Double 10K times shows the next values:

Double (old parsing): ~42 ms
Double (new parsing): ~6 ms
Long (old parsing): ~7 ms
Long (new parsing): ~7 ms

As we can see, the parsing for Long values stays the same (±1ms), while the parsing for Double is dramatically improved.

Reducing the number of exceptions also has a positive side effect in memory consumption.

Checklist

New code follows the Google Java Style Guide
This is automatically checked by mvn verify, but can also be checked on its own using mvn spotless:check.
Style violations can be fixed using mvn spotless:apply; this can be done in a separate commit to verify that it did not cause undesired changes.
If necessary, new public API validates arguments, for example rejects null
New public API has Javadoc
- Javadoc uses @since $next-version$
  ( $next-version$ is a special placeholder which is automatically replaced during release)
If necessary, new unit tests have been added
- Assertions in unit tests use Truth, see existing tests
- No JUnit 3 features are used (such as extending class TestCase)
- If this pull request fixes a bug, a new test was added for a situation which failed previously and is now fixed
mvn clean verify javadoc:jar passes without errors

google-cla · 2024-05-09T12:09:05Z

Thanks for your pull request! It looks like this may be your first contribution to a Google open source project. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

View this failed invocation of the CLA check for more information.

For the most up to date status, view the checks section at the bottom of the pull request.

The Parsing of a Double value was always executing a `Long.parseLong(value)`, which generated a `NumberFormatException`. Identifying that a Number is a Double or a Long can be easily achieve (in a naive way) looking for the decimal separator. This simple change avoids the extra `NumberFormatException` A simple JUnit test, parsing a `Long` or a `Double` 10K times shows the next values: * Double (old parsing): ~42 ms * Double (new parsing): ~6 ms * Long (old parsing): ~7 ms * Long (new parsing): ~7 ms As we can see, the parsing for `Long` values stays the same (±1ms), while the parsing for `Double` is dramatically improved. Reducing the number of exceptions also has a positive side effect in memory consumption.

Marcono1234

Thanks for this suggested improvement! What you are saying sounds reasonable to me, but I have not measured it yet myself.

@eamonnmcmanus, what do you think?

Marcono1234 · 2024-05-12T21:27:52Z

gson/src/main/java/com/google/gson/ToNumberPolicy.java

-      try {
-        return Long.parseLong(value);
-      } catch (NumberFormatException longE) {
+      if (value.contains(".")) {


Could also use indexOf('.') >= 0 here, which might be more efficient due to using a char. But I am not sure if that is really more efficient and if that is worth it.

@Marcono1234 the difference between contains(".") and indexOf('.') >= 0 is very small, and nearly negligible. But you're right, indexOf is slightly faster. I just pushed the tweak.

The usage of `indexOf(char)` is slightly faster

eamonnmcmanus

Thanks! The old code uses exceptions for control flow which is indeed bad, especially for performance. With this change we will usually not have to do that.

gson/src/main/java/com/google/gson/ToNumberPolicy.java

ctasada force-pushed the ctasada/improve-long-double-number-policy branch from 98cff1c to 8d4d95b Compare May 9, 2024 12:54

ctasada force-pushed the ctasada/improve-long-double-number-policy branch from 8d4d95b to e175533 Compare May 9, 2024 13:00

Marcono1234 reviewed May 12, 2024

View reviewed changes

Replace contains(".") by indexOf('.') >= 0

aa2db76

The usage of `indexOf(char)` is slightly faster

ctasada force-pushed the ctasada/improve-long-double-number-policy branch from 8dcfdd0 to aa2db76 Compare May 13, 2024 15:45

eamonnmcmanus requested changes May 13, 2024

View reviewed changes

gson/src/main/java/com/google/gson/ToNumberPolicy.java Show resolved Hide resolved

gson/src/main/java/com/google/gson/ToNumberPolicy.java Outdated Show resolved Hide resolved

Rename exception variables

cf87bf0

eamonnmcmanus approved these changes May 18, 2024

View reviewed changes

eamonnmcmanus merged commit 454a491 into google:main May 18, 2024
11 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improved Long-Double Number Policy #2674

Improved Long-Double Number Policy #2674

ctasada commented May 9, 2024 •

edited

google-cla bot commented May 9, 2024

Marcono1234 left a comment

Marcono1234 May 12, 2024

ctasada May 13, 2024

eamonnmcmanus left a comment

Improved Long-Double Number Policy #2674

Improved Long-Double Number Policy #2674

Conversation

ctasada commented May 9, 2024 • edited

Purpose

Description

Checklist

google-cla bot commented May 9, 2024

Marcono1234 left a comment

Choose a reason for hiding this comment

Marcono1234 May 12, 2024

Choose a reason for hiding this comment

ctasada May 13, 2024

Choose a reason for hiding this comment

eamonnmcmanus left a comment

Choose a reason for hiding this comment

ctasada commented May 9, 2024 •

edited