seq: use augmenting addition to update value #2615

jfinkels · 2021-08-29T12:06:29Z

This pull request fixes a floating-point precision issue with the update in the main loop of seq. Before this commit, the behavior was

$ seq 0.1 -0.1 -0.2
0.1
0.0
-0.1

which was incorrect: -0.2 should be printed as well. This was happening
because the update in the main loop was essentially

value = start + (i * increment);

which caused value to become -0.20000000000000004. That exceeds the lower bound of -0.2, so that number was not printed. By changing the update to

value += increment;

we eliminate this issue.

tertsdiepraam · 2021-08-29T14:10:45Z

Very interesting! This does indeed mitigate the issue, but it's not quite solved yet. I just tried:

seq 0 0.1 10000.1

and GNU includes 10000.1 while we (including this PR) don't.

I checked out the FreeBSD implementation and they use the multiplication method with an additional check for the last value:

/*
* Did we miss the last value of the range in the loop above?
*
* We might have, so check if the printable version of the last
* computed value ('cur') and desired 'last' value are equal.  If they
* are equal after formatting truncation, but 'cur' and
* 'last_shown_value' are not equal, it means the exit condition of the
* loop held true due to a rounding error and we still need to print
* 'last'.
*/
asprintf(&cur_print, fmt, cur);
asprintf(&last_print, fmt, last);
if (strcmp(cur_print, last_print) == 0 && cur != last_shown_value) {
    fputs(last_print, stdout);
    fputs(sep, stdout);
}

I would imagine that using the multiplication method would also minimize drifting of the values, although I'm not sure how big of a problem that is.

miDeb · 2021-08-29T14:51:27Z

AFAIK gnu uses long doubles (80 bit floats on x86), which are not available in rust.

jfinkels · 2021-08-29T14:55:31Z

So maybe the FreeBSD approach can offer a bit more protection, but we may not be completely safe against this type of incompatibility.

blyxxyz · 2021-08-29T16:11:42Z

Arbitrary-precision decimal arithmetic might be ideal for correctness, because all the input is decimal to begin with. Maybe with this crate.

That would make it trickier to support the --format option for printf-style float formatting, but numbers could be converted to floats in that case.

src/uu/seq/src/seq.rs

jfinkels · 2021-09-02T01:24:25Z

I have updated this pull request to use the bigdecimal crate. However, I've introduced a regression in the test for NaN arguments. Suggestions welcome, see my line comment.

jfinkels · 2021-09-09T02:31:55Z

I just updated this pull request with support for the NaN error message, but @blyxxyz pointed out that there are some further use cases (namely, inf and -inf arguments) that will see a regression with the code as it is today. I'll add more test cases to cover those situations and I'll ensure that the changes I propose don't reduce the functionality of seq.

jfinkels · 2021-09-10T02:17:50Z

Pull request #2647 adds two missing unit test cases that witness our support of seq 0 inf and seq -inf 0. I'll include those changes in this branch in order to ensure that this pull request does not cause a regression.

Change from using `print!()` to using `stdout.write_all()` in order to allow the main function to handle broken pipe errors gracefully.

Create the new `number.rs` module to contain the `Number` enumeration and the trait implementation for parsing a `Number` from a string.

Fix a floating-point precision issue with the update in the main loop of `seq`. Before this commit, the behavior was $ seq 0.1 -0.1 -0.2 0.1 0.0 -0.1 which was incorrect: -0.2 should be printed as well. This was happening because the update in the main loop was essentially value = start + (i * increment); which caused `value` to become `-0.20000000000000004`. That exceeds the lower bound of -0.2, so that number was not printed. By using the `bigdecimal` crate to represent floats of (practically) arbitrary precision, we eliminate the issue.

jfinkels · 2021-09-11T01:07:36Z

Hmm... I'm not sure how to do this without having a BigDecimal implementation that also supports positive and negative infinity.

blyxxyz

I think changing RangeBigDecimal into (BigDecimal, BigDecimal, Option<BigDecimal>) could work. With something like this logic:

If the start or increment is inf or -inf, abort. GNU seq supports it but it's not really useful for anything.
If the end is inf and the increment is positive or the end is -inf and the increment is negative set the end to None and just loop endlessly.
If the end is -inf and the increment is negative or the end is inf and the increment is positive, stop immediately without any output.

src/uu/seq/src/seq.rs

jfinkels · 2021-09-14T01:42:04Z

I created separate pull requests for two of the suggested changes and made all of the suggested changes except:

I think changing RangeBigDecimal into (BigDecimal, BigDecimal, Option<BigDecimal>) could work. With something like this logic: [...]

I'll try that next. Thanks.

jfinkels · 2021-09-25T01:54:15Z

I'm going to approach this from a different angle. I'll open a new pull request when I have something. Thanks for the feedback on this.

jfinkels commented Sep 2, 2021

View reviewed changes

src/uu/seq/src/seq.rs Outdated Show resolved Hide resolved

jfinkels force-pushed the seq-float-precision-bounds branch from 66c0fee to b79e93b Compare September 9, 2021 02:22

jfinkels added 3 commits September 10, 2021 20:00

seq: use stdout.write_all() instead of print!()

96b8616

Change from using `print!()` to using `stdout.write_all()` in order to allow the main function to handle broken pipe errors gracefully.

seq: move number parsing to its own module

278dfd4

Create the new `number.rs` module to contain the `Number` enumeration and the trait implementation for parsing a `Number` from a string.

jfinkels force-pushed the seq-float-precision-bounds branch from b79e93b to 02366e6 Compare September 11, 2021 01:07

blyxxyz suggested changes Sep 11, 2021

View reviewed changes

src/uu/seq/src/seq.rs Outdated Show resolved Hide resolved

src/uu/seq/src/seq.rs Outdated Show resolved Hide resolved

src/uu/seq/src/seq.rs Show resolved Hide resolved

src/uu/seq/src/seq.rs Show resolved Hide resolved

src/uu/seq/src/seq.rs Outdated Show resolved Hide resolved

fixup! seq: use bigdecimal for floating point precision

cad3218

jfinkels mentioned this pull request Sep 15, 2021

seq: remove --terminator/-t option #2677

Closed

jfinkels closed this Sep 25, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

seq: use augmenting addition to update value #2615

seq: use augmenting addition to update value #2615

jfinkels commented Aug 29, 2021

tertsdiepraam commented Aug 29, 2021 •

edited

miDeb commented Aug 29, 2021

jfinkels commented Aug 29, 2021

blyxxyz commented Aug 29, 2021

jfinkels commented Sep 2, 2021

jfinkels commented Sep 9, 2021

jfinkels commented Sep 10, 2021

jfinkels commented Sep 11, 2021

blyxxyz left a comment

jfinkels commented Sep 14, 2021

jfinkels commented Sep 25, 2021

seq: use augmenting addition to update value #2615

seq: use augmenting addition to update value #2615

Conversation

jfinkels commented Aug 29, 2021

tertsdiepraam commented Aug 29, 2021 • edited

miDeb commented Aug 29, 2021

jfinkels commented Aug 29, 2021

blyxxyz commented Aug 29, 2021

jfinkels commented Sep 2, 2021

jfinkels commented Sep 9, 2021

jfinkels commented Sep 10, 2021

jfinkels commented Sep 11, 2021

blyxxyz left a comment

Choose a reason for hiding this comment

jfinkels commented Sep 14, 2021

jfinkels commented Sep 25, 2021

tertsdiepraam commented Aug 29, 2021 •

edited