ENH: special.rel_entr: avoid premature overflow #20710

mdhaber · 2024-05-14T13:48:46Z

Describe your issue.

Looks like there's opportunity for improvement in scipy.special.rel_entr. Presumably, it takes the ratio x/y before taking the logarithm because it's faster than taking the difference of two logs, but this can overflow when the correct result is finite. Perhaps a compromise is to check for the overflow and, in that case, take the time to compute the two logs separately.

This came up gh-20673 as a Hypothesis counterexample rather than a real use case, which is why I consider it an enhancement request.

Reproducing Code Example

import numpy as np
from scipy import special
x = np.asarray([2., 3., 4.])
y = np.asarray(np.finfo(np.float64).tiny)
special.rel_entr(x, y)  # array([1418.17913143, 2128.48509246,           inf])
x * (np.log(x) - np.log(y))  # array([1418.17913143, 2128.48509246, 2839.13085157])

Error message

SciPy/NumPy/Python version and system information

main

The text was updated successfully, but these errors were encountered:

fancidev · 2024-05-25T13:21:15Z

It is clearly desirable to avoid premature overflow.

If x and y are greater than one, taking the log and then subtracting might reduce the accuracy, as the domain of logarithm is “larger” than its image. On the other hand, accuracy is not compromised if the arguments are less than one, which may be typical if rel_entr is applied to two probability values.

nickodell · 2024-05-26T03:19:45Z

@fancidev

If x and y are greater than one, taking the log and then subtracting might reduce the accuracy, as the domain of logarithm is “larger” than its image. On the other hand, accuracy is not compromised if the arguments are less than one, which may be typical if rel_entr is applied to two probability values.

I checked this idea experimentally, and in general, the division approach has better accuracy than the logarithm approach. With three exceptions, the x * log(x / y) approach has equal or better accuracy than the x * (log(x) - log(y)) approach.

The three exceptions:

Overflow. If x / y == inf, then log subtraction is more accurate. (This is the lower-right white area of the graph below - here error is infinite for the ratio approach.)
Underflow. If x / y == 0, then log subtraction is more accurate. (This is the upper-left white area of the graph below - here error is infinite for the ratio approach.)
Subnormal numbers. If x / y is a subnormal number, then log subtraction is more accurate. (This is the diagonal band of yellow squares above the main diagonal. This has finite error, but is still quite inaccurate.)

Other than these three exceptions, the division approach is more accurate.

Here's a comparison of the error of the two methods:

(Note: Errors have been clipped at 95% to make the graphs readable. Some cells have thousands of ULP of error. See full notebook for details.)

With that in mind, here is the logic I would propose:

# assume all of the corner cases have already been handled
ratio = x / y
if ratio == inf or ratio < np.finfo(np.float64).tiny:
    return x * (log(x) - log(y))
else:
    return x * log(ratio)

The ratio == inf check handles case 1. The ratio < np.finfo(np.float64).tiny check handles cases 2 and 3.

If you look only at the area where x < 1 and y < 1, which is the bottom-left quadrant of the graph, then the x * log(x / y) approach still has equal or better accuracy than the x * (log(x) - log(y)) approach.

Notebook source

fancidev · 2024-05-26T08:54:56Z

Very interesting analysis @nickodell ! I suspect the "clipped" values with large error could be related to x and y that are close.

To illustrate this, I tried a contrived example:

import numpy as np
from mpmath import mp
mp.dps = 1000
x = 1234
y = np.nextafter(x, 100)
print(f'{np.log(y/x)=}')
print(f'{np.log(y) - np.log(x)=}')
print(f'{float(mp.log(y)-mp.log(x))=}')
print(f'{float(mp.log(mp.mpf(y)/x))=}')
print(f'{np.log1p((y-x)/x)=}')

Output:

np.log(y/x)=-2.2204460492503136e-16
np.log(y) - np.log(x)=-8.881784197001252e-16
float(mp.log(y)-mp.log(x))=-1.842574355293615e-16
float(mp.log(mp.mpf(y)/x))=-1.842574355293615e-16
np.log1p((y-x)/x)=-1.842574355293615e-16

So for those two numbers that are really close, log(y/x) is off by 20%, log(y)-log(x) is off by 380%, and log1p gives the correct answer.

The contrived example shows that if we insist on the accuracy from a numerical perspective, there's a fair bit of work to do.

nickodell · 2024-05-27T22:35:46Z

That's a really smart idea. Here's another graph of function error for the log1p method. (I used np.log1p((x - y) / y) rather than np.log1p((y-x)/x), though.)

So I could piece together a more accurate rel_entr() from three sources: log1p for the diagonal, where x and y are very close, ratio for everywhere else, and log difference for places where there is under/overflow.

mdhaber added enhancement A new feature or improvement scipy.special labels May 14, 2024

nickodell mentioned this issue May 28, 2024

ENH: special.rel_entr: Avoid overflow before computing log #20816

Merged

steppi closed this as completed in #20816 May 30, 2024

j-bowhay added this to the 1.15.0 milestone May 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH: special.rel_entr: avoid premature overflow #20710

ENH: special.rel_entr: avoid premature overflow #20710

mdhaber commented May 14, 2024 •

edited

fancidev commented May 25, 2024 •

edited

nickodell commented May 26, 2024

fancidev commented May 26, 2024

nickodell commented May 27, 2024

ENH: special.rel_entr: avoid premature overflow #20710

ENH: special.rel_entr: avoid premature overflow #20710

Comments

mdhaber commented May 14, 2024 • edited

Describe your issue.

Reproducing Code Example

Error message

SciPy/NumPy/Python version and system information

fancidev commented May 25, 2024 • edited

nickodell commented May 26, 2024

fancidev commented May 26, 2024

nickodell commented May 27, 2024

mdhaber commented May 14, 2024 •

edited

fancidev commented May 25, 2024 •

edited