Why default to summary rather than histogram? #460

sp1rs · 2022-08-25T15:37:34Z

What is the reason behind converting the metric to a summary rather than a histogram by default?

SuperQ · 2022-08-25T16:05:30Z

Probably historical. A number of Prometheus things from the very early days defaulted to Summary.

glightfoot · 2022-08-25T16:14:47Z

Hey, that's a good question. Histograms in prometheus have a few main disadvantages that prevent them from being useful for statsd by default. The first and biggest downside is that histograms require some knowledge of what's being measured and the expected distribution in order to set decent bucket boundaries. Imagine you have timings that are expected to measure around a few milliseconds, and another set of timings that cluster around a few seconds. With a generic histogram using the default buckets, neither of these sets of timings would produce accurate data in default histograms. However, if you know these distributions, you can create buckets that will allow you to get meaningful percentiles.

Second, histograms have a higher cardinality than summaries, especially if you try to measure something with a wide distribution of values. Given we don't know what kind of timings people will send, in order to have meaningful histograms by default in the statsd exporter, we'd need a very wide set of buckets. This causes more load on prometheus.

Finally, summaries are accurate and produce meaningful data out of the box for any timing*, regardless of the distribution, since they directly calculate percentiles. Histograms use a linear estimation between bucket boundaries to get a percentile value, which inherently has error baked in that some people don't necessarily consider. This may change once prometheus supports sparse histograms, which significantly improve on these limitations.

Assuming there are frequent enough timings being sent to be able to sample them.

TL;DR Summaries are cheaper and more accurate for unknown distributions than histograms, which currently require some knowledge of the expected distribution.

SuperQ · 2022-08-25T18:04:31Z

The big down side of Summaries is that they can't be aggregated. If you have more than one statsd_exporter receiving data from the same app(s). The data will be essentially useless.

matthiasr · 2022-08-27T06:42:51Z

I thought about changing the default in the past but never tackled that.

With Histograms v2 in the works, I would rather not change the default now – they will alleviate a lot of the "must pick buckets" pain, and if we can make one breaking change rather than multiple all the better.

pedro-stanaka · 2024-02-19T21:58:44Z

Now that native histograms are more stable I would +1 here to make this default in the next major release. I have been using it as default and can just recommend the level of detail you get is impressive.

matthiasr · 2024-03-03T21:31:20Z

They're "more stable" but still experimental 😅 We still need a text format (prometheus/proposals#32), and it's behind a feature flag in Prometheus itself. Let's wait until it is really stable 😉

matthiasr added enhancement question labels Aug 27, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why default to summary rather than histogram? #460

Why default to summary rather than histogram? #460

sp1rs commented Aug 25, 2022 •

edited

SuperQ commented Aug 25, 2022

glightfoot commented Aug 25, 2022

SuperQ commented Aug 25, 2022

matthiasr commented Aug 27, 2022

pedro-stanaka commented Feb 19, 2024

matthiasr commented Mar 3, 2024

Why default to summary rather than histogram? #460

Why default to summary rather than histogram? #460

Comments

sp1rs commented Aug 25, 2022 • edited

SuperQ commented Aug 25, 2022

glightfoot commented Aug 25, 2022

SuperQ commented Aug 25, 2022

matthiasr commented Aug 27, 2022

pedro-stanaka commented Feb 19, 2024

matthiasr commented Mar 3, 2024

sp1rs commented Aug 25, 2022 •

edited