Receiver: cache matchers for series calls #7353

pedro-stanaka · 2024-05-13T06:40:07Z

Summary

We have tried caching matchers before with a time-based expiration cache, this time we are trying with LRU cache.

We saw some of our receivers busy with compiling regexes and with high CPU usage, similar to the profile of the benchmark I added here:

Benchmark results

Expand!

Result on store-proxy-cache-matchers
BenchmarkProxySeriesRegex-11    	 1545795	       768.7 ns/op	    1144 B/op	      19 allocs/op
BenchmarkProxySeriesRegex-11    	 1548040	       769.4 ns/op	    1144 B/op	      19 allocs/op
BenchmarkProxySeriesRegex-11    	 1545019	       778.3 ns/op	    1144 B/op	      19 allocs/op
BenchmarkProxySeriesRegex-11    	 1539387	       771.1 ns/op	    1144 B/op	      19 allocs/op

Result on main
BenchmarkProxySeriesRegex-11    	  130292	      8803 ns/op	   10288 B/op	      78 allocs/op
BenchmarkProxySeriesRegex-11    	  124045	      8533 ns/op	   10288 B/op	      78 allocs/op
BenchmarkProxySeriesRegex-11    	  125092	      8712 ns/op	   10288 B/op	      78 allocs/op
BenchmarkProxySeriesRegex-11    	  120110	      8676 ns/op	   10288 B/op	      78 allocs/op

The results indicate that the "store-proxy-cache-matchers" branch considerably outperforms the "main" branch in all observed aspects of the BenchmarkProxySeriesRegex function. It is roughly 10 times faster regarding execution time while using about 9 times less memory and making about 4 times fewer allocations per operation. These improvements suggest significant optimizations in the regex handling or related data processing in the "store-proxy-cache-matchers" branch compared to the "main" branch

Changes

Adding matcher cache for method MatchersToPromMatchers and a new version which uses the cache.
The main change is in matchesExternalLabels function which now receives a cache instance.

Verification

I have added tests to the change and new benchmarks.

We have tried caching matchers before with a time-based expiration cache, this time we are trying with LRU cache. We saw some of our receivers busy with compiling regexes and with high CPU usage, similar to the profile of the benchmark I added here: * Adding matcher cache for method `MatchersToPromMatchers` and a new version which uses the cache. * The main change is in `matchesExternalLabels` function which now receives a cache instance. adding matcher cache and refactor matchers Co-authored-by: Andre Branchizio <andre.branchizio@shopify.com> Signed-off-by: Pedro Tanaka <pedro.tanaka@shopify.com> Using the cache in proxy and tsdb stores (only receiver) Signed-off-by: Pedro Tanaka <pedro.tanaka@shopify.com> fixing problem with deep equality Signed-off-by: Pedro Tanaka <pedro.tanaka@shopify.com> adding some docs Signed-off-by: Pedro Tanaka <pedro.tanaka@shopify.com> Adding benchmark Signed-off-by: Pedro Tanaka <pedro.tanaka@shopify.com> undo unecessary changes Signed-off-by: Pedro Tanaka <pedro.tanaka@shopify.com> Adjusting metric names Signed-off-by: Pedro Tanaka <pedro.tanaka@shopify.com> adding changelog Signed-off-by: Pedro Tanaka <pedro.tanaka@shopify.com> wiring changes to the receiver Signed-off-by: Pedro Tanaka <pedro.tanaka@shopify.com> Fixing linting Signed-off-by: Pedro Tanaka <pedro.tanaka@shopify.com>

Signed-off-by: Pedro Tanaka <pedro.tanaka@shopify.com>

GiedriusS

The results indicate that the "store-proxy-cache-matchers" branch considerably outperforms the "main" branch in all observed aspects of the BenchmarkProxySeriesRegex function. It is roughly 10 times faster regarding execution time while using about 9 times less memory and making about 4 times fewer allocations per operation. These improvements suggest significant optimizations in the regex handling or related data processing in the "store-proxy-cache-matchers" branch compared to the "main" branch

Was this AI generated? 😄

GiedriusS · 2024-05-28T13:43:02Z

pkg/store/storepb/matcher_cache.go

+
+func (c *MatchersCache) GetOrSet(key LabelMatcher, newItem NewItemFunc) (*labels.Matcher, error) {
+	c.metrics.requestsTotal.Inc()
+	if item, ok := c.cache.Get(key); ok {


I suggest using singleflight here to reduce allocations even more

GiedriusS · 2024-05-28T13:46:08Z

cmd/thanos/receive.go

@@ -973,6 +986,8 @@ func (rc *receiveConfig) registerFlag(cmd extkingpin.FlagClause) {
 			"about order.").
 		Default("false").Hidden().BoolVar(&rc.allowOutOfOrderUpload)

+	cmd.Flag("matcher-cache-size", "The size of the cache used for matching against external labels. Using 0 disables caching.").Default("0").IntVar(&rc.matcherCacheSize)


Should we add this to other components as well like Thanos Store?

GiedriusS · 2024-05-28T13:46:48Z

pkg/store/prometheus.go

+		tms []*labels.Matcher
+		err error
+	)
+	if cache != nil {


Maybe we could put *storepb.MatchersCache behind an interface to avoid this if cache != nil { ... } else { ... } everywhere?

GiedriusS · 2024-05-28T13:49:33Z

pkg/store/storepb/matcher_cache.go

+	}
+}
+
+func NewMatchersCache(opts ...MatcherCacheOption) (*MatchersCache, error) {


Maybe we can just use pkg/cache/inmemory.go? It's another LRU implementation that already exists in the tree.

pull-request-size bot added the size/L label May 13, 2024

pedro-stanaka force-pushed the store-proxy-cache-matchers branch from 598b480 to d56e024 Compare May 13, 2024 06:41

pedro-stanaka marked this pull request as ready for review May 13, 2024 08:13

pedro-stanaka marked this pull request as draft May 13, 2024 08:14

pedro-stanaka force-pushed the store-proxy-cache-matchers branch 2 times, most recently from 0528b9c to a58508d Compare May 13, 2024 09:29

pedro-stanaka changed the title ~~Receivers|Store: cache matchers for series calls~~ Receiver: cache matchers for series calls May 13, 2024

pedro-stanaka force-pushed the store-proxy-cache-matchers branch 2 times, most recently from 23c786a to 34e4852 Compare May 13, 2024 12:02

pedro-stanaka force-pushed the store-proxy-cache-matchers branch from 34e4852 to 3f852a5 Compare May 13, 2024 14:48

docs

f7b7697

Signed-off-by: Pedro Tanaka <pedro.tanaka@shopify.com>

pedro-stanaka marked this pull request as ready for review May 14, 2024 13:05

GiedriusS reviewed May 28, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Receiver: cache matchers for series calls #7353

Receiver: cache matchers for series calls #7353

pedro-stanaka commented May 13, 2024 •

edited

GiedriusS left a comment

GiedriusS May 28, 2024

GiedriusS May 28, 2024

GiedriusS May 28, 2024

GiedriusS May 28, 2024

Receiver: cache matchers for series calls #7353

Are you sure you want to change the base?

Receiver: cache matchers for series calls #7353

Conversation

pedro-stanaka commented May 13, 2024 • edited

Summary

Benchmark results

Changes

Verification

GiedriusS left a comment

Choose a reason for hiding this comment

GiedriusS May 28, 2024

Choose a reason for hiding this comment

GiedriusS May 28, 2024

Choose a reason for hiding this comment

GiedriusS May 28, 2024

Choose a reason for hiding this comment

GiedriusS May 28, 2024

Choose a reason for hiding this comment

pedro-stanaka commented May 13, 2024 •

edited