A68: Random subsetting with rendezvous hashing LB policy #423

s-matyukevich · 2024-04-11T20:15:53Z

Replaces #383

Related to #430 which describes an LB policy that could be used in combination with random subsetting to correct the resulting server-side load imbalance.

A68-random-subsetting.md

atollena · 2024-04-15T12:54:35Z

A68-random-subsetting.md

+* When the lb policy is initialized it also creates a random 32-byte long `salt` string. 
+* After every resolver update the policy picks a new subset. It does this by implementing `rendezvous hashing` algorithm:
+  * Concatenate `salt` to each address in the list.
+  * For every resulting entity compute [MurmurHash3](https://en.wikipedia.org/wiki/MurmurHash) hash, which produces 128-byte output.


There is no dependency on murmur from grpc, at least in Go, as of today. You can use xxhash which is depended upon by ring_hash.

updated to use xxhash, This changes the algorithms slightly as we can use random pre-generated seed instead of concatenating salt to each address. The new version is even simpler.

atollena · 2024-04-15T13:03:58Z

A68-random-subsetting.md

+
+* The policy receives a single configuration parameter: `subset_size`, which must be configured by the user.
+* When the lb policy is initialized it also creates a random 32-byte long `salt` string. 
+* After every resolver update the policy picks a new subset. It does this by implementing `rendezvous hashing` algorithm:


I think it would help to define the algorithm in pseudo code. See https://github.com/grpc/proposal/blob/master/A55-xds-stateful-session-affinity.md#lb-policy-for-stateful-session-affinity or https://github.com/grpc/proposal/blob/master/A42-xds-ring-hash-lb-policy.md#aggregated-connectivity-state for examples of gRFCs that do this.

A68-random-subsetting.md

atollena · 2024-04-15T13:09:42Z

A68-random-subsetting.md

+
+### Handling Parent/Resolver Updates
+
+When the resolver updates the list of addresses, or the LB config changes, Random subsetting LB will run the subsetting algorithm, described above, to filter the endpoint list. Then it will create a new resolver state with the filtered list of the addresses and pass it to the child LB. Attributes and service config from the old resolver state will be copied to the new one. 


I think you should replace addresses with endpoints to take A61 into consideration.

atollena · 2024-04-15T13:11:18Z

A68-random-subsetting.md

+
+## Proposal
+
+Introduce a new LB policy, `random_subsetting`. This policy selects a subset of addresses and passes them to the child LB policy. It maintains 2 important properties:


I think you need to replace addresses with endpoints to account for https://github.com/grpc/proposal/blob/master/A61-IPv4-IPv6-dualstack-backends.md, where each endpoint may have multiple addresses.

A68-random-subsetting.md

atollena · 2024-04-15T13:17:57Z

A68-random-subsetting.md

+* The policy receives a single configuration parameter: `subset_size`, which must be configured by the user.
+* When the lb policy is initialized it also creates a random 32-byte long `salt` string. 
+* After every resolver update the policy picks a new subset. It does this by implementing `rendezvous hashing` algorithm:
+  * Concatenate `salt` to each address in the list.


You'll have to decide which address, in case the endpoint has more than one (I think you can use the first address?).

Updated the doc to use the first address.

atollena · 2024-04-15T13:20:02Z

A68-random-subsetting.md

+As described in [gRFC A52](https://github.com/grpc/proposal/blob/master/A52-xds-custom-lb-policies.md), gRPC has an LB policy registry, which maintains a list of converters. Every converter translates xDS LB policy to the corresponding service config. In order to allow using the Random subsetting LB policy via xDS, the only thing that needs to be done is providing a corresponding converter function. The function implementation will be trivial as the fields in the xDS LB policy will match exactly the fields in the service config.
+
+## Rationale
+### Alternatives Considered: Deterministic subsetting


You should probably discuss the trade offs of doing this kind of subsetting in the control plane, since it was discussed in the original proposal.

Yeah, but I posted a link to the tl;dr; of the discussion, so you think this is not enough?

I'm thinking of the option of doing random subsetting in the control plane by sending different EDS responses (with different subsets) to each dataplane, or the equivalent with other resolvers. It is simple to implement with xDS and works for Envoy and gRPC. IIRC the main argument for not going that route is the need to have an xDS infrastructure (this is a big barrier for our orgs, and probably others), and existing limitations of https://github.com/envoyproxy/go-control-plane.

This was discussed in https://github.com/grpc/proposal/pull/383/files#r1308024474.

In order to understand this proposal, I think users will need to understand the trade off of doing it as a balancer in each data plane rather than directly in service discovery.

Co-authored-by: Antoine Tollenaere <atollena@gmail.com>

… into random-subsetting

s-matyukevich · 2024-05-06T16:52:35Z

Bump on this. It has been almost a month since the proposal was submitted and no one from gRPC maintainers commented on it yet. cc @markdroth and @ejona86 since you both reviewed previous version of the proposal and have full context.

s-matyukevich added 7 commits April 10, 2024 15:41

Random subsetting with rendezvous hashing LB policy

97c8e73

Random subsetting with rendezvous hashing LB policy

1058d36

rename folder

ea7db4b

more images

4c2ca73

More images

c1b1b6e

review suggestion

8fd9fff

review comments

d58f609

s-matyukevich mentioned this pull request Apr 11, 2024

A68: Deterministic Subsetting LB policy #383

Closed

s-matyukevich changed the title ~~Random subsetting with rendezvous hashing LB policy~~ A68: Random subsetting with rendezvous hashing LB policy Apr 11, 2024

Add discussion link

5ce1497

atollena reviewed Apr 15, 2024

View reviewed changes

s-matyukevich and others added 6 commits April 15, 2024 08:02

Update A68-random-subsetting.md

802fde1

Co-authored-by: Antoine Tollenaere <atollena@gmail.com>

Update A68-random-subsetting.md

31cdd68

Co-authored-by: Antoine Tollenaere <atollena@gmail.com>

Update A68-random-subsetting.md

bcd293c

Co-authored-by: Antoine Tollenaere <atollena@gmail.com>

replace addresses with endpoints

5c859ca

Merge branch 'random-subsetting' of github.com:s-matyukevich/proposal…

b710a6f

… into random-subsetting

add pseudocode

8796292

s-matyukevich mentioned this pull request Apr 26, 2024

PID LB policy #430

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A68: Random subsetting with rendezvous hashing LB policy #423

A68: Random subsetting with rendezvous hashing LB policy #423

s-matyukevich commented Apr 11, 2024 •

edited

atollena Apr 15, 2024

s-matyukevich Apr 15, 2024

atollena Apr 15, 2024

s-matyukevich Apr 15, 2024

atollena Apr 15, 2024

s-matyukevich Apr 15, 2024

atollena Apr 15, 2024

s-matyukevich Apr 15, 2024

atollena Apr 15, 2024

s-matyukevich Apr 15, 2024

atollena Apr 15, 2024

s-matyukevich Apr 15, 2024

atollena Apr 15, 2024

s-matyukevich commented May 6, 2024


		### Handling Parent/Resolver Updates

		When the resolver updates the list of addresses, or the LB config changes, Random subsetting LB will run the subsetting algorithm, described above, to filter the endpoint list. Then it will create a new resolver state with the filtered list of the addresses and pass it to the child LB. Attributes and service config from the old resolver state will be copied to the new one.


		## Proposal

		Introduce a new LB policy, `random_subsetting`. This policy selects a subset of addresses and passes them to the child LB policy. It maintains 2 important properties:

A68: Random subsetting with rendezvous hashing LB policy #423

Are you sure you want to change the base?

A68: Random subsetting with rendezvous hashing LB policy #423

Conversation

s-matyukevich commented Apr 11, 2024 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

s-matyukevich commented May 6, 2024

s-matyukevich commented Apr 11, 2024 •

edited