matching: introduce consistent hashing matcher #14875

snowp · 2021-01-30T13:07:33Z

This introduces a new matcher that allows matching on an input value by
computing a hash value and matching if the value % (configured value) is
greater than a configured threshold. This is useful in being able to
define match criteria that should match for a certain % of input values
in a way that is consistent between independent Envoy instances (e.g. it
does not rely on a random input).

Risk Level: Low, new extension
Testing: UTs
Docs Changes: Inline proto docs
Release Notes: n/a
Platform Specific Features: n/a
Fixes #14782

This introduces a new matcher that allows matching on an input value by computing a hash value and matching if the value % (configured value) is greater than a configured threshold. This is useful in being able to define match criteria that should match for a certain % of input values in a way that is consistent between independent Envoy instances (e.g. it does not rely on a random input). Signed-off-by: Snow Pettersen <snowp@lyft.com>

Signed-off-by: Snow Pettersen <snowp@lyft.com>

repokitteh-read-only · 2021-01-30T13:07:39Z

CC @envoyproxy/api-shepherds: Your approval is needed for changes made to api/envoy/.
API shepherd assignee is @htuch
CC @envoyproxy/api-watchers: FYI only for changes made to api/envoy/.

🐱

Caused by: #14875 was opened by snowp.

see: more, trace.

snowp · 2021-01-30T13:07:55Z

@donyu

snowp · 2021-01-30T13:17:02Z

Will split this into a PR for the changes to the core matching logic (since I want to make similar changes to the other factories) and leave this as just the extension change, hence the draft

htuch

Looks like a useful addition. How come this requirement never come up in RouteConfiguration matchers?

htuch · 2021-01-31T01:09:03Z

api/envoy/extensions/matching/input_matchers/consistent_hashing/v3/consistent_hashing.proto

+message ConsistentHashing {
+  // The threshold the resulting hash must be over in order for this matcher to evaluate to true.
+  // This value must be below the configured modulo value.
+  uint32 threshold = 1 [(validate.rules).uint32 = {gt: 0}];


Could this be zero to mean 100%?

Yeah good point, will remove the validation

htuch · 2021-01-31T01:11:56Z

source/extensions/matching/input_matchers/consistent_hashing/matcher.h

+    }
+
+    // Otherwise, match if (hash(input) % modulo) > threshold.
+    return HashUtil::xxHash64(*input) % modulo_ > threshold_;


One interesting implication of the consistent hasher API guarantee that the results are fleet-wide consistent is that if we ever change xxHash64, or xxhash itself internally makes some inconsistent breaking change, the property will be violated during rollouts. FWIW we have this problem today with our affinity load balancers. Not sure how you prefer to handle it (maybe warn in API documentation is enough?) but worth considering.

I'll add a warning to the docs, not sure what else we could do here besides implementing our own hash that we promise we'll never change (or never update xxhash?).

Can we model 100% match here given the >? Should it be >=?

htuch · 2021-01-31T01:12:52Z

test/extensions/matching/input_matchers/consistent_hashing/matcher_test.cc

+TEST(MatcherTest, EmptyValue) {
+  Matcher matcher(10, 100);
+
+  ASSERT_FALSE(matcher.match(absl::nullopt));


Nit: these can all be EXPECT.

test/extensions/matching/input_matchers/consistent_hashing/matcher_test.cc

donyu · 2021-02-01T16:17:30Z

CODEOWNERS

@@ -162,3 +162,5 @@ extensions/filters/http/oauth2 @rgs1 @derekargueta @snowp
 /*/extensions/filters/http/kill_request @qqustc @htuch
 # Rate limit expression descriptor
 /*/extensions/rate_limit_descriptors/expr @kyessenov @lizan
+# hash input matcher
+/*/extensions/matching/input_matchers/consistent_hashing @snowp @donyu


Signed-off-by: Snow Pettersen <snowp@lyft.com>

sschepens · 2021-02-18T11:37:46Z

Looks like a useful addition. How come this requirement never come up in RouteConfiguration matchers?

@htuch We're actually in need of this kind of matching in RouteConfiguration and are using Lua to get it done.

sschepens · 2021-02-18T15:12:24Z

@snowp @htuch do you think it's possible to also add a configurable "seed", this is something we also need, because without a seed a given value would consistently match across every given service using a rule.

We're using this kind of hashing for "sticky" deployments based on a unique device identificator header, what happens without a seed is when a value of the header becomes covered by the threshold, then that user would simultaneously be enabled on all sticky deployments on the platform.

Signed-off-by: Snow Pettersen <snowp@lyft.com>

snowp · 2021-02-18T16:15:57Z

@sschepens Sure, I was going to add that in later as we'll need this as well. Would simply passing through a uint64_t to xxhash be sufficient for your use case?

Signed-off-by: Snow Pettersen <snowp@lyft.com>

sschepens · 2021-02-18T17:39:44Z

@snowp yep, i think that would be alright, any string seed could be converted by management servers to a uint64

sschepens · 2021-02-18T17:41:03Z

@snowp another question, will this be usable from RouteConfiguration or is this part of something else?

Signed-off-by: Snow Pettersen <snowp@lyft.com>

htuch

Looks great. Just one question.

htuch · 2021-02-19T03:05:10Z

source/extensions/matching/input_matchers/consistent_hashing/matcher.h

+    }
+
+    // Otherwise, match if (hash(input) % modulo) > threshold.
+    return HashUtil::xxHash64(*input) % modulo_ > threshold_;


Can we model 100% match here given the >? Should it be >=?

htuch · 2021-02-19T03:05:43Z

test/extensions/matching/input_matchers/consistent_hashing/config_test.cc

+  auto message = Config::Utility::translateAnyToFactoryConfig(
+      config.typed_config(), ProtobufMessage::getStrictValidationVisitor(), factory);
+  auto matcher = factory.createInputMatcher(*message, context);
+  ASSERT_NE(nullptr, matcher);


Nit: EXPECT for a final assertion (even if fatal).

Signed-off-by: Snow Pettersen <snowp@lyft.com>

snowp · 2021-03-02T15:45:32Z

/retest

repokitteh-read-only · 2021-03-02T15:45:36Z

Retrying Azure Pipelines:
Retried failed jobs in: envoy-presubmit

🐱

Caused by: a #14875 (comment) was created by @snowp.

see: more, trace.

snowp · 2021-03-02T18:37:50Z

@htuch ptal

htuch

LGTM modulo nits.

htuch · 2021-03-03T02:45:58Z

source/extensions/matching/input_matchers/consistent_hashing/matcher.h

+      return false;
+    }
+
+    // Otherwise, match if (hash(input) % modulo) > threshold.


htuch · 2021-03-03T02:46:09Z

api/envoy/extensions/matching/input_matchers/consistent_hashing/v3/consistent_hashing.proto

+
+// The consistent hashing matchers computes a consistent hash from the input and matches if the resulting hash
+// is within the configured threshold.
+// More specifically, this matcher evaluates to true if hash(input) % modulo > threshold.


Signed-off-by: Snow Pettersen <snowp@lyft.com>

snowp · 2021-03-08T14:36:33Z

@htuch ptal

htuch

LGTM, thanks!

htuch · 2021-03-08T20:23:14Z

/lgtm api

Snow Pettersen added 5 commits January 29, 2021 20:37

spelling

b875edf

Signed-off-by: Snow Pettersen <snowp@lyft.com>

spelling

180ef64

Signed-off-by: Snow Pettersen <snowp@lyft.com>

codeowners, renames, better comments

7fb2b5f

Signed-off-by: Snow Pettersen <snowp@lyft.com>

spelling

4a91bb5

Signed-off-by: Snow Pettersen <snowp@lyft.com>

repokitteh-read-only bot added the api label Jan 30, 2021

repokitteh-read-only bot assigned htuch Jan 30, 2021

htuch reviewed Jan 31, 2021

View reviewed changes

donyu reviewed Feb 1, 2021

View reviewed changes

htuch added the waiting label Feb 7, 2021

Snow Pettersen added 3 commits February 17, 2021 19:57

Merge remote-tracking branch 'envoy/main' into consistent-matcher

ab01c61

Signed-off-by: Snow Pettersen <snowp@lyft.com>

cleanup after merge

5b8eb7b

Signed-off-by: Snow Pettersen <snowp@lyft.com>

feedback

6e1f7a1

Signed-off-by: Snow Pettersen <snowp@lyft.com>

repokitteh-read-only bot removed the waiting label Feb 17, 2021

Snow Pettersen added 3 commits February 17, 2021 21:52

fix docs

09bef8e

Signed-off-by: Snow Pettersen <snowp@lyft.com>

spelling

1103ce1

Signed-off-by: Snow Pettersen <snowp@lyft.com>

spelling

00ec10b

Signed-off-by: Snow Pettersen <snowp@lyft.com>

fix category

d328029

Signed-off-by: Snow Pettersen <snowp@lyft.com>

snowp marked this pull request as ready for review February 18, 2021 16:22

fix test

caa6629

Signed-off-by: Snow Pettersen <snowp@lyft.com>

add dep

81a21eb

Signed-off-by: Snow Pettersen <snowp@lyft.com>

htuch reviewed Feb 19, 2021

View reviewed changes

sschepens mentioned this pull request Feb 19, 2021

traffic splitting with session affinity #8167

Closed

Snow Pettersen added 4 commits February 19, 2021 17:00

greater or equal

558f469

Signed-off-by: Snow Pettersen <snowp@lyft.com>

assert -> expect

aa4dd9b

Signed-off-by: Snow Pettersen <snowp@lyft.com>

Merge remote-tracking branch 'envoy/main' into consistent-matcher

1ebce21

Signed-off-by: Snow Pettersen <snowp@lyft.com>

add seed

61a13bd

Signed-off-by: Snow Pettersen <snowp@lyft.com>

htuch reviewed Mar 3, 2021

View reviewed changes

docs update

9794836

Signed-off-by: Snow Pettersen <snowp@lyft.com>

htuch approved these changes Mar 8, 2021

View reviewed changes

repokitteh-read-only bot removed the api label Mar 8, 2021

htuch merged commit 7fe3d35 into envoyproxy:main Mar 8, 2021

jmendesky mentioned this pull request May 6, 2021

Session affinity doesn't seem to apply to weighted subsets - Envoy support required istio/istio#9764

Closed

itsmunim mentioned this pull request Jun 9, 2021

Session stickiness does not work with weighted canary distribution istio/istio#33343

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

matching: introduce consistent hashing matcher #14875

matching: introduce consistent hashing matcher #14875

snowp commented Jan 30, 2021

repokitteh-read-only bot commented Jan 30, 2021

snowp commented Jan 30, 2021

snowp commented Jan 30, 2021

htuch left a comment

htuch Jan 31, 2021

snowp Feb 17, 2021

htuch Jan 31, 2021

snowp Feb 17, 2021

htuch Feb 19, 2021

htuch Jan 31, 2021

donyu Feb 1, 2021

sschepens commented Feb 18, 2021

sschepens commented Feb 18, 2021

snowp commented Feb 18, 2021

sschepens commented Feb 18, 2021

sschepens commented Feb 18, 2021

htuch left a comment

htuch Feb 19, 2021

htuch Feb 19, 2021

snowp commented Mar 2, 2021

repokitteh-read-only bot commented Mar 2, 2021

snowp commented Mar 2, 2021

htuch left a comment

htuch Mar 3, 2021

htuch Mar 3, 2021

snowp commented Mar 8, 2021

htuch left a comment

htuch commented Mar 8, 2021

matching: introduce consistent hashing matcher #14875

matching: introduce consistent hashing matcher #14875

Conversation

snowp commented Jan 30, 2021

repokitteh-read-only bot commented Jan 30, 2021

snowp commented Jan 30, 2021

snowp commented Jan 30, 2021

htuch left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sschepens commented Feb 18, 2021

sschepens commented Feb 18, 2021

snowp commented Feb 18, 2021

sschepens commented Feb 18, 2021

sschepens commented Feb 18, 2021

htuch left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

snowp commented Mar 2, 2021

repokitteh-read-only bot commented Mar 2, 2021

snowp commented Mar 2, 2021

htuch left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

snowp commented Mar 8, 2021

htuch left a comment

Choose a reason for hiding this comment

htuch commented Mar 8, 2021