balancer/weightedroundrobin: Add recording point for endpoint weight not yet usable and add metrics tests #7466

zasweq · 2024-08-01T01:59:02Z

This PR adds unit test assertions for Weighted Roudn Robin Metrics and an e2e test with exact OpenTelemetry metrics atoms expected.

It also fixes the "startup" case for WeightedSubConns when a WeightedSubConn has not received a load report yet. Previously, it would record an endpoint weight stale metric, but I added logic to make this case record on endpoint weight not yet usable instead.

RELEASE NOTES: N/A

codecov · 2024-08-01T02:20:08Z

Codecov Report

Attention: Patch coverage is 80.00000% with 12 lines in your changes missing coverage. Please review.

Project coverage is 81.53%. Comparing base (3eb0145) to head (761b729).
Report is 15 commits behind head on master.

Files	Patch %	Lines
internal/testutils/stats/test_metrics_recorder.go	73.68%	9 Missing and 1 partial ⚠️
internal/testutils/xds/e2e/setup/setup.go	81.81%	1 Missing and 1 partial ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #7466      +/-   ##
==========================================
- Coverage   81.57%   81.53%   -0.05%     
==========================================
  Files         354      358       +4     
  Lines       27076    27278     +202     
==========================================
+ Hits        22088    22240     +152     
- Misses       3798     3824      +26     
- Partials     1190     1214      +24

Files	Coverage Δ
balancer/weightedroundrobin/balancer.go	`80.51% <100.00%> (+0.29%)`	⬆️
balancer/weightedroundrobin/scheduler.go	`100.00% <100.00%> (ø)`
internal/testutils/xds/e2e/setup/setup.go	`81.81% <81.81%> (ø)`
internal/testutils/stats/test_metrics_recorder.go	`75.20% <73.68%> (+2.58%)`	⬆️

... and 30 files with indirect coverage changes

…s tests

dfawley

Sending comments discussed offline

dfawley · 2024-08-01T22:27:23Z

stats/opentelemetry/e2e_test.go

+	// load report, giving that SubConn a weight which will eventually expire.
+	// Two backends needed as for only one backend, WRR does not recompute the
+	// scheduler.
+	for i := 0; i < 100; i++ {


Use a map to guarantee you're hitting both backends. Or make RPCs simultaneously with checking metrics.

Decided to just set weights on both of them, and do the latter. Even if I fork a goroutine here, I don't think there's a guarantee it doesn't just hit backend 2 over and over again 100 times even if I have a metrics emission (will emit endpoint weight metric even if doesn't have a weight).

dfawley · 2024-08-01T22:28:38Z

stats/opentelemetry/e2e_test.go

+	rm := &metricdata.ResourceMetrics{}
+	reader.Collect(ctx, rm)
+	gotMetrics := map[string]metricdata.Metrics{}
+	for _, sm := range rm.ScopeMetrics {
+		for _, m := range sm.Metrics {
+			gotMetrics[m.Name] = m
+		}
+	}


Refactor into a function

dfawley · 2024-08-05T21:50:11Z

internal/testutils/stats/test_metrics_recorder.go

+// ClearMetrics clears the metrics data stores of the test metrics recorder by
+// setting all the data to 0.


Why set everything to zero instead of r.data = make(...)?

That is what I do in my constructor, but it seems like it doesn't matter because the writes in RecordSomething() will write a new data type. Should I get rid of that logic in constructor too and here?

Oh I see.. Yes, I would just create an empty map unless there's a need to do otherwise (and then...question whether that need is truly necessary). The zero value as the default should make everything "just work".

Yeah fair. Switched to just making an empty map, and letting new map write take care of it. Made no sense to configure with metrics name list I guess.

stats/opentelemetry/e2e_test.go

balancer/weightedroundrobin/metrics_test.go

internal/testutils/stats/test_metrics_recorder.go

test/xds/xds_client_integration_test.go

dfawley · 2024-08-09T16:03:10Z

balancer/weightedroundrobin/metrics_test.go

+			wsc := &weightedSubConn{
+				metricsRecorder: tmr,
+				weightVal:       3,
+			}
+			if test.lastUpdatedSet {
+				wsc.lastUpdated = time.Now()
+			}
+			if test.nonEmptySet {
+				wsc.nonEmptySince = time.Now()
+			}


Consider instead to have lastUpdated and nonEmptySince as fields in the test cases struct as time.Times, and just set them to either time.Time{} or time.Now(). Then there's less logic which should make things easier to read & understand in the future.

Ah right sounds good. Switched.

…not yet usable and add metrics tests (grpc#7466)

zasweq requested a review from dfawley August 1, 2024 01:59

zasweq assigned dfawley Aug 1, 2024

zasweq added the Type: Testing label Aug 1, 2024

zasweq added this to the 1.66 Release milestone Aug 1, 2024

zasweq force-pushed the add-tests-wrr-metrics branch 2 times, most recently from 270a26d to 37e724c Compare August 1, 2024 02:29

Add recording point for endpoint weight not yet usable and add metric…

f8aed9e

…s tests

zasweq force-pushed the add-tests-wrr-metrics branch from 37e724c to f8aed9e Compare August 1, 2024 03:36

dfawley reviewed Aug 2, 2024

View reviewed changes

dfawley assigned zasweq and unassigned dfawley Aug 2, 2024

Responded to some comments

9a1fb9b

zasweq assigned dfawley and unassigned zasweq Aug 2, 2024

zasweq force-pushed the add-tests-wrr-metrics branch from ac5b190 to f50574a Compare August 3, 2024 01:44

Add unit tests for WRR Metrics

6ff2e00

zasweq force-pushed the add-tests-wrr-metrics branch from f50574a to 6ff2e00 Compare August 3, 2024 02:40

dfawley reviewed Aug 5, 2024

View reviewed changes

dfawley assigned zasweq and unassigned dfawley Aug 5, 2024

zasweq force-pushed the add-tests-wrr-metrics branch from 5e8a314 to aa9ea9f Compare August 7, 2024 01:47

zasweq assigned dfawley and unassigned zasweq Aug 7, 2024

Responded to Doug's comments

761b729

zasweq force-pushed the add-tests-wrr-metrics branch from aa9ea9f to 761b729 Compare August 7, 2024 02:06

dfawley approved these changes Aug 9, 2024

View reviewed changes

dfawley assigned zasweq and unassigned dfawley Aug 9, 2024

Responded to Doug's comments

2ceb6e6

zasweq merged commit 54b48f7 into grpc:master Aug 10, 2024
10 of 11 checks passed

infovivek2020 pushed a commit to infovivek2020/grpc-go that referenced this pull request Aug 18, 2024

balancer/weightedroundrobin: Add recording point for endpoint weight …

548ba56

…not yet usable and add metrics tests (grpc#7466)

tbg mentioned this pull request Nov 27, 2024

DEPS: upgrade grpc to v1.68.0 cockroachdb/cockroach#136278

Closed

16 tasks

github-actions bot locked as resolved and limited conversation to collaborators Feb 6, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

balancer/weightedroundrobin: Add recording point for endpoint weight not yet usable and add metrics tests #7466

balancer/weightedroundrobin: Add recording point for endpoint weight not yet usable and add metrics tests #7466

zasweq commented Aug 1, 2024

codecov bot commented Aug 1, 2024 •

edited

Loading

dfawley left a comment

dfawley Aug 1, 2024

zasweq Aug 2, 2024

dfawley Aug 1, 2024

dfawley Aug 5, 2024

zasweq Aug 6, 2024

dfawley Aug 9, 2024

zasweq Aug 9, 2024

dfawley Aug 9, 2024

zasweq Aug 9, 2024

		// ClearMetrics clears the metrics data stores of the test metrics recorder by
		// setting all the data to 0.

balancer/weightedroundrobin: Add recording point for endpoint weight not yet usable and add metrics tests #7466

balancer/weightedroundrobin: Add recording point for endpoint weight not yet usable and add metrics tests #7466

Conversation

zasweq commented Aug 1, 2024

codecov bot commented Aug 1, 2024 • edited Loading

Codecov Report

dfawley left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

codecov bot commented Aug 1, 2024 •

edited

Loading