[Core] Stale ray_cluster_<state>_nodes metrics #50735
Labels
bug
Something that is supposed to be working; but isn't
core
Issues that should be addressed in Ray Core
good-first-issue
Great starter issue for someone just starting to contribute to Ray
observability
Issues related to the Ray Dashboard, Logging, Metrics, Tracing, and/or Profiling
P1
Issue that should be fixed within a few weeks
What happened + What you expected to happen
I am observing stale values for
ray_cluster_active_nodes
andray_cluster_pending_nodes
metrics.Example:
Dashboard shows this (accurate):
However,
http://10.212.14.97:8080/metrics
shows this (inaccurate):Versions / Dependencies
Ray version 2.41.0
KubeRay version 1.2.2
Reproduction script
I've reproduced this multiple times in the context of KubeRay:
Issue Severity
Low: It annoys or frustrates me.
The text was updated successfully, but these errors were encountered: