in_kubernetes_events: Inefficient Defaults Lead To Kube API Spamming/Resource Drain & extra processing required in fluent-bit #8315

ryanohnemus · 2023-12-21T13:45:28Z

Bug Report

Describe the bug
The current in_kubernetes_events plugin is polling the kubeapi every 500ms by default (unless you specifically update the interval_(sec|nsec) config options. It is retrieving the same data over and over and using improper resourceVersion semantics (see bug #8314) to detect resource (event) changes.

resourceVersion / resourceVersionMatch are unset during the call to /api/v1/events which requires a quorum of kube api servers before it's response.

https://kubernetes.io/docs/reference/using-api/api-concepts/#semantics-for-get-and-list
Unless you have strong consistency requirements, using resourceVersionMatch=NotOlderThan and a known resourceVersion is preferable since it can achieve better performance and scalability of your cluster than leaving resourceVersion and resourceVersionMatch unset, which requires quorum read to be served.

This combination in large clusters, especially when combined with another default (requesting events for all namespaces when not limiting to a single namespace with kube_namespace), can lead to several Mbs of data being constantly polled from the kube api servers every 500ms.

To Reproduce

Steps to reproduce the problem:

Use the following input with defaults:

[INPUT]
    name          kubernetes_events
    tag           k8s_events

set debug logging (FLB_LOG_LEVEL=debug) and you can see we are consistently polling the same data and skipping over resourceVersion information we already have. If you attach this to a debugger (or just rebuild with extra flb_plg_debug lines within this do:

fluent-bit/plugins/in_kubernetes_events/kubernetes_events.c

Line 709 in bc28e78

do {

)

Expected behavior
Running a list against the k8s cluster should only be done at startup or after our last resourceVersion is considered too far out of date by k8s (it will return a 410 when requesting too old of a version). Then we should follow efficient-detection-of-changes which uses the resourceVersion of the EventList (not the individual events) to create a chunked stream of updates.

Your Environment

Version used: 2.2.0

The text was updated successfully, but these errors were encountered:

github-actions · 2024-04-07T01:50:54Z

This issue is stale because it has been open 90 days with no activity. Remove stale label or comment or this will be closed in 5 days. Maintainers can add the exempt-stale label.

github-actions · 2024-04-13T01:43:41Z

This issue was closed because it has been stalled for 5 days with no activity.

ryanohnemus added the status: waiting-for-triage label Dec 21, 2023

ryanohnemus mentioned this issue Dec 21, 2023

http_client: Add Ability to Process Http Chunked Stream #8316

Merged

7 tasks

ryanohnemus mentioned this issue Jan 4, 2024

in_kubernetes_events: Efficiently stream kubernetes events via watch #8351

Merged

6 tasks

github-actions bot added the Stale label Apr 7, 2024

github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale Apr 13, 2024

edsiper closed this as completed in #8351 Jun 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

in_kubernetes_events: Inefficient Defaults Lead To Kube API Spamming/Resource Drain & extra processing required in fluent-bit #8315

in_kubernetes_events: Inefficient Defaults Lead To Kube API Spamming/Resource Drain & extra processing required in fluent-bit #8315

ryanohnemus commented Dec 21, 2023

github-actions bot commented Apr 7, 2024

github-actions bot commented Apr 13, 2024

in_kubernetes_events: Inefficient Defaults Lead To Kube API Spamming/Resource Drain & extra processing required in fluent-bit #8315

in_kubernetes_events: Inefficient Defaults Lead To Kube API Spamming/Resource Drain & extra processing required in fluent-bit #8315

Comments

ryanohnemus commented Dec 21, 2023

Bug Report

github-actions bot commented Apr 7, 2024

github-actions bot commented Apr 13, 2024