Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add ignore_rpc_timeout option to allow suppressing rpc timeout errors #5137

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

gluckzhang
Copy link

Problem

Currently we run the watchtower to monitor validators on both mainnet and testnet. Though we have configured the instance to have a higher unhealthy threshold, ignore bad gateway errors, bear a longer connection time, and check the status less frequently (e.g., --unhealthy-threshold 2 --ignore-http-bad-gateway --rpc-timeout 60 --interval 65), we still receive a lot of operation timed out alerts. Such errors are more related to the availability of RPC endpoints and for now, we would like to suppress such errors.

Summary of Changes

This PR adds a new optional cli option --ignore_rpc_timeout to allow users to suppress rpc timeout errors. The default value of ignore_rpc_timeout is false so merging this PR does not change the default behavior of watchtower. It is up to users to decide whether they would like to ignore rpc timeouts.

Signed-off-by: Long Zhang <gluckzhang@gmail.com>
@mergify mergify bot requested a review from a team March 4, 2025 08:41
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant