You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We use Gitlab’s retry:when: rule with runner_system_failure to retry jobs when they fail due to an infrastructure problem, and we also have metrics and alerts for system failures that depend on Gitlab's classifcation. So far, using the anka-cloud-gitlab-executor, all failures (including system failures) return ERROR: Job failed: exit status 1 which registers in Gitlab as a script error. When using the anka-gitlab-runner instead, these errors were classified correctly in Gitlab.
For example, a recent job ended with this output:
2024/08/19 20:22:18 instance 16210410-9555-42d7-42a8-3a6c6740be8c is in state "Scheduling"
2024/08/19 20:22:21 instance 16210410-9555-42d7-42a8-3a6c6740be8c is in state "Error"
2024/08/19 20:22:21 error: failed to wait for instance "16210410-9555-42d7-42a8-3a6c6740be8c" to be scheduled: instance 16210410-9555-42d7-42a8-3a6c6740be8c is in an unexpected state: Error
ERROR: Job failed: exit status 1
When we query the Gitlab API for this job, we would expect the field "failure_reason" to have the value "runner_system_failure", but instead it comes back as "script_failure".
In our research of the problem, we came across these docs about how to send exit codes from a custom executor to Gitlab such that it appears as a system failure.
The text was updated successfully, but these errors were encountered:
We use Gitlab’s
retry:when:
rule withrunner_system_failure
to retry jobs when they fail due to an infrastructure problem, and we also have metrics and alerts for system failures that depend on Gitlab's classifcation. So far, using theanka-cloud-gitlab-executor
, all failures (including system failures) returnERROR: Job failed: exit status 1
which registers in Gitlab as a script error. When using theanka-gitlab-runner
instead, these errors were classified correctly in Gitlab.For example, a recent job ended with this output:
When we query the Gitlab API for this job, we would expect the field
"failure_reason"
to have the value"runner_system_failure"
, but instead it comes back as"script_failure"
.In our research of the problem, we came across these docs about how to send exit codes from a custom executor to Gitlab such that it appears as a system failure.
The text was updated successfully, but these errors were encountered: