Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Don't handle success callbacks after timeout (C* <= 2.1) #97

Merged

Conversation

Bj0rnen
Copy link
Collaborator

@Bj0rnen Bj0rnen commented May 9, 2017

This is an important fix that we merged into Spotify's production Reaper in the beginning of the year (spotify/cassandra-reaper#148). The "Old repair API" (C* 1.2, 2.0, maybe 2.1 too) inexplicably returns SESSION_SUCCESS when a repair session is aborted. That's what we do on timed out segments, and this led to a race condition where there's a chance that a segment that takes too long is flagged as "done" and never gets repaired.

@adejanovski adejanovski merged commit b15e743 into thelastpickle:master May 9, 2017
@adejanovski
Copy link
Contributor

I never ran into this one so it may be handled a bit differently in 2.0 (or never had a "spotted" timeout in 2.1).

Thanks !

@Bj0rnen Bj0rnen deleted the spotify/NoSuccessAfterTimeout branch May 9, 2017 13:37
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants