Refactor JOB_PROCESS_WHERE_QUERY to get an optimized query plan #869
+8
−12
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This pull request is a simple "refactoring" of the job-pulling query performed by agenda, that yieldsbetter results when there are lots of "expired" jobs in database. As far as I can see, there's no functional impact.
Context:
I recently ended up in a situation with a lot of "expired" jobs in database, i.e. jobs with
nextRunAt
< current date.In that situation, I saw mongod CPU consumption climbing pretty high, and after checking mongo logs, I saw that the regular "job pulling" request performed by agenda scanned a lot of index keys.
I managed to reproduce this on my computer. I first created 100k "expired Jobs":
Then checked the explain plan of the job pulling request done by agenda:
So 10k index keys were examined :(
One of my colleague played with the query, and found out that with a simple "refactoring" of the request, we got a much better query plan:
Only 100~ keys examined!