Fix crash with old resultset.json #824

PragTob · 2020-01-15T11:49:01Z

ResultMerger#store_result:

      clear_resultset
      new_set = resultset
      command_name, data = result.to_hash.first

it might throw out the result, it might not 😱

Hence adjusted the command name to be unique (for now) so that we're always confroned with the old bad data.

It's expected that this fails now as the fix is a TODO atm I just wanted to put it here so that I don't just have to update #820 all the time.

write basic feature
make feature runnable on all machines (adjust paths in saved json)
add another scenario that also updates the time stamp to not get sorted out through the merge timeout
actually fix it

Ensure a different name so merging doesn't just throw it out ResultMerger#store_result: clear_resultset new_set = resultset command_name, data = result.to_hash.first it might throw out the result, it might not 😱

* make sure the time is current is also tested * adjust file paths so they actually end up trying to get merged and not being ignored

We do a light check on every result if it looks like it conforms to our file format. If not, we ignore it and throw it out. Initial plan was to just catch an error and return a known good result, but sadly since we don't know what order the results come in that doesn't really work (as the first could be the "bad" result). Might be able to change this if we always knew we started with a known good result. Checking feels kinda clunky, but I honestly didn't know of a better way right now.

PragTob · 2020-01-17T21:42:05Z

lib/simplecov/combine/results_combiner.rb

+
+      def matches_current_format?(result)
+        # I so wish I could already use pattern matching
+        key, data = result.first


this whole checking (and the other checks) feel kinda clunky. I didn't know of a better way to handle it right now though :(

PragTob · 2020-01-17T21:42:58Z

@deivid-rodriguez this is ready-ish from my side, but the checking feels pretty clunky. If you manage to get to it, I'd appreciate a look (even if I'd merge it before because I wanna get the release out :) )

PragTob · 2020-01-18T14:53:57Z

Merging to build upon it and eventual release this evening, if someone has a better idea on how to do this that'd still be mightily appreciated :)

deivid-rodriguez · 2020-01-18T14:58:03Z

I'm sorry @PragTob I couldn't find time to look at this.

PragTob · 2020-01-18T16:30:06Z

@deivid-rodriguez no worries, I also didn't give a lot of time due to me wanting to release today :) if you still find anything here, I'm happy about a comment and I'll fix it up later :D

deivid-rodriguez · 2020-01-19T12:03:37Z

features/support/env.rb

@@ -1,5 +1,8 @@
 # frozen_string_literal: true

+# While you're here, handy tip for debugging: try to attach @announce-output
+# @announce-command to the scenarios/features to see what's going on


deivid-rodriguez · 2020-01-19T12:05:12Z

I think it would be better if we "auto-healed" the results automatically without giving any warnings? That said, it doesn't seem like a big deal, but with a warning the user might be confused and think she is doing something wrong when she isn't.

deivid-rodriguez · 2020-01-19T12:50:47Z

Also, is the issue you mention in the PR body about results being discarded what this PR is fixing?

PragTob · 2020-01-19T21:15:30Z

@deivid-rodriguez thanks for checking in!

Yeah, figured that/knew that I knew it from somewhere. Wanted to revisit #687

As for auto healing, I don't think it's needed. The resultset is from an old run anyhow so we wouldn't want to merge it anyhow. So ignoring it is fine, I wanted to make sure in the message that the problem should go away but maybe I can formulate it better?

Also, if we change the format again auto healing becomes more and more involved for (imo) no benefit. :)

deivid-rodriguez · 2020-01-19T21:32:43Z

I read the code a bit, and it seems to me that the only change in the format is a new nesting level with ":lines" or ":branches" keys. The way I would've approached it is to add a ":lines" nesting to old reports and then merge them normally. I don't see a reason why we shouldn't merge them if they're within the "time window" for merging the user has set.

PragTob · 2020-01-20T08:43:22Z

@deivid-rodriguez imo the time window is only there because we have no good mechanism of saying when a new test run was started (because after rspec is finished cucumber might run and we only ever start new processes). However, in practice what we want is to only ever merge reports from the same test run. If the .resultset.json was done with an old version of SimpleCov then I'm 100% sure that this was an older test run and I wouldn't want to merge that.

So, I don't think that that's the right behaviour.

You are right that to achieve it right now that's the only thing we'd have to do is this. However, if we changed the format again (which we well might) that'd add different "fixes" which could grow more complex. Plus I really don't believe that we should merge them as outlined above. If we really could make sure that we're the first/last test process that's relevant I'd be even in favor of deleting the cache file, alas I can't think of how we could determine that.

deivid-rodriguez · 2020-01-20T09:03:40Z

@deivid-rodriguez imo the time window is only there because we have no good mechanism of saying when a new test run was started (because after rspec is finished cucumber might run and we only ever start new processes). However, in practice what we want is to only ever merge reports from the same test run. If the .resultset.json was done with an old version of SimpleCov then I'm 100% sure that this was an older test run and I wouldn't want to merge that.

That's not how I understood auto merging. The way I understood it is: consider current coverage results accurate during the specified time window, and merge them with subsequent reports during that time window instead of discarding them. I find this useful for example for development workflows were you only run single tests or parts of a test suite, because they refer to the part of the program that I'm changing, and running the whole thing is too heavy. With auto-merging, I have continuous feedback about how my progress affects global coverage results. Without it, I lose all the other information.

PragTob · 2020-01-20T09:23:12Z

@deivid-rodriguez that's an interesting view of it that I hadn't considered before 👌

I see merging just as a vehicle to be able to merge results of multiple test suites (rspec, test unit, cucumber...) and parallel test execution.

With your use case, might it not be counter productive? I.e. if you make coverage worse you won't notice because it merges with the old results so it'd be more like the result you get is the total of what you covered through all your test runs.

Moreover, I think it might lead to other problems if you also changed the code so that lines move around different lines may be marked as covered just because their line in the file changes.

deivid-rodriguez · 2020-01-20T09:34:18Z

With your use case, might it not be counter productive? I.e. if you make coverage worse you won't notice because it merges with the old results so it'd be more like the result you get is the total of what you covered through all your test runs.

The lines covered vs total lines will change accordingly even if the percent is similar/same. Anyways, this is how it currently works, so I assume people is tweaking the available options (use_merging, or merge_timeout), or the available formatters to fit their needs/tastes.

Moreover, I think it might lead to other problems if you also changed the code so that lines move around different lines may be marked as covered just because their line in the file changes.

Simplecov shows warnings when this happens, that's good. Probably a good time for refreshing the whole thing :)

Anyways, I'm happy with the way you did it too, I just wanted to comment with the way I would've done it. The warnings were a bit annoying in my test suite after the upgrade because I have many many different test suites, which all try to merge with global results and print the warning. But nothing that rm -rf coverage can't fix :)

deivid-rodriguez · 2020-01-20T09:37:06Z

Thanks for all your recent work by the way, it's awesome!

PragTob · 2020-01-20T09:39:51Z

Nah, thanks for the feedback! 👌

You're right that's how it currently works. The warning should only show up once though, right? 🤔

deivid-rodriguez · 2020-01-20T09:49:39Z

Right, if you discard the invalid resultset the first time, there should be no warnings after that. Mmmm, there's maybe a bug somewhere 🤷‍♂️, I'll see if I can reproduce the multiwarning situation.

PragTob · 2020-01-20T09:58:19Z

Yeah that'd immensely suck if I messed that up... let me check with our own scenario real quick...

F*ck... I just reproduced it, we still always seem to write it back into the cache file 😢

PragTob · 2020-01-20T09:59:16Z

@deivid-rodriguez welp... I'll write a feature, work on it and release beta3 I guess. Only affects people where it is recent enough regarding their merge time out but yeah...

This was referenced Jan 15, 2020

Fix crash when parsing previous version resultset.json #820

Closed

Allow manual merging #780

Merged

PragTob force-pushed the fix-crash-against-old-resultset-json branch 3 times, most recently from 542090d to 8a3c80c Compare January 16, 2020 12:22

PragTob added 2 commits January 17, 2020 21:34

Reproduce #820 - crash when running with an old resultset.json

e20bd27

Ensure a different name so merging doesn't just throw it out ResultMerger#store_result: clear_resultset new_set = resultset command_name, data = result.to_hash.first it might throw out the result, it might not 😱

Improve testing by covering additional edge cases

23c8b94

* make sure the time is current is also tested * adjust file paths so they actually end up trying to get merged and not being ignored

PragTob force-pushed the fix-crash-against-old-resultset-json branch from 8a3c80c to 23c8b94 Compare January 17, 2020 20:35

PragTob changed the title ~~[WIP] Reproduction of crash with old resultset.json~~ Fix crash with old resultset.json Jan 17, 2020

PragTob commented Jan 17, 2020

View reviewed changes

PragTob merged commit db804e2 into master Jan 18, 2020

PragTob deleted the fix-crash-against-old-resultset-json branch January 18, 2020 14:54

deivid-rodriguez reviewed Jan 19, 2020

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix crash with old resultset.json #824

Fix crash with old resultset.json #824

PragTob commented Jan 15, 2020 •

edited

Loading

PragTob Jan 17, 2020

PragTob commented Jan 17, 2020

PragTob commented Jan 18, 2020

deivid-rodriguez commented Jan 18, 2020

PragTob commented Jan 18, 2020

deivid-rodriguez Jan 19, 2020

deivid-rodriguez commented Jan 19, 2020

deivid-rodriguez commented Jan 19, 2020

PragTob commented Jan 19, 2020

deivid-rodriguez commented Jan 19, 2020

PragTob commented Jan 20, 2020

deivid-rodriguez commented Jan 20, 2020

PragTob commented Jan 20, 2020

deivid-rodriguez commented Jan 20, 2020

deivid-rodriguez commented Jan 20, 2020

PragTob commented Jan 20, 2020

deivid-rodriguez commented Jan 20, 2020

PragTob commented Jan 20, 2020

PragTob commented Jan 20, 2020

Fix crash with old resultset.json #824

Fix crash with old resultset.json #824

Conversation

PragTob commented Jan 15, 2020 • edited Loading

PragTob Jan 17, 2020

Choose a reason for hiding this comment

PragTob commented Jan 17, 2020

PragTob commented Jan 18, 2020

deivid-rodriguez commented Jan 18, 2020

PragTob commented Jan 18, 2020

deivid-rodriguez Jan 19, 2020

Choose a reason for hiding this comment

deivid-rodriguez commented Jan 19, 2020

deivid-rodriguez commented Jan 19, 2020

PragTob commented Jan 19, 2020

deivid-rodriguez commented Jan 19, 2020

PragTob commented Jan 20, 2020

deivid-rodriguez commented Jan 20, 2020

PragTob commented Jan 20, 2020

deivid-rodriguez commented Jan 20, 2020

deivid-rodriguez commented Jan 20, 2020

PragTob commented Jan 20, 2020

deivid-rodriguez commented Jan 20, 2020

PragTob commented Jan 20, 2020

PragTob commented Jan 20, 2020

PragTob commented Jan 15, 2020 •

edited

Loading