Unable to start/stop drivers concurrently on Server #8525

titusfortner · 2020-07-15T00:17:49Z

💥 Regression Report

Getting errors when starting more than one driver session at a time from different Threads. ~~Extra browsers are started, so I'm not sure if it failing on starting the 3rd one or if it breaks when trying to quit the session.~~ The server appears to lose track of one session after another has been started.

Last working Selenium version

3.141.59

Stopped working in version:

4.x

To Reproduce

This is the broken spec:
https://github.com/SeleniumHQ/selenium/blob/dd7090cab54beb2be2541a08746ddaeb8783c496/rb/spec/integration/selenium/webdriver/spec_support/shared_examples/concurrent_driver.rb

./go //rb:remote-chrome-test

It's the same failure/stack track fro both firefox and chrome - https://travis-ci.org/github/titusfortner/selenium/jobs/708131143#L718

Expected behavior

More than one browser starts and then is correctly closed

Actual behavior

Here's the log of requests for sessions and the server not recognizing the sessions:
https://gist.github.com/titusfortner/a4076e1f3dc8ef4115acc7a84fde9bce

It includes messages like:
invalid session id and NoSuchSessionException

Environment

OS: All
Browser: All
Language Bindings version: Ruby trunk
Selenium Grid version (if applicable): trunk

The text was updated successfully, but these errors were encountered:

AutomatedTester · 2020-07-15T08:42:00Z

I think that you're going to need to help us investigate it more.

Bazel runs tests in parallel by default so wondering if there is something else going on here.

titusfortner · 2020-07-15T15:09:58Z

I should have at least included logs when I filed this, so I added a gist above.

The test is starting the sessions from different threads.

Note that it is Chromedriver complaining about the session id, so it's like the server is mixing up which Session ID is associated with which driver.

Interestingly, when I use the most recently created session first, that one seems to work properly, but then the next two fail with the same invalid session id: https://gist.github.com/titusfortner/d8896b8fbd1d3ca2ceb9d55411af8aa9

barancev · 2020-07-16T08:00:14Z

How to reproduce the issue? I can't find rb_server_toggle branch on github and I can't see it merged to trunk.

titusfortner · 2020-07-16T17:42:05Z

Yes, it got merged in; It can be replicated with: ./go //rb:remote-chrome-test
We've guarded and are tracking this bug fo it in trunk, so you can see the results here: https://travis-ci.com/github/SeleniumHQ/selenium/jobs/361722656#L732 and here: https://travis-ci.com/github/SeleniumHQ/selenium/jobs/361722657#L729

Let me know if there is more info I can provide.

shs96c · 2020-07-21T20:52:47Z

This should be fixed by a3e0daf. @titusfortner can you please confirm, and reopen the issue if there's still a problem?

titusfortner · 2020-07-22T20:35:09Z

@shs96c
Haven't been able to reproduce the problem with firefox, and of course this doesn't work with Safari
But Chrome, both Linux & Mac are having regular issues with this.
From Travis: https://travis-ci.com/github/SeleniumHQ/selenium/jobs/363687203#L741

The 2 errors I'm consistently getting locally (the first looks more helpful for diagnosing):
https://gist.github.com/titusfortner/5d290b31c8015b3ef96c5761f7510cb5

AutomatedTester · 2020-08-24T21:17:44Z

Having a quick look at this, I think

From Travis: https://travis-ci.com/github/SeleniumHQ/selenium/jobs/363687203#L741

is unrelated.

I have also tried to run tests concurrently with a server and can't replicate this issue with python bindings.

I am going to remove the Se4 tag for now until we can get down to the nitty gritty of the issue (which I will leave to you @titusfortner )

titusfortner · 2020-08-25T15:58:47Z

@AutomatedTester It's reproducible in our test suite, and still an issue:
https://travis-ci.com/github/SeleniumHQ/selenium/jobs/376455420#L744
https://travis-ci.com/github/SeleniumHQ/selenium/jobs/376455421#L717

bjuric · 2020-09-12T06:10:22Z

I ran into the exact same issue when I tried upgrading Selenium from 3.141.59 to the latest 4.x (4.0.0-alpha-6) on my gwen-web project. Multiple parallel sessions and switching between windows is not working in 4.x. Happy to help test this again if you have discovered the cause and have a fix.

titusfortner · 2020-09-12T19:54:59Z

@bjuric what language are you using?

bjuric · 2020-09-12T22:24:09Z

I'm using the Java selenium impl.

…

On Sun, 13 Sep 2020, 5:55 am Titus, ***@***.***> wrote: @bjuric <https://github.com/bjuric> what language are you using? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#8525 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAKOPCXO4WFOU2F5JS2BQHDSFPG2DANCNFSM4O2AK6EA> .

titusfortner · 2020-09-12T22:37:19Z

@bjuric do you have reproducible code, so we have more to go off of than just my Ruby code?

bjuric · 2020-09-13T09:49:28Z

@titusfortner Here is a gist which replicates the issue with the java impl: https://gist.github.com/bjuric/ccf7dbb5546d9c323c29d2b8de6ff6ff

If you remove the fluent wait line, the parallel execution works correctly. Likely an issue with fluent wait when there are multiple web driver instances running at the same time.

bjuric · 2020-09-18T05:17:39Z

@titusfortner Turns out my issue was to do with fluent wait not using my custom thread pool (service executor). I've raised a PR here to fix that: #8713

shs96c · 2020-09-18T16:53:14Z

The java code appears to be working as intended....

bjuric · 2020-09-19T06:13:12Z

Interesting, I was able reproduce the problem using Java 8 (oracle and openjdk) on two machines; a mac with 4 threads and a custom PC with 12 threads. On the former it failed immediately, but on the latter it worked the first time, but then failed for all subsequent runs. Applying the #8713 PR fix above and passing the executor to fluent wait, fixed all instances of the same tests and passed consistently on every launch and ran faster too.

bjuric · 2020-09-23T14:34:50Z

One way I've found to work around this (that feels a little dirty tbh) is to set the following system property to force the CompletableFuture in FluentWait to run on the calling thread.

java.util.concurrent.ForkJoinPool.common.parallelism=0

The asynchronous nature of waits etc now in Selenium 4 come at a cost to concurrency it would seem.

titusfortner · 2021-01-13T21:51:06Z

@shs96c this issue is what is causing the Ruby tests to fail. Since it is crashing the browser and breaking the rest of the test execution, I'm going to completely block this test from running. (just making a note of it on this issue).

To reiterate, I can't reproduce this on my Mac, and I don't have a local linux environment handy to check it against, but it was an issue on Travis & now GitHub. This same code works just fine with the 3.141.59 server, just not with the alpha.

titusfortner · 2021-02-05T06:59:38Z

So, I uncommented the guard in #9147 so we can look at what is going on right now. I can't reproduce it on my mac, and I don't have a Linux machine handy to investigate.

https://github.com/SeleniumHQ/selenium/runs/1836774444?check_suite_focus=true#step:8:263

It's properly starting the 1st session, getting title, closing it. The other 2 are requested but the code never hears back, and it throws a net read timeout error:
https://github.com/SeleniumHQ/selenium/runs/1836774444?check_suite_focus=true#step:8:603

Firefox has same behavior: https://github.com/SeleniumHQ/selenium/runs/1836774400?check_suite_focus=true

…he number of processors detected

titusfortner · 2021-02-11T03:48:48Z

This issue had to do with the fact that CI tools only have 2 processors and new grid only allows one browser per processor. The way the tests were starting browsers concurrently required all 3 sessions to start before any of them could be used.

I fixed it by hard coding only 2 sessions for use on CI tools, which isn't ideal but works.
fwiw, I don't think this is ideal behavior, so might need to open another issue to address root cause.

titusfortner added the C-server label Jul 15, 2020

ghost added the needs-triaging label Jul 15, 2020

AutomatedTester removed the needs-triaging label Jul 15, 2020

shs96c self-assigned this Jul 21, 2020

shs96c added this to the 4.0 milestone Jul 21, 2020

shs96c closed this as completed Jul 21, 2020

titusfortner reopened this Jul 22, 2020

AutomatedTester removed this from the 4.0 milestone Aug 24, 2020

bjuric mentioned this issue Oct 16, 2020

Add withExecutor to FluentWait to support concurrent drivers in custom thread pools #8713

Closed

8 tasks

diemol added C-grid and removed C-server labels Feb 4, 2021

titusfortner mentioned this issue Feb 5, 2021

Fix 8525 by limiting concurrent test on CI #9147

Merged

titusfortner added a commit to titusfortner/selenium that referenced this issue Feb 11, 2021

[rb] fix SeleniumHQ#8525 by starting grid with 4 sessions

eaa9923

titusfortner added a commit to titusfortner/selenium that referenced this issue Feb 11, 2021

[rb] fix SeleniumHQ#8525 by limiting concurrent sessions in test to t…

7ce3f0b

…he number of processors detected

titusfortner added a commit to titusfortner/selenium that referenced this issue Feb 11, 2021

[rb] fix SeleniumHQ#8525 by limiting concurrent sessions in test to t…

fd88246

…he number of processors detected

titusfortner added a commit to titusfortner/selenium that referenced this issue Feb 11, 2021

[rb] fix SeleniumHQ#8525 by hardcoding ci runs to only 2 sessions

0dbad8b

titusfortner added a commit to titusfortner/selenium that referenced this issue Feb 11, 2021

[rb] fix SeleniumHQ#8525 by hardcoding ci runs to only 2 sessions

e728d63

titusfortner added a commit to titusfortner/selenium that referenced this issue Feb 11, 2021

[rb] fix SeleniumHQ#8525 by starting grid with 4 sessions

0f2e177

titusfortner closed this as completed in 5e3439d Feb 11, 2021

github-actions bot locked and limited conversation to collaborators Sep 5, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unable to start/stop drivers concurrently on Server #8525

Unable to start/stop drivers concurrently on Server #8525

titusfortner commented Jul 15, 2020 •

edited

Loading

AutomatedTester commented Jul 15, 2020

titusfortner commented Jul 15, 2020

barancev commented Jul 16, 2020

titusfortner commented Jul 16, 2020

shs96c commented Jul 21, 2020

titusfortner commented Jul 22, 2020

AutomatedTester commented Aug 24, 2020

titusfortner commented Aug 25, 2020

bjuric commented Sep 12, 2020

titusfortner commented Sep 12, 2020

bjuric commented Sep 12, 2020 via email

titusfortner commented Sep 12, 2020

bjuric commented Sep 13, 2020

bjuric commented Sep 18, 2020

shs96c commented Sep 18, 2020

bjuric commented Sep 19, 2020 •

edited

Loading

bjuric commented Sep 23, 2020 •

edited

Loading

titusfortner commented Jan 13, 2021

titusfortner commented Feb 5, 2021

titusfortner commented Feb 11, 2021

Unable to start/stop drivers concurrently on Server #8525

Unable to start/stop drivers concurrently on Server #8525

Comments

titusfortner commented Jul 15, 2020 • edited Loading

💥 Regression Report

Last working Selenium version

Stopped working in version:

To Reproduce

Expected behavior

Actual behavior

Environment

AutomatedTester commented Jul 15, 2020

titusfortner commented Jul 15, 2020

barancev commented Jul 16, 2020

titusfortner commented Jul 16, 2020

shs96c commented Jul 21, 2020

titusfortner commented Jul 22, 2020

AutomatedTester commented Aug 24, 2020

titusfortner commented Aug 25, 2020

bjuric commented Sep 12, 2020

titusfortner commented Sep 12, 2020

bjuric commented Sep 12, 2020 via email

titusfortner commented Sep 12, 2020

bjuric commented Sep 13, 2020

bjuric commented Sep 18, 2020

shs96c commented Sep 18, 2020

bjuric commented Sep 19, 2020 • edited Loading

bjuric commented Sep 23, 2020 • edited Loading

titusfortner commented Jan 13, 2021

titusfortner commented Feb 5, 2021

titusfortner commented Feb 11, 2021

titusfortner commented Jul 15, 2020 •

edited

Loading

bjuric commented Sep 19, 2020 •

edited

Loading

bjuric commented Sep 23, 2020 •

edited

Loading