[Profiler] Avoid Recursive ThreadStoreLock in Profiling Thread Enumerator #110548

mdh1418 · 2024-12-09T20:18:01Z

Profiling Enumerators look to acquire the ThreadStoreLock.
In release config, re-acquiring the ThreadStoreLock and releasing it in ProfilerThreadEnum::Init will cause problems if the callback invoking EnumThread has logic that depends on the ThreadStoreLock being held.
For example, invoking EnumThread within RuntimeSuspendFinished will violate the expectation that the ThreadStoreLock is held until RestartEE is called, demonstrated in #110062

This PR aims to avoid recursively acquiring the ThreadStoreLock by expanding the known scnearios where the profiling thread enumerator shouldn't acquire the ThreadStoreLock.

jkotas · 2024-12-09T20:28:24Z

Do we need a test for this?

src/coreclr/vm/profilingenumerators.cpp

Profiling Enumerators look to acquire the ThreadStoreLock. In release config, re-acquiring the ThreadStoreLock and releasing it in ProfilerThreadEnum::Init will cause problems if the callback invoking EnumThread has logic that depends on the ThreadStoreLock being held. To avoid recursively acquiring the ThreadStoreLock, expand the condition when the profiling thread enumerator shouldn't acquire the ThreadStoreLock.

There was a potential race condition when setting the flag before suspending and resetting the flag after restarting. For example, if the thread restarting runtime is preempted right after resuming runtime, the flag could remain unset by the time another thread looks to suspend runtime, which would see that the flag as set.

davmason

LGTM

mdh1418 · 2024-12-11T21:22:33Z

Given that this test should run quickly when it passes, and its expected failure case is when it hangs/deadlocks, is there a straightforward way to set a quick timeout for this test instead of hitting CI's timeouts? And is there a way to hit such a timeout in a local run via corerun? @davmason @jkotas

davmason · 2024-12-11T21:26:03Z

The problem with timeouts is you can't check them in because we run a lot of tests in various modes (GCStress, JITStress, etc) that can make them take 10+ minutes. For this kind of thing what I have done in the past is make the timeout short locally and run it in a loop on your dev machine to build confidence, but leave the timeout unchanged in the tree.

mdh1418 · 2024-12-12T19:53:10Z

/backport to release/9.0-staging

github-actions · 2024-12-12T19:53:22Z

Started backporting to release/9.0-staging: https://github.com/dotnet/runtime/actions/runs/12303767691

…ator (dotnet#110548) * [Profiler] Avoid Recursive ThreadStoreLock Profiling Enumerators look to acquire the ThreadStoreLock. In release config, re-acquiring the ThreadStoreLock and releasing it in ProfilerThreadEnum::Init will cause problems if the callback invoking EnumThread has logic that depends on the ThreadStoreLock being held. To avoid recursively acquiring the ThreadStoreLock, expand the condition when the profiling thread enumerator shouldn't acquire the ThreadStoreLock. * [Profiler] Change order to set fProfilerRequestedRuntimeSuspend There was a potential race condition when setting the flag before suspending and resetting the flag after restarting. For example, if the thread restarting runtime is preempted right after resuming runtime, the flag could remain unset by the time another thread looks to suspend runtime, which would see that the flag as set. * [Profiler][Tests] Add unit test for EnumThreads during suspension * [Profiler][Tests] Fixup EnumThreads test

mdh1418 requested review from noahfalk, jkotas, VSadov and davmason December 9, 2024 20:18

dotnet-issue-labeler bot added the area-Diagnostics-coreclr label Dec 9, 2024

dotnet-policy-service bot assigned mdh1418 Dec 9, 2024

mdh1418 changed the title ~~[Profiler] Avoid Recursive ThreadStoreLock~~ [Profiler] Avoid Recursive ThreadStoreLock in Profiling Thread Enumerator Dec 9, 2024

mdh1418 marked this pull request as ready for review December 9, 2024 20:18

VSadov reviewed Dec 9, 2024

View reviewed changes

src/coreclr/vm/profilingenumerators.cpp Outdated Show resolved Hide resolved

VSadov reviewed Dec 9, 2024

View reviewed changes

src/coreclr/vm/profilingenumerators.cpp Outdated Show resolved Hide resolved

mdh1418 added 3 commits December 10, 2024 12:49

[Profiler][Tests] Add unit test for EnumThreads during suspension

d56d8bb

mdh1418 force-pushed the profiler_thread_enum_avoid_recursive_thread_store_lock branch from cd6f6f5 to d56d8bb Compare December 10, 2024 21:06

davmason approved these changes Dec 10, 2024

View reviewed changes

build-analysis bot mentioned this pull request Dec 11, 2024

Test failures in JSImportGenerator.Unit.Tests.Compiles.ValidateGeneratedSourceOutput #110575

Closed

[Profiler][Tests] Fixup EnumThreads test

fca15ed

mdh1418 merged commit a390024 into dotnet:main Dec 11, 2024
96 checks passed

mdh1418 deleted the profiler_thread_enum_avoid_recursive_thread_store_lock branch December 11, 2024 21:35

github-actions bot mentioned this pull request Dec 12, 2024

[release/9.0-staging] [Profiler] Avoid Recursive ThreadStoreLock in Profiling Thread Enumerator #110665

Merged

4 tasks

github-actions bot locked and limited conversation to collaborators Jan 12, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Profiler] Avoid Recursive ThreadStoreLock in Profiling Thread Enumerator #110548

[Profiler] Avoid Recursive ThreadStoreLock in Profiling Thread Enumerator #110548

mdh1418 commented Dec 9, 2024

jkotas commented Dec 9, 2024

davmason left a comment

mdh1418 commented Dec 11, 2024

davmason commented Dec 11, 2024

mdh1418 commented Dec 12, 2024

github-actions bot commented Dec 12, 2024

[Profiler] Avoid Recursive ThreadStoreLock in Profiling Thread Enumerator #110548

[Profiler] Avoid Recursive ThreadStoreLock in Profiling Thread Enumerator #110548

Conversation

mdh1418 commented Dec 9, 2024

jkotas commented Dec 9, 2024

davmason left a comment

Choose a reason for hiding this comment

mdh1418 commented Dec 11, 2024

davmason commented Dec 11, 2024

mdh1418 commented Dec 12, 2024

github-actions bot commented Dec 12, 2024