When a performance problem is encountered on Linux, these instructions can be used to gather detailed information about what was happening on the machine at the time of the performance problem.
#Required Tools#
- perfcollect: Bash script that automates data collection.
- Available at http://aka.ms/perfcollect.
- PerfView: Windows-based performance tool that can also analyze trace files collected with Perfcollect.
- Available at http://aka.ms/perfview.
#Preparing Your Machine# Follow these steps to prepare your machine to collect a performance trace.
-
Download Perfcollect.
curl -OL http://aka.ms/perfcollect
-
Make the script executable.
chmod +x perfcollect
-
Install tracing prerequisites - these are the actual tracing libraries. For details on prerequisites, see below.
sudo ./perfcollect install
#Collecting a Trace#
-
Have two shell windows available - one for controlling tracing, referred to as [Trace], and one for running the application, referred to as [App].
-
[App] Setup the application shell - this enables tracing configuration inside of CoreCLR.
export COMPlus_PerfMapEnabled=1 export COMPlus_EnableEventLog=1
-
[Trace] Start collection.
sudo ./perfcollect collect sampleTrace
Expected Output:
Collection started. Press CTRL+C to stop.
-
[App] Run the app - let it run as long as you need to in order to capture the performance problem. Generally, you don't need very long. As an example, for a CPU investigation, 5-10 seconds of the high CPU situation is usually enough.
dotnet run
-
[Trace] Stop collection - hit CTRL+C.
^C ...STOPPED. Starting post-processing. This may take some time. Generating native image symbol files ...SKIPPED Saving native symbols ...FINISHED Exporting perf.data file ...FINISHED Compressing trace files ...FINISHED Cleaning up artifacts ...FINISHED Trace saved to sampleTrace.trace.zip
The compressed trace file is now stored in the current working directory.
#Viewing a Trace# Traces are best viewed using PerfView on Windows. Note that we're currently looking into porting the analysis pieces of PerfView to Linux so that the entire investigation can occur on Linux.
##Open the Trace File##
-
Copy the trace.zip file from Linux to a Windows machine.
-
Download PerfView from http://aka.ms/perfview.
-
Run PerfView.exe
PerfView.exe <path to trace.zip file>
##Select a View## PerfView will display the list of views that are supported based on the data contained in the trace file.
- For CPU investigations, choose CPU stacks.
- For very detailed GC information, choose GCStats.
- For per-process/module/method JIT information, choose JITStats.
- If there is not a view for the information you need, you can try looking for the events in the raw events view. Choose Events.
For more details on how to interpret views in PerfView, see help links in the view itself, or from the main window in PerfView choose Help->Users Guide.
#Extra Information# This information is not strictly required to collect and analyze traces, but is provided for those who are interested.
##Prerequisites## Perfcollect will alert users to any prerequisites that are not installed and offer to install them. Prerequisites can be installed automatically by running:
sudo ./perfcollect install
The current prerequisites are:
- perf: Also known as perf_event, the Linux Performance Events sub-system and companion user-mode collection/viewer application. perf is part of the Linux kernel source, but is not usually installed by default.
- LTTng: Stands for "Linux Tracing Toolkit Next Generation", and is used to capture event data emitted at runtime by CoreCLR. This data is then used to analyze the behavior of various runtime components such as the GC, JIT and thread pool.