[NativeAOT] Cache location of unwind sections #82994

filipnavara · 2023-03-05T09:43:38Z

Abstract

In issue #77568 the exception handling performance was tested on various scenarios. For Linux AOT a bottleneck was identified in the findUnwindSections method. Specifically, for multi-threaded scenario there's a significant performance penalty due to the usage of dl_iterate_phdr API which internally uses a lock.

A simple observation is that nearly all the frames we try to unwind are the compiled managed code which uses the same unwind table all the time. We can cache the value upfront and avoid all the lookups entirely.

Another side-effect of this is that it also helps the code paths that do thread hijacking during GC, and it potentially avoids some locks in those code paths.

Implementation

The implementation reshuffles the implementation of the FindProcInfo and VirtualUnwind methods and moves them into UnixCodeManager where the UnwindInfoSections value is cached.

The llvm-libunwind API offers two ways to inject the cached information about the unwind sections. It can either be done through custom AddressSpace class implementation which has the benefit that high-level C++ API can be reused by simply switching one template parameter. Alternatively, the low-level C++ API can be used directly and the information just passed to it. Since the unwinding code already used the low-level API in most cases I opted to go that route.

Testing

Test code

using System.Diagnostics;
using System.Runtime.CompilerServices;

internal class Program
{
    static int exceptionsHandled = 0;

    [MethodImpl(MethodImplOptions.NoInlining)]
    private static void CallMe(int i)
    {
        if (i == 0)
        {
            throw new NotImplementedException();
        }

        CallMe(i - 1);
    }

    [MethodImpl(MethodImplOptions.NoInlining)]
    private static void CatchMe()
    {
        try
        {
            CallMe(100);
        }
        catch (NotImplementedException)
        {
            Interlocked.Increment(ref exceptionsHandled);
        }
    }

    private static void ThreadEntrypoint()
    {
        while (true)
        {
            CatchMe();
        }
    }

    private static void Main(string[] args)
    {
        int savedExceptionsHandled = 0;
        for (int i = 0; i < 10; i++)
        {
            new Thread(ThreadEntrypoint).Start();
        }
        Thread.Sleep(5000);
        savedExceptionsHandled = exceptionsHandled;
        Console.WriteLine($"Exceptions per second: {savedExceptionsHandled / 5}");
        Environment.Exit(0);
    }
}

The test code was injected into an empty application created with dotnet new console and then compiled with dotnet publish -p:PublishAot=true -r linux-x64 -c Debug.

My test configuration is a Ryzen 7950X machine with Ubuntu 22.04.2 LTS in Windows Subsystem for Linux. Baseline is .NET 8 Preview 1, where I get ~19500 exceptions per second. With this PR I get around 145000 exceptions per second, or more than 7 times as fast throughput.

I also briefly tested on MacBook Air M1 in osx-arm64 configuration. The throughput of the PR is about 1.78x faster than the .NET 8 Preview 1 baseline.

ghost · 2023-03-05T09:43:51Z

Tagging subscribers to this area: @agocke, @MichalStrehovsky, @jkotas
See info in area-owners.md if you want to be subscribed.

Issue Details

TBD: Just testing build on different configurations...

Author:	filipnavara
Assignees:	-
Labels:	`community-contribution`, `area-NativeAOT-coreclr`
Milestone:	-

src/coreclr/nativeaot/Runtime/unix/UnwindHelpers.cpp

janvorli · 2023-03-06T15:16:53Z

@filipnavara. The result is awesome. I have ran the tests I have used in my analysis with this change. Originally, the Linux NativeAOT was clearly not scaling at all, now the multi-threaded performance is only about 10% worse than the single threaded one.

janvorli

LGTM, thank you!

src/coreclr/nativeaot/Runtime/unix/UnwindHelpers.cpp

VSadov · 2023-03-06T17:56:57Z

This can improve GC root reporting too as that performs stack walks and in server GC case does it on multiple threads.

VSadov · 2023-03-06T18:19:37Z

/azp run runtime-extra-platforms

azure-pipelines · 2023-03-06T18:20:04Z

Azure Pipelines successfully started running 1 pipeline(s).

VSadov

Very nice! Thanks!!

filipnavara added 3 commits March 5, 2023 09:06

WIP: Cache managed code unwind section lookup

016cb97

Clean up

27778e4

Attempt to fix EHABI and Darwin builds

442e00c

dotnet-issue-labeler bot added the area-NativeAOT-coreclr label Mar 5, 2023

ghost added the community-contribution Indicates that the PR has been added by a community member label Mar 5, 2023

filipnavara requested review from VSadov, janvorli and jkotas March 5, 2023 14:50

filipnavara commented Mar 5, 2023

View reviewed changes

src/coreclr/nativeaot/Runtime/unix/UnwindHelpers.cpp Show resolved Hide resolved

filipnavara marked this pull request as ready for review March 5, 2023 14:59

filipnavara requested a review from MichalStrehovsky as a code owner March 5, 2023 14:59

janvorli approved these changes Mar 6, 2023

View reviewed changes

src/coreclr/nativeaot/Runtime/unix/UnwindHelpers.cpp Outdated Show resolved Hide resolved

src/coreclr/nativeaot/Runtime/unix/UnwindHelpers.cpp Show resolved Hide resolved

Rename ip to pc

1518034

build-analysis bot mentioned this pull request Mar 6, 2023

Tracking issue for CI build timeouts #76454

Closed

VSadov approved these changes Mar 7, 2023

View reviewed changes

VSadov merged commit 013ca67 into dotnet:main Mar 7, 2023

filipnavara deleted the cache_unwind_sections branch March 7, 2023 06:37

marek-safar changed the title ~~[NativeAOT] Experiment: Cache location of unwind sections~~ [NativeAOT] Cache location of unwind sections Mar 9, 2023

ghost locked as resolved and limited conversation to collaborators Apr 8, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[NativeAOT] Cache location of unwind sections #82994

[NativeAOT] Cache location of unwind sections #82994

filipnavara commented Mar 5, 2023 •

edited

Loading

ghost commented Mar 5, 2023

janvorli commented Mar 6, 2023

janvorli left a comment

VSadov commented Mar 6, 2023

VSadov commented Mar 6, 2023

azure-pipelines bot commented Mar 6, 2023

VSadov left a comment

[NativeAOT] Cache location of unwind sections #82994

[NativeAOT] Cache location of unwind sections #82994

Conversation

filipnavara commented Mar 5, 2023 • edited Loading

Abstract

Implementation

Testing

ghost commented Mar 5, 2023

janvorli commented Mar 6, 2023

janvorli left a comment

Choose a reason for hiding this comment

VSadov commented Mar 6, 2023

VSadov commented Mar 6, 2023

azure-pipelines bot commented Mar 6, 2023

VSadov left a comment

Choose a reason for hiding this comment

filipnavara commented Mar 5, 2023 •

edited

Loading