JIT: Add 3-opt implementation for improving upon RPO-based layout #103450

amanasifkhalid · 2024-06-13T22:32:28Z

No description provided.

dotnet-policy-service · 2024-06-13T22:33:00Z

Tagging subscribers to this area: @JulieLeeMSFT, @jakobbotsch
See info in area-owners.md if you want to be subscribed.

AndyAyersMS · 2024-06-15T01:10:16Z

src/coreclr/jit/fgopt.cpp

+        improvedLayout              = false;
+        BasicBlock* const exitBlock = blockVector[blockCount - 1];
+
+        for (unsigned i = 1; i < (blockCount - 1); i++)


I think the root of the TP cost is here -- we want to avoid having to search for possible cut points.

One approach is to just pick randomly, but I think we can do better for now. Roughly speaking in the pass above we should find all blocks that are either not just before their optimal successor and/or not just after their optimal successor.

We can rank these by the difference in the current vs optimal score. Then greedily pick the worst, that gives the first cut point. For the second cut point you can pick the best pred for the first cut point's current next block, or the best succ for the current pred of the first cut point's ideal successor. That is, if we have

S ~~~~ 1|2 ~~~ 3|4 ~~~ 5|6 ~~~ E 1's ideal succ is 4 reordering is S ~~~~ 1|4 ~~~ 5|2 ~~~ 3|6 ~~~ E So we either try and find a 5 which is the ideal pred of 2, or a 6 which is the ideal succ of 3. Failing that we might pick some other block that is not currently followed by its ideal succ.

So one idea is to keep 3 values for each block: its min score, current score, and best score (lower is better). Order the blocks by current-min. Pick of the best as the first split, and then see if any of the next few provide a good second split.

Likely though this ends up needing a priority queue or similar as once we accept an arrangement we need to update some of the costings...

dotnet-policy-service · 2024-07-15T03:17:47Z

Draft Pull Request was automatically closed for 30 days of inactivity. Please let us know if you'd like to reopen it.

amanasifkhalid · 2024-09-30T18:12:42Z

@AndyAyersMS thanks for bearing with me on this. I've implemented your suggestion of building and maintaining a priority queue of cut points, and this seems to be sufficiently cheap. Diffs show plenty of variance in asmdiffs across platforms, though this looks like a net PerfScore win. To contain the number of iterations, we currently consider each edge at most once; we probably don't want to limit the search space too much, though these limitations had pretty small diffs locally, so it seems like the current approach is fixing the most obvious instances of subpar layout.

I haven't implemented this for methods with EH yet, though I'm thinking of leaving the cutpoint search as-is, and then after reordering blocks, we can make EH regions contiguous by "bubbling up" the next EH block we see to its predecessor. This fixup can break up fallthrough from EH exits into non-EH blocks, but it will maintain the relative ordering such that the exit jump is forward; for now, breaking up such fallthrough seems necessary. With this approach, we can get rid of the EH fixup logic in earlier ordering passes (RPO layout, fgMoveColdBlocks, etc) and win back some TP -- but I wanted to evaluate that separately from this PR.

I think this PR is in good shape, so I thought I'd ping you now in case you want to take a look, though I don't plan to push to merge this until we get the LSRA changes where we want them.

amanasifkhalid added 5 commits June 13, 2024 14:05

Implement k-opt for non-EH methods

5813e1a

Enable for methods with EH

3f4e749

Merge branch 'main' into k-opt-layout

da291af

Add comments

699714b

Style

0849f76

dotnet-issue-labeler bot added the area-CodeGen-coreclr CLR JIT compiler in src/coreclr/src/jit and related components such as SuperPMI label Jun 13, 2024

dotnet-policy-service bot assigned amanasifkhalid Jun 13, 2024

amanasifkhalid added 4 commits June 13, 2024 18:33

Merge branch 'main' into k-opt-layout

e223927

Only one iteration for now; try to reduce TP cost

044b332

Remove initial layout cost calculation

f0e7f6b

Conditionalize EH checks

41efb9b

amanasifkhalid mentioned this pull request Jun 14, 2024

Widespread perf regressions due to RPO layout #102763

Open

AndyAyersMS reviewed Jun 15, 2024

View reviewed changes

dotnet-policy-service bot closed this Jul 15, 2024

github-actions bot locked and limited conversation to collaborators Aug 14, 2024

amanasifkhalid added 8 commits September 18, 2024 10:15

Merge from main

5b7a85e

Add priority queue impl

94c2272

wip

0081d9b

Fix lambda capture

4700e65

Merge branch 'main' into k-opt-layout

cabacf9

Consider forward conditional jumps

ebb7e6a

Remove debug print

4eb4471

Consider backward jumps; find more initial candidates

0175b18

amanasifkhalid reopened this Sep 25, 2024

amanasifkhalid added 4 commits September 25, 2024 16:02

Revert irrelevant changes

bbc28df

Missed a few

2e507be

Add JitDump check

9ed6452

Add more candidate edges when reordering

40fc6bc

amanasifkhalid added 3 commits September 25, 2024 21:29

Merge branch 'main' into k-opt-layout

0b8e830

Don't add duplicate edges to cutPoints

a3b7392

Consider each candidate edge at most once

d468a8a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

JIT: Add 3-opt implementation for improving upon RPO-based layout #103450

JIT: Add 3-opt implementation for improving upon RPO-based layout #103450

amanasifkhalid commented Jun 13, 2024

dotnet-policy-service bot commented Jun 13, 2024

AndyAyersMS Jun 15, 2024

dotnet-policy-service bot commented Jul 15, 2024

amanasifkhalid commented Sep 30, 2024

JIT: Add 3-opt implementation for improving upon RPO-based layout #103450

Are you sure you want to change the base?

JIT: Add 3-opt implementation for improving upon RPO-based layout #103450

Conversation

amanasifkhalid commented Jun 13, 2024

dotnet-policy-service bot commented Jun 13, 2024

AndyAyersMS Jun 15, 2024

Choose a reason for hiding this comment

dotnet-policy-service bot commented Jul 15, 2024

amanasifkhalid commented Sep 30, 2024