JIT: use root compiler instance for sufficient PGO observation #115119

AndyAyersMS · 2025-04-28T16:11:38Z

During inlining, we evaluate some aspects of an inlinee's viability while importing its direct caller. If that caller is not the inline root we may make inconsistent observations of the overall state of PGO. So for PGO observations always consult the root compiler.

For example, the root R may have decided to inline a small method A that did not have PGO (say because of minimal profiling or lack of PGO for always inlined R2R methods), and that method calls another method B; we want to evaluate the viability of B using the PGO state of R, not of A.

During inlining, we evaluate some aspects of an inlinee's viability while importing its direct caller. If that caller is not the inline root we may make inconsistent observations of the overall state of PGO. So for PGO observations always consult the root compiler. For example, the root R may have decided to inline a small method A that did not have PGO (say because of minimal profiling or lack of PGO for always inlined R2R methods), and that method calls another method B; we want to evaluate the viability of B using the PGO state of R, not of A.

Copilot

Pull Request Overview

This PR refactors inlining behavior to ensure that PGO observations are based on the root compiler instance. Key changes include:

Logging updates in inlinepolicy.cpp to reflect the PGO state of the root compiler.
Modification in importercalls.cpp to use impInlineRoot()->fgHaveSufficientProfileWeights() for consistency.
Additional debug messages in fginline.cpp to report the PGO data state.

Reviewed Changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated no comments.

File	Description
src/coreclr/jit/inlinepolicy.cpp	Added and refined JITDUMP logging for inline candidate evaluation.
src/coreclr/jit/importercalls.cpp	Updated to reference the root compiler PGO state.
src/coreclr/jit/fginline.cpp	Introduced extra debug logging for PGO status in inliner.

Comments suppressed due to low confidence (3)

src/coreclr/jit/inlinepolicy.cpp:1375

[nitpick] Consider including additional context—such as the inline candidate identifier—in this debug log for clearer traceability of inlining decisions.

JITDUMP("Callee has trusted profile\n");

src/coreclr/jit/inlinepolicy.cpp:1410

[nitpick] Consider appending the inline candidate’s identifier to this debug message to improve the clarity and usefulness when diagnosing inlining failures.

JITDUMP("Callee IL size %u exceeds maxCodeSize %u\n", m_CodeSize, maxCodeSize);

src/coreclr/jit/fginline.cpp:790

[nitpick] Ensure that these verbose debug logs are appropriately gated for production builds to avoid unintended performance impacts.

JITDUMP("INLINER: pgo source is %s; pgo data is %sconsistent; %strusted; %ssufficient\n", compGetPgoSourceName(), fgPgoConsistent ? "" : "not ", fgHaveTrustedProfileWeights() ? "" : "not ", fgHaveSufficientProfileWeights() ? "" : "not ");

AndyAyersMS · 2025-04-28T16:12:17Z

@dotnet/jit-contrib PTAL

Likely will be difficult to see the impact via SPMI/PMI.

AndyAyersMS · 2025-04-28T16:12:43Z

src/coreclr/jit/importercalls.cpp

@@ -9256,7 +9256,7 @@ void Compiler::impCheckCanInline(GenTreeCall*           call,
        // Profile data allows us to avoid early "too many IL bytes" outs.
        //
        inlineResult->NoteBool(InlineObservation::CALLSITE_HAS_PROFILE_WEIGHTS,
-                               compiler->fgHaveSufficientProfileWeights());
+                               compiler->impInlineRoot()->fgHaveSufficientProfileWeights());


This is the actual change; the rest is just dumping more state

AndyAyersMS · 2025-04-28T19:13:32Z

Lots of missed contexts (2% ish). I will collect a bespoke ASP.NET SPMI to get a clearer picture.

AndyAyersMS · 2025-04-29T00:50:36Z

Lots of missed contexts (2% ish). I will collect a bespoke ASP.NET SPMI to get a clearer picture.

Locally this seems to be too much (excluding ~800 contexts that don't replay with base)

[17:46:35] Asm diffs found
[17:46:35] Total instructions executed by base: 201022764491
[17:46:35] Total instructions executed by diff: 218769331734
[17:46:35] Total instructions executed delta: 17746567243 (8.83% of base)

[17:42:53] Total bytes of base: 66211175
[17:42:53] Total bytes of diff: 68489412
[17:42:53] Total bytes of delta: 2278237 (3.44% of base)
[17:42:53]
[17:42:53] Total PerfScore of base: 66735541.16814355
[17:42:53] Total PerfScore of diff: 66848347.7220094
[17:42:53] Total PerfScore of delta: 112806.55386584997 (0.17% of base)
[17:42:53]
[17:42:53] Relative PerfScore Geomean: 0.5682%
[17:42:53] Relative PerfScore Geomean (Diffs): 39.2229%

However most of the big diffs are from OSR methods, and my bespoke SPMI artificially enhances the set of OSR methods.

[17:42:54] Top method regressions (bytes):
[17:42:54]        10314 (1,121.09% of base) : 107161.dasm - Microsoft.EntityFrameworkCore.Metadata.Conventions.Internal.ConventionDispatcher+ImmediateConventionScope:OnEntityTypeAdded(Microsoft.EntityFrameworkCore.Metadata.Builders.IConventionEntityTypeBuilder):Microsoft.EntityFrameworkCore.Metadata.Builders.IConventionEntityTypeBuilder:this (Tier1-OSR)
[17:42:54]        10242 (1,113.26% of base) : 58257.dasm - Microsoft.EntityFrameworkCore.Metadata.Conventions.Internal.ConventionDispatcher+ImmediateConventionScope:OnEntityTypeAdded(Microsoft.EntityFrameworkCore.Metadata.Builders.IConventionEntityTypeBuilder):Microsoft.EntityFrameworkCore.Metadata.Builders.IConventionEntityTypeBuilder:this (Tier1-OSR)
[17:42:54]         8914 (633.10% of base) : 171459.dasm - Markdig.Parsers.ParserList`2[System.__Canon,System.__Canon]:.ctor(System.Collections.Generic.IEnumerable`1[System.__Canon]):this (Tier1-OSR)
[17:42:54]         8781 (624.98% of base) : 171463.dasm - Markdig.Parsers.ParserList`2[System.__Canon,System.__Canon]:.ctor(System.Collections.Generic.IEnumerable`1[System.__Canon]):this (Tier1-OSR)
[17:42:54]         8218 (2,257.69% of base) : 107077.dasm - System.Diagnostics.Metrics.MeterListener:Start():this (Tier1-OSR)
[17:42:54]         8216 (2,257.14% of base) : 122787.dasm - System.Diagnostics.Metrics.MeterListener:Start():this (Tier1-OSR)
[17:42:54]         8215 (2,256.87% of base) : 110339.dasm - System.Diagnostics.Metrics.MeterListener:Start():this (Tier1-OSR)
[17:42:54]         8215 (2,256.87% of base) : 90999.dasm - System.Diagnostics.Metrics.MeterListener:Start():this (Tier1-OSR)

Going to try restricting this to be non-OSR and see what that looks like

AndyAyersMS · 2025-04-29T01:30:41Z

Still seems quite impactful with OSR using the old behavior -- this is with a local release baseline to rule out possible compiler version issues

[18:26:56] Total instructions executed by base: 201039018573
[18:26:56] Total instructions executed by diff: 211365390699
[18:26:56] Total instructions executed delta: 10326372126 (5.14% of base)

[17:57:53] Total bytes of base: 66211175
[17:57:53] Total bytes of diff: 67616223
[17:57:53] Total bytes of delta: 1405048 (2.12% of base)
[17:57:53]
[17:57:53] Total PerfScore of base: 66735541.16814355
[17:57:53] Total PerfScore of diff: 66586403.12200942
[17:57:53] Total PerfScore of delta: -149138.04613412917 (-0.22% of base)
[17:57:53]
[17:57:53] Relative PerfScore Geomean: 0.5572%
[17:57:53] Relative PerfScore Geomean (Diffs): 48.4549%

Next step might be to introduce some intermediate position, if the root compiler has sufficient PGO but the calling compiler doesn't -- the main impact here to the heuristic is the max IL size the inliner will consider. With sufficient PGO this is 1024, without it's 128. So maybe in this mixed mode we choose some value in between.

AndyAyersMS · 2025-04-29T20:08:46Z

@EgorBo FYI -- seems quite costly still.

EgorBo · 2025-05-02T11:53:12Z

I think the change makes total sense. I guess we can just revert it if the dotnet/performance results won't look like it's worth it

AndyAyersMS · 2025-05-16T18:22:22Z

@EgorBo this latest may be a bit more palatable, if we like what it does we can always crank it up.

Locally on a bespoke asp.net, it was +0.5% code size, +1.5% TP, though there are some context misses I still need to correct for.

(looks like the misses are only there for disasm .. we must be making queries there we don't make normally?). So TP data is likely accurate, and code size is an underestimate.

AndyAyersMS · 2025-05-17T18:10:46Z

@EgorBo take another look when you can

EgorBo

AndyAyersMS · 2025-05-27T22:29:03Z

Related Regressions:

[Perf] Windows/arm64: 1 Regression on 5/19/2025 6:24:41 PM +00:00 perf-autofiling-issues#55949
[Perf] Windows/x64: 37 Regressions on 5/19/2025 4:32:47 PM +00:00 perf-autofiling-issues#56199
[Perf] Windows/x64: 25 Regressions on 5/19/2025 4:32:47 PM +00:00 perf-autofiling-issues#56161
[Perf] Linux/x64: 22 Regressions on 5/19/2025 4:32:47 PM +00:00 perf-autofiling-issues#56208
[Perf] Linux/x64: 20 Regressions on 5/19/2025 4:32:47 PM +00:00 perf-autofiling-issues#56153
[Perf] Linux/x64: 1 Regression on 5/19/2025 12:29:04 PM +00:00 perf-autofiling-issues#56152

Improvements:

Copilot AI review requested due to automatic review settings April 28, 2025 16:11

ghost added the area-CodeGen-coreclr CLR JIT compiler in src/coreclr/src/jit and related components such as SuperPMI label Apr 28, 2025

dotnet-policy-service bot assigned AndyAyersMS Apr 28, 2025

Copilot AI reviewed Apr 28, 2025

View reviewed changes

AndyAyersMS commented Apr 28, 2025

View reviewed changes

build-analysis bot mentioned this pull request Apr 28, 2025

SmtpClientSendMailTest_SendAsync.MultipleRecipients_Failure_All test failure #115070

Closed

filipnavara mentioned this pull request Apr 28, 2025

Abolish PSPSym from ABI #114630

Merged

EgorBo approved these changes May 2, 2025

View reviewed changes

AndyAyersMS added 2 commits May 16, 2025 07:15

Merge branch 'main' into PgoMystery

4cbfda4

try boosting some but not as much

3fc63f9

EgorBo approved these changes May 19, 2025

View reviewed changes

AndyAyersMS merged commit 34f1db4 into dotnet:main May 19, 2025
108 checks passed

LoopedBard3 mentioned this pull request May 22, 2025

[Perf] Linux/arm64: 19 Regressions on 5/19/2025 6:24:41 PM +00:00 #115904

Open

AndyAyersMS mentioned this pull request May 27, 2025

[Perf] Windows/x64: 7 Improvements on 5/19/2025 8:58:33 PM +00:00 dotnet/perf-autofiling-issues#56249

Closed

LoopedBard3 mentioned this pull request Jun 5, 2025

[Perf] Windows/x64: 1 Regression on 3/20/2025 10:20:08 AM +00:00 (Improved + Closed) dotnet/perf-autofiling-issues#52338

Closed

AndyAyersMS mentioned this pull request Jun 6, 2025

[Perf] Regressions from inliner policy change #114996

Open

AndyAyersMS mentioned this pull request Jun 17, 2025

JIT: De-abstraction in .NET 10 #108913

Open

This was referenced Jun 18, 2025

[Perf] Windows/x64: 38 Regressions on 1/21/2025 8:48:11 PM +00:00 #111912

Closed

[Perf] Windows/x64: BenchStone Regressions on 1/29/2025 7:05:56 PM +00:00 #112136

Closed

github-actions bot locked and limited conversation to collaborators Jun 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

JIT: use root compiler instance for sufficient PGO observation #115119

JIT: use root compiler instance for sufficient PGO observation #115119

Uh oh!

AndyAyersMS commented Apr 28, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

AndyAyersMS commented Apr 28, 2025

Uh oh!

AndyAyersMS Apr 28, 2025

Uh oh!

AndyAyersMS commented Apr 28, 2025

Uh oh!

AndyAyersMS commented Apr 29, 2025

Uh oh!

AndyAyersMS commented Apr 29, 2025

Uh oh!

AndyAyersMS commented Apr 29, 2025

Uh oh!

EgorBo commented May 2, 2025

Uh oh!

AndyAyersMS commented May 16, 2025 •

edited

Loading

Uh oh!

AndyAyersMS commented May 17, 2025

Uh oh!

EgorBo left a comment

Uh oh!

Uh oh!

AndyAyersMS commented May 27, 2025 •

edited

Loading

Uh oh!

Uh oh!

JIT: use root compiler instance for sufficient PGO observation #115119

JIT: use root compiler instance for sufficient PGO observation #115119

Uh oh!

Conversation

AndyAyersMS commented Apr 28, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

AndyAyersMS commented Apr 28, 2025

Uh oh!

AndyAyersMS Apr 28, 2025

Choose a reason for hiding this comment

Uh oh!

AndyAyersMS commented Apr 28, 2025

Uh oh!

AndyAyersMS commented Apr 29, 2025

Uh oh!

AndyAyersMS commented Apr 29, 2025

Uh oh!

AndyAyersMS commented Apr 29, 2025

Uh oh!

EgorBo commented May 2, 2025

Uh oh!

AndyAyersMS commented May 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AndyAyersMS commented May 17, 2025

Uh oh!

EgorBo left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

AndyAyersMS commented May 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

AndyAyersMS commented May 16, 2025 •

edited

Loading

AndyAyersMS commented May 27, 2025 •

edited

Loading