[BOLT][test] Fix callcont-fallthru.s after #129481 #135867

aaupov · 2025-04-15T22:21:03Z

Only set --synthetic nm flag in link_fdata if requested explicitly.

Test Plan: bin/llvm-lit -a tools/bolt/test/X86/callcont-fallthru.s

Created using spr 1.3.4

llvmbot · 2025-04-15T22:21:34Z

@llvm/pr-subscribers-bolt

Author: Amir Ayupov (aaupov)

Changes

Force the use of llvm-nm for PREAGGPLT check.

Test Plan: bin/llvm-lit -a tools/bolt/test/X86/callcont-fallthru.s

Full diff: https://github.com/llvm/llvm-project/pull/135867.diff

1 Files Affected:

(modified) bolt/test/X86/callcont-fallthru.s (+1-1)

diff --git a/bolt/test/X86/callcont-fallthru.s b/bolt/test/X86/callcont-fallthru.s
index ee72d8f62e032..6b5caa08d3128 100644
--- a/bolt/test/X86/callcont-fallthru.s
+++ b/bolt/test/X86/callcont-fallthru.s
@@ -9,7 +9,7 @@
 # RUN: link_fdata %s %t %t.pa3 PREAGG3
 # RUN: link_fdata %s %t %t.pat PREAGGT1
 # RUN: link_fdata %s %t %t.pat2 PREAGGT2
-# RUN: link_fdata %s %t %t.patplt PREAGGPLT
+# RUN: link_fdata %s %t %t.patplt PREAGGPLT --nmtool llvm-nm
 
 ## Check normal case: fallthrough is not LP or secondary entry.
 # RUN: llvm-strip --strip-unneeded %t -o %t.strip

Created using spr 1.3.4

paschalis-mpeis

Hey Amir,

Thanks for the PR. Unfortunately, it is still failing. The trick below doesn't seem to work on my buildbot machine:

Link against a DSO to ensure PLT entries.

So doing:

nm --synthetic callcont-fallthru.s.tmp

won't list a puts@plt symbol, which is what causes an link_fdata.py assertion:

AssertionError: ERROR: symbol puts@plt is not defined in binary

On my dev AArch64 instance --synthetic does the trick. BTW run lines 4 and 6 appear identical when inspected (-###)

aaupov · 2025-04-16T20:55:03Z

@MaskRay – can you please advise how to force a PLT entry if linking with a DSO hack doesn't work?

MaskRay · 2025-04-17T02:48:15Z

Hey Amir,

Thanks for the PR. Unfortunately, it is still failing. The trick below doesn't seem to work on my buildbot machine:

Link against a DSO to ensure PLT entries.

So doing:
nm --synthetic callcont-fallthru.s.tmp
won't list a puts@plt symbol, which is what causes an link_fdata.py assertion:

AssertionError: ERROR: symbol puts@plt is not defined in binary

On my dev AArch64 instance --synthetic does the trick. BTW run lines 4 and 6 appear identical when inspected (-###)

You need a libc.so that defines puts, and then creates an executable that references puts and links against libc.so. Then the executable will have a PLT entry, and you do not need the --unresolved-symbols=ignore-all hack.

paschalis-mpeis · 2025-04-22T11:52:34Z

Thanks a lot both! In case there's some delay in resolving this edge case, may I suggest temporarily disabling this test on AArch64 until a more consistent workaround is in place?

paschalis-mpeis · 2025-04-25T13:25:33Z

Hey folks, any updates on this?

I spent some time experimenting with @MaskRay's suggestion. I used a mock libc shared object that had a puts symbol.
Indeed there won't be unresolved symbols now, however, still GNU nm doesn't show a PLT entry when using --synthetic .

yota9 · 2025-04-29T19:50:51Z

Maybe I'm missing something but why use /dev/null library hack in the first place here? There is stub.c available with puts symbol already, just use it to compile so and link against it, plt entry should appear normally.

paschalis-mpeis · 2025-04-29T20:37:52Z

Hey @yota9, thanks for the input. I tried something similar.
Even when I use stub.c and link it with:

-# RUN: %clang %cflags -fpic -shared -xc /dev/null -o %t.so
-## Link against a DSO to ensure PLT entries.
+# RUN: %clang %cflags %p/../Inputs/stub.c -fPIC -shared -o %t.so

then running GNU nm:

nm %t --synthetic

would emit only

                 U puts

which link_fdata rejects. On some other machines though, GNU nm emits:

                 U puts
0000000000001234 T puts@plt

which works well. In both cases it was the same nm driver version.
TMU this inconsistency was reported on x86 machines too.

I might've missed something on my end. I briefly discussed this with Amir (see discord) as I'm trying to unblock our AArch64 buildbot. We figured it's fine to disable this test on AArch64 until the issue gets resolved. Could you mind taking a look at #137831, and consider accepting it?

yota9 · 2025-04-30T07:03:48Z

@paschalis-mpeis Could you please check if the binaries are identical and it is indeed nm problem? E.g. with objdump, is plt entry is there? Maybe there is problem related to the plt section type, e.g. one of the binaries has .plt.sec or .plt.got section and there is some kind of but in nm that not lists symbols from these sections. Then we can use custom linker script with .plt section only.
My next suggestion would be just using llvm-nm here. Pass llvm-nm with --nmtool arg to link_fdata.py, since the lit.cfg.py has it in the list of mandatory tools for bolt testing, so we won't have environment dependencies here. Maybe even add nmtool as an link_fdata_cmd arg in lit.cfg.py , so all tests would use it by default...

paschalis-mpeis · 2025-04-30T09:20:57Z

Hey @yota9, thanks for the suggestions!

Indeed, the PLT entries exist in both binaries. For example running:

build/bin/llvm-objdump -d -j .plt build/tools/bolt/test/X86/Output/callcont-fallthru.s.tmp

shows:

build/tools/bolt/test/X86/Output/callcont-fallthru.s.tmp:       file format elf64-x86-64

Disassembly of section .plt:
0000000000001430 <.plt>:
    1430: ff 35 f2 20 00 00             pushq   0x20f2(%rip)            # 0x3528 <puts+0x3528>
    1436: ff 25 f4 20 00 00             jmpq    *0x20f4(%rip)           # 0x3530 <puts+0x3530>
    143c: 0f 1f 40 00                   nopl    (%rax)

0000000000001440 <puts@plt>:
    1440: ff 25 f2 20 00 00             jmpq    *0x20f2(%rip)           # 0x3538 <puts+0x3538>
    1446: 68 00 00 00 00                pushq   $0x0
    144b: e9 e0 ff ff ff                jmp     0x1430 <.plt>

I noticed some code differences in the binaries but I haven't looked deeper into it.

It looks like it's differences in GNU nm though:

On my AArch64 dev-machine, nm --synthetic lists puts@plt, but when I copy that same binary over to our upcoming AArch64 buildbot, it's missing.

Conversely, nm --synthetic on the buildbot does not list puts@plt, but when if I copy that binary to the dev-machine it does appear.

I too agree that relying on GNU is not ideal. Essentially using any binary tool that does not come from the built LLVM revision. However, llvm-nm does not seem support --synthetic.

BTW, thanks for all the help! I'm focused on AArch64, so while I may be involved to some extent with this, I'll let Amir drive the fix. That's why I'm looking for a code owner to get #137831 stamped. :)
(also cc'ing: @aaupov, @maksfb)

aaupov · 2025-04-30T11:47:57Z

Thanks for tracking it down, looks like it's an issue with GNU nm. However llvm-nm has no functionality equivalent to nm --synthetic which prints the address of the PLT entry, and the test relies on that.

Let me try to decouple this test from GNU nm.

paschalis-mpeis · 2025-04-30T12:34:19Z

Great. A quick way to use an llvm tool could be:

llvm-objdump -d -j .plt %t | grep @plt

This produces output similar to what nm --synthetic produces (when it works):

0000000000001430 <puts@plt>:

You'll need ofc to tweak link_fdata to properly parse symbol+address:

llvm-project/bolt/test/link_fdata.py

Lines 96 to 99 in fb8d61d

    
           symval, _, symname = symline.split(maxsplit=2) 
        
           if symname in symbols and args.no_redefine: 
        
               continue 
        
           symbols[symname] = symval

Not sure of any cleaner approach? (@yota9, @MaskRay)

yota9 · 2025-05-02T09:42:02Z

I've decided to add synthetic option to llvm-nm here #138232 . Unfortunately it would take some time, as main maintainer won't be able to review it soon, so probably for now we might just mark the test as XFAIL until then.. Not forgetting to replace nm with llvm-nm

paschalis-mpeis · 2025-05-02T10:02:47Z

That is perfect and the way we should go forward with this – thanks @yota9.

The problem is that the test is flaky: it passes on most systems but fails on a few.
UsingXFAIL would make my AArch64 buildbot happy but it will cause failures (Unexpectedly Passed) on other AArch64 machines I've tested . 🤷‍♂️

That's why I propose restricting this to X86 for now, as a way to unblock us in the meantime:

[BOLT][test] Disable callcont-fallthru.s on AArch64 #137831

llvm-project/bolt/test/X86/callcont-fallthru.s

Lines 3 to 5 in 42d76a3


	# REQUIRES: x86_64-linux

yota9 · 2025-05-02T10:07:55Z

@paschalis-mpeis Indeed, you're right. Let's wait about @aaupov decision then, it LGTM

Compatible with GNU nm --syntethic option is used to show special symbols created by the linker. Current implementation is limited to show plt entries in the form of symbol@plt and plt entry address. Currently it would be used for BOLT testing purposes (llvm#135867) in order to eliminate external GNU nm dependency.

yota9 · 2025-05-02T15:27:59Z

@paschalis-mpeis I realised that if would change the nm to llvm-nm that we can just mark test as xfail, as it would fail until the patch above would be submitted. This way we would guarantee to have proper changes it test.

paschalis-mpeis · 2025-05-02T15:52:31Z

Yeap, good idea. I could add XFAIL and modify runline like:

# RUN: link_fdata %s %t %t.patplt PREAGGPLT --synthetic --nmtool=llvm-nm

The differences would be :

with REQUIRES we won't cross-run this x86 lit test on AArch64 (as I do currently in #137831)
with XFAIL + llvm-nm the test would be expected to fail on both architectures. But once your work is merged, it would unexpectedly pass, which would break the test and prompt us to update it

I'm happy to proceed with this as well.

yota9 · 2025-05-02T15:54:16Z

Yeah, that's right. Although maybe we need to replace nm to llvm-nm in link_fdata to be default... Up to you and @aaupov to decide..

paschalis-mpeis · 2025-05-02T16:02:15Z

Yes, and that'd actually be better so we don't depend on whatever host GNU nm the machine has.
Based on this, I'd say @aaupov intends to make this change too. I think he's away – let's see what he says once back.

aaupov · 2025-05-14T19:27:18Z

This approach doesn't solve the problem in case nm is symlinked to llvm-nm which doesn't have the flag. Abandon in favor of #139953.

[𝘀𝗽𝗿] initial version

b6b5e29

Created using spr 1.3.4

aaupov requested review from maksfb, rafaelauler, ayermolo, dcci and yota9 as code owners April 15, 2025 22:21

llvmbot added the BOLT label Apr 15, 2025

aaupov mentioned this pull request Apr 15, 2025

[BOLT] Accept PLT fall-throughs as valid traces #129481

Merged

provide synthetic opt

fb8d61d

Created using spr 1.3.4

aaupov requested a review from paschalis-mpeis April 15, 2025 22:45

paschalis-mpeis reviewed Apr 16, 2025

View reviewed changes

paschalis-mpeis mentioned this pull request Apr 29, 2025

[BOLT][test] Disable callcont-fallthru.s on AArch64 #137831

Closed

yota9 mentioned this pull request May 2, 2025

[llvm-nm] Introduce synthetic flag #138232

Open

MaskRay approved these changes May 3, 2025

View reviewed changes

aaupov closed this May 14, 2025

[BOLT][test] Fix callcont-fallthru.s after #129481 #135867

[BOLT][test] Fix callcont-fallthru.s after #129481 #135867

Uh oh!

Conversation

aaupov commented Apr 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

llvmbot commented Apr 15, 2025

Uh oh!

paschalis-mpeis left a comment

Choose a reason for hiding this comment

Uh oh!

aaupov commented Apr 16, 2025

Uh oh!

MaskRay commented Apr 17, 2025

Uh oh!

paschalis-mpeis commented Apr 22, 2025

Uh oh!

paschalis-mpeis commented Apr 25, 2025

Uh oh!

yota9 commented Apr 29, 2025

Uh oh!

paschalis-mpeis commented Apr 29, 2025

Uh oh!

yota9 commented Apr 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

paschalis-mpeis commented Apr 30, 2025

Uh oh!

aaupov commented Apr 30, 2025

Uh oh!

paschalis-mpeis commented Apr 30, 2025

Uh oh!

yota9 commented May 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

paschalis-mpeis commented May 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yota9 commented May 2, 2025

Uh oh!

yota9 commented May 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

paschalis-mpeis commented May 2, 2025

Uh oh!

yota9 commented May 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

paschalis-mpeis commented May 2, 2025

Uh oh!

aaupov commented May 14, 2025

Uh oh!

Uh oh!

aaupov commented Apr 15, 2025 •

edited

Loading

yota9 commented Apr 30, 2025 •

edited

Loading

yota9 commented May 2, 2025 •

edited

Loading

paschalis-mpeis commented May 2, 2025 •

edited

Loading

yota9 commented May 2, 2025 •

edited

Loading

yota9 commented May 2, 2025 •

edited

Loading