[libclc] Support the generic address space #137183

frasercrmck · 2025-04-24T14:26:09Z

This commit provides definitions of builtins with the generic address space.

It is assumed that all current libclc targets can support the generic address space.

One concept to consider is the difference between supporting the generic address space from the user's perspective and the requirement for libclc as a compiler implementation detail to define separate generic address space builtins. In practice a target (like NVPTX) might notionally support the generic address space, but it's mapped to the same LLVM target address space as another address space (often the private one).

In such cases libclc must be careful not to define both private and generic overloads of the same builtin. We track these two concepts separately, and make the assumption that if the generic address space does clash with another, it's with the private one. We track the concepts separately because there are some builtins such as atomics that are defined for the generic address space but not the private address space.

frasercrmck · 2025-04-24T14:26:21Z

CC @wenju-he

arsenm

Even as a compiler implementation detail, libclc should not need to consider the address space mapping (unless maybe you're directly using IR)

There is a clang bug if there is different mangling. The itanium mangling should be coming from the source type / original address space, not whatever IR address space value that happens to map to

frasercrmck · 2025-04-24T16:14:06Z

There is a clang bug if there is different mangling. The itanium mangling should be coming from the source type / original address space, not whatever IR address space value that happens to map to

Yeah, that would be nice but this is what's happening, I'm afraid.

It is actually supported in the Itanium mangler:

    if (Context.getASTContext().addressSpaceMapManglingFor(AS)) {
      //  <target-addrspace> ::= "AS" <address-space-number>
      unsigned TargetAS = Context.getASTContext().getTargetAddressSpace(AS);
      if (TargetAS != 0 ||
          Context.getASTContext().getTargetAddressSpace(LangAS::Default) != 0)
        ASString = "AS" + llvm::utostr(TargetAS);
    } else {
      switch (AS) {
      default: llvm_unreachable("Not a language specific address space");
      //  <OpenCL-addrspace> ::= "CL" [ "global" | "local" | "constant" |
      //                                "private"| "generic" | "device" |
      //                                "host" ]
      case LangAS::opencl_global:
        ASString = "CLglobal";
        break;

It's just that targets we care about in libclc unconditionally enable that address space map mangling for all address spaces, such as AMDGPU and NVPTX.

I'm not sure I would want to change this behaviour at this point. At least not for the purposes of enabling generic address space support in libclc. There will be a bunch of downstream toolchains that rely on the current mangling scheme.

arsenm · 2025-04-24T19:01:46Z

It is actually supported in the Itanium mangler:

I don't remember this part of the hack. There was a recent fix to always use the correct mapping values for AMDGPU when generic address space is enabled (which should be the only mapping, still need to do something about the setAddressSpaceMap hack).

arsenm · 2025-04-24T19:02:37Z

libclc/CMakeLists.txt

+    # FIXME: Shouldn't clang automatically enable this extension based on the
+    # target?


Yes, the extension should be reported as available or not by the target macros

we should enable __opencl_c_generic_address_space for amdgpu and nvptx in setSupportedOpenCLOpts API rather than in this CMakeLists file, right?

yes (although for amdgpu it needs to skip the ancient targets without flat addressing)

See the first PR for AMDGPU support: #137636.

I'll do a separate one for NVPTX. It looks like SPIRV (and X86) enable all by default. That should cover all libclc targets.

libclc/clc/include/clc/math/unary_decl_with_int_ptr.inc

arsenm · 2025-04-24T19:05:13Z

libclc/clc/include/clc/clcfunc.h

+#ifdef __CLC_DISTINCT_GENERIC_ADDRSPACE__
+#define _CLC_DISTINCT_GENERIC_AS_SUPPORTED 1
+#else
+#define _CLC_DISTINCT_GENERIC_AS_SUPPORTED 0
+#endif
+#else


These macro names are too general for the implementation. I don't think this works for anything other than the 0-is-private-and-generic case.

What if you defined a qualifier macro with the value, and check if they are equal

These macro names are too general for the implementation. I don't think this works for anything other than the 0-is-private-and-generic case.

Yes the assumption is very much currently that it's either fully distinct or 0-is-both. I didn't know how much effort to put into making it fully flexible given our list of targets is fairly static.

I'd be open to making it more flexible. I don't think there's anything technically stopping a target having two or more of constant, local and global mangle to the same target address space, for example. Do we want something in libclc that can take care of all possibilities, or just the generic space with another?

What if you defined a qualifier macro with the value, and check if they are equal

Could you expand on this, sorry?

I mean something like

#define __libclc_generic_addrspace_val 0 #define __libclc_private_addrspace_val 5 #if __libclc_private_addrspace_val == __libclc_generic_addrspace_val // ... #endif

Even if not this, the current name is misleading. It's not about generic being supported, if anything it's more like private isn't real

Yes, something like that. Thanks for clarifying, I was half wondering if you meant adding some new macro definition to clang itself.

I am a bit hesitant to encode actual address space values in libclc as it's ultimately up to clang (and may differ depending on the subtarget/CPU), but either way unless clang gives us more information we have to encode some kind of assumption in libclc. Having some kind of introspection available about the address space mappings would be nice but overkill.

It looks like every target uses private = 0 except for AMDGPU which has private = 5. For generic we have mostly 0 too, except for DirectX and SPIR (which I think influences SPIRV?) which use 4. It's doable using some preprocessorr logic, or some defs passed in from CMake as build options, but I'd worry a bit about it getting out of sync with the actual compiler.

I think the macro could definitely be better worded to include the assumption of "private". Something along the lines of GENERIC_IS_(NOT_)PRIVATE or whatever. Ultimately this only applies to NVPTX so there's also the possibility of using the NVPTX target rather than trying to come up with something general. At least if anything goes wrong it would fail at build time.

This is ultimately a clang bug. The two functions are different symbols. addressSpaceMapManglingFor shouldn't be reporting true in the colliding cases

This commit provides definitions of builtins with the generic address space. It is assumed that all current libclc targets can support the generic address space. One concept to consider is the difference between supporting the generic address space from the user's perspective, and the requirement for libclc as a compiler implementation detail to define separate generic address space builtins. In practice a target (like NVPTX) might notionally support the generic address space, but it's mapped to the same LLVM target address space as the private address space. Therefore libclc may not define both private and generic overloads of the same builtin. We track these two concepts separately, and make the assumption that if the generic address space does clash with another, it's with the private one.

frasercrmck added the libclc libclc OpenCL library label Apr 24, 2025

frasercrmck requested a review from arsenm April 24, 2025 14:26

arsenm reviewed Apr 24, 2025

View reviewed changes

frasercrmck added 2 commits April 29, 2025 17:34

Update decl guards

694166d

frasercrmck force-pushed the libclc-generic-addrspace branch from b437f11 to 694166d Compare April 29, 2025 16:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[libclc] Support the generic address space #137183

[libclc] Support the generic address space #137183

frasercrmck commented Apr 24, 2025

frasercrmck commented Apr 24, 2025

arsenm left a comment •

edited

Loading

frasercrmck commented Apr 24, 2025

arsenm commented Apr 24, 2025

arsenm Apr 24, 2025

wenju-he Apr 25, 2025

arsenm Apr 25, 2025

frasercrmck Apr 28, 2025

arsenm Apr 24, 2025

frasercrmck Apr 28, 2025

arsenm Apr 29, 2025

arsenm Apr 30, 2025

frasercrmck Apr 30, 2025

arsenm May 1, 2025

		# FIXME: Shouldn't clang automatically enable this extension based on the
		# target?

[libclc] Support the generic address space #137183

Are you sure you want to change the base?

[libclc] Support the generic address space #137183

Conversation

frasercrmck commented Apr 24, 2025

frasercrmck commented Apr 24, 2025

arsenm left a comment • edited Loading

Choose a reason for hiding this comment

frasercrmck commented Apr 24, 2025

arsenm commented Apr 24, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

arsenm left a comment •

edited

Loading