History log of /llvm-project/clang/lib/Basic/Cuda.cpp (Results 1 – 25 of 65)
Revision (<<< Hide revision tags) (Show revision tags >>>) Date Author Comments
Revision tags: llvmorg-21-init
# 97c3a990 22-Jan-2025 Sergey Kozub <skozub@nvidia.com>

Remove incorrect CUDA defines (#123898)

Remove CUDA_127 and CUDA_129 defines incorrectly added in
https://github.com/llvm/llvm-project/pull/123398


# 616979eb 21-Jan-2025 Sergey Kozub <skozub@nvidia.com>

[NVPTX] Add support for PTX 8.6 and CUDA 12.6 (12.8) (#123398)

Add CUDA versions 12.7, 12.8, 12.9 which support PTX8.6+ (enables using Blackwell-specific instructions).


Revision tags: llvmorg-19.1.7, llvmorg-19.1.6, llvmorg-19.1.5, llvmorg-19.1.4
# a6fc489b 18-Nov-2024 Matt Arsenault <Matthew.Arsenault@amd.com>

AMDGPU: Add gfx950 subtarget definitions (#116307)

Mostly a stub, but adds some baseline tests and
tests for removed instructions.


# de0fd64b 13-Nov-2024 Shilei Tian <i@tianshilei.me>

[AMDGPU] Introduce a new generic target `gfx9-4-generic` (#115190)

This patch introduces a new generic target, `gfx9-4-generic`. Since it doesn’t support FP8 and XF32-related instructions, the patch

[AMDGPU] Introduce a new generic target `gfx9-4-generic` (#115190)

This patch introduces a new generic target, `gfx9-4-generic`. Since it doesn’t support FP8 and XF32-related instructions, the patch includes several code reorganizations to accommodate these changes.

show more ...


Revision tags: llvmorg-19.1.3
# 076aac59 23-Oct-2024 Carl Ritson <carl.ritson@amd.com>

[AMDGPU] Add a new target for gfx1153 (#113138)


Revision tags: llvmorg-19.1.2
# 30a06e80 14-Oct-2024 Artem Belevich <tra@google.com>

[CUDA] Add support for CUDA-12.6 and sm_100 (#112028)

This is a copy of #97402(with minor updates), which is now ready to land.

---------

Co-authored-by: Sergey Kozub <skozub@nvidia.com>


Revision tags: llvmorg-19.1.1, llvmorg-19.1.0, llvmorg-19.1.0-rc4, llvmorg-19.1.0-rc3, llvmorg-19.1.0-rc2, llvmorg-19.1.0-rc1, llvmorg-20-init
# ab200864 30-Jun-2024 Jakub Chlanda <jakub@codeplay.com>

[CUDA][NFC] CudaArch to OffloadArch rename (#97028)

Rename `CudaArch` to `OffloadArch` to better reflect its content and the
use.
Apply a similar rename to helpers handling the enum.


# 9acb533c 25-Jun-2024 Alex Voicu <alexandru.voicu@amd.com>

[clang][Driver] Add HIPAMD Driver support for AMDGCN flavoured SPIR-V (#95061)

This patch augments the HIPAMD driver to allow it to target AMDGCN
flavoured SPIR-V compilation. It's mostly straightf

[clang][Driver] Add HIPAMD Driver support for AMDGCN flavoured SPIR-V (#95061)

This patch augments the HIPAMD driver to allow it to target AMDGCN
flavoured SPIR-V compilation. It's mostly straightforward, as we re-use
some of the existing SPIRV infra, however there are a few notable
additions:

- we introduce an `amdgcnspirv` offload arch, rather than relying on
using `generic` (this is already fairly overloaded) or simply using
`spirv` or `spirv64` (we'll want to use these to denote unflavoured
SPIRV, once we bring up that capability)
- initially it is won't be possible to mix-in SPIR-V and concrete AMDGPU
targets, as it would require some relatively intrusive surgery in the
HIPAMD Toolchain and the Driver to deal with two triples
(`spirv64-amd-amdhsa` and `amdgcn-amd-amdhsa`, respectively)
- in order to retain user provided compiler flags and have them
available at JIT time, we rely on embedding the command line via
`-fembed-bitcode=marker`, which the bitcode writer had previously not
implemented for SPIRV; we only allow it conditionally for AMDGCN
flavoured SPIRV, and it is handled correctly by the Translator (it ends
up as a string literal)

Once the SPIRV BE is no longer experimental we'll switch to using that
rather than the translator. There's some additional work that'll come
via a separate PR around correctly piping through AMDGCN's
implementation of `printf`, for now we merely handle its flags
correctly.

show more ...


Revision tags: llvmorg-18.1.8
# 1ca0055f 06-Jun-2024 Shilei Tian <i@tianshilei.me>

[AMDGPU] Add a new target gfx1152 (#94534)


Revision tags: llvmorg-18.1.7
# 8407779b 05-Jun-2024 Andrey Portnoy <aportnoy@nvidia.com>

[CUDA] Mark CUDA-12.5 as supported and introduce ptx 8.5. (#94113)

This PR is based on https://github.com/llvm/llvm-project/pull/91516.


# 2bfa26d3 05-Jun-2024 Konstantin Zhuravlyov <kzhuravl_dev@outlook.com>

AMDGPU: Add missing gfx* generic targets handling in clang (NVPTX, OpenMP runtime) (#94483)


Revision tags: llvmorg-18.1.6
# 2f956a35 08-May-2024 Artem Belevich <tra@google.com>

[CUDA] Mark CUDA-12.4 as supported and introduce ptx 8.4. (#91516)


Revision tags: llvmorg-18.1.5, llvmorg-18.1.4
# 9e7aab95 16-Apr-2024 Joseph Huber <huberjn@outlook.com>

[CUDA] Rename SM_32 to SM_32_ to work around AIX headers (#88779)

Summary:
AIX headers define this, so we need to work around it. In the future
this will be removed but for now we should just rename

[CUDA] Rename SM_32 to SM_32_ to work around AIX headers (#88779)

Summary:
AIX headers define this, so we need to work around it. In the future
this will be removed but for now we should just rename it to avoid these
issues.

show more ...


Revision tags: llvmorg-18.1.3, llvmorg-18.1.2, llvmorg-18.1.1, llvmorg-18.1.0, llvmorg-18.1.0-rc4, llvmorg-18.1.0-rc3, llvmorg-18.1.0-rc2, llvmorg-18.1.0-rc1, llvmorg-19-init
# 631c6e83 11-Dec-2023 Artem Belevich <tra@google.com>

[CUDA] Add support for CUDA-12.3 and sm_90a (#74895)


Revision tags: llvmorg-17.0.6
# cf1e0c0b 23-Nov-2023 Jay Foad <jay.foad@amd.com>

[AMDGPU] Define new targets gfx1200 and gfx1201 (#73133)

Define target names and ELF numbers for new GFX12 targets gfx1200 and
gfx1201. For now they behave identically to GFX11.


Revision tags: llvmorg-17.0.5, llvmorg-17.0.4, llvmorg-17.0.3, llvmorg-17.0.2, llvmorg-17.0.1, llvmorg-17.0.0, llvmorg-17.0.0-rc4, llvmorg-17.0.0-rc3, llvmorg-17.0.0-rc2, llvmorg-17.0.0-rc1, llvmorg-18-init
# 92542f2a 17-Jul-2023 Jay Foad <jay.foad@amd.com>

[AMDGPU] Add targets gfx1150 and gfx1151

This is the target definition only. Currently they are treated the same
as GFX 11.0.x.

Differential Revision: https://reviews.llvm.org/D155429


Revision tags: llvmorg-16.0.6
# 0f49116e 02-Jun-2023 Artem Belevich <tra@google.com>

[CUDA] Update Kepler(sm_3*) support info.

sm_30 and sm_32 were removed in cuda-11.0
sm_35 and sm_37 were removed in cuda-12.0

Differential Revision: https://reviews.llvm.org/D152027


Revision tags: llvmorg-16.0.5
# ffb635cb 24-May-2023 Artem Belevich <tra@google.com>

[CUDA] bump supported CUDA version to 12.1/11.8

Differential Revision: https://reviews.llvm.org/D151361


Revision tags: llvmorg-16.0.4
# 9d057279 10-May-2023 Konstantin Zhuravlyov <kzhuravl_dev@outlook.com>

AMDGPU: Add basic gfx942 target

Differential Revision: https://reviews.llvm.org/D149983


# 1fc70210 10-May-2023 Konstantin Zhuravlyov <kzhuravl_dev@outlook.com>

AMDGPU: Add basic gfx941 target

Differential Revision: https://reviews.llvm.org/D149982


Revision tags: llvmorg-16.0.3, llvmorg-16.0.2, llvmorg-16.0.1, llvmorg-16.0.0, llvmorg-16.0.0-rc4, llvmorg-16.0.0-rc3, llvmorg-16.0.0-rc2, llvmorg-16.0.0-rc1, llvmorg-17-init, llvmorg-15.0.7
# 7e937d08 15-Dec-2022 Kazu Hirata <kazu@google.com>

Don't include StringSwitch (NFC)

These files do not use llvm::StringSwitch.


Revision tags: llvmorg-15.0.6, llvmorg-15.0.5, llvmorg-15.0.4, llvmorg-15.0.3, working, llvmorg-15.0.2, llvmorg-15.0.1, llvmorg-15.0.0, llvmorg-15.0.0-rc3
# 9a01cca6 10-Aug-2022 Artem Belevich <tra@google.com>

Add support for CUDA-11.8 and sm_{87,89,90} GPUs.

Differential Revision: https://reviews.llvm.org/D135306


# f3a2cbcf 05-Oct-2022 Artem Belevich <tra@google.com>

Refactored CUDA version housekeeping to use less boilerplate.

Differential Revision: https://reviews.llvm.org/D135328


Revision tags: llvmorg-15.0.0-rc2, llvmorg-15.0.0-rc1, llvmorg-16-init, llvmorg-14.0.6
# 06decd0b 19-Jun-2022 Kazu Hirata <kazu@google.com>

[clang] Use value_or instead of getValueOr (NFC)


Revision tags: llvmorg-14.0.5, llvmorg-14.0.4, llvmorg-14.0.3, llvmorg-14.0.2
# 8bdfc73f 13-Apr-2022 Joe Nash <Joseph.Nash@amd.com>

[AMDGPU][clang] Definition of gfx11 subtarget

Contributors:
Jay Foad <jay.foad@amd.com>
Konstantin Zhuravlyov <kzhuravl_dev@outlook.com>

Patch 2/N for upstreaming of AMDGPU gfx11 architecture

Depe

[AMDGPU][clang] Definition of gfx11 subtarget

Contributors:
Jay Foad <jay.foad@amd.com>
Konstantin Zhuravlyov <kzhuravl_dev@outlook.com>

Patch 2/N for upstreaming of AMDGPU gfx11 architecture

Depends on D124536

Reviewed By: foad, kzhuravl, #amdgpu, arsenm

Differential Revision: https://reviews.llvm.org/D124537

show more ...


123