Revision tags: llvmorg-21-init |
|
#
97c3a990 |
| 22-Jan-2025 |
Sergey Kozub <skozub@nvidia.com> |
Remove incorrect CUDA defines (#123898)
Remove CUDA_127 and CUDA_129 defines incorrectly added in https://github.com/llvm/llvm-project/pull/123398
|
#
616979eb |
| 21-Jan-2025 |
Sergey Kozub <skozub@nvidia.com> |
[NVPTX] Add support for PTX 8.6 and CUDA 12.6 (12.8) (#123398)
Add CUDA versions 12.7, 12.8, 12.9 which support PTX8.6+ (enables using Blackwell-specific instructions).
|
Revision tags: llvmorg-19.1.7, llvmorg-19.1.6, llvmorg-19.1.5, llvmorg-19.1.4 |
|
#
a6fc489b |
| 18-Nov-2024 |
Matt Arsenault <Matthew.Arsenault@amd.com> |
AMDGPU: Add gfx950 subtarget definitions (#116307)
Mostly a stub, but adds some baseline tests and tests for removed instructions.
|
#
de0fd64b |
| 13-Nov-2024 |
Shilei Tian <i@tianshilei.me> |
[AMDGPU] Introduce a new generic target `gfx9-4-generic` (#115190)
This patch introduces a new generic target, `gfx9-4-generic`. Since it doesn’t support FP8 and XF32-related instructions, the patch
[AMDGPU] Introduce a new generic target `gfx9-4-generic` (#115190)
This patch introduces a new generic target, `gfx9-4-generic`. Since it doesn’t support FP8 and XF32-related instructions, the patch includes several code reorganizations to accommodate these changes.
show more ...
|
Revision tags: llvmorg-19.1.3 |
|
#
076aac59 |
| 23-Oct-2024 |
Carl Ritson <carl.ritson@amd.com> |
[AMDGPU] Add a new target for gfx1153 (#113138)
|
Revision tags: llvmorg-19.1.2 |
|
#
30a06e80 |
| 14-Oct-2024 |
Artem Belevich <tra@google.com> |
[CUDA] Add support for CUDA-12.6 and sm_100 (#112028)
This is a copy of #97402(with minor updates), which is now ready to land.
---------
Co-authored-by: Sergey Kozub <skozub@nvidia.com>
|
Revision tags: llvmorg-19.1.1, llvmorg-19.1.0, llvmorg-19.1.0-rc4, llvmorg-19.1.0-rc3, llvmorg-19.1.0-rc2, llvmorg-19.1.0-rc1, llvmorg-20-init |
|
#
ab200864 |
| 30-Jun-2024 |
Jakub Chlanda <jakub@codeplay.com> |
[CUDA][NFC] CudaArch to OffloadArch rename (#97028)
Rename `CudaArch` to `OffloadArch` to better reflect its content and the
use.
Apply a similar rename to helpers handling the enum.
|
#
9acb533c |
| 25-Jun-2024 |
Alex Voicu <alexandru.voicu@amd.com> |
[clang][Driver] Add HIPAMD Driver support for AMDGCN flavoured SPIR-V (#95061)
This patch augments the HIPAMD driver to allow it to target AMDGCN
flavoured SPIR-V compilation. It's mostly straightf
[clang][Driver] Add HIPAMD Driver support for AMDGCN flavoured SPIR-V (#95061)
This patch augments the HIPAMD driver to allow it to target AMDGCN
flavoured SPIR-V compilation. It's mostly straightforward, as we re-use
some of the existing SPIRV infra, however there are a few notable
additions:
- we introduce an `amdgcnspirv` offload arch, rather than relying on
using `generic` (this is already fairly overloaded) or simply using
`spirv` or `spirv64` (we'll want to use these to denote unflavoured
SPIRV, once we bring up that capability)
- initially it is won't be possible to mix-in SPIR-V and concrete AMDGPU
targets, as it would require some relatively intrusive surgery in the
HIPAMD Toolchain and the Driver to deal with two triples
(`spirv64-amd-amdhsa` and `amdgcn-amd-amdhsa`, respectively)
- in order to retain user provided compiler flags and have them
available at JIT time, we rely on embedding the command line via
`-fembed-bitcode=marker`, which the bitcode writer had previously not
implemented for SPIRV; we only allow it conditionally for AMDGCN
flavoured SPIRV, and it is handled correctly by the Translator (it ends
up as a string literal)
Once the SPIRV BE is no longer experimental we'll switch to using that
rather than the translator. There's some additional work that'll come
via a separate PR around correctly piping through AMDGCN's
implementation of `printf`, for now we merely handle its flags
correctly.
show more ...
|
Revision tags: llvmorg-18.1.8 |
|
#
1ca0055f |
| 06-Jun-2024 |
Shilei Tian <i@tianshilei.me> |
[AMDGPU] Add a new target gfx1152 (#94534)
|
Revision tags: llvmorg-18.1.7 |
|
#
8407779b |
| 05-Jun-2024 |
Andrey Portnoy <aportnoy@nvidia.com> |
[CUDA] Mark CUDA-12.5 as supported and introduce ptx 8.5. (#94113)
This PR is based on https://github.com/llvm/llvm-project/pull/91516.
|
#
2bfa26d3 |
| 05-Jun-2024 |
Konstantin Zhuravlyov <kzhuravl_dev@outlook.com> |
AMDGPU: Add missing gfx* generic targets handling in clang (NVPTX, OpenMP runtime) (#94483)
|
Revision tags: llvmorg-18.1.6 |
|
#
2f956a35 |
| 08-May-2024 |
Artem Belevich <tra@google.com> |
[CUDA] Mark CUDA-12.4 as supported and introduce ptx 8.4. (#91516)
|
Revision tags: llvmorg-18.1.5, llvmorg-18.1.4 |
|
#
9e7aab95 |
| 16-Apr-2024 |
Joseph Huber <huberjn@outlook.com> |
[CUDA] Rename SM_32 to SM_32_ to work around AIX headers (#88779)
Summary: AIX headers define this, so we need to work around it. In the future this will be removed but for now we should just rename
[CUDA] Rename SM_32 to SM_32_ to work around AIX headers (#88779)
Summary: AIX headers define this, so we need to work around it. In the future this will be removed but for now we should just rename it to avoid these issues.
show more ...
|
Revision tags: llvmorg-18.1.3, llvmorg-18.1.2, llvmorg-18.1.1, llvmorg-18.1.0, llvmorg-18.1.0-rc4, llvmorg-18.1.0-rc3, llvmorg-18.1.0-rc2, llvmorg-18.1.0-rc1, llvmorg-19-init |
|
#
631c6e83 |
| 11-Dec-2023 |
Artem Belevich <tra@google.com> |
[CUDA] Add support for CUDA-12.3 and sm_90a (#74895)
|
Revision tags: llvmorg-17.0.6 |
|
#
cf1e0c0b |
| 23-Nov-2023 |
Jay Foad <jay.foad@amd.com> |
[AMDGPU] Define new targets gfx1200 and gfx1201 (#73133)
Define target names and ELF numbers for new GFX12 targets gfx1200 and
gfx1201. For now they behave identically to GFX11.
|
Revision tags: llvmorg-17.0.5, llvmorg-17.0.4, llvmorg-17.0.3, llvmorg-17.0.2, llvmorg-17.0.1, llvmorg-17.0.0, llvmorg-17.0.0-rc4, llvmorg-17.0.0-rc3, llvmorg-17.0.0-rc2, llvmorg-17.0.0-rc1, llvmorg-18-init |
|
#
92542f2a |
| 17-Jul-2023 |
Jay Foad <jay.foad@amd.com> |
[AMDGPU] Add targets gfx1150 and gfx1151
This is the target definition only. Currently they are treated the same as GFX 11.0.x.
Differential Revision: https://reviews.llvm.org/D155429
|
Revision tags: llvmorg-16.0.6 |
|
#
0f49116e |
| 02-Jun-2023 |
Artem Belevich <tra@google.com> |
[CUDA] Update Kepler(sm_3*) support info.
sm_30 and sm_32 were removed in cuda-11.0 sm_35 and sm_37 were removed in cuda-12.0
Differential Revision: https://reviews.llvm.org/D152027
|
Revision tags: llvmorg-16.0.5 |
|
#
ffb635cb |
| 24-May-2023 |
Artem Belevich <tra@google.com> |
[CUDA] bump supported CUDA version to 12.1/11.8
Differential Revision: https://reviews.llvm.org/D151361
|
Revision tags: llvmorg-16.0.4 |
|
#
9d057279 |
| 10-May-2023 |
Konstantin Zhuravlyov <kzhuravl_dev@outlook.com> |
AMDGPU: Add basic gfx942 target
Differential Revision: https://reviews.llvm.org/D149983
|
#
1fc70210 |
| 10-May-2023 |
Konstantin Zhuravlyov <kzhuravl_dev@outlook.com> |
AMDGPU: Add basic gfx941 target
Differential Revision: https://reviews.llvm.org/D149982
|
Revision tags: llvmorg-16.0.3, llvmorg-16.0.2, llvmorg-16.0.1, llvmorg-16.0.0, llvmorg-16.0.0-rc4, llvmorg-16.0.0-rc3, llvmorg-16.0.0-rc2, llvmorg-16.0.0-rc1, llvmorg-17-init, llvmorg-15.0.7 |
|
#
7e937d08 |
| 15-Dec-2022 |
Kazu Hirata <kazu@google.com> |
Don't include StringSwitch (NFC)
These files do not use llvm::StringSwitch.
|
Revision tags: llvmorg-15.0.6, llvmorg-15.0.5, llvmorg-15.0.4, llvmorg-15.0.3, working, llvmorg-15.0.2, llvmorg-15.0.1, llvmorg-15.0.0, llvmorg-15.0.0-rc3 |
|
#
9a01cca6 |
| 10-Aug-2022 |
Artem Belevich <tra@google.com> |
Add support for CUDA-11.8 and sm_{87,89,90} GPUs.
Differential Revision: https://reviews.llvm.org/D135306
|
#
f3a2cbcf |
| 05-Oct-2022 |
Artem Belevich <tra@google.com> |
Refactored CUDA version housekeeping to use less boilerplate.
Differential Revision: https://reviews.llvm.org/D135328
|
Revision tags: llvmorg-15.0.0-rc2, llvmorg-15.0.0-rc1, llvmorg-16-init, llvmorg-14.0.6 |
|
#
06decd0b |
| 19-Jun-2022 |
Kazu Hirata <kazu@google.com> |
[clang] Use value_or instead of getValueOr (NFC)
|
Revision tags: llvmorg-14.0.5, llvmorg-14.0.4, llvmorg-14.0.3, llvmorg-14.0.2 |
|
#
8bdfc73f |
| 13-Apr-2022 |
Joe Nash <Joseph.Nash@amd.com> |
[AMDGPU][clang] Definition of gfx11 subtarget
Contributors: Jay Foad <jay.foad@amd.com> Konstantin Zhuravlyov <kzhuravl_dev@outlook.com>
Patch 2/N for upstreaming of AMDGPU gfx11 architecture
Depe
[AMDGPU][clang] Definition of gfx11 subtarget
Contributors: Jay Foad <jay.foad@amd.com> Konstantin Zhuravlyov <kzhuravl_dev@outlook.com>
Patch 2/N for upstreaming of AMDGPU gfx11 architecture
Depends on D124536
Reviewed By: foad, kzhuravl, #amdgpu, arsenm
Differential Revision: https://reviews.llvm.org/D124537
show more ...
|