amdgpu-simplify-libcall-rootn.ll - OpenGrok history log for /llvm-project/llvm/test/CodeGen/AMDGPU/amdgpu-simplify-libcall-rootn.ll

Revision (<<< Hide revision tags) (Show revision tags >>>)	Date	Author	Comments
Revision tags: llvmorg-21-init, llvmorg-19.1.7, llvmorg-19.1.6, llvmorg-19.1.5, llvmorg-19.1.4
# 38fffa63	06-Nov-2024	Paul Walker <paul.walker@arm.com>	[LLVM][IR] Use splat syntax when printing Constant[Data]Vector. (#112548)
Revision tags: llvmorg-19.1.3, llvmorg-19.1.2, llvmorg-19.1.1, llvmorg-19.1.0, llvmorg-19.1.0-rc4, llvmorg-19.1.0-rc3, llvmorg-19.1.0-rc2, llvmorg-19.1.0-rc1, llvmorg-20-init, llvmorg-18.1.8, llvmorg-18.1.7
# dab1f7c8	21-May-2024	Matt Arsenault <Matthew.Arsenault@amd.com>	AMDGPU: Emit 1/llvm.sqrt(x) instead of rsqrt calls in libcall handling (#92863) With the contract flag we should end up codegening to the rsqrt instruction, or denormal corrected rsqrt sequence pre AMDGPU: Emit 1/llvm.sqrt(x) instead of rsqrt calls in libcall handling (#92863) With the contract flag we should end up codegening to the rsqrt instruction, or denormal corrected rsqrt sequence present in the library. show more ...
# 66b76faf	21-May-2024	Matt Arsenault <Matthew.Arsenault@amd.com>	AMDGPU: Directly emit sqrt intrinsic when folding rootn(x, 2) (#92598) This avoids depending on pre/post link runs. Depends #92595
# 3cb1fe60	20-May-2024	Matt Arsenault <Matthew.Arsenault@amd.com>	AMDGPU: Don't fold rootn(x, 1) to input for strictfp functions (#92595) We need to insert a constrained canonicalize. Depends #92594
# 586ecd75	20-May-2024	Matt Arsenault <Matthew.Arsenault@amd.com>	AMDGPU: Relax vector restriction for rootn libcall folds (#92594) We could try harder for nonsplat vectors but probably not worth the effort.
Revision tags: llvmorg-18.1.6, llvmorg-18.1.5, llvmorg-18.1.4, llvmorg-18.1.3
# f5296df9	27-Mar-2024	Kevin P. Neal <52762977+kpneal@users.noreply.github.com>	[FPEnv][AMDGPU] Correct AMDGPUSimplifyLibCalls handling of strictfp attribute. (#86705) The AMDGPUSimplifyLibCalls pass was lowering function calls with the strictfp attribute to sequences that inc [FPEnv][AMDGPU] Correct AMDGPUSimplifyLibCalls handling of strictfp attribute. (#86705) The AMDGPUSimplifyLibCalls pass was lowering function calls with the strictfp attribute to sequences that included function calls incorrectly lacking the attribute. This patch corrects that. The pass now also emits the correct constrained fp call instead of normal FP instructions when in a function with the strictfp attribute. Replacing non-constrained calls with constrained calls when required is still on the IRBuilder's TODO list. show more ...
Revision tags: llvmorg-18.1.2, llvmorg-18.1.1, llvmorg-18.1.0, llvmorg-18.1.0-rc4, llvmorg-18.1.0-rc3, llvmorg-18.1.0-rc2, llvmorg-18.1.0-rc1, llvmorg-19-init
# 32f9983c	15-Dec-2023	Jessica Del <50999226+OutOfCache@users.noreply.github.com>	[AMDGPU] - Add address space for strided buffers (#74471) This is an experimental address space for strided buffers. These buffers can have structs as elements and a stride > 1. These pointers al [AMDGPU] - Add address space for strided buffers (#74471) This is an experimental address space for strided buffers. These buffers can have structs as elements and a stride > 1. These pointers allow the indexed access in units of stride, i.e., they point at `buffer[index * stride]`. Thus, we can use the `idxen` modifier for buffer loads. We assign address space 9 to 192-bit buffer pointers which contain a 128-bit descriptor, a 32-bit offset and a 32-bit index. Essentially, they are fat buffer pointers with an additional 32-bit index. show more ...
# 3c86bc0a	02-Dec-2023	Matt Arsenault <Matthew.Arsenault@amd.com>	AMDGPU: Add more tests for rootn libcall handling