amdgpu-simplify-libcall-pown.ll - OpenGrok history log for /llvm-project/llvm/test/CodeGen/AMDGPU/amdgpu-simplify-libcall-pown.ll

Revision (<<< Hide revision tags) (Show revision tags >>>)	Date	Author	Comments
Revision tags: llvmorg-21-init, llvmorg-19.1.7, llvmorg-19.1.6, llvmorg-19.1.5, llvmorg-19.1.4
# 38fffa63	06-Nov-2024	Paul Walker <paul.walker@arm.com>	[LLVM][IR] Use splat syntax when printing Constant[Data]Vector. (#112548)
Revision tags: llvmorg-19.1.3, llvmorg-19.1.2, llvmorg-19.1.1, llvmorg-19.1.0, llvmorg-19.1.0-rc4
# a1058776	21-Aug-2024	Nikita Popov <npopov@redhat.com>	[InstCombine] Remove some of the complexity-based canonicalization (#91185) The idea behind this canonicalization is that it allows us to handle less patterns, because we know that some will be can [InstCombine] Remove some of the complexity-based canonicalization (#91185) The idea behind this canonicalization is that it allows us to handle less patterns, because we know that some will be canonicalized away. This is indeed very useful to e.g. know that constants are always on the right. However, this is only useful if the canonicalization is actually reliable. This is the case for constants, but not for arguments: Moving these to the right makes it look like the "more complex" expression is guaranteed to be on the left, but this is not actually the case in practice. It fails as soon as you replace the argument with another instruction. The end result is that it looks like things correctly work in tests, while they actually don't. We use the "thwart complexity-based canonicalization" trick to handle this in tests, but it's often a challenge for new contributors to get this right, and based on the regressions this PR originally exposed, we clearly don't get this right in many cases. For this reason, I think that it's better to remove this complexity canonicalization. It will make it much easier to write tests for commuted cases and make sure that they are handled. show more ...
Revision tags: llvmorg-19.1.0-rc3, llvmorg-19.1.0-rc2, llvmorg-19.1.0-rc1, llvmorg-20-init
# bff619f9	01-Jul-2024	Matt Arsenault <Matthew.Arsenault@amd.com>	Revert "AMDGPU: Use real copysign in fast pow (#97152)" This reverts commit d3e7c4ce7a3d7f08cea02cba8f34c590a349688b.
# d3e7c4ce	01-Jul-2024	Matt Arsenault <Matthew.Arsenault@amd.com>	AMDGPU: Use real copysign in fast pow (#97152) Previously this would introduce some codegen regressions, but those have been avoided by simplifying demanded bits on copysign operations.
# b932da16	18-Jun-2024	Matt Arsenault <Matthew.Arsenault@amd.com>	AMDGPU: Fix vector handling in pown libcall simplification (#95832) The isIntegerTy check would not work as you would hope in the vector case.
Revision tags: llvmorg-18.1.8, llvmorg-18.1.7, llvmorg-18.1.6
# 3aae916f	14-May-2024	Yingwei Zheng <dtcxzyw2333@gmail.com>	Reland "[ValueTracking] Compute knownbits from known fp classes" (#92084) This patch relands https://github.com/llvm/llvm-project/pull/86409. I mistakenly thought that `Known.makeNegative()` clea Reland "[ValueTracking] Compute knownbits from known fp classes" (#92084) This patch relands https://github.com/llvm/llvm-project/pull/86409. I mistakenly thought that `Known.makeNegative()` clears the sign bit of `Known.Zero`. This patch fixes the assertion failure by explicitly clearing the sign bit. show more ...
# 2e165a2c	14-May-2024	Martin Storsjö <martin@martin.st>	Revert "[ValueTracking] Compute knownbits from known fp classes (#86409)" This reverts commit d03a1a6e5838c7c2c0836d71507dfdf7840ade49. This change caused failed assertions, see https://github.com/ Revert "[ValueTracking] Compute knownbits from known fp classes (#86409)" This reverts commit d03a1a6e5838c7c2c0836d71507dfdf7840ade49. This change caused failed assertions, see https://github.com/llvm/llvm-project/pull/86409#issuecomment-2109469845 for details. show more ...
# d03a1a6e	13-May-2024	Yingwei Zheng <dtcxzyw2333@gmail.com>	[ValueTracking] Compute knownbits from known fp classes (#86409) This patch calculates knownbits from fp instructions/dominating fcmp conditions. It will enable more optimizations with signbit idio [ValueTracking] Compute knownbits from known fp classes (#86409) This patch calculates knownbits from fp instructions/dominating fcmp conditions. It will enable more optimizations with signbit idioms. show more ...
Revision tags: llvmorg-18.1.5, llvmorg-18.1.4, llvmorg-18.1.3
# f5296df9	27-Mar-2024	Kevin P. Neal <52762977+kpneal@users.noreply.github.com>	[FPEnv][AMDGPU] Correct AMDGPUSimplifyLibCalls handling of strictfp attribute. (#86705) The AMDGPUSimplifyLibCalls pass was lowering function calls with the strictfp attribute to sequences that inc [FPEnv][AMDGPU] Correct AMDGPUSimplifyLibCalls handling of strictfp attribute. (#86705) The AMDGPUSimplifyLibCalls pass was lowering function calls with the strictfp attribute to sequences that included function calls incorrectly lacking the attribute. This patch corrects that. The pass now also emits the correct constrained fp call instead of normal FP instructions when in a function with the strictfp attribute. Replacing non-constrained calls with constrained calls when required is still on the IRBuilder's TODO list. show more ...
Revision tags: llvmorg-18.1.2, llvmorg-18.1.1, llvmorg-18.1.0, llvmorg-18.1.0-rc4, llvmorg-18.1.0-rc3, llvmorg-18.1.0-rc2, llvmorg-18.1.0-rc1, llvmorg-19-init
# 32f9983c	15-Dec-2023	Jessica Del <50999226+OutOfCache@users.noreply.github.com>	[AMDGPU] - Add address space for strided buffers (#74471) This is an experimental address space for strided buffers. These buffers can have structs as elements and a stride > 1. These pointers al [AMDGPU] - Add address space for strided buffers (#74471) This is an experimental address space for strided buffers. These buffers can have structs as elements and a stride > 1. These pointers allow the indexed access in units of stride, i.e., they point at `buffer[index * stride]`. Thus, we can use the `idxen` modifier for buffer loads. We assign address space 9 to 192-bit buffer pointers which contain a 128-bit descriptor, a 32-bit offset and a 32-bit index. Essentially, they are fat buffer pointers with an additional 32-bit index. show more ...
Revision tags: llvmorg-17.0.6, llvmorg-17.0.5, llvmorg-17.0.4, llvmorg-17.0.3, llvmorg-17.0.2, llvmorg-17.0.1, llvmorg-17.0.0, llvmorg-17.0.0-rc4
# deefda70	26-Aug-2023	Matt Arsenault <Matthew.Arsenault@amd.com>	AMDGPU: Use exp2 and log2 intrinsics directly for f16/f32 These codegen correctly but f64 doesn't. This prevents losing fast math flags on the way to the underlying intrinsic. https://reviews.llvm. AMDGPU: Use exp2 and log2 intrinsics directly for f16/f32 These codegen correctly but f64 doesn't. This prevents losing fast math flags on the way to the underlying intrinsic. https://reviews.llvm.org/D158997 show more ...
# f5d8a9b1	25-Aug-2023	Matt Arsenault <Matthew.Arsenault@amd.com>	AMDGPU: Simplify handling of constant vectors in libcalls Also fixes not handling the partially undef case. https://reviews.llvm.org/D158905
# afb24cbb	25-Aug-2023	Matt Arsenault <Matthew.Arsenault@amd.com>	AMDGPU: Don't require all flags to expand fast powr This was requiring all fast math flags, which is practically useless. This wouldn't fire using all the standard OpenCL fast math flags. This only AMDGPU: Don't require all flags to expand fast powr This was requiring all fast math flags, which is practically useless. This wouldn't fire using all the standard OpenCL fast math flags. This only needs afn nnan and ninf. https://reviews.llvm.org/D158904 show more ...
Revision tags: llvmorg-17.0.0-rc3
# aa539b12	20-Aug-2023	Matt Arsenault <Matthew.Arsenault@amd.com>	AMDGPU: Add baseline tests for libcall recognition of pow/powr/pown