amdgpu-codegenprepare-idiv.ll - OpenGrok history log for /llvm-project/llvm/test/CodeGen/AMDGPU/amdgpu-codegenprepare-idiv.ll

Revision (<<< Hide revision tags) (Show revision tags >>>)	Date	Author	Comments
Revision tags: llvmorg-21-init, llvmorg-19.1.7, llvmorg-19.1.6
# 463e93b9	12-Dec-2024	choikwa <5455710+choikwa@users.noreply.github.com>	Reapply [AMDGPU] prevent shrinking udiv/urem if either operand exceeds signed max (#119325) This reverts commit 254d206ee2a337cb38ba347c896f7c6a14c7f218. +Added a fix in ExpandDivRem24 to disqual Reapply [AMDGPU] prevent shrinking udiv/urem if either operand exceeds signed max (#119325) This reverts commit 254d206ee2a337cb38ba347c896f7c6a14c7f218. +Added a fix in ExpandDivRem24 to disqualify if DivNumBits exceed 24. Original commit & msg: ce6e955ac374f2b86cbbb73b2f32174dffd85f25. Handle signed and unsigned path differently in getDivNumBits. Using computeKnownBits, this rejects shrinking unsigned div/rem if operands exceed signed max since we know NumSignBits will be always 0. show more ...
# 254d206e	09-Dec-2024	Joseph Huber <huberjn@outlook.com>	Revert "Reapply "[AMDGPU] prevent shrinking udiv/urem if either operand is in… (#118928)" This reverts commit 509893b58ff444a6f080946bd368e9bde7668f13. This broke the libc build again https://lab.l Revert "Reapply "[AMDGPU] prevent shrinking udiv/urem if either operand is in… (#118928)" This reverts commit 509893b58ff444a6f080946bd368e9bde7668f13. This broke the libc build again https://lab.llvm.org/buildbot/#/builders/73/builds/9787. show more ...
# 509893b5	07-Dec-2024	choikwa <5455710+choikwa@users.noreply.github.com>	Reapply "[AMDGPU] prevent shrinking udiv/urem if either operand is in… (#118928) … (SignedMax,UnsignedMax] (#116733)" This reverts commit 905e831f8c8341e53e7e3adc57fd20b8e08eb999. Handle signe Reapply "[AMDGPU] prevent shrinking udiv/urem if either operand is in… (#118928) … (SignedMax,UnsignedMax] (#116733)" This reverts commit 905e831f8c8341e53e7e3adc57fd20b8e08eb999. Handle signed and unsigned path differently in getDivNumBits. Using computeKnownBits, this rejects shrinking unsigned div/rem if operands exceed signed max since we know NumSignBits will be always 0. Rebased and re-attempt after first one was reverted due to unrelated failure in LibC (should be fixed by now I'm told). show more ...
Revision tags: llvmorg-19.1.5
# 905e831f	21-Nov-2024	Joseph Huber <huberjn@outlook.com>	Revert "[AMDGPU] prevent shrinking udiv/urem if either operand is in (SignedMax,UnsignedMax] (#116733)" This reverts commit b8e1d4dbea8905e48d51a70bf75cb8fababa4a60. Causes failures on the `libc` t Revert "[AMDGPU] prevent shrinking udiv/urem if either operand is in (SignedMax,UnsignedMax] (#116733)" This reverts commit b8e1d4dbea8905e48d51a70bf75cb8fababa4a60. Causes failures on the `libc` test suite https://lab.llvm.org/buildbot/#/builders/73/builds/8871 show more ...
# b8e1d4db	20-Nov-2024	choikwa <5455710+choikwa@users.noreply.github.com>	[AMDGPU] prevent shrinking udiv/urem if either operand is in (SignedMax,UnsignedMax] (#116733) Do this by using ComputeKnownBits and checking for !isNonNegative and isUnsigned. This rejects shrinki [AMDGPU] prevent shrinking udiv/urem if either operand is in (SignedMax,UnsignedMax] (#116733) Do this by using ComputeKnownBits and checking for !isNonNegative and isUnsigned. This rejects shrinking unsigned div/rem if operands exceed smax_bitwidth since we know NumSignBits will be always 0. show more ...
Revision tags: llvmorg-19.1.4
# 6548b635	09-Nov-2024	Shilei Tian <i@tianshilei.me>	Reapply "[AMDGPU] Still set up the two SGPRs for queue ptr even it is COV5 (#112403)" This reverts commit ca33649abe5fad93c57afef54e43ed9b3249cd86.
# ca33649a	08-Nov-2024	Shilei Tian <i@tianshilei.me>	Revert "[AMDGPU] Still set up the two SGPRs for queue ptr even it is COV5 (#112403)" This reverts commit e215a1e27d84adad2635a52393621eb4fa439dc9 as it broke both hip and openmp buildbots.
# e215a1e2	08-Nov-2024	Shilei Tian <i@tianshilei.me>	[AMDGPU] Still set up the two SGPRs for queue ptr even it is COV5 (#112403)
# 38fffa63	06-Nov-2024	Paul Walker <paul.walker@arm.com>	[LLVM][IR] Use splat syntax when printing Constant[Data]Vector. (#112548)
Revision tags: llvmorg-19.1.3, llvmorg-19.1.2, llvmorg-19.1.1, llvmorg-19.1.0, llvmorg-19.1.0-rc4, llvmorg-19.1.0-rc3
# 7389545d	12-Aug-2024	Pierre van Houtryve <pierre.vanhoutryve@amd.com>	Reapply "[AMDGPU] Always lower s/udiv64 by constant to MUL" (#101942) Reland #100723, fixing the ARM issue at the cost of a small loss of optimization in `test/CodeGen/AMDGPU/fshr.ll` Solves #100 Reapply "[AMDGPU] Always lower s/udiv64 by constant to MUL" (#101942) Reland #100723, fixing the ARM issue at the cost of a small loss of optimization in `test/CodeGen/AMDGPU/fshr.ll` Solves #100383 show more ...
Revision tags: llvmorg-19.1.0-rc2
# 0b92e70d	02-Aug-2024	Fangrui Song <i@maskray.me>	Revert "[AMDGPU] Always lower s/udiv64 by constant to MUL (#100723)" This reverts commit 92fbc963a51683d32f70d0c7f3783bb13983f08d. The patch also affected ARM and caused an assertion failure during Revert "[AMDGPU] Always lower s/udiv64 by constant to MUL (#100723)" This reverts commit 92fbc963a51683d32f70d0c7f3783bb13983f08d. The patch also affected ARM and caused an assertion failure during CurDAG->Legalize (https://github.com/llvm/llvm-project/pull/100723#issuecomment-2266154211). show more ...
# 92fbc963	02-Aug-2024	Pierre van Houtryve <pierre.vanhoutryve@amd.com>	[AMDGPU] Always lower s/udiv64 by constant to MUL (#100723) Solves #100383
Revision tags: llvmorg-19.1.0-rc1, llvmorg-20-init
# 229e1185	23-Jul-2024	Christudasan Devadasan <christudasan.devadasan@amd.com>	[AMDGPU] Codegen support for constrained multi-dword sloads (#96163) For targets that support xnack replay feature (gfx8+), the multi-dword scalar loads shouldn't clobber any register that holds the [AMDGPU] Codegen support for constrained multi-dword sloads (#96163) For targets that support xnack replay feature (gfx8+), the multi-dword scalar loads shouldn't clobber any register that holds the src address. The constrained version of the scalar loads have the early clobber flag attached to the dst operand to restrict RA from re-allocating any of the src regs for its dst operand. show more ...
# a1d7da05	23-Jul-2024	Christudasan Devadasan <christudasan.devadasan@amd.com>	[AMDGPU][SILoadStoreOptimizer] Merge constrained sloads (#96162) Consider the constrained multi-dword loads while merging individual loads to a single multi-dword load.
# b1bcb7ca	15-Jul-2024	Matt Arsenault <Matthew.Arsenault@amd.com>	Reapply "AMDGPU: Move attributor into optimization pipeline (#83131)" and follow up commit "clang/AMDGPU: Defeat attribute optimization in attribute test" (#98851) This reverts commit adaff46d087799 Reapply "AMDGPU: Move attributor into optimization pipeline (#83131)" and follow up commit "clang/AMDGPU: Defeat attribute optimization in attribute test" (#98851) This reverts commit adaff46d087799072438dd744b038e6fd50a2d78. Drop the -O3 checks from default-attributes.hip. I don't know why they are different on some bots but reverting this is far too disruptive. show more ...
# adaff46d	15-Jul-2024	dyung <douglas.yung@sony.com>	Revert "AMDGPU: Move attributor into optimization pipeline (#83131)" and follow up commit "clang/AMDGPU: Defeat attribute optimization in attribute test" (#98851) This reverts commits 677cc15e0ff2e0 Revert "AMDGPU: Move attributor into optimization pipeline (#83131)" and follow up commit "clang/AMDGPU: Defeat attribute optimization in attribute test" (#98851) This reverts commits 677cc15e0ff2e0e6aa30538eb187990a6a8f53c0 and 78bc1b64a6dc3fb6191355a5e1b502be8b3668e7. The test CodeGenHIP/default-attributes.hip is failing on multiple bots even after the attempted fix including the following: - https://lab.llvm.org/buildbot/#/builders/3/builds/1473 - https://lab.llvm.org/buildbot/#/builders/65/builds/1380 - https://lab.llvm.org/buildbot/#/builders/161/builds/595 - https://lab.llvm.org/buildbot/#/builders/154/builds/1372 - https://lab.llvm.org/buildbot/#/builders/133/builds/1547 - https://lab.llvm.org/buildbot/#/builders/81/builds/755 - https://lab.llvm.org/buildbot/#/builders/40/builds/570 - https://lab.llvm.org/buildbot/#/builders/13/builds/748 - https://lab.llvm.org/buildbot/#/builders/12/builds/1845 - https://lab.llvm.org/buildbot/#/builders/11/builds/1695 - https://lab.llvm.org/buildbot/#/builders/190/builds/1829 - https://lab.llvm.org/buildbot/#/builders/193/builds/962 - https://lab.llvm.org/buildbot/#/builders/23/builds/991 - https://lab.llvm.org/buildbot/#/builders/144/builds/2256 - https://lab.llvm.org/buildbot/#/builders/46/builds/1614 These bots have been broken for a day, so reverting to get everything back to green. show more ...
# 78bc1b64	14-Jul-2024	Matt Arsenault <Matthew.Arsenault@amd.com>	AMDGPU: Move attributor into optimization pipeline (#83131) Removing it from the codegen pipeline induces a lot of test churn because llc is no longer optimizing out implicit arguments to kernels. AMDGPU: Move attributor into optimization pipeline (#83131) Removing it from the codegen pipeline induces a lot of test churn because llc is no longer optimizing out implicit arguments to kernels. Mostly mechanical, but there are some creative test updates. I preferred to take the changes as-is in tests where the ABI isn't relevant. In cases where it's more relevant, or the optimize out logic was too ingrained in the test, I pre-run the optimization. Some cases manually add attributes to disable inputs. show more ...
Revision tags: llvmorg-18.1.8
# af3ffff3	07-Jun-2024	Simon Pilgrim <llvm-dev@redking.me.uk>	[DAG] Always allow folding XOR patterns to ABS pre-legalization (#94601) Removes residual ARM handling for vXi64 ABS nodes to prevent infinite loops.
Revision tags: llvmorg-18.1.7, llvmorg-18.1.6, llvmorg-18.1.5, llvmorg-18.1.4, llvmorg-18.1.3
# 0a43ca73	27-Mar-2024	Shilei Tian <i@tianshilei.me>	[AMDGPU] Fix missing `IsExact` flag when expanding vector binary operator (#86712)
Revision tags: llvmorg-18.1.2, llvmorg-18.1.1, llvmorg-18.1.0, llvmorg-18.1.0-rc4, llvmorg-18.1.0-rc3, llvmorg-18.1.0-rc2, llvmorg-18.1.0-rc1, llvmorg-19-init, llvmorg-17.0.6, llvmorg-17.0.5
# 51d4ad67	31-Oct-2023	Simon Pilgrim <llvm-dev@redking.me.uk>	[AMDGPU] amdgpu-codegenprepare-idiv.ll - regenerate checks. NFC. Reduces diffs in a future patch
Revision tags: llvmorg-17.0.4
# fe8335ba	30-Oct-2023	Stanislav Mekhanoshin <rampitec@users.noreply.github.com>	[AMDGPU] Select 64-bit imm moves if can be encoded as 32 bit operand (#70395) This allows folding of 64-bit operands if fit into 32-bit. Fixes https://github.com/llvm/llvm-project/issues/67781
# e39f6c18	25-Oct-2023	Alex Richardson <alexrichardson@google.com>	[opt] Infer DataLayout from triple if not specified There are many tests that specify a target triple/CPU flags but no DataLayout which can lead to IR being generated that has unusual behaviour. Thi [opt] Infer DataLayout from triple if not specified There are many tests that specify a target triple/CPU flags but no DataLayout which can lead to IR being generated that has unusual behaviour. This commit attempts to use the default DataLayout based on the relevant flags if there is no explicit override on the command line or in the IR file. One thing that is not currently possible to differentiate from a missing datalayout `target datalayout = ""` in the IR file since the current APIs don't allow detecting this case. If it is considered useful to support this case (instead of passing "-data-layout=" on the command line), I can change IR parsers to track whether they have seen such a directive and change the callback type. Differential Revision: https://reviews.llvm.org/D141060 show more ...
# 40a426fa	19-Oct-2023	Pierre van Houtryve <pierre.vanhoutryve@amd.com>	[AMDGPU] Constant fold FMAD_FTZ (#69443) Solves #68315
Revision tags: llvmorg-17.0.3
# 7b3bbd83	09-Oct-2023	Jay Foad <jay.foad@amd.com>	Revert "[CodeGen] Really renumber slot indexes before register allocation (#67038)" This reverts commit 2501ae58e3bb9a70d279a56d7b3a0ed70a8a852c. Reverted due to various buildbot failures.
# 2501ae58	09-Oct-2023	Jay Foad <jay.foad@amd.com>	[CodeGen] Really renumber slot indexes before register allocation (#67038) PR #66334 tried to renumber slot indexes before register allocation, but the numbering was still affected by list entries [CodeGen] Really renumber slot indexes before register allocation (#67038) PR #66334 tried to renumber slot indexes before register allocation, but the numbering was still affected by list entries for instructions which had been erased. Fix this to make the register allocator's live range length heuristics even less dependent on the history of how instructions have been added to and removed from SlotIndexes's maps. show more ...
12 3 4