RISCVCodeGenPrepare.cpp - OpenGrok history log for /llvm-project/llvm/lib/Target/RISCV/RISCVCodeGenPrepare.cpp

Revision (<<< Hide revision tags) (Show revision tags >>>)	Date	Author	Comments
Revision tags: llvmorg-21-init, llvmorg-19.1.7, llvmorg-19.1.6, llvmorg-19.1.5, llvmorg-19.1.4
# 82d5dd28	12-Nov-2024	Kazu Hirata <kazu@google.com>	[RISCV] Remove unused includes (NFC) (#115814) Identified with misc-include-cleaner.
Revision tags: llvmorg-19.1.3, llvmorg-19.1.2, llvmorg-19.1.1, llvmorg-19.1.0, llvmorg-19.1.0-rc4, llvmorg-19.1.0-rc3, llvmorg-19.1.0-rc2
# 87af9ee8	01-Aug-2024	Yeting Kuo <46629943+yetingk@users.noreply.github.com>	[RISCV] Use experimental.vp.splat to splat specific vector length elements. (#101329) Previously, llvm IR is hard to create a scalable vector splat with a specific vector length, so we use riscv.vm [RISCV] Use experimental.vp.splat to splat specific vector length elements. (#101329) Previously, llvm IR is hard to create a scalable vector splat with a specific vector length, so we use riscv.vmv.v.x and riscv.vmv.v.f to do this work. But the two rvv intrinsics needs strict type constraint which can not support fixed vector types and illegal vector types. Using vp.splat could preserve old functionality and also generate more optimized code for vector types and illegal vectors. This patch also fixes crash for getEVT not serving ptr types. show more ...
Revision tags: llvmorg-19.1.0-rc1, llvmorg-20-init
# f0ac8903	17-Jul-2024	Piotr Fusik <p.fusik@samsung.com>	[RISCV][NFC] Fix intrinsic misspelled in a comment (#98998)
# 563ae620	16-Jul-2024	Luke Lau <luke@igalia.com>	[RISCV] Don't expand zero stride vp.strided.load if SEW>XLEN (#98924) A splat of a <n x i64> on RV32 will get lowered as a zero strided load anyway (and won't match any .vx splat patterns), so don' [RISCV] Don't expand zero stride vp.strided.load if SEW>XLEN (#98924) A splat of a <n x i64> on RV32 will get lowered as a zero strided load anyway (and won't match any .vx splat patterns), so don't expand it to a scalar load + splat to avoid writing it to the stack. show more ...
# d5f4f084	15-Jul-2024	Luke Lau <luke@igalia.com>	[RISCV] Always expand zero strided vp.strided.load (#98901) This patch makes zero strided VP loads always be expanded to a scalar load and splat even if +optimized-zero-stride-load is present. E [RISCV] Always expand zero strided vp.strided.load (#98901) This patch makes zero strided VP loads always be expanded to a scalar load and splat even if +optimized-zero-stride-load is present. Expanding it allows more .vx splat patterns to be matched, which is needed to prevent regressions in #98111. If the feature is present, RISCVISelDAGToDAG will combine it back to a zero strided load. The RV32 test diff also shows how need to emit a zero strided load either way after expanding an SEW=64 strided load. We could maybe fix this in a later patch by not doing the expand if SEW>XLEN. show more ...
# 94279ae4	15-Jul-2024	Yeting Kuo <46629943+yetingk@users.noreply.github.com>	[RISCV] Recommit "Expand vp.stride.load to splat of a scalar load." (#98579) This is a recommit of #98140. The old commit should be rebased on #98205 which changes the feature of hardware zero stri [RISCV] Recommit "Expand vp.stride.load to splat of a scalar load." (#98579) This is a recommit of #98140. The old commit should be rebased on #98205 which changes the feature of hardware zero stride optimization. It's a similar patch as a214c521f8763b36dd400b89017f74ad5ae4b6c7 for vp.stride.load. Some targets prefer pattern (vmv.v.x (load)) instead of vlse with zero stride. show more ...
# cea7bad7	11-Jul-2024	Nico Weber <thakis@chromium.org>	Revert "[RISCV] Expand vp.stride.load to splat of a scalar load." (#98422) Reverts llvm/llvm-project#98140 Breaks tests, see comments on the PR.
# cda245a3	11-Jul-2024	Yeting Kuo <46629943+yetingk@users.noreply.github.com>	[RISCV] Expand vp.stride.load to splat of a scalar load. (#98140) It's a similar patch as a214c521f8763b36dd400b89017f74ad5ae4b6c7 for vp.stride.load. Some targets prefer pattern (vmv.v.x (load)) i [RISCV] Expand vp.stride.load to splat of a scalar load. (#98140) It's a similar patch as a214c521f8763b36dd400b89017f74ad5ae4b6c7 for vp.stride.load. Some targets prefer pattern (vmv.v.x (load)) instead of vlse with zero stride. It's IR version of #97798. show more ...
# 9df71d76	28-Jun-2024	Nikita Popov <npopov@redhat.com>	[IR] Add getDataLayout() helpers to Function and GlobalValue (#96919) Similar to https://github.com/llvm/llvm-project/pull/96902, this adds `getDataLayout()` helpers to Function and GlobalValue, re [IR] Add getDataLayout() helpers to Function and GlobalValue (#96919) Similar to https://github.com/llvm/llvm-project/pull/96902, this adds `getDataLayout()` helpers to Function and GlobalValue, replacing the current `getParent()->getDataLayout()` pattern. show more ...
Revision tags: llvmorg-18.1.8, llvmorg-18.1.7, llvmorg-18.1.6, llvmorg-18.1.5, llvmorg-18.1.4, llvmorg-18.1.3, llvmorg-18.1.2, llvmorg-18.1.1, llvmorg-18.1.0, llvmorg-18.1.0-rc4, llvmorg-18.1.0-rc3, llvmorg-18.1.0-rc2, llvmorg-18.1.0-rc1, llvmorg-19-init
# 93968912	19-Jan-2024	Craig Topper <craig.topper@sifive.com>	[RISCV] Don't look for sext in RISCVCodeGenPrepare::visitAnd. We want to know the upper 33 bits of the And Input are zero. SExt only guarantees they are the same. We originally checked for SExt or [RISCV] Don't look for sext in RISCVCodeGenPrepare::visitAnd. We want to know the upper 33 bits of the And Input are zero. SExt only guarantees they are the same. We originally checked for SExt or ZExt when we were using isImpliedByDomCondition because a ZExt may have been changed to SExt before we visited the And. We are no longer using isImpliedByDomCondition so we can only look for zext with the nneg flag. While here, switch to PatternMatch to simplify the code. Fixes #78783 show more ...
# 15b0fabb	18-Jan-2024	Luke Lau <luke@igalia.com>	[RISCV] Vectorize phi for loop carried @llvm.vector.reduce.fadd (#78244) LLVM vector reduction intrinsics return a scalar result, but on RISC-V vector reduction instructions write the result in the [RISCV] Vectorize phi for loop carried @llvm.vector.reduce.fadd (#78244) LLVM vector reduction intrinsics return a scalar result, but on RISC-V vector reduction instructions write the result in the first element of a vector register. So when a reduction in a loop uses a scalar phi, we end up with unnecessary scalar moves: loop: vfmv.s.f v10, fa0 vfredosum.vs v8, v8, v10 vfmv.f.s fa0, v8 This mainly affects ordered fadd reductions, which has a scalar accumulator operand. This tries to vectorize any scalar phis that feed into a fadd reduction in RISCVCodeGenPrepare, converting: loop: %phi = phi <float> [ ..., %entry ], [ %acc, %loop] %acc = call float @llvm.vector.reduce.fadd.nxv4f32(float %phi, <vscale x 2 x float> %vec) ``` to loop: %phi = phi <vscale x 2 x float> [ ..., %entry ], [ %acc.vec, %loop] %phi.scalar = extractelement <vscale x 2 x float> %phi, i64 0 %acc = call float @llvm.vector.reduce.fadd.nxv4f32(float %x, <vscale x 2 x float> %vec) %acc.vec = insertelement <vscale x 2 x float> poison, float %acc.next, i64 0 Which eliminates the scalar -> vector -> scalar crossing during instruction selection. show more ...
Revision tags: llvmorg-17.0.6, llvmorg-17.0.5
# d64d5ea1	13-Nov-2023	Yingwei Zheng <dtcxzyw2333@gmail.com>	[RISCV][CodeGenPrepare] Remove duplicated transform for zext. NFC. (#72053) After #71534 and #72052, the transform `zext -> zext nneg` in `RISCVCodeGenPrepare` is redundant.
Revision tags: llvmorg-17.0.4
# 784a2cd5	30-Oct-2023	Philip Reames <preames@rivosinc.com>	[RISCV] Rewrite RISCVCodeGenPrepare using zext nneg [nfc-ish] (#70739) This stacks on #70725. Once we have lowering for zext nneg, we can rewrite all of the existing RISCVCodeGenPrepare login in te [RISCV] Rewrite RISCVCodeGenPrepare using zext nneg [nfc-ish] (#70739) This stacks on #70725. Once we have lowering for zext nneg, we can rewrite all of the existing RISCVCodeGenPrepare login in terms of zext nneg instead of sext. The change isn't NFC from the perspective of the individual pass, but should be from the perspective of codegen as a whole. As noted in the TODO, one piece can be moved to instcombine, but I'll leave that to a separate commit. show more ...
Revision tags: llvmorg-17.0.3, llvmorg-17.0.2, llvmorg-17.0.1, llvmorg-17.0.0, llvmorg-17.0.0-rc4, llvmorg-17.0.0-rc3, llvmorg-17.0.0-rc2, llvmorg-17.0.0-rc1, llvmorg-18-init, llvmorg-16.0.6, llvmorg-16.0.5, llvmorg-16.0.4, llvmorg-16.0.3, llvmorg-16.0.2, llvmorg-16.0.1
# 0f4c9c01	27-Mar-2023	Craig Topper <craig.topper@sifive.com>	[RISCV] Replace RISCV->RISC-V in strings. To be consistent with RISC-V branding guidelines https://riscv.org/about/risc-v-branding-guidelines/ Think we should be using RISC-V where possible. D14644 [RISCV] Replace RISCV->RISC-V in strings. To be consistent with RISC-V branding guidelines https://riscv.org/about/risc-v-branding-guidelines/ Think we should be using RISC-V where possible. D146449 already updated comments. Strings may have more user impact. Reviewed By: asb Differential Revision: https://reviews.llvm.org/D146451 show more ...
# 29463612	27-Mar-2023	Craig Topper <craig.topper@sifive.com>	[RISCV] Replace RISCV -> RISC-V in comments. NFC To be consistent with RISC-V branding guidelines https://riscv.org/about/risc-v-branding-guidelines/ Think we should be using RISC-V where possible. [RISCV] Replace RISCV -> RISC-V in comments. NFC To be consistent with RISC-V branding guidelines https://riscv.org/about/risc-v-branding-guidelines/ Think we should be using RISC-V where possible. More patches will follow. Reviewed By: asb Differential Revision: https://reviews.llvm.org/D146449 show more ...
Revision tags: llvmorg-16.0.0, llvmorg-16.0.0-rc4, llvmorg-16.0.0-rc3, llvmorg-16.0.0-rc2, llvmorg-16.0.0-rc1, llvmorg-17-init, llvmorg-15.0.7, llvmorg-15.0.6, llvmorg-15.0.5, llvmorg-15.0.4, llvmorg-15.0.3, working, llvmorg-15.0.2, llvmorg-15.0.1, llvmorg-15.0.0, llvmorg-15.0.0-rc3
# 37db2833	13-Aug-2022	Craig Topper <craig.topper@sifive.com>	[RISCV] isImpliedByDomCondition returns an Optional<bool> not a bool. We were incorrectly checking that it returned an implicaton result, not that the implication result itself was true.
Revision tags: llvmorg-15.0.0-rc2
# f19497f7	03-Aug-2022	Craig Topper <craig.topper@sifive.com>	[RISCV] Use InstVisitor in RISCVCodeGenPrepare. NFC Makes it easy to add new instructions to look at without dispatching manually.
Revision tags: llvmorg-15.0.0-rc1, llvmorg-16-init
# 1db6d6dc	25-Jul-2022	Craig Topper <craig.topper@sifive.com>	[RISCV] Teach RISCVCodeGenPrepare to optimize (zext (abs(i32 X, i1 1))). (abs(i32 X, i1 1) always produces a positive result. The 'i1 1' means INT_MIN input produces poison. If the result is sign ex [RISCV] Teach RISCVCodeGenPrepare to optimize (zext (abs(i32 X, i1 1))). (abs(i32 X, i1 1) always produces a positive result. The 'i1 1' means INT_MIN input produces poison. If the result is sign extended, InstCombine will convert it to zext. This does not produce ideal code for RISCV. This patch reverses the zext back to sext which can be folded into a subw or negw. Ideally we'd do this in SelectionDAG, but we lose the INT_MIN poison flag when llvm.abs becomes ISD::ABS. Reviewed By: reames Differential Revision: https://reviews.llvm.org/D130412 show more ...
# 8cc48309	17-Jul-2022	Craig Topper <craig.topper@sifive.com>	[RISCV] Teach RISCVCodeGenPrepare to optimize (i64 (and (zext/sext (i32 X), C1))) If X is known positive by a dominating condition, we can fill in ones into the upper bits of C1 if that would allow [RISCV] Teach RISCVCodeGenPrepare to optimize (i64 (and (zext/sext (i32 X), C1))) If X is known positive by a dominating condition, we can fill in ones into the upper bits of C1 if that would allow it to become an simm12 allowing the use of ANDI. This pattern often occurs in unrolled loops where the induction variable has been widened. To get the best benefit from this, I had to move the pass above ConstantHoisting which is in addIRPasses. Otherwise the AND constant is often hoisted away from the AND. Reviewed By: asb Differential Revision: https://reviews.llvm.org/D129888 show more ...
# 73f766ca	17-Jul-2022	Craig Topper <craig.topper@sifive.com>	[RISCV] Remove unnecessary use of IRBuilder from RISCVCodeGenPrepare. We're creating single instruction to replace another instruction. We can insert using the InsertBefore operand of the constructo [RISCV] Remove unnecessary use of IRBuilder from RISCVCodeGenPrepare. We're creating single instruction to replace another instruction. We can insert using the InsertBefore operand of the constructor. Then copy the debug location. show more ...
# 1a8468ba	14-Jul-2022	Craig Topper <craig.topper@sifive.com>	[RISCV] Add a RISCV specific CodeGenPrepare pass. Initial optimization is to convert (i64 (zext (i32 X))) to (i64 (sext (i32 X))) if the dominating condition for the basic block guaranteed the sign [RISCV] Add a RISCV specific CodeGenPrepare pass. Initial optimization is to convert (i64 (zext (i32 X))) to (i64 (sext (i32 X))) if the dominating condition for the basic block guaranteed the sign bit of X is zero. This frequently occurs in loop preheaders where a signed induction variable that can never be negative has been widened. There will be a dominating check that the 32-bit trip count isn't negative or zero. The check here is not restricted to that specific case though. A i32->i64 sext is cheaper than zext on RV64 without the Zba extension. Later optimizations can often remove the sext from the preheader basic block because the dominating block also needs a sext to evaluate the greater than 0 check. Reviewed By: asb Differential Revision: https://reviews.llvm.org/D129732 show more ...