O0-pipeline.ll - OpenGrok history log for /llvm-project/llvm/test/CodeGen/RISCV/O0-pipeline.ll

Revision (<<< Hide revision tags) (Show revision tags >>>)	Date	Author	Comments
Revision tags: llvmorg-21-init, llvmorg-19.1.7, llvmorg-19.1.6, llvmorg-19.1.5, llvmorg-19.1.4
# 97982a8c	05-Nov-2024	dlav-sc <daniil.avdeev@syntacore.com>	[RISCV][CFI] add function epilogue cfi information (#110810) This patch adds CFI instructions in the function epilogue. Before patch: addi sp, s0, -32 ld ra, 24(sp) # 8-byte Folded Reload ld s [RISCV][CFI] add function epilogue cfi information (#110810) This patch adds CFI instructions in the function epilogue. Before patch: addi sp, s0, -32 ld ra, 24(sp) # 8-byte Folded Reload ld s0, 16(sp) # 8-byte Folded Reload ld s1, 8(sp) # 8-byte Folded Reload addi sp, sp, 32 ret After patch: addi sp, s0, -32 .cfi_def_cfa sp, 32 ld ra, 24(sp) # 8-byte Folded Reload ld s0, 16(sp) # 8-byte Folded Reload ld s1, 8(sp) # 8-byte Folded Reload .cfi_restore ra .cfi_restore s0 .cfi_restore s1 addi sp, sp, 32 .cfi_def_cfa_offset 0 ret This functionality is already present in `riscv-gcc`, but it’s not in `clang` and this slightly impairs the `lldb` debugging experience, e.g. backtrace. show more ...
Revision tags: llvmorg-19.1.3, llvmorg-19.1.2, llvmorg-19.1.1
# 0ee10e94	19-Sep-2024	Alex Bradbury <asb@igalia.com>	[RISCV] Add additional fence for amocas when required by recent ABI change (#101023) A recent atomics ABI change / fix requires that for the "A6C" and A6S" atomics ABIs (i.e. both of those supporte [RISCV] Add additional fence for amocas when required by recent ABI change (#101023) A recent atomics ABI change / fix requires that for the "A6C" and A6S" atomics ABIs (i.e. both of those supported by LLVM currently), an additional fence is inserted for an atomic_compare_exchange with seq_cst failure ordering. <https://github.com/riscv-non-isa/riscv-elf-psabi-doc/pull/445> This isn't trivial to support through the hooks used by AtomicExpandPass because that pass assumes that when fences are inserted, the original atomics ordering information can be removed from the instruction. Rather than try to change and complicate that API, this patch implements the needed fence insertion through a small special purpose pass. show more ...
Revision tags: llvmorg-19.1.0, llvmorg-19.1.0-rc4
# 3d08ade7	29-Aug-2024	Stephen Tozer <stephen.tozer@sony.com>	[ExtendLifetimes] Implement llvm.fake.use to extend variable lifetimes (#86149) This patch is part of a set of patches that add an `-fextend-lifetimes` flag to clang, which extends the lifetimes of [ExtendLifetimes] Implement llvm.fake.use to extend variable lifetimes (#86149) This patch is part of a set of patches that add an `-fextend-lifetimes` flag to clang, which extends the lifetimes of local variables and parameters for improved debuggability. In addition to that flag, the patch series adds a pragma to selectively disable `-fextend-lifetimes`, and an `-fextend-this-ptr` flag which functions as `-fextend-lifetimes` for this pointers only. All changes and tests in these patches were written by Wolfgang Pieb (@wolfy1961), while Stephen Tozer (@SLTozer) has handled review and merging. The extend lifetimes flag is intended to eventually be set on by `-Og`, as discussed in the RFC here: https://discourse.llvm.org/t/rfc-redefine-og-o1-and-add-a-new-level-of-og/72850 This patch implements a new intrinsic instruction in LLVM, `llvm.fake.use` in IR and `FAKE_USE` in MIR, that takes a single operand and has no effect other than "using" its operand, to ensure that its operand remains live until after the fake use. This patch does not emit fake uses anywhere; the next patch in this sequence causes them to be emitted from the clang frontend, such that for each variable (or this) a fake.use operand is inserted at the end of that variable's scope, using that variable's value. This patch covers everything post-frontend, which is largely just the basic plumbing for a new intrinsic/instruction, along with a few steps to preserve the fake uses through optimizations (such as moving them ahead of a tail call or translating them through SROA). Co-authored-by: Stephen Tozer <stephen.tozer@sony.com> show more ...
Revision tags: llvmorg-19.1.0-rc3
# e80d8e1b	08-Aug-2024	Yeting Kuo <46629943+yetingk@users.noreply.github.com>	[RISCV] Insert simple landing pad before indirect jumps for Zicfilp. (#91860) This patch is based on https://github.com/llvm/llvm-project/pull/91855. This patch inserts simple landing pad ([pr])be [RISCV] Insert simple landing pad before indirect jumps for Zicfilp. (#91860) This patch is based on https://github.com/llvm/llvm-project/pull/91855. This patch inserts simple landing pad ([pr])before indirct jumps. And this also make option riscv-landing-pad-label influence this feature. [pr]: https://github.com/riscv-non-isa/riscv-elf-psabi-doc/pull/417 show more ...
# 9fb196b4	06-Aug-2024	Yeting Kuo <46629943+yetingk@users.noreply.github.com>	[RISCV] Insert simple landing pad for taken address labels. (#91855) This patch implements simple landing pad labels ([pr]). When Zicfilp enabled, this patch inserts `lpad 0` at the beginning of ba [RISCV] Insert simple landing pad for taken address labels. (#91855) This patch implements simple landing pad labels ([pr]). When Zicfilp enabled, this patch inserts `lpad 0` at the beginning of basic blocks which are possible to be landed by indirect jumps. This patch also supports option riscv-landing-pad-label to make users cpable to set nonzero fixed labels. Using nonzero fixed label force setting t2 before indirect jumps. It's less portable but more strict than original implementation. [pr]: https://github.com/riscv-non-isa/riscv-elf-psabi-doc/pull/417 show more ...
# fa92d51f	06-Aug-2024	Alexis Engelke <engelke@in.tum.de>	[VP] Merge ExpandVP pass into PreISelIntrinsicLowering (#101652) Similar to #97727; avoid an extra pass over the entire IR by performing the lowering as part of the pre-isel-intrinsic-lowering pass.
Revision tags: llvmorg-19.1.0-rc2
# b5fc083d	01-Aug-2024	Alexis Engelke <engelke@in.tum.de>	[CodeGen] Merge lowerConstantIntrinsics into pre-isel lowering (#97727) Currently, the LowerConstantIntrinsics pass does an RPO traversal of every function... only to find that many functions don't [CodeGen] Merge lowerConstantIntrinsics into pre-isel lowering (#97727) Currently, the LowerConstantIntrinsics pass does an RPO traversal of every function... only to find that many functions don't have constant intrinsics (is.constant, objectsize). In the CodeGen pipeline, there is already a pre-isel intrinsic lowering pass, which iterates over intrinsic declarations and lowers all users. Call lowerConstantIntrinsics from this pass to avoid the extra iteration over the entire IR and the RPO traversal. show more ...
Revision tags: llvmorg-19.1.0-rc1, llvmorg-20-init
# 87560434	17-Jun-2024	Philip Reames <preames@rivosinc.com>	[RISCV] Teach RISCVInsertVSETVLI to work without LiveIntervals (Reapplying with corrected commit message) We recently moved RISCVInsertVSETVLI from before vector register allocation to after vector [RISCV] Teach RISCVInsertVSETVLI to work without LiveIntervals (Reapplying with corrected commit message) We recently moved RISCVInsertVSETVLI from before vector register allocation to after vector register allocation. When doing so, we added an unconditional dependency on LiveIntervals - even at O0 where LiveIntevals hadn't previously run. As reported in #93587, this was apparently not safe to do. This change makes LiveIntervals optional, and adjusts all the update code to only run wen live intervals is present. The only real tricky part of this change is the abstract state tracking in the dataflow. We need to represent a "register w/unknown definition" state - but only when we don't have LiveIntervals. This adjust the abstract state definition so that the AVLIsReg state can represent either a register + valno, or a register + unknown definition. With LiveIntervals, we have an exact definition for each AVL use. Without LiveIntervals, we treat the definition of a register AVL as being unknown. The key semantic change is that we now have a state in the lattice for which something is known about the AVL value, but for which two identical lattice elements do not neccessarily represent the same AVL value at runtime. Previously, the only case which could result in such an unknown AVL was the fully unknown state (where VTYPE is also fully unknown). This requires a small adjustment to hasSameAVL and lattice state equality to draw this important distinction. The net effect of this patch is that we remove the LiveIntervals dependency at O0, and O0 code quality will regress for cases involving register AVL values. In practice, this means we pessimize code written with intrinsics at O0. This patch is an alternative to #93796 and #94340. It is very directly inspired by review conversation around them, and thus should be considered coauthored by Luke. show more ...
# 1d028151	17-Jun-2024	Philip Reames <preames@rivosinc.com>	Revert "[RISCV] Teach RISCVInsertVSETVLI to work without LiveIntervals (#94686)" This reverts commit 111507ed4ce49bbb8cfbf36a3e143bb25f0f13c0. Accidentally landed with stale commit message, will re Revert "[RISCV] Teach RISCVInsertVSETVLI to work without LiveIntervals (#94686)" This reverts commit 111507ed4ce49bbb8cfbf36a3e143bb25f0f13c0. Accidentally landed with stale commit message, will reply shortly. show more ...
# 111507ed	17-Jun-2024	Philip Reames <preames@rivosinc.com>	[RISCV] Teach RISCVInsertVSETVLI to work without LiveIntervals (#94686) Stacked on https://github.com/llvm/llvm-project/pull/94658. We recently moved RISCVInsertVSETVLI from before vector reg [RISCV] Teach RISCVInsertVSETVLI to work without LiveIntervals (#94686) Stacked on https://github.com/llvm/llvm-project/pull/94658. We recently moved RISCVInsertVSETVLI from before vector register allocation to after vector register allocation. When doing so, we added an unconditional dependency on LiveIntervals - even at O0 where LiveIntevals hadn't previously run. As reported in #93587, this was apparently not safe to do. This change makes LiveIntervals optional, and adjusts all the update code to only run wen live intervals is present. The only real tricky part of this change is the abstract state tracking in the dataflow. We need to represent a "register w/unknown definition" state - but only when we don't have LiveIntervals. This adjust the abstract state definition so that the AVLIsReg state can represent either a register + valno, or a register + unknown definition. With LiveIntervals, we have an exact definition for each AVL use. Without LiveIntervals, we treat the definition of a register AVL as being unknown. The key semantic change is that we now have a state in the lattice for which something is known about the AVL value, but for which two identical lattice elements do not necessarily represent the same AVL value at runtime. Previously, the only case which could result in such an unknown AVL was the fully unknown state (where VTYPE is also fully unknown). This requires a small adjustment to hasSameAVL and lattice state equality to draw this important distinction. The net effect of this patch is that we remove the LiveIntervals dependency at O0, and O0 code quality will regress for cases involving register AVL values. This patch is an alternative to https://github.com/llvm/llvm-project/pull/93796 and https://github.com/llvm/llvm-project/pull/94340. It is very directly inspired by review conversation around them, and thus should be considered coauthored by Luke. show more ...
Revision tags: llvmorg-18.1.8, llvmorg-18.1.7
# cab81dd0	31-May-2024	Egor Pasko <pasko@chromium.org>	[EntryExitInstrumenter] Move passes out of clang into LLVM default pipelines (#92171) Move EntryExitInstrumenter(PostInlining=true) to as late as possible and EntryExitInstrumenter(PostInlining=fal [EntryExitInstrumenter] Move passes out of clang into LLVM default pipelines (#92171) Move EntryExitInstrumenter(PostInlining=true) to as late as possible and EntryExitInstrumenter(PostInlining=false) to an early pre-inlining stage (but skip for ThinLTO post-link). This should fix the issues reported in https://github.com/rust-lang/rust/issues/92109 and https://github.com/llvm/llvm-project/issues/52853. These are caused by https://reviews.llvm.org/D97608. show more ...
# 1cff7413	29-May-2024	Luke Lau <luke@igalia.com>	[RISCV] Merge RISCVCoalesceVSETVLI back into RISCVInsertVSETVLI (#92869) We no longer need to separate the passes now that #70549 is landed and this will unblock #89089. It's not strictly NFC be [RISCV] Merge RISCVCoalesceVSETVLI back into RISCVInsertVSETVLI (#92869) We no longer need to separate the passes now that #70549 is landed and this will unblock #89089. It's not strictly NFC because it will move coalescing before register allocation when -riscv-vsetvl-after-rvv-regalloc is disabled. But this makes it closer to the original behaviour. show more ...
# 1579e9ca	24-May-2024	Nikita Popov <nikita.ppv@gmail.com>	Revert "Run ObjCContractPass in Default Codegen Pipeline (#92331)" This reverts commit 8cc8e5d6c6ac9bfc888f3449f7e424678deae8c2. This reverts commit dae55c89835347a353619f506ee5c8f8a2c136a7. Causes Revert "Run ObjCContractPass in Default Codegen Pipeline (#92331)" This reverts commit 8cc8e5d6c6ac9bfc888f3449f7e424678deae8c2. This reverts commit dae55c89835347a353619f506ee5c8f8a2c136a7. Causes major compile-time regressions for unoptimized builds. show more ...
# 8cc8e5d6	23-May-2024	Nuri Amari <nuri.amari99@gmail.com>	Run ObjCContractPass in Default Codegen Pipeline (#92331) Prior to this patch, when using -fthinlto-index= the ObjCARCContractPass isn't run prior to CodeGen, and instruction selection fails on IR c Run ObjCContractPass in Default Codegen Pipeline (#92331) Prior to this patch, when using -fthinlto-index= the ObjCARCContractPass isn't run prior to CodeGen, and instruction selection fails on IR containing arc intrinsics. This patch is motivated by that usecase. The pass was previously added in various places codegen is performed. This patch adds the pass to the default codegen pipepline, makes sure it bails immediately if no arc intrinsics are found, and removes the adhoc scheduling of the pass. Co-authored-by: Nuri Amari <nuriamari@fb.com> show more ...
# 675e7bd1	21-May-2024	Piyou Chen <piyou.chen@sifive.com>	[RISCV] Support postRA vsetvl insertion pass (#70549) This patch try to get rid of vsetvl implict vl/vtype def-use chain and improve the register allocation quality by moving the vsetvl insertion [RISCV] Support postRA vsetvl insertion pass (#70549) This patch try to get rid of vsetvl implict vl/vtype def-use chain and improve the register allocation quality by moving the vsetvl insertion pass after RVV register allocation It will gain the benefit for the following optimization from 1. unblock scheduler's constraints by removing vl/vtype def-use chain 2. Support RVV re-materialization 3. Support partial spill This patch add a new option `-riscv-vsetvl-after-rvv-regalloc=<1\|0>` to control this feature and default set as disable. show more ...
Revision tags: llvmorg-18.1.6
# 1a58e886	15-May-2024	Luke Lau <luke@igalia.com>	[RISCV] Move RISCVInsertVSETVLI to after phi elimination (#91440) Split off from #70549, this patch moves RISCVInsertVSETVLI to after phi elimination where we exit SSA and need to move to LiveVaria [RISCV] Move RISCVInsertVSETVLI to after phi elimination (#91440) Split off from #70549, this patch moves RISCVInsertVSETVLI to after phi elimination where we exit SSA and need to move to LiveVariables. The motivation for splitting this off is to avoid the large scheduling diffs from moving completely to after regalloc, and instead focus on converting the pass to work on LiveIntervals. The two main changes required are updating VSETVLIInfo to store VNInfos instead of MachineInstrs, which allows us to still check for PHI defs in needVSETVLIPHI, and fixing up the live intervals of any AVL operands after inserting new instructions. On O3 the pass is inserted after the register coalescer, otherwise we end up with a bunch of COPYs around eliminated PHIs that trip up needVSETVLIPHI. Co-authored-by: Piyou Chen <piyou.chen@sifive.com> show more ...
# 0ebe48f0	10-May-2024	Luke Lau <luke@igalia.com>	[RISCV] Move RISCVInsertVSETVLI after CSR/VXRM passes (#91701) This further splits off #91440 to inch RISCVInsertVSETVLI closer to post vector regalloc. As noted in #91440, most of the diffs are [RISCV] Move RISCVInsertVSETVLI after CSR/VXRM passes (#91701) This further splits off #91440 to inch RISCVInsertVSETVLI closer to post vector regalloc. As noted in #91440, most of the diffs are from moving vsetvli insertion after the vxrm/csr insertion passes, but these are getting conflated with the changes from moving to LiveIntervals. One idea was that we could try and remove some of these diffs by manually moving back the vsetvlis past the vxrm/csr instructions. But this meant having to touch up the LiveIntervals again which seemed to lead to even more diffs. This instead just moves RISCVInsertVSETVLI after RISCVInsertReadWriteCSR and RISCVInsertWriteVXRM so we can isolate those changes. show more ...
Revision tags: llvmorg-18.1.5
# af82d01f	24-Apr-2024	Luke Lau <luke@igalia.com>	Reapply "[RISCV] Separate doLocalPostpass into new pass and move to post vector regalloc (#88295)" The original commit was calling shrinkToUses on an interval for a virtual register whose def was er Reapply "[RISCV] Separate doLocalPostpass into new pass and move to post vector regalloc (#88295)" The original commit was calling shrinkToUses on an interval for a virtual register whose def was erased. This fixes it by calling shrinkToUses first and removing the interval if we erase the old VL def. show more ...
# fc13353e	24-Apr-2024	Luke Lau <luke@igalia.com>	Revert "[RISCV] Separate doLocalPostpass into new pass and move to post vector regalloc (#88295)" Seems to cause an address sanitizer failure on one of the buildbots related to live intervals.
# 603ba4c5	24-Apr-2024	Luke Lau <luke@igalia.com>	[RISCV] Separate doLocalPostpass into new pass and move to post vector regalloc (#88295) This patch splits off part of the work to move vsetvli insertion to post regalloc in #70549. The doLocalP [RISCV] Separate doLocalPostpass into new pass and move to post vector regalloc (#88295) This patch splits off part of the work to move vsetvli insertion to post regalloc in #70549. The doLocalPostpass operates outside of RISCVInsertVSETVLI's dataflow, so we can move it to its own pass. We can then move it to post vector regalloc which should be a smaller change. A couple of things that are different from #70549: - This manually fixes up the LiveIntervals rather than recomputing it via createAndComputeVirtRegInterval. I'm not sure if there's much of a difference with either. - For the postpass it's sufficient enough to just check isUndef() in hasUndefinedMergeOp, i.e. we don't need to lookup the def in VNInfo. Running on llvm-test-suite and SPEC CPU 2017 there aren't any changes in the number of vsetvlis removed. There are some minor scheduling diffs as well as extra spills and less spills in some cases (caused by transient vsetvlis existing between RISCVInsertVSETVLI and RISCVCoalesceVSETVLI when vec regalloc happens), but they are minor and should go away once we finish moving the rest of RISCVInsertVSETVLI. We could also potentially turn off this pass for unoptimised builds. show more ...
Revision tags: llvmorg-18.1.4, llvmorg-18.1.3, llvmorg-18.1.2, llvmorg-18.1.1, llvmorg-18.1.0, llvmorg-18.1.0-rc4
# 28233408	26-Feb-2024	Jack Styles <99514724+Stylie777@users.noreply.github.com>	[CodeGen] [ARM] Make RISC-V Init Undef Pass Target Independent and add support for the ARM Architecture. (#77770) When using Greedy Register Allocation, there are times where early-clobber values [CodeGen] [ARM] Make RISC-V Init Undef Pass Target Independent and add support for the ARM Architecture. (#77770) When using Greedy Register Allocation, there are times where early-clobber values are ignored, and assigned the same register. This is illeagal behaviour for these intructions. To get around this, using Pseudo instructions for early-clobber registers gives them a definition and allows Greedy to assign them to a different register. This then meets the ARM Architecture Reference Manual and matches the defined behaviour. This patch takes the existing RISC-V patch and makes it target independent, then adds support for the ARM Architecture. Doing this will ensure early-clobber restraints are followed when using the ARM Architecture. Making the pass target independent will also open up possibility that support other architectures can be added in the future. show more ...
Revision tags: llvmorg-18.1.0-rc3, llvmorg-18.1.0-rc2, llvmorg-18.1.0-rc1, llvmorg-19-init
# d0a39e61	01-Dec-2023	Piyou Chen <piyou.chen@sifive.com>	[RISCV] default enable splitting regalloc between RVV and other (#72950) This patch make riscv-split-regalloc as true by default. It will not affect the codegen result if it vector register allo [RISCV] default enable splitting regalloc between RVV and other (#72950) This patch make riscv-split-regalloc as true by default. It will not affect the codegen result if it vector register allocation doesn't exist. If there is the vector register allocation, it may affect the non-rvv register LiveInterval's segment/weight. It will make the allocation in a different order. show more ...
Revision tags: llvmorg-17.0.6, llvmorg-17.0.5
# 014390d9	02-Nov-2023	Craig Topper <craig.topper@sifive.com>	[RISCV] Implement cross basic block VXRM write insertion. (#70382) This adds a new pass to insert VXRM writes for vector instructions. With the goal of avoiding redundant writes. The pass does 2 [RISCV] Implement cross basic block VXRM write insertion. (#70382) This adds a new pass to insert VXRM writes for vector instructions. With the goal of avoiding redundant writes. The pass does 2 dataflow algorithms. The first is a forward data flow to calculate where a VXRM value is available. The second is a backwards dataflow to determine where a VXRM value is anticipated. Finally, we use the results of these two dataflows to insert VXRM writes where a value is anticipated, but not available. The pass does not split critical edges so we aren't always able to eliminate all redundancy. The pass will only insert vxrm writes on paths that always require it. show more ...
Revision tags: llvmorg-17.0.4
# 109aa586	26-Oct-2023	Craig Topper <craig.topper@sifive.com>	[RISCV] Add an experimental pseudoinstruction to represent a rematerializable constant materialization sequence. (#69983) Rematerialization during register allocation is currently limited to a sing [RISCV] Add an experimental pseudoinstruction to represent a rematerializable constant materialization sequence. (#69983) Rematerialization during register allocation is currently limited to a single instruction with no inputs. This patch introduces a pseudoinstruction that represents the materialization of a constant. I've started with a sequence of 2 instructions for now, which covers at least the common LUI+ADDI(W) case. This instruction will be expanded into real instructions immediately after register allocation using a new pass. This gives the post-RA scheduler a chance to separate the 2 instructions to improve ILP. I believe this matches the approach used by AArch64. Unfortunately, this loses some CSE opportunies when an LUI value is used by multiple constants with different LSBs. This feature is off by default and a new backend command line option is added to enable it for testing. This avoids the spill and reloads reported in #69586. show more ...
Revision tags: llvmorg-17.0.3, llvmorg-17.0.2, llvmorg-17.0.1, llvmorg-17.0.0, llvmorg-17.0.0-rc4, llvmorg-17.0.0-rc3
# a63bd7e9	14-Aug-2023	Philip Reames <preames@rivosinc.com>	[RISCV] Use NoReg in place of IMPLICIT_DEF for undefined passthru operands In a recent series of refactorings (described here: https://discourse.llvm.org/t/riscv-transition-in-vector-pseudo-structur [RISCV] Use NoReg in place of IMPLICIT_DEF for undefined passthru operands In a recent series of refactorings (described here: https://discourse.llvm.org/t/riscv-transition-in-vector-pseudo-structure-policy-variants/71295), I greatly increased the number of IMPLICIT_DEF operands to our vector instructions. This has turned out to have an unexpected negative impact because MachineCSE does not CSE IMPLICIT_DEFs, and thus does not CSE any instruction with an IMPLICIT_DEF operand. SelectionDAG does CSE the same case, but that only covers the same block case, not the cross block case. This lead to the performance regression reported in https://github.com/llvm/llvm-project/issues/64282. This change is a slightly ugly hack to side step the issue. Instead of fixing the root cause (lack of CSE for IMPLICIT_DEF) or undoing the operand changes, we leave the extra operand in place, and use NoReg in place of IMPLICIT_DEF. I then convert back to IMPLICIT_DEF just before register allocation so that ProcessImplicitDefs and TwoAddressInstructions can do the normal transforms to Undef tied registers. We may end up backporting this into the 17.x release branch. Given how late in the release cycle this is landing, that's much less likely now, but still a possibility. Differential Revision: https://reviews.llvm.org/D156909 show more ...
12