LoopUnrollRuntime.cpp - OpenGrok history log for /llvm-project/llvm/lib/Transforms/Utils/LoopUnrollRuntime.cpp

Revision (<<< Hide revision tags) (Show revision tags >>>)	Date	Author	Comments
# 611ffcf4	14-Jul-2022	Kazu Hirata <kazu@google.com>	[llvm] Use value instead of getValue (NFC)
# 6d5f8143	29-Jun-2022	Florian Hahn <flo@fhahn.com>	[LoopUnrollRuntime] Invalidate SCEV for exit phi in ConnectProlog. ConnectProlog adds new incoming values to exit phi nodes which can change the SCEV for the phi after 20d798bd47ec51. Fix is analog [LoopUnrollRuntime] Invalidate SCEV for exit phi in ConnectProlog. ConnectProlog adds new incoming values to exit phi nodes which can change the SCEV for the phi after 20d798bd47ec51. Fix is analog to cfc741bc0e029. Fixes #56286. show more ...
# 9a35f19e	29-Jun-2022	Florian Hahn <flo@fhahn.com>	[UnrollRuntime] Invalidate SCEVs for modified phis in ConnectEpilog. ConnectEpilog adds new incoming values to exit phi nodes which can change the SCEV for the phi after 20d798bd47ec51. Fix is anal [UnrollRuntime] Invalidate SCEVs for modified phis in ConnectEpilog. ConnectEpilog adds new incoming values to exit phi nodes which can change the SCEV for the phi after 20d798bd47ec51. Fix is analog to cfc741bc0e029. Fixes #56282. show more ...
# a7938c74	26-Jun-2022	Kazu Hirata <kazu@google.com>	[llvm] Don't use Optional::hasValue (NFC) This patch replaces Optional::hasValue with the implicit cast to bool in conditionals only.
# 3b7c3a65	25-Jun-2022	Kazu Hirata <kazu@google.com>	Revert "Don't use Optional::hasValue (NFC)" This reverts commit aa8feeefd3ac6c78ee8f67bf033976fc7d68bc6d.
# aa8feeef	25-Jun-2022	Kazu Hirata <kazu@google.com>	Don't use Optional::hasValue (NFC)
Revision tags: llvmorg-14.0.6, llvmorg-14.0.5
# b8c2781f	09-Jun-2022	Simon Moll <moll@cs.uni-saarland.de>	[NFC] format InstructionSimplify & lowerCaseFunctionNames Clang-format InstructionSimplify and convert all "FunctionName"s to "functionName". This patch does touch a lot of files but gets done with [NFC] format InstructionSimplify & lowerCaseFunctionNames Clang-format InstructionSimplify and convert all "FunctionName"s to "functionName". This patch does touch a lot of files but gets done with the cleanup of InstructionSimplify in one commit. This is the alternative to the less invasive clang-format only patch: D126783 Reviewed By: spatel, rengolin Differential Revision: https://reviews.llvm.org/D126889 show more ...
Revision tags: llvmorg-14.0.4
# 81c648a3	18-May-2022	Nikita Popov <npopov@redhat.com>	[LoopUnroll] Freeze tripcount rather than condition This is a followup to D125754. We introduce two branches, one before the unrolled loop and one before the epilogue (and similar for the prologue c [LoopUnroll] Freeze tripcount rather than condition This is a followup to D125754. We introduce two branches, one before the unrolled loop and one before the epilogue (and similar for the prologue case). The previous patch only froze the condition on the first branch. Rather than independently freezing the second condition, this patch instead freezes TripCount and bases BECount on it. These are the two quantities involved in the conditions, and this ensures that both work on a consistent, non-poisonous trip count. Differential Revision: https://reviews.llvm.org/D125896 show more ...
# 323514de	17-May-2022	Nikita Popov <npopov@redhat.com>	[LoopUnroll] Avoid branch on poison for runtime unroll with multiple exits When performing runtime unrolling with multiple exits, one of the earlier (non-latch) exits may exit the loop on the first [LoopUnroll] Avoid branch on poison for runtime unroll with multiple exits When performing runtime unrolling with multiple exits, one of the earlier (non-latch) exits may exit the loop on the first iteration, such that we never branch on the latch exit condition. As such, we need to freeze the condition of the new branch that is introduced before the loop, as it now executes unconditionally. Differential Revision: https://reviews.llvm.org/D125754 show more ...
Revision tags: llvmorg-14.0.3, llvmorg-14.0.2, llvmorg-14.0.1, llvmorg-14.0.0, llvmorg-14.0.0-rc4, llvmorg-14.0.0-rc3, llvmorg-14.0.0-rc2
# a494ae43	01-Mar-2022	serge-sans-paille <sguelton@redhat.com>	Cleanup includes: TransformsUtils Estimation on the impact on preprocessor output: before: 1065307662 after: 1064800684 Discourse thread: https://discourse.llvm.org/t/include-what-you-use-include- Cleanup includes: TransformsUtils Estimation on the impact on preprocessor output: before: 1065307662 after: 1064800684 Discourse thread: https://discourse.llvm.org/t/include-what-you-use-include-cleanup Differential Revision: https://reviews.llvm.org/D120741 show more ...
Revision tags: llvmorg-14.0.0-rc1, llvmorg-15-init, llvmorg-13.0.1, llvmorg-13.0.1-rc3, llvmorg-13.0.1-rc2
# 84b978da	01-Dec-2021	Nikita Popov <nikita.ppv@gmail.com>	[LoopUnrollRuntime] Remove unnecessary pointer BECount check (NFC) BECounts are guaranteed to be integers nowadays.
Revision tags: llvmorg-13.0.1-rc1
# 8f95e915	15-Nov-2021	Philip Reames <listmail@philipreames.com>	[unroll-runtime] Relax two profitability limitations on multi-exit unrolling This change is mostly about getting rid of some "uninteresting" cases in a follow on deeper heuristic change. If anyone [unroll-runtime] Relax two profitability limitations on multi-exit unrolling This change is mostly about getting rid of some "uninteresting" cases in a follow on deeper heuristic change. If anyone sees actually interesting code differences out of this, please let me know. I'm not expecting this to have much impact at all. Case 1 - With the single deoptimize non-latch exit, we can't have two exiting blocks sharing an exit block. We can only hit this with a poorly documented debug flag. Case 2 - Why should we treat epilog cases differently from prolog cases? Or to say it differently, why should starting with a constant control whether a multiple exit loop gets unrolled? Sorry for the lack of tests here. These are both exceedingly narrow cases in practice, and after a while trying, I couldn't come up with a test which did anything "useful" as opposed to simply exercise a random combination of force flags. Note that the legality cases for each are already exercised with force flags. show more ...
# 423da618	15-Nov-2021	Philip Reames <listmail@philipreames.com>	[runtime-unroll] Inline canSafelyUnrollMultiExitLoop [NFC] All of the interesting logic from this routine has been removed, inline the single check into the sole non-assert caller. The assert use h [runtime-unroll] Inline canSafelyUnrollMultiExitLoop [NFC] All of the interesting logic from this routine has been removed, inline the single check into the sole non-assert caller. The assert use has little value with the restructured code and is simply dropped. show more ...
# e99902a8	15-Nov-2021	Philip Reames <listmail@philipreames.com>	[runtime-unroll] Restructure if-clause to improve readability [NFC]
# 37ead201	12-Nov-2021	Philip Reames <listmail@philipreames.com>	[runtime-unroll] Use incrementing IVs instead of decrementing ones This is one of those wonderful "in theory X doesn't matter, but in practice is does" changes. In this particular case, we shift the [runtime-unroll] Use incrementing IVs instead of decrementing ones This is one of those wonderful "in theory X doesn't matter, but in practice is does" changes. In this particular case, we shift the IVs inserted by the runtime unroller to clamp iteration count of the loops* from decrementing to incrementing. Why does this matter? A couple of reasons: * SCEV doesn't have a native subtract node. Instead, all subtracts (A - B) are represented as A + -1 * B and drops any flags invalidated by such. As a result, SCEV is slightly less good at reasoning about edge cases involving decrementing addrecs than incrementing ones. (You can see this in the inferred flags in some of the test cases.) * Other parts of the optimizer produce incrementing IVs, and they're common in idiomatic source language. We do have support for reversing IVs, but in general if we produce one of each, the pair will persist surprisingly far through the optimizer before being coalesced. (You can see this looking at nearby phis in the test cases.) Note that if the hardware prefers decrementing (i.e. zero tested) loops, LSR should convert back immediately before codegen. * Mostly irrelevant detail: The main loop of the prolog case is handled independently and will simple use the original IV with a changed start value. We could in theory use this scheme for all iteration clamping, but that's a larger and more invasive change. show more ...
# 81e9c906	15-Oct-2021	Kazu Hirata <kazu@google.com>	[llvm] Use llvm::is_contained (NFC)
Revision tags: llvmorg-13.0.0, llvmorg-13.0.0-rc4, llvmorg-13.0.0-rc3
# abca4c01	13-Sep-2021	Kazu Hirata <kazu@google.com>	[Utils] Use make_early_inc_range (NFC)
# fa82a3d0	02-Sep-2021	Philip Reames <listmail@philipreames.com>	[runtimeunroll] Support epilogue unrolling with a parent loop This patch adds support for unrolling inner loops using epilogue unrolling. The basic issue is that the original latch exit block of the [runtimeunroll] Support epilogue unrolling with a parent loop This patch adds support for unrolling inner loops using epilogue unrolling. The basic issue is that the original latch exit block of the inner loop could be outside the outer loop. When we clone the inner loop and split the latch exit, the cloned blocks need to be in the outer loop. Differential Revision: https://reviews.llvm.org/D108476 show more ...
# 45c672e2	02-Sep-2021	Philip Reames <listmail@philipreames.com>	[runtimeunroll] Under EXPENSIVE_CHECKS, validate loop info Requested in review comment on D108476
# b604fcb7	31-Aug-2021	Philip Reames <listmail@philipreames.com>	[runtime] Move prolog/epilog block to a post-simplify strategy The runtime unroller will try to produce a non-loop if the unroll count is 2 and thus the prolog/epilog loop would only run at most one [runtime] Move prolog/epilog block to a post-simplify strategy The runtime unroller will try to produce a non-loop if the unroll count is 2 and thus the prolog/epilog loop would only run at most one iteration. The old implementation did this by avoiding loop construction entirely. This patches instead constructs the trivial loop and then explicitly breaks the backedge and simplifies. This does result in some additional code churn when triggered, but a) results in better quality code and b) removes a codepath which didn't work properly for multiple exit epilogs. One oddity that I want to draw to reviewer attention is that this somehow changes revisit order. The new order looks equivalent to me, but I don't understand how creating and erasing an extra loop here creates this effect. Differential Revision: https://reviews.llvm.org/D108521 show more ...
Revision tags: llvmorg-13.0.0-rc2
# d8d84c9d	22-Aug-2021	Philip Reames <listmail@philipreames.com>	[runtimeunroll] Use early return to reduce nesting [nfc]
# 17b9cb18	19-Aug-2021	Philip Reames <listmail@philipreames.com>	[runtimeunroll] Support multiple exits to latch exit w/prolog loop This patch extends the runtime unrolling infrastructure to support unrolling a loop with multiple exiting blocks branching to the s [runtimeunroll] Support multiple exits to latch exit w/prolog loop This patch extends the runtime unrolling infrastructure to support unrolling a loop with multiple exiting blocks branching to the same exit block used by the latch. It intentionally does not include a cost model change to enable this functionality unless appropriate force flags are used. This is the prolog companion to D107381. Since this was LGTMed, a problem with DT updating was reported against that patch. I roled in the analogous fix here as it seemed obvious, and not worth re-review. As an aside, our prolog form leaves a lot of potential value on the floor when there is an invariant load or invariant condition in the loop being runtime unrolled. We should probably consider a "required prolog" heuristic. (Alternatively, maybe we should be peeling these cases more aggressively?) Differential Revision: https://reviews.llvm.org/D108262 show more ...
# 447256f2	19-Aug-2021	Philip Reames <listmail@philipreames.com>	[runtimeunroll] Fix reported DT verification error after 94d0914 In 94d0914, I added support for unrolling of multiple exit loops which have multiple exits reaching the latch. Per reports on the re [runtimeunroll] Fix reported DT verification error after 94d0914 In 94d0914, I added support for unrolling of multiple exit loops which have multiple exits reaching the latch. Per reports on the review post commit, I'd missed updating the domtree for one case. This fix addresses that ommission. There's no new test as this is covered by existing tests with expensive verification turned on. show more ...
# 94d09142	18-Aug-2021	Philip Reames <listmail@philipreames.com>	[runtimeunroll] Support multiple exits to latch exit w/epilogue loop This patch extends the runtime unrolling infrastructure to support unrolling a loop with multiple exiting blocks branching to the [runtimeunroll] Support multiple exits to latch exit w/epilogue loop This patch extends the runtime unrolling infrastructure to support unrolling a loop with multiple exiting blocks branching to the same exit block used by the latch. It intentionally does not include a cost model change to enable this functionality unless appropriate force flags are used. I decided to restrict this to the epilogue case. Given the changes ended up being pretty generic, we may be able to unblock the prolog case too, but I want to do that in a separate change to reduce the amount of code we all have to understand at one time. Differential Revision: https://reviews.llvm.org/D107381 show more ...
# 223835f0	03-Aug-2021	Philip Reames <listmail@philipreames.com>	[runtimeunroll] A bit of style cleanup to simplify a following change [NFC] Use for-range, use the idiomatic pattern for non-loop values, etc..
123 4 5 6 7