StackArrays.cpp - OpenGrok history log for /llvm-project/flang/lib/Optimizer/Transforms/StackArrays.cpp

Revision (<<< Hide revision tags) (Show revision tags >>>)	Date	Author	Comments
Revision tags: llvmorg-21-init, llvmorg-19.1.7
# e3cd88a7	13-Jan-2025	Slava Zakharin <szakharin@nvidia.com>	[flang] Fixed StackArrays assertion after #121919. (#122550) `findAllocaLoopInsertionPoint()` hit assertion not being able to find the `fir.freemem` because of the `fir.convert`. I think it is bet [flang] Fixed StackArrays assertion after #121919. (#122550) `findAllocaLoopInsertionPoint()` hit assertion not being able to find the `fir.freemem` because of the `fir.convert`. I think it is better to look for `fir.freemem` same way with the look-through walk. show more ...
# 303249c4	08-Jan-2025	Tom Eccles <tom.eccles@arm.com>	[flang][StackArrays] track pointers through fir.convert (#121919) This does add a little computational complexity because now every freemem operation has to be tested for every allocation. This cou [flang][StackArrays] track pointers through fir.convert (#121919) This does add a little computational complexity because now every freemem operation has to be tested for every allocation. This could be improved with some more memoisation but I think it is easier to read this way. Let me know if you would prefer me to change this to pre-compute the normalised addresses each freemem operation is using. Weirdly, this change resulted in a verifier failure for the fir.declare in the previous test case. Maybe it was previously removed as dead code and now it isn't. Anyway I fixed that too. show more ...
# 392651a7	22-Dec-2024	Kazu Hirata <kazu@google.com>	[flang] Migrate away from PointerUnion::{is,get} (NFC) (#120880) Note that PointerUnion::{is,get} have been soft deprecated in PointerUnion.h: // FIXME: Replace the uses of is(), get() and dyn [flang] Migrate away from PointerUnion::{is,get} (NFC) (#120880) Note that PointerUnion::{is,get} have been soft deprecated in PointerUnion.h: // FIXME: Replace the uses of is(), get() and dyn_cast() with // isa<T>, cast<T> and the llvm::dyn_cast<T> I'm not touching PointerUnion::dyn_cast for now because it's a bit complicated; we could blindly migrate it to dyn_cast_if_present, but we should probably use dyn_cast when the operand is known to be non-null. show more ...
# 09dfc571	20-Dec-2024	Jacques Pienaar <jpienaar@google.com>	[mlir] Enable decoupling two kinds of greedy behavior. (#104649) The greedy rewriter is used in many different flows and it has a lot of convenience (work list management, debugging actions, tracin [mlir] Enable decoupling two kinds of greedy behavior. (#104649) The greedy rewriter is used in many different flows and it has a lot of convenience (work list management, debugging actions, tracing, etc). But it combines two kinds of greedy behavior 1) how ops are matched, 2) folding wherever it can. These are independent forms of greedy and leads to inefficiency. E.g., cases where one need to create different phases in lowering and is required to applying patterns in specific order split across different passes. Using the driver one ends up needlessly retrying folding/having multiple rounds of folding attempts, where one final run would have sufficed. Of course folks can locally avoid this behavior by just building their own, but this is also a common requested feature that folks keep on working around locally in suboptimal ways. For downstream users, there should be no behavioral change. Updating from the deprecated should just be a find and replace (e.g., `find ./ -type f -exec sed -i 's\|applyPatternsAndFoldGreedily\|applyPatternsGreedily\|g' {} \;` variety) as the API arguments hasn't changed between the two. show more ...
Revision tags: llvmorg-19.1.6, llvmorg-19.1.5, llvmorg-19.1.4, llvmorg-19.1.3, llvmorg-19.1.2
# 4b3f251b	11-Oct-2024	donald chen <chenxunyu1993@gmail.com>	[mlir] [dataflow] unify semantics of program point (#110344) The concept of a 'program point' in the original data flow framework is ambiguous. It can refer to either an operation or a block itself [mlir] [dataflow] unify semantics of program point (#110344) The concept of a 'program point' in the original data flow framework is ambiguous. It can refer to either an operation or a block itself. This representation has different interpretations in forward and backward data-flow analysis. In forward data-flow analysis, the program point of an operation represents the state after the operation, while in backward data flow analysis, it represents the state before the operation. When using forward or backward data-flow analysis, it is crucial to carefully handle this distinction to ensure correctness. This patch refactors the definition of program point, unifying the interpretation of program points in both forward and backward data-flow analysis. How to integrate this patch? For dense forward data-flow analysis and other analysis (except dense backward data-flow analysis), the program point corresponding to the original operation can be obtained by `getProgramPointAfter(op)`, and the program point corresponding to the original block can be obtained by `getProgramPointBefore(block)`. For dense backward data-flow analysis, the program point corresponding to the original operation can be obtained by `getProgramPointBefore(op)`, and the program point corresponding to the original block can be obtained by `getProgramPointAfter(block)`. NOTE: If you need to get the lattice of other data-flow analyses in dense backward data-flow analysis, you should still use the dense forward data-flow approach. For example, to get the Executable state of a block in dense backward data-flow analysis and add the dependency of the current operation, you should write: ``getOrCreateFor<Executable>(getProgramPointBefore(op), getProgramPointBefore(block))`` In case above, we use getProgramPointBefore(op) because the analysis we rely on is dense backward data-flow, and we use getProgramPointBefore(block) because the lattice we query is the result of a non-dense backward data flow computation. related dsscussion: https://discourse.llvm.org/t/rfc-unify-the-semantics-of-program-points/80671/8 corresponding PSA: https://discourse.llvm.org/t/psa-program-point-semantics-change/81479 show more ...
Revision tags: llvmorg-19.1.1, llvmorg-19.1.0
# 1e64864c	17-Sep-2024	Tom Eccles <tom.eccles@arm.com>	[flang][StackArrays] run in parallel on different functions (#108842) Since #108562, StackArrays no longer has to create function declarations at the module level to use stacksave/stackrestore LLVM [flang][StackArrays] run in parallel on different functions (#108842) Since #108562, StackArrays no longer has to create function declarations at the module level to use stacksave/stackrestore LLVM intrinsics. This will allow it to run in parallel on multiple functions at the same time. show more ...
# 5aaf384b	16-Sep-2024	Tom Eccles <tom.eccles@arm.com>	[flang][NFC] use llvm.intr.stacksave/restore instead of opaque calls (#108562) The new LLVM stack save/restore intrinsic operations are more convenient than function calls because they do not add f [flang][NFC] use llvm.intr.stacksave/restore instead of opaque calls (#108562) The new LLVM stack save/restore intrinsic operations are more convenient than function calls because they do not add function declarations to the module and therefore do not block the parallelisation of passes. Furthermore they could be much more easily marked with memory effects than function calls if that ever proved useful. This builds on top of #107879. Resolves #108016 show more ...
Revision tags: llvmorg-19.1.0-rc4
# 2051a7bc	23-Aug-2024	jeanPerier <jperier@nvidia.com>	[flang][NFC] turn fir.call is_bind_c into enum for procedure flags (#105691) First patch to fix a BIND(C) ABI issue (https://github.com/llvm/llvm-project/issues/102113). I need to keep track of BI [flang][NFC] turn fir.call is_bind_c into enum for procedure flags (#105691) First patch to fix a BIND(C) ABI issue (https://github.com/llvm/llvm-project/issues/102113). I need to keep track of BIND(C) in more locations (fir.dispatch and func.func operations), and I need to fix a few passes that are dropping the attribute on the floor. Since I expect more procedure attributes that cannot be reflected in mlir::FunctionType will be needed for ABI, optimizations, or debug info, this NFC patch adds a new enum attribute to keep track of procedure attributes in the IR. This patch is not updating lowering to lower more attributes, this will be done in a separate patch to keep the test changes low here. Adding the attribute on fir.dispatch and func.func will also be done in separate patches. show more ...
# 15e915a4	22-Aug-2024	Ivan Butygin <ivan.butygin@gmail.com>	[mlir][dataflow] Propagate errors from `visitOperation` (#105448) Base `DataFlowAnalysis::visit` returns `LogicalResult`, but wrappers's Sparse/Dense/Forward/Backward `visitOperation` doesn't. S [mlir][dataflow] Propagate errors from `visitOperation` (#105448) Base `DataFlowAnalysis::visit` returns `LogicalResult`, but wrappers's Sparse/Dense/Forward/Backward `visitOperation` doesn't. Sometimes it's needed to abort solver early if some unrecoverable condition detected inside analysis. Update `visitOperation` to return `LogicalResult` and propagate it to `solver.initializeAndRun()`. Only `visitOperation` is updated for now, it's possible to update other hooks like `visitNonControlFlowArguments`, bit it's not needed immediately and let's keep this PR small. Hijacked `UnderlyingValueAnalysis` test analysis to test it. show more ...
Revision tags: llvmorg-19.1.0-rc3
# 698b42cc	16-Aug-2024	Kareem Ergawy <kareem.ergawy@amd.com>	[flang][stack-arrays] Collect analysis results for OMP ws loops (#103590) We missed collecting the analysis results for regions terminated with `omp.yield`. This result in missing some opportunitie [flang][stack-arrays] Collect analysis results for OMP ws loops (#103590) We missed collecting the analysis results for regions terminated with `omp.yield`. This result in missing some opportunities for malloc optimizations inside omp regions. show more ...
Revision tags: llvmorg-19.1.0-rc2, llvmorg-19.1.0-rc1, llvmorg-20-init
# 464d321e	18-Jul-2024	Kareem Ergawy <kareem.ergawy@amd.com>	[flang][stack-arrays] Extend pass to work on declare ops and within omp regions (#98810) Extends the stack-arrays pass to support `fir.declare` ops. Before that, we did not recognize malloc-free pa [flang][stack-arrays] Extend pass to work on declare ops and within omp regions (#98810) Extends the stack-arrays pass to support `fir.declare` ops. Before that, we did not recognize malloc-free pairs for which `fir.declare` is used to declare the allocated entity. This is because the `free` op was invoked on the result of the `fir.declare` op and did not directly use the allocated memory SSA value. This also extends the pass to collect the analysis results within OpenMP regions. show more ...
# db791b27	02-Jul-2024	Ramkumar Ramachandra <ramkumar.ramachandra@codasip.com>	mlir/LogicalResult: move into llvm (#97309) This patch is part of a project to move the Presburger library into LLVM.
Revision tags: llvmorg-18.1.8
# a506279e	14-Jun-2024	Mehdi Amini <joker.eph@gmail.com>	[mlir] Do not merge blocks during canonicalization by default (#95057) This is a heavy process, and it can trigger a massive explosion in adding block arguments. While potentially reducing the code [mlir] Do not merge blocks during canonicalization by default (#95057) This is a heavy process, and it can trigger a massive explosion in adding block arguments. While potentially reducing the code size, the resulting merged blocks with arguments are hiding some of the def-use chain and can even hinder some further analyses/optimizations: a merge block does not have it's own path-sensitive context, instead the context is merged from all the predecessors. Previous behavior can be restored by passing: {test-convergence region-simplify=aggressive} to the canonicalize pass. show more ...
Revision tags: llvmorg-18.1.7, llvmorg-18.1.6, llvmorg-18.1.5
# fac349a1	28-Apr-2024	Christian Sigg <chsigg@users.noreply.github.com>	Reapply "[mlir] Mark `isa/dyn_cast/cast/...` member functions depreca… (#90406) …ted. (#89998)" (#90250) This partially reverts commit 7aedd7dc754c74a49fe84ed2640e269c25414087. This change rem Reapply "[mlir] Mark `isa/dyn_cast/cast/...` member functions depreca… (#90406) …ted. (#89998)" (#90250) This partially reverts commit 7aedd7dc754c74a49fe84ed2640e269c25414087. This change removes calls to the deprecated member functions. It does not mark the functions deprecated yet and does not disable the deprecation warning in TypeSwitch. This seems to cause problems with MSVC. show more ...
# 7aedd7dc	26-Apr-2024	dyung <douglas.yung@sony.com>	Revert "[mlir] Mark `isa/dyn_cast/cast/...` member functions deprecated. (#89998)" (#90250) This reverts commit 950b7ce0b88318f9099e9a7c9817d224ebdc6337. This change is causing build failures on Revert "[mlir] Mark `isa/dyn_cast/cast/...` member functions deprecated. (#89998)" (#90250) This reverts commit 950b7ce0b88318f9099e9a7c9817d224ebdc6337. This change is causing build failures on a bot https://lab.llvm.org/buildbot/#/builders/216/builds/38157 show more ...
# 950b7ce0	26-Apr-2024	Christian Sigg <chsigg@users.noreply.github.com>	[mlir] Mark `isa/dyn_cast/cast/...` member functions deprecated. (#89998) See https://mlir.llvm.org/deprecation and https://discourse.llvm.org/t/preferred-casting-style-going-forward.
# 46b66dfd	26-Apr-2024	Tom Eccles <tom.eccles@arm.com>	[flang][NFC] use tablegen to create StackArrays constructor (#90038) Stack arrays needs to stay running only on func.func because it needs to know which block terminators can end the function (rath [flang][NFC] use tablegen to create StackArrays constructor (#90038) Stack arrays needs to stay running only on func.func because it needs to know which block terminators can end the function (rather than just branch between unstructured control flow). A similar concept does not exist at the more abstract level of "any top level mlir operation". For example, it currently looks for func::ReturnOp and fir::UnreachableOp as points when execution can end. If this were to be run on omp.declare_reduction, it would also need to understand omp.YieldOp (perhaps only when omp.declare_reduction is the parent). There isn't a generic concept in MLIR for this. show more ...
Revision tags: llvmorg-18.1.4, llvmorg-18.1.3, llvmorg-18.1.2, llvmorg-18.1.1, llvmorg-18.1.0, llvmorg-18.1.0-rc4, llvmorg-18.1.0-rc3, llvmorg-18.1.0-rc2, llvmorg-18.1.0-rc1, llvmorg-19-init
# c50de57f	21-Dec-2023	Kazu Hirata <kazu@google.com>	[flang] Fix a warning This patch fixes: flang/lib/Optimizer/Transforms/StackArrays.cpp:452:7: error: ignoring return value of function declared with 'nodiscard' attribute [-Werror,-Wunused-re [flang] Fix a warning This patch fixes: flang/lib/Optimizer/Transforms/StackArrays.cpp:452:7: error: ignoring return value of function declared with 'nodiscard' attribute [-Werror,-Wunused-result] show more ...
Revision tags: llvmorg-17.0.6, llvmorg-17.0.5
# e2153241	03-Nov-2023	Tom Eccles <tom.eccles@arm.com>	[flang][StackArrays] skip analysis of very large functions (#71047) The stack arrays pass uses data flow analysis to determine whether heap allocations are freed on all paths out of the function. [flang][StackArrays] skip analysis of very large functions (#71047) The stack arrays pass uses data flow analysis to determine whether heap allocations are freed on all paths out of the function. `interp_domain_em_part2` in spec2017 wrf generates over 120k operations, including almost 5k fir.if operations and over 200 fir.do_loop operations, all in the same function. The MLIR data flow analysis framework cannot provide reasonable performance for such cases because there is a combinatorial explosion in the number of control flow paths through the function, all of which must be checked to determine if the heap allocations will be freed. This patch skips the stack arrays pass for ridiculously large functions (defined as having more than 1000 fir.allocmem operations). This threshold is configurable at runtime with a command line argument. With this patch, compiling this file is more than 80% faster. show more ...
Revision tags: llvmorg-17.0.4, llvmorg-17.0.3, llvmorg-17.0.2, llvmorg-17.0.1, llvmorg-17.0.0, llvmorg-17.0.0-rc4, llvmorg-17.0.0-rc3, llvmorg-17.0.0-rc2, llvmorg-17.0.0-rc1, llvmorg-18-init
# b2b7efb9	21-Jul-2023	Alex Zinenko <zinenko@google.com>	[mlir] NFC: rename XDataFlowAnalysis to XForwardDataFlowAnalysis This makes naming consisnt with XBackwardDataFlowAnalysis. Reviewed By: Mogball, phisiart Differential Revision: https://reviews.ll [mlir] NFC: rename XDataFlowAnalysis to XForwardDataFlowAnalysis This makes naming consisnt with XBackwardDataFlowAnalysis. Reviewed By: Mogball, phisiart Differential Revision: https://reviews.llvm.org/D155930 show more ...
Revision tags: llvmorg-16.0.6
# 76c3c5bc	05-Jun-2023	Tom Eccles <tom.eccles@arm.com>	[flang] [stack-arrays] fix unused variable warning
Revision tags: llvmorg-16.0.5
# 53cc33b0	01-Jun-2023	Tom Eccles <tom.eccles@arm.com>	[flang] Store KindMapping by value in FirOpBuilder Previously only a constant reference was stored in the FirOpBuilder. However, a lot of code was merged using FirOpBuilder builder{rewriter, getKin [flang] Store KindMapping by value in FirOpBuilder Previously only a constant reference was stored in the FirOpBuilder. However, a lot of code was merged using FirOpBuilder builder{rewriter, getKindMapping(mod)}; This is incorrect because the KindMapping returned will go out of scope as soon as FirOpBuilder's constructor had run. This led to an infinite loop running some tests using HLFIR (because the stack space containing the kind mapping was re-used and corrupted). One solution would have just been to fix the incorrect call sites, however, as a large number of these had already made it past review, I decided to instead change FirOpBuilder to store its own copy of the KindMapping. This is not costly because nearly every time we construct a KindMapping is exclusively to construct a FirOpBuilder. To make this common pattern simpler, I added a new constructor to FirOpBuilder which calls getKindMapping(). Differential Revision: https://reviews.llvm.org/D151881 show more ...
# 775de675	01-Jun-2023	Tom Eccles <tom.eccles@arm.com>	[flang] convert stack arrays allocation to match old type The old fir.allocmem operation returned a !fir.heap<.> type. The new fir.alloca operation returns a !fir.ref<.> type. This patch inserts a f [flang] convert stack arrays allocation to match old type The old fir.allocmem operation returned a !fir.heap<.> type. The new fir.alloca operation returns a !fir.ref<.> type. This patch inserts a fir.convert so that the old type is preserved. This prevents verifier failures when types returned from fir.if statements don't match the expected type. Differential Revision: https://reviews.llvm.org/D151921 show more ...
# 408f4196	17-May-2023	Tom Eccles <tom.eccles@arm.com>	[flang] use greedy mlir driver for stack arrays pass In upstream mlir, the dialect conversion infrastructure is used for lowering from one dialect to another: the passes are of the form XToYPass. Wh [flang] use greedy mlir driver for stack arrays pass In upstream mlir, the dialect conversion infrastructure is used for lowering from one dialect to another: the passes are of the form XToYPass. Whereas, transformations within the same dialect tend to use applyPatternsAndFoldGreedily. In this case, the full complexity of applyPatternsAndFoldGreedily isn't needed so we can get away with the simpler applyOpPatternsAndFold. This change was suggested by @jeanPerier The old differential revision for this patch was https://reviews.llvm.org/D150853 Re-applying here fixing the issue which led to the patch being reverted. The issue was from erasing uses of the allocation operation while still iterating over those uses (leading to a use-after-free). I have added a regression test which catches this bug for -fsanitize=address builds, but it is hard to reliably cause a crash from the use-after-free in normal builds. Differential Revision: https://reviews.llvm.org/D151728 show more ...
# 2dfaec77	24-May-2023	Tom Eccles <tom.eccles@arm.com>	Revert "[flang] use greedy mlir driver for stack arrays pass" This reverts commit 74c2ec50f393bad8b31d0dd0bd8b2ff44d361198. This caused a regression building spec2017 with -Ofast.
12