LTOBackend.cpp - OpenGrok history log for /llvm-project/llvm/lib/LTO/LTOBackend.cpp

Revision (<<< Hide revision tags) (Show revision tags >>>)	Date	Author	Comments
Revision tags: llvmorg-21-init, llvmorg-19.1.7
# 799955eb	11-Jan-2025	Teresa Johnson <tejohnson@google.com>	[ThinLTO] Skip opt pipeline and summary wrapper pass on empty modules (#120143) Follow up to PR118508, to avoid unnecessary compile time for an empty combind regular LTO module if all modules end u [ThinLTO] Skip opt pipeline and summary wrapper pass on empty modules (#120143) Follow up to PR118508, to avoid unnecessary compile time for an empty combind regular LTO module if all modules end up being ThinLTO only. This required minor changes to a few tests to ensure they weren't empty. show more ...
Revision tags: llvmorg-19.1.6, llvmorg-19.1.5, llvmorg-19.1.4
# 5c8c90d8	17-Nov-2024	Rahul Joshi <rjoshi@nvidia.com>	[NFC][LLVM] Fix a couple of build warnings (#116490) - Fix `HANDLE_EXTENSION` macro redefinition warning in LTOBackend.cpp - Fix "unnecessary brackets" around rf/df variable definitions warning.
Revision tags: llvmorg-19.1.3, llvmorg-19.1.2
# dc85d526	09-Oct-2024	Kyungwoo Lee <kyulee@meta.com>	[CGData][ThinLTO] Global Outlining with Two-CodeGen Rounds (#90933) This feature is enabled by `-codegen-data-thinlto-two-rounds`, which effectively runs the `-codegen-data-generate` and `-codegen- [CGData][ThinLTO] Global Outlining with Two-CodeGen Rounds (#90933) This feature is enabled by `-codegen-data-thinlto-two-rounds`, which effectively runs the `-codegen-data-generate` and `-codegen-data-use` in two rounds to enable global outlining with ThinLTO. 1. The first round: Run both optimization + codegen with a scratch output. Before running codegen, we serialize the optimized bitcode modules to a temporary path. 2. From the scratch object files, we merge them into the codegen data. 3. The second round: Read the optimized bitcode modules and start the codegen only this time. Using the codegen data, the machine outliner effectively performs the global outlining. Depends on #90934, #110461 and #110463. This is a patch for https://discourse.llvm.org/t/rfc-enhanced-machine-outliner-part-2-thinlto-nolto/78753. show more ...
# c1959813	03-Oct-2024	Kyungwoo Lee <kyulee@meta.com>	[CGData][ThinLTO][NFC] Prep for two-codegen rounds (#90934) This is NFC for https://github.com/llvm/llvm-project/pull/90933. - Create a lambda function, `RunBackends`, to group the backend opera [CGData][ThinLTO][NFC] Prep for two-codegen rounds (#90934) This is NFC for https://github.com/llvm/llvm-project/pull/90933. - Create a lambda function, `RunBackends`, to group the backend operations into a single function. - Explicitly pass the `CodeGenOnly` argument to thinBackend, instead of depending on a configuration value. Depends on https://github.com/llvm/llvm-project/pull/90304. This is a patch for https://discourse.llvm.org/t/rfc-enhanced-machine-outliner-part-2-thinlto-nolto/78753. show more ...
Revision tags: llvmorg-19.1.1, llvmorg-19.1.0, llvmorg-19.1.0-rc4
# 35639079	23-Aug-2024	Kazu Hirata <kazu@google.com>	[LTO] Turn ImportMapTy into a proper class (NFC) (#105748) This patch turns type alias ImportMapTy into a proper class to provide a more intuitive interface like: ImportList.addDefinition(...) [LTO] Turn ImportMapTy into a proper class (NFC) (#105748) This patch turns type alias ImportMapTy into a proper class to provide a more intuitive interface like: ImportList.addDefinition(...) as opposed to: FunctionImporter::addDefinition(ImportList, ...) Also, this patch requires all non-const accesses to go through addDefinition, maybeAddDeclaration, and addGUID while providing const accesses via: const ImportMapTyImpl &getImportMap() const { return ImportMap; } I realize ImportMapTy may not be the best name as a class (maybe OK as a type alias). I am not renaming ImportMapTy in this patch at least because there are 47 mentions of ImportMapTy under llvm/. show more ...
# 3082a381	22-Aug-2024	Kazu Hirata <kazu@google.com>	[LTO] Introduce helper functions to add GUIDs to ImportList (NFC) (#105555) The new helper functions make the intent clearer while hiding implementation details, including how we handle previously [LTO] Introduce helper functions to add GUIDs to ImportList (NFC) (#105555) The new helper functions make the intent clearer while hiding implementation details, including how we handle previously added entries. Note that: - If we are adding a GUID as a GlobalValueSummary::Definition, then we override a previously added GlobalValueSummary::Declaration entry for the same GUID. - If we are adding a GUID as a GlobalValueSummary::Declaration, then a previously added GlobalValueSummary::Definition entry for the same GUID takes precedence, and no change is made. show more ...
Revision tags: llvmorg-19.1.0-rc3, llvmorg-19.1.0-rc2
# 26e455ba	29-Jul-2024	macurtis-amd <macurtis@amd.com>	[lld][LTO] Teach LTO to print pipeline passes (#101018) I found this useful while debugging code generation differences between old and new offloading drivers. No functional change (intended).
Revision tags: llvmorg-19.1.0-rc1, llvmorg-20-init
# d29fdfbc	26-Jun-2024	Joel E. Denny <jdenny.ornl@gmail.com>	[LTO] Avoid assert fail on failed pass plugin load (#96691) Without this patch, passing -load-pass-plugin=nonexistent.so to llvm-lto2 produces a backtrace because LTOBackend.cpp does not handle th [LTO] Avoid assert fail on failed pass plugin load (#96691) Without this patch, passing -load-pass-plugin=nonexistent.so to llvm-lto2 produces a backtrace because LTOBackend.cpp does not handle the error correctly: ``` Failed to load passes from 'nonexistant.so'. Request ignored. Expected<T> must be checked before access or destruction. Unchecked Expected<T> contained error: Could not load library 'nonexistant.so': nonexistant.so: cannot open shared object file: No such file or directoryPLEASE submit a bug report to https://github.com/llvm/llvm-project/issues/ and include the crash backtrace. ``` Any tool using `lto::Config::PassPlugins` should suffer similarly. Based on the message "Request ignored" and the continue statement, the intention was apparently to continue on failure to load a plugin. However, no one appears to rely on that behavior now given that it crashes instead, and terminating is consistent with opt. show more ...
# 8d9db947	20-Jun-2024	Mingming Liu <mingmingl@google.com>	Reland "[ThinLTO] Populate declaration import status except for distributed ThinLTO under a default-off new option" (#95482) Make `FunctionsToImportTy` an `unordered_map` rather than `DenseMap`. Cr Reland "[ThinLTO] Populate declaration import status except for distributed ThinLTO under a default-off new option" (#95482) Make `FunctionsToImportTy` an `unordered_map` rather than `DenseMap`. Credit goes to jvoung@ for the 'DenseMap -> unordered_map' change. This is a reland of https://github.com/llvm/llvm-project/pull/92718 * `DenseMap` allocates space for a large number of key/value pairs and wastes space when the number of elements are small. * While init bucket size is zero [1], it quickly allocates buckets for 64 elements [2] when the number of elements is small (for example, 3 or 4 elements). The programmer manual [3] also mentions it could waste space. * Experiments show `FunctionsToImportTy.size()` is smaller than 4 for multiple binaries with high indexing ram usage. `unordered_map` grows factor is at most 2 in llvm libc [4] for insert operations. With this change, `ComputeCrossModuleImport` ram increase is smaller than 0.5G on a couple of binaries with high indexing ram usage. A wider range of (pre-release) tests pass. [1] https://github.com/llvm/llvm-project/blob/ad79a14c9e5ec4a369eed4adf567c22cc029863f/llvm/include/llvm/ADT/DenseMap.h#L431-L432 [2] https://github.com/llvm/llvm-project/blob/ad79a14c9e5ec4a369eed4adf567c22cc029863f/llvm/include/llvm/ADT/DenseMap.h#L849 [3] https://llvm.org/docs/ProgrammersManual.html#llvm-adt-densemap-h [4] https://github.com/llvm/llvm-project/blob/ad79a14c9e5ec4a369eed4adf567c22cc029863f/libcxx/include/__hash_table#L1525-L1526 Original commit message The goal is to populate `declaration` import status if a new flag `-import-declaration` is on. * For in-process ThinLTO, the `declaration` status is visible to backend `function-import` pass, so `FunctionImporter::importFunctions` should read the import status and be no-op for declaration summaries. Basically, the postlink pipeline is updated to keep its current behavior (import definitions), but not updated to handle `declaration` summaries. Two use cases ([better call-graph sort](https://discourse.llvm.org/t/rfc-for-better-call-graph-sort-build-a-more-complete-call-graph-by-adding-more-indirect-call-edges/74029#support-cross-module-function-declaration-import-5) or [cross-module auto-init](https://github.com/llvm/llvm-project/pull/87597#discussion_r1556067195)) would use this bit differently. * For distributed ThinLTO, the `declaration` status is not serialized to bitcode. As discussed, https://github.com/llvm/llvm-project/pull/87600 will do this. show more ...
Revision tags: llvmorg-18.1.8
# d3342e5b	12-Jun-2024	Abhina Sree <69635948+abhina-sree@users.noreply.github.com>	[SystemZ][z/OS] Continue marking text files with OF_Text (#95111) Text files should be opened with OF_Text to have the correct encoding.
Revision tags: llvmorg-18.1.7
# 707f4de4	05-Jun-2024	Mingming Liu <mingmingl@google.com>	Revert "Reland "[ThinLTO] Populate declaration import status except for distributed ThinLTO under a default-off new option" (#92718) (#94503) This reverts commit e33db249b53fb70dce62db3ebd82d42239bd Revert "Reland "[ThinLTO] Populate declaration import status except for distributed ThinLTO under a default-off new option" (#92718) (#94503) This reverts commit e33db249b53fb70dce62db3ebd82d42239bd1d9d. The change from set to map increases memory usage, and caused indexing OOM in some applications. Need to profile offline to bring the memory usage down. show more ...
# e33db249	20-May-2024	Mingming Liu <mingmingl@google.com>	Reland "[ThinLTO] Populate declaration import status except for distributed ThinLTO under a default-off new option" (#92718) The original PR is reviewed in https://github.com/llvm/llvm-project/pull Reland "[ThinLTO] Populate declaration import status except for distributed ThinLTO under a default-off new option" (#92718) The original PR is reviewed in https://github.com/llvm/llvm-project/pull/88024, and this PR adds one line (https://github.com/llvm/llvm-project/pull/92718/commits/b9f04d199dec4f3c221d981dcb91e55298d0693f) to fix test Limit to one thread for in-process ThinLTO to test `LLVM_DEBUG` log. - This should fix build bot failure like https://lab.llvm.org/buildbot/#/builders/259/builds/4727 and https://lab.llvm.org/buildbot/#/builders/9/builds/43876 - I could repro the failure and see interleaved log messages by using `-thinlto-threads=all` Original Commit Message: The goal is to populate `declaration` import status if a new flag `-import-declaration` is on. * For in-process ThinLTO, the `declaration` status is visible to backend `function-import` pass, so `FunctionImporter::importFunctions` should read the import status and be no-op for declaration summaries. Basically, the postlink pipeline is updated to keep its current behavior (import definitions), but not updated to handle `declaration` summaries. Two use cases ([better call-graph sort](https://discourse.llvm.org/t/rfc-for-better-call-graph-sort-build-a-more-complete-call-graph-by-adding-more-indirect-call-edges/74029#support-cross-module-function-declaration-import-5) or [cross-module auto-init](https://github.com/llvm/llvm-project/pull/87597#discussion_r1556067195)) would use this bit differently. * For distributed ThinLTO, the `declaration` status is not serialized to bitcode. As discussed, https://github.com/llvm/llvm-project/pull/87600 will do this. show more ...
# 32ae9a28	20-May-2024	Kazu Hirata <kazu@google.com>	[llvm] Use SmallString::str (NFC) (#92712)
# 6b0733e3	20-May-2024	Mingming Liu <mingmingl@google.com>	Revert "[ThinLTO] Populate declaration import status except for distributed ThinLTO under a default-off new option" (#92715) Reverts llvm/llvm-project#88024 Build bot failures (https://lab.llvm. Revert "[ThinLTO] Populate declaration import status except for distributed ThinLTO under a default-off new option" (#92715) Reverts llvm/llvm-project#88024 Build bot failures (https://lab.llvm.org/buildbot/#/builders/259/builds/4727 and https://lab.llvm.org/buildbot/#/builders/9/builds/43876) show more ...
# 8de78905	20-May-2024	Mingming Liu <mingmingl@google.com>	[ThinLTO] Populate declaration import status except for distributed ThinLTO under a default-off new option (#88024) The goal is to populate `declaration` import status if a new flag`-import-declarat [ThinLTO] Populate declaration import status except for distributed ThinLTO under a default-off new option (#88024) The goal is to populate `declaration` import status if a new flag`-import-declaration` is on. * For in-process ThinLTO, the `declaration` status is visible to backend `function-import` pass, so `FunctionImporter::importFunctions` should read the import status and be no-op for declaration summaries. Basically, the postlink pipeline is updated to keep its current behavior (import definitions), but not updated to handle `declaration` summaries. Two use cases (better call-graph sort and cross-module auto-init) would use this bit differently. * For distributed ThinLTO, the `declaration` status is not serialized to bitcode. As discussed, https://github.com/llvm/llvm-project/pull/87600 will do this. [1] https://discourse.llvm.org/t/rfc-for-better-call-graph-sort-build-a-more-complete-call-graph-by-adding-more-indirect-call-edges/74029#support-cross-module-function-declaration-import-5 [2] https://github.com/llvm/llvm-project/pull/87597#discussion_r1556067195 show more ...
Revision tags: llvmorg-18.1.6, llvmorg-18.1.5
# e86ebe4f	22-Apr-2024	Pierre van Houtryve <pierre.vanhoutryve@amd.com>	[LTO] Allow target-specific module splittting (#83128) Allow targets to implement custom module splitting logic for --lto-partitions, see #89245 https://discourse.llvm.org/t/rfc-lto-target-speci [LTO] Allow target-specific module splittting (#83128) Allow targets to implement custom module splitting logic for --lto-partitions, see #89245 https://discourse.llvm.org/t/rfc-lto-target-specific-module-splittting/77252 show more ...
Revision tags: llvmorg-18.1.4, llvmorg-18.1.3, llvmorg-18.1.2, llvmorg-18.1.1
# 716042a6	06-Mar-2024	Mehdi Amini <joker.eph@gmail.com>	Rename llvm::ThreadPool -> llvm::DefaultThreadPool (NFC) (#83702) The base class llvm::ThreadPoolInterface will be renamed llvm::ThreadPool in a subsequent commit. This is a breaking change: cli Rename llvm::ThreadPool -> llvm::DefaultThreadPool (NFC) (#83702) The base class llvm::ThreadPoolInterface will be renamed llvm::ThreadPool in a subsequent commit. This is a breaking change: clients who use to create a ThreadPool must now create a DefaultThreadPool instead. show more ...
Revision tags: llvmorg-18.1.0, llvmorg-18.1.0-rc4
# ec24094b	22-Feb-2024	Igor Kudrin <ikudrin@accesssoftek.com>	[LTO] Remove Config.UseDefaultPipeline (#82587) This option is not used. It was added in [D122133](https://reviews.llvm.org/D122133), 5856f30b, with the only usage in `ClangLinkerWrapper.cpp`, whi [LTO] Remove Config.UseDefaultPipeline (#82587) This option is not used. It was added in [D122133](https://reviews.llvm.org/D122133), 5856f30b, with the only usage in `ClangLinkerWrapper.cpp`, which was later updated in a1d57fc2, and then finally removed in [D142650](https://reviews.llvm.org/D142650), 6185246f. show more ...
Revision tags: llvmorg-18.1.0-rc3
# 93cdd1b5	12-Feb-2024	Arthur Eubanks <aeubanks@google.com>	[PGO] Add ability to mark cold functions as optsize/minsize/optnone (#69030) The performance of cold functions shouldn't matter too much, so if we care about binary sizes, add an option to mark col [PGO] Add ability to mark cold functions as optsize/minsize/optnone (#69030) The performance of cold functions shouldn't matter too much, so if we care about binary sizes, add an option to mark cold functions as optsize/minsize for binary size, or optnone for compile times [1]. Clang patch will be in a future patch. This is intended to replace `shouldOptimizeForSize(Function&, ...)`. We've seen multiple cases where calls to this expensive function, if not careful, can blow up compile times. I will clean up users of that function in a followup patch. Initial version: https://reviews.llvm.org/D149800 [1] https://discourse.llvm.org/t/rfc-new-feature-proposal-de-optimizing-cold-functions-using-pgo-info/56388 show more ...
Revision tags: llvmorg-18.1.0-rc2, llvmorg-18.1.0-rc1, llvmorg-19-init, llvmorg-17.0.6, llvmorg-17.0.5, llvmorg-17.0.4, llvmorg-17.0.3, llvmorg-17.0.2
# 64573da4	19-Sep-2023	Arthur Eubanks <aeubanks@google.com>	[IR] Add "Large Data Threshold" module metadata (#66797) This allows us to not have to pass -mllvm flags to set the large data threshold for (in-LLD/not-distributed) ThinLTO. Follows https://rev [IR] Add "Large Data Threshold" module metadata (#66797) This allows us to not have to pass -mllvm flags to set the large data threshold for (in-LLD/not-distributed) ThinLTO. Follows https://reviews.llvm.org/D52322, which did the same for the code model. Since the large data threshold is tied to the code model and we disallow mixing different code models, do the same for the large data threshold. show more ...
Revision tags: llvmorg-17.0.1, llvmorg-17.0.0, llvmorg-17.0.0-rc4, llvmorg-17.0.0-rc3, llvmorg-17.0.0-rc2, llvmorg-17.0.0-rc1, llvmorg-18-init
# 546ec641	11-Jul-2023	Teresa Johnson <tejohnson@google.com>	Restore "[MemProf] Use new option/pass for profile feedback and matching" This restores commit b4a82b62258c5f650a1cccf5b179933e6bae4867, reverted in 3ab7ef28eebf9019eb3d3c4efd7ebfd160106bb1 because Restore "[MemProf] Use new option/pass for profile feedback and matching" This restores commit b4a82b62258c5f650a1cccf5b179933e6bae4867, reverted in 3ab7ef28eebf9019eb3d3c4efd7ebfd160106bb1 because it was thought to cause a bot failure, which ended up being unrelated to this patch set. Differential Revision: https://reviews.llvm.org/D154856 show more ...
# 3ab7ef28	11-Jul-2023	JP Lehr <JanPatrick.Lehr@amd.com>	Revert "[MemProf] Use new option/pass for profile feedback and matching" This reverts commit b4a82b62258c5f650a1cccf5b179933e6bae4867. Broke AMDGPU OpenMP Offload buildbot
# b4a82b62	10-Jul-2023	Teresa Johnson <tejohnson@google.com>	[MemProf] Use new option/pass for profile feedback and matching Previously the MemProf profile was expected to be in the same profile file as a normal PGO profile, passed via the usual -fprofile-use [MemProf] Use new option/pass for profile feedback and matching Previously the MemProf profile was expected to be in the same profile file as a normal PGO profile, passed via the usual -fprofile-use= option, and was matched in the same pass. To simplify profile preparation, since the raw MemProf profile requires the binary for symbolization and may be simpler to index separately from the raw PGO profile, and also to enable providing a MemProf profile for a SamplePGO build, separate out the MemProf feedback option and matching pass. This patch adds the -fmemory-profile-use=${file} option, and the provided file is passed down to LLVM and ultimately used in a new MemProfUsePass which performs the matching of just the memory profile contents of that file. Note that a single profile file containing both normal PGO and MemProf profile data is still supported, and the relevant profile data is matched by the appropriate matching pass(es) based on which option(s) the profile is provided with (the same profile file can be supplied to both feedback options). Differential Revision: https://reviews.llvm.org/D154856 show more ...
# a1ca3af3	05-Jul-2023	Matthew Voss <matthew.voss@sony.com>	[llvm] A Unified LTO Bitcode Frontend Here's a high level summary of the changes in this patch. For more information on rational, see the RFC. (https://discourse.llvm.org/t/rfc-a-unified-lto-bitcode [llvm] A Unified LTO Bitcode Frontend Here's a high level summary of the changes in this patch. For more information on rational, see the RFC. (https://discourse.llvm.org/t/rfc-a-unified-lto-bitcode-frontend/61774). - Add config parameter to LTO backend, specifying which LTO mode is desired when using unified LTO. - Add unified LTO flag to the summary index for efficiency. Unified LTO modules can be detected without parsing the module. - Make sure that the ModuleID is generated by incorporating more types of symbols. Differential Revision: https://reviews.llvm.org/D123803 show more ...
# 8de9f2b5	26-Jun-2023	Job Noorman <jnoorman@igalia.com>	Move SubtargetFeature.h from MC to TargetParser SubtargetFeature.h is currently part of MC while it doesn't depend on anything in MC. Since some LLVM components might have the need to work with targ Move SubtargetFeature.h from MC to TargetParser SubtargetFeature.h is currently part of MC while it doesn't depend on anything in MC. Since some LLVM components might have the need to work with target features without necessarily needing MC, it might be worthwhile to move SubtargetFeature.h to a different location. This will reduce the dependencies of said components. Note that I choose TargetParser as the destination because that's where Triple lives and SubtargetFeatures feels related to that. This issues came up during a JITLink review (D149522). JITLink would like to avoid a dependency on MC while still needing to store target features. Reviewed By: MaskRay, arsenm Differential Revision: https://reviews.llvm.org/D150549 show more ...
12 3 4 5 6 7 8