TargetLoweringBase.cpp - OpenGrok history log for /llvm-project/llvm/lib/CodeGen/TargetLoweringBase.cpp

Revision (<<< Hide revision tags) (Show revision tags >>>)	Date	Author	Comments
# d0d05aec	11-Jul-2024	Florian Hahn <flo@fhahn.com>	[Darwin] Fix availability of exp10 for watchOS, tvOS, xROS. (#98542) Update availability information added in 1eb7f055d9a. exp10 is available on iOS >= 7.0 and macOS >= 10.9. On all other platforms [Darwin] Fix availability of exp10 for watchOS, tvOS, xROS. (#98542) Update availability information added in 1eb7f055d9a. exp10 is available on iOS >= 7.0 and macOS >= 10.9. On all other platforms, it is available on any version. Also drop the x86 check, as the availability only depends on the OS version, not the target platform. PR: https://github.com/llvm/llvm-project/pull/98542 show more ...
# 0b58f34c	11-Jul-2024	Farzon Lotfi <1802579+farzonl@users.noreply.github.com>	[X86][CodeGen] Add base trig intrinsic lowerings (#96222) This change is an implementation of https://github.com/llvm/llvm-project/issues/87367's investigation on supporting IEEE math operations a [X86][CodeGen] Add base trig intrinsic lowerings (#96222) This change is an implementation of https://github.com/llvm/llvm-project/issues/87367's investigation on supporting IEEE math operations as intrinsics. Which was discussed in this RFC: https://discourse.llvm.org/t/rfc-all-the-math-intrinsics/78294 This change adds constraint intrinsics and some lowering cases for `acos`, `asin`, `atan`, `cosh`, `sinh`, and `tanh`. The only x86 specific change was for f80. https://github.com/llvm/llvm-project/issues/70079 https://github.com/llvm/llvm-project/issues/70080 https://github.com/llvm/llvm-project/issues/70081 https://github.com/llvm/llvm-project/issues/70083 https://github.com/llvm/llvm-project/issues/70084 https://github.com/llvm/llvm-project/issues/95966 The x86 lowering is going to be done in three pr changes with this being the first. A second PR will be put up for Loop Vectorizing and then SLPVectorizer. The constraint intrinsics is also going to be in multiple parts, but just 2. This part covers just the llvm specific changes, part2 will cover clang specifc changes and legalization for backends than have special legalization requirements like aarch64 and wasm. show more ...
# 3f1a7675	11-Jul-2024	Joseph Huber <huberjn@outlook.com>	[LLVM] Factor disabled Libcalls into the initializer (#98421) Summary: These Libcalls represent which functions are available to the backend. If a runtime call is not available, the target sets the [LLVM] Factor disabled Libcalls into the initializer (#98421) Summary: These Libcalls represent which functions are available to the backend. If a runtime call is not available, the target sets the the name to `nullptr`. Currently, this logic is spread around the various targets. This patch pulls all of the locations that disable libcalls into the intializer. This patch is effectively NFC. The motivation behind this patch is that currently the LTO handling uses the list of all runtime calls to determine which functions cannot be internalized and must be extracted from static libraries. We do not want this to happen for libcalls that are not emitted by the backend. A follow-up patch will move out this logic so the LTO pass can know which rtlib calls are actually used by the backend. show more ...
# 3141c11f	04-Jul-2024	Craig Topper <craig.topper@sifive.com>	[SelectionDAG] Remove LegalTypes argument from getShiftAmountTy. NFC (#97757) This argument is no longer used inside the function. Remove it from the interface.
# f4d058fd	04-Jul-2024	Craig Topper <craig.topper@sifive.com>	[SelectionDAG] Ignore LegalTypes parameter in TargetLoweringBase::getShiftAmountTy. (#97645) When this flag was false, `getShiftAmountTy` would return `PointerTy` instead of the target's preferred [SelectionDAG] Ignore LegalTypes parameter in TargetLoweringBase::getShiftAmountTy. (#97645) When this flag was false, `getShiftAmountTy` would return `PointerTy` instead of the target's preferred shift amount type for scalar shifts. This used to be needed when the target's preferred type wasn't large enough to support the shift amount needed for an illegal type. For example, any scalar type larger than i256 on X86 since X86's preferred shift amount type is i8. For a while now, we've had code that uses `MVT::i32` if `LegalTypes` is true, but the target's preferred type is too small. This fixed a repeated cause of crashes where the `LegalTypes` flag wasn't set to false when illegal types could be present. This has made it unnecessary to set the `LegalTypes` flag correctly, and as a result more and more places don't. So I think its time for this flag to go away. This first patch just disconnects the flag. The interface and all callers will be cleaned up in follow up patches. The X86 test change is because we now have the same shift type for both shifts in a (srl (sub C, (shl X, 32), 32) sequence. This makes the shift amounts appear equal in value and type which is needed to enable a combine. show more ...
# f2f18459	21-Jun-2024	Nikita Popov <npopov@redhat.com>	Revert "Intrinsic: introduce minimumnum and maximumnum (#93841)" As far as I can tell, this pull request was not approved, and did not go through an RFC on discourse. This reverts commit 8988148003 Revert "Intrinsic: introduce minimumnum and maximumnum (#93841)" As far as I can tell, this pull request was not approved, and did not go through an RFC on discourse. This reverts commit 89881480030f48f83af668175b70a9798edca2fb. This reverts commit 225d8fc8eb24fb797154c1ef6dcbe5ba033142da. show more ...
# 89881480	21-Jun-2024	YunQiang Su <syq@debian.org>	Intrinsic: introduce minimumnum and maximumnum (#93841) Currently, on different platform, the behaivor of llvm.minnum is different if one operand is sNaN: When we compare sNaN vs NUM: ARM/AAr Intrinsic: introduce minimumnum and maximumnum (#93841) Currently, on different platform, the behaivor of llvm.minnum is different if one operand is sNaN: When we compare sNaN vs NUM: ARM/AArch64/PowerPC: follow the IEEE754-2008's minNUM: return qNaN. RISC-V/Hexagon follow the IEEE754-2019's minimumNumber: return NUM. X86: Returns NUM but not same with IEEE754-2019's minimumNumber as +0.0 is not always greater than -0.0. MIPS/LoongArch/Generic: return NUM. LIBCALL: returns qNaN. So, let's introduce llvm.minmumnum/llvm.maximumnum, which always follow IEEE754-2019's minimumNumber/maximumNumber. Half-fix: #93033 show more ...
# 995835fe	17-Jun-2024	Poseydon42 <vvmposeydon@gmail.com>	[SelectionDAG] Add support for the 3-way comparison intrinsics [US]CMP (#91871) This PR adds initial support for the `scmp`/`ucmp` 3-way comparison intrinsics in the SelectionDAG. Some of the expan [SelectionDAG] Add support for the 3-way comparison intrinsics [US]CMP (#91871) This PR adds initial support for the `scmp`/`ucmp` 3-way comparison intrinsics in the SelectionDAG. Some of the expansions/lowerings are not optimal yet. show more ...
Revision tags: llvmorg-18.1.8
# 6355fb45	14-Jun-2024	Farzon Lotfi <1802579+farzonl@users.noreply.github.com>	[CodeGen] Support vectors across all backends (#95518) Add a default f16 type promotion
Revision tags: llvmorg-18.1.7
# 1d874335	05-Jun-2024	Farzon Lotfi <1802579+farzonl@users.noreply.github.com>	[x86] Add tan intrinsic part 4 (#90503) This change is an implementation of #87367's investigation on supporting IEEE math operations as intrinsics. Which was discussed in this RFC: https://disc [x86] Add tan intrinsic part 4 (#90503) This change is an implementation of #87367's investigation on supporting IEEE math operations as intrinsics. Which was discussed in this RFC: https://discourse.llvm.org/t/rfc-all-the-math-intrinsics/78294 Much of this change was following how G_FSIN and G_FCOS were used. Changes: - `llvm/docs/GlobalISel/GenericOpcode.rst` - Document the `G_FTAN` opcode - `llvm/docs/LangRef.rst` - Document the tan intrinsic - `llvm/include/llvm/Analysis/VecFuncs.def` - Associate the tan intrinsic as a vector function similar to the tanf libcall. - `llvm/include/llvm/CodeGen/BasicTTIImpl.h` - Map the tan intrinsic to `ISD::FTAN` - `llvm/include/llvm/CodeGen/ISDOpcodes.h` - Define ISD opcodes for `FTAN` and `STRICT_FTAN` - `llvm/include/llvm/IR/Intrinsics.td` - Create the tan intrinsic - `llvm/include/llvm/IR/RuntimeLibcalls.def` - Define tan libcall mappings - `llvm/include/llvm/Target/GenericOpcodes.td` - Define the `G_FTAN` Opcode - `llvm/include/llvm/Support/TargetOpcodes.def` - Create a `G_FTAN` Opcode handler - `llvm/include/llvm/Target/GlobalISel/SelectionDAGCompat.td` - Map `G_FTAN` to `ftan` - `llvm/include/llvm/Target/TargetSelectionDAG.td` - Define `ftan`, `strict_ftan`, and `any_ftan` and map them to the ISD opcodes for `FTAN` and `STRICT_FTAN` - `llvm/lib/Analysis/VectorUtils.cpp` - Associate the tan intrinsic as a vector intrinsic - `llvm/lib/CodeGen/GlobalISel/IRTranslator.cpp` Map the tan intrinsic to `G_FTAN` Opcode - `llvm/lib/CodeGen/GlobalISel/LegalizerHelper.cpp` - Add `G_FTAN` to the list of floating point math operations also associate `G_FTAN` with the `TAN_F` runtime lib. - `llvm/lib/CodeGen/GlobalISel/Utils.cpp` - More floating point math operation common behaviors. - llvm/lib/CodeGen/SelectionDAG/LegalizeDAG.cpp - List the function expansion operations for `FTAN` and `STRICT_FTAN`. Also define both opcodes in `PromoteNode`. - `llvm/lib/CodeGen/SelectionDAG/LegalizeFloatTypes.cpp` - More `FTAN` and `STRICT_FTAN` handling in the legalizer - `llvm/lib/CodeGen/SelectionDAG/LegalizeTypes.h` - Define `SoftenFloatRes_FTAN` and `ExpandFloatRes_FTAN`. - `llvm/lib/CodeGen/SelectionDAG/LegalizeVectorOps.cpp` - Define `FTAN` as a legal vector operation. - `llvm/lib/CodeGen/SelectionDAG/LegalizeVectorTypes.cpp` - Define `FTAN` as a legal vector operation. - `llvm/lib/CodeGen/SelectionDAG/SelectionDAG.cpp` - define tan as an intrinsic that doesn't return NaN. - `llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp` Map `LibFunc_tan`, `LibFunc_tanf`, and `LibFunc_tanl` to `ISD::FTAN`. Map `Intrinsic::tan` to `ISD::FTAN` and add selection dag handling for `Intrinsic::tan`. - `llvm/lib/CodeGen/SelectionDAG/SelectionDAGDumper.cpp` - Define `ftan` and `strict_ftan` names for the equivalent ISD opcodes. - `llvm/lib/CodeGen/TargetLoweringBase.cpp` -Define a Tan128 libcall and ISD::FTAN as a target lowering action. - `llvm/lib/Target/X86/X86ISelLowering.cpp` - Add x86_64 lowering for tan intrinsic resolves https://github.com/llvm/llvm-project/issues/70082 show more ...
# 05e6bb40	30-May-2024	Roger Ferrer Ibáñez <rofirrim@gmail.com>	[SelectionDAG] Add an ISD::CLEAR_CACHE node to lower llvm.clear_cache (#93795) The current way of lowering `llvm.clear_cache` is a bit unusual. As suggested by Matt Arsenault we are better off usin [SelectionDAG] Add an ISD::CLEAR_CACHE node to lower llvm.clear_cache (#93795) The current way of lowering `llvm.clear_cache` is a bit unusual. As suggested by Matt Arsenault we are better off using an ISD node. This change introduces a new `ISD::CLEAR_CACHE`, registers a new libcall by default named `__clear_cache` and the default legalisation is a libcall. This is preparatory work for a custom lowering of `ISD::CLEAR_CACHE` needed by RISC-V on some platforms. show more ...
# 8aceb7a5	29-May-2024	Craig Topper <craig.topper@sifive.com>	[ValueTypes] Remove MVT::MAX_ALLOWED_VALUETYPE. NFC (#93654) Despite the comment, this isn't used to size bit vectors or tables. That's done by VALUETYPE_SIZE. MAX_ALLOWED_VALUETYPE is only used by [ValueTypes] Remove MVT::MAX_ALLOWED_VALUETYPE. NFC (#93654) Despite the comment, this isn't used to size bit vectors or tables. That's done by VALUETYPE_SIZE. MAX_ALLOWED_VALUETYPE is only used by some static_asserts that compare it to VALUETYPE_SIZE. This patch removes it and most of the static_asserts. I left one where I compared VALUETYPE_SIZE to token which is the first type that isn't part of the VALUETYPE range. This isn't strictly needed, we'd probably catch duplication error from VTEmitter.cpp first. show more ...
# 1eb7f055	20-May-2024	Matt Arsenault <Matthew.Arsenault@amd.com>	CodeGen: Fix libcall names for exp10 on the various darwins (#92520) It's really great that we have the same information duplicated in TargetLibraryInfo and RuntimeLibcalls which both assume everyt CodeGen: Fix libcall names for exp10 on the various darwins (#92520) It's really great that we have the same information duplicated in TargetLibraryInfo and RuntimeLibcalls which both assume everything by default. Should fix issue reported after #92287 show more ...
Revision tags: llvmorg-18.1.6
# b52fa946	09-May-2024	David Sherwood <57997763+david-arm@users.noreply.github.com>	[Analysis] Add cost model for experimental.cttz.elts intrinsic (#90720) In PR #88385 I've added support for auto-vectorisation of some early exit loops, which requires using the experimental.cttz.e [Analysis] Add cost model for experimental.cttz.elts intrinsic (#90720) In PR #88385 I've added support for auto-vectorisation of some early exit loops, which requires using the experimental.cttz.elts to calculate final indices in the early exit block. We need a more accurate cost model for this intrinsic to better reflect the cost of work required in the early exit block. I've tried to accurately represent the expansion code for the intrinsic when the target does not have efficient lowering for it. It's quite tricky to model because you need to first figure out what types will actually be used in the expansion. The type used can have a significant effect on the cost if you end up using illegal vector types. Tests added here: Analysis/CostModel/AArch64/cttz_elts.ll Analysis/CostModel/RISCV/cttz_elts.ll show more ...
# 026a29e8	07-May-2024	Kazu Hirata <kazu@google.com>	[Analysis, CodeGen, DebugInfo] Use StringRef::operator== instead of StringRef::equals (NFC) (#91304) I'm planning to remove StringRef::equals in favor of StringRef::operator==. - StringRef::oper [Analysis, CodeGen, DebugInfo] Use StringRef::operator== instead of StringRef::equals (NFC) (#91304) I'm planning to remove StringRef::equals in favor of StringRef::operator==. - StringRef::operator==/!= outnumber StringRef::equals by a factor of 53 under llvm/ in terms of their usage. - The elimination of StringRef::equals brings StringRef closer to std::string_view, which has operator== but not equals. - S == "foo" is more readable than S.equals("foo"), especially for !Long.Expression.equals("str") vs Long.Expression != "str". show more ...
Revision tags: llvmorg-18.1.5, llvmorg-18.1.4
# 91dd844a	16-Apr-2024	Brandon Wu <brandon.wu@sifive.com>	Recommit [RISCV] RISCV vector calling convention (2/2) (#79096) (#87736) Bug fix: Handle RVV return type in calling convention correctly. Return values are handled in a same way as function argumen Recommit [RISCV] RISCV vector calling convention (2/2) (#79096) (#87736) Bug fix: Handle RVV return type in calling convention correctly. Return values are handled in a same way as function arguments. One thing to mention is that if a type can be broken down into homogeneous vector types, e.g. {<vscale x 4 x i32>, {<vscale x 4 x i32>, <vscale x 4 x i32>}}, it is considered as a vector tuple type and need to be handled by tuple type rule. show more ...
Revision tags: llvmorg-18.1.3
# 94b5c118	28-Mar-2024	Jonas Paulsson <paulson1@linux.ibm.com>	[ISel] Move handling of atomic loads from SystemZ to DAGCombiner (NFC). (#86484) The folding of sign/zero extensions into an atomic load by specifying an extension type is not target specific, and [ISel] Move handling of atomic loads from SystemZ to DAGCombiner (NFC). (#86484) The folding of sign/zero extensions into an atomic load by specifying an extension type is not target specific, and therefore belongs in the DAGCombiner rather than in the SystemZ backend. - Handle atomic loads similarly to regular loads by adding AtomicLoadExtActions with set/get methods. - Move SystemZ extendAtomicLoad() to DagCombiner.cpp. show more ...
# 26464f26	27-Mar-2024	Justin Cady <desk@justincady.com>	[FreeBSD] Mark __stack_chk_guard dso_local except for PPC64 (#86665) Adjust logic of 1cb9f37a17ab to match freebsd/freebsd-src@9a4d48a645a7a. D113443 is the original attempt to bring this FreeBSD p [FreeBSD] Mark __stack_chk_guard dso_local except for PPC64 (#86665) Adjust logic of 1cb9f37a17ab to match freebsd/freebsd-src@9a4d48a645a7a. D113443 is the original attempt to bring this FreeBSD patch to llvm-project, but it never landed. This change is required to build FreeBSD kernel modules with -fstack-protector using a standard LLVM toolchain. The FreeBSD kernel loader does not handle R_X86_64_REX_GOTPCRELX relocations. Fixes #50932. show more ...
# cd768ec9	20-Mar-2024	Graham Hunter <graham.hunter@arm.com>	[AArch64] Support scalable offsets with isLegalAddressingMode (#83255) Allows us to indicate that an addressing mode featuring a vscale-relative immediate offset is supported.
Revision tags: llvmorg-18.1.2
# 63a5dc4a	11-Mar-2024	Jay Foad <jay.foad@amd.com>	[CodeGen] Do not pass MF into MachineRegisterInfo methods. NFC. (#84770) MachineRegisterInfo already knows the MF so there is no need to pass it in as an argument.
Revision tags: llvmorg-18.1.1
# 8300f30a	04-Mar-2024	Shilei Tian <i@tianshilei.me>	[SelectionDAG] Add `STRICT_BF16_TO_FP` and `STRICT_FP_TO_BF16` (#80056) This patch adds the support for `STRICT_BF16_TO_FP` and `STRICT_FP_TO_BF16`.
# 2c5d01c2	04-Mar-2024	Shilei Tian <i@tianshilei.me>	Revert "[SelectionDAG] Add `STRICT_BF16_TO_FP` and `STRICT_FP_TO_BF16` (#80056)" This reverts commit b0c158bd947c360a4652eb0de3a4794f46deb88b. The changes in `compiler-rt` broke tests.
# b0c158bd	04-Mar-2024	Shilei Tian <i@tianshilei.me>	[SelectionDAG] Add `STRICT_BF16_TO_FP` and `STRICT_FP_TO_BF16` (#80056) This patch adds the support for `STRICT_BF16_TO_FP` and `STRICT_FP_TO_BF16`.
Revision tags: llvmorg-18.1.0, llvmorg-18.1.0-rc4, llvmorg-18.1.0-rc3
# 21630efb	14-Feb-2024	Pranav Kant <prka@google.com>	[X86][CodeGen] Restrict F128 lowering to GNU environment (#81664) Otherwise it breaks some environment like X64 Android that doesn't have f128 functions available in its libc. Followup to #79611.
# 11fcae69	13-Feb-2024	Joseph Huber <huberjn@outlook.com>	[LLVM] Add `__builtin_readsteadycounter` intrinsic and builtin for realtime clocks (#81331) Summary: This patch adds a new intrinsic and builtin function mirroring the existing `__builtin_readcyclec [LLVM] Add `__builtin_readsteadycounter` intrinsic and builtin for realtime clocks (#81331) Summary: This patch adds a new intrinsic and builtin function mirroring the existing `__builtin_readcyclecounter`. The difference is that this implementation targets a separate counter that some targets have which returns a fixed frequency clock that can be used to determine elapsed time, this is different compared to the cycle counter which often has variable frequency. This patch only adds support for the NVPTX and AMDGPU targets. This is done as a new and separate builtin rather than an argument to `readcyclecounter` to avoid needing to change existing code and to make the separation more explicit. show more ...
123 4 5 6 7 8 9 10 >>...20