float-intrinsics.ll - OpenGrok history log for /llvm-project/llvm/test/CodeGen/RISCV/float-intrinsics.ll

Revision (<<< Hide revision tags) (Show revision tags >>>)	Date	Author	Comments
Revision tags: llvmorg-21-init, llvmorg-19.1.7, llvmorg-19.1.6, llvmorg-19.1.5
# 1bc9de24	27-Nov-2024	Craig Topper <craig.topper@sifive.com>	[RISCV] Add test cases for llvm.tan/asin/acos/atan/atan2/sinh/cosh/tanh. NFC
# dae9cf38	27-Nov-2024	Craig Topper <craig.topper@sifive.com>	[RISCV] Move scalar llvm.exp10 tests into half/float/double-intrinsics.ll. NFC Improves coverage for more configurations.
# ed6749a4	25-Nov-2024	Craig Topper <craig.topper@sifive.com>	[RISCV] Promote frexp with Zfh. The default expansion tries to create an illegal integer type after legalization.
# 20bd029a	25-Nov-2024	Craig Topper <craig.topper@sifive.com>	[RISCV] Promote fldexp with Zfh. (#117396) The default expansion tries to create i16 operations after type legalization. Fixes #117349
Revision tags: llvmorg-19.1.4
# 9122c523	15-Nov-2024	Pengcheng Wang <wangpengcheng.pp@bytedance.com>	[RISCV] Enable bidirectional scheduling and tracking register pressure (#115445) This is based on other targets like PPC/AArch64 and some experiments. This PR will only enable bidirectional schedu [RISCV] Enable bidirectional scheduling and tracking register pressure (#115445) This is based on other targets like PPC/AArch64 and some experiments. This PR will only enable bidirectional scheduling and tracking register pressure. Disclaimer: I haven't tested it on many cores, maybe we should make some options being features. I believe downstreams must have tried this before, so feedbacks are welcome. show more ...
# 97982a8c	05-Nov-2024	dlav-sc <daniil.avdeev@syntacore.com>	[RISCV][CFI] add function epilogue cfi information (#110810) This patch adds CFI instructions in the function epilogue. Before patch: addi sp, s0, -32 ld ra, 24(sp) # 8-byte Folded Reload ld s [RISCV][CFI] add function epilogue cfi information (#110810) This patch adds CFI instructions in the function epilogue. Before patch: addi sp, s0, -32 ld ra, 24(sp) # 8-byte Folded Reload ld s0, 16(sp) # 8-byte Folded Reload ld s1, 8(sp) # 8-byte Folded Reload addi sp, sp, 32 ret After patch: addi sp, s0, -32 .cfi_def_cfa sp, 32 ld ra, 24(sp) # 8-byte Folded Reload ld s0, 16(sp) # 8-byte Folded Reload ld s1, 8(sp) # 8-byte Folded Reload .cfi_restore ra .cfi_restore s0 .cfi_restore s1 addi sp, sp, 32 .cfi_def_cfa_offset 0 ret This functionality is already present in `riscv-gcc`, but it’s not in `clang` and this slightly impairs the `lldb` debugging experience, e.g. backtrace. show more ...
Revision tags: llvmorg-19.1.3, llvmorg-19.1.2
# bc91f3cd	02-Oct-2024	Craig Topper <craig.topper@sifive.com>	[RISCV] Add 32 bit GPR sub-register for Zfinx. (#108336) This patches adds a 32 bit register class for use with Zfinx instructions. This makes them more similar to F instructions and allows us to on [RISCV] Add 32 bit GPR sub-register for Zfinx. (#108336) This patches adds a 32 bit register class for use with Zfinx instructions. This makes them more similar to F instructions and allows us to only spill 32 bits. I've added CodeGenOnly instructions for load/store using GPRF32 as that gave better results than insert_subreg/extract_subreg. Function arguments use this new GPRF32 register class for f32 arguments with Zfinx. Eliminating the need to use RISCVISD::FMV* nodes. This is similar to #107446 which adds a 16 bit register class. show more ...
Revision tags: llvmorg-19.1.1, llvmorg-19.1.0
# 49660e55	06-Sep-2024	Craig Topper <craig.topper@sifive.com>	[RISCV] Pass f32/f64 directly without a bitcast for Zfinx/Zdinx. (#107464) With Zfinx/Zdinx, f32/f64 are legal types for a GPR, we don't need a bitcast. This avoids turning fneg/fabs into bitwis [RISCV] Pass f32/f64 directly without a bitcast for Zfinx/Zdinx. (#107464) With Zfinx/Zdinx, f32/f64 are legal types for a GPR, we don't need a bitcast. This avoids turning fneg/fabs into bitwise operations purely because of these bitcasts. If the bitwise operations are faster for some reason on a Zfinx CPU, then that seems like it should be done for all fneg/fabs, not just ones near function arguments/returns. I don't have much interest in Zfinx, this just makes the code more similar to what I proposed for Zhinx in #107446. show more ...
Revision tags: llvmorg-19.1.0-rc4
# 776aef1a	01-Sep-2024	Craig Topper <craig.topper@sifive.com>	[RISCV] Correct the rounding mode for llvm.lround.i64.f32 with RV64+Zfinx. We should use RMM instead of DYN.
# 2b84fe6f	21-Aug-2024	YunQiang Su <yunqiang@isrc.iscas.ac.cn>	RISC-V: Add fminimumnum and fmaximumnum support (#104411) Since 2.2, `fmin.s/fmax.s` instructions follow the IEEE754-2019, if F extension is avaiable; and `fmin.d/fmax.d` also follow the IEEE754-20 RISC-V: Add fminimumnum and fmaximumnum support (#104411) Since 2.2, `fmin.s/fmax.s` instructions follow the IEEE754-2019, if F extension is avaiable; and `fmin.d/fmax.d` also follow the IEEE754-2019 if D extension is avaiable. So, let's mark them as Legal. show more ...
Revision tags: llvmorg-19.1.0-rc3, llvmorg-19.1.0-rc2, llvmorg-19.1.0-rc1, llvmorg-20-init
# f2f18459	21-Jun-2024	Nikita Popov <npopov@redhat.com>	Revert "Intrinsic: introduce minimumnum and maximumnum (#93841)" As far as I can tell, this pull request was not approved, and did not go through an RFC on discourse. This reverts commit 8988148003 Revert "Intrinsic: introduce minimumnum and maximumnum (#93841)" As far as I can tell, this pull request was not approved, and did not go through an RFC on discourse. This reverts commit 89881480030f48f83af668175b70a9798edca2fb. This reverts commit 225d8fc8eb24fb797154c1ef6dcbe5ba033142da. show more ...
# 89881480	21-Jun-2024	YunQiang Su <syq@debian.org>	Intrinsic: introduce minimumnum and maximumnum (#93841) Currently, on different platform, the behaivor of llvm.minnum is different if one operand is sNaN: When we compare sNaN vs NUM: ARM/AAr Intrinsic: introduce minimumnum and maximumnum (#93841) Currently, on different platform, the behaivor of llvm.minnum is different if one operand is sNaN: When we compare sNaN vs NUM: ARM/AArch64/PowerPC: follow the IEEE754-2008's minNUM: return qNaN. RISC-V/Hexagon follow the IEEE754-2019's minimumNumber: return NUM. X86: Returns NUM but not same with IEEE754-2019's minimumNumber as +0.0 is not always greater than -0.0. MIPS/LoongArch/Generic: return NUM. LIBCALL: returns qNaN. So, let's introduce llvm.minmumnum/llvm.maximumnum, which always follow IEEE754-2019's minimumNumber/maximumNumber. Half-fix: #93033 show more ...
Revision tags: llvmorg-18.1.8, llvmorg-18.1.7, llvmorg-18.1.6, llvmorg-18.1.5, llvmorg-18.1.4, llvmorg-18.1.3, llvmorg-18.1.2, llvmorg-18.1.1, llvmorg-18.1.0, llvmorg-18.1.0-rc4, llvmorg-18.1.0-rc3, llvmorg-18.1.0-rc2, llvmorg-18.1.0-rc1, llvmorg-19-init
# eabaee0c	07-Jan-2024	Fangrui Song <i@maskray.me>	[RISCV] Omit "@plt" in assembly output "call foo@plt" (#72467) R_RISCV_CALL/R_RISCV_CALL_PLT distinction is not necessary and R_RISCV_CALL has been deprecated. Since https://reviews.llvm.org/D132530 [RISCV] Omit "@plt" in assembly output "call foo@plt" (#72467) R_RISCV_CALL/R_RISCV_CALL_PLT distinction is not necessary and R_RISCV_CALL has been deprecated. Since https://reviews.llvm.org/D132530 `call foo` assembles to R_RISCV_CALL_PLT. The `@plt` suffix is not useful and can be removed now (matching AArch64 and PowerPC). GNU assembler assembles `call foo` to RISCV_CALL_PLT since 2022-09 (70f35d72ef04cd23771875c1661c9975044a749c). Without this patch, unconditionally changing MO_CALL to MO_PLT could create `jump .L1@plt, a0`, which is invalid in LLVM integrated assembler and GNU assembler. show more ...
Revision tags: llvmorg-17.0.6, llvmorg-17.0.5, llvmorg-17.0.4, llvmorg-17.0.3, llvmorg-17.0.2, llvmorg-17.0.1, llvmorg-17.0.0, llvmorg-17.0.0-rc4, llvmorg-17.0.0-rc3, llvmorg-17.0.0-rc2, llvmorg-17.0.0-rc1, llvmorg-18-init
# a09a19be	29-Jun-2023	Jianjian GUAN <jacquesguan@me.com>	[RISCV] Update computeKnownBitsForTargetNode for FPCLASS. The fclass instruction only set one of the low 10 bits. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D154040
Revision tags: llvmorg-16.0.6, llvmorg-16.0.5, llvmorg-16.0.4, llvmorg-16.0.3
# fe558efe	30-Apr-2023	Shao-Ce SUN <sunshaoce@iscas.ac.cn>	[RISCV][CodeGen] Support Zfinx codegen This patch was split from D122918 . Co-Author: @liaolucy @realqhc Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D148874
# 5b869f3e	26-Apr-2023	LiaoChunyu <chunyu@iscas.ac.cn>	[RISCV] Custom lowering of llvm.is.fpclass This patch supports FCLASS.S, FCLASS.H and FCLASS.D. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D149063
Revision tags: llvmorg-16.0.2
# d7c97e91	12-Apr-2023	Craig Topper <craig.topper@sifive.com>	[RISCV] Support llvm.lround intrinsics with i32 return type on RV64. It seems that flang uses this for "nint" and expects this i32 to work. On the C side we think lround should only work for "long" [RISCV] Support llvm.lround intrinsics with i32 return type on RV64. It seems that flang uses this for "nint" and expects this i32 to work. On the C side we think lround should only work for "long" which is i64 on rv64. It's easy for us to support i32 when we have native FP instructions. I fell back to i64 and truncated the result otherwise. The documentation for lround says it returns an unspecified value if doesn't fit in the integer type. I have no idea what flang is expecting. I really only did the libcall to avoid forking a test. Reviewed By: asb Differential Revision: https://reviews.llvm.org/D147195 show more ...
Revision tags: llvmorg-16.0.1
# 7b0c4184	28-Mar-2023	Craig Topper <craig.topper@sifive.com>	[RISCV] Move compressible registers to the beginning of the FP allocation order. We don't have very many compressible FP instructions, just load and store. These instruction require the FP register [RISCV] Move compressible registers to the beginning of the FP allocation order. We don't have very many compressible FP instructions, just load and store. These instruction require the FP register to be f8-f15. This patch changes the FP allocation order to prioritize f10-f15 first. These are also the FP argument registers. So I allocated them in reverse order starting at f15 to avoid taking the first argument registers. This appears to match gcc allocation order. Reviewed By: asb Differential Revision: https://reviews.llvm.org/D146488 show more ...
Revision tags: llvmorg-16.0.0, llvmorg-16.0.0-rc4, llvmorg-16.0.0-rc3, llvmorg-16.0.0-rc2
# d02b9869	31-Jan-2023	Han-Kuan Chen <hankuan.chen@sifive.com>	[RISCV] Don't use constantpool for floating-point value if the value can be easily constructed by integer sequence and a floating-point move. In addition, this commit does the following combine vfm [RISCV] Don't use constantpool for floating-point value if the value can be easily constructed by integer sequence and a floating-point move. In addition, this commit does the following combine vfmv.v.f + fmv.[dhw].x -> vmv.v.x vfmv.s.f + fmv.[dhw].x -> vmv.s.x vfmerge.vfm + fmv.[dhw].x -> vmerge.vxm Differential Revision: https://reviews.llvm.org/D142953 show more ...
Revision tags: llvmorg-16.0.0-rc1, llvmorg-17-init, llvmorg-15.0.7, llvmorg-15.0.6, llvmorg-15.0.5, llvmorg-15.0.4
# e94dc58d	26-Oct-2022	Craig Topper <craig.topper@sifive.com>	[RISCV] Inline scalar ceil/floor/trunc/rint/round/roundeven. This avoids the call overhead as well as the the save/restore of fflags and the snan handling in the libm function. The save/restore of [RISCV] Inline scalar ceil/floor/trunc/rint/round/roundeven. This avoids the call overhead as well as the the save/restore of fflags and the snan handling in the libm function. The save/restore of fflags and snan handling are needed to be correct for -ftrapping-math. I think we can ignore them in the default environment. The inline sequence will generate an invalid exception for nan and an inexact exception if fractional bits are discarded. I've used a custom inserter to explicitly create the control flow around the float->int->float conversion. We can probably avoid the final fsgnj after the conversion for no signed zeros FMF, but I'll leave that for future work. Note the comparison constant is slightly different than glibc uses. They use 1<<53 for double, I'm using 1<<52. I believe either are valid. Numbers >= 1<<52 can't have any fractional bits. It's ok to do the float->int->float conversion on numbers between 1<<53 and 1<<52 since they will all fit in 64. We only have a problem if the double can't fit in i64 Reviewed By: reames Differential Revision: https://reviews.llvm.org/D136508 show more ...
Revision tags: llvmorg-15.0.3, working, llvmorg-15.0.2, llvmorg-15.0.1, llvmorg-15.0.0, llvmorg-15.0.0-rc3
# 47b1f836	10-Aug-2022	Alex Bradbury <asb@igalia.com>	[RISCV] Implement isUsedByReturnOnly TargetLowering hook in order to tailcall more libcalls Prior to this patch, libcalls inserted by the SelectionDAG legalizer could never be tailcalled. The eligib [RISCV] Implement isUsedByReturnOnly TargetLowering hook in order to tailcall more libcalls Prior to this patch, libcalls inserted by the SelectionDAG legalizer could never be tailcalled. The eligibility of libcalls for tail calling is is partly determined by checking TargetLowering::isInTailCallPosition and comparing the return type of the libcall and the calleer. isInTailCallPosition in turn calls TargetLowering::isUsedByReturnOnly (which always returns false if not implemented by the target). This patch provides a minimal implementation of TargetLowering::isUsedByReturnOnly - enough to support tail calling libcalls on hard float ABIs. Soft-float ABIs are left for a follow on patch. libcall-tail-calls.ll also shows missed opportunities to tail call integer libcalls, but this is due to issues outside of the isUsedByReturnOnly hook. Differential Revision: https://reviews.llvm.org/D131087 show more ...
Revision tags: llvmorg-15.0.0-rc2, llvmorg-15.0.0-rc1, llvmorg-16-init, llvmorg-14.0.6, llvmorg-14.0.5, llvmorg-14.0.4, llvmorg-14.0.3, llvmorg-14.0.2, llvmorg-14.0.1, llvmorg-14.0.0, llvmorg-14.0.0-rc4, llvmorg-14.0.0-rc3, llvmorg-14.0.0-rc2, llvmorg-14.0.0-rc1, llvmorg-15-init, llvmorg-13.0.1, llvmorg-13.0.1-rc3, llvmorg-13.0.1-rc2
# bd653f64	11-Jan-2022	Haocong.Lu <Haocong.Lu@streamcomputing.com>	[RISCV] Use shift for zero extension when Zbb and Zbp are not enabled Now AND is used for zero extension when both Zbb and Zbp are not enabled. It may be better to use shift operation if the trailin [RISCV] Use shift for zero extension when Zbb and Zbp are not enabled Now AND is used for zero extension when both Zbb and Zbp are not enabled. It may be better to use shift operation if the trailing ones mask exceeds simm12. This patch optimzes LUI+ADDI+AND to SLLI+SRLI. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D116720 show more ...
# b271184f	10-Jan-2022	Craig Topper <craig.topper@sifive.com>	[RISCV] Use FP ABI on some of the FP tests to reduce the number of CHECK lines. NFC These tests are interested in the FP instructions being used, not the conversions needed to pass the arguments/ret [RISCV] Use FP ABI on some of the FP tests to reduce the number of CHECK lines. NFC These tests are interested in the FP instructions being used, not the conversions needed to pass the arguments/returns in GPRs. Reviewed By: asb Differential Revision: https://reviews.llvm.org/D116869 show more ...
Revision tags: llvmorg-13.0.1-rc1
# 137d3474	16-Nov-2021	Hsiangkai Wang <kai.wang@sifive.com>	[RISCV] Reverse the order of loading/storing callee-saved registers. Currently, we restore the return address register as the last restoring instruction in the epilog. The next instruction is `ret` [RISCV] Reverse the order of loading/storing callee-saved registers. Currently, we restore the return address register as the last restoring instruction in the epilog. The next instruction is `ret` usually. It is a use of return address register. In some microarchitectures, there is load-to-use data hazard. To avoid the load-to-use data hazard, we could separate the load instruction from its use as far as possible. In this patch, we reverse the order of restoring callee-saved registers to increase the distance of `load ra` and `ret` in the epilog. Differential Revision: https://reviews.llvm.org/D113967 show more ...
# 8909dc5e	13-Nov-2021	Craig Topper <craig.topper@sifive.com>	[RISCV] Fixed duplicate RUN line on float-intrinsics.ll. NFC We had two identical RV64I RUN lines. One should be RV32I.
12