X86FixupBWInsts.cpp - OpenGrok history log for /llvm-project/llvm/lib/Target/X86/X86FixupBWInsts.cpp

Revision (<<< Hide revision tags) (Show revision tags >>>)	Date	Author	Comments
Revision tags: llvmorg-21-init, llvmorg-19.1.7, llvmorg-19.1.6, llvmorg-19.1.5, llvmorg-19.1.4, llvmorg-19.1.3
# 6ab26eab	28-Oct-2024	Ellis Hoag <ellis.sparky.hoag@gmail.com>	Check hasOptSize() in shouldOptimizeForSize() (#112626)
Revision tags: llvmorg-19.1.2, llvmorg-19.1.1, llvmorg-19.1.0, llvmorg-19.1.0-rc4, llvmorg-19.1.0-rc3, llvmorg-19.1.0-rc2, llvmorg-19.1.0-rc1, llvmorg-20-init, llvmorg-18.1.8, llvmorg-18.1.7
# 135ddd80	22-May-2024	Craig Topper <craig.topper@sifive.com>	[X86] Remove unused MachineLoopInfo from X86FixupBWInsts. NFC (#92984)
Revision tags: llvmorg-18.1.6
# 2aaec48d	05-May-2024	Fangrui Song <i@maskray.me>	X86FixupBWInsts: Remove redundant code. NFC
Revision tags: llvmorg-18.1.5, llvmorg-18.1.4, llvmorg-18.1.3, llvmorg-18.1.2, llvmorg-18.1.1, llvmorg-18.1.0, llvmorg-18.1.0-rc4, llvmorg-18.1.0-rc3, llvmorg-18.1.0-rc2, llvmorg-18.1.0-rc1, llvmorg-19-init, llvmorg-17.0.6, llvmorg-17.0.5, llvmorg-17.0.4, llvmorg-17.0.3, llvmorg-17.0.2, llvmorg-17.0.1, llvmorg-17.0.0
# bd6e77c6	08-Sep-2023	Jay Foad <jay.foad@amd.com>	[X86] Convert X86FixupBWInsts from LivePhysRegs to LiveRegUnits. NFCI. (#65592) This gives a geomean 0.50% speed up according to https://llvm-compile-time-tracker.com/
# bd3a33c0	07-Sep-2023	Jay Foad <jay.foad@amd.com>	[X86] Simplify the API of FixupBWInstPass::getSuperRegDestIfDead. NFC. (#65581) It is easy to check the returned register for validity so there is no need to return a separate boolean flag.
Revision tags: llvmorg-17.0.0-rc4, llvmorg-17.0.0-rc3, llvmorg-17.0.0-rc2, llvmorg-17.0.0-rc1, llvmorg-18-init, llvmorg-16.0.6, llvmorg-16.0.5, llvmorg-16.0.4, llvmorg-16.0.3, llvmorg-16.0.2
# caa9d6e2	15-Apr-2023	Akshay Khadse <akshayskhadse@gmail.com>	Fix uninitialized pointer members in Target/X86 Reviewed By: LuoYuanke Differential Revision: https://reviews.llvm.org/D148312
Revision tags: llvmorg-16.0.1, llvmorg-16.0.0, llvmorg-16.0.0-rc4
# d73da868	27-Feb-2023	Marco Elver <elver@google.com>	[X86][FixupBWInsts] Fix propagation of !pcsections metadata Use MIMetadata() to propagate both DebugLoc and !pcsections metadata. This fixes several of the non-native sized !pcsections tests in pcs [X86][FixupBWInsts] Fix propagation of !pcsections metadata Use MIMetadata() to propagate both DebugLoc and !pcsections metadata. This fixes several of the non-native sized !pcsections tests in pcsections-atomics.ll. Reviewed By: dvyukov Differential Revision: https://reviews.llvm.org/D144676 show more ...
Revision tags: llvmorg-16.0.0-rc3, llvmorg-16.0.0-rc2
# 011e4abb	29-Jan-2023	Shengchen Kan <shengchen.kan@intel.com>	[X86][MC][bugfix] Report error for mismatched modifier in inline asm and remove function getX86SubSuperRegisterOrZero ``` MCRegister getX86SubSuperRegister(MCRegister Reg, unsigned Size, [X86][MC][bugfix] Report error for mismatched modifier in inline asm and remove function getX86SubSuperRegisterOrZero ``` MCRegister getX86SubSuperRegister(MCRegister Reg, unsigned Size, bool High = false); ``` A strange behavior of the functions `getX86SubSuperRegister*` was introduced by llvm-svn:145579: The returned register may not match the parameters when a 8-bit high register is required. And llvm-svn: 175762 refined the code and dropped the comments, then we knew nothing happened there from the code :-( These two functions are only called with `Size=8` and `High=true` in two places. One is in `X86FixupBWInsts.cpp` for liveness of registers and the other is in `X86AsmPrinter.cpp` for inline asm. For the first one, we provide an alternative in this patch. For the second one, the strange behaviour caused a bug that an erorr was not reported for mismatched modifier. ``` void f() { char x; asm volatile ("mov %%ah, %h0" :"=r"(x)::"%eax", "%ebx", "%ecx", "%edx", "edi", "esi"); } ``` ``` $ gcc -S test.c error: extended registers have no high halves ``` ``` $ clang -S test.c no error ``` so we fix the bug in this patch. `getX86SubSuperRegister` is just a wrapper of `getX86SubSuperRegisterOrZero` with a `assert`. I belive we should remove the latter. Reviewed By: pengfei Differential Revision: https://reviews.llvm.org/D142834 show more ...
Revision tags: llvmorg-16.0.0-rc1, llvmorg-17-init, llvmorg-15.0.7, llvmorg-15.0.6, llvmorg-15.0.5, llvmorg-15.0.4, llvmorg-15.0.3, working, llvmorg-15.0.2, llvmorg-15.0.1, llvmorg-15.0.0, llvmorg-15.0.0-rc3, llvmorg-15.0.0-rc2, llvmorg-15.0.0-rc1, llvmorg-16-init
# f0dd12ec	20-Jul-2022	Sanjay Patel <spatel@rotateright.com>	[x86] use zero-extending load of a byte outside of loops too (2nd try) The first attempt missed changing test files for tools (update_llc_test_checks.py). Original commit message: This implements [x86] use zero-extending load of a byte outside of loops too (2nd try) The first attempt missed changing test files for tools (update_llc_test_checks.py). Original commit message: This implements the main suggested change from issue #56498. Using the shorter (non-extending) instruction with only -Oz ("minsize") rather than -Os ("optsize") is left as a possible follow-up. As noted in the bug report, the zero-extending load may have shorter latency/better throughput across a wide range of x86 micro-arches, and it avoids a potential false dependency. The cost is an extra instruction byte. This could cause perf ups and downs from secondary effects, but I don't think it is possible to account for those in advance, and that will likely also depend on exact micro-arch. This does bring LLVM x86 codegen more in line with existing gcc codegen, so if problems are exposed they are more likely to occur for both compilers. Differential Revision: https://reviews.llvm.org/D129775 show more ...
# 95401b01	19-Jul-2022	Sanjay Patel <spatel@rotateright.com>	Revert "[x86] use zero-extending load of a byte outside of loops too" This reverts commit 9d1ea1774c51c44ddf0b5065bf600919988d7015. There are tests of update_llc_tests_checks.py that missed being up Revert "[x86] use zero-extending load of a byte outside of loops too" This reverts commit 9d1ea1774c51c44ddf0b5065bf600919988d7015. There are tests of update_llc_tests_checks.py that missed being updated. show more ...
# 9d1ea177	19-Jul-2022	Sanjay Patel <spatel@rotateright.com>	[x86] use zero-extending load of a byte outside of loops too This implements the main suggested change from issue #56498. Using the shorter (non-extending) instruction with only -Oz ("minsize") rath [x86] use zero-extending load of a byte outside of loops too This implements the main suggested change from issue #56498. Using the shorter (non-extending) instruction with only -Oz ("minsize") rather than -Os ("optsize") is left as a possible follow-up. As noted in the bug report, the zero-extending load may have shorter latency/better throughput across a wide range of x86 micro-arches, and it avoids a potential false dependency. The cost is an extra instruction byte. This could cause perf ups and downs from secondary effects, but I don't think it is possible to account for those in advance, and that will likely also depend on exact micro-arch. This does bring LLVM x86 codegen more in line with existing gcc codegen, so if problems are exposed they are more likely to occur for both compilers. Differential Revision: https://reviews.llvm.org/D129775 show more ...
Revision tags: llvmorg-14.0.6, llvmorg-14.0.5, llvmorg-14.0.4, llvmorg-14.0.3, llvmorg-14.0.2, llvmorg-14.0.1, llvmorg-14.0.0, llvmorg-14.0.0-rc4, llvmorg-14.0.0-rc3, llvmorg-14.0.0-rc2, llvmorg-14.0.0-rc1, llvmorg-15-init, llvmorg-13.0.1, llvmorg-13.0.1-rc3, llvmorg-13.0.1-rc2
# bb6447a7	13-Dec-2021	Kazu Hirata <kazu@google.com>	[llvm] Use llvm::reverse (NFC)
Revision tags: llvmorg-13.0.1-rc1, llvmorg-13.0.0, llvmorg-13.0.0-rc4, llvmorg-13.0.0-rc3, llvmorg-13.0.0-rc2, llvmorg-13.0.0-rc1, llvmorg-14-init
# 47c3fe2a	01-Jul-2021	Jeremy Morse <jeremy.morse@sony.com>	[DebugInfo][InstrRef][1/4] Support transformations that widen values Very late in compilation, backends like X86 will perform optimisations like this: $cx = MOV16rm $rax, ... -> $rcx = [DebugInfo][InstrRef][1/4] Support transformations that widen values Very late in compilation, backends like X86 will perform optimisations like this: $cx = MOV16rm $rax, ... -> $rcx = MOV64rm $rax, ... Widening the load from 16 bits to 64 bits. SEeing how the lower 16 bits remain the same, this doesn't affect execution. However, any debug instruction reference to the defined operand now refers to a 64 bit value, nto a 16 bit one, which might be unexpected. Elsewhere in codegen, there's often this pattern: CALL64pcrel32 @foo, implicit-def $rax %0:gr64 = COPY $rax %1:gr32 = COPY %0.sub_32bit Where we want to refer to the definition of $eax by the call, but don't want to refer the copies (they don't define values in the way LiveDebugValues sees it). To solve this, add a subregister field to the existing "substitutions" facility, so that we can describe a field within a larger value definition. I would imagine that this would be used most often when a value is widened, and we need to refer to the original, narrower definition. Differential Revision: https://reviews.llvm.org/D88891 show more ...
Revision tags: llvmorg-12.0.1, llvmorg-12.0.1-rc4, llvmorg-12.0.1-rc3, llvmorg-12.0.1-rc2, llvmorg-12.0.1-rc1, llvmorg-12.0.0, llvmorg-12.0.0-rc5, llvmorg-12.0.0-rc4, llvmorg-12.0.0-rc3, llvmorg-12.0.0-rc2, llvmorg-11.1.0, llvmorg-11.1.0-rc3, llvmorg-12.0.0-rc1, llvmorg-13-init, llvmorg-11.1.0-rc2, llvmorg-11.1.0-rc1, llvmorg-11.0.1, llvmorg-11.0.1-rc2, llvmorg-11.0.1-rc1, llvmorg-11.0.0, llvmorg-11.0.0-rc6, llvmorg-11.0.0-rc5, llvmorg-11.0.0-rc4, llvmorg-11.0.0-rc3
# 890707aa	03-Sep-2020	Simon Pilgrim <llvm-dev@redking.me.uk>	[X86] Avoid llvm-qualified-auto warning by not using auto. NFC. Try to consistently use the actual type name in the file.
Revision tags: llvmorg-11.0.0-rc2, llvmorg-11.0.0-rc1, llvmorg-12-init, llvmorg-10.0.1, llvmorg-10.0.1-rc4, llvmorg-10.0.1-rc3, llvmorg-10.0.1-rc2, llvmorg-10.0.1-rc1, llvmorg-10.0.0, llvmorg-10.0.0-rc6, llvmorg-10.0.0-rc5, llvmorg-10.0.0-rc4, llvmorg-10.0.0-rc3
# c3c20c83	15-Feb-2020	Craig Topper <craig.topper@gmail.com>	[X86] Fix typo in comment. NFC
Revision tags: llvmorg-10.0.0-rc2, llvmorg-10.0.0-rc1, llvmorg-11-init, llvmorg-9.0.1, llvmorg-9.0.1-rc3
# d9ae4939	05-Dec-2019	Hiroshi Yamauchi <yamauchi@google.com>	[PGO][PGSO] Instrument the code gen / target passes. Summary: Split off of D67120. Add the profile guided size optimization instrumentation / queries in the code gen or target passes. This doesn't [PGO][PGSO] Instrument the code gen / target passes. Summary: Split off of D67120. Add the profile guided size optimization instrumentation / queries in the code gen or target passes. This doesn't enable the size optimizations in those passes yet as they are currently disabled in shouldOptimizeForSize (for non-IR pass queries). A second try after reverted D71072. Reviewers: davidxl Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D71149 show more ...
# 2eb30faf	06-Dec-2019	Hiroshi Yamauchi <yamauchi@google.com>	Revert "[PGO][PGSO] Instrument the code gen / target passes." This reverts commit 9a0b5e14075a1f42a72eedb66fd4fde7985d37ac. This seems to break buildbots.
# 9a0b5e14	05-Dec-2019	Hiroshi Yamauchi <yamauchi@google.com>	[PGO][PGSO] Instrument the code gen / target passes. Summary: Split off of D67120. Add the profile guided size optimization instrumentation / queries in the code gen or target passes. This doesn't [PGO][PGSO] Instrument the code gen / target passes. Summary: Split off of D67120. Add the profile guided size optimization instrumentation / queries in the code gen or target passes. This doesn't enable the size optimizations in those passes yet as they are currently disabled in shouldOptimizeForSize (for non-IR pass queries). Reviewers: davidxl Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D71072 show more ...
Revision tags: llvmorg-9.0.1-rc2, llvmorg-9.0.1-rc1
# eabd405e	06-Nov-2019	Simon Pilgrim <llvm-dev@redking.me.uk>	[X86] Fix uninitialized variable warnings. NFCI.
Revision tags: llvmorg-9.0.0, llvmorg-9.0.0-rc6, llvmorg-9.0.0-rc5, llvmorg-9.0.0-rc4
# 0364d89b	06-Sep-2019	Craig Topper <craig.topper@intel.com>	[X86] Teach FixupBWInsts to turn MOVSX16rr8/MOVZX16rr8/MOVSX16rm8/MOVZX16rm8 into their 32-bit dest equivalents when the upper part of the register is dead. llvm-svn: 371240
# 7739fbc9	06-Sep-2019	Craig Topper <craig.topper@intel.com>	[X86] Fix bad indentation. NFC llvm-svn: 371167
Revision tags: llvmorg-9.0.0-rc3
# a17d1d22	16-Aug-2019	Craig Topper <craig.topper@intel.com>	[X86] Use Register/MCRegister in more places in X86 This was a quick pass through some obvious places. I haven't tried the clang-tidy check. I also replaced the zeroes in getX86SubSuperRegister wit [X86] Use Register/MCRegister in more places in X86 This was a quick pass through some obvious places. I haven't tried the clang-tidy check. I also replaced the zeroes in getX86SubSuperRegister with X86::NoRegister which is the real sentinel name. Differential Revision: https://reviews.llvm.org/D66363 llvm-svn: 369151 show more ...
# 0c476111	15-Aug-2019	Daniel Sanders <daniel_l_sanders@apple.com>	Apply llvm-prefer-register-over-unsigned from clang-tidy to LLVM Summary: This clang-tidy check is looking for unsigned integer variables whose initializer starts with an implicit cast from llvm::Re Apply llvm-prefer-register-over-unsigned from clang-tidy to LLVM Summary: This clang-tidy check is looking for unsigned integer variables whose initializer starts with an implicit cast from llvm::Register and changes the type of the variable to llvm::Register (dropping the llvm:: where possible). Partial reverts in: X86FrameLowering.cpp - Some functions return unsigned and arguably should be MCRegister X86FixupLEAs.cpp - Some functions return unsigned and arguably should be MCRegister X86FrameLowering.cpp - Some functions return unsigned and arguably should be MCRegister HexagonBitSimplify.cpp - Function takes BitTracker::RegisterRef which appears to be unsigned& MachineVerifier.cpp - Ambiguous operator==() given MCRegister and const Register PPCFastISel.cpp - No Register::operator-=() PeepholeOptimizer.cpp - TargetInstrInfo::optimizeLoadInstr() takes an unsigned& MachineTraceMetrics.cpp - MachineTraceMetrics lacks a suitable constructor Manual fixups in: ARMFastISel.cpp - ARMEmitLoad() now takes a Register& instead of unsigned& HexagonSplitDouble.cpp - Ternary operator was ambiguous between unsigned/Register HexagonConstExtenders.cpp - Has a local class named Register, used llvm::Register instead of Register. PPCFastISel.cpp - PPCEmitLoad() now takes a Register& instead of unsigned& Depends on D65919 Reviewers: arsenm, bogner, craig.topper, RKSimon Reviewed By: arsenm Subscribers: RKSimon, craig.topper, lenary, aemerson, wuzish, jholewinski, MatzeB, qcolombet, dschuff, jyknight, dylanmckay, sdardis, nemanjai, jvesely, wdng, nhaehnle, sbc100, jgravelle-google, kristof.beyls, hiraditya, aheejin, kbarton, fedor.sergeev, javed.absar, asb, rbar, johnrusso, simoncook, apazos, sabuasal, niosHD, jrtc27, MaskRay, zzheng, edward-jones, atanasyan, rogfer01, MartinMosbeck, brucehoult, the_o, tpr, PkmX, jocewei, jsji, Petar.Avramovic, asbirlea, Jim, s.egerton, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D65962 llvm-svn: 369041 show more ...
Revision tags: llvmorg-9.0.0-rc2, llvmorg-9.0.0-rc1, llvmorg-10-init, llvmorg-8.0.1, llvmorg-8.0.1-rc4, llvmorg-8.0.1-rc3
# f3356722	13-Jun-2019	Tom Stellard <tstellar@redhat.com>	X86: Clean up pass initialization Summary: - Remove redundant initializations from pass constructors that were already being initialized by LLVMInitializeX86Target(). - Add initialization functio X86: Clean up pass initialization Summary: - Remove redundant initializations from pass constructors that were already being initialized by LLVMInitializeX86Target(). - Add initialization function for the FPS pass. Reviewers: craig.topper Reviewed By: craig.topper Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D63218 llvm-svn: 363221 show more ...
Revision tags: llvmorg-8.0.1-rc2, llvmorg-8.0.1-rc1
# 85bd3978	04-Apr-2019	Evandro Menezes <e.menezes@samsung.com>	[IR] Refactor attribute methods in Function class (NFC) Rename the functions that query the optimization kind attributes. Differential revision: https://reviews.llvm.org/D60287 llvm-svn: 357731
12 3