llvm-project

Commit Graph

Author	SHA1	Message	Date
Simon Pilgrim	32da2f9245	[DAGCombine] Permit combining of shuffles of equivalent splat BUILD_VECTORs combineShuffleOfScalars is very conservative about shuffled BUILD_VECTORs that can be combined together. This patch adds one additional case - if both BUILD_VECTORs represent splats of the same scalar value but with different UNDEF elements, then we should create a single splat BUILD_VECTOR, sharing only the UNDEF elements defined by the shuffle mask. Differential Revision: https://reviews.llvm.org/D38696 llvm-svn: 316331	2017-10-23 15:48:08 +00:00
Simon Pilgrim	03c8753924	[X86][SSE] Regenerate bitcast-and-setcc tests Avoid the retl/retq changes in an upcoming patch llvm-svn: 316328	2017-10-23 14:47:49 +00:00
Simon Pilgrim	e131cb0bd5	[X86][AVX2] Regenerate AVX2 intrinsics tests on 32 + 64-bit targets llvm-svn: 316326	2017-10-23 14:19:46 +00:00
Simon Pilgrim	c680c4742b	[X86][AVX] Regenerate AVX intrinsics tests on 32 + 64-bit targets llvm-svn: 316325	2017-10-23 14:17:59 +00:00
Simon Pilgrim	eae6e9dbc5	[X86][F16C] Regenerate F16C schedule tests llvm-svn: 316324	2017-10-23 14:15:24 +00:00
Ayman Musa	4b2bd5ff5e	[X86] Add test for opportunity to use bzhi X86 instruction instead of load+and instructions. Transformation uploaded for CR in https://reviews.llvm.org/D34141. llvm-svn: 316320	2017-10-23 10:24:19 +00:00
Marina Yatsina	f9371d821f	Add logic to greedy reg alloc to avoid bad eviction chains This fixes bugzilla 26810 https://bugs.llvm.org/show_bug.cgi?id=26810 This is intended to prevent sequences like: movl %ebp, 8(%esp) # 4-byte Spill movl %ecx, %ebp movl %ebx, %ecx movl %edi, %ebx movl %edx, %edi cltd idivl %esi movl %edi, %edx movl %ebx, %edi movl %ecx, %ebx movl %ebp, %ecx movl 16(%esp), %ebp # 4 - byte Reload Such sequences are created in 2 scenarios: Scenario #1: vreg0 is evicted from physreg0 by vreg1 Evictee vreg0 is intended for region splitting with split candidate physreg0 (the reg vreg0 was evicted from) Region splitting creates a local interval because of interference with the evictor vreg1 (normally region spliiting creates 2 interval, the "by reg" and "by stack" intervals. Local interval created when interference occurs.) one of the split intervals ends up evicting vreg2 from physreg1 Evictee vreg2 is intended for region splitting with split candidate physreg1 one of the split intervals ends up evicting vreg3 from physreg2 etc.. until someone spills Scenario #2 vreg0 is evicted from physreg0 by vreg1 vreg2 is evicted from physreg2 by vreg3 etc Evictee vreg0 is intended for region splitting with split candidate physreg1 Region splitting creates a local interval because of interference with the evictor vreg1 one of the split intervals ends up evicting back original evictor vreg1 from physreg0 (the reg vreg0 was evicted from) Another evictee vreg2 is intended for region splitting with split candidate physreg1 one of the split intervals ends up evicting vreg3 from physreg2 etc.. until someone spills As compile time was a concern, I've added a flag to control weather we do cost calculations for local intervals we expect to be created (it's on by default for X86 target, off for the rest). Differential Revision: https://reviews.llvm.org/D35816 Change-Id: Id9411ff7bbb845463d289ba2ae97737a1ee7cc39 llvm-svn: 316295	2017-10-22 17:59:38 +00:00
Momchil Velikov	d6a4ab3d49	[ARM] Dynamic stack alignment for 16-bit Thumb This patch implements dynamic stack (re-)alignment for 16-bit Thumb. When targeting processors, which support only the 16-bit Thumb instruction set the compiler ignores the alignment attributes of automatic variables and may silently generate incorrect code. Differential revision: https://reviews.llvm.org/D38143 llvm-svn: 316289	2017-10-22 11:56:35 +00:00
Guy Blank	92d5ce3bd4	[X86] Add a pass to convert instruction chains between domains. The pass scans the function to find instruction chains that define registers in the same domain (closures). It then calculates the cost of converting the closure to another domain. If found profitable, the instructions are converted to instructions in the other domain and the register classes are changed accordingly. This commit adds the pass infrastructure and a simple conversion from the GPR domain to the Mask domain. Differential Revision: https://reviews.llvm.org/D37251 Change-Id: Ic2cf1d76598110401168326d411128ae2580a604 llvm-svn: 316288	2017-10-22 11:43:08 +00:00
Aaron Ballman	fc02869c96	Reverting r316270 due to failing build bots. http://lab.llvm.org:8011/builders/clang-x86_64-linux-selfhost-modules-2/builds/12899 http://lab.llvm.org:8011/builders/clang-x86-windows-msvc2015/builds/7951 llvm-svn: 316276	2017-10-21 20:38:15 +00:00
Simon Pilgrim	3cb024490a	[X86][SSE] Add extractps/pextrd equivalence to domain tables Differential Revision: https://reviews.llvm.org/D39135 llvm-svn: 316274	2017-10-21 20:19:48 +00:00
Fangrui Song	c7b749bd06	[PPC CodeGen] Fix the bitreverse.i64 intrinsic. Summary: The two 32-bit words were swapped. Subscribers: nemanjai, kbarton Differential Revision: https://reviews.llvm.org/D38705 llvm-svn: 316270	2017-10-21 16:59:40 +00:00
Simon Pilgrim	7025b07828	[X86][SSE] Add missing extractps scheduling test llvm-svn: 316262	2017-10-21 14:35:09 +00:00
Craig Topper	fcf27188d7	[X86] Do not generate __multi3 for mul i128 on X86 Summary: __multi3 is not available on x86 (32-bit). Setting lib call name for MULI_128 to nullptr forces DAGTypeLegalizer::ExpandIntRes_MUL to generate instructions for 128-bit multiply instead of a call to an undefined function. This fixes PR20871 though it may be worth looking at why licm and indvars combine to generate 65-bit multiplies in that test. Patch by Riyaz V Puthiyapurayil Reviewers: craig.topper, schweitz Reviewed By: craig.topper, schweitz Subscribers: RKSimon, llvm-commits Differential Revision: https://reviews.llvm.org/D38668 llvm-svn: 316254	2017-10-21 02:26:00 +00:00
Krzysztof Parzyszek	9d19c8cac9	[Packetizer] Add function to check for aliasing between instructions llvm-svn: 316243	2017-10-20 22:08:40 +00:00
Krzysztof Parzyszek	022922b31a	[Hexagon] Report error instead of crashing on wrong inline-asm constraints llvm-svn: 316236	2017-10-20 20:24:44 +00:00
Krzysztof Parzyszek	64e5d7d3ae	[Hexagon] Reorganize and update instruction patterns llvm-svn: 316228	2017-10-20 19:33:12 +00:00
Simon Pilgrim	1311ff1340	[X86][SSE] Add missing _mm_extract_ps fast-isel test llvm-svn: 316226	2017-10-20 19:29:01 +00:00
Sanjay Patel	bb94161fb7	[x86] avoid FileCheck assert duplication with retl/retq regex; NFC This was suggested in PR35003: https://bugs.llvm.org/show_bug.cgi?id=35003 32-bit checks may be identical to 64-bit (if we avoid those pesky scalar params!). I'll check in the script change shortly assuming this doesn't anger any bots. llvm-svn: 316223	2017-10-20 18:35:32 +00:00
Dave Lee	f9b72327b0	Make x86 __ehhandler comdat if parent function is Summary: This change comes from using lld for i686-windows-msvc. Before this change, lld emits an error of: error: relocation against symbol in discarded section: .xdata It's possible that this could be addressed in lld, but I think this change is reasonable on its own. At a high level, this is being generated: A (.text comdat) -> B (.text) -> C (.xdata comdat) Where A is a C++ inline function, which references B, an exception handler thunk, which references C, the exception handling info. With this structure, lld will error when applying relocations to B if the C it references has been discarded (some other C has been selected). This change checks if A is comdat, and if so places the exception registration thunk (B) in the comdata group of A (and B). It appears that MSVC makes the __ehhandler function comdat. Is it possible that duplicate thunks are being emitted into the final binary with other linkers, or are they stripping the unused thunks? Reviewers: rnk, majnemer, compnerd, smeenai Reviewed By: rnk, compnerd Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D38940 llvm-svn: 316219	2017-10-20 17:04:43 +00:00
Krzysztof Parzyszek	3818aeaeb9	[Hexagon] Allow redefinition with immediates for hw loop conversion Normally, if the registers holding the induction variable's bounds are redefined inside of the loop's body, the loop cannot be converted to a hardware loop. However, if the redefining instruction is actually loading an immediate value into the register, this conversion is both possible and legal (since the immediate itself will be used in the loop setup in the preheader). llvm-svn: 316218	2017-10-20 16:56:33 +00:00
Simon Pilgrim	b6b617b7d8	[X86] Check all CPU target names. We ignore the 32-bit/64-bit triple but I've tried to use i686 triples for CPUs that don't support x86_64 llvm-svn: 316217	2017-10-20 16:55:51 +00:00
Zvi Rackover	e95709d54a	X86 Tests: Add tests for vector permutes with variable indices. NFC. Basic tests which are the equivalent of single-source shufflevector with variable mask. llvm-svn: 316216	2017-10-20 15:32:14 +00:00
Aleksandar Beserminji	143572984d	Revert "[mips] Reordering callseq* nodes to be linear" This reverts commit r314507, because the original patch is causing test failures. llvm-svn: 316215	2017-10-20 14:35:41 +00:00
Eugene Leviant	27b226fb65	[ARM] Use post-RA MI scheduler when +use-misched is set Differential revision: https://reviews.llvm.org/D39100 llvm-svn: 316214	2017-10-20 14:29:17 +00:00
Simon Pilgrim	46b791921f	[X86][AVX512] Regenerate regcall tests. As part of tracking down machine verifier issues (PR27481) llvm-svn: 316213	2017-10-20 14:13:02 +00:00
Dylan McKay	6670e42402	[AVR] Fix the select-mbb-placement-bug.ll llvm-svn: 316205	2017-10-20 04:17:14 +00:00
Nemanja Ivanovic	0026c06e11	Disabling the transformation introduced in r315888 The commit at https://reviews.llvm.org/rL315888 is causing some failures with internal testing. Disabling this code until we can resolve the issues. llvm-svn: 316199	2017-10-20 00:36:46 +00:00
Alex Bradbury	8971842f43	[RISCV] Initial codegen support for ALU operations This adds the minimum necessary to support codegen for simple ALU operations on RV32. Prolog and epilog insertion, support for memory operations etc etc follow in future patches. Leave guessInstructionProperties=1 until https://reviews.llvm.org/D37065 is reviewed and lands. Differential Revision: https://reviews.llvm.org/D29933 llvm-svn: 316188	2017-10-19 21:37:38 +00:00
Simon Pilgrim	e8e2c4c0cf	[X86][AES] Test AES intrinsics on 32/64-bit targets with/without VEX encoding Don't just test on 32-bit llvm-svn: 316176	2017-10-19 19:05:04 +00:00
Krzysztof Parzyszek	e4d0e199bf	[Hexagon] Fix store conversion from rr to io in optimize addressing modes llvm-svn: 316170	2017-10-19 16:59:22 +00:00
Simon Pilgrim	fdd63d1535	[X86] Replace custom scalar integer absolute matching with ISD::ABS lowering. x86 has its own copy of integer absolute pattern matching to combine directly to a SUB+CMOV. This patch removes the x86 combine and adds custom lowering support for ISD::ABS instead, allowing us to use the DAGCombiner version. Additional test cases are already covered by iabs.ll (rL315706 and rL315711). Differential Revision: https://reviews.llvm.org/D38895 llvm-svn: 316162	2017-10-19 15:02:24 +00:00
Simon Pilgrim	d0649f978f	[X86] Add scalar (abs (abs x)) -> (abs x) combine test. Before landing D38895 llvm-svn: 316160	2017-10-19 14:59:26 +00:00
Diana Picus	7bf71008aa	[ARM GlobalISel] Fix liveins in test. NFC llvm-svn: 316155	2017-10-19 09:28:19 +00:00
Diana Picus	a993859335	[ARM GlobalISel] Remove redundant tests These test cases don't really add anything that isn't covered by other tests as well, so we can safely remove them. llvm-svn: 316154	2017-10-19 08:50:28 +00:00
Justin Bogner	876ad287d1	GISel: Canonicalize select tests using update_mir_test_checks This runs `udpate_mir_test_checks --add-vreg-checks` on the tests taht are already more or less in the format that generates, so that there will be less churn in some upcoming changes. llvm-svn: 316139	2017-10-18 23:33:31 +00:00
Justin Bogner	f8dc015bd1	AArch64/GISel: Modernize the localizer test llvm-svn: 316138	2017-10-18 23:26:24 +00:00
Justin Bogner	d45849f703	Canonicalize a large number of mir tests using update_mir_test_checks This converts a large and somewhat arbitrary set of tests to use update_mir_test_checks. I ran the script on all of the tests I expect to need to modify for an upcoming mir syntax change and kept the ones that obviously didn't change the tests in ways that might make it harder to understand. llvm-svn: 316137	2017-10-18 23:18:12 +00:00
Dylan McKay	443695f80a	[AVR] Fix the select_mbb_placement_bug.ll test llvm-svn: 316124	2017-10-18 20:04:57 +00:00
Sumanth Gundapaneni	e1983bcf55	[Hexagon] New HVX target features. This patch lets the llvm tools handle the new HVX target features that are added by frontend (clang). The target-features are of the form "hvx-length64b" for 64 Byte HVX mode, "hvx-length128b" for 128 Byte mode HVX. "hvx-double" is an alias to "hvx-length128b" and is soon will be deprecated. The hvx version target feature is upgated form "+hvx" to "+hvxv{version_number}. Eg: "+hvxv62" For the correct HVX code generation, the user must use the following target features. For 64B mode: "+hvxv62" "+hvx-length64b" For 128B mode: "+hvxv62" "+hvx-length128b" Clang picks a default length if none is specified. If for some reason, no hvx-length is specified to llvm, the compilation will bail out. There is a corresponding clang patch. Differential Revision: https://reviews.llvm.org/D38851 llvm-svn: 316101	2017-10-18 18:07:07 +00:00
Konstantin Zhuravlyov	8d5e9e110c	AMDGPU: Rename MaxFlatWorkgroupSize to MaxFlatWorkGroupSize for consistency Differential Revision: https://reviews.llvm.org/D38957 llvm-svn: 316097	2017-10-18 17:31:09 +00:00
Justin Bogner	2ac32cc9ce	AArch64/GISel: Fix a couple of tests that were testing the wrong thing Fix a couple of tests that were extending the wrong vreg, and regenerate their checks with update_mir_test_checks. This looks like it was a copy-paste or test update error. llvm-svn: 316087	2017-10-18 15:34:33 +00:00
Simon Dardis	03c2c65b2d	[mips] Fix analyzeBranch to handle debug data In the case where there was a conditional branch followed by a unconditional branch with debug instruction separating them, MipsInstrInfo::analyzeBranch would not skip past debug instruction when searching for the second branch which give erroneous results about the control flow of the block. This could lead to the branch folder to merge the non-fall through case into it's predecessor, leaving the conditional branch with a dangling basic block operand. This resolves PR34975. Thanks to Alexander Richardson for reporting the issue! Reviewers: atanasyan Differential Revision: https://reviews.llvm.org/D39003 llvm-svn: 316084	2017-10-18 14:35:29 +00:00
Simon Dardis	77bf0fd59c	[mips] Move test to correct directory. NFCI llvm-svn: 316081	2017-10-18 13:59:48 +00:00
Michael Zuckerman	7ba046c784	Adding new test for bug fix 316067 https://bugs.llvm.org/show_bug.cgi?id=34978 This test checks that the x86-interleaved ends without any assertion. Change-Id: I1e970482a4d0404516cbc85517fc091bb21c35a8 llvm-svn: 316080	2017-10-18 13:51:31 +00:00
Hiroshi Inoue	5388e66d3a	[PowerPC] Use helper functions to check sign-/zero-extended value Helper functions to identify sign- and zero-extending machine instruction is introduced in rL315888. This patch makes PPCInstrInfo::optimizeCompareInstr use the helper functions. It simplifies the code and also makes possible more optimizations since the helper can do more analysis than the original check code; I observed about 5000 more compare instructions are eliminated while building LLVM. Also, this patch fixes a bug in helpers on ANDIo instruction handling due to the order of checks. This bug causes a failure in an existing test case for optimizeCompareInstr. Differential Revision: https://reviews.llvm.org/D38988 llvm-svn: 316071	2017-10-18 10:31:19 +00:00
Wei Ding	7ab1f7a421	AMDGPU : Fix an error for the llvm.cttz implementation. Differential Revision: http://reviews.llvm.org/D39014 llvm-svn: 316037	2017-10-17 21:49:52 +00:00
Tim Northover	350a87eaf1	AArch64: account for possible frame index operand in compares. If the address of a local is used in a comparison, AArch64 can fold the address-calculation into the comparison via "adds". Unfortunately, a couple of places (both hit in this one test) are not ready to deal with that yet and just assume the first source operand is a register. llvm-svn: 316035	2017-10-17 21:43:52 +00:00
Simon Pilgrim	7cd4e2c96f	[X86][SSE] Tests packuswb/truncation codegen from PR34773 llvm-svn: 316033	2017-10-17 21:14:53 +00:00
Konstantin Zhuravlyov	7dabe9ced7	AMDGPU: Start generating metadata for MaxFlatWorkGroupSize Differential Revision: https://reviews.llvm.org/D38958 llvm-svn: 316024	2017-10-17 20:03:21 +00:00

1 2 3 4 5 ...

21934 Commits