llvm-project

Commit Graph

Author	SHA1	Message	Date
Simon Pilgrim	205f65f62f	[X86][AVX2] Relaxed alignment on nontemporal store tests llvm-svn: 271646	2016-06-03 10:06:59 +00:00
Simon Pilgrim	8ea8940677	[X86][AVX2] Regenerated nontemporal store tests and added tests for all 256-bit vector types llvm-svn: 271645	2016-06-03 09:56:24 +00:00
Simon Pilgrim	e85506b6e0	[X86][XOP] Support for VPERMIL2PD/VPERMIL2PS 2-input shuffle instructions This patch begins adding support for lowering to the XOP VPERMIL2PD/VPERMIL2PS shuffle instructions - adding the X86ISD::VPERMIL2 opcode and cleaning up the usage. The internal llvm intrinsics were assuming the shuffle mask operand was the same type as the float/double input operands (I guess to simplify the intrinsic definitions in X86InstrXOP.td to a single value type). These needed changing to integer types (matching the clang builtin and the AMD intrinsics definitions), an auto upgrade path is added to convert old calls. Mask decoding/target shuffle support will be added in future patches. Differential Revision: http://reviews.llvm.org/D20049 llvm-svn: 271633	2016-06-03 08:06:03 +00:00
Craig Topper	e7ae106147	[AVX512] Ensure EVEX vpshufd, vpshuflw, and vpshufhw have isel priority over the VEX encoded ones. llvm-svn: 271629	2016-06-03 05:31:04 +00:00
Craig Topper	01f53b1773	[AVX512] Fix shuffle comment printing for EVEX encoded PSHUFD, PSHUFHW, and PSHUFLW. llvm-svn: 271628	2016-06-03 05:31:00 +00:00
Simon Pilgrim	ab95b2fe26	[X86][SSE] Added SSE41/AVX2 non-temporal tests Useful for when we add MOVNTDQA support llvm-svn: 271552	2016-06-02 18:01:21 +00:00
Dimitry Andric	6a482a73d6	Only attempt to detect AVG if SSE2 is available Summary: In PR29973 Sanjay Patel reported an assertion failure when a certain loop was optimized, for a target without SSE2 support. It turned out this was because of the AVG pattern detection introduced in rL253952. Prevent the assertion failure by bailing out early in `detectAVGPattern()`, if the target does not support SSE2. Also add a minimized test case. Reviewers: congh, eli.friedman, spatel Subscribers: emaste, llvm-commits Differential Revision: http://reviews.llvm.org/D20905 llvm-svn: 271548	2016-06-02 17:30:49 +00:00
Sanjay Patel	f509d85a6d	[DAG] use getBitcast() to reduce code Although this was intended to be NFC, the test case wiggle shows a change in code scheduling/RA caused by a difference in the SDLoc() generation. Depending on how you look at it, this is the (dis)advantage of exact checking in regression tests. llvm-svn: 271526	2016-06-02 16:01:15 +00:00
Simon Pilgrim	ebdc397c86	[X86][SSE] Added non-temporal load tests for vector types These currently lower to regular loads instead of MOVNTDQA llvm-svn: 271516	2016-06-02 13:51:50 +00:00
Simon Pilgrim	0afd5a4d80	[X86][SSE] Replace (V)CVTTPS2DQ and VCVTTPD2DQ truncating (round to zero) f32/f64 to i32 with generic IR (llvm) This patch removes the llvm intrinsics (V)CVTTPS2DQ and VCVTTPD2DQ truncation (round to zero) conversions and auto-upgrades to FP_TO_SINT calls instead. Note: I looked at updating CVTTPD2DQ as well but this still requires a lot more work to correctly lower. Differential Revision: http://reviews.llvm.org/D20860 llvm-svn: 271510	2016-06-02 10:55:21 +00:00
Craig Topper	ca9c0801e1	[X86] Add AVX 256-bit load and stores to fast isel. I'm not sure why this was missing for so long. This also exposed that we were picking floating point 256-bit VMOVNTPS for some integer types in normal isel for AVX1 even though VMOVNTDQ is available. In practice it doesn't matter due to the execution dependency fix pass, but it required extra isel patterns. Fixing that in a follow up commit. llvm-svn: 271481	2016-06-02 04:19:45 +00:00
Craig Topper	f10fbfa738	[AVX512] Remove masked load intrinsics. Clang now emits generic masked load intrinsics instead. The intrinsics will be autoupgraded to the same generic masked loads. llvm-svn: 271478	2016-06-02 04:19:36 +00:00
Sanjay Patel	b4a4357ecb	[x86, AVX2] regenerate checks llvm-svn: 271434	2016-06-01 21:32:56 +00:00
Michael Kuperstein	738ae45ce8	[DAG] Improve legalization of INSERT_SUBVECTOR When the index is known to be constant 0, insert directly into the the low half, instead of spilling, performing the insert in-memory, and reloading. Differential Revision: http://reviews.llvm.org/D20763 llvm-svn: 271428	2016-06-01 20:49:35 +00:00
Than McIntosh	4ef761aa35	Better fix for PR27903. Summary: Re-enable lifetime-start-on-first-use for stack coloring, but explicitly disable it for slots with more than one start or end lifetime marker. Bug: 27903 Reviewers: wmi, tejohnson, qcolombet, gbiv Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D20739 llvm-svn: 271412	2016-06-01 17:55:10 +00:00
Simon Pilgrim	1cd61b82bd	[X86][SSE] Added non-temporal store tests for all 512-bit vector types llvm-svn: 271393	2016-06-01 13:58:00 +00:00
Simon Pilgrim	288be8bab6	[X86][SSE] Added non-temporal store tests for all 256-bit vector types Also added KNL AVX-512 checks llvm-svn: 271391	2016-06-01 13:20:25 +00:00
Simon Pilgrim	80f5335969	[X86][SSE] Added non-temporal store tests for all 128-bit integer vector types llvm-svn: 271389	2016-06-01 13:05:00 +00:00
Michael Zuckerman	6a894956fc	Adding back-end support to two bit scanning intrinsics Adding LLVM back-end support to two intrinsics dealing with bit scan: _bit_scan_forward and _bit_scan_reverse. Their functionality is as described in Intel intrinsics guide: https://software.intel.com/sites/landingpage/IntrinsicsGuide/#text=_bit_scan_forward&expand=371,370 https://software.intel.com/sites/landingpage/IntrinsicsGuide/#text=_bit_scan_reverse&expand=371,370 Commit on behalf of Omer Paparo Bivas Differential Revision: http://reviews.llvm.org/D19915 llvm-svn: 271386	2016-06-01 12:02:37 +00:00
Craig Topper	4f2d5a68d3	Revert r271362 "[AVX512] Remove masked load intrinsics. Clang now emits generic masked load intrinsics instead." Looks like something isn't quite right still. Also forgot to move the test cases to an autoupgrade test. llvm-svn: 271363	2016-06-01 05:57:55 +00:00
Craig Topper	dacd9d2bac	[AVX512] Remove masked load intrinsics. Clang now emits generic masked load intrinsics instead. The intrinsics will be autoupgraded to the same generic masked loads. llvm-svn: 271362	2016-06-01 05:35:16 +00:00
Kevin B. Smith	ed0b620a65	[X86]: Add a pattern that uses GR16_ABCD rather than GR32_ABCD to avoid falsely marking whole 32 bit register as live. Differential Revision: http://reviews.llvm.org/D20649 llvm-svn: 271341	2016-05-31 22:00:12 +00:00
Simon Pilgrim	e05dc45897	[X86][SSE] Add load-folding patterns for (V)CVTDQ2PD (PR27291) Added patterns for (V)CVTDQ2PD -> 2f64 loading from a 64-bit source. llvm-svn: 271269	2016-05-31 12:04:35 +00:00
Igor Breger	73ee8ba9b0	[AVX512] Fix intrinsic vcvtps2ph lowering. Differential Revision: http://reviews.llvm.org/D20788 llvm-svn: 271255	2016-05-31 08:04:21 +00:00
Igor Breger	52bd1d5fcc	Fix intrinsic vbroadcast{i32\|f32}x2 lowering. Differential Revision: http://reviews.llvm.org/D20780 llvm-svn: 271254	2016-05-31 07:43:39 +00:00
Craig Topper	50f85c22c5	[AVX512] Remove masked store intrinsics. Clang now emits generic masked store intrinsics instead. The intrinsics will be autoupgraded to the same generic masked stores. llvm-svn: 271245	2016-05-31 01:50:02 +00:00
Saleem Abdulrasool	d2f705ddf9	X86: permit using SjLj EH on x86 targets as an option This adds support to the backed to actually support SjLj EH as an exception model. This is NOT the default model, and requires explicitly opting into it from the frontend. GCC supports this model and for MinGW can still be enabled via the `--using-sjlj-exceptions` options. Addresses PR27749! llvm-svn: 271244	2016-05-31 01:48:07 +00:00
Craig Topper	8287fd8abd	[X86] Remove SSE/AVX unaligned store intrinsics as clang no longer uses them. Auto upgrade to native unaligned store instructions. llvm-svn: 271236	2016-05-30 23:15:56 +00:00
Craig Topper	39716f8358	[X86] Use update_llc_test_checks.py to re-generate a test in preparation for an upcoming commit. NFC llvm-svn: 271234	2016-05-30 22:54:14 +00:00
Simon Pilgrim	d788c9d83d	[X86][XOP] Split off auto-upgraded xop intrinsics llvm-svn: 271228	2016-05-30 19:50:56 +00:00
Simon Pilgrim	582d75b0eb	[X86][SSE] Renamed pmovxrm tests These aren't intrinsics anymore - as discussed on D20686 llvm-svn: 271226	2016-05-30 19:14:37 +00:00
Simon Pilgrim	24da61058a	[X86][AVX2] Regenerated AVX2 extension tests llvm-svn: 271224	2016-05-30 18:49:57 +00:00
Simon Pilgrim	d64af65f6d	[X86][SSE] Updated storeu fast-isel tests to match clang builtin tests Since rL271214 the headers have no longer used the storeu intrinsic llvm-svn: 271222	2016-05-30 18:42:51 +00:00
Simon Pilgrim	4ed0e07b23	[X86][SSE2] Updated _mm_store_pd1/_mm_store1_pd fast-isel tests to match D20617 llvm-svn: 271220	2016-05-30 18:18:44 +00:00
Simon Pilgrim	9602d678cb	[X86][SSE] (Reapplied) Replace (V)PMOVSX and (V)PMOVZX integer extension intrinsics with generic IR (llvm) This patch removes the llvm intrinsics VPMOVSX and (V)PMOVZX sign/zero extension intrinsics and auto-upgrades to SEXT/ZEXT calls instead. We already did this for SSE41 PMOVSX sometime ago so much of that implementation can be reused. Reapplied now that the the companion patch (D20684) removes/auto-upgrade the clang intrinsics has been committed. Differential Revision: http://reviews.llvm.org/D20686 llvm-svn: 271131	2016-05-28 18:03:41 +00:00
Sanjay Patel	97c2c108fd	[x86] avoid printing unnecessary sign bits of hex immediates in asm comments (PR20347) It would be better to check the valid/expected size of the immediate operand, but this is generally better than what we print right now. Differential Revision: http://reviews.llvm.org/D20385 llvm-svn: 271114	2016-05-28 14:58:37 +00:00
Ahmed Bougacha	a3dc1ba142	[X86] Try to zero elts when lowering 256-bit shuffle with PSHUFB. Otherwise we fallback to a blend of PSHUFBs later on. Differential Revision: http://reviews.llvm.org/D19661 llvm-svn: 271113	2016-05-28 14:38:04 +00:00
Michael Kuperstein	a75c77b127	[X86] Detect SAD patterns and emit psadbw instructions. This recommits r267649 with a fix for PR27539. Differential Revision: http://reviews.llvm.org/D20598 llvm-svn: 271033	2016-05-27 18:53:22 +00:00
Simon Pilgrim	7e67a22298	[X86][AVX] Removed some remains of old (pre-regeneration) filechecks llvm-svn: 271007	2016-05-27 15:56:19 +00:00
Than McIntosh	4daf7f13b6	Disable lifetime-start-on-first-use analysis. Summary: Turn off lifetime-start-on-first-use enhancement for the moment pending a fix for bug 27903. Bug: 27903 Reviewers: tejohnson, wmi, qcolombet, gbiv Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D20731 llvm-svn: 271003	2016-05-27 15:27:51 +00:00
Simon Pilgrim	4642a57fbf	Revert: r270973 - [X86][SSE] Replace (V)PMOVSX and (V)PMOVZX integer extension intrinsics with generic IR (llvm) llvm-svn: 270976	2016-05-27 09:02:25 +00:00
Simon Pilgrim	c013e5737b	[X86][SSE] Replace (V)PMOVSX and (V)PMOVZX integer extension intrinsics with generic IR (llvm) This patch removes the llvm intrinsics VPMOVSX and (V)PMOVZX sign/zero extension intrinsics and auto-upgrades to SEXT/ZEXT calls instead. We already did this for SSE41 PMOVSX sometime ago so much of that implementation can be reused. A companion patch (D20684) removes/auto-upgrade the clang intrinsics. Differential Revision: http://reviews.llvm.org/D20686 llvm-svn: 270973	2016-05-27 08:49:15 +00:00
Mitch Bodart	05aeeb5cf1	[CodeGen] Fix problem with X86 byte registers in CriticalAntiDepBreaker CriticalAntiDepBreaker was not correctly tracking defs of the high X86 byte registers, leading to incorrect use of a busy register to break an antidependence. Fixes pr27681, and its duplicates pr27580, pr27804. Differential Revision: http://reviews.llvm.org/D20456 llvm-svn: 270935	2016-05-26 23:08:52 +00:00
Simon Pilgrim	cf340bd9c1	[X86][SSE] When lowering a 256-bit shuffle as PMOVZX, reduce the input vector to the lower 128-bit subvector. Most often as not this is what it started out as, the extraction is zero-cost on AVX and the PMOVZX/PMOVSX folding logic is based around 128-bit loads. llvm-svn: 270858	2016-05-26 15:40:36 +00:00
Simon Pilgrim	50c37ceb3b	[X86][SSE] Added load_zext_16i8_to_8i32 test Odd issue with input vector not being folded into pmovzx on AVX2+ targets llvm-svn: 270852	2016-05-26 14:45:30 +00:00
Igor Breger	8437bb70fd	[AVX512] Fix intrinsic cmp{sd\|ss} lowering. Differential Revision: http://reviews.llvm.org/D20615 llvm-svn: 270843	2016-05-26 12:42:25 +00:00
Simon Pilgrim	ab3809193c	[X86][F16C] Added F16C fast-isel tests to match clang/test/CodeGen/f16c-builtins.c llvm-svn: 270837	2016-05-26 10:26:56 +00:00
Simon Pilgrim	0e4fdc0842	[X86][AVX2] Added gather fast-isel tests to match clang/test/CodeGen/avx2-builtins.c llvm-svn: 270835	2016-05-26 10:07:05 +00:00
Simon Pilgrim	d6469e3467	[X86][SSE41] Removed pblendw intrinsics tests - they are auto-upgraded Equivalent tests included in sse41-intrinsics-x86-upgrade.ll - the i8/i32 immediate diff doesn't matter anymore llvm-svn: 270767	2016-05-25 21:27:58 +00:00
Simon Pilgrim	fa814259ad	[X86][SSE41] Regenerated intrinsics tests llvm-svn: 270764	2016-05-25 21:21:51 +00:00

1 2 3 4 5 ...

7560 Commits