llvm-project

Commit Graph

Author	SHA1	Message	Date
Uriel Korach	2aa707bdaa	[X86] test/testn intrinsics lowering to IR. llvm part. Remove builtins from llvm and add AutoUpgrade support. Also add fast-isel tests for the TEST and TESTN instructions. Differential Revision: https://reviews.llvm.org/D38736 llvm-svn: 318036	2017-11-13 12:51:18 +00:00
Uriel Korach	5b2b71d909	[X86] test/testn intrinsics lowering to IR. clang side Change Header files of the intrinsics for lowering test and testn intrinsics to IR code. Removed test and testn builtins from clang Differential Revision: https://reviews.llvm.org/D38737 llvm-svn: 318035	2017-11-13 12:50:52 +00:00
Greg Bedwell	99e183cd5a	Move the setting of LLVM_BUILD_MODE to a macro so that we can re-use it in compiler-rt Differential Revision: https://reviews.llvm.org/D38470 llvm-svn: 318034	2017-11-13 12:40:05 +00:00
Momchil Velikov	842aa90192	[ARM] Place jump table as the first operand in additions When generating table jump code for switch statements, place the jump table label as the first operand in the various addition instructions in order to enable addressing mode selectors to better match index computation and possibly fold them into the addressing mode of the table entry load instruction. Differential revision: https://reviews.llvm.org/D39752 llvm-svn: 318033	2017-11-13 11:56:48 +00:00
Simon Dardis	8e2a5bd235	[CodeGenPrepare] Check that erased sunken address are not reused CodeGenPrepare sinks address computations from one basic block to another and attempts to reuse address computations that have already been sunk. If the same address computation appears twice with the first instance as an operand of a load whose result is an operand to a simplifable select, CodeGenPrepare simplifies the select and recursively erases the now dead instructions. CodeGenPrepare then attempts to use the erased address computation for the second load. Fix this by erasing the cached address value if it has zero uses before looking for the address value in the sunken address map. This partially resolves PR35209. Thanks to Alexander Richardson for reporting the issue! Reviewers: john.brawn Differential Revision: https://reviews.llvm.org/D39841 llvm-svn: 318032	2017-11-13 11:47:21 +00:00
Jina Nahias	aecd4f5f9d	Change // CHECK: shufflevector <8 x double> %0, <8 x double> %{{.}}, <8 x i32> <i32 0, i32 1, i32 2, i32 3, i32 8, i32 9, i32 8, i32 9> To // CHECK: shufflevector <8 x double> %{{.}}, <8 x double> %{{.*}}, <8 x i32> <i32 0, i32 1, i32 2, i32 3, i32 8, i32 9, i32 8, i32 9> for fixing 318025 commit warning Change-Id: Id48a1fe1f247fe6a0b84e7189f18d2e637678e79 llvm-svn: 318031	2017-11-13 11:41:41 +00:00
Gabor Horvath	5cfada60b4	[analyzer] Document the issue hash debugging facility Differential Revision: https://reviews.llvm.org/D39543 llvm-svn: 318030	2017-11-13 11:13:02 +00:00
Florian Hahn	7114755913	[CodeExtractor] Add missing AllowVarArgs initialization. llvm-svn: 318029	2017-11-13 11:08:47 +00:00
Florian Hahn	0e9dec672d	[PartialInliner] Inline vararg functions that forward varargs. Summary: This patch extends the partial inliner to support inlining parts of vararg functions, if the vararg handling is done in the outlined part. It adds a `ForwardVarArgsTo` argument to InlineFunction. If it is non-null, all varargs passed to the inlined function will be added to all calls to `ForwardVarArgsTo`. The partial inliner takes care to only pass `ForwardVarArgsTo` if the varargs handing is done in the outlined function. It checks that vastart is not part of the function to be inlined. `test/Transforms/CodeExtractor/PartialInlineNoInline.ll` (already part of the repo) checks we do not do partial inlining if vastart is used in a basic block that will be inlined. Reviewers: davide, davidxl, grosser Reviewed By: davide, davidxl, grosser Subscribers: gyiu, grosser, eraman, llvm-commits Differential Revision: https://reviews.llvm.org/D39607 llvm-svn: 318028	2017-11-13 10:35:52 +00:00
Sander de Smalen	070a7ff1ad	Test commit llvm-svn: 318027	2017-11-13 09:57:20 +00:00
Jina Nahias	9a7f9f123c	[x86][AVX512] Lowering shuffle i/f intrinsics to LLVM IR This patch, together with a matching clang patch (https://reviews.llvm.org/D38672), implements the lowering of X86 shuffle i/f intrinsics to IR. Differential Revision: https://reviews.llvm.org/D38671 Change-Id: I1e7d359a74743e995ec356237a85214ce55d3661 llvm-svn: 318026	2017-11-13 09:16:39 +00:00
Jina Nahias	dca979194d	[x86][AVX512] Lowering shuffle i/f intrinsics to LLVM IR This patch, together with a matching llvm patch (https://reviews.llvm.org/D38671), implements the lowering of X86 shuffle i/f intrinsics to IR. Differential Revision: https://reviews.llvm.org/D38672 Change-Id: I9b3c2f2b34323bd9ccb21d0c1832f848b88ec047 llvm-svn: 318025	2017-11-13 09:15:31 +00:00
Gadi Haber	c9f2300652	[X86][SKX] Adding scheduling info of non-intrinsic + commutable SKX opcodes. Updated the scheduling information of the SKX subtarget in the file X86SchedSkylakeServer.td under lib/Target/X86 to: 1. add regular opcodes in addition to the suffixed "_Int" opcodes 2. add the (V)MAXCPD/MAXCPS/MAXCSD/MAXCSS/MINCPD/MINCPS/MINCSD/MINCSS instructions that are equivalent to their counterparts without the 'C' as they are part of a hack to make floating point min/max commutable under fast math. Reviewers: zvi, RKSimon, craig.topper Differential Revision: https://reviews.llvm.org/D39833 Change-Id: Ie13702a5ce1b1a08af91ca637a52b6962881e7d6 llvm-svn: 318024	2017-11-13 08:42:07 +00:00
Craig Topper	1af2adb9f3	[X86] Limit NOPs to 7 bytes when 'slm' is spelled 'silvermont'. We support 2 spelling for silvermont and we should accept both here. llvm-svn: 318023	2017-11-13 08:17:30 +00:00
Craig Topper	75d71540f8	[X86] Use sse_load_f32/f64 to improve load folding of scalar vfscalefss/sd, vrcp14ss/sd, rsqrt14ss/sd instructions. llvm-svn: 318022	2017-11-13 08:07:33 +00:00
Craig Topper	c748455e51	[X86] Regenerate test. NFC llvm-svn: 318021	2017-11-13 08:07:31 +00:00
Matt Arsenault	88efb9ff8e	MI: Print ranges on MMO llvm-svn: 318020	2017-11-13 07:09:20 +00:00
Craig Topper	ca8abedb2a	[X86] Use sse_load_f32/f64 to improve load folding for scalar VFPCLASS intrinsics. llvm-svn: 318019	2017-11-13 06:46:48 +00:00
Craig Topper	bf328f263e	[X86] Add tests for missed opportunities to fold a 128-bit vector load into vfpclassss and vpfpclasssd. llvm-svn: 318018	2017-11-13 06:46:46 +00:00
Matt Arsenault	e5e0c742df	AMDGPU: Preserve nuw in shl add ptr combine llvm-svn: 318017	2017-11-13 05:33:35 +00:00
Craig Topper	d4f6094091	[X86] Fix SQRTSS/SQRTSD/RCPSS/RCPSD intrinsics to use sse_load_f32/sse_load_f64 to increase load folding opportunities. llvm-svn: 318016	2017-11-13 05:25:24 +00:00
Craig Topper	24389c6746	[X86] Add tests for full vector loads to fold-load-unops.ll. We should be able to fold a full vector load into a scalar intrinsic. Since it's legal to narrow a load. llvm-svn: 318015	2017-11-13 05:25:23 +00:00
Craig Topper	a95a1fd42d	[X86] Regenerate fold-load-unops.ll and add and avx512f command line. llvm-svn: 318014	2017-11-13 05:25:21 +00:00
Matt Arsenault	fbe9533509	AMDGPU: Fix multi-use shl/add combine This was using a custom function that didn't handle the addressing modes properly for private. Use isLegalAddressingMode to avoid duplicating this. Additionally, skip the combine if there is only one use since the standard combine will handle it. llvm-svn: 318013	2017-11-13 05:11:54 +00:00
Marshall Clow	843ec14af4	Put the status in the wrong column llvm-svn: 318012	2017-11-13 04:15:39 +00:00
Marshall Clow	fbb0a5aa3f	Implement P0550R2: Transformation Trait remove_cvref llvm-svn: 318011	2017-11-13 03:59:22 +00:00
Craig Topper	23493f3777	[X86] Attempt to fix signed and unsigned comparison warning. llvm-svn: 318010	2017-11-13 02:19:13 +00:00
Craig Topper	deee24b83c	[X86] Use sse_load_f32/f64 in patterns for the memory forms of VRNDSCALESS/SD. llvm-svn: 318009	2017-11-13 02:03:01 +00:00
Craig Topper	63157c4784	[X86] Use EVEX encoded VRNDSCALE instructions to implement the legacy round intrinsics. The VRNDSCALE instructions implement a superset of the (V)ROUND instructions. They are equivalent if the upper 4-bits of the immediate are 0. This patch lowers the legacy intrinsics to the VRNDSCALE ISD node and masks the upper bits of the immediate to 0. This allows us to take advantage of the larger register encoding space. We should maybe consider converting VRNDSCALE back to VROUND in the EVEX to VEX pass if the extended registers are not being used. I notice some load folding opportunities being missed for the VRNDSCALESS/SD instructions that I'll try to fix in future patches. llvm-svn: 318008	2017-11-13 02:03:00 +00:00
Craig Topper	0af48f1ad4	[X86] Split VRNDSCALE/VREDUCE/VGETMANT/VRANGE ISD nodes into versions with and without the rounding operand. NFCI I want to reuse the VRNDSCALE node for the legacy SSE rounding intrinsics so that those intrinsics can use EVEX instructions. All of these nodes share tablegen multiclasses so I split them all so that they all remain similar in their implementations. llvm-svn: 318007	2017-11-13 02:02:58 +00:00
Matt Arsenault	90e4f719e1	Fix some misc. -enable-var-scope violations llvm-svn: 318006	2017-11-13 01:47:52 +00:00
Matt Arsenault	e1cd482fda	AMDGPU: Select d16 loads into low component of register llvm-svn: 318005	2017-11-13 00:22:09 +00:00
Matt Arsenault	70b9282015	AMDGPU: Fix -enable-var-scope violations llvm-svn: 318004	2017-11-12 23:53:44 +00:00
Matt Arsenault	cf9b6d8d57	AMDGPU: Fix missing gfx9 atomic inc/dec tests The global instructions weren't tested. Plus there were also some -enable-var-scope violations and broken check prefixes. llvm-svn: 318003	2017-11-12 23:40:12 +00:00
Vitaly Buka	8b9d6be24d	[sanitizer] Simplify stack check in accert.cc Somehow on arm bots stack does not include main. llvm-svn: 318002	2017-11-12 21:15:19 +00:00
Vitaly Buka	1925591925	[sanitizer] Try to see test output on armv7 llvm-svn: 318001	2017-11-12 20:25:14 +00:00
Marshall Clow	199216376a	Two more papers from Albuquerque llvm-svn: 318000	2017-11-12 18:52:16 +00:00
Craig Topper	b42a23ff8f	[X86] Add an X86ISD::RANGES opcode to use for the scalar intrinsics. This fixes a bug where we selected packed instructions for scalar intrinsics. llvm-svn: 317999	2017-11-12 18:51:09 +00:00
Craig Topper	6b53c4a982	[X86] Add test cases and command lines demonstrating how we accidentally select vrangeps/vrangepd from vrangess/vrangesd instrinsics when the rounding mode is CUR_DIRECTION llvm-svn: 317998	2017-11-12 18:51:08 +00:00
Craig Topper	1382932c12	[X86] Remove some no longer needed intrinsic lowering code. llvm-svn: 317997	2017-11-12 18:51:06 +00:00
Marshall Clow	952d249987	Updated C++2a status page with new features/defects approved in Albuquerque. llvm-svn: 317996	2017-11-12 18:48:42 +00:00
Mandeep Singh Grang	d104673257	[llvm] Remove redundant return [NFC] Reviewers: davidxl, olista01, Eugene.Zelenko Reviewed By: Eugene.Zelenko Subscribers: sdardis, javed.absar, llvm-commits Differential Revision: https://reviews.llvm.org/D39917 llvm-svn: 317995	2017-11-12 03:47:50 +00:00
Craig Topper	d3e5781e53	[InstCombine] Teach visitICmpInst to not break integer absolute value idioms Summary: This patch adds an early out to visitICmpInst if we are looking at a compare as part of an integer absolute value idiom. Similar is already done for min/max. In the particular case I observed in a benchmark we had an absolute value of a load from an indexed global. We simplified the compare using foldCmpLoadFromIndexedGlobal into a magic bit vector, a shift, and an and. But the load result was still used for the select and the negate part of the absolute valute idiom. So we overcomplicated the code and lost the ability to recognize it as an absolute value. I've chosen a simpler case for the test here. Reviewers: spatel, davide, majnemer Reviewed By: spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D39766 llvm-svn: 317994	2017-11-12 02:28:21 +00:00
Dave Lee	bfb990b783	Fix AST matcher documentation typo llvm-svn: 317993	2017-11-11 23:53:27 +00:00
Dave Lee	0934fdc3a5	Add ObjC exception statement AST matchers Summary: Add AST matchers for Objective-C @throw, @try, @catch and @finally. Reviewers: aaron.ballman, malcolm.parsons, alexshap, compnerd Reviewed By: aaron.ballman Subscribers: cfe-commits, klimek Differential Revision: https://reviews.llvm.org/D39940 llvm-svn: 317992	2017-11-11 22:46:15 +00:00
Craig Topper	ac250825c6	[X86] Use vrndscaleps/pd for 128/256 ffloor/ftrunc/fceil/fnearbyint/frint when avx512vl is enabled. This matches what we do for scalar and 512-bit types. llvm-svn: 317991	2017-11-11 21:44:51 +00:00
Craig Topper	ae9ffa1f5a	[X86] Remove avx512-round.ll. The 512-bit rounding tests are now in vec_floor.ll with 128/256 sizes. llvm-svn: 317990	2017-11-11 21:44:50 +00:00
Craig Topper	e44fc7836e	[X86] Add avx512vl command line to vec_floor.ll. Add 512-bit test cases. llvm-svn: 317989	2017-11-11 21:44:49 +00:00
Craig Topper	a9f48803d7	[X86] Add avx512f command line to rounding-ops.ll llvm-svn: 317988	2017-11-11 21:44:48 +00:00
Craig Topper	1a20db2108	[X86] Regenerate rounding-ops.ll with update_llc_test_checks.py llvm-svn: 317987	2017-11-11 21:44:47 +00:00

1 2 3 4 5 ...

276093 Commits All Branches Search

276093 Commits

All Branches