llvm-project

Commit Graph

Author	SHA1	Message	Date
Benjamin Kramer	1516dd70bb	Fix broken test from r325630. llvm-svn: 325634	2018-02-20 22:30:16 +00:00
Lang Hames	919f15a1b4	[PBQP] Fix PR33038 by pruning empty intervals in initializeGraph. Spilling may cause previously non-empty intervals (both for the spilled vreg and others) to become empty. Moving the pruning into initializeGraph catches these cases and fixes PR33038. llvm-svn: 325632	2018-02-20 22:15:09 +00:00
Benjamin Kramer	fd0630665b	[MemoryBuiltins] Check nobuiltin status when identifying calls to free. This is usually not a problem because this code's main purpose is eliminating unused new/delete pairs. We got deletes of nullptr or nobuiltin deletes of builtin new wrong though. llvm-svn: 325630	2018-02-20 22:00:33 +00:00
Sanjay Patel	7365b44b85	[InstCombine] remove unneeded operand swap: NFCI FMul is commutative, so complexity-based canonicalization should always take care of the swap via SimplifyAssociativeOrCommutative(). llvm-svn: 325628	2018-02-20 21:52:46 +00:00
Craig Topper	7fbea20b90	[SelectionDAG] Support known true/false SimplifySetCC cases for comparing against vector splats of constants. This is split off from D42948 and includes just the cases that constant fold to true or false. It also includes some refactoring to keep predicate checks together. This supports things like (setcc uge X, 0) -> true Differential Revision: https://reviews.llvm.org/D43489 llvm-svn: 325627	2018-02-20 21:48:14 +00:00
Sanjay Patel	e29caaa9c5	[PatternMatch] enhance m_SignMask() to ignore undef elements in vectors llvm-svn: 325623	2018-02-20 21:02:40 +00:00
Sanjay Patel	ff7b777bbe	[InstSimplify] add tests for m_SignMask with undef vector elements; NFC llvm-svn: 325622	2018-02-20 20:53:35 +00:00
Evandro Menezes	72f3983633	[AArch64] Refactor instructions using SIMD immediates Get rid of icky goto loops and make the code easier to maintain. Otherwise, NFC. Restore r324903 and fix PR36369. Differentail revision: https://reviews.llvm.org/D43364 llvm-svn: 325621	2018-02-20 20:31:45 +00:00
Teresa Johnson	a344fd3db6	[LTO] Remove unused Path parameter to AddBufferFn Summary: With D43396, no clients use the Path parameter anymore. Depends on D43396. Reviewers: pcc Subscribers: mehdi_amini, inglorion, llvm-commits Differential Revision: https://reviews.llvm.org/D43400 llvm-svn: 325619	2018-02-20 20:21:53 +00:00
Teresa Johnson	b145cca85e	[ThinLTO/gold] Avoid race with cache pruner by copying to temp files Summary: This will avoid the race condition described in the review for D37993. I believe that the Path parameter to AddBufferFn is no longer utilized. I would prefer to remove that as a follow up clean up patch to reduce the diffs in this patch. Reviewers: pcc Reviewed By: pcc Subscribers: inglorion, llvm-commits Differential Revision: https://reviews.llvm.org/D43396 llvm-svn: 325618	2018-02-20 19:51:30 +00:00
Alexey Bataev	42bcec7d38	[LV] Fix test checks, NFC. llvm-svn: 325617	2018-02-20 19:49:25 +00:00
Sjoerd Meijer	4d5c40492a	[ARM] Lower BR_CC for f16 This case wasn't handled yet. Differential Revision: https://reviews.llvm.org/D43508 llvm-svn: 325616	2018-02-20 19:28:05 +00:00
Stanislav Mekhanoshin	a3b6d95db4	[AMDGPU] Removed redundant run lines for fmuladd.f16 test. NFC. llvm-svn: 325615	2018-02-20 19:19:56 +00:00
David Blaikie	0e5506838e	[llvm-objdump] Use unique_ptr to simplify memory ownership Followup to r325099/r325100 to simplify further. llvm-svn: 325612	2018-02-20 18:48:51 +00:00
Simon Pilgrim	75853bf149	[X86][MMX] Regenerate MMX bitcast test llvm-svn: 325611	2018-02-20 18:48:29 +00:00
Simon Pilgrim	2cf3769a7e	[X86][3DNow] Regenerate intrinsics tests llvm-svn: 325609	2018-02-20 18:44:21 +00:00
Sanjay Patel	a604370004	[IRBuilder] fix CreateMaxNum to actually produce maxnum (PR36454) The bug was introduced here: https://reviews.llvm.org/rL296409 ...but the patch doesn't use maxnum and nothing else in trunk has tried since then, so the bug went unnoticed. llvm-svn: 325607	2018-02-20 18:21:43 +00:00
Krzysztof Parzyszek	f9f2005f94	[Hexagon] Handle *Low8 register classes in early if-conversion llvm-svn: 325606	2018-02-20 18:19:17 +00:00
Alexey Bataev	47dfd249f0	[SLP] Fix tests checks, NFC. llvm-svn: 325605	2018-02-20 18:11:50 +00:00
Craig Topper	df0c22fcd3	[X86] Correct SHRUNKBLEND creation to work correctly when there are multiple uses of the condition. SimplifyDemandedBits forces the demanded mask to all 1s if the node has multiple uses, unless the AssumeSingleUse flag is set. So previously we were only really likely to simplify something if the condition had a single use. And on the off chance we did simplify with multiple uses the demanded mask being used was all ones so there was no reason to create a shrunkblend. This patch now checks that the condition is only used by selects first, and then sets the AssumeSingleUse flag for the simplifcation. Then we convert the selects to shrunkblend, and finally replace condition. Differential Revision: https://reviews.llvm.org/D43446 llvm-svn: 325604	2018-02-20 17:58:17 +00:00
Craig Topper	35801fa5ce	[SelectionDAG] Add LegalTypes flag to getShiftAmountTy. Use it to unify and simplify DAGCombiner and simplifySetCC code and fix a bug. DAGCombiner and SimplifySetCC both use getPointerTy for shift amounts pre-legalization. DAGCombiner uses a single helper function to hide this. SimplifySetCC does it in multiple places. This patch adds a defaulted parameter to getShiftAmountTy that can make it return getPointerTy for scalar types. Use this parameter to simplify the SimplifySetCC and DAGCombiner. Additionally, there were two places in SimplifySetCC that were creating shifts using the target's preferred shift amount pre-legalization. If the target uses a narrow type and the type is illegal, this can cause SimplfiySetCC to create a shift with an amount that can't represent all possible shift values for the type. To fix this we should use pointer type there too. Alternatively we could make getScalarShiftAmountTy for each target return a safe value for large types as proposed in D43445. And maybe we should still do that, but fixing the SimplifySetCC code keeps other targets from tripping over this in the future. Fixes PR36250. Differential Revision: https://reviews.llvm.org/D43449 llvm-svn: 325602	2018-02-20 17:41:05 +00:00
Craig Topper	010ae8dcbb	[X86] Promote 16-bit cmovs to 32-bits This allows us to avoid an opsize prefix. And forcing some move immediates to i32 avoids a length changing prefix on those instructions. This mostly replaces the existing combine we had for zext/sext+cmov of constants. I left in a case for sign extending a 32 bit cmov of constants to 64 bits. Differential Revision: https://reviews.llvm.org/D43327 llvm-svn: 325601	2018-02-20 17:41:00 +00:00
Jonas Devlieghere	563c901bac	[dsymutil] Correctly handle DW_TAG_label This patch contains logic for handling DW_TAG_label that's present in darwin's dsymutil implementation, but not yet upstream. Differential revision: https://reviews.llvm.org/D43438 llvm-svn: 325600	2018-02-20 17:34:29 +00:00
Mikhail Maltsev	581a7f0bef	[vim] Recognize more FileCheck comments Summary: Currently vim syntax highlighting recognizes 'CHECK:' as a special comment, but not CHECK-DAG, CHECK-NOT and other CHECKs. This patch adds rules for these comments. Reviewers: chandlerc, compnerd, rogfer01 Reviewed By: rogfer01 Subscribers: rogfer01, llvm-commits Differential Revision: https://reviews.llvm.org/D43289 llvm-svn: 325599	2018-02-20 17:27:44 +00:00
Sanjay Patel	29b98ae337	[InstCombine] remove unneeded dyn_cast to prevent unused variable warning llvm-svn: 325597	2018-02-20 17:14:53 +00:00
Sanjay Patel	b2d978682b	[InstCombine] remove compound fdiv pattern folds These are fdiv-with-constant-divisor, so they already become reciprocal multiplies. The last gap for vector ops should be closed with rL325590. It's possible that we're missing folds for some edge cases with denormal intermediate constants after deleting these, but there are no tests for those patterns, and it would be better to handle denormals more consistently (and less conservatively) as noted in TODO comments. llvm-svn: 325595	2018-02-20 16:52:17 +00:00
Sanjay Patel	90f4c8ec29	[InstCombine] fold fdiv with non-splat divisor to fmul: X/C --> X * (1/C) llvm-svn: 325590	2018-02-20 16:08:15 +00:00
Simon Dardis	d3860e6670	[mips] Correct the definition of cvt.d.w An upcoming patch D41434, changes the ordering of the matcher table for assembly. This patch corrects the definition of the normal MIPS cvt.d.w not to be available in microMIPS. llvm-svn: 325589	2018-02-20 15:55:17 +00:00
Alexey Bataev	0d6aeadc40	[DEBUGINFO] Add support for emission of the inlined strings. Summary: Patch adds an option for emission of inlined strings rather than .debug_str section. Reviewers: echristo, jlebar Subscribers: eraman, llvm-commits, JDevlieghere Differential Revision: https://reviews.llvm.org/D43390 llvm-svn: 325583	2018-02-20 15:28:08 +00:00
Lei Huang	dfd41552f4	[PowerPC] Reduce stack frame for fastcc functions by only allocating parameter save area when needed Current implementation always allocates the parameter save area conservatively for fastcc functions. There is no reason to allocate the parameter save area if all the parameters can be passed via registers. Differential Revision: https://reviews.llvm.org/D42602 llvm-svn: 325581	2018-02-20 15:09:45 +00:00
Krzysztof Parzyszek	b404fae9e3	[Hexagon] Fix alignment calculation of stack objects in Hexagon bit tracker llvm-svn: 325580	2018-02-20 14:29:43 +00:00
Simon Pilgrim	61ba223704	[X86] Regenerate XOR tests llvm-svn: 325579	2018-02-20 14:08:39 +00:00
Simon Pilgrim	2f29afb439	[VectorLegalizer] Fix uint64_t typo in ExpandUINT_TO_FLOAT (PR36391) ExpandUINT_TO_FLOAT can accept vXi32 or vXi64 inputs, so we need to use a uint64_t shift to generate the 2^(BW/2) constant. No test case unfortunately as no upstream target uses this, but its affecting a downstream target. llvm-svn: 325578	2018-02-20 13:24:24 +00:00
David Green	056476497e	[ARM] Mark -1 as cheap in xor's for thumb1 We can always convert xor %a, -1 into MVN, even in thumb 1 where the -1 would not otherwise be considered a cheap constant. This prevents the -1's from being pulled out into constants and potentially hoisted. Differential Revision: https://reviews.llvm.org/D43451 llvm-svn: 325573	2018-02-20 11:07:35 +00:00
George Rimar	da4f43a4b4	[llvm-mc] - Produce R_X86_64_PLT32 for "call/jmp foo". For instructions like call foo and jmp foo patch changes relocation produced from R_X86_64_PC32 to R_X86_64_PLT32. Relocation can be used as a marker for 32-bit PC-relative branches. Linker will reduce PLT32 relocation to PC32 if function is defined locally. Differential revision: https://reviews.llvm.org/D43383 llvm-svn: 325569	2018-02-20 10:17:57 +00:00
Tim Renouf	8234b4893a	[AMDGPU] stop buffer_store being moved illegally Summary: The machine instruction scheduler was illegally moving a buffer store past a buffer load with the same descriptor and offset. Fixed by marking buffer ops as mayAlias and isAliased. This may be overly conservative, and we may need to revisit. Subscribers: arsenm, kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D43332 Change-Id: Iff3173d9e0653e830474546276ab9d30318b8ef7 llvm-svn: 325567	2018-02-20 10:03:38 +00:00
George Rimar	9712113f8b	[MC] - Don't crash on unclosed frame. llvm-mc can crash when there is cfi_startproc without cfi_end_proc: .text .globl foo foo: .cfi_startproc Testcase shows the issue, patch fixes it. Differential revision: https://reviews.llvm.org/D43456 llvm-svn: 325564	2018-02-20 09:04:13 +00:00
Gadi Haber	2cede2e229	[X86][CET]: Adding full coverage of MC encoding for the CET instructions.<NFC> NFC. Adding MC regressions tests to cover the CET instructions. This patch is part of a larger task to cover MC encoding of all X86 isa sets started in revision: https://reviews.llvm.org/D39952 Reviewers: zvi, craig.topper, RKSimon, AndreiGrischenko, oren_ben_simhon Differential Revision: https://reviews.llvm.org/D41329 Change-Id: I9c133d4ba07508ce8fd738a1230edd586e2c2f1b llvm-svn: 325561	2018-02-20 08:00:31 +00:00
Craig Topper	9256ac1a58	[X86] Add 512-bit unmasked pmulhrsw/pmulhw/pmulhuw intrinsics. Remove and auto upgrade 128/256/512 bit masked pmulhrsw/pmulhw/pmulhuw intrinsics. The 128 and 256 bit versions were already not used by clang. This adds an equivalent unmasked 512 bit version. Then autoupgrades all sizes to use unmasked intrinsics plus select. llvm-svn: 325559	2018-02-20 07:28:14 +00:00
Craig Topper	8e31ef948c	[X86] Remove GCCBuiltin from a bunch of intrinsics that aren't used by clang and should be removed. llvm-svn: 325552	2018-02-20 05:49:22 +00:00
Serge Pavlov	76d8ccee2e	Report fatal error in the case of out of memory This is the second part of recommit of r325224. The previous part was committed in r325426, which deals with C++ memory allocation. Solution for C memory allocation involved functions `llvm::malloc` and similar. This was a fragile solution because it caused ambiguity errors in some cases. In this commit the new functions have names like `llvm::safe_malloc`. The relevant part of original comment is below, updated for new function names. Analysis of fails in the case of out of memory errors can be tricky on Windows. Such error emerges at the point where memory allocation function fails, but manifests itself when null pointer is used. These two points may be distant from each other. Besides, next runs may not exhibit allocation error. In some cases memory is allocated by a call to some of C allocation functions, malloc, calloc and realloc. They are used for interoperability with C code, when allocated object has variable size and when it is necessary to avoid call of constructors. In many calls the result is not checked for null pointer. To simplify checks, new functions are defined in the namespace 'llvm': `safe_malloc`, `safe_calloc` and `safe_realloc`. They behave as corresponding standard functions but produce fatal error if allocation fails. This change replaces the standard functions like 'malloc' in the cases when the result of the allocation function is not checked for null pointer. Finally, there are plain C code, that uses malloc and similar functions. If the result is not checked, assert statement is added. Differential Revision: https://reviews.llvm.org/D43010 llvm-svn: 325551	2018-02-20 05:41:26 +00:00
Amara Emerson	db211892ed	[AArch64][GlobalISel] When copying from a gpr32 to an fpr16 reg, convert to fpr32 first. This is a follow on commit to r[x] where we fix the other direction of copy. For this case, after converting the source from gpr32 -> fpr32, we use a subregister copy, which is essentially what EXTRACT_SUBREG does in SDAG land. https://reviews.llvm.org/D43444 llvm-svn: 325550	2018-02-20 05:11:57 +00:00
Craig Topper	41f64c3204	[X86] Mark XOP vpmac* and vpmadc intrinsics as being commutative so that tablegen will generate patterns with the load in operand 0. This allows loads to be folded during isel without the peephole pass. llvm-svn: 325548	2018-02-20 03:58:14 +00:00
Craig Topper	a05ed17316	[X86] Make XOP VPCOM instructions commutable to fold loads during isel. llvm-svn: 325547	2018-02-20 03:58:13 +00:00
Craig Topper	9b64bf54b9	[X86] Make a helper function for commuting AVX512 VPCMP immediates since we do it in two places. llvm-svn: 325546	2018-02-20 03:58:11 +00:00
Aditya Nandakumar	bab2d3e2b9	[GISel]: Add pattern matchers for G_BITCAST/PTRTOINT/INTTOPTR Adds pattern matchers for the above along with unit tests for the same. https://reviews.llvm.org/D43479 llvm-svn: 325542	2018-02-19 23:11:53 +00:00
Sanjay Patel	2816560b2c	[InstCombine] use CreateWithCopiedFlags to reduce code; NFCI Also, move the folds with constants closer to make it easier to follow. llvm-svn: 325541	2018-02-19 23:09:03 +00:00
Brian Gesiak	d1eabb1810	Revert "[mem2reg] Use range loops (NFCI)" This reverts commit r325532. llvm-svn: 325539	2018-02-19 22:48:51 +00:00
Craig Topper	b195ed8ce3	[X86] Use vpmovq2m/vpmovd2m for truncate to vXi1 when possible. Previously we used vptestmd, but the scheduling data for SKX says vpmovq2m/vpmovd2m is lower latency. We already used vpmovb2m/vpmovw2m for byte/word truncates. So this is more consistent anyway. llvm-svn: 325534	2018-02-19 22:07:31 +00:00
Sanjay Patel	1d14779aed	[InstCombine] allow fdiv with constant dividend folds with less than full -ffast-math It's possible that we could allow this either 'arcp' or 'reassoc' alone, but this should be conservatively better than what we have right now. GCC allows this with only -freciprocal-math. The last test is changed to show a case that is expected to fold, but we need D43398. llvm-svn: 325533	2018-02-19 21:46:52 +00:00

1 2 3 4 5 ...

160433 Commits