llvm-project

Commit Graph

Author	SHA1	Message	Date
Sanjay Patel	e7e4cc5f98	[InstCombine] add tests for missing icmp fold (PR32524) llvm-svn: 299557	2017-04-05 16:21:38 +00:00
Dmitry Preobrazhensky	45db65037f	[AMDGPU][MC] Fix for Bug 28167 + LIT tests Corrected src0 for v_writelane_b32: - Enabled inline constants and literals for SI/CI (VOP2) - Enabled inline constants for VI (VOP3) Reviewers: vpykhtin, arsenm https://reviews.llvm.org/D31463 llvm-svn: 299555	2017-04-05 16:08:21 +00:00
Nirav Dave	aa65a2beb8	[SystemZ] Prevent Merging Bitcast with non-normal loads Fixes PR32505. Reviewers: uweigand, jonpa Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D31609 llvm-svn: 299552	2017-04-05 15:42:48 +00:00
Davide Italiano	dfeea506c0	[yaml2obj] Factor out error handling code. llvm-svn: 299551	2017-04-05 15:18:16 +00:00
Davide Italiano	de87d145cc	[llvm-ar] Remove unneeded std::, NFCI. This makes it more consistent with other exit() calls in llvm-ar (and the tools in general). llvm-svn: 299549	2017-04-05 15:05:05 +00:00
Davide Italiano	79ebe31c65	[llvm-ar] errors go on stderr and not on stdout. llvm-svn: 299548	2017-04-05 14:52:17 +00:00
Jonathan Roelofs	ff9a2f9175	Respect CMAKE_INSTALL_MANDIR for sphinx generated manpages This is a re-work of r297516, which was reverted in r297545. https://reviews.llvm.org/D30906 llvm-svn: 299547	2017-04-05 14:49:46 +00:00
Davide Italiano	91f00258be	[yaml2obj] Improve error message when output file cannot be opened. Patch by Sam Clegg! Differential Revision: https://reviews.llvm.org/D31351 llvm-svn: 299546	2017-04-05 14:44:00 +00:00
Matthew Simpson	1a4d5c9860	[LV] Make test case more robust This test case depends on the loop being vectorized without forcing the vectorization factor. If the profitability ever changes in the future (due to cost model improvements), the test may no longer work as intended. Instead of checking the resulting IR, we should just check the instruction costs. The costs will be computed regardless if vectorization is profitable. llvm-svn: 299545	2017-04-05 14:34:13 +00:00
Sanjay Patel	b2f1621bb1	[DAGCombiner] add and use TLI hook to convert and-of-seteq / or-of-setne to bitwise logic+setcc (PR32401) This is a generic combine enabled via target hook to reduce icmp logic as discussed in: https://bugs.llvm.org/show_bug.cgi?id=32401 It's likely that other targets will want to enable this hook for scalar transforms, and there are probably other patterns that can use bitwise logic to reduce comparisons. Note that we are missing an IR canonicalization for these patterns, and we will probably prefer the pair-of-compares form in IR (shorter, more likely to fold). Differential Revision: https://reviews.llvm.org/D31483 llvm-svn: 299542	2017-04-05 14:09:39 +00:00
Jonas Paulsson	38a2da92bc	[DAGCombiner] Don't make a BUILD_VECTOR with operands of illegal type. When DAGCombiner visits a SIGN_EXTEND_INREG of a BUILD_VECTOR with constant operands, a new BUILD_VECTOR node will be created transformed constants. Llvm-stress found a case where the new BUILD_VECTOR had constant operands of an illegal type, because the (legal) element type is in fact not a legal scalar type. This patch changes this so that the new BUILD_VECTOR has the same operand type as the old one. Review: Eli Friedman, Nirav Dave https://bugs.llvm.org//show_bug.cgi?id=32422 llvm-svn: 299540	2017-04-05 13:45:37 +00:00
Sanjay Patel	8090e6f004	[InstCombine] add tests for missing add canonicalization; NFC llvm-svn: 299539	2017-04-05 13:33:10 +00:00
Daniel Sanders	4f3eb249cf	[globalisel][tablegen] Fix patterns involving multiple ComplexPatterns. Summary: Temporaries are now allocated to operands instead of predicates and this allocation is used to correctly pair up the rendered operands with the matched operands. Previously, ComplexPatterns were allocated temporaries independently in the Src Pattern and Dst Pattern, leading to mismatches. Additionally, the Dst Pattern failed to account for the allocated index and therefore always used temporary 0, 1, ... when it should have used base+0, base+1, ... Thanks to Aditya Nandakumar for noticing the bug. Depends on D30539 Reviewers: ab, t.p.northover, qcolombet, rovka, aditya_nandakumar Reviewed By: rovka Subscribers: igorb, dberris, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D31054 llvm-svn: 299538	2017-04-05 13:14:03 +00:00
Sam Kolton	34e29784fb	[AMDGPU] SDWA peephole: enable by default Reviewers: vpykhtin, rampitec, arsenm Subscribers: qcolombet, kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye Differential Revision: https://reviews.llvm.org/D31671 llvm-svn: 299536	2017-04-05 12:00:45 +00:00
Alexander Kornienko	014ac69f2e	Fix WebAssembly after r299529. llvm-svn: 299535	2017-04-05 11:50:43 +00:00
Simon Pilgrim	5fbd93b21a	[X86][SSE] Renamed combine to make it clear that it only handles the vector shift by immediate opcodes. NFCI llvm-svn: 299532	2017-04-05 10:44:42 +00:00
James Molloy	9d42334e02	[AArch64] Crypto requires FP. So if FP is disabled, crypto should also be disabled. llvm-svn: 299531	2017-04-05 10:44:38 +00:00
Alex Bradbury	866113c2ea	Add MCContext argument to MCAsmBackend::applyFixup for error reporting A number of backends (AArch64, MIPS, ARM) have been using MCContext::reportError to report issues such as out-of-range fixup values in their TgtAsmBackend. This is great, but because MCContext couldn't easily be threaded through to the adjustFixupValue helper function from its usual callsite (applyFixup), these backends ended up adding an MCContext* argument and adding another call to applyFixup to processFixupValue. Adding an MCContext parameter to applyFixup makes this unnecessary, and even better - applyFixup can take a reference to MCContext rather than a potentially null pointer. Differential Revision: https://reviews.llvm.org/D30264 llvm-svn: 299529	2017-04-05 10:16:14 +00:00
James Molloy	37dd4d7aaa	[LAA] Correctly return a half-open range in expandBounds This is a latent bug that's been hanging around for a while. For a loop-invariant pointer, expandBounds would return the range {Ptr, Ptr}, but this was interpreted as a half-open range, not a closed range. So we ended up planting incorrect bounds checks. Even worse, they were tautological, so we ended up incorrectly executing the optimized loop. llvm-svn: 299526	2017-04-05 09:24:26 +00:00
Gor Nishanov	06fdf48a59	[coroutines] Add syntax coloring to examples in Coroutines.rst Subscribers: EricWF Differential Revision: https://reviews.llvm.org/D31699 llvm-svn: 299517	2017-04-05 05:26:26 +00:00
Akira Hatanaka	75be84f3c2	[ObjCArc] Do not dereference an invalidated iterator. Fix a bug in ARC contract pass where an iterator that pointed to a deleted instruction was dereferenced. It appears that tryToContractReleaseIntoStoreStrong was incorrectly assuming that a call to objc_retain would not immediately follow a call to objc_release. rdar://problem/25276306 llvm-svn: 299507	2017-04-05 03:44:09 +00:00
Lang Hames	0932cdb049	[RuntimeDyld] Remove an unused static member left over from r299449. llvm-svn: 299497	2017-04-05 01:43:59 +00:00
Bob Haarman	6de8134784	ThinLTOBitcodeWriter: handle aliases first in filterModule Summary: This change fixes a "local linkage requires default visibility" assert when attempting to build LLVM with ThinLTO on Windows. Reviewers: pcc, tejohnson, mehdi_amini Reviewed By: pcc Subscribers: llvm-commits, Prazek Differential Revision: https://reviews.llvm.org/D31632 llvm-svn: 299491	2017-04-05 00:42:07 +00:00
Ahmed Bougacha	ec8b1fb539	[X86] Relax assert in broadcast-of-subvector lowering. Before r294774, there was a problem when lowering broadcasts to use 128-bit subvectors. When we looked through a bitcast to find the broadcast input, we'd keep using the original type, so you'd end up with things like: (v8f32 (broadcast (v4f32 (extract_subvector (v8i32 V), ...)) )) r294774 fixed it to always emit subvectors with the scalar type of the original source. It also introduced some asserts, to check that we use scalars with the same size, and vectors with the same number of elements. The scalar size equality is checked earlier when looking through bitcasts, and is a useful assert. However, the number of elements don't have to be identical: we're always going to extract a 128-bit subvector, and we can have different size inputs if we looked through a concat_vector to find a 256-bit source. Relax the overzealous assert. Replace it with a check of the original source vector being 256 or 512 bits. If it's 128 bits, we can't extract_subvector from it. Fixes PR32371. llvm-svn: 299490	2017-04-05 00:14:39 +00:00
Matt Arsenault	7b0d947404	Allow targets to opt-in to codegen in SCC order Decouple this setting from EnableIRPA. To support function calls on AMDGPU, it is necessary to report the global register usage throughout the kernel's call graph, so callees need to be handled first. llvm-svn: 299487	2017-04-04 23:44:46 +00:00
Daniel Berlin	e33bc31df4	Re-apply MemorySSA: Add support for caching clobbering access in stores with some fixes. Summary: This enables us to cache the clobbering access for stores, despite the fact that we can't rewrite the use-def chains themselves. Early testing shows that, after this change, for larger testcases, it will be a significant net positive (memory and time) to remove the walker caching. Reviewers: george.burgess.iv, davide Subscribers: Prazek, llvm-commits Differential Revision: https://reviews.llvm.org/D31567 llvm-svn: 299486	2017-04-04 23:43:10 +00:00
Daniel Berlin	f49d4c45a1	Revert "MemorySSA: Add support for caching clobbering access in stores" This reverts revision r299322. llvm-svn: 299485	2017-04-04 23:43:04 +00:00
Petr Hosek	880cfd45fc	[MC] Set defaults based on section names and support name suffixes Set correct default flags and section type based on its name for .text, .data, .bss, .init_array, .fini_array, .preinit_array, .tdata, and .tbss and support section name suffixes for .data., .rodata., .text., .bss., .tdata.* and .tbss.* which matches the behavior of GAS. Fixes PR31888. Differential Revision: https://reviews.llvm.org/D30229 llvm-svn: 299484	2017-04-04 23:32:45 +00:00
Ahmed Bougacha	d3c03a5ddd	[AArch64] Avoid partial register deps on insertelt of load into lane 0. This improves upon r246462: that prevented FMOVs from being emitted for the cross-class INSERT_SUBREGs by disabling the formation of INSERT_SUBREGs of LOAD. But the ld1.s that we started selecting caused us to introduce partial dependencies on the vector register. Avoid that by using SCALAR_TO_VECTOR: it's a first-class citizen that is folded away by many patterns, including the scalar LDRS that we want in this case. Credit goes to Adam for finding the issue! llvm-svn: 299482	2017-04-04 22:55:53 +00:00
Evgeniy Stepanov	12de7b2446	Change section flag character for SHF_LINK_ORDER to "o". GAS uses "m" as a compatibility alias for "M" (SHF_MERGE). "o" is free, except on ia64, where it already means SHF_LINK_ORDER. llvm-svn: 299479	2017-04-04 22:35:08 +00:00
Craig Topper	1534495ffd	[InstCombine] Add test cases for various add/subtracts of constants(scalar, splat, and vector) with phis and selects. Improvements coming in a future commit. llvm-svn: 299476	2017-04-04 22:22:30 +00:00
Rafael Espindola	07503baf3a	[lit] Add a minimum export implementation. llvm-svn: 299475	2017-04-04 22:20:18 +00:00
Sanjay Patel	0bf0abedf6	[InstCombine] rename variable for easier reading; NFC We usually give constants a 'C' somewhere in the name... llvm-svn: 299474	2017-04-04 22:06:03 +00:00
Craig Topper	c745b6a1f6	[InstCombine] Turn subtract of vectors of i1 into xor like we do for scalar i1. Matches what we already do for add. llvm-svn: 299472	2017-04-04 21:44:56 +00:00
Balaram Makam	b3120b6d3f	[AArch64] Add missing schedinfo, check completeness for Falkor. llvm-svn: 299468	2017-04-04 21:15:53 +00:00
Keno Fischer	282c62495f	[ExecutionDepsFix] Don't revisit true dependencies If an instruction has a true dependency, it makes sense for to use that register for any undef read operands in the same instruction (we'll have to wait for that register to become available anyway). This logic was already implemented. However, the code would then still try to revisit that instruction and break the dependency (and always fail, since by definition a true dependency has to be live before the instruction). Avoid revisiting such instructions as a performance optimization. No functional change. Differential Revision: https://reviews.llvm.org/D30173 llvm-svn: 299467	2017-04-04 20:30:47 +00:00
Craig Topper	86173600ec	[InstCombine] Support folding and/or/xor with a constant vector RHS into selects and phis Currently we only fold with ConstantInt RHS. This generalizes to any Constant RHS. Differential Revision: https://reviews.llvm.org/D31610 llvm-svn: 299466	2017-04-04 20:26:25 +00:00
Petr Hosek	9eb0a1e09b	[AArch64][Fuchsia] Allow -mcmodel=kernel for --target=aarch64-fuchsia This mode is just like -mcmodel=small except that it moves the thread pointer from TPIDR_EL0 to TPIDR_EL1. Patch by Roland McGrath. Differential Revision: https://reviews.llvm.org/D31624 llvm-svn: 299462	2017-04-04 19:51:53 +00:00
Craig Topper	11791a723b	[InstCombine] Add test cases for missing combines of phis with and/or/xor with constant argument. NFC llvm-svn: 299460	2017-04-04 19:31:21 +00:00
Yi Kong	57019dc9b2	Implement host CPU detection for AArch64 This shares detection logic with ARM(32), since AArch64 capable CPUs may also run in 32-bit system mode. We observe weird /proc/cpuinfo output for MSM8992 and MSM8994, where they report all CPU cores as one single model, depending on which CPU core the kernel is running on. As a workaround, we hardcode the known CPU part name for these SoCs. For big.LITTLE systems, this patch would only return the part name of the first core (usually the little core). Proper support will be added in a follow-up change. Differential Revision: D31675 llvm-svn: 299458	2017-04-04 19:06:04 +00:00
Matt Arsenault	3333968771	Verifier: Check some amdgpu calling convention restrictions llvm-svn: 299457	2017-04-04 18:43:11 +00:00
Balaram Makam	7b5c098cfa	[AArch64] Refine Falkor Machine Model - Part 2 llvm-svn: 299456	2017-04-04 18:42:14 +00:00
Coby Tayree	3847be9410	[X86][inline-asm] Add support for MS 'EVEN' directive MS assembly syntax provide us with the 'EVEN' directive as a synonymous to at&t '.even'. This patch include the (small, simple) changes need to allow it. Test is provided at the following (clang-side) review: https://reviews.llvm.org/D27418 Differential Revision: https://reviews.llvm.org/D27417 llvm-svn: 299453	2017-04-04 17:57:23 +00:00
Craig Topper	78cfbc1635	[InstCombine] Add more test cases for missing combines of selects with and/or/xor with constant argument. NFC llvm-svn: 299450	2017-04-04 17:48:08 +00:00
Lang Hames	d22badef45	[RuntimeDyld] Make RuntimeDyld honor the ProcessAllSections flag. When the ProcessAllSections flag (introduced in r204398) is set RuntimeDyld is supposed to make a call to the client's memory manager for every section in each object that is loaded. Due to some missing checks, this was not happening in all cases. This patch adds the missing cases, and fixes the Orc unit test that verifies correct behavior for ProcessAllSections (The unit test had been silently bailing out due to an ordering issue: a change in the test order meant that this unit-test was running before the native target was registered. This issue has also been fixed in this patch). This fixes <rdar://problem/22789965> llvm-svn: 299449	2017-04-04 17:03:49 +00:00
Sanjay Patel	ac618383e3	[x86] remove dead select-of-constants transform; NFCI https://reviews.llvm.org/D30537 / https://reviews.llvm.org/rL296977 added these transforms and other related transforms to the generic DAGCombiner (with a hook that x86 sets to true), so these patterns should not exist by the time we reach the target-specific combiner hook. llvm-svn: 299448	2017-04-04 16:54:58 +00:00
Rong Xu	48596b6f7a	[PGO] Memory intrinsic calls optimization based on profiled size This patch optimizes two memory intrinsic operations: memset and memcpy based on the profiled size of the operation. The high level transformation is like: mem_op(..., size) ==> switch (size) { case s1: mem_op(..., s1); goto merge_bb; case s2: mem_op(..., s2); goto merge_bb; ... default: mem_op(..., size); goto merge_bb; } merge_bb: Differential Revision: http://reviews.llvm.org/D28966 llvm-svn: 299446	2017-04-04 16:42:20 +00:00
Matt Arsenault	3e90f84806	AMDGPU: Remove legacy export intrinsic llvm-svn: 299444	2017-04-04 16:34:39 +00:00
Matt Arsenault	236da200f1	AMDGPU: Remove legacy image intrinsics llvm-svn: 299443	2017-04-04 16:34:35 +00:00
Coby Tayree	2cb497afa4	[X86][MS-compatability]Allow named synonymous for MS-assembly operators This patch enhances X86AsmParser's immediate expression parsing abilities, to include a named synonymous for selected binary/unary bitwise operators: {and,shl,shr,or,xor,not}, ultimately achieving better MS-compatability MASM reference: https://msdn.microsoft.com/en-us/library/94b6khh4.aspx Differential Revision: D31277 llvm-svn: 299439	2017-04-04 14:43:23 +00:00
Simon Pilgrim	448222d8ba	Strip trailing whitespace llvm-svn: 299438	2017-04-04 14:40:53 +00:00
Daniel Sanders	9e4817d49e	[globalisel][tablegen] Fix non-determinism introduced in r299430. This should fix the last issue on llvm-clang-x86_64-expensive-checks-win. llvm-svn: 299436	2017-04-04 14:27:06 +00:00
Daniel Sanders	db7ed37c7a	[globalisel][tablegen] Try to make MSVC happy with r299430 Fix other cases of 'const StringRef' creeping back in at the same time. This should fix the llvm-clang-x86_64-expensive-checks-win buildbot. llvm-svn: 299433	2017-04-04 13:52:00 +00:00
Michael Zuckerman	88fb171015	[X86][LLVM] Converting __mm{\|256\|512}_movm_epi{8\|16\|32\|64} LLVMIR call into generic intrinsics. This patch is a part one of two reviews, one for the clang and the other for LLVM. The patch deletes the back-end intrinsics and adds support for them in the auto upgrade. Differential Revision: https://reviews.llvm.org/D31393 llvm-svn: 299432	2017-04-04 13:32:14 +00:00
Daniel Sanders	bee5739a7c	[tablegen][globalisel] Add support for nested instruction matching. Summary: Lift the restrictions that prevented the tree walking introduced in the previous change and add support for patterns like: (G_ADD (G_MUL (G_SEXT $src1), (G_SEXT $src2)), $src3) -> SMADDWrrr $dst, $src1, $src2, $src3 Also adds support for G_SEXT and G_ZEXT to support these cases. One particular aspect of this that I should draw attention to is that I've tried to be overly conservative in determining the safety of matches that involve non-adjacent instructions and multiple basic blocks. This is intended to be used as a cheap initial check and we may add a more expensive check in the future. The current rules are: * Reject if any instruction may load/store (we'd need to check for intervening memory operations. * Reject if any instruction has implicit operands. * Reject if any instruction has unmodelled side-effects. See isObviouslySafeToFold(). Reviewers: t.p.northover, javed.absar, qcolombet, aditya_nandakumar, ab, rovka Reviewed By: ab Subscribers: igorb, dberris, llvm-commits, kristof.beyls Differential Revision: https://reviews.llvm.org/D30539 llvm-svn: 299430	2017-04-04 13:25:23 +00:00
Simon Dardis	0a47edb153	[mips] Deal with empty blocks in the mips hazard scheduler This patch teaches the hazard scheduler how to handle empty blocks when search for the next real instruction when dealing with forbidden slots. Reviewers: slthakur Differential Revision: https://reviews.llvm.org/D31293 llvm-svn: 299427	2017-04-04 11:28:53 +00:00
Oren Ben Simhon	568fb197da	[X86] Add 64 bit pattern matching for PSADBW PSADBW pattern currently supports the 32 bit IR pattern and only GLT (greather than) comparison. The patch extends the pattern to catch also 64 bit IR pattern and includes all other comparison types (not only GLT). Differential Revision: https://reviews.llvm.org/D31577 llvm-svn: 299425	2017-04-04 10:23:18 +00:00
Jonas Hahnfeld	1f9b00117c	Align all scalar numbers to LLVM_YAML_IS_FLOW_SEQUENCE_VECTOR Otherwise, yamlize in YAMLTraits.h might be wrongly defined. This makes some AMDGPU tests fail when LLVM_LINK_LLVM_DYLIB is set. Differential Revision: https://reviews.llvm.org/D30508 llvm-svn: 299415	2017-04-04 06:02:32 +00:00
Craig Topper	e06b6bcfa1	[InstCombine] Use setAllBits in place of getAllOnesValue since we know the bitwidths are the same. NFCI llvm-svn: 299413	2017-04-04 05:03:02 +00:00
Zvi Rackover	82bf48d8b9	InstCombine: Use the InstSimplify hook for shufflevector Summary: Start using the recently added InstSimplify hook for shuffles in the respective InstCombine visitor. Reviewers: spatel, RKSimon, craig.topper, majnemer Reviewed By: majnemer Subscribers: majnemer, llvm-commits Differential Revision: https://reviews.llvm.org/D31526 llvm-svn: 299412	2017-04-04 04:47:57 +00:00
Reid Kleckner	13fc411e39	[PDB] Save one type record copy Summary: The TypeTableBuilder provides stable storage for type records. We don't need to copy all of the bytes into a flat vector before adding it to the TpiStreamBuilder. This makes addTypeRecord take an ArrayRef<uint8_t> and a hash code to go with it, which seems like a simplification. Reviewers: ruiu, zturner, inglorion Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D31634 llvm-svn: 299406	2017-04-04 00:56:34 +00:00
Reid Kleckner	c4b5d794f1	[codeview] Cope with unsorted streams in type merging Summary: MASM can produce type streams that are not topologically sorted. It can even produce type streams with circular references, but those are not common in practice. Reviewers: inglorion, ruiu Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D31629 llvm-svn: 299403	2017-04-03 23:58:15 +00:00
Reid Kleckner	67cecd1e1c	[Fuzzer] Flush std::cout before aborting in CxxStringEqTest On Windows, abort() does not appear to flush std::cout. Should fix red sanitizer-windows bot. llvm-svn: 299398	2017-04-03 23:00:25 +00:00
Sanjay Patel	a4546efbc8	add/move codegen tests for and/or of setcc; NFC llvm-svn: 299396	2017-04-03 22:45:46 +00:00
Tim Northover	4e3cc794d5	Update stale doxygen links in ProgrammersManual.rst Patch by Wei-Ren Chen. llvm-svn: 299395	2017-04-03 22:24:32 +00:00
Zvi Rackover	8f460655a2	InstSimplify: Add a hook for shufflevector Summary: Add a hook for simplification of shufflevector's with the following rules: - Constant folding - NFC, as it was already being done by the default handler. - If only one of the operands is constant, constant fold the shuffle if the mask does not select elements from the variable operand - to show the hook is firing and affecting the test-cases. Reviewers: RKSimon, craig.topper, spatel, sanjoy, nlopes, majnemer Reviewed By: spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D31525 llvm-svn: 299393	2017-04-03 22:05:30 +00:00
Weiming Zhao	74a7fa0594	Reland r298901 with modifications (reverted in r298932) Dont emit Mapping symbols for sections that contain only data. Summary: Dont emit mapping symbols for sections that contain only data. Reviewers: rengolin, weimingz, kparzysz, t.p.northover, peter.smith Reviewed By: t.p.northover Patched by Shankar Easwaran <shankare@codeaurora.org> Subscribers: alekseyshl, t.p.northover, llvm-commits Differential Revision: https://reviews.llvm.org/D30724 llvm-svn: 299392	2017-04-03 21:50:04 +00:00
Matt Arsenault	b600e138cc	AMDGPU: Remove llvm.SI.vs.load.input llvm-svn: 299391	2017-04-03 21:45:13 +00:00
Matt Arsenault	c82768290d	DAG: Fix missing legalization for any_extend_vector_inreg operands llvm-svn: 299389	2017-04-03 21:28:13 +00:00
Reid Kleckner	1c3b5087b7	[codeview] Add support for label type records MASM can produce these type records. llvm-svn: 299388	2017-04-03 21:25:20 +00:00
Simon Pilgrim	af33757b5d	[X86][SSE]] Lower BUILD_VECTOR with repeated elts as BUILD_VECTOR + VECTOR_SHUFFLE It can be costly to transfer from the gprs to the xmm registers and can prevent loads merging. This patch splits vXi16/vXi32/vXi64 BUILD_VECTORS that use the same operand in multiple elements into a BUILD_VECTOR with only a single insertion of each of those elements and then performs an unary shuffle to duplicate the values. There are a couple of minor regressions this patch unearths due to some missing MOVDDUP/BROADCAST folds that I will address in a future patch. Note: Now that vector shuffle lowering and combining is pretty good we should be reusing that instead of duplicating so much in LowerBUILD_VECTOR - this is the first of several patches to address this. Differential Revision: https://reviews.llvm.org/D31373 llvm-svn: 299387	2017-04-03 21:06:51 +00:00
Craig Topper	1604f0773b	[InstCombine] Remove canonicalization for (X & C1) \| C2 --> (X \| C2) & (C1\|C2) when C1 & C2 have common bits. It turns out that SimplifyDemandedInstructionBits will get called earlier and remove bits from C1 first. Effectively doing (X & (C1&C2)) \| C2. So by the time it got to this check there could be no common bits. I think the DAGCombiner has the same check but its check can be executed because it handles demanded bits later. I'll look at it next. llvm-svn: 299384	2017-04-03 20:41:47 +00:00
Amjad Aboud	0389f62879	x86 interrupt calling convention: re-align stack pointer on 64-bit if an error code was pushed The x86_64 ABI requires that the stack is 16 byte aligned on function calls. Thus, the 8-byte error code, which is pushed by the CPU for certain exceptions, leads to a misaligned stack. This results in bugs such as Bug 26413, where misaligned movaps instructions are generated. This commit fixes the misalignment by adjusting the stack pointer in these cases. The adjustment is done at the beginning of the prologue generation by subtracting another 8 bytes from the stack pointer. These additional bytes are popped again in the function epilogue. Fixes Bug 26413 Patch by Philipp Oppermann. Differential Revision: https://reviews.llvm.org/D30049 llvm-svn: 299383	2017-04-03 20:28:45 +00:00
Jun Bum Lim	dee5565869	[CodeGenPrep] move aarch64-type-promotion to CGP Summary: Move the aarch64-type-promotion pass within the existing type promotion framework in CGP. This change also support forking sexts when a new sext is required for promotion. Note that change is based on D27853 and I am submitting this out early to provide a better idea on D27853. Reviewers: jmolloy, mcrosier, javed.absar, qcolombet Reviewed By: qcolombet Subscribers: llvm-commits, aemerson, rengolin, mcrosier Differential Revision: https://reviews.llvm.org/D28680 llvm-svn: 299379	2017-04-03 19:20:07 +00:00
Craig Topper	3882613956	[DAGCombine][InstCombine] Fix inverted if condition in equivalent comments in DAGCombine and InstCombine. NFC llvm-svn: 299378	2017-04-03 19:18:48 +00:00
Joel Jones	0a5e55e819	Fix LLVMBuild.txt typo. NFC llvm-svn: 299373	2017-04-03 18:21:50 +00:00
Matt Arsenault	754dd3eaef	AMDGPU: Remove legacy bfe intrinsics llvm-svn: 299372	2017-04-03 18:08:08 +00:00
Graydon Hoare	3875cfafa1	[Support] Make printAllJSONValues public, for custom output. Summary: This changes the static method TimerGroup::printAllJSONValues from private to public, to match the static method TimerGroup::printAll. When trying to drive the reporting machinery by hand, the existing API is _almost_ flexible enough, but this entrypoint is required to intermix printing timers with other non-timer output. The underlying motive here is a Swift change to consolidate the collection of timers, LLVM statistics and other (non-assert-dependent) counters into JSON files, which requires a bit of manual intervention in LLVM's stat and timer output routines. See https://github.com/apple/swift/pull/8477 for details. Reviewers: MatzeB Reviewed By: MatzeB Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D31566 llvm-svn: 299371	2017-04-03 18:04:15 +00:00
Peter Collingbourne	f5af778389	Bitcode: Remove reader support for MODULE_CODE_PURGEVALS. Support for writing this module code was removed in r73220, which was well before the LLVM 3.0 release, so we do not need to be able to understand it for backwards compatibility. Differential Revision: https://reviews.llvm.org/D31563 llvm-svn: 299370	2017-04-03 17:58:48 +00:00
Craig Topper	8698fc09de	[InstCombine] Add test cases showing how we fail to fold vector constants into selects the way we do with scalars. llvm-svn: 299369	2017-04-03 17:49:15 +00:00
Zvi Rackover	d76a4d0ac6	Revert "[DAGCombine] A shuffle of a splat is always the splat itself" This reverts commit r299047 which is incorrect because the simplification may result in incorrect propogation of undefs to users of the folded shuffle. Thanks to Andrea Di Biagio for pointing this out. llvm-svn: 299368	2017-04-03 17:41:19 +00:00
Krzysztof Parzyszek	44173f7d02	[Hexagon] Factor out some common code in HexagonEarlyIfConv.cpp, NFC llvm-svn: 299367	2017-04-03 17:26:40 +00:00
Craig Topper	79120e80b8	Revert r299337 "[InstCombine] Remove redundant combine from visitAnd" One of the tsan bots started failing at this commit. I don't see anything obviously wrong with the commit so trying this to see if it recovers. Failing log: http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux-autoconf/builds/6792 llvm-svn: 299366	2017-04-03 17:22:23 +00:00
Sanjay Patel	77bf622db6	[InstCombine] fix formatting for foldLogOpOfMaskedICmps and related bits; NFCI 1. Improve enum, function, and variable names. 2. Improve comments. 3. Fix variable capitalization. 4. Run clang-format. As an existing code comment suggests, this should work with vector types / splat constants too, so making this look right first will reduce the diffs needed for that change. llvm-svn: 299365	2017-04-03 16:53:12 +00:00
Craig Topper	d33ee1b960	[APInt] Move isMask and isShiftedMask out of APIntOps and into the APInt class. Implement them without memory allocation for multiword This moves the isMask and isShiftedMask functions to be class methods. They now use the MathExtras.h function for single word size and leading/trailing zeros/ones or countPopulation for the multiword size. The previous implementation made multiple temorary memory allocations to do the bitwise arithmetic operations to match the MathExtras.h implementation. Differential Revision: https://reviews.llvm.org/D31565 llvm-svn: 299362	2017-04-03 16:34:59 +00:00
Simon Pilgrim	9daf9c047d	[DAGCombiner] Check limits before accessing array element (PR32502) llvm-svn: 299361	2017-04-03 15:27:49 +00:00
Sjoerd Meijer	1179470ff8	ARMAsmParser: clean up of isImmediate functions - we are now using immediate AsmOperands so that the range check functions are tablegen'ed. - Big bonus is that error messages become much more accurate, i.e. instead of a useless "invalid operand" error message it will not say that the immediate operand must in range [x,y], which is why regression tests needed updating. More tablegen operand descriptions could probably benefit from using immediateAsmOperand, but this is a first good step to get rid of most of the nearly identical range check functions. I will address the remaining immediate operands in next clean ups. Differential Revision: https://reviews.llvm.org/D31333 llvm-svn: 299358	2017-04-03 14:50:04 +00:00
Craig Topper	d0b053d229	[InstCombine] Make foldOpWithConstantIntoOperand take a BinaryOperator instead of a generic Instruction. It blindly assumes there are two operands so make it explicit. llvm-svn: 299351	2017-04-03 07:08:08 +00:00
Craig Topper	07944f891c	[InstCombine] Remove a And transform that should be handled by SimplifyDemandedInstructionBits. NFCI llvm-svn: 299349	2017-04-03 06:02:09 +00:00
NAKAMURA Takumi	823b506403	Trailing whitespace. llvm-svn: 299344	2017-04-02 23:57:17 +00:00
NAKAMURA Takumi	37bc9f3786	Reformat. llvm-svn: 299343	2017-04-02 23:57:10 +00:00
Craig Topper	00b47eec0f	[APInt] Make use of whichWord and maskBit to simplify some code. NFC llvm-svn: 299342	2017-04-02 19:35:18 +00:00
Craig Topper	55229b780d	[APInt] Add a public typedef for the internal type of APInt use it instead of integerPart. Make APINT_BITS_PER_WORD and APINT_WORD_SIZE public. This patch is one step to attempt to unify the main APInt interface and the tc functions used by APFloat. This patch adds a WordType to APInt and uses that in all the tc functions. I've added temporary typedefs to APFloat to alias it to integerPart to keep the patch size down. I'll work on removing that in a future patch. In future patches I hope to reuse the tc functions to implement some of the main APInt functionality. I may remove APINT_ from BITS_PER_WORD and WORD_SIZE constants so that we don't have the repetitive APInt::APINT_ externally. Differential Revision: https://reviews.llvm.org/D31523 llvm-svn: 299341	2017-04-02 19:17:22 +00:00
Craig Topper	70e4f434ae	[InstCombine] Make InstCombiner::OptAndOp take a BinaryOperator instead of an Instruction. The callers have already performed the necessary cast before calling. This allows us to remove a comment that says the instruction must be a BinaryOperator and make it explicit in the argument type. Had to add a default case to the switch because BinaryOperator::getOpcode() returns a BinaryOps enum. llvm-svn: 299339	2017-04-02 17:57:30 +00:00
Simon Pilgrim	0e2f8cd875	[X86][MMX] Improve support for folding fptosi from XMM to MMX llvm-svn: 299338	2017-04-02 17:45:41 +00:00
Craig Topper	d133591a7e	[InstCombine] Remove redundant combine from visitAnd As far as I can tell this combine is fully handled by SimplifyDemandedInstructionBits. I was only looking at this because it is the only user of APIntOps::isShiftedMask which is itself broken. As demonstrated by r299187. I was going to fix isShiftedMask and needed to make sure we had coverage for the new cases it would expose to this combine. But looks like we can nuke it instead. Differential Revision: https://reviews.llvm.org/D31543 llvm-svn: 299337	2017-04-02 17:34:30 +00:00
Simon Pilgrim	ba28263b03	[X86][MMX] Simplify tablegen patterns by always combining MOVDQ2Q from v2i64 llvm-svn: 299336	2017-04-02 16:20:34 +00:00
Simon Pilgrim	e56a2d7b4c	[X86][MMX] Added support for subvector extraction to MMX register llvm-svn: 299335	2017-04-02 15:52:28 +00:00
NAKAMURA Takumi	2f181584cb	APInt.h: Prune \param(s) in \returns. [-Wdocumentation] llvm-svn: 299334	2017-04-02 15:05:18 +00:00
Simon Pilgrim	7fc08a8117	Regenerate test with codegen. NFCI. llvm-svn: 299333	2017-04-02 14:21:14 +00:00
Simon Pilgrim	841ecebd7c	Regenerate test with codegen. NFCI. llvm-svn: 299332	2017-04-02 13:59:37 +00:00
Simon Pilgrim	637182f262	Regenerate test. NFCI. llvm-svn: 299331	2017-04-02 13:50:44 +00:00
Daniel Berlin	07daac8a36	NewGVN: Handle coercion of constant stores, loads, memory insts. Summary: Depends on D30928. This adds support for coercion of stores and memory instructions that do not require insertion to process. Another few tests down. I added the relevant tests from rle.ll Reviewers: davide Subscribers: llvm-commits, Prazek Differential Revision: https://reviews.llvm.org/D30929 llvm-svn: 299330	2017-04-02 13:23:44 +00:00
Nikolai Bozhenov	fca527af5c	[BypassSlowDivision] Do not bypass division of hash-like values Disable bypassing if one of the operands looks like a hash value. Slow division often occurs in hashtable implementations and fast division is never taken there because a hash value is extremely unlikely to have enough upper bits set to zero. A value is considered to be hash-like if it is produced by 1) XOR operation 2) Multiplication by a constant wider than the shorter type 3) PHI node with all incoming values being hash-like Differential Revision: https://reviews.llvm.org/D28200 llvm-svn: 299329	2017-04-02 13:14:30 +00:00
Simon Pilgrim	dddce31eb4	[X86][MMX] Add generic fptosi 4f32-4i32 test llvm-svn: 299328	2017-04-02 13:10:20 +00:00
Zvi Rackover	e479980686	Add another interesting shufflevector test case for InstSimplify. NFC. Test case shows opportunity to constant fold a shuffle with one variable input vector operand. llvm-svn: 299327	2017-04-02 10:42:21 +00:00
Craig Topper	15e484aa2b	[X86] Use tcAdd/tcSubtract to implement the slow case of operator+=/operator-=. llvm-svn: 299326	2017-04-02 06:59:43 +00:00
Craig Topper	b8f1068765	[APInt] Combine declaration and initialization. NFC llvm-svn: 299325	2017-04-02 06:59:41 +00:00
Craig Topper	b7d8faa231	[APInt] Simplify some code by using operator+=(uint64_t) instead of doing a more complex assignment into a temporary APInt just to use the APInt operator+=. llvm-svn: 299324	2017-04-02 06:59:38 +00:00
Craig Topper	d7ed50de26	[APInt] Fix typo in comment. NFC llvm-svn: 299323	2017-04-02 06:59:36 +00:00
Daniel Berlin	8a00270838	MemorySSA: Add support for caching clobbering access in stores Summary: This enables us to cache the clobbering access for stores, despite the fact that we can't rewrite the use-def chains themselves. Early testing shows that, after this change, for larger testcases, it will be a significant net positive (memory and time) to remove the walker caching. Reviewers: george.burgess.iv, davide Subscribers: Prazek, llvm-commits Differential Revision: https://reviews.llvm.org/D31567 llvm-svn: 299322	2017-04-02 05:09:15 +00:00
Craig Topper	68a3ed2e1b	[APInt] Use conditional operator to simplify some code. NFC llvm-svn: 299320	2017-04-01 21:50:10 +00:00
Craig Topper	a742cb5fc8	[APInt] Implement flipAllBitsSlowCase with tcComplement. NFCI llvm-svn: 299319	2017-04-01 21:50:08 +00:00
Craig Topper	99cfe4f99d	[APInt] Fix indentation. NFC llvm-svn: 299318	2017-04-01 21:50:06 +00:00
Craig Topper	b2aaa5da42	[APInt] Implement AndAssignSlowCase using tcAnd. Do the same for Or and Xor. NFCI llvm-svn: 299317	2017-04-01 21:50:03 +00:00
Craig Topper	278ebd2f98	[APInt] Allow GreatestCommonDivisor to take rvalue inputs efficiently. Use moves instead of copies in the loop. Summary: GreatestComonDivisor currently makes a copy of both its inputs. Then in the loop we do one move and two copies, plus any allocation the urem call does. This patch changes it to take its inputs by value so that we can do a move of any rvalue inputs instead of copying. Then in the loop we do 3 move assignments and no copies. This way the only possible allocations we have in the loop is from the urem call. Reviewers: dblaikie, RKSimon, hans Reviewed By: dblaikie Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D31572 llvm-svn: 299314	2017-04-01 20:30:57 +00:00
Davide Italiano	16fe5822f2	[WASM] Remove other comparison of unsigned expression >= 0. This should finally fix the GCC 7 build with -Werror. llvm-svn: 299313	2017-04-01 19:47:52 +00:00
Davide Italiano	deede839f2	[WASM] Remove a set but never used variable. llvm-svn: 299312	2017-04-01 19:40:51 +00:00
Davide Italiano	543760218a	[WASM] Remove an assertion that can never fire. uint* is by definition always >=0. llvm-svn: 299311	2017-04-01 19:37:15 +00:00
Davide Italiano	c88169e61b	[AMDGPU] Garbage collect now unused dead code. NFCI. llvm-svn: 299310	2017-04-01 19:30:17 +00:00
Sanjay Patel	8b5ad3f00e	[InstSimplify] add constant folding for fdiv/frem Also, add a helper function so we don't have to repeat this code for each binop. llvm-svn: 299309	2017-04-01 19:05:11 +00:00
Sanjay Patel	ee0f5cc41f	[InstSimplify] add tests for missed constant folding; NFC llvm-svn: 299308	2017-04-01 18:44:03 +00:00
Sanjay Patel	1fd16f073d	fix formatting; NFC llvm-svn: 299307	2017-04-01 18:40:30 +00:00
Sanjay Patel	5e94bb00b3	fix formatting; NFC llvm-svn: 299305	2017-04-01 15:53:12 +00:00
Sanjay Patel	665021e7ee	[DAGCombiner] enable vector transforms for any/all {sign} bits set/clear The code already allowed vector types in via "isInteger" (which might want a more specific name), so use splat-friendly constant predicates to match those types. llvm-svn: 299304	2017-04-01 15:05:54 +00:00
Sanjay Patel	fe9340c168	[PowerPC, x86] add vector tests for any/all {sign} bits set/clear; NFC llvm-svn: 299303	2017-04-01 14:32:18 +00:00
Daniel Berlin	43e13a2d98	MemorySSA: Update expensive checking version of def_chain_iterator for templating changes llvm-svn: 299301	2017-04-01 10:04:28 +00:00
Daniel Berlin	9a9c9ff260	NewGVN: Don't try to kill off the stored value of stores when processing the congruence class of the store. Because we use the stored value of a store as the def, it isn't dead just because it appears as a def when it comes from a store. Note: I have not hit any cases with the memory code as it is where this breaks anything, just because of what memory congruences we actually allow. In a followup that improves memory congruence, this bug actually breaks real stuff (but the verifier catches it). llvm-svn: 299300	2017-04-01 09:44:33 +00:00
Daniel Berlin	9b4984926c	NewGVN: Clean up GVNExpression memory hierarchy, restructure hash computation a bit so we don't have to redefine it for loads, stores, and calls llvm-svn: 299299	2017-04-01 09:44:29 +00:00
Daniel Berlin	871ecd90ca	NewGVN: Use def_chain iterator in singleReachablePhiPath instead of recursion llvm-svn: 299298	2017-04-01 09:44:24 +00:00
Daniel Berlin	07275c3065	Move def_chain iterator to MemorySSA.h so it can be reused llvm-svn: 299297	2017-04-01 09:44:19 +00:00
Daniel Berlin	f56cc07fd2	MemorySSA.h - make clang-format happy llvm-svn: 299296	2017-04-01 09:44:14 +00:00
Daniel Berlin	d042031f0f	MemorySSA: Push const correctness further. llvm-svn: 299295	2017-04-01 09:01:12 +00:00
Daniel Berlin	7500c5641e	MemorySSA: Kill the WalkTargetCache now that we have getBlockDefs. llvm-svn: 299294	2017-04-01 08:59:45 +00:00
Craig Topper	473a80ffcb	[APInt] Implement operator! using operator==(uint64_t). NFCI llvm-svn: 299293	2017-04-01 06:50:00 +00:00
Craig Topper	9ab8d7f9c3	[APInt] Remove the mul/urem/srem/udiv/sdiv functions from the APIntOps namespace. Replace the few usages with calls to the class methods. NFC llvm-svn: 299292	2017-04-01 05:08:57 +00:00
Craig Topper	73250168e7	[DAGCombiner] Fix fold (or (shuf A, V_0, MA), (shuf B, V_0, MB)) -> (shuf A, B, Mask) to explicitly ensure that only one of the inputs of each shuffle is a zero vector. This can only happen when we have a mix of zero and undef elements and the two vectors have a different arrangement of zeros/undefs. The shuffle should eventually be constant folded to all zeros. Fixes PR32484. llvm-svn: 299291	2017-04-01 04:26:20 +00:00
Quentin Colombet	49d70d0529	Revert "Feature generic option to setup start/stop-after/before" This reverts commit r299282. Didn't intend to commit this :( llvm-svn: 299288	2017-04-01 01:26:24 +00:00
Quentin Colombet	fc8f048c13	Revert "Localizer fun" This reverts commit r299283. Didn't intend to commit this :( llvm-svn: 299287	2017-04-01 01:26:21 +00:00
Quentin Colombet	35a47010b1	Revert "Instrument SDISel C++ patterns" This reverts commit r299284. Didn't intend to commit this :( llvm-svn: 299286	2017-04-01 01:26:17 +00:00
Quentin Colombet	7f64318938	[RegBankSelect] Support REG_SEQUENCE for generic mapping REG_SEQUENCE falls into the same category as COPY for operands mapping: - They don't have MCInstrDesc with register constraints - The input variable could use whatever register classes - It is possible to have register class already assigned to the operands In particular, given REG_SEQUENCE are always target specific because of the subreg indices. Those indices must apply to the register class of the definition of the REG_SEQUENCE and therefore, the target must set a register class to that definition. As a result, the generic code can always use that register class to derive a valid mapping for a REG_SEQUENCE. llvm-svn: 299285	2017-04-01 01:26:14 +00:00
Quentin Colombet	b43da15602	Instrument SDISel C++ patterns llvm-svn: 299284	2017-04-01 01:21:32 +00:00
Quentin Colombet	3c40b366c5	Localizer fun WIP llvm-svn: 299283	2017-04-01 01:21:28 +00:00
Quentin Colombet	ffe3053a66	Feature generic option to setup start/stop-after/before This patch refactors the code used in llc such that all the users of the addPassesToEmitFile API have access to a homogeneous way of handling start/stop-after/before options right out of the box. Previously each user would have needed to duplicate this logic and set up its own options. NFC llvm-svn: 299282	2017-04-01 01:21:24 +00:00
Peter Collingbourne	3420e4bf94	Fix a test to check assembly output instead of bitcode. llvm-svn: 299279	2017-03-31 23:22:19 +00:00
Eric Christopher	60a245e0ff	Reduce the number of times we query the subtarget for the same information. llvm-svn: 299278	2017-03-31 23:12:27 +00:00
Eric Christopher	cf965f2f03	Small cleanup to remove extraneous cast. llvm-svn: 299277	2017-03-31 23:12:24 +00:00
Konstantin Zhuravlyov	16a3b1ca76	AMDGPU/llvm-readobj: Rename RuntimeMDNoteType -> CodeObjectMetadataNoteType to match the new metadata. NFC. llvm-svn: 299275	2017-03-31 22:36:39 +00:00
Craig Topper	47fd2de304	[APInt] Fix bugs in isShiftedMask to match behavior of the similar function in MathExtras.h This removes a parameter from the routine that was responsible for a lot of the issue. It was a bit count that had to be set to the BitWidth of the APInt and would get passed to getLowBitsSet. This guaranteed the call to getLowBitsSet would create an all ones value. This was then compared to (V \| (V-1)). So the only shifted masks we detected had to have the MSB set. The one in tree user is a transform in InstCombine that never fires due to earlier transforms covering the case better. I've submitted a patch to remove it completely, but for now I've just adapted it to the new interface for isShiftedMask. llvm-svn: 299273	2017-03-31 22:23:42 +00:00
Konstantin Zhuravlyov	b55b465611	[AMDGPU] Fix typo in test filename. NFC. llvm-svn: 299271	2017-03-31 22:14:54 +00:00

1 2 3 4 5 ...

147190 Commits