llvm-project

Commit Graph

Author	SHA1	Message	Date
Craig Topper	5d16cd9d63	[AVX512] Remove masked VPERMD/VPERMQ/VPERMILPS/VPERMILPD intrinsics. They were autoupgraded to native IR in r274506 and r274506. llvm-svn: 274519	2016-07-04 19:58:38 +00:00
Jan Vesely	991dfd7b07	AMDGPU/R600: Add indentation to VTX and TEX fetch asm strings These are printed as part of Fetch clauses. Differential Revision: http://reviews.llvm.org/D21730 llvm-svn: 274517	2016-07-04 19:45:00 +00:00
James Molloy	c3b4ed4a70	Revert "[Thumb] Reapply r272251 with a fix for PR28348" This reverts commit r274510 - it made green dragon unhappy. llvm-svn: 274512	2016-07-04 17:14:24 +00:00
James Molloy	9f019835ef	[Thumb] Reapply r272251 with a fix for PR28348 We were using DAG->getConstant instead of DAG->getTargetConstant. This meant that we could inadvertently increase the use count of a constant if stars aligned, which it did in this testcase. Increasing the use count of the constant could cause ISel to fall over (because DAGToDAG lowering assumed the constant had only one use!) Original commit message: [Thumb] Select a BIC instead of AND if the immediate can be encoded more optimally negated If an immediate is only used in an AND node, it is possible that the immediate can be more optimally materialized when negated. If this is the case, we can negate the immediate and use a BIC instead; int i(int a) { return a & 0xfffffeec; } Used to produce: ldr r1, [CONSTPOOL] ands r0, r1 CONSTPOOL: 0xfffffeec And now produces: movs r1, #255 adds r1, #20 ; Less costly immediate generation bics r0, r1 llvm-svn: 274510	2016-07-04 16:35:41 +00:00
Simon Pilgrim	02d435d2f4	[X86][AVX512] Autoupgrade the VPERMPD/VPERMQ intrinsics llvm-svn: 274506	2016-07-04 14:19:05 +00:00
Simon Pilgrim	9fca300cbe	[X86][AVX512] Autoupgrade the VPERMILPD/VPERMILPS intrinsics llvm-svn: 274498	2016-07-04 12:40:54 +00:00
Eric Liu	e617adea12	Fixed warning caused by r274402. llvm-svn: 274497	2016-07-04 12:10:08 +00:00
Nicolai Haehnle	84c9f9919a	Add writeonly IR attribute Summary: This complements the earlier addition of IntrWriteMem and IntrWriteArgMem LLVM intrinsic properties, see D18291. Also start using the attribute for memset, memcpy, and memmove intrinsics, and remove their special-casing in BasicAliasAnalysis. Reviewers: reames, joker.eph Subscribers: joker.eph, llvm-commits Differential Revision: http://reviews.llvm.org/D18714 llvm-svn: 274485	2016-07-04 08:01:29 +00:00
Craig Topper	d83f818a3e	[CodeGen] Make the code that detects a if a shuffle is really a concatenation of the inputs more general purpose. We can now handle concatenation of each source multiple times. The previous code just checked for each source to appear once in either order. This also now handles an entire source vector sized piece having undef indices correctly. We now concat with UNDEF instead of using one of the sources. This is responsible for the test case change. llvm-svn: 274483	2016-07-04 06:19:35 +00:00
NAKAMURA Takumi	4cb46e6747	Reformat blank lines. llvm-svn: 274481	2016-07-04 01:26:33 +00:00
NAKAMURA Takumi	f252951e90	Reformat comment lines. llvm-svn: 274480	2016-07-04 01:26:27 +00:00
NAKAMURA Takumi	940cd9368d	Untabify. llvm-svn: 274479	2016-07-04 01:26:21 +00:00
NAKAMURA Takumi	f4c6441b01	Reformat. llvm-svn: 274478	2016-07-04 01:26:14 +00:00
Simon Pilgrim	c804751a18	[X86] Add shuffle mask rescaling helper function. NFCI. llvm-svn: 274476	2016-07-03 21:28:17 +00:00
Simon Pilgrim	8e84fcf118	[X86][AVX2] Merge unary permute matching behind the same V2.isUndef() condition. NFCI. llvm-svn: 274474	2016-07-03 20:39:42 +00:00
Simon Pilgrim	7f096de0b8	[X86][AVX512] Add support for 512-bit shuffle lowering to VPERMPD/VPERMQ llvm-svn: 274473	2016-07-03 19:50:06 +00:00
Craig Topper	d1eca0f32c	[CodeGen] Teach OR combine of shuffles involving zero vectors to better handle undef indices. Undef indices can now be treated as zeros. Or if its undef ORed with zero, we will keep the undef. llvm-svn: 274472	2016-07-03 19:37:12 +00:00
Haicheng Wu	b71b2f622a	[MBB] add a missing corner case in UpdateTerminator() After the block placement, if a block ends with a conditional branch, but the next block is not its successor. The conditional branch should be changed to unconditional branch. This patch fixes PR28307, PR28297, PR28402. Differential Revision: http://reviews.llvm.org/D21811 llvm-svn: 274470	2016-07-03 19:14:17 +00:00
Simon Pilgrim	68ea80649b	[X86][AVX512] Add support for VPERMPD/VPERMQ masked shuffle comments llvm-svn: 274469	2016-07-03 18:40:24 +00:00
Simon Pilgrim	a0d73835b2	[X86][AVX512] Add support for 512-bit shuffle decoding of VPERMPD/VPERMQ llvm-svn: 274468	2016-07-03 18:27:37 +00:00
Simon Pilgrim	5080e7f56c	[X86][AVX] Renamed VPERMILPI shuffle comment macros to be more specific llvm-svn: 274467	2016-07-03 18:02:43 +00:00
Simon Pilgrim	dbd6db0dc7	[X86][AVX512] Add support for VPALIGNR/PSHUFD/PSHUFHW/PSHUFLW masked shuffle comments llvm-svn: 274466	2016-07-03 15:00:51 +00:00
Sanjay Patel	cbaac41856	[InstCombine] enable vector select of bools -> logic folds llvm-svn: 274465	2016-07-03 14:34:39 +00:00
Simon Pilgrim	598bdb6bfe	[X86][AVX512] Add support for UNPCK masked shuffle comments llvm-svn: 274464	2016-07-03 14:26:21 +00:00
Sanjay Patel	a1a4e100be	fix formatting; NFC llvm-svn: 274463	2016-07-03 14:08:19 +00:00
Simon Pilgrim	1f59076196	[X86][AVX512] Add support for VPERM/VSHUF masked shuffle comments llvm-svn: 274462	2016-07-03 13:55:41 +00:00
Simon Pilgrim	68f438a036	[X86][AVX512] Add support for PMOVZX masked shuffle comments llvm-svn: 274461	2016-07-03 13:33:28 +00:00
Simon Pilgrim	7c2fbdc101	[X86][AVX512] Add support for masked shuffle comments This patch adds support for including the avx512 mask register information in the mask/maskz versions of shuffle instruction comments. This initial version just adds support for MOVDDUP/MOVSHDUP/MOVSLDUP to reduce the mass of test regenerations, other shuffle instructions can be added in due course. Differential Revision: http://reviews.llvm.org/D21953 llvm-svn: 274459	2016-07-03 13:08:29 +00:00
Simon Pilgrim	129b720c18	[X86][AVX512] Add support for lowering shuffles to VPERMILPS llvm-svn: 274458	2016-07-03 12:47:21 +00:00
Sean Silva	fa6db90164	PR28400: Partly undo r274440 to bring test-suite back to life with the new PM PR28400 seems to be not an isolated issue, but a general problem related to caching analyses. We will need to discuss on llvm-dev. A test case is in the PR. llvm-svn: 274457	2016-07-03 03:35:06 +00:00
Sean Silva	997cbea05b	[PM] Some preparatory refactoring to minimize the diff of D21921 llvm-svn: 274456	2016-07-03 03:35:03 +00:00
Sean Silva	45835e731d	Remove dead TLI arg of isKnownNonNull and propagate deadness. NFC. This actually uncovered a surprisingly large chain of ultimately unused TLI args. From what I can gather, this argument is a remnant of when isKnownNonNull would look at the TLI directly. The current approach seems to be that InferFunctionAttrs runs early in the pipeline and uses TLI to annotate the TLI-dependent non-null information as return attributes. This also removes the dependence of functionattrs on TLI altogether. llvm-svn: 274455	2016-07-02 23:47:27 +00:00
Xinliang David Li	8a021317a2	[PM] Port LoopAccessInfo analysis to new PM It is implemented as a LoopAnalysis pass as discussed and agreed upon. llvm-svn: 274452	2016-07-02 21:18:40 +00:00
Simon Pilgrim	a7329dac6f	Fix spelling. llvm-svn: 274451	2016-07-02 20:21:39 +00:00
Simon Pilgrim	99e8a1aa0b	[X86][AVX512] Add support for lowering shuffles to VPERMILPD llvm-svn: 274450	2016-07-02 20:20:12 +00:00
Sean Silva	0fb7774f91	[PM] Some preparatory refactoring to minimize the diff of D21921 The main change here is just moving stuff to static functions. llvm-svn: 274446	2016-07-02 19:12:56 +00:00
Sean Silva	e2133e7c32	[PM] Preparatory cleanups to ArgumentPromotion. This pulls some obvious changes out of http://reviews.llvm.org/D21921 to minimize the diff. llvm-svn: 274445	2016-07-02 18:59:51 +00:00
Simon Pilgrim	cde7c54baa	[X86][AVX512] Add support for 512-bit PSHUFB lowering llvm-svn: 274444	2016-07-02 18:14:31 +00:00
Simon Pilgrim	77dda7c2e0	[X86][AVX512] Converted the MOVDDUP/MOVSLDUP/MOVSHDUP masked intrinsics to generic IR llvm-svn: 274443	2016-07-02 17:16:41 +00:00
Sean Silva	f2db01c626	[PM] Fix a small typo from when I ported JumpThreading llvm-svn: 274440	2016-07-02 16:16:44 +00:00
Simon Pilgrim	19adee9d84	[X86][AVX512] Autoupgrade the MOVDDUP/MOVSLDUP/MOVSHDUP intrinsics llvm-svn: 274439	2016-07-02 14:42:35 +00:00
Benjamin Kramer	52a692d28d	[DIBuilder] Remove dead code. NFC. llvm-svn: 274438	2016-07-02 13:18:38 +00:00
Benjamin Kramer	4d9d2cc77f	[Hexagon] Create global std::map lazily. This could of course be a simple binary search with no global state involved at all if someone cares enough. Just don't make everyone linking the hexagon backend pay for it on process startup and shutdown. llvm-svn: 274437	2016-07-02 13:05:12 +00:00
Simon Pilgrim	f040d8c061	[X86][AVX512] Add support for lowering shuffles to MOVDDUP/MOVSLDUP/MOVSHDUP llvm-svn: 274436	2016-07-02 12:45:03 +00:00
Benjamin Kramer	3bc1edf95b	Use arrays or initializer lists to feed ArrayRefs instead of SmallVector where possible. No functionality change intended. llvm-svn: 274431	2016-07-02 11:41:39 +00:00
Qin Zhao	b463c23c10	[esan\|cfrag] Add counters for struct array accesses Summary: Adds one counter to the struct counter array for counting struct array accesses. Adds instrumentation to insert counter update for struct array accesses. Reviewers: aizatsky Subscribers: llvm-commits, bruening, eugenis, kcc, zhaoqin, vitalybuka Differential Revision: http://reviews.llvm.org/D21594 llvm-svn: 274420	2016-07-02 03:25:37 +00:00
Marcin Koscielnicki	32e8734e41	[SystemZ] Move misplaced SystemZ::TDC to non-memory opcode range. llvm-svn: 274417	2016-07-02 02:20:40 +00:00
Pirama Arumuga Nainar	9c3aec2035	Add RenderScript ArchType Summary: Add renderscript32 and renderscript64 ArchTypes. This is to configure the ABI requirement on 32-bit RenderScript that 'long' types have 64-bit size and alignment. 64-bit RenderScript is the same as AArch64, but is added here for completeness. Reviewers: echristo, rsmith Subscribers: aemerson, jfb, rampitec, dschuff, mehdi_amini, llvm-commits, srhines Differential Revision: http://reviews.llvm.org/D21333 llvm-svn: 274412	2016-07-02 00:23:09 +00:00
Michael Kuperstein	071d8306b0	[PM] Port ConstantHoisting to the new Pass Manager Differential Revision: http://reviews.llvm.org/D21945 llvm-svn: 274411	2016-07-02 00:16:47 +00:00
Reid Kleckner	e092dad72c	[codeview] Set the Nested and Scoped ClassOptions based on the scope chain These are set on both the declaration record and the definition record. llvm-svn: 274410	2016-07-02 00:11:07 +00:00
Matt Arsenault	3add3a40a4	LoadStoreVectorizer: Fix warning about extra semicolon llvm-svn: 274406	2016-07-01 23:26:54 +00:00
Matt Arsenault	accddacb70	TII: Fix inlineasm size counting comments as insts The main problem was counting comments on their own line as instructions. llvm-svn: 274405	2016-07-01 23:26:50 +00:00
Matt Arsenault	28aaf45c10	PeepholeOptimizer: Relax assert Allow implicit defs llvm-svn: 274402	2016-07-01 23:15:06 +00:00
David Majnemer	08bd744c2c	[CodeView] Include the offset of nested members Given something like: struct S { int a; struct { int b; }; }; We would fail to give 'b' offset 4. Instead, we would give it the offset it has inside of it's struct. llvm-svn: 274400	2016-07-01 23:12:48 +00:00
David Majnemer	6bdc24e7b6	[CodeView] Pretty print anonymous scopes A namespace without a name should be written out as `anonymous namespace' while a tag type without a name should be written out as <unnamed-tag>. llvm-svn: 274399	2016-07-01 23:12:45 +00:00
Matt Arsenault	7f681ac7a9	AMDGPU: Add feature for unaligned access llvm-svn: 274398	2016-07-01 23:03:44 +00:00
Matt Arsenault	8af47a09e5	AMDGPU: Expand unaligned accesses early Due to visit order problems, in the case of an unaligned copy the legalized DAG fails to eliminate extra instructions introduced by the expansion of both unaligned parts. llvm-svn: 274397	2016-07-01 22:55:55 +00:00
Evgeniy Stepanov	b736335dc3	[msan] Fix __msan_maybe_ for non-standard type sizes. Fix incorrect calculation of the type size for __msan_maybe_warning_N call that resulted in an invalid (narrowing) zext instruction and "Assertion `castIsValid(op, S, Ty) && "Invalid cast!"' failed." Only happens in very large functions (with more than 3500 MSan checks) operating on integer types that are not power-of-two. llvm-svn: 274395	2016-07-01 22:49:59 +00:00
Matt Arsenault	327bb5ad82	AMDGPU: Improve load/store of illegal types. There was a combine before to handle the simple copy case. Split this into handling loads and stores separately. We might want to change how this handles some of the vector extloads, since this can result in large code size increases. llvm-svn: 274394	2016-07-01 22:47:50 +00:00
Reid Kleckner	ad56ea3129	[codeview] Don't record UDTs for anonymous structs MSVC makes up names for these anonymous structs, but we don't (yet). Eventually Clang should use getTypedefNameForAnonDecl() to put some name in the debug info, and we can update the test case when that happens. llvm-svn: 274391	2016-07-01 22:24:51 +00:00
Alina Sbirlea	8d8aa5dd6c	Address two correctness issues in LoadStoreVectorizer Summary: GetBoundryInstruction returns the last instruction as the instruction which follows or end(). Otherwise the last instruction in the boundry set is not being tested by isVectorizable(). Partially solve reordering of instructions. More extensive solution to follow. Reviewers: tstellarAMD, llvm-commits, jlebar Subscribers: escha, arsenm, mzolotukhin Differential Revision: http://reviews.llvm.org/D21934 llvm-svn: 274389	2016-07-01 21:44:12 +00:00
Krzysztof Parzyszek	1bba89612b	[Hexagon] Revert r274381: that was actually wrong llvm-svn: 274384	2016-07-01 20:45:19 +00:00
Krzysztof Parzyszek	a17250d8e0	[Hexagon] Use MachineOperand::readsReg instead of isUse llvm-svn: 274381	2016-07-01 20:28:30 +00:00
Reid Kleckner	6e96a4c64a	[pdb] Check the display name for <unnamed-tag>, not the linkage name This issue was encountered on libcmt.pdb, which has a type record that looks like this: Struct (0x1094) { TypeLeafKind: LF_STRUCTURE (0x1505) MemberCount: 3 Properties [ (0x200) HasUniqueName (0x200) ] FieldList: <field list> (0x1093) DerivedFrom: 0x0 VShape: 0x0 SizeOf: 4 Name: <unnamed-tag> LinkageName: .?AU<unnamed-tag>@@ } The checks for startswith/endswith "<unnamed-tag>" should look at the display name, not the linkage name. llvm-svn: 274376	2016-07-01 18:43:29 +00:00
Reid Kleckner	c92e9469c4	[codeview] Assert that our CV type records are valid We were asserting that our type records were valid when emitting assembly, but not when emitting an object file. I've been seeing lots of LNK1285 errors (corrupt PDB) during incremental debug self-host builds with the MSVC linker, and hopefully this will catch some of them earlier. llvm-svn: 274373	2016-07-01 18:05:56 +00:00
Matt Arsenault	105c2a204c	AMDGPU/SI: Enable testing several variants for si scheduler Enable testing different scheduling variants if sgpr usage is very high. It was previously disabled because of a bug in handleMove, but it has been fixed since. Patch by Axel Davy llvm-svn: 274372	2016-07-01 18:03:46 +00:00
Hans Wennborg	a3bb5f1594	Revert r274347 "[ARM] Refactor Thumb2 mul instruction descs" This caused PR28387: Assertion "#operands for dag node doesn't match .td file!" llvm-svn: 274367	2016-07-01 17:26:42 +00:00
Duncan P. N. Exon Smith	4a876eb645	CodeGen: Use MachineInstr& in RegisterCoalescer, NFC Remove a few more implicit iterator to pointer conversions by preferring MachineInstr&. llvm-svn: 274363	2016-07-01 16:43:13 +00:00
Sanjay Patel	887aa6d6ef	fix documentation comments; NFC llvm-svn: 274362	2016-07-01 16:41:59 +00:00
Duncan P. N. Exon Smith	aae6f3c95e	CodeGen: Avoid implicit conversions in TargetInstrInfo, NFC Avoid implicit conversions from MachineBasicBlock::iterator to MachineInstr* in TargetInstrInfo. llvm-svn: 274361	2016-07-01 16:38:28 +00:00
Duncan P. N. Exon Smith	b77911be02	CodeGen: Use MachineInstr& in ScheduleDAGIntrs, NFC Use MachineInstr& to avoid implicit conversions from MachineBasicBlock::iterator to MachineInstr. In one case, this could use a range-based for loop, but the other loops iterated in reverse order. One of the reverse-loops checked the MachineInstr for nullptr, a condition that is provably unreachable. (And even if my proof has a flaw, UBSan would catch the bug.) llvm-svn: 274360	2016-07-01 16:21:48 +00:00
Dehao Chen	ad2b4e1334	Do not count debug instructions when counting number of uses to reorder frame objects. Summary: The code generation should be independent of the debug info. Reviewers: zansari, davidxl, mkuper, majnemer Subscribers: majnemer, llvm-commits Differential Revision: http://reviews.llvm.org/D21911 llvm-svn: 274357	2016-07-01 15:40:25 +00:00
Duncan P. N. Exon Smith	eda8f5d592	CodeGen: Avoid iterator conversion in UnreachableBlockElim, NFC Avoid an unnecessary (and implicit) iterator to pointer conversion in UnreachableBlockElim by using the post-increment operator. llvm-svn: 274355	2016-07-01 15:13:09 +00:00
Duncan P. N. Exon Smith	ef105caea9	CodeGen: Use MachineInstr& in SlotIndexes.cpp, NFC Avoid implicit conversions from iterator to pointer by preferring MachineInstr& and using range-based for loops. llvm-svn: 274354	2016-07-01 15:08:52 +00:00
Duncan P. N. Exon Smith	44ed0de298	CodeGen: Use MachineInstr& in RegAllocFast, NFC Use MachineInstr& instead of MachineInstr* in RegAllocFast to avoid implicit conversions from MachineInstrBundleIterator. RAFast::spillAll and RAFast::spillVirtReg still take iterators, since their argument may be an end iterator from MachineBasicBlock::getFirstTerminator. llvm-svn: 274353	2016-07-01 15:03:37 +00:00
Sam Parker	06692203ed	[ARM] Refactor Thumb2 mul instruction descs No functional changes. Just created wrapper classes around the 3 and 4 reg mult and mac instruction classes. Differential Revision: http://reviews.llvm.org/D21549 llvm-svn: 274347	2016-07-01 12:55:49 +00:00
Benjamin Kramer	b0b52fc4c6	function_refify. NFC. While there use emplace_back to create an expensive pair. llvm-svn: 274344	2016-07-01 11:05:15 +00:00
Nikolay Haustov	beb24f5b20	Resubmit r268719 - AMDGPU/SI: Add amdgpu_kernel calling convention. Part 2. This was reverted in r268740 because of problems with corresponding Clang change. Clang change was updated and resubmitted in r274220. Check calling convention in AMDGPUMachineFunction::isKernel This will be used for AMDGPU_HSA_KERNEL symbol type in output ELF. Also, in the future unused non-kernels may be optimized. Reviewers: tstellarAMD, arsenm Subscribers: arsenm, joker.eph, llvm-commits Differential Revision: http://reviews.llvm.org/D19917 llvm-svn: 274341	2016-07-01 10:00:58 +00:00
Sam Kolton	5196b88f07	[AMDGPU] Assembler: support SDWA for VOPC instructions Summary: dst_sel and dst_unused disabled for VOPC as they have no effect on result Reviewers: artem.tamazov, tstellarAMD, vpykhtin Subscribers: arsenm, kzhuravl Differential Revision: http://reviews.llvm.org/D21376 llvm-svn: 274340	2016-07-01 09:59:21 +00:00
NAKAMURA Takumi	566597330a	Update libdeps; AMDGPUCodeGen requires LLVMVectorize. llvm-svn: 274339	2016-07-01 09:55:23 +00:00
Craig Topper	90d7664a22	[CodeGen] Cleanup getVectorShuffle a bit to take advantage of its new ArrayRef argument and its begin/end iterators. Also use 'int' type for number of elements and loop iterators to remove several typecasts. No functional change intended. llvm-svn: 274338	2016-07-01 06:54:51 +00:00
Craig Topper	2bd8b4b180	[CodeGen,Target] Remove the version of DAG.getVectorShuffle that takes a pointer to a mask array. Convert all callers to use the ArrayRef version. No functional change intended. For the most part this simplifies all callers. There were two places in X86 that needed an explicit makeArrayRef to shorten a statically sized array. llvm-svn: 274337	2016-07-01 06:54:47 +00:00
Eric Christopher	36e601c6dc	Add support for allowing us to create uniquely identified "COMDAT" or "ELF Group" sections while lowering. In particular, for ELF sections this is useful for creating function-specific groups that get merged into the same named section. Also use const Twine& instead of StringRef for the getELF functions while we're here. Differential Revision: http://reviews.llvm.org/D21743 llvm-svn: 274336	2016-07-01 06:07:38 +00:00
Eric Christopher	0b6537e6e5	80-column and comment fixups. llvm-svn: 274335	2016-07-01 06:07:31 +00:00
Xinliang David Li	94734eef33	[PM] refactor LoopAccessInfo code part-2 Differential Revision: http://reviews.llvm.org/D21636 llvm-svn: 274334	2016-07-01 05:59:55 +00:00
Xinliang David Li	93926acbb2	[MBP] method interface cleanup Make worklist and ehworklist member of the class so that they don't need to be passed around. llvm-svn: 274333	2016-07-01 05:46:48 +00:00
Matt Arsenault	908b9e26a6	AMDGPU: Add option to run the load/store vectorizer llvm-svn: 274329	2016-07-01 03:33:52 +00:00
Reid Kleckner	b5af11dfa3	[codeview] Add DISubprogram::ThisAdjustment Summary: This represents the adjustment applied to the implicit 'this' parameter in the prologue of a virtual method in the MS C++ ABI. The adjustment is always zero unless multiple inheritance is involved. This increases the size of DISubprogram by 8 bytes, unfortunately. The adjustment really is a signed 32-bit integer. If this size increase is too much, we could probably win it back by splitting out a subclass with info specific to virtual methods (virtuality, vindex, thisadjustment, containingType). Reviewers: aprantl, dexonsmith Subscribers: aaboud, amccarth, llvm-commits Differential Revision: http://reviews.llvm.org/D21614 llvm-svn: 274325	2016-07-01 02:41:21 +00:00
Matt Arsenault	a8576706e3	LoadStoreVectorizer: improvements: better pointer analysis If OpB has an ADD NSW/NUW, we can use that to prove that adding 1 to OpA won't wrap if OpA + 1 == OpB. Patch by Fiona Glaser llvm-svn: 274324	2016-07-01 02:16:24 +00:00
Matt Arsenault	0101ecade0	LoadStoreVectorizer: Don't increase alignment with no align set If no alignment was set on the load/stores, it would vectorize to the new type even though this increases the default alignment. llvm-svn: 274323	2016-07-01 02:09:38 +00:00
Matt Arsenault	370e8226c7	LoadStoreVectorizer: Check TTI for vec reg bit width llvm-svn: 274322	2016-07-01 02:07:22 +00:00
Matt Arsenault	42ad17059a	LoadStoreVectorizer: Fix assert when merging pointer ops This needs to use inttoptr/ptrtoint if combining an int and pointer load. If a pointer is used always do an integer load. llvm-svn: 274321	2016-07-01 01:55:52 +00:00
Duncan P. N. Exon Smith	9d1f156418	Revert "code hoisting pass based on GVN" This reverts commit r274305, since it breaks self-hosting: http://lab.llvm.org:8080/green/job/clang-stage1-configure-RA_build/22349/ http://lab.llvm.org:8011/builders/clang-x86_64-linux-selfhost-modules/builds/17232 Note that the blamelist on lab.llvm.org:8011 is incorrect. The previous build was r274299, but somehow r274305 wasn't included in the blamelist: http://lab.llvm.org:8011/builders/clang-x86_64-linux-selfhost-modules llvm-svn: 274320	2016-07-01 01:51:40 +00:00
Duncan P. N. Exon Smith	d26fdc83c9	CodeGen: Use MachineInstr& in LiveVariables API, NFC Change all the methods in LiveVariables that expect non-null MachineInstr* to take MachineInstr& and update the call sites. This clarifies the API, and designs away a class of iterator to pointer implicit conversions. llvm-svn: 274319	2016-07-01 01:51:32 +00:00
Matt Arsenault	241f34cde8	LoadStoreVectorizer: Use AA metadata This was not passing the full instruction with metadata to the alias query. llvm-svn: 274318	2016-07-01 01:47:46 +00:00
Duncan P. N. Exon Smith	1df1d1dcfc	CodeGen: Remove implicit iterator conversions in PHIElimination, NFC llvm-svn: 274317	2016-07-01 01:27:19 +00:00
Duncan P. N. Exon Smith	762c5ca3ee	CodeGen: Use MachineInstr& in PostRASchedulerList, NFC Remove another unnecessary iterator to pointer conversion. llvm-svn: 274315	2016-07-01 01:18:53 +00:00
Matt Arsenault	0994bd57fb	AMDGPU: Implement getLoadStoreVecRegBitWidth llvm-svn: 274312	2016-07-01 00:56:27 +00:00
Duncan P. N. Exon Smith	286d94884b	CodeGen: Use MachineInstr& in PostRAHazardRecognizer, NFC Convert a loop to a range-based for, using MachineInstr& instead of MachineInstr* and removing an implicit conversion from iterator to pointer. llvm-svn: 274311	2016-07-01 00:50:29 +00:00
Duncan P. N. Exon Smith	6e3ac34202	CodeGen: Use MachineInstr& in PrologEpilogInserter, NFC Use MachineInstr& over MachineInstr* to avoid implicit iterator to pointer conversions. MachineInstr*-as-nullptr was being used as a flag for whether the for loop terminated normally; I added an explicit `bool` instead. llvm-svn: 274310	2016-07-01 00:40:57 +00:00

1 2 3 4 5 ...

92296 Commits