llvm-project

Commit Graph

Author	SHA1	Message	Date
Matt Arsenault	3add3a40a4	LoadStoreVectorizer: Fix warning about extra semicolon llvm-svn: 274406	2016-07-01 23:26:54 +00:00
Matt Arsenault	accddacb70	TII: Fix inlineasm size counting comments as insts The main problem was counting comments on their own line as instructions. llvm-svn: 274405	2016-07-01 23:26:50 +00:00
Matt Arsenault	28aaf45c10	PeepholeOptimizer: Relax assert Allow implicit defs llvm-svn: 274402	2016-07-01 23:15:06 +00:00
David Majnemer	08bd744c2c	[CodeView] Include the offset of nested members Given something like: struct S { int a; struct { int b; }; }; We would fail to give 'b' offset 4. Instead, we would give it the offset it has inside of it's struct. llvm-svn: 274400	2016-07-01 23:12:48 +00:00
David Majnemer	6bdc24e7b6	[CodeView] Pretty print anonymous scopes A namespace without a name should be written out as `anonymous namespace' while a tag type without a name should be written out as <unnamed-tag>. llvm-svn: 274399	2016-07-01 23:12:45 +00:00
Matt Arsenault	7f681ac7a9	AMDGPU: Add feature for unaligned access llvm-svn: 274398	2016-07-01 23:03:44 +00:00
Matt Arsenault	8af47a09e5	AMDGPU: Expand unaligned accesses early Due to visit order problems, in the case of an unaligned copy the legalized DAG fails to eliminate extra instructions introduced by the expansion of both unaligned parts. llvm-svn: 274397	2016-07-01 22:55:55 +00:00
Evgeniy Stepanov	b736335dc3	[msan] Fix __msan_maybe_ for non-standard type sizes. Fix incorrect calculation of the type size for __msan_maybe_warning_N call that resulted in an invalid (narrowing) zext instruction and "Assertion `castIsValid(op, S, Ty) && "Invalid cast!"' failed." Only happens in very large functions (with more than 3500 MSan checks) operating on integer types that are not power-of-two. llvm-svn: 274395	2016-07-01 22:49:59 +00:00
Matt Arsenault	327bb5ad82	AMDGPU: Improve load/store of illegal types. There was a combine before to handle the simple copy case. Split this into handling loads and stores separately. We might want to change how this handles some of the vector extloads, since this can result in large code size increases. llvm-svn: 274394	2016-07-01 22:47:50 +00:00
Reid Kleckner	ad56ea3129	[codeview] Don't record UDTs for anonymous structs MSVC makes up names for these anonymous structs, but we don't (yet). Eventually Clang should use getTypedefNameForAnonDecl() to put some name in the debug info, and we can update the test case when that happens. llvm-svn: 274391	2016-07-01 22:24:51 +00:00
Alina Sbirlea	8d8aa5dd6c	Address two correctness issues in LoadStoreVectorizer Summary: GetBoundryInstruction returns the last instruction as the instruction which follows or end(). Otherwise the last instruction in the boundry set is not being tested by isVectorizable(). Partially solve reordering of instructions. More extensive solution to follow. Reviewers: tstellarAMD, llvm-commits, jlebar Subscribers: escha, arsenm, mzolotukhin Differential Revision: http://reviews.llvm.org/D21934 llvm-svn: 274389	2016-07-01 21:44:12 +00:00
Krzysztof Parzyszek	1bba89612b	[Hexagon] Revert r274381: that was actually wrong llvm-svn: 274384	2016-07-01 20:45:19 +00:00
Krzysztof Parzyszek	a17250d8e0	[Hexagon] Use MachineOperand::readsReg instead of isUse llvm-svn: 274381	2016-07-01 20:28:30 +00:00
Reid Kleckner	6e96a4c64a	[pdb] Check the display name for <unnamed-tag>, not the linkage name This issue was encountered on libcmt.pdb, which has a type record that looks like this: Struct (0x1094) { TypeLeafKind: LF_STRUCTURE (0x1505) MemberCount: 3 Properties [ (0x200) HasUniqueName (0x200) ] FieldList: <field list> (0x1093) DerivedFrom: 0x0 VShape: 0x0 SizeOf: 4 Name: <unnamed-tag> LinkageName: .?AU<unnamed-tag>@@ } The checks for startswith/endswith "<unnamed-tag>" should look at the display name, not the linkage name. llvm-svn: 274376	2016-07-01 18:43:29 +00:00
Reid Kleckner	c92e9469c4	[codeview] Assert that our CV type records are valid We were asserting that our type records were valid when emitting assembly, but not when emitting an object file. I've been seeing lots of LNK1285 errors (corrupt PDB) during incremental debug self-host builds with the MSVC linker, and hopefully this will catch some of them earlier. llvm-svn: 274373	2016-07-01 18:05:56 +00:00
Matt Arsenault	105c2a204c	AMDGPU/SI: Enable testing several variants for si scheduler Enable testing different scheduling variants if sgpr usage is very high. It was previously disabled because of a bug in handleMove, but it has been fixed since. Patch by Axel Davy llvm-svn: 274372	2016-07-01 18:03:46 +00:00
Hans Wennborg	a3bb5f1594	Revert r274347 "[ARM] Refactor Thumb2 mul instruction descs" This caused PR28387: Assertion "#operands for dag node doesn't match .td file!" llvm-svn: 274367	2016-07-01 17:26:42 +00:00
Duncan P. N. Exon Smith	4a876eb645	CodeGen: Use MachineInstr& in RegisterCoalescer, NFC Remove a few more implicit iterator to pointer conversions by preferring MachineInstr&. llvm-svn: 274363	2016-07-01 16:43:13 +00:00
Sanjay Patel	887aa6d6ef	fix documentation comments; NFC llvm-svn: 274362	2016-07-01 16:41:59 +00:00
Duncan P. N. Exon Smith	aae6f3c95e	CodeGen: Avoid implicit conversions in TargetInstrInfo, NFC Avoid implicit conversions from MachineBasicBlock::iterator to MachineInstr* in TargetInstrInfo. llvm-svn: 274361	2016-07-01 16:38:28 +00:00
Duncan P. N. Exon Smith	b77911be02	CodeGen: Use MachineInstr& in ScheduleDAGIntrs, NFC Use MachineInstr& to avoid implicit conversions from MachineBasicBlock::iterator to MachineInstr. In one case, this could use a range-based for loop, but the other loops iterated in reverse order. One of the reverse-loops checked the MachineInstr for nullptr, a condition that is provably unreachable. (And even if my proof has a flaw, UBSan would catch the bug.) llvm-svn: 274360	2016-07-01 16:21:48 +00:00
Dehao Chen	ad2b4e1334	Do not count debug instructions when counting number of uses to reorder frame objects. Summary: The code generation should be independent of the debug info. Reviewers: zansari, davidxl, mkuper, majnemer Subscribers: majnemer, llvm-commits Differential Revision: http://reviews.llvm.org/D21911 llvm-svn: 274357	2016-07-01 15:40:25 +00:00
Duncan P. N. Exon Smith	eda8f5d592	CodeGen: Avoid iterator conversion in UnreachableBlockElim, NFC Avoid an unnecessary (and implicit) iterator to pointer conversion in UnreachableBlockElim by using the post-increment operator. llvm-svn: 274355	2016-07-01 15:13:09 +00:00
Duncan P. N. Exon Smith	ef105caea9	CodeGen: Use MachineInstr& in SlotIndexes.cpp, NFC Avoid implicit conversions from iterator to pointer by preferring MachineInstr& and using range-based for loops. llvm-svn: 274354	2016-07-01 15:08:52 +00:00
Duncan P. N. Exon Smith	44ed0de298	CodeGen: Use MachineInstr& in RegAllocFast, NFC Use MachineInstr& instead of MachineInstr* in RegAllocFast to avoid implicit conversions from MachineInstrBundleIterator. RAFast::spillAll and RAFast::spillVirtReg still take iterators, since their argument may be an end iterator from MachineBasicBlock::getFirstTerminator. llvm-svn: 274353	2016-07-01 15:03:37 +00:00
Sam Parker	06692203ed	[ARM] Refactor Thumb2 mul instruction descs No functional changes. Just created wrapper classes around the 3 and 4 reg mult and mac instruction classes. Differential Revision: http://reviews.llvm.org/D21549 llvm-svn: 274347	2016-07-01 12:55:49 +00:00
Benjamin Kramer	b0b52fc4c6	function_refify. NFC. While there use emplace_back to create an expensive pair. llvm-svn: 274344	2016-07-01 11:05:15 +00:00
Nikolay Haustov	beb24f5b20	Resubmit r268719 - AMDGPU/SI: Add amdgpu_kernel calling convention. Part 2. This was reverted in r268740 because of problems with corresponding Clang change. Clang change was updated and resubmitted in r274220. Check calling convention in AMDGPUMachineFunction::isKernel This will be used for AMDGPU_HSA_KERNEL symbol type in output ELF. Also, in the future unused non-kernels may be optimized. Reviewers: tstellarAMD, arsenm Subscribers: arsenm, joker.eph, llvm-commits Differential Revision: http://reviews.llvm.org/D19917 llvm-svn: 274341	2016-07-01 10:00:58 +00:00
Sam Kolton	5196b88f07	[AMDGPU] Assembler: support SDWA for VOPC instructions Summary: dst_sel and dst_unused disabled for VOPC as they have no effect on result Reviewers: artem.tamazov, tstellarAMD, vpykhtin Subscribers: arsenm, kzhuravl Differential Revision: http://reviews.llvm.org/D21376 llvm-svn: 274340	2016-07-01 09:59:21 +00:00
NAKAMURA Takumi	566597330a	Update libdeps; AMDGPUCodeGen requires LLVMVectorize. llvm-svn: 274339	2016-07-01 09:55:23 +00:00
Craig Topper	90d7664a22	[CodeGen] Cleanup getVectorShuffle a bit to take advantage of its new ArrayRef argument and its begin/end iterators. Also use 'int' type for number of elements and loop iterators to remove several typecasts. No functional change intended. llvm-svn: 274338	2016-07-01 06:54:51 +00:00
Craig Topper	2bd8b4b180	[CodeGen,Target] Remove the version of DAG.getVectorShuffle that takes a pointer to a mask array. Convert all callers to use the ArrayRef version. No functional change intended. For the most part this simplifies all callers. There were two places in X86 that needed an explicit makeArrayRef to shorten a statically sized array. llvm-svn: 274337	2016-07-01 06:54:47 +00:00
Eric Christopher	36e601c6dc	Add support for allowing us to create uniquely identified "COMDAT" or "ELF Group" sections while lowering. In particular, for ELF sections this is useful for creating function-specific groups that get merged into the same named section. Also use const Twine& instead of StringRef for the getELF functions while we're here. Differential Revision: http://reviews.llvm.org/D21743 llvm-svn: 274336	2016-07-01 06:07:38 +00:00
Eric Christopher	0b6537e6e5	80-column and comment fixups. llvm-svn: 274335	2016-07-01 06:07:31 +00:00
Xinliang David Li	94734eef33	[PM] refactor LoopAccessInfo code part-2 Differential Revision: http://reviews.llvm.org/D21636 llvm-svn: 274334	2016-07-01 05:59:55 +00:00
Xinliang David Li	93926acbb2	[MBP] method interface cleanup Make worklist and ehworklist member of the class so that they don't need to be passed around. llvm-svn: 274333	2016-07-01 05:46:48 +00:00
Matt Arsenault	908b9e26a6	AMDGPU: Add option to run the load/store vectorizer llvm-svn: 274329	2016-07-01 03:33:52 +00:00
Reid Kleckner	b5af11dfa3	[codeview] Add DISubprogram::ThisAdjustment Summary: This represents the adjustment applied to the implicit 'this' parameter in the prologue of a virtual method in the MS C++ ABI. The adjustment is always zero unless multiple inheritance is involved. This increases the size of DISubprogram by 8 bytes, unfortunately. The adjustment really is a signed 32-bit integer. If this size increase is too much, we could probably win it back by splitting out a subclass with info specific to virtual methods (virtuality, vindex, thisadjustment, containingType). Reviewers: aprantl, dexonsmith Subscribers: aaboud, amccarth, llvm-commits Differential Revision: http://reviews.llvm.org/D21614 llvm-svn: 274325	2016-07-01 02:41:21 +00:00
Matt Arsenault	a8576706e3	LoadStoreVectorizer: improvements: better pointer analysis If OpB has an ADD NSW/NUW, we can use that to prove that adding 1 to OpA won't wrap if OpA + 1 == OpB. Patch by Fiona Glaser llvm-svn: 274324	2016-07-01 02:16:24 +00:00
Matt Arsenault	0101ecade0	LoadStoreVectorizer: Don't increase alignment with no align set If no alignment was set on the load/stores, it would vectorize to the new type even though this increases the default alignment. llvm-svn: 274323	2016-07-01 02:09:38 +00:00
Matt Arsenault	370e8226c7	LoadStoreVectorizer: Check TTI for vec reg bit width llvm-svn: 274322	2016-07-01 02:07:22 +00:00
Matt Arsenault	42ad17059a	LoadStoreVectorizer: Fix assert when merging pointer ops This needs to use inttoptr/ptrtoint if combining an int and pointer load. If a pointer is used always do an integer load. llvm-svn: 274321	2016-07-01 01:55:52 +00:00
Duncan P. N. Exon Smith	9d1f156418	Revert "code hoisting pass based on GVN" This reverts commit r274305, since it breaks self-hosting: http://lab.llvm.org:8080/green/job/clang-stage1-configure-RA_build/22349/ http://lab.llvm.org:8011/builders/clang-x86_64-linux-selfhost-modules/builds/17232 Note that the blamelist on lab.llvm.org:8011 is incorrect. The previous build was r274299, but somehow r274305 wasn't included in the blamelist: http://lab.llvm.org:8011/builders/clang-x86_64-linux-selfhost-modules llvm-svn: 274320	2016-07-01 01:51:40 +00:00
Duncan P. N. Exon Smith	d26fdc83c9	CodeGen: Use MachineInstr& in LiveVariables API, NFC Change all the methods in LiveVariables that expect non-null MachineInstr* to take MachineInstr& and update the call sites. This clarifies the API, and designs away a class of iterator to pointer implicit conversions. llvm-svn: 274319	2016-07-01 01:51:32 +00:00
Matt Arsenault	241f34cde8	LoadStoreVectorizer: Use AA metadata This was not passing the full instruction with metadata to the alias query. llvm-svn: 274318	2016-07-01 01:47:46 +00:00
Duncan P. N. Exon Smith	1df1d1dcfc	CodeGen: Remove implicit iterator conversions in PHIElimination, NFC llvm-svn: 274317	2016-07-01 01:27:19 +00:00
Duncan P. N. Exon Smith	762c5ca3ee	CodeGen: Use MachineInstr& in PostRASchedulerList, NFC Remove another unnecessary iterator to pointer conversion. llvm-svn: 274315	2016-07-01 01:18:53 +00:00
Matt Arsenault	0994bd57fb	AMDGPU: Implement getLoadStoreVecRegBitWidth llvm-svn: 274312	2016-07-01 00:56:27 +00:00
Duncan P. N. Exon Smith	286d94884b	CodeGen: Use MachineInstr& in PostRAHazardRecognizer, NFC Convert a loop to a range-based for, using MachineInstr& instead of MachineInstr* and removing an implicit conversion from iterator to pointer. llvm-svn: 274311	2016-07-01 00:50:29 +00:00
Duncan P. N. Exon Smith	6e3ac34202	CodeGen: Use MachineInstr& in PrologEpilogInserter, NFC Use MachineInstr& over MachineInstr* to avoid implicit iterator to pointer conversions. MachineInstr*-as-nullptr was being used as a flag for whether the for loop terminated normally; I added an explicit `bool` instead. llvm-svn: 274310	2016-07-01 00:40:57 +00:00
Reid Kleckner	64b16171df	[pdb] Avoid reporting an error when the module symbol stream is empty llvm-svn: 274309	2016-07-01 00:37:49 +00:00
Reid Kleckner	7aa95a9fca	[PDB] Indicate which type record failed hash validation llvm-svn: 274308	2016-07-01 00:37:25 +00:00
Matt Arsenault	d7e8898bdd	LoadStoreVectorizer: if one element of a vector is integer, default to integer. Fixes issues on some architectures where we use arithmetic ops to build vectors, which can cause bad things to happen for loads/stores of mixed types. Patch by Fiona Glaser llvm-svn: 274307	2016-07-01 00:37:01 +00:00
Matt Arsenault	8a4ab5e19f	LoadStoreVectorizer: Fix crashes on sub-byte types llvm-svn: 274306	2016-07-01 00:36:54 +00:00
Sebastian Pop	5c5798c57c	code hoisting pass based on GVN This pass hoists duplicated computations in the program. The primary goal of gvn-hoist is to reduce the size of functions before inline heuristics to reduce the total cost of function inlining. Pass written by Sebastian Pop, Aditya Kumar, Xiaoyu Hu, and Brian Rzycki. Important algorithmic contributions by Daniel Berlin under the form of reviews. Differential Revision: http://reviews.llvm.org/D19338 llvm-svn: 274305	2016-07-01 00:24:31 +00:00
Duncan P. N. Exon Smith	632987296f	Target: Remove unused arguments from overrideSchedPolicy, NFC TargetSubtargetInfo::overrideSchedPolicy takes two MachineInstr* arguments (begin and end) that invite implicit conversions from MachineInstrBundleIterator. One option would be to change their type to an iterator, but since they don't seem to have been used since the API was added in 2010, I'm deleting the dead code. llvm-svn: 274304	2016-07-01 00:23:27 +00:00
Duncan P. N. Exon Smith	cb38ffa74d	CodeGen: Use MachineInstr& in MachineSink, NFC Use MachineInstr& instead of MachineInstr* in MachineSinker to help avoid implicit conversions from iterator to pointer. llvm-svn: 274303	2016-07-01 00:11:48 +00:00
Adam Nemet	f45594c912	[LAA] Fix alphabetical sorting of headers. NFC llvm-svn: 274302	2016-07-01 00:09:02 +00:00
Duncan P. N. Exon Smith	5a7538be61	CodeGen: Use MachineInstr& more in MachineTraceMetrics, NFC Push MachineInstr& through helper APIs for consistency. This doesn't remove any more implicit conversions, but it's a nice cleanup after r274300. llvm-svn: 274301	2016-07-01 00:05:40 +00:00
Duncan P. N. Exon Smith	5d2b938bdb	CodeGen: Use MachineInstr& in MachineTraceMetrics, NFC This avoids an implicit conversion from iterator to pointer. llvm-svn: 274300	2016-06-30 23:53:20 +00:00
Matt Arsenault	079d0f19a2	LoadStoreVectorizer: Check skipFunction first. Also add test I forgot to add to r274296. llvm-svn: 274299	2016-06-30 23:50:18 +00:00
Duncan P. N. Exon Smith	c73850c702	CodeGen: Use MachineInstr& in LocalStackSlotAllocation, NFC Avoid a number of implicit conversions from iterator to pointer by using range-based for and MachineInstr&. llvm-svn: 274298	2016-06-30 23:39:46 +00:00
Duncan P. N. Exon Smith	07acb3e382	CodeGen: Use range-based for in LiveVariables, NFC Avoid an implicit iterator to pointer conversion in LiveVariables::runOnBlock by switching to a range-based for. llvm-svn: 274297	2016-06-30 23:33:35 +00:00
Matt Arsenault	2cbe52b990	LoadStoreVectorizer: Skip optnone functions llvm-svn: 274296	2016-06-30 23:30:29 +00:00
Duncan P. N. Exon Smith	9129873a93	CodeGen: Use MachineInstr& in HoistSpillHelper, NFC Avoid another few implicit conversions from iterator to pointer. llvm-svn: 274295	2016-06-30 23:28:15 +00:00
Duncan P. N. Exon Smith	fb612acff7	CodeGen: Use MachineInstr& in LDVImpl::handleDebugValue, NFC Avoid another implicit conversion from iterator to pointer. llvm-svn: 274294	2016-06-30 23:13:38 +00:00
Matt Arsenault	08debb0244	Add LoadStoreVectorizer pass This was contributed by Apple, and I've been working on minimal cleanups and generalizing it. llvm-svn: 274293	2016-06-30 23:11:38 +00:00
Duncan P. N. Exon Smith	a62287b323	CodeGen: Use MachineInstr& in ExpandISelPseudos, NFC Avoid another implicit conversion from MachineInstrBundleIterator to MachineInstr* by using MachineInstr&. llvm-svn: 274292	2016-06-30 23:09:39 +00:00
Duncan P. N. Exon Smith	0490cdeb33	CodeGen: Use MachineInstr& in IfConversion, NFC Switch to a range-based for in IfConverter::PredicateBlock and take MachineInstr& in MaySpeculate to avoid an implicit conversion from MachineBasicBlock::iterator to MachineInstr*. llvm-svn: 274290	2016-06-30 23:04:51 +00:00
Duncan P. N. Exon Smith	e4f5e4f4d1	CodeGen: Use MachineInstr& in TargetLowering, NFC This is a mechanical change to make TargetLowering API take MachineInstr& (instead of MachineInstr), since the argument is expected to be a valid MachineInstr. In one case, changed a parameter from MachineInstr to MachineBasicBlock::iterator, since it was used as an insertion point. As a side effect, this removes a bunch of MachineInstr* to MachineBasicBlock::iterator implicit conversions, a necessary step toward fixing PR26753. llvm-svn: 274287	2016-06-30 22:52:52 +00:00
David L Kreitzer	29711c0d83	Test commit. llvm-svn: 274284	2016-06-30 21:43:11 +00:00
Matt Arsenault	2ec640a62f	Don't use unchecked dyn_cast llvm-svn: 274282	2016-06-30 21:18:06 +00:00
Matt Arsenault	727e279ac4	SLPVectorizer: Move propagateMetadata to VectorUtils This will be re-used by the LoadStoreVectorizer. Fix handling of range metadata and testcase by Justin Lebar. llvm-svn: 274281	2016-06-30 21:17:59 +00:00
Matt Arsenault	c1142725bd	AMDGPU: Add m0 vgpr load loop block as successor This shows up as a verifier error when I move this earlier, not sure why it didn't before. llvm-svn: 274275	2016-06-30 20:49:28 +00:00
Mike Aizatsky	8ba86a5a48	[libFuzzer] Let user specify extra stats file. Summary: If AFL_DRIVER_EXTRA_STATS_FILENAME is set and valid, write to it peak_rss_mb and slowest_unit_time_sec. These are both stats that libFuzzer can print but afl cannot. Reviewers: kcc, aizatsky, metzman Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D21742 llvm-svn: 274273	2016-06-30 20:43:06 +00:00
Yunzhong Gao	b386955adc	Add an artificial line-0 debug location when the compiler emits a call to __stack_chk_fail(). This avoids a compiler crash. Differential Revision: http://reviews.llvm.org/D21818 llvm-svn: 274263	2016-06-30 18:49:04 +00:00
Wei Mi	95685faeee	Refine the set of UniformAfterVectorization instructions. Except the seed uniform instructions (conditional branch and consecutive ptr instructions), dependencies to be added into uniform set should only be used by existing uniform instructions or intructions outside of current loop. Differential Revision: http://reviews.llvm.org/D21755 llvm-svn: 274262	2016-06-30 18:42:56 +00:00
Rafael Espindola	d86e8bb0ed	Delete MCCodeGenInfo. MC doesn't really care about CodeGen stuff, so this was just complicating target initialization. llvm-svn: 274258	2016-06-30 18:25:11 +00:00
Etienne Bergeron	078d8f69b6	revert http://reviews.llvm.org/D21101 llvm-svn: 274251	2016-06-30 17:52:24 +00:00
Zachary Turner	ab58ae8730	[pdb] Re-add code to write PDB files. Somehow all the functionality to write PDB files got removed, probably accidentally when uploading the patch perhaps the wrong one got uploaded. This re-adds all the code, as well as the corresponding test. llvm-svn: 274248	2016-06-30 17:43:00 +00:00
Etienne Bergeron	47cf4eabe6	[exceptions] Upgrade exception handlers when stack protector is used Summary: MSVC provide exception handlers with enhanced information to deal with security buffer feature (/GS). To be more secure, the security cookies (GS and SEH) are validated when unwinding the stack. The following code: ``` void f() {} void foo() { __try { f(); } __except(1) { f(); } } ``` Reviewers: majnemer, rnk Subscribers: thakis, llvm-commits, chrisha Differential Revision: http://reviews.llvm.org/D21101 llvm-svn: 274239	2016-06-30 15:36:59 +00:00
Sanjay Patel	7521e1b880	fix formatting, add TODO; NFC llvm-svn: 274238	2016-06-30 15:32:45 +00:00
Jun Bum Lim	596a3bd9ec	[DSE] Fix bug in partial overwrite tracking Summary: Found cases where DSE incorrectly add partially-overwritten intervals. Please see the test case for details. Reviewers: mcrosier, eeckstein, hfinkel Subscribers: mcrosier, llvm-commits Differential Revision: http://reviews.llvm.org/D21859 llvm-svn: 274237	2016-06-30 15:32:20 +00:00
Sanjay Patel	7c6eab5777	[InstCombine] shrink switch conditions better (PR24766) https://llvm.org/bugs/show_bug.cgi?id=24766#c2 This removes a hack that was added for the benefit of x86 codegen. It prevented shrinking the switch condition even to smaller legal (DataLayout) types. We have a safety mechanism in CGP after: http://reviews.llvm.org/rL251857 ...so we're free to use the optimal (smallest) IR type now. Differential Revision: http://reviews.llvm.org/D12965 llvm-svn: 274233	2016-06-30 14:51:21 +00:00
Elliot Colp	bda4cb6091	Test commit llvm-svn: 274232	2016-06-30 14:42:47 +00:00
Sanjay Patel	4520d9a1f5	[InstCombine] use ConstantExpr::getBitCast() instead of creating useless instruction llvm-svn: 274229	2016-06-30 14:27:41 +00:00
Sanjay Patel	7ad98babfa	[InstCombine] extend matchSelectFromAndOr() to work with i1 scalar types If the incoming types are i1, then we don't have to pattern match any sext ops. Differential Revision: http://reviews.llvm.org/D21740 llvm-svn: 274228	2016-06-30 14:18:18 +00:00
Rafael Espindola	222a9d09f3	Don't repeat names in comments. NFC. llvm-svn: 274226	2016-06-30 12:44:52 +00:00
Rafael Espindola	db6bd02185	Delete unused includes. NFC. llvm-svn: 274225	2016-06-30 12:19:16 +00:00
Jonas Paulsson	25e193da4c	[SystemZ] Let z13 also support FeatureMiscellaneousExtensions. This processor feature had been left out by mistake from the z13 ProcessorModel. This time with updated test case. Thanks, Hans. Reviewed by Ulrich Weigand. llvm-svn: 274216	2016-06-30 07:13:56 +00:00
Pankaj Gode	f4b25547cf	[AArch64] Add Broadcom Vulcan scheduling model. Adding scheduling model for new Broadcom Vulcan core (ARMv8.1A). Differential Revision: http://reviews.llvm.org/D21728 llvm-svn: 274213	2016-06-30 06:42:31 +00:00
Craig Topper	bc56e3ba53	Use ShuffleVectorSDNode::isSplat member method instead of static method isSplatMask where the mask came directly from getMask() on a shuffle node. llvm-svn: 274208	2016-06-30 04:38:51 +00:00
David Majnemer	9319cbc045	[CodeView] Implement support for bitfields in LLVM CodeView need to know the offset of the storage allocation for a bitfield. Encode this via the "extraData" field in DIDerivedType and introduced a new flag, DIFlagBitField, to indicate whether or not a member is a bitfield. This fixes PR28162. Differential Revision: http://reviews.llvm.org/D21782 llvm-svn: 274200	2016-06-30 03:00:20 +00:00
Sanjoy Das	0da2d14766	[SCEV] Compute max be count from shift operator only if all else fails In particular, check to see if we can compute a precise trip count by exhaustively simulating the loop first. llvm-svn: 274199	2016-06-30 02:47:28 +00:00
George Burgess IV	d86e38e1db	[CFLAA] Add support for ModRef queries. This patch makes CFLAA answer some ModRef queries. Because we don't distinguish between reading/writing when making StratifiedSets, we're unable to offer any of the readonly-related answers. Patch by Jia Chen. Differential Revision: http://reviews.llvm.org/D21858 llvm-svn: 274197	2016-06-30 02:11:26 +00:00
Matthias Braun	f7493393fc	RegisterScavenging: Code cleanup; NFC - Use range based for loops - No need for some !Reg checks: isPhysicalRegister() reports false for NoRegister anyway - Do not repeat function name in documentation comment. - Do not repeat documentation comment in implementation when we already have one at the declaration. - Factor some common subexpressions out. - Change file comments to use doxygen syntax. llvm-svn: 274194	2016-06-30 00:23:54 +00:00
Marcin Koscielnicki	68747ac78e	[SystemZ] Split up PerformDAGCombine. [NFC] This function is already a bit too long, and I'm about to make it worse. llvm-svn: 274191	2016-06-30 00:08:54 +00:00
Duncan P. N. Exon Smith	9cfc75c214	CodeGen: Use MachineInstr& in TargetInstrInfo, NFC This is mostly a mechanical change to make TargetInstrInfo API take MachineInstr& (instead of MachineInstr* or MachineBasicBlock::iterator) when the argument is expected to be a valid MachineInstr. This is a general API improvement. Although it would be possible to do this one function at a time, that would demand a quadratic amount of churn since many of these functions call each other. Instead I've done everything as a block and just updated what was necessary. This is mostly mechanical fixes: adding and removing `` and `&` operators. The only non-mechanical change is to split ARMBaseInstrInfo::getOperandLatencyImpl out from ARMBaseInstrInfo::getOperandLatency. Previously, the latter took a `MachineInstr` which it updated to the instruction bundle leader; now, the latter calls the former either with the same `MachineInstr&` or the bundle leader. As a side effect, this removes a bunch of MachineInstr* to MachineBasicBlock::iterator implicit conversions, a necessary step toward fixing PR26753. Note: I updated WebAssembly, Lanai, and AVR (despite being off-by-default) since it turned out to be easy. I couldn't run tests for AVR since llc doesn't link with it turned on. llvm-svn: 274189	2016-06-30 00:01:54 +00:00
Matthias Braun	14cdab6492	PrologEpilogInserter: Some code cleanup; NFC - Use range based for - Use the more common variable names MBB and MF for MachineBasicBlock/MachineFunction variables. - Add a few const modifiers llvm-svn: 274187	2016-06-29 23:54:42 +00:00
Peter Collingbourne	8ec68fad33	Object: Replace NewArchiveIterator with a simpler NewArchiveMember class. NFCI. The NewArchiveIterator class has a problem: it requires too much context. Any memory buffers added to the archive must be stored within an Archive::Member, which must have an associated Archive. This makes it harder than necessary to create new archive members (or new archives entirely) from scratch using memory buffers. This patch replaces NewArchiveIterator with a NewArchiveMember class that stores just the memory buffer and the information that goes into the archive member header. Differential Revision: http://reviews.llvm.org/D21721 llvm-svn: 274183	2016-06-29 22:27:42 +00:00
Adam Nemet	e1af3c635c	[LV] Improve accuracy and formatting of function comment llvm-svn: 274182	2016-06-29 22:04:10 +00:00
Zachary Turner	07670b3e98	Resubmit "Update llvm command line parser to support subcommands." This fixes an issue where occurrence counts would be unexpectedly reset when parsing different parts of a command line multiple times. ORIGINAL COMMIT MESSAGE This allows command line tools to use syntaxes like the following: llvm-foo.exe command1 -o1 -o2 llvm-foo.exe command2 -p1 -p2 Where command1 and command2 contain completely different sets of valid options. This is backwards compatible with previous uses of llvm cl which did not support subcommands, as any option which specifies no optional subcommand (e.g. all existing code) goes into a special "top level" subcommand that expects dashed options to appear immediately after the program name. For example, code which is subcommand unaware would generate a command line such as the following, where no subcommand is specified: llvm-foo.exe -q1 -q2 The top level subcommand can co-exist with actual subcommands, as it is implemented as an actual subcommand which is searched if no explicit subcommand is specified. So llvm-foo.exe as specified above could be written so as to support all three aforementioned command lines simultaneously. There is one additional "special" subcommand called AllSubCommands, which can be used to inject an option into every subcommand. This is useful to support things like help, so that commands such as: llvm-foo.exe --help llvm-foo.exe command1 --help llvm-foo.exe command2 --help All work and display the help for the selected subcommand without having to explicitly go and write code to handle each one separately. This patch is submitted without an example of anything actually using subcommands, but a followup patch will convert the llvm-pdbdump tool to use subcommands. Reviewed By: beanz llvm-svn: 274171	2016-06-29 21:48:26 +00:00
Artem Belevich	4d5d7be8cc	Revert r273313 "[NVPTX] Improve lowering of byval args of device functions." The change causes llvm crash in some unoptimized builds. llvm-svn: 274163	2016-06-29 20:51:15 +00:00
Evgeniy Stepanov	a5da256f92	StackColoring for SafeStack. This is a fix for PR27842. An IR-level implementation of stack coloring tailored to work with SafeStack. It is a bit weaker than the MI implementation in that it does not the "lifetime start at first access" logic. This can be improved in the future. This patch also replaces the naive implementation of stack frame layout with a greedy algorithm that can split existing stack slots and even fit small objects inside the alignment padding of other objects. llvm-svn: 274162	2016-06-29 20:37:43 +00:00
Kevin Enderby	c60a321c6b	Change Archive::create() from ErrorOr<...> to Expected<...> and update its clients. This commit will break the next lld builds. I’ll be committing the matching change for lld next. llvm-svn: 274160	2016-06-29 20:35:44 +00:00
Tim Shen	aec68b263d	[InstCombine] Simplify and correct folding fcmps with the same children Summary: Take advantage of FCmpInst::Predicate's bit pattern and handle (fcmp , x, y) \| (fcmp , x, y) and (fcmp , x, y) & (fcmp , x, y) more consistently. Also fold more FCmpInst::FCMP_FALSE and FCmpInst::FCMP_TRUE to constants. Currently InstCombine wrongly folds (fcmp ogt, x, y) \| (fcmp ord, x, y) to (fcmp ogt, x, y); this patch also fixes that. Reviewers: spatel Subscribers: llvm-commits, iteratee, echristo Differential Revision: http://reviews.llvm.org/D21775 llvm-svn: 274156	2016-06-29 20:10:17 +00:00
Tim Shen	860a67eb4c	[InstCombine, NFC] Change the generated variable names by creating new instructions This removes some noise for D21775's test changes. llvm-svn: 274155	2016-06-29 20:10:13 +00:00
Davide Italiano	901269c8c9	[Triple] Reimplement isLittleEndian(). Now it works for arm too. Differential Revision: http://reviews.llvm.org/D21846 llvm-svn: 274154	2016-06-29 20:01:39 +00:00
Nirav Dave	8e10380b73	Permit memory operands in ins/outs instructions [x86] (PR15455) While (ins\|outs)[bwld] instructions do not take %dx as a memory operand, various unofficial references do and objdump disassembles to this format. Extend special treatment of similar (in\|out)[bwld] operations. Reviewers: craig.topper, rnk, ab Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D18837 llvm-svn: 274152	2016-06-29 19:54:27 +00:00
Nico Weber	12fdf60b75	Revert r272251, it caused PR28348. llvm-svn: 274141	2016-06-29 17:33:41 +00:00
Ahmed Bougacha	15a2f6d58c	[X86] Lower blended PACKUSes using appropriate types. When lowering two blended PACKUS, we used to disregard the types of the PACKUS inputs, indiscriminately generating a v16i8 PACKUS. This leads to non-selectable things like: (v16i8 (PACKUS (v4i32 v0), (v4i32 v1))) Instead, check that the PACKUSes have the same type, and use that as the final result type. llvm-svn: 274138	2016-06-29 16:56:09 +00:00
Benjamin Kramer	832d042078	[ManagedStatic] Reimplement double-checked locking with std::atomic. This gets rid of the memory fence in the hot path (dereferencing the ManagedStatic), trading for an extra mutex lock in the cold path (when the ManagedStatic was uninitialized). Since this only happens on the first accesses it shouldn't matter much. On strict architectures like x86 this removes any atomic instructions from the hot path. Also remove the tsan annotations, tsan knows how standard atomics work so they should be unnecessary now. llvm-svn: 274131	2016-06-29 15:04:07 +00:00
Rafael Espindola	a99ccfce1a	Drop support for creating $stubs. They are created by ld64 since OS X 10.5. llvm-svn: 274130	2016-06-29 14:59:50 +00:00
Elena Demikhovsky	5e21c94f25	Reverted patch 273864 llvm-svn: 274115	2016-06-29 10:01:06 +00:00
Marcin Koscielnicki	518cbc7cc3	[SystemZ] Add floating-point test data class instructions. These are not used by CodeGen yet - ISD combiners creating the new node will come in subsequent patches. llvm-svn: 274108	2016-06-29 07:29:07 +00:00
Vedant Kumar	34e4e477c8	Revert "[Coverage] Move logic to encode filenames and mappings into llvm (NFC)" This reverts commit 520a8298d8ef676b5da617ba3d2c7fa37381e939 (r273055). This is breaking stage2 instrumented builds with "malformed coverage data" errors. llvm-svn: 274106	2016-06-29 05:33:26 +00:00
Vedant Kumar	a30139d50c	Revert "[Coverage] Clarify ownership of a MemoryBuffer in the reader (NFC)" This reverts commit 1037ef2574adde2103ad221d63834c3e1df4a776. llvm-svn: 274105	2016-06-29 05:33:24 +00:00
Craig Topper	df7454f94b	Revert "[ValueTracking] Teach computeKnownBits for PHI nodes to compute sign bit for a recurrence with a NSW addition." This is breaking an optimizaton remark test in clang. I've identified a couple fixes for that, but want to understand it better before I commit to anything. llvm-svn: 274102	2016-06-29 04:57:00 +00:00
Adam Nemet	ad437fff53	[Diag] Add getter shouldAlwaysPrint. NFC For the new hotness attribute, the API will take the pass rather than the pass name so we can no longer play the trick of AlwaysPrint being a special pass name. This adds a getter to help the transition. There is also a corresponding clang patch. llvm-svn: 274100	2016-06-29 04:55:19 +00:00
Craig Topper	2cc199baff	[ValueTracking] Teach computeKnownBits for PHI nodes to compute sign bit for a recurrence with a NSW addition. If a operation for a recurrence is an addition with no signed wrap and both input sign bits are 0, then the result sign bit must also be 0. Similar for the negative case. I found this deficiency while playing around with a loop in the x86 backend that contained a signed division that could be optimized into an unsigned division if we could prove both inputs were positive. One of them being the loop induction variable. With this patch we can perform the conversion for this case. One of the test cases here is a contrived variation of the loop I was looking at. Differential revision: http://reviews.llvm.org/D21493 llvm-svn: 274098	2016-06-29 03:46:47 +00:00
Craig Topper	3a011de10c	[DAGCombine] Teach DAG combine to handle ORs of shuffles involving zero vectors where the zero vector is the first operand to the shuffle instead of the second. llvm-svn: 274097	2016-06-29 03:29:12 +00:00
Craig Topper	f067a043fb	[CodeGen] Make ShuffleVectorSDNode::commuteMask take a MutableArrayRef instead of SmallVectorImpl. NFC. llvm-svn: 274095	2016-06-29 03:29:06 +00:00
Eric Christopher	0c58837b1f	Revert "[InstCombine] Avoid combining the bitcast of a var that is used as both address and result of load instructions" Revert "[InstCombine] Combine A->B->A BitCast" as this appears to cause PR27996 and as discussed in http://reviews.llvm.org/D20847 This reverts commits r270135 and r263734. llvm-svn: 274094	2016-06-29 03:05:58 +00:00
Davide Italiano	941685e9f4	[Triple] Add isLittleEndian(). This allows us to query about the endianness without having to look at DataLayout. The API will be used (and tested) in lld, in order to find out the endianness of BitcodeFiles. Briefly discussed with Rafael. llvm-svn: 274090	2016-06-29 01:56:27 +00:00
Vedant Kumar	1ead14b147	[Object] Fix a -Wpessimizing-move error; clang-format; NFC llvm-svn: 274085	2016-06-29 00:37:13 +00:00
Kevin Enderby	42398051d8	Finish cleaning up most of the error handling in libObject’s MachOUniversalBinary and its clients to use the new llvm::Error model for error handling. Changed getAsArchive() from ErrorOr<...> to Expected<...> so now all interfaces there use the new llvm::Error model for return values. In the two places it had if (!Parent) this is actually a program error so changed from returning errorCodeToError(object_error::parse_failed) to calling report_fatal_error() with a message. In getObjectForArch() added error messages to its two llvm::Error return values instead of returning errorCodeToError(object_error::arch_not_found) with no error message. For the llvm-obdump, llvm-nm and llvm-size clients since the only binary files in Mach-O Universal Binaries that are supported are Mach-O files or archives with Mach-O objects, updated their logic to generate an error when a slice contains something like an ELF binary instead of ignoring it. And added a test case for that. The last error stuff to be cleaned up for libObject’s MachOUniversalBinary is the use of errorOrToExpected(Archive::create(ObjBuffer)) which needs Archive::create() to be changed from ErrorOr<...> to Expected<...> first, which I’ll work on next. llvm-svn: 274079	2016-06-28 23:16:13 +00:00
Kyle Butt	82c2290e0f	Codegen: [MBP] Add messages to asserts. NFC llvm-svn: 274075	2016-06-28 22:50:54 +00:00
Weiming Zhao	5410edddb1	[ARM] Fix 28282: cost computation for constant hoisting Summary: This fixes bug: https://llvm.org/bugs/show_bug.cgi?id=28282 Currently the cost model of constant hoisting checks the bit width of the data type of the constants. However, the actual immediate value is small enough and not need to be hoisted. This patch checks for the actual bit width needed for the constant. Reviewers: t.p.northover, rengolin Subscribers: aemerson, rengolin, llvm-commits Differential Revision: http://reviews.llvm.org/D21668 llvm-svn: 274073	2016-06-28 22:30:45 +00:00
Manman Ren	d16490dfd1	Revert r274054 to try to appease the bot llvm-svn: 274072	2016-06-28 22:20:17 +00:00
Dehao Chen	8cd84aaa6f	Relax the clearance calculating for breaking partial register dependency. Summary: LLVM assumes that large clearance will hide the partial register spill penalty. But in our experiment, 16 clearance is too small. As the inserted XOR is normally fairly cheap, we should have a higher clearance threshold to aggressively insert XORs that is necessary to break partial register dependency. Reviewers: wmi, davidxl, stoklund, zansari, myatsina, RKSimon, DavidKreitzer, mkuper, joerg, spatel Subscribers: davidxl, llvm-commits Differential Revision: http://reviews.llvm.org/D21560 llvm-svn: 274068	2016-06-28 21:19:34 +00:00
Chris Bieneman	92b2e8a295	[YAML] Fix YAML tags appearing before the start of sequence elements Our existing yaml::Output code writes tags immediately when mapTag is called, without any state handling. This results in tags on sequence elements being written before the element itself. For example, we see this: SomeArray: !elem_type - key1: 1 key2: 2 !elem_type2 - key3: 3 key4: 4 We should instead see: SomeArray: - !elem_type key1: 1 key2: 2 - !elem_type2 key3: 3 key4: 4 Our reader handles reading properly, so this bug only impacts writing yaml sequences with tagged elements. As a test for this I've modified the Mach-O yaml encoding to allways apply the !mach-o tag when encoding MachOYAML::Object entries. This results in the !mach-o tag appearing as expected in dumped fat files. llvm-svn: 274067	2016-06-28 21:10:26 +00:00
Zhan Jun Liau	347db3e18e	[SystemZ] Use NILL instruction instead of NILF where possible Summary: SystemZ shift instructions only use the last 6 bits of the shift amount. When the result of an AND operation is used as a shift amount, this means that we can use the NILL instruction (which operates on the last 16 bits) rather than NILF (which operates on the last 32 bits) for a 16-bit savings in instruction size. Reviewers: uweigand Subscribers: llvm-commits Author: colpell Committing on behalf of Elliot. Differential Revision: http://reviews.llvm.org/D21686 llvm-svn: 274066	2016-06-28 21:03:19 +00:00
Matthias Braun	0b9a07883d	X86FrameLowering: Check subregs when deciding prolog kill flags llvm-svn: 274057	2016-06-28 20:31:56 +00:00
Rafael Espindola	b1556c42ce	Use isPositionIndependent in a few more places. I think this converts all the simple cases that really just care about the generated code being position independent or not. The remaining uses are a bit more complicated and are checking things like "is this a library or executable" or "can this symbol be preempted". llvm-svn: 274055	2016-06-28 20:13:36 +00:00
Zachary Turner	2012d744f4	Update llvm command line parser to support subcommands. This allows command line tools to use syntaxes like the following: llvm-foo.exe command1 -o1 -o2 llvm-foo.exe command2 -p1 -p2 Where command1 and command2 contain completely different sets of valid options. This is backwards compatible with previous uses of llvm cl which did not support subcommands, as any option which specifies no optional subcommand (e.g. all existing code) goes into a special "top level" subcommand that expects dashed options to appear immediately after the program name. For example, code which is subcommand unaware would generate a command line such as the following, where no subcommand is specified: llvm-foo.exe -q1 -q2 The top level subcommand can co-exist with actual subcommands, as it is implemented as an actual subcommand which is searched if no explicit subcommand is specified. So llvm-foo.exe as specified above could be written so as to support all three aforementioned command lines simultaneously. There is one additional "special" subcommand called AllSubCommands, which can be used to inject an option into every subcommand. This is useful to support things like help, so that commands such as: llvm-foo.exe --help llvm-foo.exe command1 --help llvm-foo.exe command2 --help All work and display the help for the selected subcommand without having to explicitly go and write code to handle each one separately. This patch is submitted without an example of anything actually using subcommands, but a followup patch will convert the llvm-pdbdump tool to use subcommands. Reviewed By: beanz Differential Revision: http://reviews.llvm.org/D21485 llvm-svn: 274054	2016-06-28 20:09:47 +00:00
Krzysztof Parzyszek	5db97acfa2	Fix typo llvm-svn: 274051	2016-06-28 19:12:28 +00:00
Artur Pilipenko	7ad95ec22d	Support arbitrary addrspace pointers in masked load/store intrinsics This is a resubmittion of 263158 change after fixing the existing problem with intrinsics mangling (see LTO and intrinsics mangling llvm-dev thread for details). This patch fixes the problem which occurs when loop-vectorize tries to use @llvm.masked.load/store intrinsic for a non-default addrspace pointer. It fails with "Calling a function with a bad signature!" assertion in CallInst constructor because it tries to pass a non-default addrspace pointer to the pointer argument which has default addrspace. The fix is to add pointer type as another overloaded type to @llvm.masked.load/store intrinsics. Reviewed By: reames Differential Revision: http://reviews.llvm.org/D17270 llvm-svn: 274043	2016-06-28 18:27:25 +00:00
Matt Arsenault	eb9025d698	AMDGPU: Fix global isel crashes llvm-svn: 274039	2016-06-28 17:42:09 +00:00
Chad Rosier	02e831cf2f	Typos. NFC. llvm-svn: 274038	2016-06-28 17:19:10 +00:00
Michael Kuperstein	85de98fd24	[X86] Reorder source list alphabetically. NFC. llvm-svn: 274036	2016-06-28 17:11:15 +00:00
Matt Arsenault	254a6450dd	AMDGPU: Fix typo llvm-svn: 274034	2016-06-28 16:59:53 +00:00
Matt Arsenault	3d846501fb	AMDGPU: Remove unused function llvm-svn: 274033	2016-06-28 16:59:49 +00:00
David Majnemer	1c7d532cde	[X86] Make WRPKRU/RDPKRU pass -verify-machineinstrs The original implementation attempted to zero registers using XOR %foo, %foo. This is problematic because it constitutes a read-modify-write of a register which might not be defined. Instead, use MOV32r0 to avoid these problems; expandPostRAPseudo does the right thing here. llvm-svn: 274024	2016-06-28 16:04:46 +00:00
Rafael Espindola	5ac8f5c379	Don't pass a Reloc::Model to GVIsIndirectSymbol. It already has access to it. While at it, rename it to isGVIndirectSymbol. llvm-svn: 274023	2016-06-28 15:38:13 +00:00
Rafael Espindola	82f4631c88	Don't pass Reloc::Model to places that already have it. NFC. llvm-svn: 274022	2016-06-28 15:18:26 +00:00
Rafael Espindola	b30e66b82c	Convert more cases to isPositionIndependent(). NFC. llvm-svn: 274021	2016-06-28 14:33:28 +00:00
Rafael Espindola	6f7c280a3d	Delete dead code. NFC. llvm-svn: 274020	2016-06-28 14:26:39 +00:00
Marcin Koscielnicki	234e5a809b	[SystemZ] Save/restore r6 and r7 if function contains landing pad. This fixes PR27102. Differential Revision: http://reviews.llvm.org/D18541 llvm-svn: 274017	2016-06-28 14:13:11 +00:00
Simon Pilgrim	5f71c909f0	[X86][AVX] Peek through bitcasts to find the source of broadcasts (reapplied) AVX1 can only broadcast vectors as floats/doubles, so for 256-bit vectors we insert bitcasts if we are shuffling v8i32/v4i64 types. Unfortunately the presence of these bitcasts prevents the current broadcast lowering code from peeking through cases where we have concatenated / extracted vectors to create the 256-bit vectors. This patch allows us to peek through bitcasts as long as the number of elements doesn't change (i.e. element bitwidth is the same) so the broadcast index is not affected. Note this bitcast peek is different from the stage later on which doesn't care about the type and is just trying to find a load node. As we're being more aggressive with bitcasts, we also need to ensure that the broadcast type is correctly bitcasted Differential Revision: http://reviews.llvm.org/D21660 llvm-svn: 274013	2016-06-28 13:24:05 +00:00
Rafael Espindola	248cfb9752	Convert 2 more uses to shouldAssumeDSOLocal(). NFC. llvm-svn: 274009	2016-06-28 12:49:12 +00:00

1 2 3 4 5 ...

92296 Commits