llvm-project

Commit Graph

Author	SHA1	Message	Date
Zachary Turner	60667ca0b2	[pdb] Merge NamedStreamMapBuilder and NamedStreamMap. While the builder pattern has proven useful for certain other larger types, in this case it was hampering the ability to use the data structure, as for runtime access we need a map that we can efficiently read from and write to. So the two are merged into a single data structure that can efficiently be read to, written from, deserialized from bytes, and serialized to bytes. llvm-svn: 292664	2017-01-20 22:41:40 +00:00
Zachary Turner	f04d6e8d52	[PDB] Rename some files to be more intuitive. llvm-svn: 292663	2017-01-20 22:41:15 +00:00
Peter Collingbourne	e02b74e294	IPO, LTO: Plumb the summary from the LTO API into the pass manager. Differential Revision: https://reviews.llvm.org/D28840 llvm-svn: 292661	2017-01-20 22:18:52 +00:00
Sanjay Patel	0c1c70aef4	[ValueTracking] recognize variations of 'clamp' to improve codegen (PR31693) By enhancing value tracking, we allow an existing min/max canonicalization to kick in and improve codegen for several targets that have min/max instructions. Unfortunately, recognizing min/max in value tracking may cause us to hit a hack in InstCombiner::visitICmpInst() more often: http://lists.llvm.org/pipermail/llvm-dev/2017-January/109340.html ...but I'm hoping we can remove that soon. Correctness proofs based on Alive: Name: smaxmin Pre: C1 < C2 %cmp2 = icmp slt i8 %x, C2 %min = select i1 %cmp2, i8 %x, i8 C2 %cmp3 = icmp slt i8 %x, C1 %r = select i1 %cmp3, i8 C1, i8 %min => %cmp2 = icmp slt i8 %x, C2 %min = select i1 %cmp2, i8 %x, i8 C2 %cmp1 = icmp sgt i8 %min, C1 %r = select i1 %cmp1, i8 %min, i8 C1 Name: sminmax Pre: C1 > C2 %cmp2 = icmp sgt i8 %x, C2 %max = select i1 %cmp2, i8 %x, i8 C2 %cmp3 = icmp sgt i8 %x, C1 %r = select i1 %cmp3, i8 C1, i8 %max => %cmp2 = icmp sgt i8 %x, C2 %max = select i1 %cmp2, i8 %x, i8 C2 %cmp1 = icmp slt i8 %max, C1 %r = select i1 %cmp1, i8 %max, i8 C1 ---------------------------------------- Optimization: smaxmin Done: 1 Optimization is correct! ---------------------------------------- Optimization: sminmax Done: 1 Optimization is correct! Name: umaxmin Pre: C1 u< C2 %cmp2 = icmp ult i8 %x, C2 %min = select i1 %cmp2, i8 %x, i8 C2 %cmp3 = icmp ult i8 %x, C1 %r = select i1 %cmp3, i8 C1, i8 %min => %cmp2 = icmp ult i8 %x, C2 %min = select i1 %cmp2, i8 %x, i8 C2 %cmp1 = icmp ugt i8 %min, C1 %r = select i1 %cmp1, i8 %min, i8 C1 Name: uminmax Pre: C1 u> C2 %cmp2 = icmp ugt i8 %x, C2 %max = select i1 %cmp2, i8 %x, i8 C2 %cmp3 = icmp ugt i8 %x, C1 %r = select i1 %cmp3, i8 C1, i8 %max => %cmp2 = icmp ugt i8 %x, C2 %max = select i1 %cmp2, i8 %x, i8 C2 %cmp1 = icmp ult i8 %max, C1 %r = select i1 %cmp1, i8 %max, i8 C1 ---------------------------------------- Optimization: umaxmin Done: 1 Optimization is correct! ---------------------------------------- Optimization: uminmax Done: 1 Optimization is correct! llvm-svn: 292660	2017-01-20 22:18:47 +00:00
Peter Collingbourne	d88f928a5c	docs: Document that !absolute_symbol { all-ones, all-ones } means the full set. llvm-svn: 292657	2017-01-20 21:56:37 +00:00
Teresa Johnson	4566c6db87	[ThinLTO] Drop non-prevailing non-ODR weak to declarations Summary: Allow non-ODR weak/linkonce non-prevailing copies to be marked as available_externally in the index. Add support for dropping these to declarations in the backend. Reviewers: mehdi_amini, pcc Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28806 llvm-svn: 292656	2017-01-20 21:54:58 +00:00
Sanjay Patel	5eb113d2af	[InstCombine] add tests to show missed canonicalization of min/max; NFC Unfortunately, recognizing these in value tracking may cause us to hit a hack in InstCombiner::visitICmpInst() more often: http://lists.llvm.org/pipermail/llvm-dev/2017-January/109340.html ...but besides being the obviously Right Thing To Do, there's a clear codegen win from identifying these patterns for several targets. llvm-svn: 292655	2017-01-20 21:49:41 +00:00
Peter Collingbourne	f04a390099	LowerTypeTests: Implement importing of type identifiers. To import a type identifier we read the summary and create external references to the symbols defined when exporting. Differential Revision: https://reviews.llvm.org/D28546 llvm-svn: 292654	2017-01-20 21:49:34 +00:00
Daniel Sanders	df0a9a0897	[globalisel] Fix an unused variable warning when NDEBUG is defined. llvm-svn: 292653	2017-01-20 21:40:05 +00:00
Kostya Serebryany	87a3811d32	[libFuzzer] add an assert to protect against LLVMFuzzerInitialize changing argv[0] llvm-svn: 292652	2017-01-20 21:34:24 +00:00
Jan Vesely	f170504c41	AMDGPU/R600: Serialize vector trunc stores to private AS Add DUMMY_CHAIN SDNode to denote stores of interest Bugzilla: https://llvm.org/bugs/show_bug.cgi?id=28915 Bugzilla: https://llvm.org/bugs/show_bug.cgi?id=30411 Differential Revision: https://reviews.llvm.org/D27964 llvm-svn: 292651	2017-01-20 21:24:26 +00:00
Daniel Berlin	68af279533	NewGVN: Remove pr31686.ll, it is tested by pr31594.ll, which is much smaller and simpler llvm-svn: 292649	2017-01-20 21:04:58 +00:00
Daniel Berlin	26addef1a0	NewGVN: Fix PR 31686 and PR 31698 by rewriting store leader handling. Summary: This rewrites store expression/leader handling. We no longer use the value operand as the leader, instead, we store it separately. We also now store the stored value as part of the expression, and compare it when comparing stores for equality. This enables us to get rid of a bunch of our previous hacks and machinations, as the existing machinery takes care of everything except updating the stored value on classes. The only time we have to update it is if the storecount goes to 0, and when we do, we destroy it. Since we no longer use the value operand as the leader, during elimination, we have to use the value operand. Doing this also fixes a bunch of store forwarding cases we were missing. Any value operand we use is guaranteed to either be updated by previous eliminations, or minimized by future ones. (IE the fact that we don't use the most dominating value operand when it's not a constant does not affect anything). Sadly, this change also exposes that we didn't pay attention to the output of the pr31594.ll test, as it also very clearly exposes the same store leader bug we are fixing here. (I added pr31682.ll anyway, but maybe we think that's too large to be useful) On the plus side, propagate-ir-flags.ll now passes due to the corrected store forwarding. This change was 3 stage'd on darwin and linux, with the full test-suite. Reviewers: davide Subscribers: llvm-commits llvm-svn: 292648	2017-01-20 21:04:30 +00:00
Peter Collingbourne	ee1416037e	LowerTypeTests: Compute SizeM1BitWidth in exportTypeId. NFCI. This avoids needing to store it in a separate field in TypeIdLowering. llvm-svn: 292647	2017-01-20 20:57:40 +00:00
Kostya Serebryany	98d592cc91	[libFuzzer] experimental support for 'equivalance fuzzing' llvm-svn: 292646	2017-01-20 20:57:07 +00:00
Dan Gohman	a99b717f52	[WebAssembly] Don't create bitcast-wrappers for varargs. WebAssembly varargs functions use a significantly different ABI than non-varargs functions, and the current code in WebAssemblyFixFunctionBitcasts doesn't handle that difference. For now, just avoid creating wrapper functions in the presence of varargs. llvm-svn: 292645	2017-01-20 20:50:29 +00:00
Mehdi Amini	3bb4d01db9	[ThinLTO] Fix lazy-loading of MDString instruction attachments CFI is using intrinsics that takes MDString as arguments, and this was broken during lazy-loading of metadata. Differential Revision: https://reviews.llvm.org/D28916 llvm-svn: 292641	2017-01-20 20:29:16 +00:00
Sanjay Patel	f9594db218	[x86] add tests to show missed min/max vector codegen (PR31693) llvm-svn: 292640	2017-01-20 20:14:11 +00:00
Chris Bieneman	2e752db47a	[DWARF] [ObjectYAML] Adding APIs for unittesting Summary: This patch adds some new APIs to enable using the YAML DWARF representation in unit tests. The most basic new API is DWARFYAML::EmitDebugSections which converts a YAML string into a series of owned MemoryBuffer objects stored in a StringMap. The string map can then be used to construct a DWARFContext for parsing in place of an ObjectFile. Reviewers: dblaikie, clayborg Subscribers: mgorny, fhahn, jgosnell, aprantl, llvm-commits Differential Revision: https://reviews.llvm.org/D28828 llvm-svn: 292634	2017-01-20 19:03:14 +00:00
Haicheng Wu	201b191b82	Recommit "[InlineCost] Use TTI to check if GEP is free." #3 This is the third attemp to recommit r292526. The original summary: Currently, a GEP is considered free only if its indices are all constant. TTI::getGEPCost() can give target-specific more accurate analysis. TTI is already used for the cost of many other instructions. llvm-svn: 292633	2017-01-20 18:51:22 +00:00
Alexey Bataev	4fe77b9329	[SLP] Initial test for fix of PR31690. llvm-svn: 292631	2017-01-20 18:40:21 +00:00
Matthias Braun	856548a616	ARM: tLDR_postidx should be marked mayLoad This fixes -verify-machineinstrs complaints. llvm-svn: 292629	2017-01-20 18:30:28 +00:00
Simon Pilgrim	a50a93fcd0	[InstCombine][X86] Add MULDQ/MULUDQ undef handling llvm-svn: 292627	2017-01-20 18:20:30 +00:00
Alexey Bataev	f5677329a6	[SLP] A new test for horizontal vectorization for non-power-of-2 instructions. llvm-svn: 292626	2017-01-20 18:04:29 +00:00
Matthias Braun	2e8c11e4b3	AArch64LoadStoreOptimizer: Update kill flags when merging stores Kill flags need to be updated correctly when moving stores up/down to form store pair instructions. Those invalid flags have been ignored before but as of r290014 they are recognized when using -mllvm -verify-machineinstrs. Also simplifies test/CodeGen/AArch64/ldst-opt-dbg-limit.mir, renames it to ldst-opt.mir test and adds a new tests for this change. Differential Revision: https://reviews.llvm.org/D28875 llvm-svn: 292625	2017-01-20 18:04:27 +00:00
Petar Jovanovic	dbb39356b4	[mips] Fix debug information for __thread variable This patch fixes debug information for __thread variable on Mips using .dtprelword and .dtpreldword directives. Patch by Aleksandar Beserminji. Differential Revision: http://reviews.llvm.org/D28770 llvm-svn: 292624	2017-01-20 17:53:30 +00:00
Eugene Zelenko	734bb7bb09	[AMDGPU] Fix some Clang-tidy modernize and Include What You Use warnings; other minor fixes (NFC). llvm-svn: 292623	2017-01-20 17:52:16 +00:00
Wei Mi	d5f7eebd42	[RegisterCoalescing] Recommit the patch "Remove partial redundent copy". The recommit fixes a bug related with live interval update after the partial redundent copy is moved. The original patch is to solve the performance problem described in PR27827. Register coalescing sometimes cannot remove a copy because of interference. But if we can find a reverse copy in one of the predecessor block of the copy, the copy is partially redundent and we may remove the copy partially by moving it to the predecessor block without the reverse copy. Differential Revision: https://reviews.llvm.org/D28585 llvm-svn: 292621	2017-01-20 17:38:54 +00:00
Simon Pilgrim	06f125230f	[InstCombine][SSE] Tests showing missed opportunities to handle muldq/muludq with undef arguments Fixed a typo in existing test names at the same time llvm-svn: 292619	2017-01-20 17:06:38 +00:00
Haicheng Wu	71ef5bc0ff	Revert "Recommit "[InlineCost] Use TTI to check if GEP is free." #2" This reverts commit r292616 because the test case still has problem. llvm-svn: 292618	2017-01-20 16:52:22 +00:00
Haicheng Wu	8f34ae2aae	Recommit "[InlineCost] Use TTI to check if GEP is free." #2 This is the second attemp to recommit r292526. The original summary: Currently, a GEP is considered free only if its indices are all constant. TTI::getGEPCost() can give target-specific more accurate analysis. TTI is already used for the cost of many other instructions. llvm-svn: 292616	2017-01-20 16:36:34 +00:00
Simon Pilgrim	3e5b525699	Remove trailing whitespace. NFCI. llvm-svn: 292613	2017-01-20 15:15:59 +00:00
Simon Pilgrim	0da4d2bc03	[CostModel][X86] Removed unused cost. NFCI. SHL v8i32 is already handled in the SSE41 cost table llvm-svn: 292612	2017-01-20 15:14:38 +00:00
Simon Pilgrim	2817b476e8	[InstCombine][SSE] Tests showing missed opportunities to constant fold packss/packus llvm-svn: 292609	2017-01-20 13:21:30 +00:00
Sjoerd Meijer	2db2a947f6	[Thumb] Add support for tMUL in the compare instruction peephole optimizer. We also want to optimise tests like this: return a*b == 0. The MULS instruction is flag setting, so we don't need the CMP instruction but can instead branch on the result of the MULS. The generated instructions sequence for this example was: MULS, MOVS, MOVS, CMP. The MOVS instruction load the boolean values resulting from the select instruction, but these MOVS instructions are flag setting and were thus preventing this optimisation. Now we first reorder and move the MULS to before the CMP and generate sequence MOVS, MOVS, MULS, CMP so that the optimisation could trigger. Reordering of the MULS and MOVS is safe to do because the subsequent MOVS instructions just set the CPSR register and don't use it, i.e. the CPSR is dead. Differential Revision: https://reviews.llvm.org/D27990 llvm-svn: 292608	2017-01-20 13:10:12 +00:00
Simon Pilgrim	8942722cbc	[InstCombine][SSE] Tests showing missed opportunities to handle packss/packus with undef arguments llvm-svn: 292601	2017-01-20 11:28:07 +00:00
Benjamin Kramer	11590b8281	Pacify -Wreorder. llvm-svn: 292599	2017-01-20 10:37:53 +00:00
Mehdi Amini	2737989245	Add an assertion to PlaceholderQueue destructor, ensuring it has been flushed llvm-svn: 292597	2017-01-20 10:18:32 +00:00
Sam Kolton	07dbde214b	[AMDGPU] Add subtarget features for SDWA/DPP Reviewers: vpykhtin, artem.tamazov, tstellarAMD Subscribers: arsenm, kzhuravl, wdng, nhaehnle, yaxunl, tony-tye Differential Revision: https://reviews.llvm.org/D28900 llvm-svn: 292596	2017-01-20 10:01:25 +00:00
Chandler Carruth	3cdf650770	[PM] Tidy up the spacing of this new, much nicer test file. llvm-svn: 292592	2017-01-20 09:30:03 +00:00
Simon Pilgrim	51b3b98e3a	[InstCombine][SSE] Add DemandedElts support for PACKSS/PACKUS instructions Simplify a packss/packus truncation based on the elements of the mask that are actually demanded. Differential Revision: https://reviews.llvm.org/D28777 llvm-svn: 292591	2017-01-20 09:28:21 +00:00
Chandler Carruth	e9b18e3d34	[PM] Port LoopSink to the new pass manager. Like several other loop passes (the vectorizer, etc) this pass doesn't really fit the model of a loop pass. The critical distinction is that it isn't intended to be pipelined together with other loop passes. I plan to add some documentation to the loop pass manager to make this more clear on that side. LoopSink is also different because it doesn't really need a lot of the infrastructure of our loop passes. For example, if there aren't loop invariant instructions causing a preheader to exist, there is no need to form a preheader. It also doesn't need LCSSA because this pass is only involved in sinking invariant instructions from a preheader into the loop, not reasoning about live-outs. This allows some nice simplifications to the pass in the new PM where we can directly walk the loops once without restructuring them. Differential Revision: https://reviews.llvm.org/D28921 llvm-svn: 292589	2017-01-20 08:42:19 +00:00
Chandler Carruth	1725c8c315	[LoopSink] Trivial comment cleanup. llvm-svn: 292588	2017-01-20 08:42:14 +00:00
Diana Picus	bd66b7dc87	[ARM] Use helpers for adding pred / CC operands. NFC Hunt down some of the places where we use bare addReg(0) or addImm(AL).addReg(0) and replace with add(condCodeOp()) and add(predOps()). This should make it easier to understand what those operands represent (without having to look at the definition of the instruction that we're adding to). Differential Revision: https://reviews.llvm.org/D27984 llvm-svn: 292587	2017-01-20 08:15:24 +00:00
Craig Topper	ae78b5dcff	[AVX-512] Fix a couple test cases to not pass an undef mask to gather intrinsic. This could break if any future optimizations taken advantage of the undef. llvm-svn: 292585	2017-01-20 07:12:30 +00:00
Jonas Paulsson	e48d7d5554	[TargetLowering] Improve comment for setOperationAction(). Add a sentence that says that the type argument can refer to either the type of a result, or that of an operand. Review: Eli Friedman. llvm-svn: 292584	2017-01-20 06:48:47 +00:00
Daniel Berlin	89fea6fd9d	NewGVN: Fix PR 31682, an overactive assert. Part of the assert has been left active for further debugging. The other part has been turned into a stat for tracking for the moment. llvm-svn: 292583	2017-01-20 06:38:41 +00:00
Mohammad Shahid	5dc021bf45	[SLP] Add a base test for jumbled store Change-Id: I905ce08a02c76a6896dcfd9629547417c99adc4a llvm-svn: 292581	2017-01-20 06:05:33 +00:00
Saleem Abdulrasool	d3cbf2683d	llvm-cxxfilt: add missing includes from previous change llvm-svn: 292580	2017-01-20 05:33:22 +00:00
Saleem Abdulrasool	f1790c2c30	llvm-cxxfilt: fix program description Fix a silly copy-paste error in the tool description. Take the opportunity to add crash stack printing which will hopefully never be needed. llvm-svn: 292579	2017-01-20 05:27:09 +00:00
Saleem Abdulrasool	a63fd043fd	llvm-cxxfilt: support `-t` to demangle types By default c++filt demangles functions, though you can optionally pass `-t` to have it decode types as well, behaving nearly identical to `__cxa_demangle`. Add support for this mode. llvm-svn: 292576	2017-01-20 04:25:26 +00:00
Matthias Braun	24553c5e55	BitVector: Fix undefined behaviour Calling reset() on an empty BitVector would call memset with a nullptr argument which is undefined behaviour. This should fix the sanitizer bot. llvm-svn: 292575	2017-01-20 04:23:08 +00:00
Matthias Braun	d9217c0b86	Revert "LiveRegUnits: Add accumulateBackward() function" This seems to be breaking some bots. This reverts commit r292543. llvm-svn: 292574	2017-01-20 03:58:42 +00:00
Saleem Abdulrasool	4d0d252288	Revert "Demangle: only demangle mangled symbols" This reverts SVN r286795. This was incorrect the demangler is expected to be able to demangle types as well as functions. This makes the behaviour of itaniumDemangle similar to __cxa_demangle once more. llvm-svn: 292573	2017-01-20 03:54:04 +00:00
Haicheng Wu	8f2aca388b	Revert "Recommit "[InlineCost] Use TTI to check if GEP is free."" This reverts commit r292570. The test still has problem. llvm-svn: 292572	2017-01-20 03:40:41 +00:00
Haicheng Wu	1af1f071ea	Recommit "[InlineCost] Use TTI to check if GEP is free." This recommits r292526 which is reverted in r292529 after fixing the test case. The original summary: Currently, a GEP is considered free only if its indices are all constant. TTI::getGEPCost() can give target-specific more accurate analysis. TTI is already used for the cost of many other instructions. llvm-svn: 292570	2017-01-20 03:09:11 +00:00
Chandler Carruth	f002264d49	[LoopInfo] Add helper methods to compute two useful orderings of the loops in a function. These are relatively confusing to talk about and compute correctly so it seems really good to write down their implementation in one place. I've replaced one place we needed this in the loop PM infrastructure and I have another place in a pending patch that wants it. We can't quite use this for the core loop PM walk because there we're sometimes working on a sub-forest. I'll add the expected unittests before committing this but wanted to make sure folks were happy with these names / comments. Credit goes to Richard Smith for the idea for naming the order where siblings are in reverse program order but the tree traversal remains preorder. Differential Revision: https://reviews.llvm.org/D28932 llvm-svn: 292569	2017-01-20 02:41:20 +00:00
Greg Parker	2873766788	[test] Remove a unwanted match for `XFAIL:`. llvm-svn: 292567	2017-01-20 02:01:04 +00:00
Ahmed Bougacha	d294823930	[AArch64][GlobalISel] Widen scalar int->fp conversions. It's incorrect to ignore the higher bits of the integer source. Teach the legalizer how to widen it. llvm-svn: 292563	2017-01-20 01:37:24 +00:00
Michael Kuperstein	568027aabb	[PM] Attempt to pacify windows bots. Another difference in type pretty-printing, this one windows-specific. llvm-svn: 292556	2017-01-20 00:47:32 +00:00
Stanislav Mekhanoshin	6ec3e3a728	[AMDGPU] Prevent spills before exec mask is restored Inline spiller can decide to move a spill as early as possible in the basic block. It will skip phis and label, but we also need to make sure it skips instructions in the basic block prologue which restore exec mask. Added isPositionLike callback in TargetInstrInfo to detect instructions which shall be skipped in addition to common phis, labels etc. Differential Revision: https://reviews.llvm.org/D27997 llvm-svn: 292554	2017-01-20 00:44:31 +00:00
Justin Bogner	e094cc4372	GlobalISel: Add a note about how we're being a bit loose with memory operands The logic in r292461 is conservatively correct, but we should revisit this later. Add a TODO so we don't forget. llvm-svn: 292553	2017-01-20 00:30:17 +00:00
Ahmed Bougacha	19252ac6f0	[AArch64][GlobalISel] Split FP conversion legalizer tests. NFC. Big functions with large vreg # are quite unwieldy to update. Change it to have one function per test (it does increase boilerplate, but makes the core hopefully more readable and maintanable). llvm-svn: 292552	2017-01-20 00:30:09 +00:00
Ahmed Bougacha	b0de0d1bfc	[AArch64][GlobalISel] Split legalizer combine tests. NFC. Big functions with large vreg # are quite unwieldy to update. This test also relied on legal s8 operations which we're considering removing. Change it to have one function per test (it does increase boilerplate, but makes the core hopefully more readable and maintanable), and use 100% legal operations throughout. llvm-svn: 292551	2017-01-20 00:30:06 +00:00
Ahmed Bougacha	bf480554df	[MIRParser] Allow generic register specification on operand. This completes r292321 by adding support for generic registers, e.g.: %2:_(s32) = G_ADD %0, %1 llvm-svn: 292550	2017-01-20 00:29:59 +00:00
Kuba Mracek	30881272e1	[lit] Limit parallelism of sanitizer tests on Darwin [llvm part, take 2] Running lit tests and unit tests of ASan and TSan on macOS has very bad performance when running with a high number of threads. This is caused by xnu (the macOS kernel), which currently doesn't handle mapping and unmapping of sanitizer shadow regions (reserved VM which are several terabytes large) very well. The situation is so bad that increasing the number of threads actually makes the total testing time larger. The macOS buildbots are affected by this. Note that we can't easily limit the number of sanitizer testing threads without affecting the rest of the tests. This patch adds a special "group" into lit, and limits the number of concurrently running tests in this group. This helps solve the contention problem, while still allowing other tests to run in full, that means running lit with -j8 will still with 8 threads, and parallelism is only limited in sanitizer tests. Differential Revision: https://reviews.llvm.org/D28420 llvm-svn: 292548	2017-01-20 00:24:32 +00:00
Justin Bogner	680663931c	GlobalISel: Only set FailedISel on dropped dbg intrinsics when using fallback It's easier to test the non-fallback path if we just drop these intrinsics for now, like we did before we added the fallback path. We'll obviously need to fix this properly, but the fixme for that is already here. llvm-svn: 292547	2017-01-20 00:24:30 +00:00
Anna Thomas	698f0deea9	[AliasAnalysis] Fences do not modify constant memory location Summary: Fence instructions are currently marked as `ModRef` for all memory locations. We can improve this for constant memory locations (such as constant globals), since fence instructions cannot modify these locations. This helps us to forward constant loads across fences (added test case in GVN). There were no changes in behaviour for similar test cases in early-cse and licm. Reviewers: dberlin, sanjoy, reames Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28914 llvm-svn: 292546	2017-01-20 00:21:33 +00:00
Justin Bogner	23acaf07de	GlobalISel: Pass the MachineFunction in to reportSelectionError directly Rather than trying to find MF based on the possibly-null MI we've passed in here, just pass it in directly. It's already available at all callers anyway. llvm-svn: 292544	2017-01-20 00:16:19 +00:00
Matthias Braun	3ffeb68869	LiveRegUnits: Add accumulateBackward() function This function can be used to accumulate the set of all read and modified register in a sequence of instructions. Use this code in AArch64A57FPLoadBalancing::scavengeRegister() to prove the concept. - The AArch64A57LoadBalancing code is using a backwards analysis now which is irrespective of kill flags. This is the main motivation for this change. Differential Revision: http://reviews.llvm.org/D22082 llvm-svn: 292543	2017-01-20 00:16:17 +00:00
Matthias Braun	710a4c1f3d	CodeGen: Add/Factor out LiveRegUnits class; NFCI This is a set of register units intended to track register liveness, it is similar in spirit to LivePhysRegs. You can also think of this as the liveness tracking parts of the RegisterScavenger factored out into an own class. This was proposed in http://llvm.org/PR27609 Differential Revision: http://reviews.llvm.org/D21916 llvm-svn: 292542	2017-01-20 00:16:14 +00:00
Tim Northover	3babfef11d	AArch64: fall back to DAG ISel for inline assembly. We can't currently handle "calls" to inlineasm strings so it's better to let the DAG handle it than generate rubbish. llvm-svn: 292540	2017-01-19 23:59:35 +00:00
Zachary Turner	a332fa38e9	Fix a few more build errors. llvm-svn: 292538	2017-01-19 23:44:14 +00:00
Zachary Turner	d54deaee6c	Fix incorrectly formed assert statement. llvm-svn: 292537	2017-01-19 23:41:11 +00:00
Michael Kuperstein	853e3337db	[PM] Make default pipeline test for the new PM strict Use CHECK-NEXT to verify that a test breaks whenever unexpected passes, analyses, or invalidations show up in default pipelines. The test case is constructed so that we don't expect to invalidate anything, and needs to be kept that way. The test is slightly less strict than we'd like because of differences in type pretty-printing. (Right now it does show some invalidations - all of those are intentional and temporary.) Differential Revision: https://reviews.llvm.org/D28887 llvm-svn: 292536	2017-01-19 23:39:28 +00:00
Zachary Turner	11036a909f	[pdb] Add HashTable data structure. This was being parsed / serialized ad-hoc inside the code for a specific PDB stream. But this data structure is used in multiple ways / places within the PDB format. To be able to re-use it we need to raise this code out and make it more generic. In doing so, a number of bugs are fixed in the original implementation, and support is added for growing the hash table and deleting items from the hash table, which had either been omitted or incorrect implemented in the initial version. Differential Revision: https://reviews.llvm.org/D28715 llvm-svn: 292535	2017-01-19 23:31:24 +00:00
Michael Kuperstein	c9bb572b73	Revert r292530 since it breaks buildbots. llvm-svn: 292534	2017-01-19 23:22:55 +00:00
Dehao Chen	94f369fc7f	clang-format SampleProfile.cpp (NFC) llvm-svn: 292533	2017-01-19 23:20:31 +00:00
Peter Collingbourne	58ffcfbffa	LTO: Flush the resolution file after writing to it. Without this the file could be truncated if the linker crashes. llvm-svn: 292532	2017-01-19 23:10:14 +00:00
Davide Italiano	6c2c3e07bf	[SCCP] Teach the pass how to handle `div` with overdefined operands. This can prove that: extern int f; int g() { int x = 0; for (int i = 0; i < 365; ++i) { x /= f; } return x; } always returns zero. Thanks to Sanjoy for confirming this transformation actually made sense (bugs are mine). llvm-svn: 292531	2017-01-19 23:07:51 +00:00
Michael Kuperstein	5a52af0f63	[PM] Make default pipeline test for the new PM strict Use CHECK-NEXT to verify that a test breaks whenever unexpected passes, analyses, or invalidations show up in default pipelines. The test case is constructed so that we don't expect to invalidate anything, and needs to be kept that way. (Right now it does show some invalidations - all of those are intentional and temporary.) Differential Revision: https://reviews.llvm.org/D28887 llvm-svn: 292530	2017-01-19 22:55:46 +00:00
Haicheng Wu	e036df4723	Revert "[InlineCost] Use TTI to check if GEP is free." This reverts commit r292526. The test case has problem. llvm-svn: 292529	2017-01-19 22:51:03 +00:00
Simon Pilgrim	fb32eea1b4	[SelectionDAG] Improve knownbits handling of UMIN/UMAX (PR31293) This patch improves the knownbits logic for unsigned integer min/max opcodes. For UMIN we know that the result will have the maximum of the inputs' known leading zero bits in the result, similarly for UMAX the maximum of the inputs' leading one bits. This is particularly useful for simplifying clamping patterns,. e.g. as SSE doesn't have a uitofp instruction we want to use sitofp instead where possible and for that we need to confirm that the top bit is not set. Differential Revision: https://reviews.llvm.org/D28853 llvm-svn: 292528	2017-01-19 22:41:22 +00:00
Haicheng Wu	da556345dc	[InlineCost] Use TTI to check if GEP is free. Currently, a GEP is considered free only if its indices are all constant. TTI::getGEPCost() can give target-specific more accurate analysis. TTI is already used for the cost of many other instructions. Differential Revision: https://reviews.llvm.org/D28693 llvm-svn: 292526	2017-01-19 22:28:34 +00:00
Stanislav Mekhanoshin	68257700f8	[AMDGPU] Add exec copy to LiveIntervals in SILowerControlFlow::emitElse This instruction is missing from LiveIntervals. I'm not aware of any problems because of this though. Differential Revision: https://reviews.llvm.org/D28879 llvm-svn: 292521	2017-01-19 21:26:22 +00:00
Kostya Serebryany	a44ebf4d06	[libFuzzer] ensure that entries in PersistentAutoDictionary are not empty llvm-svn: 292520	2017-01-19 21:14:47 +00:00
Davide Italiano	93c6c18a85	[SCCP] Update comment in visitBinaryOp() after recent changes. llvm-svn: 292519	2017-01-19 21:07:42 +00:00
Serge Rogatch	f83d2a25bf	[XRay][Arm] Repair XRay table emission on Arm32 and add tests to identify such problem earlier Summary: Emission of XRay table was occasionally disabled for Arm32, but this bug was not then detected because earlier (also by mistake) testing of XRay was occasionally disabled on 32-bit Arm targets. This patch should fix that problem and detect such problems in the future. This patch is one of a series, see also - https://reviews.llvm.org/D28623 Reviewers: rengolin, dberris Reviewed By: dberris Subscribers: llvm-commits, aemerson, rengolin, dberris, iid_iunknown Differential Revision: https://reviews.llvm.org/D28624 llvm-svn: 292516	2017-01-19 20:24:23 +00:00
Chad Rosier	9245e12f95	[Assembler] Improve error when unable to evaluate expression. Add a SMLoc to MCExpr. Most code does not generate or consume the SMLoc (yet). Patch by Sanne Wouda <sanne.wouda@arm.com>! Differential Revision: https://reviews.llvm.org/D28861 llvm-svn: 292515	2017-01-19 20:06:32 +00:00
Evgeniy Stepanov	f2d9a46b5f	Fix aliases to thumbfunc-based exprs to be thumbfunc. If F is a Thumb function symbol, and G = F + const, and G is a function symbol, then G is Thumb. Because what else could it be? Differential Revision: https://reviews.llvm.org/D28878 llvm-svn: 292514	2017-01-19 20:04:11 +00:00
Kostya Serebryany	38b5d3ca54	[libFuzzer] improve -minimize_crash: honor -artifact_prefix= and don't special case 2-byte inputs llvm-svn: 292511	2017-01-19 19:38:12 +00:00
Xin Tong	5ee40ba400	Improve what can be promoted in LICM. Summary: In case of non-alloca pointers, we check for whether it is a pointer from malloc-like calls and it is not captured. In such case, we can promote the pointer, as the caller will have no way to access this pointer even if there is unwinding in middle of the loop. Reviewers: hfinkel, sanjoy, reames, eli.friedman Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28834 llvm-svn: 292510	2017-01-19 19:31:40 +00:00
Kostya Serebryany	6e47a10ec7	[libFuzzer] add two tests for experimenting with equivalence fuzzing llvm-svn: 292509	2017-01-19 19:07:26 +00:00
Easwaran Raman	6c8f511f82	Add an interface to scale the frequencies of a set of blocks. The scaling is done with reference to the the new frequency of a reference block. Differential Revision: https://reviews.llvm.org/D28535 llvm-svn: 292507	2017-01-19 18:53:16 +00:00
Davide Italiano	2ef8c4e708	[InstCombine] Simplify gep (gep p, a), (b-a) Patch by Andrea Canciani. Differential Revision: https://reviews.llvm.org/D27413 llvm-svn: 292506	2017-01-19 18:51:56 +00:00
Simon Pilgrim	db101e4d57	[X86][SSE] Improve comments describing combineTruncatedArithmetic. NFCI. llvm-svn: 292502	2017-01-19 18:18:32 +00:00
Kevin Enderby	650fca5f28	Remove this test from the r292500 commit till Chris and I figure out why it is failing on a couple of build bots. llvm-svn: 292501	2017-01-19 18:07:22 +00:00
Kevin Enderby	a4579c4184	Add support for the new LC_NOTE load command. It describes a region of arbitrary data included in a Mach-O file. Its initial use is to record extra data in MH_CORE files. rdar://30001545 rdar://30001731 llvm-svn: 292500	2017-01-19 17:36:31 +00:00
Simon Pilgrim	5f2f53b106	[X86][SSE] Attempt to pre-truncate arithmetic operations that have already been extended As discussed on D28219 - it is profitable to combine trunc(binop (s/zext(x), s/zext(y)) to binop(trunc(s/zext(x)), trunc(s/zext(y))) assuming the trunc(ext()) will simplify further llvm-svn: 292493	2017-01-19 16:25:02 +00:00
Sanjay Patel	291c3d8ff2	[InstCombine] icmp Pred (shl nsw X, C1), C0 --> icmp Pred X, C0 >> C1 Try harder to fold icmp with shl nsw as discussed here: http://lists.llvm.org/pipermail/llvm-dev/2017-January/108749.html This is similar to the 'shl nuw' transforms that were added with D25913. This may eventually help solve: https://llvm.org/bugs/show_bug.cgi?id=30773 Differential Revision: https://reviews.llvm.org/D28406 llvm-svn: 292492	2017-01-19 16:12:10 +00:00
Simon Pilgrim	3b23eac71f	[X86][SSE] Added tests for pre-truncating arithmetic operations that have already been extended As discussed on D28219 - it is profitable to combine trunc(binop (s/zext(x), s/zext(y)) to binop(trunc(s/zext(x)), trunc(s/zext(y))) assuming the trunc(ext()) will simplify further llvm-svn: 292487	2017-01-19 15:03:00 +00:00
Mikael Holmen	2074e7497b	[DAG] Don't increase SDNodeOrder for dbg.value/declare. Summary: The SDNodeOrder is saved in the IROrder field in the SDNode, and this field may affects scheduling. Thus, letting dbg.value/declare increase the order numbers may in turn affect scheduling. Because of this change we also need to update the code deciding when dbg values should be output, in ScheduleDAGSDNodes.cpp/ProcessSDDbgValues. Dbg values now have the same order as the SDNode they are connected to, not the following orders. Test cases provided by Florian Hahn. Reviewers: bogner, aprantl, sunfish, atrick Reviewed By: atrick Subscribers: fhahn, probinson, andreadb, llvm-commits, MatzeB Differential Revision: https://reviews.llvm.org/D25318 llvm-svn: 292485	2017-01-19 13:55:55 +00:00
Malcolm Parsons	386171ba55	[docs] Tell Doxygen to expand LLVM_ALIGNAS to nothing Summary: Docs for clang::Decl and clang::TemplateSpecializationType have not been generated since LLVM_ALIGNAS was added to them. Tell Doxygen to expand LLVM_ALIGNAS to nothing as described at https://www.stack.nl/~dimitri/doxygen/manual/preprocessing.html Reviewers: aaron.ballman, klimek, alexfh Subscribers: ioeric, cfe-commits Differential Revision: https://reviews.llvm.org/D28850 llvm-svn: 292483	2017-01-19 13:37:42 +00:00
Mikael Holmen	8bf15614fb	Test commit access, remove trailing whitespace llvm-svn: 292482	2017-01-19 13:35:13 +00:00
Kristof Beyls	e9412b4d47	[GlobalISel] Pointers are legal operands for G_SELECT on AArch64 Differential Revision: https://reviews.llvm.org/D28805 llvm-svn: 292481	2017-01-19 13:32:14 +00:00
Elena Demikhovsky	e01512cecf	Recommiting unsigned saturation with a bugfix. A test case that crached is added to avx512-trunc.ll. (PR31589) llvm-svn: 292479	2017-01-19 12:08:21 +00:00
Daniel Sanders	d64d5024a4	Re-commit: [globalisel] Tablegen-erate current Register Bank Information Summary: Adds a RegisterBank tablegen class that can be used to declare the register banks and an associated tablegen pass to generate the necessary code. Changes since first commit attempt: * Added missing guards * Added more missing guards * Found and fixed a use-after-free bug involving Twine locals Reviewers: t.p.northover, ab, rovka, qcolombet Reviewed By: qcolombet Subscribers: aditya_nandakumar, rengolin, kristof.beyls, vkalintiris, mgorny, dberris, llvm-commits, rovka Differential Revision: https://reviews.llvm.org/D27338 llvm-svn: 292478	2017-01-19 11:15:55 +00:00
Justin Bogner	ddb80aee7e	GlobalISel: Implement widening for shifts llvm-svn: 292476	2017-01-19 07:51:17 +00:00
Craig Topper	b8e92f775d	[AVX-512] Add test cases that show where we are using two subvector inserts to broadcast a 128-bit subvector into a 512-bit vector. We'd be better off using something like SHUFF32X4. If the subvector comes from a load, we convert to SUBV_BROADCAST and use a broadcast instruction. But if there is no load we keep the inserts. I think we should create the SUBV_BROADCAST even without the load and let isel use the fallback patterns that are used if the load can't be folded. This will use the SHUFF32X4 or similar instruction for the 128-bit into 512-bit case and a single insert for 128 into 256 or 256 into 512. This should be fixed so subvector broadcast intrinsics can be replaced with native IR since some of those currently lower directly to SHUFF32X4. llvm-svn: 292475	2017-01-19 07:37:45 +00:00
Craig Topper	200ea31684	[AVX-512] Support ADD/SUB/MUL of mask vectors Summary: Currently we expand and scalarize these operations, but I think we should be able to implement ADD/SUB with KXOR and MUL with KAND. We already do this for scalar i1 operations so I just extended it to vectors of i1. Reviewers: zvi, delena Reviewed By: delena Subscribers: guyblank, llvm-commits Differential Revision: https://reviews.llvm.org/D28888 llvm-svn: 292474	2017-01-19 07:12:35 +00:00
Matt Arsenault	3e6f9b5773	AMDGPU: Disable some fneg combines unless nsz For -(x + y) -> (-x) + (-y), if x == -y, this would change the result from -0.0 to 0.0. Since the fma/fmad combine is an extension of this problem it also applies there. fmul should be fine, and I don't think any of the unary operators or conversions should be a problem either. llvm-svn: 292473	2017-01-19 06:35:27 +00:00
Matt Arsenault	3b99f12a4e	AMDGPU: Remove modifiers from v_div_scale_* They seem to produce nonsense results when used. This should be applied to the release branch. llvm-svn: 292472	2017-01-19 06:04:12 +00:00
Craig Topper	c227529105	[X86] Merge LowerADD and LowerSUB into a single LowerADD_SUB since they are identical. llvm-svn: 292469	2017-01-19 03:49:29 +00:00
Mike Aizatsky	7da919b8b0	[sancov] applying blacklist to covered points too Differential Revision: https://reviews.llvm.org/D28872 llvm-svn: 292468	2017-01-19 03:49:18 +00:00
Saleem Abdulrasool	c8bcda2b56	llvm-cxxfilt: filter out invalid manglings c++filt does not attempt to demangle symbols which do not match its expected format. This means that the symbol must start with _Z or ___Z (block invocation function extension). Any other symbols are returned as is. Note that this is different from the behaviour of __cxa_demangle which will demangle fragments. llvm-svn: 292467	2017-01-19 02:58:46 +00:00
Craig Topper	b561e66384	[AVX-512] Use VSHUF instructions instead of two inserts as fallback for subvector broadcasts that can't fold the load. llvm-svn: 292466	2017-01-19 02:34:29 +00:00
Craig Topper	044662d14b	[AVX-512] Add additional test cases for broadcast intrinsics that demonstates that we don't fold the loads to use a broadcast instruction. llvm-svn: 292465	2017-01-19 02:34:25 +00:00
Michael Kuperstein	8ecc38ef85	[PM] Add LoopVectorize to the default module pipeline LV no longer "requires" LCSSA and LoopSimplify, and instead forms them internally as required. So, there's nothing preventing it from being enabled. llvm-svn: 292464	2017-01-19 02:21:54 +00:00
Peter Collingbourne	22d9d3cdce	LowerTypeTests: Implement exporting of type identifiers. Type identifiers are exported by: - Adding coarse-grained information about how to test the type identifier to the summary. - Creating symbols in the object file (aliases and absolute symbols) containing fine-grained information about the type identifier. Differential Revision: https://reviews.llvm.org/D28424 llvm-svn: 292462	2017-01-19 01:20:11 +00:00
Justin Bogner	d09c3ce6c0	GlobalISel: Implement narrowing for G_LOAD llvm-svn: 292461	2017-01-19 01:05:48 +00:00
Justin Bogner	1f5c505437	GlobalISel: Fix text wrapping in a comment. NFC llvm-svn: 292460	2017-01-19 01:04:46 +00:00
Matthias Braun	58f99615d6	Use an actual valid register in test llvm-svn: 292459	2017-01-19 01:04:08 +00:00
Dehao Chen	1ce8d6ca59	Add -debug-info-for-profiling to emit more debug info for sample pgo profile collection Summary: SamplePGO binaries built with -gmlt to collect profile. The current -gmlt debug info is limited, and we need some additional info: * start line of all subprograms * linkage name of all subprograms * standalone subprograms (functions that has neither inlined nor been inlined) This patch adds these information to the -gmlt binary. The impact on speccpu2006 binary size (size increase comparing with -g0 binary, also includes data for -g binary, which does not change with this patch): -gmlt(orig) -gmlt(patched) -g 433.milc 4.68% 5.40% 19.73% 444.namd 8.45% 8.93% 45.99% 447.dealII 97.43% 115.21% 374.89% 450.soplex 27.75% 31.88% 126.04% 453.povray 21.81% 26.16% 92.03% 470.lbm 0.60% 0.67% 1.96% 482.sphinx3 5.77% 6.47% 26.17% 400.perlbench 17.81% 19.43% 73.08% 401.bzip2 3.73% 3.92% 12.18% 403.gcc 31.75% 34.48% 122.75% 429.mcf 0.78% 0.88% 3.89% 445.gobmk 6.08% 7.92% 42.27% 456.hmmer 10.36% 11.25% 35.23% 458.sjeng 5.08% 5.42% 14.36% 462.libquantum 1.71% 1.96% 6.36% 464.h264ref 15.61% 16.56% 43.92% 471.omnetpp 11.93% 15.84% 60.09% 473.astar 3.11% 3.69% 14.18% 483.xalancbmk 56.29% 81.63% 353.22% geomean 15.60% 18.30% 57.81% Debug info size change for -gmlt binary with this patch: 433.milc 13.46% 444.namd 5.35% 447.dealII 18.21% 450.soplex 14.68% 453.povray 19.65% 470.lbm 6.03% 482.sphinx3 11.21% 400.perlbench 8.91% 401.bzip2 4.41% 403.gcc 8.56% 429.mcf 8.24% 445.gobmk 29.47% 456.hmmer 8.19% 458.sjeng 6.05% 462.libquantum 11.23% 464.h264ref 5.93% 471.omnetpp 31.89% 473.astar 16.20% 483.xalancbmk 44.62% geomean 16.83% Reviewers: davidxl, echristo, dblaikie Reviewed By: echristo, dblaikie Subscribers: aprantl, probinson, llvm-commits, mehdi_amini Differential Revision: https://reviews.llvm.org/D25434 llvm-svn: 292457	2017-01-19 00:44:11 +00:00
Michael Kuperstein	230867e583	[LV] Run loop-simplify and LCSSA explicitly instead of "requiring" them This changes the vectorizer to explicitly use the loopsimplify and lcssa utils, instead of "requiring" the transformations as if they were analyses. This is not NFC, since it changes the LCSSA behavior - we no longer run LCSSA for all loops, but rather only for the loops we expect to modify. Differential Revision: https://reviews.llvm.org/D28868 llvm-svn: 292456	2017-01-19 00:42:28 +00:00
Matthias Braun	9f21a8d787	LiveIntervalAnalysis: Cleanup; NFC - Fix doxygen comments: Do not repeat name, remove duplicated doxygen comment (on declaration + implementation), etc. - Use more range based for llvm-svn: 292455	2017-01-19 00:32:13 +00:00
Artem Belevich	3d3f6190ab	[NVPTX] Fix lowering of fp16 ISD::FNEG. There's no neg.f16 instruction, so negation has to be done via subtraction from zero. Differential Revision: https://reviews.llvm.org/D28876 llvm-svn: 292452	2017-01-19 00:14:45 +00:00
Eli Friedman	f1f49c8265	[SCEV] Make getUDivExactExpr handle non-nuw multiplies correctly. To avoid regressions, make ScalarEvolution::createSCEV a bit more clever. Also get rid of some useless code in ScalarEvolution::howFarToZero which was hiding this bug. No new testcase because it's impossible to actually expose this bug: we don't have any in-tree users of getUDivExactExpr besides the two functions I just mentioned, and they both dodged the problem. I'll try to add some interesting users in a followup. Differential Revision: https://reviews.llvm.org/D28587 llvm-svn: 292449	2017-01-18 23:56:42 +00:00
Eli Friedman	0a2174533e	Preserve domtree and loop-simplify for runtime unrolling. Mostly straightforward changes; we just didn't do the computation before. One sort of interesting change in LoopUnroll.cpp: we weren't handling dominance for children of the loop latch correctly, but foldBlockIntoPredecessor hid the problem for complete unrolling. Currently punting on loop peeling; made some minor changes to isolate that problem to LoopUnrollPeel.cpp. Adds a flag -unroll-verify-domtree; it verifies the domtree immediately after we finish updating it. This is on by default for +Asserts builds. Differential Revision: https://reviews.llvm.org/D28073 llvm-svn: 292447	2017-01-18 23:26:37 +00:00
Krzysztof Parzyszek	de44c9d857	Treat segment [B, E) as not overlapping block with boundaries [A, B) llvm-svn: 292446	2017-01-18 23:12:19 +00:00
Krzysztof Parzyszek	954dd8d9ba	[Hexagon] Remove dead defs from the live set when expanding wstores llvm-svn: 292445	2017-01-18 23:11:40 +00:00
Michael Kuperstein	d3d2925933	Revert r291670 because it introduces a crash. r291670 doesn't crash on the original testcase from PR31589, but it crashes on a slightly more complex one. PR31589 has the new reproducer. llvm-svn: 292444	2017-01-18 23:05:58 +00:00
Mehdi Amini	062b3fed4c	Improve the `-filter-print-funcs` option to skip the banner for CGSCC pass when nothing is to be printed Before, it would print a sequence of: * IR Dump After Function Integration/Inlining ** * IR Dump After Function Integration/Inlining **** * IR Dump After Function Integration/Inlining ****** ... for every single function in the module. llvm-svn: 292442	2017-01-18 21:37:11 +00:00
Sanjay Patel	cfb8a45942	[InstCombine] add tests for shl nsw with icmp eq/ne; NFCI These should be fixed with D28406. llvm-svn: 292441	2017-01-18 21:31:21 +00:00
Sanjay Patel	ae23d65a7d	[InstCombine] add an assert to make a shl+icmp transform assumption explicit; NFCI llvm-svn: 292440	2017-01-18 21:16:12 +00:00
Haicheng Wu	8ce2d14356	[CodeGenPrepare] Fix a typo in the comment. NFC. encode => endcode. Differential Revision: https://reviews.llvm.org/D28866 llvm-svn: 292438	2017-01-18 21:12:10 +00:00
Sanjay Patel	589de5ea4e	[InstCombine] remove a redundant check; NFCI I missed deleting this check when I refactored this chunk in: https://reviews.llvm.org/rL292260 llvm-svn: 292433	2017-01-18 20:09:59 +00:00
Peter Collingbourne	20a00933fb	ThinLTOBitcodeWriter: Clear comdats on filtered globals. Differential Revision: https://reviews.llvm.org/D28839 llvm-svn: 292431	2017-01-18 20:03:02 +00:00
Peter Collingbourne	10e3b12c7a	Cloning: Copy comdats when cloning globals. Differential Revision: https://reviews.llvm.org/D28838 llvm-svn: 292430	2017-01-18 20:02:31 +00:00
Michael Kuperstein	0de990da16	Fix up a comment. NFC. llvm-svn: 292425	2017-01-18 19:05:48 +00:00
Michael Kuperstein	7cefb409b0	[LV] Allow reductions that have several uses outside the loop We currently check whether a reduction has a single outside user. We don't really need to require that - we just need to make sure a single value is used externally. The number of external users of that value shouldn't actually matter. Differential Revision: https://reviews.llvm.org/D28830 llvm-svn: 292424	2017-01-18 19:02:52 +00:00
Justin Bogner	2ceeb30eb6	cmake: Only sanitize use-after-scope if the host compiler supports it In r292256, we started adding -fsanitize-use-after-scope when using the address sanitizer, but that flag wasn't always available. This fixes the config to only add the flag if the host compiler supports it. llvm-svn: 292423	2017-01-18 19:01:58 +00:00
Evandro Menezes	7960b2e19a	[AArch64] Generate literals by the little end ARM seems to prefer that long literals be formed from their little end in order to promote the fusion of the instrs pairs MOV/MOVK and MOVK/MOVK on Cortex A57 and others (v. "Cortex A57 Software Optimisation Guide", section 4.14). Differential revision: https://reviews.llvm.org/D28697 llvm-svn: 292422	2017-01-18 18:57:08 +00:00
Davide Italiano	bca9d73309	[NewGVN] We don't use postdom info anymore. Update. Differential Revision: https://reviews.llvm.org/D28842 llvm-svn: 292421	2017-01-18 18:42:28 +00:00
Mehdi Amini	67d2cc1fad	[ThinLTO] Add a recursive step in Metadata lazy-loading Summary: Without this, we're stressing the RAUW of unique nodes, which is a costly operation. This is intended to limit the number of RAUW, and is very effective on the total link-time of opt with ThinLTO, before: real 4m4.587s user 15m3.401s sys 0m23.616s after: real 3m25.261s user 12m22.132s sys 0m24.152s Reviewers: tejohnson, pcc Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28751 llvm-svn: 292420	2017-01-18 18:36:21 +00:00
Graydon Hoare	ae5d7bb4f5	[lit] Support sharding testsuites, for parallel execution. Summary: This change equips lit.py with two new options, --num-shards=M and --run-shard=N (set by default from env vars LIT_NUM_SHARDS and LIT_RUN_SHARD). The options must be used together, and N must be in 1..M. Together these options effect only test selection: they partition the testsuite into M equal-sized "shards", then select only the Nth shard. They can be used in a cluster of test machines to achieve a very crude (static) form of parallelism, with minimal configuration work. Reviewers: modocache, ddunbar Reviewed By: ddunbar Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28789 llvm-svn: 292417	2017-01-18 18:12:20 +00:00
Alexey Bataev	f86cca1a42	[SLP] Add a tests for a fix for PR30787. Add a test for PR30787: Failure to beneficially vectorize 'copyable' elements in integer binary ops. llvm-svn: 292416	2017-01-18 18:07:46 +00:00
Stanislav Mekhanoshin	a4e63ead4b	[AMDGPU] Do not allow register coalescer to create big superregs Limit register coalescer by not allowing it to artificially increase size of registers beyond dword. Such super-registers are in fact register sequences and not distinct HW registers. With more super-regs we would need to allocate adjacent registers and constraint regalloc more than needed. Moreover, our super registers are overlapping. For instance we have VGPR0_VGPR1_VGPR2, VGPR1_VGPR2_VGPR3, VGPR2_VGPR3_VGPR4 etc, which complicates registers allocation even more, resulting in excessive spilling. Differential Revision: https://reviews.llvm.org/D28782 llvm-svn: 292413	2017-01-18 17:30:05 +00:00
Justin Bogner	fde0104649	GlobalISel: Implement narrowing for G_STORE Legalize stores of types that are too wide by breaking them up into sequences of smaller stores. llvm-svn: 292412	2017-01-18 17:29:54 +00:00
Justin Bogner	cb60161a25	GlobalISel: Correct copy-pasted comment. NFC llvm-svn: 292411	2017-01-18 17:28:41 +00:00
Teresa Johnson	2d384ac381	Don't create a comdat group for a dropped def with initializer Non-prevailing weak/linkonce odr symbols will be dropped by ThinLTO to available_externally when possible. If they had an initializer in the global_ctors list, a comdat group was being created. This code already had logic to skip available_externally defs, but now the EliminateAvailableExternally pass will drop these symbols to declarations earlier. Change the check to skip all declarations for linker (which includes available_externally along with declarations). Reviewers: mehdi_amini Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28737 llvm-svn: 292408	2017-01-18 16:58:43 +00:00
Kirill Bobyrev	6afbaf0944	Revert 292404 due to buildbot failures. llvm-svn: 292407	2017-01-18 16:34:25 +00:00
Kirill Bobyrev	9ad06dbe17	[X86] Minor code cleanup to fix several clang-tidy warnings. NFC llvm-svn: 292404	2017-01-18 16:15:47 +00:00
Sam Parker	b0de00d545	[ARM] Create SubtargetFeatures from build attrs An ELFObjectFile can now create SubtargetFeatures from the available ARM build attributes, in a similar manner to MIPS. I've moved the MIPS code into its own function and the ARM handler also has a separate function. Differential Revision: https://reviews.llvm.org/D28291 llvm-svn: 292403	2017-01-18 15:52:11 +00:00
Pavel Labath	97c7cf1d5c	raw_fd_ostream: Make file handles non-inheritable by default Summary: This makes the file descriptors on unix platform non-inheritable (O_CLOEXEC). There is no change in behavior on windows, as the handles were already non-inheritable there. Reviewers: rnk, rafael Subscribers: llvm-commits, mgorny Differential Revision: https://reviews.llvm.org/D28854 llvm-svn: 292401	2017-01-18 15:46:50 +00:00
Chad Rosier	771db6f895	[Assembler] Fix crash when assembling .quad for AArch32. A 64-bit relocation does not exist in 32-bit ARMELF. Report an error instead of crashing. PR23870 Patch by Sanne Wouda (sanwou01). Differential Revision: https://reviews.llvm.org/D28851 llvm-svn: 292373	2017-01-18 15:02:54 +00:00
Florian Hahn	8485cecd3f	[thumb,framelowering] Reset NoVRegs in Thumb1FrameLowering::emitPrologue. Summary: In this function, virtual registers can be introduced (for example through calls to emitThumbRegPlusImmInReg). doScavengeFrameVirtualRegs will replace those virtual registers with concrete registers later on in PrologEpilogInserter, which sets NoVRegs again. This patch fixes the Codegen/Thumb/segmented-stacks.ll test case which failed with expensive checks. https://llvm.org/bugs/show_bug.cgi?id=27484 Reviewers: rnk, bkramer, olista01 Reviewed By: olista01 Subscribers: llvm-commits, rengolin Differential Revision: https://reviews.llvm.org/D28829 llvm-svn: 292372	2017-01-18 15:01:22 +00:00
Simon Pilgrim	fe2c0ed4cf	[InstCombine][AVX2] Add DemandedElts support for VPERMD/VPERMPS shuffles Simplify a vpermv shuffle mask based on the elements of the mask that are actually demanded. llvm-svn: 292371	2017-01-18 14:47:49 +00:00
Daniel Sanders	af76f989b5	Re-revert: [globalisel] Tablegen-erate current Register Bank Information More missing guards. My build didn't notice it due to a stale file left over from a Global ISel build. llvm-svn: 292369	2017-01-18 14:26:12 +00:00
Simon Pilgrim	970d67c653	[InstCombine][AVX2] Tests showing missed opportunities to pass demanded elts through a vpermd/vpermps shuffle llvm-svn: 292368	2017-01-18 14:23:06 +00:00
Daniel Sanders	517b61cb69	Re-commit: [globalisel] Tablegen-erate current Register Bank Information Summary: Adds a RegisterBank tablegen class that can be used to declare the register banks and an associated tablegen pass to generate the necessary code. Changes since last commit: The new tablegen pass is now correctly guarded by LLVM_BUILD_GLOBAL_ISEL and this should fix the buildbots however it may not be the whole fix. The previous buildbot failures suggest there may be a memory bug lurking that I'm unable to reproduce (including when using asan) or spot in the source. If they re-occur on this commit then I'll need assistance from the bot owners to track it down. Reviewers: t.p.northover, ab, rovka, qcolombet Reviewed By: qcolombet Subscribers: aditya_nandakumar, rengolin, kristof.beyls, vkalintiris, mgorny, dberris, llvm-commits, rovka Differential Revision: https://reviews.llvm.org/D27338 llvm-svn: 292367	2017-01-18 14:17:50 +00:00
Sam Parker	df7c6ef96f	[ARM] Create objdump subtarget from build attrs Enable an ELFObjectFile to read the its arm build attributes to produce a target triple with a specific ARM architecture. llvm-objdump now uses this functionality to automatically produce a more accurate target. Differential Revision: https://reviews.llvm.org/D28769 llvm-svn: 292366	2017-01-18 13:52:12 +00:00
Simon Pilgrim	a22c3a1c0f	[InstCombine] Remove unnecessary intrinsics demanded elts handling As discussed on D28777 - we don't need to handle 'all element' shuffles inside InstCombiner::visitCallInst as InstCombiner::SimplifyDemandedVectorElts will do everything we need. llvm-svn: 292365	2017-01-18 13:44:04 +00:00
Simon Pilgrim	4b51989635	Fixed parser error on windows shell evaluation of RUN script line llvm-svn: 292363	2017-01-18 11:40:28 +00:00
Simon Pilgrim	d0ccf5e2e3	[X86][SSE] Simplify umax knownbits test combineSRA doesn't detect sign bits splats that it does itself so just use -1 as the demanded input so that its already splatted llvm-svn: 292361	2017-01-18 11:20:31 +00:00
Michael Zuckerman	0c0240ce84	[X86] Improve mul combine for negative multiplayer (2^c - 1) This patch improves the mul instruction combine function (combineMul) by adding new layer of logic. In this patch, we are adding the ability to fold (mul x, -((1 << c) -1)) or (mul x, -((1 << c) +1)) into (neg(X << c) -x) or (neg((x << c) + x) respective. Differential Revision: https://reviews.llvm.org/D28232 llvm-svn: 292358	2017-01-18 09:31:13 +00:00
Renato Golin	03c5e69d07	Revert "[XRay][Arm] Repair XRay table emission on Arm32 and add tests to identify such problem earlier" This reverts commit r292210, as it broke the Thumb buldbot with: clang-5.0: error: the clang compiler does not support '-fxray-instrument on thumbv7-unknown-linux-gnueabihf'. llvm-svn: 292357	2017-01-18 09:08:43 +00:00
Jonas Paulsson	a9bb00d82b	[SystemZ] Proper handling of undef flag while expanding pseudo. During post-RA pseudo expansion, an 'undef' flag of the source operand should be propagated by emitGRX32Move(). Review: Ulrich Weigand llvm-svn: 292353	2017-01-18 08:32:54 +00:00
Marina Yatsina	197db00e3e	[X86] Fix for bugzilla 31576 - add support for "data32" instruction prefix This patch fixes bugzilla 31576 (https://llvm.org/bugs/show_bug.cgi?id=31576). "data32" instruction prefix was not defined in the llvm. An exception had to be added to the X86 tablegen and AsmPrinter because both "data16" and "data32" are encoded to 0x66 (but in different modes). Differential Revision: https://reviews.llvm.org/D28468 llvm-svn: 292352	2017-01-18 08:07:51 +00:00
Chandler Carruth	8aaad7c4d9	[LoopDeletion] (cleanup, NFC) Fix one more local variable that didn't follow LLVM's naming conventions while I'm here. Again, sorry I didn't spot this earlier to coalesce with other cleanup changes. llvm-svn: 292333	2017-01-18 02:43:01 +00:00
Chandler Carruth	d50c5fb13f	[PM] Teach LoopDeletion to correctly update the LPM when loops are deleted. I've expanded its test coverage a bit including adding one test that will crash clearly without this change. llvm-svn: 292332	2017-01-18 02:41:26 +00:00
Chandler Carruth	88c36d7852	[LoopDeletion] (cleanup, NFC) Make this test actually test what it claims to test. LoopSimplify was unifying the multiple exits in this test case, making it never even test the multiple exit handling of LoopDeletion. Doh. Now it works (thanks to a great idea from mkuper) and will fail if we ever change something to make it stop working. llvm-svn: 292331	2017-01-18 02:29:35 +00:00
Matt Arsenault	f411071d63	DAG: Consider nnan in isKnownNeverNaN llvm-svn: 292328	2017-01-18 02:10:08 +00:00
Wei Mi	ce9d04ce58	Revert rL292292 since it causes a SEGV on sanitizer-x86_64-linux-fuzzer build bot. llvm-svn: 292327	2017-01-18 01:53:53 +00:00
Kostya Serebryany	bb91170cb5	[libFuzzer] remove stale code llvm-svn: 292325	2017-01-18 01:10:18 +00:00
Pengxuan Zheng	ac6595c960	[test-release.sh] Add Polly to the list of projects Reviewers: zinob, hans, grosser Reviewed By: hans, grosser Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28712 llvm-svn: 292323	2017-01-18 01:03:29 +00:00
Dan Gohman	73e3aaa61e	[WebAssembly] Update grow_memory's return type. The grow_memory instruction now returns the previous memory size. Add the return type to the LLVM intrinsic. llvm-svn: 292322	2017-01-18 01:02:45 +00:00
Matthias Braun	de5fea2c30	MIRParser: Allow regclass specification on operand You can now define the register class of a virtual register on the operand itself avoiding the need to use a "registers:" block. Example: "%0:gr64 = COPY %rax" Differential Revision: https://reviews.llvm.org/D22398 llvm-svn: 292321	2017-01-18 00:59:19 +00:00
Eugene Zelenko	34c23279c2	[Target, Transforms] Fix some Clang-tidy modernize and Include What You Use warnings; other minor fixes (NFC). llvm-svn: 292320	2017-01-18 00:57:48 +00:00
Kostya Serebryany	9d0f02af3d	[libFuzzer] exit(1) on failed merge llvm-svn: 292319	2017-01-18 00:55:29 +00:00
Kostya Serebryany	924978bb43	[libFuzzer] better link for trophies llvm-svn: 292318	2017-01-18 00:45:02 +00:00
Justin Lebar	1cf6bf4989	[NVPTX] Support global variables of integer type larger than i64. Reviewers: tra, majnemer Subscribers: llvm-commits, jholewinski Differential Revision: https://reviews.llvm.org/D28825 llvm-svn: 292316	2017-01-18 00:29:53 +00:00
Xin Tong	58e8142f0e	2 returns next to each other =). NFC llvm-svn: 292315	2017-01-18 00:26:17 +00:00
Xin Tong	99c3da0e8b	Skip loop header while we can when computing loop safety info llvm-svn: 292310	2017-01-18 00:15:11 +00:00
Eric Fiselier	f744e7e15a	[LIT] Make util.executeCommand python3 friendly Summary: The parameter `input` to `subprocess.Popen.communicate(...)` must be an object of type `bytes` . This is strictly enforced in python3. This patch (1) allows `to_bytes` to be safely called redundantly. (2) Explicitly convert `input` within `executeCommand`. This allows for usages like `executeCommand(['clang++', '-'], input='int main() {}\n')`. Reviewers: ddunbar, BinaryKhaos, modocache, dim, EricWF Reviewed By: EricWF Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28736 llvm-svn: 292308	2017-01-18 00:12:41 +00:00
Justin Lebar	9c46450dbb	[NVPTX] Standardize asm printer on "foo \tbar". Some instructions were printed as "foo\tbar", but most are printed as "foo \bar". Standardize on the latter form. llvm-svn: 292306	2017-01-18 00:09:36 +00:00
Justin Lebar	2a2d6f0ddd	[NVPTX] Clean up nested !strconcat calls. !strconcat is a variadic function; it will concatenate an arbitrary number of strings. There's no need to nest it. llvm-svn: 292305	2017-01-18 00:09:19 +00:00
Justin Lebar	cc938fc197	[NVPTX] Implement min/max in tablegen, rather than with custom DAGComine logic. Summary: This change also lets us use max.{s,u}16. There's a vague warning in a test about this maybe being less efficient, but I could not come up with a case where the resulting SASS (sm_35 or sm_60) was different with or without max.{s,u}16. It's true that nvcc seems to emit only max.{s,u}32, but even ptxas 7.0 seems to have no problem generating efficient SASS from max.{s,u}16 (the casts up to i32 and back down to i16 seem to be implicit and nops, happening via register aliasing). In the absence of evidence, better to have fewer special cases, emit more straightforward code, etc. In particular, if a new GPU has 16-bit min/max instructions, we want to be able to use them. Reviewers: tra Subscribers: jholewinski, llvm-commits Differential Revision: https://reviews.llvm.org/D28732 llvm-svn: 292304	2017-01-18 00:09:01 +00:00
Justin Lebar	7dc3d6c341	[NVPTX] Lower integer absolute value idiom to abs instruction. Summary: Previously we lowered it literally, to shifts and xors. Reviewers: tra Subscribers: jholewinski, llvm-commits Differential Revision: https://reviews.llvm.org/D28722 llvm-svn: 292303	2017-01-18 00:08:44 +00:00
Justin Lebar	1091a9f566	[NVPTX] Improve lowering of llvm.ctpop. Summary: Avoid an unnecessary conversion operation when using the result of ctpop.i32 or ctpop.i16 as an i32, as in both cases the ptx instruction we run returns an i32. (Previously if we used the value as an i32, we'd do an unnecessary zext+trunc.) Reviewers: tra Subscribers: jholewinski, llvm-commits Differential Revision: https://reviews.llvm.org/D28721 llvm-svn: 292302	2017-01-18 00:08:27 +00:00
Justin Lebar	c7d20128bd	[NVPTX] Add lowering for llvm.bitreverse. Reviewers: tra Subscribers: llvm-commits, jholewinski Differential Revision: https://reviews.llvm.org/D28720 llvm-svn: 292301	2017-01-18 00:08:10 +00:00
Justin Lebar	47087814f1	[NVPTX] Fix function names in ctlz.ll test. Test-only change. Looks like a copy/paste mistake, all the functions in ctlz.ll were named "ctpop". llvm-svn: 292300	2017-01-18 00:07:52 +00:00
Justin Lebar	d17de5380b	[NVPTX] Improve lowering of llvm.ctlz. Summary: * Disable "ctlz speculation", which inserts a branch on every ctlz(x) which has defined behavior on x == 0 to check whether x is, in fact zero. * Add DAG patterns that avoid re-truncating or re-expanding the result of the 16- and 64-bit ctz instructions. Reviewers: tra Subscribers: llvm-commits, jholewinski Differential Revision: https://reviews.llvm.org/D28719 llvm-svn: 292299	2017-01-18 00:07:35 +00:00
Justin Lebar	33139053da	[IR] Grammar police: "intact" is one word. NFC llvm-svn: 292298	2017-01-18 00:07:18 +00:00
Sanjay Patel	0e9f681dee	[InstCombine] add tests to show missed shrinkage; NFC A patch to partially solve this: https://reviews.llvm.org/D28625 llvm-svn: 292296	2017-01-18 00:03:23 +00:00
Kostya Serebryany	3344f3517f	[libFuzzer] add ATTRIBUTE_NO_SANITIZE_MEMORY to sanitizer hooks llvm-svn: 292295	2017-01-17 23:50:21 +00:00
Dehao Chen	c3f87f02b1	Introduce -unroll-partial-threshold to separate PartialThreshold from Threshold in loop unorller. Summary: Partial unrolling should have separate threshold with full unrolling. Reviewers: efriedma, mzolotukhin Reviewed By: efriedma, mzolotukhin Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28831 llvm-svn: 292293	2017-01-17 23:39:33 +00:00
Wei Mi	8f4178a59e	[RegisterCoalescing] Remove partial redundent copy. The patch is to solve the performance problem described in PR27827. Register coalescing sometimes cannot remove a copy because of interference. But if we can find a reverse copy in one of the predecessor block of the copy, the copy is partially redundent and we may remove the copy partially by moving it to the predecessor block without the reverse copy. Differential Revision: https://reviews.llvm.org/D28585 llvm-svn: 292292	2017-01-17 23:39:07 +00:00
Mehdi Amini	485db58b84	Fix GettingStarted doc so that the example build command for cmake LLVM_ENABLE_PROJECTS works on linux I tested the previous one on macOS, however building libc++ on Linux requires libcxxabi as well. llvm-svn: 292290	2017-01-17 23:23:08 +00:00
Mike Aizatsky	0e37f8e41d	[libfuzzer] fixing collected pc addresses for coverage Summary: The causes google/ossfuzz#84 Reviewers: kcc Subscribers: mgorny Differential Revision: https://reviews.llvm.org/D28827 llvm-svn: 292289	2017-01-17 23:11:32 +00:00
Zachary Turner	c095f6a037	[ADT] Add SparseBitVector::find_last(). Differential Revision: https://reviews.llvm.org/D28817 llvm-svn: 292288	2017-01-17 23:09:21 +00:00
Kostya Serebryany	1d8c2ce97e	[libFuzzer] use table of recent compares for memcmp/strcmp (to unify the code between cmp and memcmp handling) llvm-svn: 292287	2017-01-17 23:09:05 +00:00
Kostya Serebryany	138ed2b068	[libFuzzer] copy the options inside MutationDispatcher to avoid use-after-scope in mutator tests llvm-svn: 292286	2017-01-17 23:05:07 +00:00
Tim Northover	33a1a0b001	GlobalISel: fix comparison order for G_FCMP As with G_ICMP we'd written the CSET instructions backwards. llvm-svn: 292285	2017-01-17 23:04:01 +00:00
Tim Northover	509091f9e0	GlobalISel: add callseq instructions to record stack usage llvm-svn: 292284	2017-01-17 22:43:34 +00:00
Tim Northover	d943354216	GlobalISel: correctly handle varargs Some platforms (notably iOS) use a different calling convention for unnamed vs named parameters in varargs functions, so we need to keep track of this information when translating calls. Since not many platforms are involved, the guts of the special handling is in the ValueHandler class (with a generic implementation that should work for most targets). llvm-svn: 292283	2017-01-17 22:30:10 +00:00
Chandler Carruth	80de5e6e01	[LoopDeletion] (cleanup, NFC) Use the dedicated helper to get a single unique exit block if available rather than rolling it ourselves. This is a little disappointing because that helper doesn't do anything clever to short-circuit the (surprisingly expensive) computation of all exit blocks. What's worse is that the way we compute this is hopelessly, hilariously inefficient. We're literally computing the same information two different ways and multiple times each way: - hasDedicatedExits computes the exit block set and then looks at the predecessors of each - getExitingBlocks computes the set of loop blocks which have exiting successors - getUniqueExitBlock(s) computes the set of non-loop blocks reached from loop blocks (sound familiar?) Anyways, at some point we should clean all of this up in the LoopInfo API, but for now just simplifying the user I'm about to touch. llvm-svn: 292282	2017-01-17 22:28:52 +00:00
Matthew Simpson	e2c9ad9483	[LV] Add requires asserts to test case llvm-svn: 292280	2017-01-17 22:21:33 +00:00
Chandler Carruth	aa885c990b	[LoopDeletion] (cleanup, NFC) Fix another variable name to match LLVM conventions, missed this one in a previous cleanup patch (sorry). llvm-svn: 292279	2017-01-17 22:19:56 +00:00
Tim Northover	b6636fd392	[GlobalISel] track predecessor mapping during switch lowering. Correctly populating Machine PHIs relies on knowing exactly how the IR level CFG was lowered to MachineIR. This needs to be tracked by any translation phases that meddle (currently only SwitchInst handling). This reapplies r291973 which was reverted because of testing failures. Fixes: + Don't return an ArrayRef to a local temporary. + Incorporate Kristof's suggested comment improvements. llvm-svn: 292278	2017-01-17 22:13:50 +00:00
Simon Pilgrim	421f2d9af8	[X86][SSE] Split UMIN and UMAX known bits tests llvm-svn: 292277	2017-01-17 22:12:25 +00:00
Chandler Carruth	bd551e9674	[LoopDeletion] (cleanup, NFC) Remove a pointless comment. I hope that for any code, it is changed only with good reason and only when the author knows what they are doing... There is of course good reason to comment here about the subtlety of the process, and I've left that comment in tact. llvm-svn: 292275	2017-01-17 22:09:28 +00:00
Chandler Carruth	26169f001c	[LoopDeletion] (cleanup, NFC) Make simple helper functions static instead of members. No state was being provided by the object so this seems strictly simpler. I've also tried to improve the name and comments for the functions to more thoroughly document what they are doing. llvm-svn: 292274	2017-01-17 22:07:26 +00:00
Chandler Carruth	bb7e4b46e9	[LoopDeletion] (cleanup, NFC) Stop passing around reference to a vector that we know has exactly one element when all we are going to do is get that one element out of it. Instead, pass around that one element. There are more simplifications to come in this code... llvm-svn: 292273	2017-01-17 22:00:52 +00:00
Chandler Carruth	04a73879a8	[PM] Clean up variable and parameter names to match modern LLVM naming conventions more conistently before hacking on this code to integrate nicely with new PM's loop pass infrastructure. NFC. llvm-svn: 292272	2017-01-17 21:51:39 +00:00
Aaron Ballman	b3c5151327	Silence some Sphinx diagnostics in an attempt to get the documentation builder back to green (http://lab.llvm.org:8011/builders/llvm-sphinx-docs/builds/1895 ). llvm-svn: 292271	2017-01-17 21:48:31 +00:00
Xin Tong	0bc2977874	Add a test case for LICM when promoting locals that may be read after the throw within the loop. NFCI. Summary: Add a test case for LICM when promoting locals that may be read after the throw within the loop. Reviewers: eli.friedman, hfinkel, sanjoy Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28822 llvm-svn: 292261	2017-01-17 21:26:36 +00:00
Sanjay Patel	14715b3c2a	[InstCombine] refactor foldICmpShlConstant(); NFCI This reduces the size of and increases the symmetry with the planned functional change in: https://reviews.llvm.org/D28406 llvm-svn: 292260	2017-01-17 21:25:16 +00:00
Alexei Starovoitov	efefbc4a19	[bpf] fix stack-use-after-scope Signed-off-by: Alexei Starovoitov <ast@kernel.org> llvm-svn: 292258	2017-01-17 21:14:00 +00:00
Vitaly Buka	729a039ec2	Enabled -fsanitize-address-use-after-scope for -DLLVM_USE_SANITIZER=Address Subscribers: mgorny, llvm-commits Differential Revision: https://reviews.llvm.org/D28823 llvm-svn: 292256	2017-01-17 21:04:23 +00:00
Michal Gorny	608319101a	[cmake] Update SOVERSION for the new versioning scheme Update SOVERSION to use just the major version number rather than major+minor, to match the new versioning scheme where only major is used to indicate API/ABI version. Since two-digit SOVERSIONs were introduced post 3.9 branching, this change does not risk any SOVERSION collisions. In the past, two-component X.Y SOVERSIONs were shortly used but those will not interfere with the new ones since the new versions start at 4. Differential Revision: https://reviews.llvm.org/D28730 llvm-svn: 292255	2017-01-17 21:04:19 +00:00
Matthew Simpson	3fbdaa5906	[LV] Mark non-consecutive-like pointers non-uniform If a memory instruction will be vectorized, but it's pointer operand is non-consecutive-like, the instruction is a gather or scatter operation. Its pointer operand will be non-uniform. This should fix PR31671. Reference: https://llvm.org/bugs/show_bug.cgi?id=31671 Differential Revision: https://reviews.llvm.org/D28819 llvm-svn: 292254	2017-01-17 20:51:39 +00:00
Dan Gohman	1209c7ac16	[WebAssembly] Add triple support for the new wasm object format Differential Revision: https://reviews.llvm.org/D26701 llvm-svn: 292252	2017-01-17 20:34:09 +00:00
Xin Tong	8343b5096d	Rename scalar_promote.ll to scalar-promote.ll and scalar_promote-unwind.ll to scalar-promote-unwind.ll. NFCI llvm-svn: 292251	2017-01-17 20:28:36 +00:00
Xin Tong	43c1a26400	Refactor out LoopInfo computation so that it can be used by other test cases. Summary: Refactor out LoopInfo computation so that it can be used by other test cases. So i am changing this test proactively for later commit, which will use this function. Reviewers: sanjoy, hfinkel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28778 llvm-svn: 292250	2017-01-17 20:24:39 +00:00
Sanjoy Das	6de072a712	[EarlyCSE] Don't DSE across readnone functions that may throw Summary: Depends on D28740 Reviewers: dberlin, chandlerc, hfinkel, majnemer Subscribers: mcrosier, llvm-commits Differential Revision: https://reviews.llvm.org/D28741 llvm-svn: 292249	2017-01-17 20:15:47 +00:00
Sanjay Patel	f42a955c3e	[InstCombine] add tests for shl nsw + icmp sle; NFC We want to handle these cases similarly to icmp sgt, so add the tests for it. See: https://reviews.llvm.org/D28406 llvm-svn: 292248	2017-01-17 20:15:26 +00:00
Ahmed Bougacha	86b680a372	[TLI] Appease spurious MSVC warning using llvm_unreachable. NFC. r292188 confused MSVC because of the combined lack of a default case and return statement. Move the unreachable outside of the NumLibFuncs case, to make it obvious that all cases should be handled. llvm_unreachable is __declspec(noreturn), so I'm assuming this does appease MSVC. llvm-svn: 292246	2017-01-17 19:54:18 +00:00
Joerg Sonnenberger	270dd41f75	Remove an overeager assert from r288844. llvm-svn: 292244	2017-01-17 19:29:15 +00:00
Bob Wilson	f2d0b68b3b	Revert r291640 change to fold X86 comparison with atomic_load_add. Even with the fix from r291630, this still causes problems. I get widespread assertion failures in the Swift runtime's WeakRefCount::increment() function. I sent a reduced testcase in reply to the commit. llvm-svn: 292242	2017-01-17 19:18:57 +00:00
Chandler Carruth	b6e32daa81	[PM] Teach the LoopPassManager to automatically canonicalize loops by runnig LCSSA over them prior to running the loop pipeline. This also teaches the loop PM to verify that LCSSA form is preserved throughout the pipeline's run across the loop nest. Most of the test updates just leverage this new functionality. One has to be relaxed with the new PM as IVUsers is less powerful when it sees LCSSA input. Differential Revision: https://reviews.llvm.org/D28743 llvm-svn: 292241	2017-01-17 19:18:12 +00:00
Sanjay Patel	9666996563	[ValueTracking] recognize a 'not' of an assumed condition as false Also, add the corresponding match to the AssumptionCache's 'Affected Values' list. Differential Revision: https://reviews.llvm.org/D28485 llvm-svn: 292239	2017-01-17 18:15:49 +00:00
David Majnemer	de55c606d1	[InstCombine] Fold ((C1 OP zext(X)) & C2) -> zext((C1 OP X) & C2) This further extends r292179 to support additional binary operators beyond subtraction. llvm-svn: 292238	2017-01-17 18:08:06 +00:00
Kuba Mracek	e7d1f92344	Revert r292231. llvm-svn: 292237	2017-01-17 18:06:38 +00:00
Simon Pilgrim	60662cb5d0	[X86][AVX512] Add all_of/any_of avx512vl tests llvm-svn: 292235	2017-01-17 17:33:18 +00:00
Chad Rosier	8520429bdd	[ValueTracking] Extend known bits to understand @llvm.bitreverse. Differential Revision: https://reviews.llvm.org/D28780 llvm-svn: 292233	2017-01-17 17:23:51 +00:00
Kuba Mracek	53013e9e6f	[lit] Limit parallelism of sanitizer tests on Darwin [llvm part] Running lit tests and unit tests of ASan and TSan on macOS has very bad performance when running with a high number of threads. This is caused by xnu (the macOS kernel), which currently doesn't handle mapping and unmapping of sanitizer shadow regions (reserved VM which are several terabytes large) very well. The situation is so bad that increasing the number of threads actually makes the total testing time larger. The macOS buildbots are affected by this. Note that we can't easily limit the number of sanitizer testing threads without affecting the rest of the tests. This patch adds a special "group" into lit, and limits the number of concurrently running tests in this group. This helps solve the contention problem, while still allowing other tests to run in full, that means running lit with -j8 will still with 8 threads, and parallelism is only limited in sanitizer tests. Differential Revision: https://reviews.llvm.org/D28420 llvm-svn: 292231	2017-01-17 17:15:02 +00:00
Sanjay Patel	5424bd2625	[InstCombine] reduce indent; NFCI llvm-svn: 292230	2017-01-17 16:59:09 +00:00
George Rimar	167ca4ae7e	Recommit r292214 "[Support/Compression] - Change zlib API to return Error instead of custom status" No any changes, will follow up with D28807 commit containing APLi change for clang to fix build issues happened. Original commit message: [Support/Compression] - Change zlib API to return Error instead of custom status. Previously API returned custom enum values. Patch changes it to return Error with string description. That should help users to report errors in universal way. Differential revision: https://reviews.llvm.org/D28684 llvm-svn: 292226	2017-01-17 15:45:07 +00:00
Sam Kolton	9dffada98b	[AMDGPU] Assembler: fix v_mac_f16 immediates Reviewers: vpykhtin, artem.tamazov, tstellarAMD Subscribers: arsenm, kzhuravl, wdng, nhaehnle, yaxunl, tony-tye Differential Revision: https://reviews.llvm.org/D28802 llvm-svn: 292224	2017-01-17 15:26:02 +00:00
Simon Pilgrim	8b2996fe1a	[X86][SSE] Tests showing horizontal all_of/any_of of vector comparison results llvm-svn: 292223	2017-01-17 15:02:01 +00:00
Krasimir Georgiev	4cbe21a43c	[llvm-objdump tests] Copy the inputs of tests closer to tests. Summary: Tests under tools/llvm-objdump should not use inputs from Object. Copied the required inputs and aligned the new tests to be more consistent with the existing tests in this respect. Reviewers: ioeric Reviewed By: ioeric Subscribers: davide, djasper, cfe-commits Differential Revision: https://reviews.llvm.org/D28799 llvm-svn: 292222	2017-01-17 14:22:29 +00:00
George Rimar	715540f207	Revert r292214 "[Support/Compression] - Change zlib API to return Error instead of custom status." It broked clang: http://lab.llvm.org:8080/green//job/clang-stage1-cmake-RA-incremental_build/34218/consoleFull#46141505449ba4694-19c4-4d7e-bec5-911270d8a58c llvm-svn: 292217	2017-01-17 13:27:58 +00:00
Boris Ulasevich	a2a0213179	BrainF example: fixing output buffering issue Differential Revision: https://reviews.llvm.org/D27824 llvm-svn: 292216	2017-01-17 13:27:28 +00:00
George Rimar	e29a32e9ce	[Support/Compression] - Change zlib API to return Error instead of custom status. Previously API returned custom enum values. Patch changes it to return Error with string description. That should help users to report errors in universal way. Differential revision: https://reviews.llvm.org/D28684 llvm-svn: 292214	2017-01-17 13:20:17 +00:00
Serge Rogatch	50be6b45a9	[XRay][Arm] Repair XRay table emission on Arm32 and add tests to identify such problem earlier Summary: Emission of XRay table was occasionally disabled for Arm32, but this bug was not then detected because earlier (also by mistake) testing of XRay was occasionally disabled on 32-bit Arm targets. This patch should fix that problem and detect such problems in the future. This patch is one of a series, see also - https://reviews.llvm.org/D28623 Reviewers: rengolin, dberris Reviewed By: dberris Subscribers: llvm-commits, aemerson, rengolin, dberris, iid_iunknown Differential Revision: https://reviews.llvm.org/D28624 llvm-svn: 292210	2017-01-17 11:52:10 +00:00
Simon Pilgrim	d4eb800b03	[InstCombine][X86][AVX] Add DemandedElts support for VPERMILPD/VPERMILPS instructions Simplify a vpermilvar shuffle mask based on the elements of the mask that are actually demanded. llvm-svn: 292209	2017-01-17 11:35:03 +00:00
Vasileios Kalintiris	92b3753115	Update the release tester for MIPS. NFC. llvm-svn: 292208	2017-01-17 11:00:28 +00:00
Pavel Labath	9778921a0b	Remove pid_t usage from llvm-xray This type is not available on windows. llvm-svn: 292206	2017-01-17 09:39:31 +00:00
Matt Arsenault	4165efdc58	AMDGPU: Add replacement export intrinsics llvm-svn: 292205	2017-01-17 07:26:53 +00:00
Alexei Starovoitov	e4975487f5	[bpf] error when unknown bpf helper is called Emit error when BPF backend sees a call to a global function or to an external symbol. The kernel verifier only allows calls to predefined helpers from bpf.h which are defined in 'enum bpf_func_id'. Such calls in assembler must look like 'call [1-9]+' where number matches bpf_func_id. Signed-off-by: Alexei Starovoitov <ast@kernel.org> llvm-svn: 292204	2017-01-17 07:26:17 +00:00
Shoaib Meenai	c0857a40e7	[utils] Add libc++ and libc++abi config to llvm-lit This allows us to use bin/llvm-lit to run individual libc++ and libc++abi tests without having to explicitly specify the site config paths, similar to other projects. Differential Revision: https://reviews.llvm.org/D28733 llvm-svn: 292203	2017-01-17 07:10:55 +00:00
Craig Topper	729d30d0ae	[AVX-512] Add support for taking a bitcast between a SUBV_BROADCAST and VSELECT and moving it to the input of the SUBV_BROADCAST if it will help with using a masked operation. llvm-svn: 292201	2017-01-17 06:49:59 +00:00
Craig Topper	7b003b9cf3	[AVX-512] Add test cases showing missed opportunities to fold subvector broadcasts with a mask operation. llvm-svn: 292200	2017-01-17 06:49:54 +00:00
Matt Arsenault	aa09378f1b	llc: Update link components llvm-svn: 292198	2017-01-17 05:47:03 +00:00
Sanjoy Das	679bc32c6a	[InstCombine] Don't DSE across readnone functions that may throw Summary: Depends on D28740 Reviewers: dberlin, chandlerc, hfinkel, majnemer Subscribers: mcrosier, llvm-commits Differential Revision: https://reviews.llvm.org/D28742 llvm-svn: 292197	2017-01-17 05:45:09 +00:00
Matt Arsenault	3583c8597d	llc: Update LLVMBuild llvm-svn: 292196	2017-01-17 05:34:08 +00:00
Matt Arsenault	3b4b15bd18	llc: Initialize more passes Targets can add these. Initialize them so -print-before/-print-after etc. work. llvm-svn: 292195	2017-01-17 05:11:25 +00:00
Lang Hames	b80ea29667	[Orc][RPC] Return unsupported rpc function errors from the non-retry cases in negotiateFunction. These cases were accidentally left out of r292055, resulting in a less descriptive ECError being returned on these paths. llvm-svn: 292193	2017-01-17 04:07:48 +00:00
Ahmed Bougacha	9e5a085cf1	Revert "[TLI] Robustize SDAG proto checking by merging it into TLI." This reverts commit r292189, as it causes issues on SystemZ bots. llvm-svn: 292191	2017-01-17 03:31:00 +00:00
Ahmed Bougacha	c018efd680	[TLI] Robustize SDAG proto checking by merging it into TLI. SelectionDAGBuilder recognizes libfuncs using some homegrown parameter type-checking. Use TLI instead, removing another heap of redundant code. This isn't strictly NFC, as the SDAG code was too lax. Concretely, this means changes are required to two tests: - calling a non-variadic function via a variadic prototype isn't OK; it just happens to work on x86_64 (but not on, e.g., aarch64). - mempcpy has a size_t parameter; the SDAG code accepts any integer type, which meant using i32 on x86_64 worked. I don't think it's worth supporting either of these (IMO) broken testcases. Instead, fix them to be more correct. llvm-svn: 292189	2017-01-17 03:10:06 +00:00
Ahmed Bougacha	6b9be1dbe1	[TLI] Add prototype checking for all remaining LibFuncs. This is another step towards unifying all LibFunc prototype checks. This work started in r267758 (D19469); add the remaining checks. Also add a unittest that checks each libfunc declared with a known-valid and known-invalid prototype. New libfuncs added in the future are required to have prototype checking in place; the known-valid test will fail otherwise. Differential Revision: https://reviews.llvm.org/D28030 llvm-svn: 292188	2017-01-17 03:10:02 +00:00
Ahmed Bougacha	a65de7e241	[TLI] Alphabetize some of the prototype check switch. NFC. llvm-svn: 292187	2017-01-17 03:10:00 +00:00
Ahmed Bougacha	1539d1de75	[unittests] Alphabetize cmake file list. NFC. llvm-svn: 292186	2017-01-17 03:09:55 +00:00
Alexei Starovoitov	05de2e4818	[bpf] error when BPF stack size exceeds 512 bytes Signed-off-by: Alexei Starovoitov <ast@kernel.org> llvm-svn: 292180	2017-01-17 01:05:17 +00:00
David Majnemer	36d382b773	[InstCombine] Fold ((C1-zext(X)) & C2) -> zext((C1-X) & C2) This is valid if C2 fits within the bitwidth of X thanks to two's complement modulo arithmetic. llvm-svn: 292179	2017-01-17 00:45:57 +00:00
Matt Arsenault	c8cc2be9f8	Add comment to test file I forgot to save llvm-svn: 292178	2017-01-17 00:35:28 +00:00
Matt Arsenault	b948b4d8df	SimplifyLibCalls: Remove checks for fabs Use the intrinsic instead of emitting the libcall which will be replaced by the intrinsic. llvm-svn: 292176	2017-01-17 00:30:31 +00:00
Matt Arsenault	2aab1d45ff	AMDGPU: Remove dead pattern This is the unsafe conversion pattern, but not guarded by an unsafe math check. It is also already done in LegalizeDAG. llvm-svn: 292173	2017-01-17 00:10:43 +00:00
Matt Arsenault	7233344c28	SimplifyLibCalls: Replace fabs libcalls with intrinsics Add missing fabs(fpext) optimzation that worked with the call, and also fixes it creating a second fpext when there were multiple uses. llvm-svn: 292172	2017-01-17 00:10:40 +00:00
Davide Italiano	1825a03f72	[Object] Fixup permissions of input files. They just need to be read/dumped, so no need to set the exec bit on any of them. NFCI, I guess. llvm-svn: 292171	2017-01-16 23:28:58 +00:00
Davide Italiano	eb9ad9831b	[llvm-objdump] Dump PT_NOTE as part of -p. PR: 31641 llvm-svn: 292170	2017-01-16 23:13:46 +00:00
Davide Italiano	cad192779a	[llvm-objdump] Dump PT_GNU_RELRO as part of -p. PR: 31641 llvm-svn: 292169	2017-01-16 22:58:26 +00:00
Davide Italiano	6cc726ead0	[llvm-objdump] Dump PT_OPENBSD_{BOOTDATA,RANDOMIZE,WXNEEDED}. PR: 31641 llvm-svn: 292167	2017-01-16 22:01:41 +00:00
David Blaikie	c8c1b80d06	Add missing header to see if that clears up the build llvm-svn: 292166	2017-01-16 21:40:08 +00:00
Simon Pilgrim	a0b0b96d83	[InstCombine][AVX] Tests showing missed opportunities to pass demanded elts through a permilpd/permilps shuffle mask llvm-svn: 292165	2017-01-16 21:34:22 +00:00
Sanjay Patel	da5682afdd	[InstCombine] use m_APInt instead of faking it llvm-svn: 292164	2017-01-16 21:24:41 +00:00
David Blaikie	435888f6ba	Attempt to fix the MSVC build by using llvm::errc instead of std::errc llvm-svn: 292163	2017-01-16 21:20:51 +00:00
Jan Vesely	334f51a6fe	ADMGPU/EG,CM: Implement _noret global atomics _RTN versions will be a lot more complicated Differential Revision: https://reviews.llvm.org/D28067 llvm-svn: 292162	2017-01-16 21:20:13 +00:00
David Blaikie	87299ad2e7	[XRay] Implement the `llvm-xray graph` subcommand Here we define the `graph` subcommand which generates a graph from the function call information and uses it to present the call information graphically with additional annotations. Reviewers: dblaikie, dberris Differential Revision: https://reviews.llvm.org/D27243 llvm-svn: 292156	2017-01-16 20:36:26 +00:00
David Blaikie	0cd22f9540	Attempt to workaround MSVC build issue where I suspect an enum class constant 0 is considered a possible null pointer I can't reproduce this so far with web compilers, so throwing this at the bots to see if it sticks. llvm-svn: 292155	2017-01-16 20:28:59 +00:00
Tony Jiang	8e8c444d3d	[PowerPC] Expand ISEL instruction into if-then-else sequence. Generally, the ISEL is expanded into if-then-else sequence, in some cases (like when the destination register is the same with the true or false value register), it may just be expanded into just the if or else sequence. llvm-svn: 292154	2017-01-16 20:12:26 +00:00
Sanjay Patel	65cce20caa	[InstCombine] fix names in canEvaluateShiftedShift(); NFC It's not clear what 'First' and 'Second' mean, so use 'Inner' and 'Outer' to match foldShiftedShift() and add comments with formulas, so it's easier to see what's going on. llvm-svn: 292153	2017-01-16 20:05:26 +00:00
Sanjay Patel	ab8b32de71	[InstCombine] use m_APInt to allow shift-shift folds for vectors with splat constants Some existing 'FIXME' tests are still not folded because of splat holes in value tracking. llvm-svn: 292151	2017-01-16 19:35:45 +00:00
Sanjay Patel	cd06f6fe10	[InstCombine] add tests to show missed vector folds; NFC The shift-shift possibilities became easier to see after: https://reviews.llvm.org/rL292145 llvm-svn: 292150	2017-01-16 19:23:34 +00:00
David Blaikie	3eaa7e348c	PR31650: Refer to enum constant when initializing llvm::None constant llvm-svn: 292149	2017-01-16 18:48:52 +00:00
Justin Lebar	e45dd3aace	[NVPTX] Add blank line to NVPTXUsage.rst to appease the Sphinx. Fixes: Warning, treated as error: /home/buildbot/llvm-build-dir/llvm-sphinx-docs/llvm/src/docs/NVPTXUsage.rst:333: ERROR: Error in "code-block" directive: maximum 1 argument(s) allowed, 17 supplied. llvm-svn: 292148	2017-01-16 18:39:15 +00:00
Sanjay Patel	646734a6cd	[InstCombine] refactor shift-of-shift folds; NFCI Reduces code duplication and makes it easier to extend these folds for vectors. llvm-svn: 292145	2017-01-16 17:27:50 +00:00
Simon Pilgrim	87eddf9aaf	[InstCombine][SSE] Tests showing missed opportunities to pass demanded elts through a packss/packus truncation llvm-svn: 292144	2017-01-16 17:26:23 +00:00
Pavel Labath	d79f638512	[llvm-xray] Fix android build std::to_string is not available in the android ndk. Using llvm::to_string instead. Committing as obvious. llvm-svn: 292143	2017-01-16 16:38:23 +00:00
Chad Rosier	58fb5f5e58	[AArch64] Falkor supports Rounding Double Multiply Add/Subtract instructions. Falkor only partially implements the ARMv8.1a extensions, so this patch refactors the support for the SQRDML[A\|S]H instruction into a separate feature. Differential Revision: https://reviews.llvm.org/D28681 llvm-svn: 292142	2017-01-16 16:28:43 +00:00
Daniel Sanders	a83a1a69c5	Revert r292132: [globalisel] Tablegen-erate current Register Bank Information'... Several buildbots encountered a crash in tablegen when building this commit. Reverting while I investigate the cause. llvm-svn: 292136	2017-01-16 15:34:43 +00:00
Hal Finkel	c29d5f1674	Fix use-after-free bug in AffectedValueCallbackVH::allUsesReplacedWith When transferring affected values in the cache from an old value, identified by the value of the current callback, to the specified new value we might need to insert a new entry into the DenseMap which constitutes the cache. Doing so might delete the current callback object. Move the copying logic into a new function, a member of the assumption cache itself, so that we don't run into UB should the callback handle itself be removed mid-copy. Differential Revision: https://reviews.llvm.org/D28749 llvm-svn: 292133	2017-01-16 15:22:01 +00:00
Daniel Sanders	ab8194def0	[globalisel] Tablegen-erate current Register Bank Information Summary: Adds a RegisterBank tablegen class that can be used to declare the register banks and an associated tablegen pass to generate the necessary code. Reviewers: t.p.northover, ab, rovka, qcolombet Subscribers: aditya_nandakumar, rengolin, kristof.beyls, vkalintiris, mgorny, dberris, llvm-commits, rovka Differential Revision: https://reviews.llvm.org/D27338 llvm-svn: 292132	2017-01-16 15:20:43 +00:00
Tony Jiang	8da139a9fd	Revert "[PowerPC] Expand ISEL instruction into if-then-else sequence." This reverts commit 1d0e0374438ca6e153844c683826ba9b82486bb1. llvm-svn: 292131	2017-01-16 15:01:07 +00:00
Simon Pilgrim	3e91519a1c	[SelectionDAG] Add knownbits support for BITREVERSE llvm-svn: 292130	2017-01-16 14:49:26 +00:00
Tony Jiang	7630b8c5ee	[PowerPC] Expand ISEL instruction into if-then-else sequence. Generally, the ISEL is expanded into if-then-else sequence, in some cases (like when the destination register is the same with the true or false value register), it may just be expanded into just the if or else sequence. llvm-svn: 292128	2017-01-16 14:43:12 +00:00
NAKAMURA Takumi	f2b135ac3a	DWARFDebugInfoTest.cpp: Don't use ArrayRef with initializer. It was allocated locally. llvm-svn: 292127	2017-01-16 14:33:37 +00:00
Simon Pilgrim	355cd67d2d	[X86][SSE] Test showing missing BITREVERSE knownbits support llvm-svn: 292118	2017-01-16 13:59:42 +00:00
Simon Dardis	730fdb73a1	[mips] Correct c.cond.fmt instruction definition. Permit explicit $fcc<X> operand in c.cond.fmt instruction. Add c.cond.fmt to the MIPS to microMIPS instruction mapping table. Check that $fcc1 - $fcc7 are unusable for MIPS-I to MIPS-III for c.cond.fmt, bc1t, bc1f. Reviewers: seanbruno, zoran.jovanovic, vkalintiris Differential Revision: https://reviews.llvm.org/D24510 llvm-svn: 292117	2017-01-16 13:55:58 +00:00
Simon Pilgrim	db73dbcc7c	[SelectionDAG] Add support for BITREVERSE constant folding We were relying on constant folding of the legalized instructions to do what constant folding we had previously llvm-svn: 292114	2017-01-16 13:39:00 +00:00

... 4 5 6 7 8 ...

143767 Commits