llvm-project

Commit Graph

Author	SHA1	Message	Date
Tim Shen	8b58bdfe6f	[GenericDomTree] Change GenericDomTree to use NodeRef in GraphTraits. NFC. Summary: Looking at the implementation, GenericDomTree has more specific requirements on NodeRef, e.g. NodeRefObject->getParent() should compile, and NodeRef should be a pointer. We can remove the pointer requirement, but it seems to have little gain, given the limited use cases. Also changed GraphTraits<Inverse<Inverse<T>> to be more accurate. Reviewers: dblaikie, chandlerc Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D23593 llvm-svn: 278961	2016-08-17 20:01:58 +00:00
Sanjay Patel	84ff18ba92	[InstCombine] minimize tests and autogenerate checks llvm-svn: 278960	2016-08-17 19:56:10 +00:00
Sanjay Patel	daffec91ef	[InstCombine] more clean up of foldICmpXorConstant(); NFCI Use m_APInt for the xor constant, but this is all still guarded by the initial ConstantInt check, so no vector types should make it in here. llvm-svn: 278957	2016-08-17 19:45:18 +00:00
Sanjay Patel	6d5f448746	[InstCombine] clean up foldICmpXorConstant(); NFCI 1. Change variable names 2. Use local variables to reduce code 3. Early exit to reduce indent llvm-svn: 278955	2016-08-17 19:23:42 +00:00
Marina Yatsina	53ce3f9d02	Fix for PR29010 This is a fix for https://llvm.org/bugs/show_bug.cgi?id=29010 Root cause of the bug is that the register class of the machine instruction operand does not fully reflect if this registers that can be allocated. Both for i386 and x86_64 the operand's register class is VR128RegClass and thus contains xmm0-xmm15, though in i386 we can only use xmm0-xmm8. In order to get the actual allocable registers of the class we need to use RegisterClassInfo. Differential Revision: https://reviews.llvm.org/D23613 llvm-svn: 278954	2016-08-17 19:07:40 +00:00
Kostya Serebryany	a7398ba024	[libFuzzer] more mutations llvm-svn: 278950	2016-08-17 18:10:42 +00:00
Adrian Prantl	ccd546e953	Move tests to the appropriate subdirectory. llvm-svn: 278948	2016-08-17 16:55:56 +00:00
Sanjay Patel	63e14a07e8	[InstCombine] use m_APInt to allow icmp (or X, Y), C folds for splat constant vectors This is a sibling of: https://reviews.llvm.org/rL278859 https://reviews.llvm.org/rL278935 llvm-svn: 278945	2016-08-17 16:38:57 +00:00
Sanjay Patel	943e92efde	[InstCombine] clean up foldICmpOrConstant(); NFCI 1. Change variable names 2. Use local variables to reduce code 3. Use ? instead of if/else 4. Use the APInt variable instead of 'RHS' so the removal of the FIXME code will be direct llvm-svn: 278944	2016-08-17 16:30:43 +00:00
Sanjay Patel	f636d762ed	[InstCombine] add tests for missing vector icmp folds llvm-svn: 278943	2016-08-17 16:23:15 +00:00
Adrian Prantl	c19dee734f	Support the DW_AT_noreturn DWARF flag. This is used to mark functions with the C++11 [[ noreturn ]] or C11 _Noreturn attributes. Patch by Victor Leschuk! https://reviews.llvm.org/D23167 llvm-svn: 278940	2016-08-17 16:02:43 +00:00
Chad Rosier	ea7e4647db	Revert "Reassociate: Reprocess RedoInsts after each inst". This reverts commit r258830, which introduced a bug described in PR28367. PR28367 llvm-svn: 278938	2016-08-17 15:54:39 +00:00
Sanjay Patel	4f7eb2aa95	[InstCombine] use m_APInt to allow icmp (add X, Y), C folds for splat constant vectors This is a sibling of: https://reviews.llvm.org/rL278859 llvm-svn: 278935	2016-08-17 15:24:30 +00:00
Andrey Bokhanko	7d7bacbafa	Clarify the statement on using #if 0 ... #endif in CodingStandards. The statement on using #if 0 ... #endif is not very clear (for people like me :-)). This patch clarifies it a bit to avoid confusion. Differential Revision: https://reviews.llvm.org/D23404 llvm-svn: 278932	2016-08-17 14:53:18 +00:00
Simon Dardis	ac96ec7906	[mips] Add l.[sd] and s.[sd] instruction aliases Reviewers: dsanders, vkalintiris Differential Review: https://reviews.llvm.org/D23121 llvm-svn: 278930	2016-08-17 14:45:09 +00:00
Chad Rosier	a6822f64f3	Revert "[Reassociate] Avoid iterator invalidation when negating value." This reverts commit r278928 due to lit test failures. llvm-svn: 278929	2016-08-17 14:31:34 +00:00
Chad Rosier	cf3e8121a6	[Reassociate] Avoid iterator invalidation when negating value. Differential Revision: https://reviews.llvm.org/D23464 PR28367 llvm-svn: 278928	2016-08-17 14:16:45 +00:00
Jonas Paulsson	7a79422536	[LoopStrenghtReduce] Refactoring and addition of a new target cost function. Refactored so that a LSRUse owns its fixups, as oppsed to letting the LSRInstance own them. This makes it easier to rate formulas for LSRUses, since the fixups are available directly. The Offsets vector has been removed since it was no longer necessary. New target hook isFoldableMemAccessOffset(), which is used during formula rating. For SystemZ, this is useful to express that loads and stores with float or vector types with a big/negative offset should be avoided in loops. Without this, LSR will generate a lot of negative offsets that would require extra instructions for loading the address. Updated tests: test/CodeGen/SystemZ/loop-01.ll Reviewed by: Quentin Colombet and Ulrich Weigand. https://reviews.llvm.org/D19152 llvm-svn: 278927	2016-08-17 13:24:19 +00:00
Marina Yatsina	4b22642e6f	Fixing bug committed in rev. 278321 In theory the indices of RC (and thus the index used for LiveRegs) may differ from the indices of OpRC. Fixed the code to extract the correct RC index. OpRC contains the first X consecutive elements of RC, and thus their indices are currently de facto the same, therefore a test cannot be added at this point. Differential Revision: https://reviews.llvm.org/D23491 llvm-svn: 278923	2016-08-17 11:40:21 +00:00
Sam Kolton	c05d7784a6	[AMDGPU] llvm-objdump: Skip amd_kernel_code_t only at the begining of kernel symbol. Summary: This change fix bug in AMDGPU disassembly. Previously, presence of symbols other than kernel symbols caused objdump to skip begining of those symbols. Reviewers: tstellarAMD, vpykhtin, Bigcheese, ruiu Subscribers: kzhuravl, arsenm Differential Revision: http://reviews.llvm.org/D21966 llvm-svn: 278921	2016-08-17 10:17:57 +00:00
Nicolai Haehnle	1cdd6ca6ca	LiveIntervals: add removeRegUnit Summary: See D22198 for the motivation: We have a pass that uses LiveIntervals anyway, and there is now a requirement to track a physical register that is not usually tracked at this point of the compilation. The pass also introduces instructions that affect this physical register, but we want to preserve LiveIntervals. Rather than add brittle and rarely exercised code to keep the tracking of the physical register intact, we want to just remove the corresponding LiveRange -- it didn't exist before anyway, and subsequent passes don't expect it to be there. Reviewers: MatzeB, arsenm Subscribers: llvm-commits, MatzeB Differential Revision: https://reviews.llvm.org/D22801 llvm-svn: 278920	2016-08-17 09:34:55 +00:00
Ayman Musa	71b43c5c1d	Fix bug in DAGBuilder for getelementptr with expanded vector. Replacing the usage of MVT with EVT in case the vector type is expanded. Differential Revision: https://reviews.llvm.org/D23306 llvm-svn: 278913	2016-08-17 07:52:15 +00:00
Chandler Carruth	5f6d73b1fa	[LTO] Fix a use-after-free introduced in r278907 and caught by ASan. The ASan build bot caught this right away: http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux-fast/builds/15580/steps/check-llvm%20asan/logs/stdio This was also breaking a Windows build bot I'm pretty sure. llvm-svn: 278912	2016-08-17 07:48:34 +00:00
Ayman Musa	c96f421ad4	First commit (test commit) - Adding empty line. llvm-svn: 278910	2016-08-17 07:37:34 +00:00
Chandler Carruth	bd5ad0df51	Restrict the use of the C++17 attribute to C++17 (at least as best we can given the current __cplusplus definitions). Without this, Clang triggers TONS of warnings about using a C++17 extension. I tried using LLVM_EXTENSION to turn these off and it doesn't work. Suggestions on a better approach are welcome, but at least this makes the build usable for me again. llvm-svn: 278909	2016-08-17 07:18:44 +00:00
Mehdi Amini	970800e0c8	[LTO] Introduce an Output class to wrap the output stream creation (NFC) Summary: While NFC for now, this will allow more flexibility on the client side to hold state necessary to back up the stream. Also when adding caching, this class will grow in complexity. Note I blindly modified the gold-plugin as I can't compile it. Reviewers: tejohnson Subscribers: mehdi_amini, llvm-commits Differential Revision: https://reviews.llvm.org/D23542 llvm-svn: 278907	2016-08-17 06:23:09 +00:00
Justin Bogner	14f383e9c4	Fix a use of LLVM_FALLTHROUGH that wasn't even in a switch. I was over-aggressive in my conversions from comments to the fallthrough attribute. llvm-svn: 278903	2016-08-17 05:25:38 +00:00
Justin Bogner	b03fd12cef	Replace "fallthrough" comments with LLVM_FALLTHROUGH This is a mechanical change of comments in switches like fallthrough, fall-through, or fall-thru to use the LLVM_FALLTHROUGH macro instead. llvm-svn: 278902	2016-08-17 05:10:15 +00:00
Chuang-Yu Cheng	f7ba716bcb	[ppc64] Don't apply sibling call optimization if callee has any byval arg This is a quick work around, because in some cases, e.g. caller's stack size > callee's stack size, we are still able to apply sibling call optimization even callee has any byval arg. This patch fix: https://llvm.org/bugs/show_bug.cgi?id=28328 Reviewers: hfinkel kbarton nemanjai amehsan Subscribers: hans, tjablin https://reviews.llvm.org/D23441 llvm-svn: 278900	2016-08-17 03:17:44 +00:00
Zijiao Ma	6ed8da0049	Remove the Triple tests that stressing the TargetParser's behaviour. Now the tests of TargetParser is in place: unittests/Support/TargetParserTest.cpp. So the tests in TripleTest.cpp which actually stressing TargetParser's behavior could be removed. llvm-svn: 278899	2016-08-17 03:17:07 +00:00
Chandler Carruth	67fc52f067	[PM] Port the always inliner to the new pass manager in a much more minimal and boring form than the old pass manager's version. This pass does the very minimal amount of work necessary to inline functions declared as always-inline. It doesn't support a wide array of things that the legacy pass manager did support, but is alse ... about 20 lines of code. So it has that going for it. Notably things this doesn't support: - Array alloca merging - To support the above, bottom-up inlining with careful history tracking and call graph updates - DCE of the functions that become dead after this inlining. - Inlining through call instructions with the always_inline attribute. Instead, it focuses on inlining functions with that attribute. The first I've omitted because I'm hoping to just turn it off for the primary pass manager. If that doesn't pan out, I can add it here but it will be reasonably expensive to do so. The second should really be handled by running global-dce after the inliner. I don't want to re-implement the non-trivial logic necessary to do comdat-correct DCE of functions. This means the -O0 pipeline will have to be at least 'always-inline,global-dce', but that seems reasonable to me. If others are seriously worried about this I'd like to hear about it and understand why. Again, this is all solveable by factoring that logic into a utility and calling it here, but I'd like to wait to do that until there is a clear reason why the existing pass-based factoring won't work. The final point is a serious one. I can fairly easily add support for this, but it seems both costly and a confusing construct for the use case of the always inliner running at -O0. This attribute can of course still impact the normal inliner easily (although I find that a questionable re-use of the same attribute). I've started a discussion to sort out what semantics we want here and based on that can figure out if it makes sense ta have this complexity at O0 or not. One other advantage of this design is that it should be quite a bit faster due to checking for whether the function is a viable candidate for inlining exactly once per function instead of doing it for each call site. Anyways, hopefully a reasonable starting point for this pass. Differential Revision: https://reviews.llvm.org/D23299 llvm-svn: 278896	2016-08-17 02:56:20 +00:00
Matthias Braun	08f4704ec8	IfConversion: Use references instead of pointers where possible; NFC Also put some commonly used subexpressions into variables. llvm-svn: 278895	2016-08-17 02:52:01 +00:00
Matthias Braun	b1e0558df4	IfConversion: Use range based for; NFC Also avoid some pointless use of auto! Because that's friendlier to readers and avoids several types accidentally resolving to unnecessary references here (MachineInstr *&, unsigned &). llvm-svn: 278894	2016-08-17 02:51:59 +00:00
Matthias Braun	2c931798d6	IfConversion: Improve doxygen comments llvm-svn: 278893	2016-08-17 02:51:57 +00:00
Chandler Carruth	f702d8ecb6	[Inliner] Add a flag to disable manual alloca merging in the Inliner. This is off for now while testing can take place to make sure that in fact we do sufficient stack coloring to fully obviate the manual alloca array merging. Some context on why we should be using stack coloring rather than merging allocas in this way: LLVM relies very heavily on analyzing pointers as coming from different allocas in order to make aliasing decisions. These are some of the most powerful aliasing signals available in LLVM. So merging allocas is an extremely destructive operation on the LLVM IR -- it takes away highly valuable and hard to reconstruct information. As a consequence, inlined functions which happen to have array allocas that this pattern matches will fail to be properly interleaved unless SROA manages to hoist everything to an SSA register. Instead, the inliner will have added an unnecessary dependence that one inlined function execute after the other because they will have been rewritten to refer to the same memory. All that said, folks will reasonably want some time to experiment here and make sure there are no significant regressions. A flag should give us an easy knob to test. For more context, see the thread here: http://lists.llvm.org/pipermail/llvm-dev/2016-July/103277.html http://lists.llvm.org/pipermail/llvm-dev/2016-August/103285.html Differential Revision: https://reviews.llvm.org/D23052 llvm-svn: 278892	2016-08-17 02:40:23 +00:00
Zijiao Ma	53d55f45a1	Some places that could using TargetParser in LLVM. NFC. llvm-svn: 278888	2016-08-17 02:08:28 +00:00
Duncan P. N. Exon Smith	4741661573	ADT: Add some missing coverage for iplist::splice These splices are interesting because they involve swapping two nodes in the same list. There are two ways to do this. Assuming: A -> B -> [Sentinel] You can either: - splice B before A, with: L.splice(A, L, B) or - splice A before Sentinel, with: L.splice(L.end(), L, A) to create: B -> A -> [Sentinel] These two swapping-splices are somewhat interesting corner cases for maintaining the list invariants. The tests pass even with my new ilist implementation, but I had some doubts about the latter when I was looking at weird UB effects. Since I can't find equivalent explicit test coverage elsewhere it seems prudent to commit. llvm-svn: 278887	2016-08-17 02:08:08 +00:00
Duncan P. N. Exon Smith	362d120488	Scalar: Avoid dereferencing end() in IndVarSimplify IndVarSimplify::sinkUnusedInvariants calls BasicBlock::getFirstInsertionPt on the ExitBlock and moves instructions before it. This can return end(), so it's not safe to dereference. Add an iterator-based overload to Instruction::moveBefore to avoid the UB. llvm-svn: 278886	2016-08-17 01:54:41 +00:00
George Burgess IV	67c5885d09	[Docs] Fix post-review comments on MemorySSA's docs. Thanks to Sean Silva for bringing these up. :) llvm-svn: 278885	2016-08-17 01:50:54 +00:00
Duncan P. N. Exon Smith	9e3edad932	IPO: Swap \|\| operands to avoid dereferencing end() IsOperandBundleUse conveniently indicates whether std::next(F->arg_begin(),UseIndex) will get to (or past) end(). Check it first to avoid dereferencing end(). llvm-svn: 278884	2016-08-17 01:23:58 +00:00
Duncan P. N. Exon Smith	3bcaa81204	Scalar: Avoid dereferencing end() in InductiveRangeCheckElimination BasicBlock::Create isn't designed to take iterators (which might be end()), but pointers (which might be nullptr). Fix the UB that was converting end() to a BasicBlock* by calling BasicBlock::getNextNode() in the first place. llvm-svn: 278883	2016-08-17 01:16:17 +00:00
Duncan P. N. Exon Smith	6331dc171c	ObjCARC: Don't increment or dereference end() when scanning args When there's only one argument and it doesn't match one of the known functions, return ARCInstKind::CallOrUser rather than falling through to the two argument case. The old behaviour both incremented past and dereferenced end(). llvm-svn: 278881	2016-08-17 01:02:18 +00:00
Duncan P. N. Exon Smith	ec083b59ed	ARM: Avoid dereferencing end() in ARMFrameLowering::emitPrologue llvm::tryFoldSPUpdateIntoPushPop assumes its arguments are valid MachineInstrs. Update ARMFrameLowering::emitPrologue to respect that; when LastPush==end(), it can't possibly be a push instruction anyway. llvm-svn: 278880	2016-08-17 00:53:04 +00:00
Duncan P. N. Exon Smith	00ec93da26	CodeGen: Avoid dereferencing end() in OptimizePHIs::OptimizeBB llvm-svn: 278879	2016-08-17 00:43:59 +00:00
Duncan P. N. Exon Smith	e04fe1a394	Hexagon: Avoid dereferencing end() in HexagonInstrInfo::InsertBranch llvm-svn: 278878	2016-08-17 00:34:00 +00:00
George Burgess IV	3bbeb735ed	[Docs] Add initial MemorySSA documentation. Patch partially by Danny. Differential Revision: https://reviews.llvm.org/D23535 llvm-svn: 278875	2016-08-17 00:17:29 +00:00
Duncan P. N. Exon Smith	db53d99d02	AMDGPU: Avoid looking for the DebugLoc in end() The end() iterator isn't a safe thing to dereference. Pass the DebugLoc into EmitFetchClause and EmitALUClause to avoid it. llvm-svn: 278873	2016-08-17 00:06:43 +00:00
Duncan P. N. Exon Smith	0a12729f99	SimplifyCFG: Avoid dereferencing end() When comparing a User* to a BasicBlock::iterator in passingValueIsAlwaysUndefined, don't dereference the iterator in case it is end(). llvm-svn: 278872	2016-08-16 23:57:56 +00:00
Justin Bogner	39eec466a2	Revert "Write the TPI stream from a PDB to Yaml." This is hitting a "use of undeclared identifier 'skipPadding' error locally and on some bots. This reverts r278869. llvm-svn: 278871	2016-08-16 23:37:10 +00:00
Duncan P. N. Exon Smith	dcbce9c391	CodeGen: Avoid dereferencing end() when unconstifying iterators Rather than doing a funny dance that relies on dereferencing end() not crashing, add some API to MachineInstrBundleIterator to get a non-const version of the iterator. llvm-svn: 278870	2016-08-16 23:34:07 +00:00
Zachary Turner	8321ba5437	Write the TPI stream from a PDB to Yaml. Reviewed By: ruiu, rnk Differential Revision: https://reviews.llvm.org/D23226 llvm-svn: 278869	2016-08-16 23:28:54 +00:00
Justin Bogner	a36f4a25b3	Introduce LLVM_FALLTHROUGH, which expands to the C++17 attribute. This allows you to annotate switch case fallthrough in a better way than a "// FALLTHROUGH" comment. Eventually it would be nice to turn on -Wimplicit-fallthrough, if we can get the code base clean. llvm-svn: 278868	2016-08-16 23:24:13 +00:00
Sanjay Patel	7ad324b396	[InstCombine] add tests for fold with no coverage and missing vector fold llvm-svn: 278867	2016-08-16 23:18:42 +00:00
Kyle Butt	07d61425e3	Codegen: Don't tail-duplicate blocks with un-analyzable fallthrough. If AnalyzeBranch can't analyze a block and it is possible to fallthrough, then duplicating the block doesn't make sense, as only one block can be the layout predecessor for the un-analyzable fallthrough. Submitted wit a test case, but NOTE: the test case doesn't currently fail. However, the test case fails with D20505 and would have saved me some time debugging. llvm-svn: 278866	2016-08-16 22:56:14 +00:00
Sanjay Patel	60ea1b43d6	[InstCombine] clean up foldICmpAddConstant(); NFCI 1. Fix variable names 2. Add local variables to reduce code 3. Fix code comments 4. Add early exit to reduce indentation 5. Remove 'else' after if -> return 6. Hoist common predicate llvm-svn: 278864	2016-08-16 22:34:42 +00:00
Konstantin Zhuravlyov	e0b87181cf	[AMDGPU] Remove duplicate initialization of SIDebuggerInsertNops pass Differential Revision: https://reviews.llvm.org/D23556 llvm-svn: 278863	2016-08-16 22:30:11 +00:00
David Majnemer	744a8753db	Preserve the assumption cache more often We were clearing it out in LoopUnswitch and InlineFunction instead of attempting to preserve it. llvm-svn: 278860	2016-08-16 22:07:32 +00:00
Sanjay Patel	e47df1ac62	[InstCombine] use m_APInt to allow icmp (sub X, Y), C folds for splat constant vectors llvm-svn: 278859	2016-08-16 21:53:19 +00:00
Duncan P. N. Exon Smith	41cf73ce16	CodeGen: Don't dereference end() in MachineBasicBlock::CorrectExtraCFGEdges The current MachineBasicBlock might be the last block, so FallThru may be past the end(). Use getNextNode(), which will convert to nullptr, rather than &*++, which is invalid if we reach the end(). llvm-svn: 278858	2016-08-16 21:46:03 +00:00
Sanjay Patel	904cd39b05	[x86] Allow merging multiple instances of an immediate within a basic block for code size savings, for 64-bit constants. This patch handles 64-bit constants which can be encoded as 32-bit immediates. It extends the functionality added by https://reviews.llvm.org/D11363 for 32-bit constants to 64-bit constants. Patch by Sunita Marathe! Differential Revision: https://reviews.llvm.org/D23391 llvm-svn: 278857	2016-08-16 21:35:16 +00:00
Kostya Serebryany	3044390af1	[libFuzzer] minor speed improvement llvm-svn: 278856	2016-08-16 21:28:05 +00:00
Sanjay Patel	b9aa67bfcf	[InstCombine] fix variable names to match formula comments; NFC llvm-svn: 278855	2016-08-16 21:26:10 +00:00
David Majnemer	110522bc0f	[LoopUnroll] Don't clear out the AssumptionCache on each loop Clearing out the AssumptionCache can cause us to rescan the entire function for assumes. If there are many loops, then we are scanning over the entire function many times. Instead of clearing out the AssumptionCache, register all cloned assumes. llvm-svn: 278854	2016-08-16 21:09:46 +00:00
Reid Kleckner	b99b709068	Revert "Enhance SCEV to compute the trip count for some loops with unknown stride." This reverts commit r278731. It caused http://crbug.com/638314 llvm-svn: 278853	2016-08-16 21:02:04 +00:00
Matt Arsenault	b8037a1bd3	TailDuplicator: Use range loops llvm-svn: 278847	2016-08-16 20:38:05 +00:00
Evandro Menezes	5a5b8dcd32	[AArch64] Adjust the scheduling model for Exynos M1. Refine the model for the FP division unit. llvm-svn: 278846	2016-08-16 20:35:01 +00:00
Evandro Menezes	d03aff2e11	[AArch64] Adjust the scheduling model for Exynos M1. Refine the model for the integer division unit. llvm-svn: 278845	2016-08-16 20:34:58 +00:00
Matt Arsenault	7f19298bfa	AMDGPU: Remove excessive padding from ImmOp and RegOp. The structs ImmOp and RegOp are in AArch64AsmParser.cpp (inside anonymous namespace). This diff changes the order of fields and removes the excessive padding (8 bytes). Patch by Alexander Shaposhnikov llvm-svn: 278844	2016-08-16 20:28:06 +00:00
Reid Kleckner	43231bc19b	Fix an instance of -Wmicrosoft-enum-value by making the enum unsigned llvm-svn: 278843	2016-08-16 20:22:49 +00:00
Haicheng Wu	9780df5385	[BranchFolding] Change a test case of r278575. Rename the operands to make the test less brittle. llvm-svn: 278841	2016-08-16 20:06:25 +00:00
Sjoerd Meijer	15c81b05ea	[MBP] do not reorder and move up loop latch block Do not reorder and move up a loop latch block before a loop header when optimising for size because this will generate an extra unconditional branch. Differential Revision: https://reviews.llvm.org/D22521 llvm-svn: 278840	2016-08-16 19:50:33 +00:00
Kostya Serebryany	d46a59fac4	[libFuzzer] new experimental feature: value profiling. Profiles values that affect control flow and treats new values as new coverage. llvm-svn: 278839	2016-08-16 19:33:51 +00:00
Benjamin Kramer	0464ae83e7	Remove excessive padding from LineNoCacheTy The struct LineNoCacheTy is in SourceMgr.cpp inside anonymous namespace. This diff changes the order of fields and removes the excessive padding (8 bytes). Patch by Alexander Shaposhnikov! Differential revision: https://reviews.llvm.org/D23546 llvm-svn: 278838	2016-08-16 19:20:10 +00:00
David Majnemer	00940fb854	Make MDNode::intersect faster than O(n * m) It is pretty easy to get it down to O(nlogn + mlogm). This implementation has the added benefit of automatically deduplicating entries between the two sets. llvm-svn: 278837	2016-08-16 18:48:37 +00:00
David Majnemer	fa0f1e660b	Don't passively concatenate MDNodes I have audited all the callers of concatenate and none require duplicate entries to service concatenation. These duplicates serve no purpose but to needlessly embiggen the IR. N.B. Layering getMostGenericAliasScope on top of concatenate makes it O(nlogn + mlogm) instead of O(n*m). llvm-svn: 278836	2016-08-16 18:48:34 +00:00
Krzysztof Parzyszek	1d01a79304	[Hexagon] Standardize next batch of pseudo instructions ALIGNA PS_aligna ALLOCA PS_alloca TFR_FI PS_fi TFR_FIA PS_fia TFR_PdFalse PS_false TFR_PdTrue PS_true VMULW PS_vmulw VMULW_ACC PS_vmulw_acc llvm-svn: 278832	2016-08-16 18:08:40 +00:00
Gor Nishanov	74309fa014	[Coroutines] Part 7: Split coroutine into subfunctions Summary: This patch adds simple coroutine splitting logic to CoroSplit pass. Documentation and overview is here: http://llvm.org/docs/Coroutines.html. Upstreaming sequence (rough plan) 1.Add documentation. (https://reviews.llvm.org/D22603) 2.Add coroutine intrinsics. (https://reviews.llvm.org/D22659) ... 7. Split coroutine into subfunctions <= we are here 8. Coroutine Frame Building algorithm 9. Handle coroutine with unwinds 10+. The rest of the logic Reviewers: majnemer Subscribers: llvm-commits, mehdi_amini Differential Revision: https://reviews.llvm.org/D23461 llvm-svn: 278830	2016-08-16 18:04:14 +00:00
Sanjay Patel	a3f4f0828b	[InstCombine] add helper functions for foldICmpWithConstant; NFCI Besides breaking up a 700 line function to improve readability, this sinks the 'FIXME: ConstantInt' check into each helper. So now we can independently break that restriction within any of the helper functions. As much as possible, the code was only {cut/paste/clang-format}'ed to minimize risk (no functional changes intended), so several more readability improvements are still possible. llvm-svn: 278828	2016-08-16 17:54:36 +00:00
Kostya Serebryany	c98ef718ea	[libFuzzer] refactoring around PCMap, NFC llvm-svn: 278825	2016-08-16 17:37:13 +00:00
Simon Dardis	4893aff94e	[mips] Enforce compact branch restrictions Check both operands for use of the $zero register which cannot be used with a compact branch instruction. Reviewers: dsanders, vkalintris Differential Review: https://reviews.llvm.org/D23547 llvm-svn: 278824	2016-08-16 17:16:11 +00:00
Krzysztof Parzyszek	eabc0d0fd5	[Hexagon] Clean up some miscellaneous V60 intrinsics a bit llvm-svn: 278823	2016-08-16 17:14:44 +00:00
Wolfgang Pieb	8df58f48dd	When the inline spiller rematerializes an instruction, take the debug location from the instruction that immediately follows the rematerialization point. Patch by Andrea DiBiagio. Differential Revision: http://reviews.llvm.org/D23539 llvm-svn: 278822	2016-08-16 17:12:50 +00:00
Wei Mi	db68c9adbd	Remove a stale comment from the test, NFC. llvm-svn: 278821	2016-08-16 16:57:15 +00:00
Vitaly Buka	1ce73ef11c	[Asan] Unpoison red zones even if use-after-scope was disabled with runtime flag Summary: PR27453 Reviewers: eugenis Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D23481 llvm-svn: 278818	2016-08-16 16:24:10 +00:00
Sanjay Patel	1e5b2d1611	[InstCombine] use m_APInt in foldICmpWithConstant; NFCI There's some formatting and pointer deref ugliness here that I intend to fix in subsequent patches. The overall goal is to refactor the obnoxiously long switch and incrementally remove the restriction to scalar types (allow folds for vector splats). This patch introduces the use of m_APInt which means the RHSV reference is now a pointer (and may have matched a vector splat), but the check of 'RHS' remains, so vector folds are disallowed and no functional change is intended. llvm-svn: 278816	2016-08-16 16:08:11 +00:00
Krzysztof Parzyszek	17aa4136a2	[Hexagon] Standardize vector predicate load/store pseudo instructions - Remove unused instructions: LDriq_pred_vec_V6, STriq_pred_vec_V6, and the 128B counterparts. - Rename: LDriq_pred_V6 PS_vloadrq_ai LDriq_pred_V6_128B PS_vloadrq_ai_128B STriq_pred_V6 PS_vstorerq_ai STriq_pred_V6_128B PS_vstorerq_ai_128B llvm-svn: 278813	2016-08-16 15:43:54 +00:00
Ahmed Bougacha	e4c03abddd	[AArch64][GlobalISel] Select G_MUL. llvm-svn: 278810	2016-08-16 14:37:46 +00:00
Ahmed Bougacha	329a1fea32	[GlobalISel] Fix G_MUL comment. NFC. llvm-svn: 278809	2016-08-16 14:37:43 +00:00
Ahmed Bougacha	59e160a19c	[AArch64][GlobalISel] Factor out unsupported binop check. NFC. We're going to need it for G_MUL, and, if other targets end up using something similar, we can easily put it in the generic selector. llvm-svn: 278808	2016-08-16 14:37:40 +00:00
David Callahan	947be0fa66	[ADCE] Modify data structures to support removing control flow Summary: This is part of a serious of patches to evolve ADCE.cpp to support removing of unnecessary control flow. This patch changes the data structures to hold liveness information to support the additional information we will eventually need. In particular we now have a notion of basic blocks being live because they contain a live operations. This will eventually feed into control dependence analysis of which branches are live. We cater to getting from instructions to associated block information and from blocks to information about their terminators. This patch also changes the structure of the main loop of the algorithm so that it alternates propagating liveness between instructions and usign control dependence information to mark branches live. We force all terminators live for now until we add code to handlinge removing control flow in a later patch. No changes to effective behavior with this patch Previous patches: D23065 [ADCE] Refactor anticipating new functionality (NFC) D23102 [ADCE] Refactoring for new functionality (NFC) Reviewers: nadav, majnemer, mehdi_amini Subscribers: freik, twoh, llvm-commits Differential Revision: https://reviews.llvm.org/D23225 llvm-svn: 278807	2016-08-16 14:31:51 +00:00
Brendon Cahoon	65b6ebccad	[Pipeliner] Fix an asssert due to invalid Phi in the epilog The pipeliner was generating an invalid Phi name for an operand in the epilog block, which caused an assert in the live variable analysis pass. The fix is to the code that generates new Phis in the epilog block. In this case, there is an existing Phi that needs to be reused rather than creating a new Phi instruction. Differential Revision: https://reviews.llvm.org/D23513 llvm-svn: 278805	2016-08-16 14:29:24 +00:00
Ahmed Bougacha	2ac5bf94bc	[AArch64][GlobalISel] Select (variable) shifts. For now, no support for immediates. llvm-svn: 278804	2016-08-16 14:02:47 +00:00
Ahmed Bougacha	7e508a8fcd	[AArch64][GlobalISel] Robustize select tests. NFC. Using the same register means nothing was checking for operand order. llvm-svn: 278803	2016-08-16 14:02:44 +00:00
Ahmed Bougacha	0306b5ef07	[AArch64][GlobalISel] Select p0 G_FRAME_INDEX. And mark it as legal. llvm-svn: 278802	2016-08-16 14:02:42 +00:00
Ahmed Bougacha	66d9dc2f7a	[GlobalISel] Mention pointers in LowLevelType.h. NFC. llvm-svn: 278801	2016-08-16 14:02:36 +00:00
Pierre Gousseau	051db7d838	[x86] Refactor a PowerPC specific ctlz/srl transformation (NFC). Following the discussion on D22038, this refactors a PowerPC specific setcc -> srl(ctlz) transformation so it can be used by other targets. Differential Revision: https://reviews.llvm.org/D23445 llvm-svn: 278799	2016-08-16 13:53:53 +00:00
Simon Pilgrim	25d2506029	[X86][AVX] Fixed typo in zero element insertion llvm-svn: 278798	2016-08-16 13:33:33 +00:00
Ron Lieberman	a481c7db93	[Hexagon] Improve test to check for @PCREL, only run llc, not opt -> llc. llvm-svn: 278796	2016-08-16 13:10:09 +00:00
Sagar Thakur	e311740bde	[MemorySanitizer] [MIPS] Changed memory mapping to support pie executable. Reviewed by eugenis Differential: D22994 llvm-svn: 278795	2016-08-16 12:55:38 +00:00
Simon Pilgrim	cc316f013a	[X86][SSE] Add support for combining v2f64 target shuffles to VZEXT_MOVL byte rotations The combine was only matching v2i64 as it assumed lowering to MOVQ - but we have v2f64 patterns that match in a similar fashion llvm-svn: 278794	2016-08-16 12:52:06 +00:00
Simon Pilgrim	d2d3202532	[X86][AVX512BW] Updated tests to demonstrate AVX512BW's inability to vectorize v64i8 shifts llvm-svn: 278790	2016-08-16 11:05:47 +00:00
Prakhar Bahuguna	a27c4a0e66	Correct the upper bound for a CBZ/CBNZ branch target. Summary: Fix for the upper bound check that was causing a build failure. Reviewers: olista01, rengolin, t.p.northover Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D23501 llvm-svn: 278789	2016-08-16 10:41:56 +00:00
Prakhar Bahuguna	15ed7ec5aa	[Thumb] Validate branch target for CBZ/CBNZ instructions. Summary: The assembler currently does not check the branch target for CBZ/CBNZ instructions, which only permit branching forwards with a positive offset. This adds validation for the branch target to ensure negative PC-relative offsets are not encoded into the instruction, whether specified as a literal or as an assembler symbol. Reviewers: rengolin, t.p.northover Subscribers: llvm-commits, rengolin Differential Revision: https://reviews.llvm.org/D23312 llvm-svn: 278788	2016-08-16 10:41:52 +00:00
Simon Pilgrim	f16cd361d4	[X86][SSE] Add support for combining target shuffles to PALIGNR byte rotations llvm-svn: 278787	2016-08-16 10:03:23 +00:00
Job Noorman	6cd8c9a9d6	[AVR] Fix compile errors Differential Revision: https://reviews.llvm.org/D23450 llvm-svn: 278784	2016-08-16 08:41:35 +00:00
Guy Blank	722caebdae	[X86] Add xgetbv/xsetbv intrinsics to non-windows platforms Differential Revision: https://reviews.llvm.org/D21958 llvm-svn: 278782	2016-08-16 06:41:00 +00:00
David Majnemer	5c5df6283a	[InstSimplify] Fold gep (gep V, C), (xor V, -1) to C-1 llvm-svn: 278779	2016-08-16 06:13:46 +00:00
Mehdi Amini	88c491ddec	FunctionImport: missed one occurence of ImportListForModule to rename (NFC) llvm-svn: 278778	2016-08-16 05:49:12 +00:00
Mehdi Amini	9b490f10e1	FunctionImport: rename ImportsForModule to ImportList for consistency (NFC) llvm-svn: 278777	2016-08-16 05:47:12 +00:00
Mehdi Amini	cdbcbf7477	[LTO] Simplify APIs and constify (NFC) Summary: Multiple APIs were taking a StringMap for the ImportLists containing the entries for for all the modules while operating on a single entry for the current module. Instead we can pass the desired ModuleImport directly. Also some of the APIs were not const, I believe just to be able to use operator[] on the StringMap. Reviewers: tejohnson Subscribers: llvm-commits, mehdi_amini Differential Revision: https://reviews.llvm.org/D23537 llvm-svn: 278776	2016-08-16 05:46:05 +00:00
Sanjay Patel	46a68ba618	[InstCombine] add tests for missing vector icmp folds llvm-svn: 278768	2016-08-16 00:48:38 +00:00
Mehdi Amini	acc50c4334	[LTO] Rename variables with meaningul names, i.e. more than one character (NFC) llvm-svn: 278766	2016-08-16 00:44:46 +00:00
Sanjay Patel	f1bf21c56b	[InstCombine] add tests for missing vector icmp folds llvm-svn: 278765	2016-08-16 00:27:12 +00:00
Reid Kleckner	229d32abfc	[AMDGPU] Give enum an explicit 64-bit type to fix MSVC 2013 failures Recall that MSVC always gives enums the type 'int', nothing else. MSVC 2015 does not appear to have this problem anymore. Clang-cl -Wmicrosoft-enum-value flags this, FWIW, so now I have a true positive for my warning. :) llvm-svn: 278762	2016-08-15 23:54:44 +00:00
Teresa Johnson	c44a12244f	[ThinLTO] Fix temp file dumping, enable via llvm-lto and test it Summary: Fixed a bug in ThinLTOCodeGenerator's temp file dumping. The Twine needs to be passed directly as an argument, or a copy saved into a std::string. It doesn't seem there are any consumers of this, so I added a new option to llvm-lto to enable saving of temp files during ThinLTO, and augmented a test to use it to check post-import but pre-opt bitcode. Reviewers: mehdi_amini Subscribers: llvm-commits, mehdi_amini Differential Revision: https://reviews.llvm.org/D23525 llvm-svn: 278761	2016-08-15 23:24:57 +00:00
Reid Kleckner	a7b04a589e	Don't use %llc_dwarf with -mtriple, they don't combine llvm-svn: 278758	2016-08-15 22:54:26 +00:00
Sanjay Patel	df77a4dbb0	[InstCombine] add tests for missing vector icmp folds llvm-svn: 278757	2016-08-15 22:43:52 +00:00
Justin Bogner	375f71e3a3	Linker: Avoid some ridiculous indentation by using a temporary. NFC This was indented really awkwardly, and clang-format didn't seem to know how to do any better. Avoid the issue with a temporary variable. llvm-svn: 278756	2016-08-15 22:41:42 +00:00
Wolfgang Pieb	12cd6ddef3	Adding the triple for test comitted with r278703. llvm-svn: 278755	2016-08-15 22:39:39 +00:00
Mike Aizatsky	5086417e9f	[sancov] extracting AArch64 test to a separate file. llvm-svn: 278754	2016-08-15 22:30:37 +00:00
Tim Shen	75ca2ac329	[ADT] Fix DepthFirstIterator's std::iterator base to have normal typedefs Summary: This is similiar to r278752, where I found that the std::iterator<...> base can be normal. Reviewers: dblaikie Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D23527 llvm-svn: 278753	2016-08-15 22:07:30 +00:00
Tim Shen	e0793db41d	[ADT] Change PostOrderIterator to use NodeRef. NFC. Reviewers: dblaikie Subscribers: mzolotukhin, llvm-commits Differential Revision: https://reviews.llvm.org/D23522 llvm-svn: 278752	2016-08-15 21:52:54 +00:00
Sanjay Patel	41520e1712	[InstCombine] add tests for missing vector icmp folds llvm-svn: 278751	2016-08-15 21:47:50 +00:00
Eli Friedman	98151d6440	Fix typo in lowering for fp128 ueq. Regression from r259791. Differential Revision: https://reviews.llvm.org/D23374 llvm-svn: 278750	2016-08-15 21:46:19 +00:00
Jan Vesely	0486f739a4	AMDGPU/R600: Convert buffer id to VTX_READ input Use patterns instead of multiple instructions Add buffer id to asm string https://reviews.llvm.org/D22650 llvm-svn: 278749	2016-08-15 21:38:30 +00:00
Hemant Kulkarni	533aa25e1c	Really fix the issue with 502957cc9cf805dc6093950e8cdcd0db4969d933. Windows %p and FileCheck limitations makes the test linux only llvm-svn: 278748	2016-08-15 21:38:23 +00:00
Sanjay Patel	638b613101	[InstCombine] add tests for missing vector icmp folds llvm-svn: 278747	2016-08-15 21:37:24 +00:00
Tim Northover	28fdc4272d	GlobalISel: support loads and stores of strange types. Before we mischaracterized structs and i1 types as a scalar with size 0 in various ways. llvm-svn: 278744	2016-08-15 21:13:17 +00:00
Teresa Johnson	ad71543972	Remove unnecessary flag from new test Remove -disable-inlining flag that snuck into the test I added for r278739. It doesn't have an effect in ThinLTO mode (something that should be fixed), but in any case the checks depend on inlining currently. llvm-svn: 278743	2016-08-15 21:07:57 +00:00
Sanjay Patel	55d87a88cc	update tests to use FileCheck and exact checking llvm-svn: 278741	2016-08-15 21:02:25 +00:00
Sanjoy Das	78db2963f6	Revert "[ValueTracking] Improve ValueTracking on left shift with nsw flag" This reverts commit r278172. It causes PR28946. llvm-svn: 278740	2016-08-15 21:01:31 +00:00
Teresa Johnson	6107a4195d	[ThinLTO] Remove functions resolved to available_externally from comdats Summary: thinLTOResolveWeakForLinkerModule needs to drop any preempted weak symbols that were converted to available_externally from comdats, otherwise we will get a verification failure (since available_externally is a declaration for the linker, and no declarations can be in a comdat). Reviewers: mehdi_amini Subscribers: llvm-commits, mehdi_amini Differential Revision: https://reviews.llvm.org/D23015 llvm-svn: 278739	2016-08-15 21:00:04 +00:00
Sanjay Patel	3e9acec2fa	[InstCombine] add tests for missing vector icmp folds llvm-svn: 278737	2016-08-15 20:56:11 +00:00
Hemant Kulkarni	5b140cdd17	Fix a test that failed due to: https://llvm.org/svn/llvm-project/llvm/trunk@278725 91177308-0d34-0410-b5e6-96231b3b80d8 llvm-svn: 278732	2016-08-15 20:36:16 +00:00
David L Kreitzer	7fe18251a5	Enhance SCEV to compute the trip count for some loops with unknown stride. Patch by Pankaj Chawla Differential Revision: https://reviews.llvm.org/D22377 llvm-svn: 278731	2016-08-15 20:21:41 +00:00
Sanjay Patel	b37bd6d7b7	[InstCombine] add test for missing vector icmp fold llvm-svn: 278727	2016-08-15 20:02:40 +00:00
Sanjay Patel	7d98be81cc	[InstCombine] add tests for vector icmp folds llvm-svn: 278726	2016-08-15 19:58:21 +00:00
Hemant Kulkarni	8dfc0b5541	llvm-objdump: Implement source[line numbers] interleaving Differential Revsion: https://reviews.llvm.org/D22932 llvm-svn: 278725	2016-08-15 19:49:24 +00:00
Kostya Serebryany	bdb220c7a0	[libFuzzer] print a verbose message after executing inputs in non-fuzzing mode llvm-svn: 278724	2016-08-15 19:44:04 +00:00
Kostya Serebryany	a0d40a21e7	[libFuzzer] fix the bot llvm-svn: 278721	2016-08-15 19:36:13 +00:00
Sanjay Patel	b860859611	[InstCombine] add tests for missing vector icmp folds llvm-svn: 278717	2016-08-15 19:16:33 +00:00
Sanjay Patel	6866b82a05	update test to use FileCheck and autogenerated checks llvm-svn: 278714	2016-08-15 18:56:10 +00:00
Reid Kleckner	bb8652312a	Fix WAsm test after LSR change in r278658 Now the increment is done in a different location llvm-svn: 278713	2016-08-15 18:51:42 +00:00
Matthias Braun	b948c52416	Revert "[Thumb] Validate branch target for CBZ/CBNZ instructions." This currently breaks the greendragon clang-stage1-configure-RA/ and brotli. It is probably just uncovering a pre-existing problem. Reverting temporarily to get the buildbots green again. A reduced testcase will follow shortly. This reverts commit r278659. llvm-svn: 278711	2016-08-15 18:50:13 +00:00
Sanjay Patel	2044a8eba9	[InstCombine] add tests for missing vector icmp folds llvm-svn: 278709	2016-08-15 18:45:10 +00:00
Sanjay Patel	aaf34d1bfc	[InstCombine] add test for missing vector icmp fold llvm-svn: 278708	2016-08-15 18:39:54 +00:00
Sanjay Patel	195eb9340a	minimize test llvm-svn: 278707	2016-08-15 18:35:44 +00:00
Sanjay Patel	3f506daf8c	remove unnecessary IR comments about uses llvm-svn: 278705	2016-08-15 18:32:50 +00:00
Sanjay Patel	d391b0d69e	[InstCombine] add tests for missing vector icmp folds llvm-svn: 278704	2016-08-15 18:26:56 +00:00
Wolfgang Pieb	dfad9b20c9	Local variables whose address is taken and passed on to a call are described in debug info using their stack slots instead of as an indirection of param reg + 0 offset. This is done by detecting FrameIndexSDNodes in SelectionDAG and generating FrameIndexDbgValues for them. This ultimately generates DBG_VALUEs with stack location operands. Differential Revision: http://reviews.llvm.org/D23283 llvm-svn: 278703	2016-08-15 18:18:26 +00:00
Sanjay Patel	cbd62a082c	[InstCombine] add tests for missing vector icmp folds llvm-svn: 278689	2016-08-15 17:55:39 +00:00
Kostya Serebryany	dfbe59b03d	[libFuzzer] add InsertRepeatedBytes and EraseBytes. New mutation: InsertRepeatedBytes. Updated mutation: EraseByte => EraseBytes. This helps https://github.com/google/sanitizers/issues/710 where libFuzzer was not able to find a known bug. Now it finds it in minutes. Hopefully, the change is general enough to help other targets. llvm-svn: 278687	2016-08-15 17:48:28 +00:00
Sanjay Patel	566b348987	[InstCombine] auto-generate exact checks Note that several of these tests belong in InstSimplify rather than InstCombine because they return existing operands or constants. llvm-svn: 278684	2016-08-15 17:19:07 +00:00
Sanjay Patel	a7b9bb3785	[InstCombine] add tests for missing vector icmp folds llvm-svn: 278683	2016-08-15 17:10:35 +00:00
Yaxun Liu	c7cbd72921	AMDGPU: Update AMDGPURuntimeMetadata.h for enums of address space qualifiers llvm-svn: 278682	2016-08-15 16:54:25 +00:00
Matt Arsenault	3661e90e71	AMDGPU: Don't fold subregister extracts into tied operands llvm-svn: 278676	2016-08-15 16:18:36 +00:00
Reid Kleckner	70a600b8bb	Revert "[SimplifyCFG] Rewrite SinkThenElseCodeToEnd" This reverts commit r278660. It causes downstream assertion failure in InstCombine on shuffle instructions. Comes up in __mm_swizzle_epi32. llvm-svn: 278672	2016-08-15 15:42:31 +00:00
Valery Pykhtin	c761675ef4	[AMDGPU] fix failure on printing of non-existing instruction operands. Differential revision: https://reviews.llvm.org/D23323 llvm-svn: 278665	2016-08-15 10:56:48 +00:00
Sjoerd Meijer	58156715b4	MachineLoop: add methods findLoopControlBlock and findLoopPreheader This adds two new utility functions findLoopControlBlock and findLoopPreheader to MachineLoop and MachineLoopInfo. These functions are refactored and taken from the Hexagon target as they are target independent; thus this is intendend to be a non-functional change. Differential Revision: https://reviews.llvm.org/D22959 llvm-svn: 278661	2016-08-15 08:22:42 +00:00
James Molloy	9a3c82f5cf	[SimplifyCFG] Rewrite SinkThenElseCodeToEnd The new version has several advantages: 1) IMSHO it's more readable and neater 2) It handles loads and stores properly 3) It can handle any number of incoming blocks rather than just two. I'll be taking advantage of this in a followup patch. With this change we can now finally sink load-modify-store idioms such as: if (a) return b += 3; else return b += 4; => %z = load i32, i32* %y %.sink = select i1 %a, i32 5, i32 7 %b = add i32 %z, %.sink store i32 %b, i32* %y ret i32 %b When this works for switches it'll be even more powerful. llvm-svn: 278660	2016-08-15 08:04:56 +00:00
Prakhar Bahuguna	a305a435a6	[Thumb] Validate branch target for CBZ/CBNZ instructions. Summary: The assembler currently does not check the branch target for CBZ/CBNZ instructions, which only permit branching forwards with a positive offset. This adds validation for the branch target to ensure negative PC-relative offsets are not encoded into the instruction, whether specified as a literal or as an assembler symbol. Reviewers: rengolin, t.p.northover Subscribers: llvm-commits, rengolin Differential Revision: https://reviews.llvm.org/D23312 llvm-svn: 278659	2016-08-15 07:57:44 +00:00
James Molloy	196ad0823e	[LSR] Don't try and create post-inc expressions on non-rotated loops If a loop is not rotated (for example when optimizing for size), the latch is not the backedge. If we promote an expression to post-inc form, we not only increase register pressure and add a COPY for that IV expression but for all IVs! Motivating testcase: void f(float a, float b, float c, int n) { while (n-- > 0) c++ = a++ + b++; } It's imperative that the pointer increments be located in the latch block and not the header block; if not, we cannot use post-increment loads and stores and we have to keep both the post-inc and pre-inc values around until the end of the latch which bloats register usage. llvm-svn: 278658	2016-08-15 07:53:03 +00:00
Craig Topper	f774de6d54	[X86] PADDUSB/W instructions should be commutable. llvm-svn: 278654	2016-08-15 06:31:57 +00:00
Craig Topper	80c8b80919	[X86] Mark some of the X86 SDNodes as commutative. llvm-svn: 278653	2016-08-15 04:47:30 +00:00
Craig Topper	dbc387cfc9	[X86] X86ISD::FANDN is not commutative or associative. llvm-svn: 278652	2016-08-15 04:47:28 +00:00
David Majnemer	3b47a5a562	[ScopedNoAliasAA] collectMDInDomain should be a free function collectMDInDomain doesn't use any class members, making it a free function is not a functional change. llvm-svn: 278651	2016-08-15 03:56:06 +00:00
David Majnemer	8b8869f8ef	[ScopedNoAliasAA] Only collect noalias nodes if we have alias.scope nodes No functional change is intended. llvm-svn: 278646	2016-08-15 02:23:50 +00:00
David Majnemer	ddc7ab26fc	[ScopedNoAliasAA] Replace !ScopeNodes.size() with ScopeNodes.empty() No functional change is intended. llvm-svn: 278645	2016-08-15 02:23:48 +00:00
David Majnemer	c77a1390de	Revert "[ScopedNoAliasAA] Remove an unneccesary set" This reverts commit r278641. I'm not sure why but this has upset the multistage builders... llvm-svn: 278644	2016-08-15 02:23:46 +00:00
David Majnemer	5ec9c58f13	[ScopedNoAliasAA] Remove an unneccesary set We are trying to prove that one group of operands is a subset of another. We did this by populating two Sets and determining that every element within one was inside the other. However, this is unnecessary. We can simply construct a single set and test if each operand is within it. llvm-svn: 278641	2016-08-15 00:13:04 +00:00
Sanjay Patel	52fe9ae990	[InstCombine] add test for missing vector icmp fold llvm-svn: 278639	2016-08-14 22:56:46 +00:00
Sanjay Patel	7e57b00274	[InstCombine] add tests for vector icmp folds llvm-svn: 278637	2016-08-14 22:44:10 +00:00
Sanjay Patel	8554f70c07	[InstCombine] add test for potentially missing vector icmp fold llvm-svn: 278636	2016-08-14 22:30:07 +00:00
Sanjay Patel	beebe05af1	[InstCombine] add test for missing vector icmp fold llvm-svn: 278635	2016-08-14 22:29:27 +00:00
Sanjay Patel	ba1f9fbddc	[InstCombine] add tests for missing vector icmp folds llvm-svn: 278634	2016-08-14 22:28:50 +00:00
Sanjay Patel	f6559404d5	[InstCombine] remove unnecessary function attributes from tests llvm-svn: 278633	2016-08-14 21:48:21 +00:00
Sanjay Patel	b44ca3bfa9	[InstCombine] add tests for missing vector icmp folds llvm-svn: 278632	2016-08-14 21:36:22 +00:00
Sanjay Patel	bbb3dffd0a	[InstCombine] add test for missing vector icmp fold llvm-svn: 278631	2016-08-14 21:05:08 +00:00
Sanjay Patel	66a3457a4c	[InstCombine] add test for missing vector icmp fold llvm-svn: 278630	2016-08-14 20:39:42 +00:00
Craig Topper	37e8c5443c	[AVX-512] Mark VPMADDWD as commutable to match SSE/AVX version. llvm-svn: 278629	2016-08-14 17:57:22 +00:00
Craig Topper	c677e97dff	[AVX-512] Add masked commutable floating point max/min instructions to folding tables. llvm-svn: 278628	2016-08-14 17:57:19 +00:00
Craig Topper	29fbdc309a	[AVX-512] Add masked logical operations to memory folding tables. llvm-svn: 278627	2016-08-14 17:57:16 +00:00
Igor Breger	505f2cc468	[AVX512] Fix VFPCLASSSD/VFPCLASSSS intrinsic lowering. The i1 result should be zero extended according to SPEC. Differential Revision: http://reviews.llvm.org/D23489 llvm-svn: 278626	2016-08-14 13:58:57 +00:00
Igor Breger	6fc00b0acf	autogenerate checks llvm-svn: 278624	2016-08-14 09:34:39 +00:00
Igor Breger	8672408db0	[AVX512] Fix insertelement i1 lowering. 1. Use shuffle to insert element i1 into vector. The previous implementation was incorrect ( dest_bit OR src_bit , it doesn't clear the bit if src_bit=0 ) 2. Improve shuffle i1 vector, use CVT2MASK if supported instead TRUNCATE. Differential Revision: http://reviews.llvm.org/D23347 llvm-svn: 278623	2016-08-14 05:25:07 +00:00
Saleem Abdulrasool	98541b09f4	Revert "gold: add a cast to appease std::max NFC" This was fixed differently by Teresa and this should no longer be needed. llvm-svn: 278622	2016-08-14 05:07:20 +00:00
Diana Picus	68be1eb885	Revert "CodeGen: If Convert blocks that would form a diamond when tail-merged." This reverts commit r278287. This commit broke the clang-cmake-thumbv7-a15-full-sh bot. See https://llvm.org/bugs/show_bug.cgi?id=28949 llvm-svn: 278621	2016-08-14 02:10:18 +00:00
Diana Picus	35ccf53e75	Revert "Codegen: Don't tail-duplicate blocks with un-analyzable fallthrough." This reverts commit r278288. r278287 broke the clang-cmake-thumbv7-a15-full-sh bot. Revert this so we can get to r278287. llvm-svn: 278620	2016-08-14 02:10:12 +00:00
Sanjoy Das	35459f0e34	[IRCE] Change variable grouping; NFC llvm-svn: 278619	2016-08-14 01:04:50 +00:00
Sanjoy Das	2143447c73	[IRCE] Create llvm::Loop instances for cloned out loops llvm-svn: 278618	2016-08-14 01:04:46 +00:00
Sanjoy Das	7a18a238c6	[IRCE] Don't iterate on loops that were cloned out IRCE has the ability to further version pre-loops and post-loops that it created, but this isn't useful at all. This change teaches IRCE to leave behind some metadata in the loops it creates (by cloning the main loop) so that these new loops are not re-processed by IRCE. Today this bug is hidden by another bug -- IRCE does not update LoopInfo properly so the loop pass manager does not re-invoke IRCE on the loops it split out. However, once the latter is fixed the bug addressed in this change causes IRCE to infinite-loop in some cases (e.g. it splits out a pre-loop, a pre-pre-loop from that, a pre-pre-pre-loop from that and so on). llvm-svn: 278617	2016-08-14 01:04:36 +00:00
Sanjoy Das	43fdc54303	[IRCE] Add better DEBUG diagnostic; NFC NFC meaning IRCE should not _do_ anything different, but -debug-only=irce will be a little friendlier. llvm-svn: 278616	2016-08-14 01:04:31 +00:00
Mehdi Amini	a71002e7f1	Fix bitcode auto-upgrade when using bitcode lazy loading The auto-upgrade path could be called before the VST (global names) was fully parsed, and thus intrinsic names were not available and the autoupgrade logic could not operate. Fix link failures with ThinLTO. This is a recommit of r278610 with a different fix. llvm-svn: 278615	2016-08-14 00:01:27 +00:00
Ron Lieberman	822ee88ab8	Fix unsupported relocation type R_HEX_6_X' for symbol .rodata LowerTargetConstantPool is not properly setting the TargetFlag to indicate desired relocation. Coding error, the offset parameter was omitted, so the TargetFlag was used as the offset, and the TargetFlag defaulted to zero. This only affects -fpic compilation, and only those items created in a Constant Pool, for example a vector of constants. Halide ran into this issue. llvm-svn: 278614	2016-08-13 23:41:11 +00:00
Mehdi Amini	466a64e298	Revert "Fix bitcode auto-upgrade when using bitcode lazy loading" This reverts commit r278610. Tests are broken llvm-svn: 278613	2016-08-13 23:39:14 +00:00
Sanjoy Das	1b1272f515	[IRCE] Fix test case; NFC The (negative) test case is supposed to check that IRCE does not muck with range checks it cannot handle, not that it does the right thing in the absence of profiling information. llvm-svn: 278612	2016-08-13 23:36:40 +00:00
Sanjoy Das	2a2f14d7ab	[IRCE] Be resilient in the face of non-simplified loops Loops containing `indirectbr` may not be in simplified form, even after running LoopSimplify. Reject then gracefully, instead of tripping an assert. llvm-svn: 278611	2016-08-13 23:36:35 +00:00
Mehdi Amini	e62aaf2303	Fix bitcode auto-upgrade when using bitcode lazy loading The auto-upgrade path could be called before the VST (global names) was fully parsed, and thus intrinsic names were not available and the autoupgrade logic could not operate. Fix link failures with ThinLTO. llvm-svn: 278610	2016-08-13 23:31:53 +00:00
Mehdi Amini	8c629ecf3a	Revert "Revert "Invariant start/end intrinsics overloaded for address space"" This reverts commit 32fc6488e48eafc0ca1bac1bd9cbf0008224d530. llvm-svn: 278609	2016-08-13 23:31:24 +00:00
Mehdi Amini	164ac651da	Revert "Invariant start/end intrinsics overloaded for address space" This reverts commit r276447. llvm-svn: 278608	2016-08-13 23:27:32 +00:00

... 2 3 4 5 6 ...

136850 Commits