llvm-project

Commit Graph

Author	SHA1	Message	Date
Rafael Espindola	9b8a4e301a	Split test in two to handle building without x86. llvm-svn: 231886	2015-03-10 23:44:12 +00:00
Rafael Espindola	b03bc79bed	Add missing section symbol to COFF's .debug_types.dwo. Should bring the cygwin bots back. I added a triple to the test that was failing so that it would have failed on Linux. llvm-svn: 231882	2015-03-10 23:06:32 +00:00
Philip Reames	71c4035c18	If a conditional branch jumps to the same target, remove the condition Given that large parts of inst combine is restricted to instructions which have one use, getting rid of a use on the condition can help the effectiveness of the optimizer. Also, it allows the condition to potentially be deleted by instcombine rather than waiting for another pass. I noticed this completely by accident in another test case. It's not anything that actually came from a real workload. p.s. We should probably do the same thing for switch instructions. Differential Revision: http://reviews.llvm.org/D8220 llvm-svn: 231881	2015-03-10 22:52:37 +00:00
Paul Robinson	857b4434df	Emit correct linkage-name attribute based on DWARF version. There are still 4 tests that check for DW_AT_MIPS_linkage_name, because they specify DWARF 2 or 3 in the module metadata. So, I didn't create an explicit version-based test for the attribute. Differential Revision: http://reviews.llvm.org/D8227 llvm-svn: 231880	2015-03-10 22:44:45 +00:00
Philip Reames	1c29227144	Infer known bits from dominating conditions This patch adds limited support in ValueTracking for inferring known bits of a value from conditional expressions which must be true to reach the instruction we're trying to optimize. At this time, the feature is off by default. Once landed, I'm hoping for feedback from others on both profitability and compile time impact. Forms of conditional value propagation have been tried in LLVM before and have failed due to compile time problems. In an attempt to side step that, this patch only considers conditions where the edge leaving the branch dominates the context instruction. It does not attempt full dataflow. Even with that restriction, it handles many interesting cases: * Early exits from functions * Early exits from loops (for context instructions in the loop and after the check) * Conditions which control entry into loops, including multi-version loops (such as those produced during vectorization, IRCE, loop unswitch, etc..) Possible applications include optimizing using information provided by constructs such as: preconditions, assumptions, null checks, & range checks. This patch implements two approaches to the problem that need further benchmarking. Approach 1 is to directly walk the dominator tree looking for interesting conditions. Approach 2 is to inspect other uses of the value being queried for interesting comparisons. From initial benchmarking, it appears that Approach 2 is faster than Approach 1, but this needs to be further validated. Differential Revision: http://reviews.llvm.org/D7708 llvm-svn: 231879	2015-03-10 22:43:20 +00:00
Quentin Colombet	1b274f99ad	[CodeGenPrepare] Refine the cost model provided by the promotion helper. - Use TargetLowering to check for the actual cost of each extension. - Provide a factorized method to check for the cost of an extension: TargetLowering::isExtFree. - Provide a virtual method TargetLowering::isExtFreeImpl for targets to be able to tune the cost of non-free extensions. This refactoring offers a better granularity to model what really happens on different targets. No performance changes and very few code differences. Part of <rdar://problem/19267165> llvm-svn: 231855	2015-03-10 21:48:15 +00:00
Nemanja Ivanovic	0adf26b9b0	Add support for part-word atomics for PPC http://reviews.llvm.org/D8090#inline-67337 llvm-svn: 231843	2015-03-10 20:51:07 +00:00
Ahmed Bougacha	fab5892f8b	[AArch64] Avoid going through GPRs for across-vector instructions. This adds new node types for each intrinsic. For instance, for addv, we have AArch64ISD::UADDV, such that: (v4i32 (uaddv ...)) is the same as (v4i32 (scalar_to_vector (i32 (int_aarch64_neon_uaddv ...)))) that is, (v4i32 (INSERT_SUBREG (v4i32 (IMPLICIT_DEF)), (i32 (int_aarch64_neon_uaddv ...)), ssub) In a combine, we transform all such across-vector-lanes intrinsics to: (i32 (extract_vector_elt (uaddv ...), 0)) This has one big advantage: by making the extract_element explicit, we enable the existing patterns for lane-aware instructions to fire. This lets us avoid needlessly going through the GPRs. Consider: uint32x4_t test_mul(uint32x4_t a, uint32x4_t b) { return vmulq_n_u32(a, vaddvq_u32(b)); } We now generate: addv.4s s1, v1 mul.4s v0, v0, v1[0] instead of the previous: addv.4s s1, v1 fmov w8, s1 dup.4s v1, w8 mul.4s v0, v1, v0 rdar://20044838 llvm-svn: 231840	2015-03-10 20:45:38 +00:00
Bruno Cardoso Lopes	b3a58b4c3c	[AsmPrinter][TLOF] Reintroduce AArch64 test Follow up from r231505. Fix the non-determinism by using a MapVector and reintroduce the AArch64 testcase. Defer deleting the got candidates up to the end and remove them in a bulk, avoiding linear time removal of each element. Thanks to Renato Golin for trying it out on other platforms. llvm-svn: 231830	2015-03-10 20:05:23 +00:00
Kit Barton	20d3981e15	Change the generation of the vmuluwm instruction to be based on the MUL opcode. Phabricator review: http://reviews.llvm.org/D8185 llvm-svn: 231827	2015-03-10 19:49:38 +00:00
Adam Nemet	58913d65ad	[LoopAccesses 3/3] Print the dependences with -analyze The dependences are now expose through the new getInterestingDependences API so we can use that with -analyze too and fix the FIXME. This lets us remove the test that relied on -debug to check the dependences. llvm-svn: 231807	2015-03-10 17:40:43 +00:00
Igor Laevsky	85f7f727d3	Teach lowering to correctly handle invoke statepoint and gc results tied to them. Note that we still can not lower gc.relocates for invoke statepoints. Also it extracts getCopyFromRegs helper function in SelectionDAGBuilder as we need to be able to customize type of the register exported from basic block during lowering of the gc.result. (Resubmitting this change after not being able to reproduce buildbot failure) Differential Revision: http://reviews.llvm.org/D7760 llvm-svn: 231800	2015-03-10 16:26:48 +00:00
Sanjay Patel	19792fb270	[X86, AVX] replace vinsertf128 intrinsics with generic shuffles We want to replace as much custom x86 shuffling via intrinsics as possible because pushing the code down the generic shuffle optimization path allows for better codegen and less complexity in LLVM. This is the sibling patch for the Clang half of this change: http://reviews.llvm.org/D8088 Differential Revision: http://reviews.llvm.org/D8086 llvm-svn: 231794	2015-03-10 16:08:36 +00:00
Karthik Bhat	8d7f7eda14	Fix a memory corruption in Dependency Analysis. This crash occurs due to memory corruption when trying to update dependency direction based on Constraints. This crash was observed during lnt regression of Polybench benchmark test case dynprog. Review: http://reviews.llvm.org/D8059 llvm-svn: 231788	2015-03-10 14:32:02 +00:00
Karthik Bhat	8d0099bdab	Fix a crash in Dependency Analysis. This crash in Dependency analysis is because we assume here that in case of UsefulGEP both source and destination have the same number of operands which may not be true. This incorrect assumption results in crash while populating Pairs. Fix the same. This crash was observed during lnt regression for code such as- struct s{ int A[10][10]; int C[10][10][10]; } S; void dep_constraint_crash_test(int k,int N) { for( int i=0;i<N;i++) for( int j=0;j<N;j++) S.A[0][0] = S.C[0][0][k]; } Review: http://reviews.llvm.org/D8162 llvm-svn: 231784	2015-03-10 13:31:03 +00:00
Owen Anderson	58364dc4da	Fix a crash in InstCombine where we could try to truncate a switch comparison to zero width. llvm-svn: 231761	2015-03-10 06:51:39 +00:00
Owen Anderson	e90f992b21	Fix a stack overflow in the assembler when checking that GEPs must be over sized types. We failed to use a marking set to properly handle recursive types, which caused use to recurse infinitely and eventually overflow the stack. llvm-svn: 231760	2015-03-10 06:34:57 +00:00
Owen Anderson	3e7e67b5ed	Fix an issue in the verifier where we could try to read information out of a malformed statepoint intrinsic. In this situation we would always have already flagged an error on the statepoint intrinsic, but then we carry on to parse other, related GC intrinsics, and could end up crashing during that verification when they try to access data from the malformed statepoint. llvm-svn: 231759	2015-03-10 05:58:21 +00:00
Owen Anderson	51b75b8c34	Fix an infinite loop in InstCombine when an instruction with no users and side effects can be constant folded. ReplaceInstUsesWith needs to return nullptr when the input has no users, because in that case it does not mutate the program. Otherwise, we can get stuck in an infinite loop of repeatedly attempting to constant fold and instruction with no users. llvm-svn: 231755	2015-03-10 05:13:47 +00:00
Rafael Espindola	fcc2821882	Use a better name for compile unit labels. They mark the start of a compile unit, so name them .Lcu_*. Using Section->getLabelBeginName() makes it looks like they mark the start of the section. While at it, switch to createTempSymbol to avoid collisions with labels created in inline assembly. Not sure if a "don't crash" test is worth it. With this getLabelBeginName is dead, delete it. llvm-svn: 231750	2015-03-10 03:58:36 +00:00
George Burgess IV	ab03af277b	Added ConstantExpr support to CFLAA. CFLAA didn't know how to properly handle ConstantExprs; it would silently ignore them. This was a problem if the ConstantExpr is, say, a GEP of a global, because CFLAA wouldn't realize that there's a global there. :) llvm-svn: 231743	2015-03-10 02:58:15 +00:00
George Burgess IV	b54a8d62a4	Added special handling for inttoptr in CFLAA. We now treat pointers given to ptrtoint and pointers retrieved from inttoptr as similar to arguments or globals (can alias anything, etc.) This solves some of the problems we were having with giving incorrect results. llvm-svn: 231741	2015-03-10 02:40:06 +00:00
Kostya Serebryany	48a4023f40	[sanitizer] fix instrumentation with -mllvm -sanitizer-coverage-block-threshold=0 to actually do something useful. llvm-svn: 231736	2015-03-10 01:58:27 +00:00
Frederic Riss	0e9a50f5b5	DwarfAccelTable: Fix handling of hash collisions. It turns out accelerator tables where totally broken if they contained entries with colliding hashes. The failure mode is pretty bad, as it not only impacted the colliding entries, but would basically make all the entries after the first hash collision pointing in the wrong place. The testcase uses the symbol names that where found to collide during a clang build. From a performance point of view, the patch adds a sort and a linear walk over each bucket contents. While it has a measurable impact on the accelerator table emission, it's not showing up significantly in clang profiles (and I'd argue that correctness is priceless :-)). llvm-svn: 231732	2015-03-10 00:46:31 +00:00
Colin LeMahieu	fa79110cc7	[Hexagon] Removing unused patterns. llvm-svn: 231723	2015-03-09 23:08:46 +00:00
David Blaikie	8d75794bfb	LLParser: gep: Simplify parsing error handling llvm-svn: 231722	2015-03-09 23:08:44 +00:00
Ahmed Bougacha	c809761dc0	[CodeGen] Replace the reused stores' chain for extractelt expansion. This fixes a subtle issue that was introduced in r205153. When reusing a store for the extractelement expansion (to load directly from it, inserting of going through the stack), later stores to the same location might have overwritten the data we were expecting to extract from. To fix that, we need to explicitly replace the chain going out of the reused store, so that later stores also have an explicit dependency on the generated element-extracting loads, and can't clobber them. rdar://20066785 Differential Revision: http://reviews.llvm.org/D8180 llvm-svn: 231721	2015-03-09 22:51:05 +00:00
Ahmed Bougacha	540469d8a2	[X86] Add nounwind to vector-idiv.ll testcases. NFC. In preparation for a patch where cfi directives get in the way. llvm-svn: 231720	2015-03-09 22:46:02 +00:00
Reid Kleckner	be0a05060f	Reland r229944: EH: Prune unreachable resume instructions during Dwarf EH preparation Fix the double-deletion of AnalysisResolver when delegating through to Dwarf EH preparation by creating one from scratch. Hopefully the new pass manager simplifies this. This reverts commit r229952. llvm-svn: 231719	2015-03-09 22:45:16 +00:00
Rafael Espindola	4f4ef15ade	Use a MapVector instead of an extra sort. This also has the advantage of not depending on the brittle getLabelBeginName. llvm-svn: 231714	2015-03-09 22:08:37 +00:00
Colin LeMahieu	2efa2d01d7	[Hexagon] Reapply r231699. Remove assumption that second operand is an immediate when checking if A2_tfrsi is combinable. llvm-svn: 231710	2015-03-09 21:48:13 +00:00
Sanjoy Das	91b5477aad	[SCEV] Unify getUnsignedRange and getSignedRange Summary: This removes some duplicated code, and also helps optimization: e.g. in the test case added, `%idx ULT 128` in `@x` is not currently optimized to `true` by `-indvars` but will be, after this change. The only functional change in ths commit is that for add recurrences, ScalarEvolution::getRange will be more aggressive -- computing the unsigned (resp. signed) range for a SCEVAddRecExpr will now look at the NSW (resp. NUW) bits and check for signed (resp. unsigned) overflow. This can be a strict improvement in some cases (such as the attached test case), and should be no worse in other cases. Reviewers: atrick, nlewycky Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D8142 llvm-svn: 231709	2015-03-09 21:43:43 +00:00
Sanjoy Das	f257452986	[SCEV] Add a `scalar-evolution-print-constant-ranges' option Summary: Unused in this commit, but will be used in a subsequent change (D8142) by a FileCheck test. Reviewers: atrick Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D8143 llvm-svn: 231708	2015-03-09 21:43:39 +00:00
Colin LeMahieu	ed853397c6	[Hexagon] Reverting r231699 llvm-svn: 231703	2015-03-09 21:19:02 +00:00
Colin LeMahieu	8c4dfaa13b	[Hexagon] Updating constant set to simpler versions. llvm-svn: 231699	2015-03-09 20:33:12 +00:00
Rafael Espindola	14862d3e37	Don't prime the section map. This was just creating unused labels for .text when the module had no functions. llvm-svn: 231694	2015-03-09 20:09:58 +00:00
Colin LeMahieu	96bfaa9766	[Hexagon] Eliminating immediate condition set. llvm-svn: 231693	2015-03-09 19:57:18 +00:00
Justin Bogner	24ee64ba87	InstrProf: Use the proftext format for these coverage tests This format's easier to understand and update by hand. llvm-svn: 231686	2015-03-09 18:54:58 +00:00
Justin Bogner	f95ca0758c	InstrProf: Allow hexadecimal function hashes in proftext format llvm-svn: 231685	2015-03-09 18:54:49 +00:00
Rafael Espindola	a60017902c	Print jump tables before exception tables. In the case where just tables are part of the function section, this produces more readable assembly by avoiding switching to the eh section and back to .text. This would also break with non unique section names, as trying to switch to a unique section actually creates a new one. llvm-svn: 231677	2015-03-09 18:29:12 +00:00
Reed Kotler	07d3a2f6b2	Add logical ops to Mips fast-isel Summary: Code is mostly copied from AArch64 port and modified where needed for Mips. This handles the "non" legal cases of logical ops. Legal cases are handled by tablegen patterns. Test Plan: Make check test logopm.ll All of test-suite passes at O0/O2 and mips32 r1/r2 with this new change. Reviewers: dsanders Reviewed By: dsanders Subscribers: echristo, llvm-commits, aemerson, rfuhler Differential Revision: http://reviews.llvm.org/D6599 llvm-svn: 231665	2015-03-09 16:28:10 +00:00
Marek Olsak	4d00dd2b93	R600/SI: Limit SGPRs to 80 on Tonga and Iceland This is a candidate for stable. llvm-svn: 231659	2015-03-09 15:48:09 +00:00
Andrea Di Biagio	228d9d4399	Fix line ending in test CodeGen/X86/pr22774.ll. NFC. Also, replaced line with 'target triple' with flag -mtriple on the RUN line. Removed the data layout string as it is not needed. llvm-svn: 231654	2015-03-09 15:02:01 +00:00
Kevin Qin	65b07b8e1b	Revert r231630 - Run LICM pass after loop unrolling pass. As it broke llvm bootstrap. llvm-svn: 231635	2015-03-09 07:26:37 +00:00
Owen Anderson	f8f259df48	Fix a bug in the LLParser where we failed to diagnose landingpads with non-constant clause operands. Fixing this also exposed a related issue where the landingpad under construction was not cleaned up when an error was raised, which would cause bad reference errors before the error could actually be printed. llvm-svn: 231634	2015-03-09 07:13:42 +00:00
Kevin Qin	aef68418de	[AArch64] Enable partial & runtime unrolling on cortex-a57 For inner one of nested loops, it is more likely to be a hot loop, and the runtime check can be promoted out from patch 0001, so the overhead is less, we can try a doubled threshold to unroll more loops. llvm-svn: 231632	2015-03-09 06:14:28 +00:00
Kevin Qin	715b01e979	Introduce runtime unrolling disable matadata and use it to mark the scalar loop from vectorization. Runtime unrolling is an expensive optimization which can bring benefit only if the loop is hot and iteration number is relatively large enough. For some loops, we know they are not worth to be runtime unrolled. The scalar loop from vectorization is one of the cases. llvm-svn: 231631	2015-03-09 06:14:18 +00:00
Kevin Qin	a998735def	Run LICM pass after loop unrolling pass. Runtime unrollng will introduce a runtime check in loop prologue. If the unrolled loop is a inner loop, then the proglogue will be inside the outer loop. LICM pass can help to promote the runtime check out if the checked value is loop invariant. llvm-svn: 231630	2015-03-09 06:14:07 +00:00
Mehdi Amini	eb242a5041	InstCombine: fix fold "fcmp x, undef" to account for NaN Summary: See the two test cases. ; Can fold fcmp with undef on one side by choosing NaN for the undef ; Can fold fcmp with undef on both side ; fcmp u_pred undef, undef -> true ; fcmp o_pred undef, undef -> false ; because whatever you choose for the first undef ; you can choose NaN for the other undef Reviewers: hfinkel, chandlerc, majnemer Reviewed By: majnemer Subscribers: majnemer, llvm-commits Differential Revision: http://reviews.llvm.org/D7617 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 231626	2015-03-09 03:20:25 +00:00
Owen Anderson	7e621e9d5e	Teach DataLayout to infer a plausible alignment for things even when nothing is specified by the user. llvm-svn: 231613	2015-03-08 21:53:59 +00:00

1 2 3 4 5 ...

29043 Commits