llvm-project

Commit Graph

Author	SHA1	Message	Date
David Majnemer	7679300d93	EarlyCSE: It isn't safe to CSE across synchronization boundaries This fixes PR22514. llvm-svn: 228760	2015-02-10 23:09:43 +00:00
Tim Northover	43c0d2db50	DeadArgElim: arguments affect all returned sub-values by default. Unless we meet an insertvalue on a path from some value to a return, that value will be live if any of the return's components are live, so all of those components must be added to the MaybeLiveUses. Previously we were deleting arguments if sub-value 0 turned out to be dead. llvm-svn: 228731	2015-02-10 19:49:18 +00:00
Chandler Carruth	2496910325	Revert r228556: InstCombine: propagate nonNull through assume This commit isn't using the correct context, and is transfoming calls that are operands to loads rather than calls that are operands to an icmp feeding into an assume. I've replied on the original review thread with a very reduced test case and some thoughts on how to rework this. llvm-svn: 228677	2015-02-10 08:07:32 +00:00
Philip Reames	7e7dc3e9df	Adjust how we avoid poll insertion inside the poll function (NFC) I realized that my early fix for this was overly complicated. Rather than scatter checks around in a bunch of places, just exit early when we visit the poll function itself. Thinking about it a bit, the whole inlining mechanism used with gc.safepoint_poll could probably be cleaned up a bit. Originally, poll insertion was fused with gc relocation rewriting. It might be worth going back to see if we can simplify the chain of events now that these two are seperated. As one thought, maybe it makes sense to rewrite calls inside the helper function before inlining it to the many callers. This would require us to visit the poll function before any other functions though.. llvm-svn: 228634	2015-02-10 00:04:53 +00:00
Adrian Prantl	34e7590e0d	Debug info: When updating debug info during SROA, do not emit debug info for any padding introduced by SROA. In particular, do not emit debug info for an alloca that represents only the padding introduced by a previous iteration. Fixes PR22495. llvm-svn: 228632	2015-02-09 23:57:22 +00:00
Adrian Prantl	27bd01f71c	Debug info: Use DW_OP_bit_piece instead of DW_OP_piece in the intermediate representation. This - increases consistency by using the same granularity everywhere - allows for pieces < 1 byte - DW_OP_piece didn't actually allow storing an offset. Part of PR22495. llvm-svn: 228631	2015-02-09 23:57:15 +00:00
Ramkumar Ramachandra	3edf74fe29	[Statepoint] Improve two asserts, fix some style (NFC) Summary: It's important that our users immediately know what gc.safepoint_poll is. Also fix the style of the declaration of CreateGCStatepoint, in preparation for another change that will wrap it. Reviewers: reames Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D7517 llvm-svn: 228626	2015-02-09 23:02:10 +00:00
Ramkumar Ramachandra	2e4b9e0a37	PlaceSafepoints: modernize gc.result.* -> gc.result Differential Revision: http://reviews.llvm.org/D7516 llvm-svn: 228625	2015-02-09 23:00:40 +00:00
Philip Reames	d4a912fefd	Update file comment to clarify points highlighted in review (NFC) llvm-svn: 228621	2015-02-09 22:44:03 +00:00
Philip Reames	a29de87ea4	Use range for loops in PlaceSafepoints (NFC) llvm-svn: 228620	2015-02-09 22:26:11 +00:00
Duncan P. N. Exon Smith	bd75ad4d0c	IR: Take uint64_t in DIBuilder::createExpression() `DIExpression` deals with `uint64_t`, so it doesn't make sense that `createExpression()` is created from `int64_t`. Switch to `uint64_t` to unify them. I've temporarily left in the `int64_t` version, which forwards to the `uint64_t` version. I'll delete it once I've updated the callers. llvm-svn: 228619	2015-02-09 22:13:27 +00:00
Philip Reames	b1ed02f728	Add basic tests for PlaceSafepoints This is just adding really simple tests which should have been part of the original submission. When doing so, I discovered that I'd mistakenly removed required pieces when preparing the patch for upstream submission. I fixed two such bugs in this submission. llvm-svn: 228610	2015-02-09 21:48:05 +00:00
Akira Hatanaka	8d3cb829ce	Fix a bug in DemoteRegToStack where a reload instruction was inserted into the wrong basic block. This would happen when the result of an invoke was used by a phi instruction in the invoke's normal destination block. An instruction to reload the invoke's value would get inserted before the critical edge was split and a new basic block (which is the correct insertion point for the reload) was created. This commit fixes the bug by splitting the critical edge before all the reload instructions are inserted. Also, hoist up the code which computes the insertion point to the only place that need that computation. rdar://problem/15978721 llvm-svn: 228566	2015-02-09 06:38:23 +00:00
Tim Northover	705d2af9e1	DeadArgElim: fix mismatch in accounting of array return types. Some parts of DeadArgElim were only considering the individual fields of StructTypes separately, but others (where insertvalue & extractvalue instructions occur) also looked into ArrayTypes. This one is an actual bug; the mismatch can lead to an argument being considered used by a return sub-value that isn't being tracked (and hence is dead by default). It then gets incorrectly eliminated. llvm-svn: 228559	2015-02-09 01:21:00 +00:00
Tim Northover	854c927de5	DeadArgElim: assess uses of entire return value aggregate. Previously, a non-extractvalue use of an aggregate return value meant the entire return was considered live (the algorithm gave up entirely). This was correct, but conservative. It's better to actually look at that Use, making the analysis results apply to all sub-values under consideration. E.g. %val = call { i32, i32 } @whatever() [...] ret { i32, i32 } %val The return is using the entire aggregate (sub-values 0 and 1). We can still simplify @whatever if we can prove that this return is itself unused. Also unifies the logic slightly between aggregate and non-aggregate cases.. llvm-svn: 228558	2015-02-09 01:20:53 +00:00
Ramkumar Ramachandra	a021ee62ca	InstCombine: propagate nonNull through assume Make assume (load (call\|invoke) != null) set nonNull return attribute for the call and invoke. Also include tests. Differential Revision: http://reviews.llvm.org/D7107 llvm-svn: 228556	2015-02-09 01:13:13 +00:00
Bjorn Steinbrink	5ec7522771	Correctly combine alias.scope metadata by a union instead of intersecting Summary: The alias.scope metadata represents sets of things an instruction might alias with. When generically combining the metadata from two instructions the result must be the union of the original sets, because the new instruction might alias with anything any of the original instructions aliased with. Reviewers: hfinkel Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D7490 llvm-svn: 228525	2015-02-08 17:07:14 +00:00
Benjamin Kramer	f094d77de8	LoopIdiom: Use utility functions. The only difference between deleteIfDeadInstruction and RecursivelyDeleteTriviallyDeadInstructions is that the former also manually invalidates SCEV. That's unnecessary because SCEV automatically gets informed when an instruction is deleted via a ValueHandle. NFC. llvm-svn: 228508	2015-02-07 21:37:08 +00:00
Bjorn Steinbrink	71bf3b800a	Properly update AA metadata when performing call slot optimization Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D7482 llvm-svn: 228500	2015-02-07 17:54:36 +00:00
Evgeniy Stepanov	4e12057760	[msan] Fix "missing origin" in atomic store. An atomic store always make the target location fully initialized (in the current implementation). It should not store origin. Initialized memory can't have meaningful origin, and, due to origin granularity (4 bytes) there is a chance that this extra store would overwrite meaningfull origin for an adjacent location. llvm-svn: 228444	2015-02-06 21:47:39 +00:00
Michael Zolotukhin	7af83c1f39	Use estimated number of optimized insns in unroll-threshold computation. If complete-unroll could help us to optimize away N% of instructions, we might want to do this even if the final size would exceed loop-unroll threshold. However, we don't want to unroll huge loop, and we are add AbsoluteThreshold to avoid that - this threshold will never be crossed, even if we expect to optimize 99% instructions after that. llvm-svn: 228434	2015-02-06 20:20:40 +00:00
Michael Zolotukhin	4e8598eee3	[InstSimplify] Add SimplifyFPBinOp function. It is a variation of SimplifyBinOp, but it takes into account FastMathFlags. It is needed in inliner and loop-unroller to accurately predict the transformation's outcome (previously we dropped the flags and were too conservative in some cases). Example: float foo(float a, float b) { float r; if (a[1] b) r = /* a lot of expensive computations /; else r = 1; return r; } float boo(float a) { return foo(a, 0.0); } Without this patch, we don't inline 'foo' into 'boo'. llvm-svn: 228432	2015-02-06 20:02:51 +00:00
Adam Nemet	7206d7a5d2	[LV] Move addRuntimeCheck to LoopAccessAnalysis This will allow it to be shared with the new Loop Distribution pass. getFirstInst is currently duplicated across LoopVectorize.cpp and LoopAccessAnalysis.cpp. This is a short-term work-around until we figure out a better solution. NFC. (The code moved is adjusted a bit for the name of the Loop member and that PtrRtCheck is now a reference rather than a pointer.) llvm-svn: 228418	2015-02-06 18:31:04 +00:00
Benjamin Kramer	970eac40bf	Make helper functions/classes/globals static. NFC. llvm-svn: 228410	2015-02-06 17:51:54 +00:00
Matthias Braun	2e404597f4	InstCombine: Combine select sequences into a single select Normalize select(C0, select(C1, a, b), b) -> select((C0 & C1), a, b) select(C0, a, select(C1, a, b)) -> select((C0 \| C1), a, b) This normal form may enable further combines on the And/Or and shortens paths for the values. Many targets prefer the other but can go back easily in CodeGen. Differential Revision: http://reviews.llvm.org/D7399 llvm-svn: 228409	2015-02-06 17:49:36 +00:00
Benjamin Kramer	39f76acb5c	IRCE: Demote template to ArrayRef and SmallVector to array. NFC. llvm-svn: 228398	2015-02-06 14:43:49 +00:00
Alexey Samsonov	19763c48df	[ASan] Enable -asan-stack-dynamic-alloca by default. By default, store all local variables in dynamic alloca instead of static one. It reduces the stack space usage in use-after-return mode (dynamic alloca will not be called if the local variables are stored in a fake stack), and improves the debug info quality for local variables (they will not be described relatively to %rbp/%rsp, which are assumed to be clobbered by function calls). llvm-svn: 228336	2015-02-05 19:39:20 +00:00
Hans Wennborg	8b4dbdf15d	LowerSwitch: Use ConstantInt for CaseRange::{Low,High} Case values are always ConstantInt. This allows us to remove a bunch of casts. NFC. llvm-svn: 228312	2015-02-05 16:58:10 +00:00
Hans Wennborg	8c82fbcb73	LowerSwitch: remove default args from CaseRange ctor; NFC llvm-svn: 228311	2015-02-05 16:50:27 +00:00
Aaron Ballman	94d4d33a38	Removing an unused variable warning I accidentally introduced with my last warning fix; NFC. llvm-svn: 228295	2015-02-05 13:52:42 +00:00
Aaron Ballman	1b072b340b	Silencing an MSVC warning about a switch statement with no cases; NFC. llvm-svn: 228294	2015-02-05 13:40:04 +00:00
Michael Zolotukhin	a9aadd2903	Implement new heuristic for complete loop unrolling. Complete loop unrolling can make some loads constant, thus enabling a lot of other optimizations. To catch such cases, we look for loads that might become constants and estimate number of instructions that would be simplified or become dead after substitution. Example: Suppose we have: int a[] = {0, 1, 0}; v = 0; for (i = 0; i < 3; i ++) v += b[i]a[i]; If we completely unroll the loop, we would get: v = b[0]a[0] + b[1]a[1] + b[2]a[2] Which then will be simplified to: v = b[0]* 0 + b[1]* 1 + b[2]* 0 And finally: v = b[1] llvm-svn: 228265	2015-02-05 02:34:00 +00:00
Tom Stellard	080209d573	StructurizeCFG: Remove obsolete fix for loop backedge detection This is no longer needed now that we are using a reverse post-order traversal. llvm-svn: 228187	2015-02-04 20:49:47 +00:00
Tom Stellard	071ec90b68	StructurizeCFG: Use a reverse post-order traversal We were previously doing a post-order traversal and operating on the list in reverse, however this would occasionaly cause backedges for loops to be visited before some of the other blocks in the loop. We know use a reverse post-order traversal, which avoids this issue. The reverse post-order traversal is not completely ideal, so we need to manually fixup the list to ensure that inner loop backedges are visited before outer loop backedges. llvm-svn: 228186	2015-02-04 20:49:44 +00:00
Duncan P. N. Exon Smith	920df5c1bb	Utils: Resolve cycles under distinct MDNodes Track unresolved nodes under distinct `MDNode`s during `MapMetadata()`, and resolve them at the end. Previously, these cycles wouldn't get resolved. llvm-svn: 228180	2015-02-04 19:44:34 +00:00
Reid Kleckner	c26a17a822	Add range adapters predecessors() and successors() for BBs Use them in two isolated transforms so we know they work and aren't dead code. llvm-svn: 228173	2015-02-04 19:14:57 +00:00
Alexey Samsonov	b9b8027cee	SpecialCaseList: Add support for parsing multiple input files. Summary: This change allows users to create SpecialCaseList objects from multiple local files. This is needed to implement a proper support for -fsanitize-blacklist flag (allow users to specify multiple blacklists, in addition to default blacklist, see PR22431). DFSan can also benefit from this change, as DFSan instrumentation pass now accepts ABI-lists both from -fsanitize-blacklist= and -mllvm -dfsan-abilist flags. Go bindings are fixed accordingly. Test Plan: regression test suite Reviewers: pcc Subscribers: llvm-commits, axw, kcc Differential Revision: http://reviews.llvm.org/D7367 llvm-svn: 228155	2015-02-04 17:39:48 +00:00
Aaron Ballman	34c325e749	Fixing a -Wsign-compare warning; NFC llvm-svn: 228142	2015-02-04 14:01:08 +00:00
Philip Reames	72634d6af0	Fix a warning in non-asserts builds llvm-svn: 228114	2015-02-04 05:11:20 +00:00
Kostya Serebryany	77cc729ad7	[sanitizer] add another workaround for PR 17409: when over a threshold emit coverage instrumentation as calls. llvm-svn: 228102	2015-02-04 01:21:45 +00:00
Philip Reames	5a9685dba6	Clang format of a file introduced in 228090 (NFC) llvm-svn: 228091	2015-02-04 00:39:57 +00:00
Philip Reames	47cc673e1f	Add a pass for inserting safepoints into (nearly) arbitrary IR This pass is responsible for figuring out where to place call safepoints and safepoint polls. It doesn't actually make the relocations explicit; that's the job of the RewriteStatepointsForGC pass (http://reviews.llvm.org/D6975). Note that this code is not yet finalized. Its moving in tree for incremental development, but further cleanup is needed and will happen over the next few days. It is not yet part of the standard pass order. Planned changes in the near future: - I plan on restructuring the statepoint rewrite to use the functions add to the IRBuilder a while back. - In the current pass, the function "gc.safepoint_poll" is treated specially but is not an intrinsic. I plan to make identifying the poll function a property of the GCStrategy at some point in the near future. - As follow on patches, I will be separating a collection of test cases we have out of tree and submitting them upstream. - It's not explicit in the code, but these two patches are introducing a new state for a statepoint which looks a lot like a patchpoint. There's no a transient form which doesn't yet have the relocations explicitly represented, but does prevent reordering of memory operations. Once this is in, I need to update actually make this explicit by reserving the 'unused' argument of the statepoint as a flag, updating the docs, and making the code explicitly check for such a thing. This wasn't really planned, but once I split the two passes - which was done for other reasons - the intermediate state fell out. Just reminds us once again that we need to merge statepoints and patchpoints at some point in the not that distant future. Future directions planned: - Identifying more cases where a backedge safepoint isn't required to ensure timely execution of a safepoint poll. - Tweaking the insertion process to generate easier to optimize IR. (For example, investigating making SplitBackedge) the default. - Adding opt-in flags for a GCStrategy to use this pass. Once done, add this pass to the actual pass ordering. Differential Revision: http://reviews.llvm.org/D6981 llvm-svn: 228090	2015-02-04 00:37:33 +00:00
Adam Nemet	5add5d9d85	[LV] Split off memcheck block really at the first check I've noticed this while trying to move addRuntimeCheck to LoopAccessAnalysis. I think that the intention was to early exit from the overflow checking before the code for the memchecks. This is the entire reason why we compute FirstCheckInst but then we don't use that as the splitting instruction but the final check. Looks like an oversight. llvm-svn: 228056	2015-02-03 22:45:39 +00:00
Daniel Berlin	487aed0d77	Allow PRE to insert no-cost phi nodes llvm-svn: 228024	2015-02-03 20:37:08 +00:00
Jingyue Wu	d7966ff3b9	Add straight-line strength reduction to LLVM Summary: Straight-line strength reduction (SLSR) is implemented in GCC but not yet in LLVM. It has proven to effectively simplify statements derived from an unrolled loop, and can potentially benefit many other cases too. For example, LLVM unrolls #pragma unroll foo (int i = 0; i < 3; ++i) { sum += foo((b + i) * s); } into sum += foo(b * s); sum += foo((b + 1) * s); sum += foo((b + 2) * s); However, no optimizations yet reduce the internal redundancy of the three expressions: b * s (b + 1) * s (b + 2) * s With SLSR, LLVM can optimize these three expressions into: t1 = b * s t2 = t1 + s t3 = t2 + s This commit is only an initial step towards implementing a series of such optimizations. I will implement more (see TODO in the file commentary) in the near future. This optimization is enabled for the NVPTX backend for now. However, I am more than happy to push it to the standard optimization pipeline after more thorough performance tests. Test Plan: test/StraightLineStrengthReduce/slsr.ll Reviewers: eliben, HaoLiu, meheff, hfinkel, jholewinski, atrick Reviewed By: jholewinski, atrick Subscribers: karthikthecool, jholewinski, llvm-commits Differential Revision: http://reviews.llvm.org/D7310 llvm-svn: 228016	2015-02-03 19:37:06 +00:00
Adam Nemet	b60295a525	[LoopVectorize] Fix rebase glitch in r227751 LoopVectorizationLegality::{getNumLoads,getNumStores} should forward to LoopAccessAnalysis now. Thanks to Takumi for noticing this! llvm-svn: 227992	2015-02-03 17:59:53 +00:00
Renato Golin	af213728cc	Adding AArch64 support to ASan instrumentation For the time being, it is still hardcoded to support only the 39 VA bits variant, I plan to work on supporting 42 and 48 VA bits variants, but I don't have access to such hardware at the moment. Patch by Chrystophe Lyon. llvm-svn: 227965	2015-02-03 11:20:45 +00:00
NAKAMURA Takumi	c7f8bfc5e5	Resurrect initializers for NumLoads and NumStores in LoopVectorizationLegality to suppress undefined behavior. FIXME: Shall they be managed in LAA? llvm-svn: 227940	2015-02-03 03:55:06 +00:00
Jingyue Wu	49a766e468	Resurrect the assertion removed by r227717 Summary: MSVC can compile "LoopID->getOperand(0) == LoopID" when LoopID is MDNode*. Test Plan: no regression Reviewers: mkuper Subscribers: jholewinski, llvm-commits Differential Revision: http://reviews.llvm.org/D7327 llvm-svn: 227853	2015-02-02 20:41:11 +00:00
Erik Eckstein	7330e358f6	Fix: SLPVectorizer crashes with assertion when vectorizing a cmp instruction. The commit r225977 uncovered this bug. The problem was that the vectorizer tried to read the second operand of an already deleted instruction. The bug didn't show up before r225977 because the freed memory still contained a non-null pointer. With r225977 deletion of instructions is delayed and the read operand pointer is always null. llvm-svn: 227800	2015-02-02 12:45:34 +00:00

1 2 3 4 5 ...

12479 Commits