llvm-project

Commit Graph

Author	SHA1	Message	Date
Patrik Hagglund	e98b7a0389	Revert EVT->MVT changes, r169836-169851, due to buildbot failures. llvm-svn: 169854	2012-12-11 11:14:33 +00:00
Patrik Hagglund	cbc9d4d0f9	Change TargetLowering::getLoadExtAction to take an MVT, instead of EVT. llvm-svn: 169840	2012-12-11 09:39:09 +00:00
Chandler Carruth	e41e7b7901	Add a new visitor for walking the uses of a pointer value. This visitor provides infrastructure for recursively traversing the use-graph of a pointer-producing instruction like an alloca or a malloc. It maintains a worklist of uses to visit, so it can handle very deep recursions. It automatically looks through instructions which simply translate one pointer to another (bitcasts and GEPs). It tracks the offset relative to the original pointer as long as that offset remains constant and exposes it during the visit as an APInt offset. Finally, it performs conservative escape analysis. However, currently it has some limitations that should be addressed going forward: 1) It doesn't handle vectors of pointers. 2) It doesn't provide a cheaper visitor when the constant offset tracking isn't needed. 3) It doesn't support non-instruction pointer values. The current functionality is exactly what is required to implement the SROA pointer-use visitors in terms of this one, rather than in terms of their own ad-hoc base visitor, which was always very poorly specified. SROA has been converted to use this, and the code there deleted which this utility now provides. Technically speaking, using this new visitor allows SROA to handle a few more cases than it previously did. It is now more aggressive in ignoring chains of instructions which look like they would defeat SROA, but in fact do not because they never result in a read or write of memory. While this is "neat", it shouldn't be interesting for real programs as any such chains should have been removed by others passes long before we get to SROA. As a consequence, I've not added any tests for these features -- it shouldn't be part of SROA's contract to perform such heroics. The goal is to extend the functionality of this visitor going forward, and re-use it from passes like ASan that can benefit from doing a detailed walk of the uses of a pointer. Thanks to Ben Kramer for the code review rounds and lots of help reviewing and debugging this patch. llvm-svn: 169728	2012-12-10 08:28:39 +00:00
Chandler Carruth	e45f4658a3	Fix PR14548: SROA was crashing on a mixture of i1 and i8 loads and stores. When SROA was evaluating a mixture of i1 and i8 loads and stores, in just a particular case, it would tickle a latent bug where we compared bits to bytes rather than bits to bits. As a consequence of the latent bug, we would allow integers through which were not byte-size multiples, a situation the later rewriting code was never intended to handle. In release builds this could trigger all manner of oddities, but the reported issue in PR14548 was forming invalid bitcast instructions. The only downside of this fix is that it makes it more clear that SROA in its current form is not capable of handling mixed i1 and i8 loads and stores. Sometimes with the previous code this would work by luck, but usually it would crash, so I'm not terribly worried. I'll watch the LNT numbers just to be sure. llvm-svn: 169719	2012-12-10 00:54:45 +00:00
Chandler Carruth	93ff2447ec	Switch SROA to pop Uses off the back of its visitors' queues. This will more closely match the behavior of the new PtrUseVisitor that I am adding. Hopefully this will not change the actual behavior in any way, but by making the processing order more similar help in debugging. llvm-svn: 169697	2012-12-09 11:56:01 +00:00
Shuxin Yang	95de7c37e2	- Re-enable population count loop idiom recognization - fix a bug which cause sigfault. - add two testing cases which was causing crash llvm-svn: 169687	2012-12-09 03:12:46 +00:00
Chandler Carruth	91e47532fe	Revert the patches adding a popcount loop idiom recognition pass. There are still bugs in this pass, as well as other issues that are being worked on, but the bugs are crashers that occur pretty easily in the wild. Test cases have been sent to the original commit's review thread. This reverts the commits: r169671: Fix a logic error. r169604: Move the popcnt tests to an X86 subdirectory. r168931: Initial commit adding the pass. llvm-svn: 169683	2012-12-08 22:18:29 +00:00
Shuxin Yang	9c5c97647f	Fix an inadvertent typo error. llvm-svn: 169671	2012-12-08 05:00:59 +00:00
Bill Wendling	e94d843e43	s/AttrListPtr/AttributeSet/g to better label what this class is going to be in the near future. llvm-svn: 169651	2012-12-07 23:16:57 +00:00
Bill Wendling	ab417b644c	Set the 'MadeChange' variable if we are deleting blocks. llvm-svn: 169455	2012-12-06 00:30:20 +00:00
Matt Beaumont-Gay	abfc446063	Add 'using' declarations to suppress -Woverloaded-virtual warnings. llvm-svn: 169214	2012-12-04 05:41:27 +00:00
Nadav Rotem	2349531def	Teach the jump threading optimization to stop scanning the basic block when calculating the cost after passing the threshold. llvm-svn: 169135	2012-12-03 17:34:44 +00:00
Chandler Carruth	ed0881b2a6	Use the new script to sort the includes of every file under lib. Sooooo many of these had incorrect or strange main module includes. I have manually inspected all of these, and fixed the main module include to be the nearest plausible thing I could find. If you own or care about any of these source files, I encourage you to take some time and check that these edits were sensible. I can't have broken anything (I strictly added headers, and reordered them, never removed), but they may not be the headers you'd really like to identify as containing the API being implemented. Many forward declarations and missing includes were added to a header files to allow them to parse cleanly when included first. The main module rule does in fact have its merits. =] llvm-svn: 169131	2012-12-03 16:50:05 +00:00
Chandler Carruth	f02b8bf11b	Remove some buggy and apparantly unnecessary code from SROA. The partitioning logic attempted to handle uses of an alloca with an offset starting before the alloca so long as the use had some overlap with the alloca itself. However, there was a bug where we tested '(uint64_t)Offset >= AllocSize' without first checking whether 'Offset' was positive. As a consequence, essentially every negative offset (that is, starting before the alloca does) would be thrown out, even if it was overlapping. The subsequent code to throw out negative offsets which were actually non-overlapping was essentially dead. The code to handle overlapping negative offsets was actually dead! I've just removed all of this, and taught SROA to discard any uses which start prior to the alloca from the beginning. It has the lovely property of simplifying the code. =] All the tests still pass, and in fact no new tests are needed as this is already covered by our testsuite. Fixing the code so that negative offsets work the way the comments indicate they were supposed to work causes regressions. That's how I found this. Anyways, this is all progress in the correct direction -- tightening up SROA to be maximally aggressive. Some day, I really hope to turn out-of-bounds accesses to an alloca into 'unreachable'. llvm-svn: 169120	2012-12-03 10:59:55 +00:00
Benjamin Kramer	47534c7440	SROA: Avoid struct and array types early to avoid creating an overly large integer type. Fixes PR14465. Differential Revision: http://llvm-reviews.chandlerc.com/D148 llvm-svn: 169084	2012-12-01 11:53:32 +00:00
Bill Wendling	c786b31233	Replace r168930 with a more reasonable patch. The original patch removed a bunch of code that the SjLjEHPrepare pass placed into the entry block if all of the landing pads were removed during the CodeGenPrepare class. The more natural way of doing things is to run the CGP before we run the SjLjEHPrepare pass. Make it so! llvm-svn: 169044	2012-11-30 22:08:55 +00:00
Meador Inge	e3f2b26bfa	Move library call simplification statistic to instcombine The simplify-libcalls pass maintained a statistic to count the number of library calls that have been simplified. Now that library call simplification is being carried out in instcombine the statistic should be moved to there. llvm-svn: 168975	2012-11-30 04:05:06 +00:00
Chandler Carruth	dbd6958183	Move the InstVisitor utility into VMCore where it belongs. It heavily depends on the IR infrastructure, there is no sense in it being off in Support land. This is in preparation to start working to expand InstVisitor into more special-purpose visitors that are still generic and can be re-used across different passes. The expansion will go into the Analylis tree though as nothing in VMCore needs it. llvm-svn: 168972	2012-11-30 03:08:41 +00:00
Shuxin Yang	abcc370423	rdar://12100355 (part 1) This revision attempts to recognize following population-count pattern: while(a) { c++; ... ; a &= a - 1; ... }, where <c> and <a>could be used multiple times in the loop body. TODO: On X8664 and ARM, __buildin_ctpop() are not expanded to a efficent instruction sequence, which need to be improved in the following commits. Reviewed by Nadav, really appreciate! llvm-svn: 168931	2012-11-29 19:38:54 +00:00
Bill Wendling	a4a77edf2e	Handle the situation where CodeGenPrepare removes a reference to a BB that has the last invoke instruction in the function. This also removes the last landing pad in an function. This is fine, but with SjLj EH code, we've already placed a bunch of code in the 'entry' block, which expects the landing pad to stick around. When we get to the situation where CGP has removed the last landing pad, go ahead and nuke the SjLj instructions from the 'entry' block. <rdar://problem/12721258> llvm-svn: 168930	2012-11-29 19:38:06 +00:00
Meador Inge	75798bb7fe	instcombine: Migrate puts optimizations This patch migrates the puts optimizations from the simplify-libcalls pass into the instcombine library call simplifier. All the simplifiers from simplify-libcalls have now been migrated to instcombine. Yay! Just a few other bits to migrate (prototype attribute inference and a few statistics) and simplify-libcalls can finally be put to rest. llvm-svn: 168925	2012-11-29 19:15:17 +00:00
Meador Inge	f8e725081c	instcombine: Migrate fputs optimizations This patch migrates the fputs optimizations from the simplify-libcalls pass into the instcombine library call simplifier. llvm-svn: 168893	2012-11-29 15:45:43 +00:00
Meador Inge	bc84d1a4f5	instcombine: Migrate fwrite optimizations This patch migrates the fwrite optimizations from the simplify-libcalls pass into the instcombine library call simplifier. llvm-svn: 168892	2012-11-29 15:45:39 +00:00
Meador Inge	1009cecca0	instcombine: Migrate fprintf optimizations This patch migrates the fprintf optimizations from the simplify-libcalls pass into the instcombine library call simplifier. llvm-svn: 168891	2012-11-29 15:45:33 +00:00
Bill Wendling	f3614fd8e2	When we delete a dead basic block, see if any of its successors are dead and delete those as well. llvm-svn: 168829	2012-11-28 23:23:48 +00:00
Meador Inge	25c9b3b6e4	instcombine: Migrate sprintf optimizations This patch migrates the sprintf optimizations from the simplify-libcalls pass into the instcombine library call simplifier. llvm-svn: 168677	2012-11-27 05:57:54 +00:00
Meador Inge	08ca115abd	instcombine: Migrate printf optimizations This patch migrates the printf optimizations from the simplify-libcalls pass into the instcombine library call simplifier. llvm-svn: 168604	2012-11-26 20:37:20 +00:00
Meador Inge	604937d1cc	instcombine: Migrate toascii optimizations This patch migrates the toascii optimizations from the simplify-libcalls pass into the instcombine library call simplifier. llvm-svn: 168580	2012-11-26 03:38:52 +00:00
Meador Inge	a62a39e0e9	instcombine: Migrate isascii optimizations This patch migrates the isascii optimizations from the simplify-libcalls pass into the instcombine library call simplifier. llvm-svn: 168579	2012-11-26 03:10:07 +00:00
Meador Inge	9a59ab6133	instcombine: Migrate isdigit optimizations This patch migrates the isdigit optimizations from the simplify-libcalls pass into the instcombine library call simplifier. llvm-svn: 168578	2012-11-26 02:31:59 +00:00
Meador Inge	a0b6d87879	instcombine: Migrate abs optimizations This patch migrates the abs optimizations from the simplify-libcalls pass into the instcombine library call simplifier. llvm-svn: 168574	2012-11-26 00:24:07 +00:00
Meador Inge	7415f8403d	instcombine: Migrate ffs* optimizations This patch migrates the ffs* optimizations from the simplify-libcalls pass into the instcombine library call simplifier. llvm-svn: 168571	2012-11-25 20:45:27 +00:00
Benjamin Kramer	455fa35e51	CodeGenPrepare: Move ret duplication out of the instruction iteration loop. It can delete the block, and the loop continues on free'd memory. No change in output. Found by valgrind. llvm-svn: 168525	2012-11-23 19:17:06 +00:00
Chandler Carruth	845b73c06f	PR14055: Implement support for sub-vector operations in SROA. Now if we can transform an alloca into a single vector value, but it has subvector, non-element accesses, we form the appropriate shufflevectors to allow SROA to proceed. This fixes PR14055 which pointed out a very common pattern that SROA couldn't handle -- mixed vec3 and vec4 operations on a single alloca. llvm-svn: 168418	2012-11-21 08:16:30 +00:00
Chandler Carruth	b7915f7fbf	Use LLVM_ENABLE_DUMP for the variables used in printing as well as the printing functions themselves. Part of PR14324 (which should have just been a patch to the list, but hey...) llvm-svn: 168362	2012-11-20 10:23:07 +00:00
Chandler Carruth	3e994a26e2	Fix PR14132 and handle OOB loads speculated throuh PHI nodes. The issue is that we may end up with newly OOB loads when speculating a load into the predecessors of a PHI node, and this confuses the new integer splitting logic in some cases, triggering an assertion failure. In fact, the branch in question must be dead code as it loads from a too-narrow alloca. Add code to handle this gracefully and leave the requisite FIXMEs for both optimizing more aggressively and doing more to aid sanitizing invalid code which triggers these patterns. llvm-svn: 168361	2012-11-20 10:02:19 +00:00
Chandler Carruth	9324c96064	Add a comment to associate a FIXME with a PR where it is matters. llvm-svn: 168347	2012-11-20 01:27:48 +00:00
Chandler Carruth	18db795b05	Rework the rewriting of loads and stores for vector and integer allocas to properly handle the combinations of these with split integer loads and stores. This essentially replaces Evan's r168227 by refactoring the code in a different way, and trynig to mirror that refactoring in both the load and store sides of the rewriting. Generally speaking there was some really problematic duplicated code here that led to poorly founded assumptions and then subtle bugs. Now much of the code actually flows through and follows a more consistent style and logical path. There is still a tiny bit of duplication on the store side of things, but it is much less bad. This also changes the logic to never re-use a load or store instruction as that was simply too error prone in practice. I've added a few tests (one a reduction of the one in Evan's original patch, which happened to be the same as the report in PR14349). I'm going to look at adding a few more tests for things I found and fixed in passing (such as the volatile tests in the vectorizable predicate). This patch has survived bootstrap, and modulo one bugfix survived Duncan's test suite, but let me know if anything else explodes. llvm-svn: 168346	2012-11-20 01:12:50 +00:00
Duncan Sands	79cf530d56	Remove the last bit of constant folding from LinearizeExprTree (most of it was removed in commit 168035, but I missed this bit). llvm-svn: 168292	2012-11-18 20:15:36 +00:00
Duncan Sands	20bd7fa0f7	Fix PR14060, an infinite loop in reassociate. The problem was that one of the operands of the expression being written was wrongly thought to be reusable as an inner node of the expression resulting in it turning up as both an inner node and a leaf, creating a cycle in the def-use graph. This would have caused the verifier to blow up if things had gotten that far, however it managed to provoke an infinite loop first. llvm-svn: 168291	2012-11-18 19:27:01 +00:00
Evan Cheng	f1b6177b62	Teach SROA rewriteVectorizedStoreInst to handle cases when the loaded value is narrower than the stored value. rdar://12713675 llvm-svn: 168227	2012-11-17 00:05:06 +00:00
Duncan Sands	ac852c742f	Fix a crash observed by Shuxin Yang. The issue here is that LinearizeExprTree, the utility for extracting a chain of operations from the IR, thought that it might as well combine any constants it came across (rather than just returning them along with everything else). On the other hand, the factorization code would like to see the individual constants (this is quite reasonable: it is much easier to pull a factor of 3 out of 2*3 than it is to pull it out of 6; you may think 6/3 isn't so hard, but due to overflow it's not as easy to undo multiplications of constants as it may at first appear). This patch therefore makes LinearizeExprTree stupider: it now leaves optimizing to the optimization part of reassociate, and sticks to just analysing the IR. llvm-svn: 168035	2012-11-15 09:58:38 +00:00
Meador Inge	193e035b9c	instcombine: Migrate math library call simplifications This patch migrates the math library call simplifications from the simplify-libcalls pass into the instcombine library call simplifier. I have typically migrated just one simplifier at a time, but the math simplifiers are interdependent because: 1. CosOpt, PowOpt, and Exp2Opt all depend on UnaryDoubleFPOpt. 2. CosOpt, PowOpt, Exp2Opt, and UnaryDoubleFPOpt all depend on the option -enable-double-float-shrink. These two factors made migrating each of these simplifiers individually more of a pain than it would be worth. So, I migrated them all together. llvm-svn: 167815	2012-11-13 04:16:17 +00:00
Shuxin Yang	c94c3bb5d0	revert r167740 llvm-svn: 167787	2012-11-13 00:08:49 +00:00
Shuxin Yang	1c442f5ec6	This change is to fix rdar://12571717 which is about assertion in Reassociate pass. The assertion is trigged when the Reassociater tries to transform expression ... + 2 * n * 3 + 2 * m + ... into: ... + 2 * (n3 + m). In the process of the transformation, a helper routine folds the constant 23 into 6, confusing optimizer which is trying the to eliminate the common factor 2, and cannot find 2 any more. Review is pending. But I'd like commit first in order to help those who are waiting for this fix. llvm-svn: 167740	2012-11-12 19:34:11 +00:00
Meador Inge	f963a8ffcc	Delete a stale comment. No functional change. llvm-svn: 167698	2012-11-12 00:28:15 +00:00
Meador Inge	d4825780ed	instcombine: Migrate memset optimizations This patch migrates the memset optimizations from the simplify-libcalls pass into the instcombine library call simplifier. llvm-svn: 167689	2012-11-11 06:49:03 +00:00
Meador Inge	9cf328b526	instcombine: Migrate memmove optimizations This patch migrates the memmove optimizations from the simplify-libcalls pass into the instcombine library call simplifier. llvm-svn: 167687	2012-11-11 06:22:40 +00:00
Meador Inge	dd9234a10a	instcombine: Migrate memcpy optimizations This patch migrates the memcpy optimizations from the simplify-libcalls pass into the instcombine library call simplifier. llvm-svn: 167686	2012-11-11 05:54:34 +00:00
Meador Inge	4d2827c10d	instcombine: Migrate memcmp optimizations This patch migrates the memcmp optimizations from the simplify-libcalls pass into the instcombine library call simplifier. llvm-svn: 167683	2012-11-11 05:11:20 +00:00

1 2 3 4 5 ...

5590 Commits