llvm-project

Commit Graph

Author	SHA1	Message	Date
Bob Wilson	58d59fe394	Fix a crash in scalarrepl for memcpy/memmove where the source and destination are the same. I had already fixed a similar problem where the source and destination were different bitcasts derived from the same alloca, but the previous fix still did not handle the case where both operands are exactly the same value. Radar 7552893. llvm-svn: 93848	2010-01-19 04:32:48 +00:00
Dan Gohman	fb4193625a	Delete useless trailing semicolons. llvm-svn: 92740	2010-01-05 17:55:26 +00:00
Bob Wilson	62a84ea8e3	Generalize SROA to allow the first index of a GEP to be non-zero. Add a missing check that an array reference doesn't go past the end of the array, and remove some redundant checks for in-bound array and vector references that are no longer needed. llvm-svn: 91897	2009-12-22 06:57:14 +00:00
Bob Wilson	532cd232fb	Reapply 91459 with a simple fix for the problem that broke the x86_64-darwin bootstrap. This also replaces the WeakVH references that Chris objected to with normal Value references. llvm-svn: 91711	2009-12-18 20:14:40 +00:00
Bob Wilson	f3927b7994	Re-revert 91459. It's breaking the x86_64 darwin bootstrap. llvm-svn: 91607	2009-12-17 18:34:24 +00:00
Daniel Dunbar	ab42d42390	Reapply r91459, it was only unmasking the bug, and since TOT is still broken having it reverted does no good. llvm-svn: 91559	2009-12-16 20:09:53 +00:00
Daniel Dunbar	133efc317e	Revert "Reapply 91184 with fixes and an addition to the testcase to cover the problem", this broke llvm-gcc bootstrap for release builds on x86_64-apple-darwin10. This reverts commit db22309800b224a9f5f51baf76071d7a93ce59c9. llvm-svn: 91534	2009-12-16 10:56:17 +00:00
Bob Wilson	e44756d7c2	Reapply 91184 with fixes and an addition to the testcase to cover the problem found last time. Instead of trying to modify the IR while iterating over it, I've change it to keep a list of WeakVH references to dead instructions, and then delete those instructions later. I also added some special case code to detect and handle the situation when both operands of a memcpy intrinsic are referencing the same alloca. llvm-svn: 91459	2009-12-15 22:00:51 +00:00
Shantonu Sen	0c20054cc4	Remove empty file completely llvm-svn: 91277	2009-12-14 14:15:15 +00:00
Chris Lattner	aaa6ac10a6	revert r91184, because it causes a crash on a .bc file I just sent to Bob. llvm-svn: 91268	2009-12-14 05:11:02 +00:00
Bob Wilson	895f364ae6	Revise scalar replacement to be more flexible about handle bitcasts and GEPs. While scanning through the uses of an alloca, keep track of the current offset relative to the start of the alloca, and check memory references to see if the offset & size correspond to a component within the alloca. This has the nice benefit of unifying much of the code from isSafeUseOfAllocation, isSafeElementUse, and isSafeUseOfBitCastedAllocation. The code to rewrite the uses of a promoted alloca, after it is determined to be safe, is reorganized in the same way. Also, when rewriting GEP instructions, mark them as "in-bounds" since all the indices are known to be safe. llvm-svn: 91184	2009-12-11 23:47:40 +00:00
Chris Lattner	2226db66ab	fix PR5436 by making the 'simple' case of SRoA not promote out of range array indexes. The "complex" case of SRoA still handles them, and correctly. This fixes a weirdness where we'd correctly avoid transforming A[0][42] if the 42 was too large, but we'd only do it if it was one gep, not two separate ones. llvm-svn: 90007	2009-11-27 16:37:41 +00:00
Chris Lattner	92ba18e9e4	filecheckize llvm-svn: 90006	2009-11-27 16:31:59 +00:00
Kenneth Uildriks	90fedc6ef9	Make opt default to not adding a target data string and update tests that depend on target data to supply it within the test llvm-svn: 85900	2009-11-03 15:29:06 +00:00
Dan Gohman	1880092722	Change tests from "opt %s" to "opt < %s" so that opt doesn't see the input filename so that opt doesn't print the input filename in the output so that grep lines in the tests don't unintentionally match strings in the input filename. llvm-svn: 81537	2009-09-11 18:01:28 +00:00
Dan Gohman	72a13d2476	Use opt -S instead of piping bitcode output through llvm-dis. llvm-svn: 81257	2009-09-08 22:34:10 +00:00
Dan Gohman	9737a63ed8	Change these tests to feed the assembly files to opt directly, instead of using llvm-as, now that opt supports this. llvm-svn: 81226	2009-09-08 16:50:01 +00:00
Nick Lewycky	aa464002f0	Don't crash trying to promote VLAs. llvm-svn: 79226	2009-08-17 05:37:31 +00:00
Dan Gohman	a5b9645c4b	Split the Add, Sub, and Mul instruction opcodes into separate integer and floating-point opcodes, introducing FAdd, FSub, and FMul. For now, the AsmParser, BitcodeReader, and IRBuilder all preserve backwards compatability, and the Core LLVM APIs preserve backwards compatibility for IR producers. Most front-ends won't need to change immediately. This implements the first step of the plan outlined here: http://nondot.org/sabre/LLVMNotes/IntegerOverflow.txt llvm-svn: 72897	2009-06-04 22:49:04 +00:00
Eli Friedman	ee94e3cc9e	PR4286: Make RewriteLoadUserOfWholeAlloca and RewriteStoreUserOfWholeAlloca deal with tail padding because isSafeUseOfBitCastedAllocation expects them to. Otherwise, we crash trying to erase the bitcast. llvm-svn: 72688	2009-06-01 09:14:32 +00:00
Chris Lattner	c48091f141	fix RewriteStoreUserOfWholeAlloca to use the correct type size method, fixing a crash on PR4146. While the store will ultimately overwrite the "padded size" number of bits in memory, the stored value may be a subset of this size. This function only wants to handle the case where all bits are stored. llvm-svn: 71224	2009-05-08 15:54:41 +00:00
Chris Lattner	69223bb7f5	fix a crash on a pointless but valid zero-length memset, rdar://6808691 llvm-svn: 69680	2009-04-21 16:52:12 +00:00
Zhou Sheng	64a6a092b1	Fix a bug. If I->use_empty(), this method should return false. llvm-svn: 67180	2009-03-18 07:56:13 +00:00
Chris Lattner	21a84f3054	teach SROA to handle promoting vector allocas with a memset into them into a vector type instead of into an integer type. llvm-svn: 66368	2009-03-08 04:17:04 +00:00
Chris Lattner	c009757761	Enhance SROA to "promote to scalar" allocas which are memcpy/memmove'd into or out of. This fixes a serious perf issue that Nate ran into. llvm-svn: 66366	2009-03-08 04:04:21 +00:00
Devang Patel	25b625165f	While converting an aggregate to scalare, ignore and remove aggregate's debug info. llvm-svn: 66262	2009-03-06 07:03:54 +00:00
Chris Lattner	5c204c92a4	Fix PR3720 by properly propagating alignment information from memcpy/memmove onto element accesses. llvm-svn: 66053	2009-03-04 19:20:50 +00:00
Chris Lattner	3c4f6be2b4	adjust for asmprinter change. llvm-svn: 65741	2009-03-01 00:26:51 +00:00
Devang Patel	caf4485781	Enable scalar replacement of AllocaInst whose one of the user is dbg info. llvm-svn: 64207	2009-02-10 07:00:59 +00:00
Chris Lattner	bbbb74372b	fix PR3489, use bits instead of bytes. llvm-svn: 63916	2009-02-06 04:34:07 +00:00
Chris Lattner	ef37dc8511	teach "convert from scalar" to handle loads of fca's. llvm-svn: 63659	2009-02-03 21:08:45 +00:00
Chris Lattner	18f56c295c	make scalar conversion handle stores of first class aggregate values. loads are not yet handled (coming soon to an sroa near you). llvm-svn: 63649	2009-02-03 19:30:11 +00:00
Chris Lattner	73eff2e6e8	Make SROA produce a vector only when the alloca is actually accessed at least once as a vector. This prevents it from compiling the example in not-a-vector into: define double @test(double %A, double %B) { %tmp4 = insertelement <7 x double> undef, double %A, i32 0 %tmp = insertelement <7 x double> %tmp4, double %B, i32 4 %tmp2 = extractelement <7 x double> %tmp, i32 4 ret double %tmp2 } instead, producing the integer code. Producing vectors when they aren't otherwise in the program is dangerous because a lot of other code treats them carefully and doesn't want to break them down. OTOH, many things want to break down tasty i448's. llvm-svn: 63638	2009-02-03 18:15:05 +00:00
Chris Lattner	8fc6561993	this produces an undefined result, just check that the alloca is gone and that sroa doesn't crash. llvm-svn: 63637	2009-02-03 18:13:00 +00:00
Chris Lattner	80810b4c2d	add another case of undefined behavior without crashing, PR3466. llvm-svn: 63620	2009-02-03 07:08:57 +00:00
Chris Lattner	6aa6b1f263	Teach ConvertUsesToScalar to handle memset, allowing it to handle crazy cases like: struct f { int A, B, C, D, E, F; }; short test4() { struct f A; A.A = 1; memset(&A.B, 2, 12); return A.C; } llvm-svn: 63596	2009-02-03 02:01:43 +00:00
Chris Lattner	09b65ab288	rearrange how SRoA handles promotion of allocas to vectors. With the new world order, it can handle cases where the first store into the alloca is an element of the vector, instead of requiring the first analyzed store to have the vector type itself. This allows us to un-xfail test/CodeGen/X86/vec_ins_extract.ll. llvm-svn: 63590	2009-02-03 01:30:09 +00:00
Chris Lattner	a0ce5f060d	this test produces an undefined value, we don't care what it is, but we do want the alloca promoted. llvm-svn: 63587	2009-02-03 01:13:52 +00:00
Chris Lattner	64217e6a28	update test llvm-svn: 63532	2009-02-02 18:12:58 +00:00
Chris Lattner	18eba4f211	Fix a bug which caused us to miscompile a couple of Ada tests. Thanks for the beautiful reduced testcase Duncan! llvm-svn: 63529	2009-02-02 18:02:59 +00:00
Chris Lattner	ec99c46d44	Simplify and generalize the SROA "convert to scalar" transformation to be able to handle ANY alloca that is poked by loads and stores of bitcasts and GEPs with constant offsets. Before the code had a number of annoying limitations and caused it to miss cases such as storing into holes in structs and complex casts (as in bitfield-sroa) where we had unions of bitfields etc. This also handles a number of important cases that are exposed due to the ABI lowering stuff we do to pass stuff by value. One case that is pretty great is that we compile 2006-11-07-InvalidArrayPromote.ll into: define i32 @func(<4 x float> %v0, <4 x float> %v1) nounwind { %tmp10 = call <4 x i32> @llvm.x86.sse2.cvttps2dq(<4 x float> %v1) %tmp105 = bitcast <4 x i32> %tmp10 to i128 %tmp1056 = zext i128 %tmp105 to i256 %tmp.upgrd.43 = lshr i256 %tmp1056, 96 %tmp.upgrd.44 = trunc i256 %tmp.upgrd.43 to i32 ret i32 %tmp.upgrd.44 } which turns into: _func: subl $28, %esp cvttps2dq %xmm1, %xmm0 movaps %xmm0, (%esp) movl 12(%esp), %eax addl $28, %esp ret Which is pretty good code all things considering :). One effect of this is that SROA will start generating arbitrary bitwidth integers that are a multiple of 8 bits. In the case above, we got a 256 bit integer, but the codegen guys assure me that it can handle the simple and/or/shift/zext stuff that we're doing on these operations. This addresses rdar://6532315 llvm-svn: 63469	2009-01-31 02:28:54 +00:00
Chris Lattner	df17987c19	Fix some issues with volatility, move "CanConvertToScalar" check after the others. llvm-svn: 63227	2009-01-28 20:16:43 +00:00
Chris Lattner	1498e62117	strengthen this test. llvm-svn: 63222	2009-01-28 19:29:30 +00:00
Chris Lattner	ae0e857b98	Fix PR3304 llvm-svn: 61995	2009-01-09 18:18:43 +00:00
Chris Lattner	c518dfd11b	This implements the second half of the fix for PR3290, handling loads from allocas that cover the entire aggregate. This handles some memcpy/byval cases that are produced by llvm-gcc. This triggers a few times in kc++ (with std::pair<std::_Rb_tree_const_iterator <kc::impl_abstract_phylum*>,bool>) and once in 176.gcc (with %struct..0anon). llvm-svn: 61915	2009-01-08 05:42:05 +00:00
Chris Lattner	f2b8c82ad1	Implement the first half of PR3290: if there is a store of an integer to a (transitive) bitcast the alloca and if that integer has the full size of the alloca, then it clobbers the whole thing. Handle this by extracting pieces out of the stored integer and filing them away in the SROA'd elements. This triggers fairly frequently because the CFE uses integers to pass small structs by value and the inliner exposes these. For example, in kimwitu++, I see a bunch of these with i64 stores to "%struct.std::pair<std::_Rb_tree_const_iterator<kc::impl_abstract_phylum*>,bool>" In 176.gcc I see a few i32 stores to "%struct..0anon". In the testcase, this is a difference between compiling test1 to: _test1: subl $12, %esp movl 20(%esp), %eax movl %eax, 4(%esp) movl 16(%esp), %eax movl %eax, (%esp) movl (%esp), %eax addl 4(%esp), %eax addl $12, %esp ret vs: _test1: movl 8(%esp), %eax addl 4(%esp), %eax ret The second half of this will be to handle loads of the same form. llvm-svn: 61853	2009-01-07 08:11:13 +00:00
Matthijs Kooijman	cbe5e16eb5	Allow scalarrepl to treat an all-zero GEP just as bitcast. This includes not marking a GEP involving a vector as unsafe, but only when it has all zero indices. This allows scalarrepl to work in a few more cases. llvm-svn: 57177	2008-10-06 16:23:31 +00:00
Matthijs Kooijman	be4fc9b9fb	Add a testcase showing that scalarrepl supports first class structs. I originally made this script to show that scalarrepl didn't support them, but it turned out it does. Better to still add the testcase then. llvm-svn: 56781	2008-09-29 10:42:13 +00:00
Chris Lattner	3f972c9150	Fix PR2423 by checking all indices for out of range access, not only indices that start with an array subscript. x->field[10000] is just as bad as (*X)[14][10000]. llvm-svn: 55226	2008-08-23 05:21:06 +00:00
Chris Lattner	6ff85681e4	Fix PR2369 by making scalarrepl more careful about promoting structures. Its default threshold is to promote things that are smaller than 128 bytes, which is sane. However, it is not sane to do this for things that turn into 128 registers. Add a cap on the number of registers introduced, defaulting to 128/4=32. llvm-svn: 52611	2008-06-22 17:46:21 +00:00
Evan Cheng	2d788ce3fb	Fix some tests. llvm-svn: 52245	2008-06-12 21:23:38 +00:00
Matthijs Kooijman	a2f743eaff	Fix some escaping and quoting in RUN lines, mainly involving { and <. In two cases quoting of <{ didn't work out, so I changed the grep to check for }> instead. This fixes 7 testcases that were not properly running before. llvm-svn: 52182	2008-06-10 16:04:47 +00:00
Matthijs Kooijman	812989b147	Learn ScalarReplAggregrates how stores and loads of first class aggregrates work and how to replace them into individual values. Also, when trying to replace an aggregrate that is used by load or store with a single (large) integer, don't crash (but don't replace the aggregrate either). Also adds a testcase for both structs and arrays. llvm-svn: 51997	2008-06-05 12:51:53 +00:00
Gabor Greif	1e427c3264	sabre brings to my attention that the 'tr' suffix is also obsolete llvm-svn: 51349	2008-05-20 21:00:03 +00:00
Gabor Greif	f45ff35bfe	Rename the last test with .llx extension to .ll, resolve duplicate test by renaming to isnan2. Now that no test has llx ending there is no need to search for them from dg.exp too. llvm-svn: 51328	2008-05-20 19:52:04 +00:00
Tanya Lattner	4e59897d3d	Upgrade tests to not use llvm-upgrade. llvm-svn: 48484	2008-03-18 04:14:37 +00:00
Chris Lattner	c966cebe93	fix a bug Anders ran into where scalarrepl would crash when promoting a union containing a vector and an array whose elements were smaller than the vector elements. this means we need to compile the load of the array elements into an extract element plus a truncate. llvm-svn: 47752	2008-02-29 07:12:06 +00:00
Chris Lattner	b9e5b8fb9e	Fix a bug where scalarrepl would discard offset if type would match. In practice this can only happen on code with already undefined behavior, but this is still a good thing to handle correctly. llvm-svn: 46539	2008-01-30 00:39:15 +00:00
Duncan Sands	399d97987b	Change uses of getTypeSize to getABITypeSize, getTypeStoreSize or getTypeSizeInBits as appropriate in ScalarReplAggregates. The right change to make was not always obvious, so it would be good to have an sroa guru review this. While there I noticed some bugs, and fixed them: (1) arrays of x86 long double have holes due to alignment padding, but this wasn't being spotted by HasStructPadding (renamed to HasPadding). The same goes for arrays of oddly sized ints. Vectors also suffer from this, in fact the problem for vectors is much worse because basic vector assumptions seem to be broken by vectors of type with alignment padding. I didn't try to fix any of these vector problems. (2) The code for extracting smaller integers from larger ones (in the "int union" case) was wrong on big-endian machines for integers with size not a multiple of 8, like i1. Probably this is impossible to hit via llvm-gcc, but I fixed it anyway while there and added a testcase. I also got rid of some trailing whitespace and changed a function name which had an obvious typo in it. llvm-svn: 43672	2007-11-04 14:43:57 +00:00
John Criswell	2660cef6d7	Convert .cvsignore files llvm-svn: 37801	2007-06-29 16:35:07 +00:00
Chris Lattner	ed24b3b2fa	Testcase for PR1421 llvm-svn: 37357	2007-05-30 06:10:46 +00:00
Chris Lattner	49a34fcca7	testcase for PR1446 llvm-svn: 37325	2007-05-24 18:42:47 +00:00
Chris Lattner	4dc9a76ad8	Move Mem2Reg/DifferingTypes.ll -> ScalarRepl/DifferingTypes.ll. -scalarrepl implements this xform. llvm-svn: 36804	2007-05-05 22:22:03 +00:00
Chris Lattner	7ebda6ba37	new testcase, should be able to eliminate the alloca and memcpy llvm-svn: 36428	2007-04-25 06:29:34 +00:00
Reid Spencer	6e87ec4351	For PR1319: Remove && from the end of the lines to prevent tests from throwing run lines into the background. Also, clean up places where the same command is run multiple times by using a temporary file. llvm-svn: 36142	2007-04-16 17:36:08 +00:00
Reid Spencer	a551c041f9	For PR1319: Upgrade to use new Tcl exec based test harness. This exposes 3 bugs that were previously not being reported: test/Transforms/GlobalDCE/2002-08-17-FunctionDGE.ll test/Transforms/GlobalOpt/memset.ll test/Transforms/IndVarsSimplify/exit_value_tests.llx llvm-svn: 36065	2007-04-15 09:21:47 +00:00
Reid Spencer	d029c7e666	Make the llvm-runtest function much more amenable by eliminating all the global variables that needed to be passed in. This makes it possible to add new global variables with only a couple changes (Makefile and llvm-dg.exp) instead of touching every single dg.exp file. llvm-svn: 35918	2007-04-11 19:56:59 +00:00
Reid Spencer	44259a29c0	Remove use of implementation keyword. llvm-svn: 35412	2007-03-28 02:38:26 +00:00
Chris Lattner	4776ebc195	new testcase llvm-svn: 35397	2007-03-28 01:31:33 +00:00
Chris Lattner	50fce05a21	add a testcase the resent patches fail on. llvm-svn: 35167	2007-03-19 18:25:48 +00:00
Chris Lattner	23dd31a3af	add PR# llvm-svn: 35151	2007-03-19 00:17:19 +00:00
Chris Lattner	dcd44dbbb0	add pr# llvm-svn: 35149	2007-03-19 00:15:43 +00:00
Chris Lattner	ee3c5d1b78	new testcase llvm-svn: 35148	2007-03-19 00:11:30 +00:00
Chris Lattner	2c0f36bc39	testcase for SROA with memset etc llvm-svn: 35147	2007-03-19 00:09:00 +00:00
Reid Spencer	83b3d82672	Regression is gone, don't try to find it on clean target. llvm-svn: 33296	2007-01-17 07:59:14 +00:00

1 2 3

125 Commits