llvm-project

Commit Graph

Author	SHA1	Message	Date
Michael Liao	5922979e49	Teach DAG combine to fold (buildvec (Xint2fp x)) to (Xint2fp (buildvec x)) - If more than 1 elemennts are defined and target supports the vectorized conversion, use the vectorized one instead to reduce the strength on conversion operation. llvm-svn: 166546	2012-10-24 04:14:18 +00:00
Michael Liao	c5af149e70	Add custom conversion from v2u32 to v2f32 in 32-bit mode - As there's no 64-bit GPRs in 32-bit mode, a custom conversion from v2u32 to v2f32 is added to improve the efficiency of the code generated. llvm-svn: 166545	2012-10-24 04:09:32 +00:00
Akira Hatanaka	868b3a333b	[mips] Make sure sret argument is returned in register V0. llvm-svn: 166539	2012-10-24 02:10:54 +00:00
Rafael Espindola	4e6e537314	Change x86_fastcallcc to require inreg markers. This allows it to known the difference from "int x" (which should go in registers and "struct y {int x;}" (which should not). Clang will be updated in the next patches. llvm-svn: 166536	2012-10-24 01:58:48 +00:00
Jakub Staszak	a6addc2741	Keep coding standard. Don't evaluate getNumOperands() every time. llvm-svn: 166531	2012-10-24 00:38:25 +00:00
Richard Smith	1f6f455f7c	Fix ODR violations: a virtual function must be defined, even if it's never called. Provide an (asserting) definition of Operator's private destructor. Remove destructors from all classes derived from Operator. We don't need them for safety, because their implicit definitions would be ill-formed (they'd call Operator's private destructor), and we don't need them to avoid emitting vtables, because we don't do anything with Operator subclasses which would trigger vtable instantiation. The Operator hierarchy is still a complete disaster with regard to undefined behavior, but this at least allows LLVM to link when using Clang's -fcatch-undefined-behavior with a new vptr-based type checking mechanism. llvm-svn: 166530	2012-10-24 00:30:41 +00:00
Chad Rosier	a623524487	[ms-inline asm] Offset operator - the size should be based on the size of a pointer, not the size of the variable. Part of rdar://12470317 llvm-svn: 166526	2012-10-23 23:42:06 +00:00
Chad Rosier	eac2b2003e	[ms-inline asm] Clean up comment. llvm-svn: 166525	2012-10-23 23:34:28 +00:00
Chad Rosier	146310a1c1	[ms-inline asm] When parsing inline assembly we set the base register to a non-zero value as we don't know the actual value at this point. This is necessary to get the matching correct in some cases. However, the actual value set as the base register doesn't matter, since we're just matching not emitting. llvm-svn: 166523	2012-10-23 23:31:33 +00:00
Michael Liao	6d106b7bfd	Clean up code and put transformation on (build_vec (ext x)) into a helper func llvm-svn: 166519	2012-10-23 23:06:52 +00:00
Michael J. Spencer	8e95113597	[Support/StringSet] Fix memory leak when inserted key already exists. llvm-svn: 166517	2012-10-23 22:55:54 +00:00
Kevin Enderby	dccdac6a06	Make branch heavy code for generating marked up disassembly simpler and easier to read by adding a couple helper functions. Suggestion by Chandler Carruth and seconded by Meador Inge! llvm-svn: 166515	2012-10-23 22:52:52 +00:00
Michael Liao	2843625bb5	Fix PR14161 - Check index being extracted to be constant 0 before simplfiying. Otherwise, retain the original sequence. llvm-svn: 166504	2012-10-23 21:40:15 +00:00
Jordan Rose	c7b09dd1d1	CMake: Include private headers / tablegen files in generated Xcode projects. llvm-svn: 166503	2012-10-23 21:36:55 +00:00
Nadav Rotem	33e034a4b3	Make the indirect branch optimization deterministic. No functionality change. Patch by Daniel Reynaud. llvm-svn: 166501	2012-10-23 21:05:33 +00:00
Matt Beaumont-Gay	bdcebd323a	Silence -Wsign-compare llvm-svn: 166494	2012-10-23 19:46:36 +00:00
Pete Cooper	41ef09a4b5	Change DenseMap to use a power of 2 growth if one is given instead of the next power of 2. This was causing DenseMaps to grow 4x instead of 2x. I'll keep an eye on the buildbots as this could impact performance llvm-svn: 166493	2012-10-23 19:34:36 +00:00
Pete Cooper	8ba353da39	Fixed bug in SmallDenseMap where it wouldn't leave enough space for an empty bucket if the number of values was exactly equal to the small capacity. This led to an infinite loop when finding a non-existent element llvm-svn: 166492	2012-10-23 18:47:35 +00:00
Nadav Rotem	5bed7b4fad	Use the AliasAnalysis isIdentifiedObj because it also understands mallocs and c++ news. PR14158. llvm-svn: 166491	2012-10-23 18:44:18 +00:00
Bill Wendling	5858b56ce3	Ignore unreachable blocks when doing memory dependence analysis on non-local loads. It's not really profitable and may result in GVN going into an infinite loop when it hits constructs like this: %x = gep %some.type %x, ... Found via an LTO build of LLVM. llvm-svn: 166490	2012-10-23 18:37:11 +00:00
Chad Rosier	37e755cee2	[ms-inline asm] Add an implementation of the offset operator. This is a follow on patch to r166433. rdar://12470317 llvm-svn: 166488	2012-10-23 17:43:43 +00:00
Michael Liao	c03c03d56e	Add custom UINT_TO_FP from v4i8/v4i16/v8i8/v8i16 to v4f32/v8f32 - Replace v4i8/v8i8 -> v8f32 DAG combine with custom lowering to reduce DAG combine overhead. - Extend the support to v4i16/v8i16 as well. llvm-svn: 166487	2012-10-23 17:36:08 +00:00
Michael Liao	1be96bb5ce	Enable lowering ZERO_EXTEND/ANY_EXTEND to PMOVZX from SSE4.1 llvm-svn: 166486	2012-10-23 17:34:00 +00:00
Eric Christopher	c33f622c6f	Grammar. llvm-svn: 166485	2012-10-23 17:19:15 +00:00
Lang Hames	dd8574e8fb	Use ilist rather than std::list for Node and Edge lists in the PBQP graph. This should fix an issue (described at http://stackoverflow.com/questions/10065384/instantiation-of-a-list-with-an-incomplete-type-in-a-typedef) that was preventing LLVMCodeGen from building with libc++ in C++11 mode. llvm-svn: 166484	2012-10-23 17:10:51 +00:00
Quentin Colombet	301d05a55c	Test commit access llvm-svn: 166481	2012-10-23 16:03:18 +00:00
Bill Schmidt	57d6de5fd9	This is another TLC patch for separating code for the Darwin and ELF ABIs for the PowerPC target, and factoring the results. This will ease future maintenance of both subtargets. PPCTargetLowering::LowerCall_Darwin_Or_64SVR4() has grown a lot of special-case code for the different ABIs, making maintenance difficult. This is getting worse as we repair errors in the 64-bit ELF ABI implementation, while avoiding changes to the Darwin ABI logic. This patch splits the routine into LowerCall_Darwin() and LowerCall_64SVR4(), allowing both versions to be significantly simplified. I've factored out chunks of similar code where it made sense to do so. I also performed similar factoring on LowerFormalArguments_Darwin() and LowerFormalArguments_64SVR4(). There are no functional changes in this patch, and therefore no new test cases have been developed. Built and tested on powerpc64-unknown-linux-gnu with no new regressions. llvm-svn: 166480	2012-10-23 15:51:16 +00:00
Duncan Sands	5ed3900d77	Fix typo that somehow escaped both testing and code inspection. llvm-svn: 166475	2012-10-23 09:07:02 +00:00
Duncan Sands	533c8ae79f	Transform code like this %V = mul i64 %N, 4 %t = getelementptr i8* bitcast (i32* %arr to i8), i32 %V into %t1 = getelementptr i32 %arr, i32 %N %t = bitcast i32* %t1 to i8* incorporating the multiplication into the getelementptr. This happens all the time in dragonegg, for example for int foo(int A, int N) { return A[N]; } because gcc turns this into byte pointer arithmetic before it hits the plugin: D.1590_2 = (long unsigned int) N_1(D); D.1591_3 = D.1590_2 4; D.1592_5 = A_4(D) + D.1591_3; D.1589_6 = D.1592_5; return D.1589_6; The D.1592_5 line is a POINTER_PLUS_EXPR, which is turned into a getelementptr on a bitcast of A_4 to i8, so this becomes exactly the kind of IR that the transform fires on. An analogous transform (with no testcases!) already existed for bitcasts of arrays, so I rewrote it to share code with this one. llvm-svn: 166474	2012-10-23 08:28:26 +00:00
Richard Smith	6289a4e85e	Per the C++ standard, we need to include the definition of llvm::Calculate in every TU where it's implicitly instantiated, even if there's an implicit instantiation for the same types available in another TU. llvm-svn: 166470	2012-10-23 06:19:46 +00:00
Nadav Rotem	58df27cb2e	Add a comment which explains why the assert fired and how to fix it. llvm-svn: 166467	2012-10-23 04:35:40 +00:00
Reed Kotler	164bb37c7b	implement setXX patterns llvm-svn: 166459	2012-10-23 01:35:48 +00:00
Julien Lerouge	a302b6d95e	Fix typo. llvm-svn: 166456	2012-10-23 00:38:15 +00:00
Julien Lerouge	d7fa5e420d	Explain why DenseMap is still used here instead of MapVector. llvm-svn: 166454	2012-10-23 00:23:46 +00:00
Eli Friedman	0f4871d487	[ms-inline-asm] Implement _emit directive (which is roughly equivalent to .byte). <rdar://problem/12470345>. llvm-svn: 166451	2012-10-22 23:58:19 +00:00
Bill Wendling	12cda50f1f	When a block ends in an indirect branch, add its successors to the machine basic block. The CFG of the machine function needs to know that the targets of the indirect branch are successors to the indirect branch. <rdar://problem/12529625> llvm-svn: 166448	2012-10-22 23:30:04 +00:00
Kevin Enderby	62183c4e18	Add support for annotated disassembly output for X86 and arm. Per the October 12, 2012 Proposal for annotated disassembly output sent out by Jim Grosbach this set of changes implements this for X86 and arm. The llvm-mc tool now has a -mdis option to produced the marked up disassembly and a couple of small example test cases have been added. rdar://11764962 llvm-svn: 166445	2012-10-22 22:31:46 +00:00
Eli Friedman	15e9b33678	[ms-inline asm] Don't rewrite out parts of an inline-asm skipped by .if 0 and friends. It's unnecessary and makes the generated assembly less faithful to the original source. llvm-svn: 166440	2012-10-22 20:50:25 +00:00
Chad Rosier	5bca3f9b8e	[ms-inline asm] Add the isOffsetOf() function. Part of rdar://12470317 llvm-svn: 166436	2012-10-22 19:50:35 +00:00
Julien Lerouge	8cf84fa4e2	Iterating over a DenseMap<std::pair<BasicBlock, unsigned>, PHINode> is not deterministic, replace it with a DenseMap<std::pair<unsigned, unsigned>, PHINode*> (we already have a map from BasicBlock to unsigned). <rdar://problem/12541389> llvm-svn: 166435	2012-10-22 19:43:56 +00:00
Chad Rosier	c14ed95da4	[ms-inline asm] Add support for parsing the offset operator. Callback for CodeGen in the front-end not implemented yet. rdar://12470317 llvm-svn: 166433	2012-10-22 19:42:52 +00:00
Nadav Rotem	1c7fc71e69	Don't crash if the load/store pointer is not a GEP. Fix by Shivarama Rao <Shivarama.Rao@amd.com> llvm-svn: 166427	2012-10-22 18:27:56 +00:00
Nadav Rotem	2f758cf8ba	Add a testcase for the previous commit. llvm-svn: 166425	2012-10-22 18:16:55 +00:00
Argyrios Kyrtzidis	54ff5e81a1	Revert r166407 because it caused analyzer tests to crash and broke self-host bots. llvm-svn: 166424	2012-10-22 18:16:14 +00:00
Hal Finkel	931c52b84c	BBVectorize should ignore unreachable blocks. Unreachable blocks can have invalid instructions. For example, jump threading can produce self-referential instructions in unreachable blocks. Also, we should not be spending time optimizing unreachable code. Fixes PR14133. llvm-svn: 166423	2012-10-22 18:00:55 +00:00
Nadav Rotem	dbf4783634	Add the "ForceSizeOpt" attribute. Patch by Quentin Colombet <qcolombet@apple.com> Original description: """ The attached patch is the first step to have a better control on Oz related optimizations. The Oz optimization level focuses on code size, thus I propose to add an attribute called ForceSizeOpt. """ llvm-svn: 166422	2012-10-22 17:33:31 +00:00
Nadav Rotem	f17cd27362	Rename a variable. llvm-svn: 166410	2012-10-22 04:53:05 +00:00
Nadav Rotem	03011f1393	Vectorizer: optimize the generation of selects. If the condition is uniform, generate a scalar-cond select (i1 as selector). llvm-svn: 166409	2012-10-22 04:38:00 +00:00
Nadav Rotem	c9741887c3	Update the loop vectorizer docs. llvm-svn: 166408	2012-10-22 03:52:53 +00:00
Nick Lewycky	8b67e1e0b9	Reapply r166405, teaching tailcallelim to be smarter about nocapture, with a very small but very important bugfix: bool shouldExplore(Use U) { Value V = U->get(); if (isa<CallInst>(V) \|\| isa<InvokeInst>(V)) [...] should have read: bool shouldExplore(Use U) { Value V = U->getUser(); if (isa<CallInst>(V) \|\| isa<InvokeInst>(V)) Fixes PR14143! llvm-svn: 166407	2012-10-22 03:03:52 +00:00
NAKAMURA Takumi	60d56d2eea	Revert r166405, "Teach TailRecursionElimination to consider 'nocapture' when deciding whether" It broke selfhosting stage2 in several builders. llvm-svn: 166406	2012-10-22 00:48:51 +00:00
Nick Lewycky	2d28f2bf83	Teach TailRecursionElimination to consider 'nocapture' when deciding whether calls can be marked tail. llvm-svn: 166405	2012-10-21 23:51:22 +00:00
Hal Finkel	8884dc323f	DataLayout should use itself when calculating the size of a vector. This is important for vectors of pointers because only DataLayout, not the underlying vector type, knows how to calculate the size of the pointers in the vector. Fixes PR14138. llvm-svn: 166401	2012-10-21 20:38:03 +00:00
Benjamin Kramer	f77f224df9	Revert r166390 "LoopIdiom: Replace custom dependence analysis with LoopDependenceAnalysis." It passes all tests, produces better results than the old code but uses the wrong pass, LoopDependenceAnalysis, which is old and unmaintained. "Why is it still in tree?", you might ask. The answer is obviously: "To confuse developers." Just swapping in the new dependency pass sends the pass manager into an infinte loop, I'll try to figure out why tomorrow. llvm-svn: 166399	2012-10-21 19:31:16 +00:00
Jakob Stoklund Olesen	fd4ced2c52	Don't crash when the Assignments vector is empty. Reported by Vincent Lejeune using an out-of-tree target. llvm-svn: 166398	2012-10-21 19:05:03 +00:00
Anders Carlsson	7d8991c778	Avoid an extra hash lookup when inserting a value into the widen map. llvm-svn: 166395	2012-10-21 16:26:35 +00:00
Jakub Staszak	baa063bd03	Simplify code. No functionality change. llvm-svn: 166393	2012-10-21 15:36:03 +00:00
Jakub Staszak	9694ab8ffa	Simplify code. No functionality change. llvm-svn: 166392	2012-10-21 15:29:19 +00:00
Benjamin Kramer	3ae8bc68af	LoopIdiom: Replace custom dependence analysis with LoopDependenceAnalysis. Requires a lot less code and complexity on loop-idiom's side and the more precise analysis can catch more cases, like the one I included as a test case. This also fixes the edge-case miscompilation from PR9481. I'm not entirely sure that all cases are handled that the old checks handled but LDA will certainly become smarter in the future. llvm-svn: 166390	2012-10-21 15:03:07 +00:00
Nadav Rotem	fe88c67161	Fix a bug in the vectorization of wide load/store operations. We used a SCEV to detect that A[X] is consecutive. We assumed that X was the induction variable. But X can be any expression that uses the induction for example: X = i + 2; llvm-svn: 166388	2012-10-21 06:49:10 +00:00
Nadav Rotem	c1679a95b6	Add support for reduction variables that do not start at zero. This is important for nested-loop reductions such as : In the innermost loop, the induction variable does not start with zero: for (i = 0 .. n) for (j = 0 .. m) sum += ... llvm-svn: 166387	2012-10-21 05:52:51 +00:00
Nadav Rotem	364bd30641	Document change. Describe the pass and some papers that inspired the design of the pass. llvm-svn: 166386	2012-10-21 04:04:25 +00:00
Nadav Rotem	7e1084d36c	Vectorizer: fix a bug in the classification of induction/reduction phis. llvm-svn: 166384	2012-10-21 02:38:01 +00:00
Lang Hames	cdd40bdc05	Allow the commuted form of tied-operand constraints in tablegen ("$dst = $src", rather than "$src = $dst"). llvm-svn: 166382	2012-10-20 22:44:13 +00:00
Nadav Rotem	e5dc57d4fb	Fix an infinite loop in the loop-vectorizer. PR14134. llvm-svn: 166379	2012-10-20 20:45:01 +00:00
Dmitri Gribenko	9fb49d2b30	Document current Doxygen use practices in Coding Standards. Mostly it is obvious stuff and most new code being committed conforms to that. Some old code does not; this might cause confusion and this is the motivation to document the correct guidelines. llvm-svn: 166378	2012-10-20 13:27:43 +00:00
Benjamin Kramer	a74129adad	Symbol hygiene: Make sure declarations and definitions match, make helper functions static. llvm-svn: 166376	2012-10-20 12:53:26 +00:00
Benjamin Kramer	7ddd70527c	SROA: Simplify code. No functionality change. llvm-svn: 166375	2012-10-20 12:04:57 +00:00
Benjamin Kramer	f55b592cc8	InstCombine: Fix an edge case where constant icmps could sneak into ConstantFoldInstOperands and crash. Have to refactor the ConstantFolder interface one day to define bugs like this away. Fixes PR14131. llvm-svn: 166374	2012-10-20 08:43:52 +00:00
Nadav Rotem	d189b82a9b	Vectorize: teach cavVectorizeMemory to distinguish between A[i]+=x and A[B[i]]+=x. If the pointer is consecutive then it is safe to read and write. If the pointer is non-loop-consecutive then it is unsafe to vectorize it because we may hit an ordering issue. llvm-svn: 166371	2012-10-20 08:26:33 +00:00
Nadav Rotem	3940bafb54	Fix a typo llvm-svn: 166367	2012-10-20 05:03:27 +00:00
Nadav Rotem	f70ca3ceed	Vectorizer: refactor the memory checks to a new function. No functionality change. llvm-svn: 166366	2012-10-20 04:59:06 +00:00
Nadav Rotem	895b003f88	Vectorization docs. llvm-svn: 166364	2012-10-20 02:34:34 +00:00
Chad Rosier	3017a06c8f	[ms-inline asm] Rename AsmOpRewrite to just AsmRewrite to be more generic. No functional change intended. llvm-svn: 166360	2012-10-20 01:02:45 +00:00
Chad Rosier	eda70b3451	[ms-inline asm] If the state of the parser is ignore, then don't parse the inline assembly. Also make sure the remove the ignored statements from the IR. llvm-svn: 166357	2012-10-20 00:47:08 +00:00
Nadav Rotem	550f7f7e19	LoopVectorize: Keep the IRBuilder on the stack. llvm-svn: 166354	2012-10-19 23:27:19 +00:00
Chad Rosier	ce09f6b9ef	[ms-inline asm] Continue parsing even when we're in an ignore block. llvm-svn: 166352	2012-10-19 23:15:00 +00:00
Nadav Rotem	4f7f72702b	Vectorizer: Add support for loop reductions. For example: for (i=0; i<n; i++) sum += A[i] + B[i] + i; llvm-svn: 166351	2012-10-19 23:05:40 +00:00
Shuxin Yang	1479fcdef1	1. Remove noreturn attribute from __builtin_debugtrap(). (The change at Clang side was committed in r166345) 2. Cosmetic change in order to conform to coding standards. llvm-svn: 166350	2012-10-19 23:00:20 +00:00
Chad Rosier	f1f6a72901	[ms-inline asm] Reset the opcode prior to parsing a statement. llvm-svn: 166349	2012-10-19 22:57:33 +00:00
Akira Hatanaka	0c7d131a7b	[mips] Use 64-bit registers to return an sret pointer if target ABI is N64. llvm-svn: 166344	2012-10-19 22:11:40 +00:00
Eric Christopher	5d7e15edc5	Grammar-o. llvm-svn: 166343	2012-10-19 22:10:54 +00:00
Akira Hatanaka	90131ac26c	[mips] Add code to do tail call optimization. Currently, it is enabled only if option "enable-mips-tail-calls" is given and all of the callee's arguments are passed in registers. llvm-svn: 166342	2012-10-19 21:47:33 +00:00
Akira Hatanaka	59a32e91f9	[mips] Fix TAILCALL's operand node type. llvm-svn: 166341	2012-10-19 21:30:15 +00:00
Nadav Rotem	4dc976fbcb	revert r166264 because the LTO build is still failing llvm-svn: 166340	2012-10-19 21:28:43 +00:00
Akira Hatanaka	c046243488	[mips] Delete MipsFunctionInfo::MaxCallFrameSize which is no longer used. llvm-svn: 166339	2012-10-19 21:18:38 +00:00
Akira Hatanaka	d03d68a3ba	[mips] Add tail call instructions. llvm-svn: 166338	2012-10-19 21:14:34 +00:00
Akira Hatanaka	0c5e357d87	[mips] Make the branch nodes used in jump instructions a template parameter. llvm-svn: 166337	2012-10-19 21:11:03 +00:00
Akira Hatanaka	91318df0cc	Add node and enum for mips tail call. llvm-svn: 166318	2012-10-19 20:59:39 +00:00
Chad Rosier	0f48c55e70	[ms-inline asm] Have the TargetParser callback to Sema to determine the size of a memory operand. Retain this information and then add the sizing directives to the IR. This allows the backend to do proper instruction selection. llvm-svn: 166316	2012-10-19 20:57:14 +00:00
Michael Liao	14c43496c4	Add 'IntrNoReturn' for longjmp intrinsics llvm-svn: 166314	2012-10-19 20:43:54 +00:00
Benjamin Kramer	317d6c621d	SimplifyLibcalls: The return value of ffsll is always i32, even when the input is zero. Fixes PR13028. llvm-svn: 166313	2012-10-19 20:43:44 +00:00
Micah Villmow	8bbb758f37	Fix a build error for ocaml bindings that was introduced with the TargetData --> DataLayout changes. llvm-svn: 166309	2012-10-19 20:36:22 +00:00
Chad Rosier	7a05864b48	[ms-inline asm] Add a MCAsmParserSemaCallback to the TargetAsmParser. llvm-svn: 166308	2012-10-19 20:35:42 +00:00
Daniel Dunbar	a3d9cabf8f	lit: Rename the valgrind leaks feature to match what is currently used (vg_leak). llvm-svn: 166306	2012-10-19 20:29:30 +00:00
Daniel Dunbar	a3514551a1	lit: Remove support for XTARGET. - The XTARGET feature (inherited from old DG tests) was just confusing (and barely ever used). The same effect can now be achieved with a combination of the more useful REQUIRES and XFAIL. llvm-svn: 166305	2012-10-19 20:29:27 +00:00
Daniel Dunbar	519a349c8a	lit: Add 'valgrind' and 'valgrind-leaks' features when valgrind is used. - These can be used with the XFAIL options. llvm-svn: 166303	2012-10-19 20:12:00 +00:00
Daniel Dunbar	37c4275a92	tests: Stop mangling '-vg' into the triple, we don't use this currently. - Also, lit is going to get a valgrind feature, instead. llvm-svn: 166302	2012-10-19 20:11:56 +00:00
Shuxin Yang	cdde059a34	This patch is to fix radar://8426430. It is about llvm support of __builtin_debugtrap() which is supposed to consistently raise SIGTRAP across all systems. In contrast, __builtin_trap() behave differently on different systems. e.g. it raises SIGTRAP on ARM, and SIGILL on X86. The purpose of __builtin_debugtrap() is to consistently provide "trap" functionality, in the mean time preserve the compatibility with on gcc on __builtin_trap(). The X86 backend is already able to handle debugtrap(). This patch is to: 1) make front-end recognize "__builtin_debugtrap()" (emboddied in the one-line change to Clang). 2) In DAG legalization phase, by default, "debugtrap" will be replaced with "trap", which make the __builtin_debugtrap() "available" to all existing ports without the hassle of changing their code. 3) If trap-function is specified (via -trap-func=xyz to llc), both __builtin_debugtrap() and __builtin_trap() will be expanded into the function call of the specified trap function. This behavior may need change in the future. The provided testing-case is to make sure 2) and 3) are working for ARM port, and we already have a testing case for x86. llvm-svn: 166300	2012-10-19 20:11:16 +00:00
Chad Rosier	75f0b2f2bd	[ms-inline asm] Add the isParsingInlineAsm() function to the MCAsmTargetParser. llvm-svn: 166292	2012-10-19 17:57:49 +00:00
Benjamin Kramer	f1088a37cb	Indvars: Don't recursively delete instruction during BB iteration. This can invalidate the iterators leading to use after frees and crashes. Fixes PR12536. llvm-svn: 166291	2012-10-19 17:53:54 +00:00
Daniel Dunbar	bc4a4565de	Fix some doc-os. llvm-svn: 166290	2012-10-19 17:23:39 +00:00
Daniel Dunbar	315bcbd145	lit: Propagate TERM variable in environment, some tools can do really obscure odd things if this is missing. llvm-svn: 166289	2012-10-19 17:23:34 +00:00
Michael Liao	4b7ccfcaad	Lower BUILD_VECTOR to SHUFFLE + INSERT_VECTOR_ELT for X86 - If INSERT_VECTOR_ELT is supported (above SSE2, either by custom sequence of legal insn), transform BUILD_VECTOR into SHUFFLE + INSERT_VECTOR_ELT if most of elements could be built from SHUFFLE with few (so far 1) elements being inserted. llvm-svn: 166288	2012-10-19 17:15:18 +00:00
Benjamin Kramer	a225ed8d2b	SCEVExpander: Don't crash when trying to merge two constant phis. Just constant fold them so they can't cause any trouble. Fixes PR12627. llvm-svn: 166286	2012-10-19 16:37:30 +00:00
Alexey Samsonov	8418442ff1	[ASan] Support comments in ASan/TSan blacklist file as lines starting with # llvm-svn: 166283	2012-10-19 15:24:46 +00:00
Evgeniy Stepanov	8eb77d847e	Move SplitBlockAndInsertIfThen to BasicBlockUtils. llvm-svn: 166278	2012-10-19 10:48:31 +00:00
Benjamin Kramer	319cb771b2	LoopVectorize: Keep the IRBuilder on the stack. No functionality change. llvm-svn: 166274	2012-10-19 08:42:02 +00:00
Stepan Dyatkovskiy	dab8043048	ARM: Removed extra stack frame object for fixed byval arguments, VarArgsStyleRegisters invocation was reworked due to some improper usage in past. PR14099 also demonstrates it. llvm-svn: 166273	2012-10-19 08:23:06 +00:00
Nick Lewycky	ac612277cb	Pacify -Wnon-virtual-dtor. llvm-svn: 166270	2012-10-19 07:00:09 +00:00
Kostya Serebryany	0995994989	[asan] make sure asan erases old unused allocas after it created a new one. This became important after the recent move from ModulePass to FunctionPass because no cleanup is happening after asan pass any more. llvm-svn: 166267	2012-10-19 06:20:53 +00:00
Nadav Rotem	4985ddc5e0	recommit the patch that makes LSR and LowerInvoke use the TargetTransform interface. llvm-svn: 166264	2012-10-19 04:27:49 +00:00
Michael Liao	2c2358036d	Simplify condition checking as CONCAT assume all inputs of the same type. llvm-svn: 166260	2012-10-19 03:17:00 +00:00
Nadav Rotem	ced93f3a05	vectorizer: Add support for reading and writing from the same memory location. llvm-svn: 166255	2012-10-19 01:24:18 +00:00
Nadav Rotem	5dc203e8f4	Reapply the TargerTransformInfo changes, minus the changes to LSR and Lowerinvoke. llvm-svn: 166248	2012-10-18 23:22:48 +00:00
Nadav Rotem	1667324f22	cleanup the comment. llvm-svn: 166247	2012-10-18 23:21:01 +00:00
Jordan Rose	ed8560b09c	Fix case for include of Compiler.h. llvm-svn: 166243	2012-10-18 22:36:01 +00:00
Jordan Rose	ec43d52336	Add move constructors for OwningPtr and OwningArrayPtr. While LLVM itself is still C++03, there's no reason why tools built on top of it can't use C++11 features. llvm-svn: 166242	2012-10-18 22:22:58 +00:00
Jordan Rose	43805d73c7	Add a T&& constructor to llvm::Optional. This allows llvm::Optional to be used with movable-but-not-copyable types. While LLVM itself is still C++03, there's no reason why tools built on top of it can't use C++11 features. llvm-svn: 166241	2012-10-18 22:22:55 +00:00
Bob Wilson	c7a4a2aa33	Mark bugpoint tests with XFAIL when building with LTO. <rdar://problem/12473675> The LTO Internalize pass is hiding symbols needed by the bugpoint-passes plug-in. We need to add a flag to control whether Internalize should be run. This is a temporary workaround to make these tests pass in the meantime. llvm-svn: 166239	2012-10-18 22:03:31 +00:00
Kevin Enderby	b23926d395	Fix a bug where a 32-bit address with the high bit does not get symbolicated because the value is incorrectly being signed extended when passed to SymbolLookUp(). llvm-svn: 166234	2012-10-18 21:49:18 +00:00
Nadav Rotem	d45a6b93df	fix a naming typo llvm-svn: 166232	2012-10-18 21:45:31 +00:00
Daniel Dunbar	9909415181	test: Add a lit config variable to check if LTO is enabled. llvm-svn: 166225	2012-10-18 20:43:11 +00:00
Daniel Dunbar	f1706edf5d	lit: Allow XFAIL: lines to also refer to "features". llvm-svn: 166224	2012-10-18 20:43:04 +00:00
Chad Rosier	d48d078487	[ms-inline asm] Add a size argument to the LookupInlineAsmIdentifier() callback, which will be used by the asm matcher in the near future. llvm-svn: 166222	2012-10-18 20:27:15 +00:00
Bob Wilson	0f295dec0d	Use an export list when building JIT unittests. <rdar://problem/12473675> When building with LTO, the internalize pass is hiding some global symbols that are necessary for the JIT unittests. It seems like that may be a bug in LTO to do that by default, but until that gets fixed, this change makes sure that we export the necessary symbols for the tests to pass. llvm-svn: 166220	2012-10-18 20:25:36 +00:00
Sebastian Pop	5ea6969422	Use pre-python 2.5 syntax in lit.cfg. Author: Quentin Neill <qneill@codeaurora.org> llvm-svn: 166217	2012-10-18 19:58:28 +00:00
Sebastian Pop	127777d686	Clear unknown mem ops when merging stack slots (pr14090) When merging stack slots, if StackColoring::remapInstructions gets a value back from GetUnderlyingObject that it does not know about or is not itself a stack slot, clear the memory operand in case it aliases the merged slot. This prevents the introduction of incorrect aliasing information. Author: Matthew Curtis <mcurtis@codeaurora.org> llvm-svn: 166216	2012-10-18 19:53:48 +00:00
Sebastian Pop	fdd94d4955	Change MachineFrameInfo::StackObject::Alloca from Value* to AllocaInst* This more accurately reflects what is actually being stored in the field. No functionality change intended. Author: Matthew Curtis <mcurtis@codeaurora.org> llvm-svn: 166215	2012-10-18 19:53:45 +00:00
Chad Rosier	f641baa030	[ms-inline asm] Have the LookupInlineAsmIdentifier() callback function return a *NamedDecl. In turn, build the expressions after we're finished parsing the asm. This avoids a crasher if the lookup fails. llvm-svn: 166212	2012-10-18 19:39:30 +00:00
Bob Wilson	3b8d7bc1e8	Revert "We need this symbol after an LTO build." This reverts commit 165776. The plug-in uses this symbol; it does not define it. It needs to be exported from bugpoint itself, not from the plug-in. llvm-svn: 166207	2012-10-18 18:52:54 +00:00
Nadav Rotem	f8a1396882	Avoid reconstructing the pointer set when searching for duplicated read/write pointers. llvm-svn: 166205	2012-10-18 18:34:50 +00:00
Micah Villmow	c8702fc261	Update the LangRef documentation for the per pointer address space support. llvm-svn: 166201	2012-10-18 18:18:17 +00:00
Meador Inge	2332615f53	Cosmetic change -- move two simplifiers to the right commented statement group. llvm-svn: 166199	2012-10-18 18:12:43 +00:00
Meador Inge	000dbccfc6	instcombine: Migrate strcpy optimizations This patch migrates the strcpy optimizations from the simplify-libcalls pass into the instcombine library call simplifier. Note also that StrCpyChkOpt has been updated with a few simplifications that were being done in the simplify-libcalls version of StrCpyOpt, but not in the migrated implementation of StrCpyOpt. There is no reason to overload StrCpyOpt with fortified and regular simplifications in the new model since there is already a dedicated simplifier for __strcpy_chk. llvm-svn: 166198	2012-10-18 18:12:40 +00:00
Eli Bendersky	71a19ecd41	test commit: verifying access from new address llvm-svn: 166197	2012-10-18 18:12:05 +00:00
Nadav Rotem	d5f8859672	In SimplifySelectOps we pulled two loads through a select node despite the fact that one was dependent on the other. rdar://12513091 llvm-svn: 166196	2012-10-18 18:06:48 +00:00
Nadav Rotem	a031c57417	When looking for a vector representation of a scalar, do a single lookup. Also, cache the result of the broadcast instruction. No functionality change. llvm-svn: 166191	2012-10-18 17:31:49 +00:00
Chad Rosier	8bce664144	[ms-inline asm] Move most of the AsmParsing logic in clang back into the MC layer. Add the ParseMSInlineAsm() function, which is the new interface to clang. Also expose the new MCAsmParserSemaCallback interface, which is used by the back-end to do name lookup in Sema. Finally, remove the now defunct APIs introduced in r165946. llvm-svn: 166183	2012-10-18 15:49:34 +00:00
Ulrich Weigand	d34b5bd610	This patch fixes failures in the SingleSource/Regression/C/uint64_to_float test case on PowerPC caused by rounding errors when converting from a 64-bit integer to a single-precision floating point. The reason for this are double-rounding effects, since on PowerPC we have to convert to an intermediate double-precision value first, which gets rounded to the final single-precision result. The patch fixes the problem by preparing the 64-bit integer so that the first conversion step to double-precision will always be exact, and the final rounding step will result in the correctly-rounded single-precision result. The generated code sequence is equivalent to what GCC would generate. When -enable-unsafe-fp-math is in effect, that extra effort is omitted and we accept possible rounding errors (just like GCC does as well). llvm-svn: 166178	2012-10-18 13:16:11 +00:00
Chandler Carruth	59ff93afe6	Refactor insert and extract of sub-integers into static helpers that operate purely on values. Sink the alloca loading and storing logic into the rewrite routines that are specific to alloca-integer-rewrite driving. This is just a refactoring here, but the subsequent step will be to reuse the insertion and extraction logic when rewriting integer loads and stores that have been split and decomposed into narrower loads and stores. No functionality changed other than different names for instructions. llvm-svn: 166176	2012-10-18 09:56:08 +00:00
Chandler Carruth	e793a50f45	This FIXME was fixed some time ago. =] llvm-svn: 166175	2012-10-18 09:56:06 +00:00
Chandler Carruth	e8479e15f5	Introduce a BarrierNoop pass, a hack designed to allow some control over the implicitly-formed-and-nesting CGSCC pass manager and function pass managers, especially when using them on the opt commandline or using extension points in the module builder. The '-barrier' opt flag (or the pass itself) will create a no-op module pass in the pipeline, resetting the pass manager stack, and allowing the creation of a new pipeline of function passes or CGSCC passes to be created that is independent from any previous pipelines. For example, this can be used to test running two CGSCC passes in independent CGSCC pass managers as opposed to in the same CGSCC pass manager. It also allows us to introduce a further hack into the PassManagerBuilder to separate the O0 pipeline extension passes from the always-inliner's CGSCC pass manager, which they likely do not want to participate in... At the very least none of the Sanitizer passes want this behavior. This fixes a bug with ASan at O0 currently, and I'll commit the ASan test which covers this pass. I'm happy to add a test case that this pass exists and works, but not sure how much time folks would like me to spend adding test cases for the details of its behavior of partition pass managers.... The whole thing is just vile, and mostly intended to unblock ASan, so I'm hoping to rip this all out in a brave new pass manager world. llvm-svn: 166172	2012-10-18 08:05:46 +00:00
Nadav Rotem	7a1728094c	remove unused variable to fix a warning. llvm-svn: 166170	2012-10-18 06:09:21 +00:00
Nadav Rotem	00cadb3ed7	Add a small example which shows a vectorizable loop with a non-pow-of-two count llvm-svn: 166169	2012-10-18 05:46:16 +00:00
Bob Wilson	d6d9ccca38	Temporarily revert the TargetTransform changes. The TargetTransform changes are breaking LTO bootstraps of clang. I am working with Nadav to figure out the problem, but I am reverting it for now to get our buildbots working. This reverts svn commits: 165665 165669 165670 165786 165787 165997 and I have also reverted clang svn 165741 llvm-svn: 166168	2012-10-18 05:43:52 +00:00
Nadav Rotem	642efbcdd8	Remove the use of dominators and AA. llvm-svn: 166167	2012-10-18 05:33:02 +00:00
Nadav Rotem	b52f717411	Vectorizer: Add support for loops with an unknown count. For example: for (i=0; i<n; i++){ a[i] = b[i+1] + c[i+3]; } llvm-svn: 166165	2012-10-18 05:29:12 +00:00
Bill Wendling	4be9013ec1	Revert r166157 because some tests fail... llvm-svn: 166159	2012-10-17 23:56:05 +00:00
Bill Wendling	7c2342467d	Check that the operand of the GEP is not the GEP itself. This occurred during an LTO build of LLVM. llvm-svn: 166157	2012-10-17 23:54:19 +00:00
Michael Liao	3ac8201ea4	Revert part of r166049 back and enable test case in r166125. - Folding (trunc (concat ... X )) to (concat ... (trunc X) ...) is valid when '...' are all 'undef's. - r166125 relies on this transformation. llvm-svn: 166155	2012-10-17 23:45:54 +00:00
NAKAMURA Takumi	7857415785	LoopVectorize.cpp: Fix a warning. [-Wunused-variable] llvm-svn: 166153	2012-10-17 23:40:15 +00:00
Michael Liao	f630f4cadc	Disable extract-concat test case temporarily llvm-svn: 166141	2012-10-17 23:08:19 +00:00
Jakub Staszak	68e5dfddcb	Remove redundant SetInsertPoint call. llvm-svn: 166138	2012-10-17 23:06:37 +00:00
Michael Liao	c87d98dbc8	Revert r166049 - In general, it's unsafe for this transformation. llvm-svn: 166135	2012-10-17 22:41:15 +00:00
Reed Kotler	6743924a32	Add conditional branch instructions and their patterns. llvm-svn: 166134	2012-10-17 22:29:54 +00:00
Roman Divacky	4955ec317c	Fix some typos and wrong indenting. llvm-svn: 166128	2012-10-17 21:07:35 +00:00
Michael Liao	7a442c8031	Teach DAG combine to fold (extract_subvec (concat v1, ..) i) to v_i - If the extracted vector has the same type of all vectored being concatenated together, it should be simplified directly into v_i, where i is the index of the element being extracted. llvm-svn: 166125	2012-10-17 20:48:33 +00:00
Jakob Stoklund Olesen	7a9f0c09de	Switch MRI::UsedPhysRegs to a register unit bit vector. This is a more compact, less redundant representation, and it avoids scanning long lists of aliases for ARM D-registers, for example. llvm-svn: 166124	2012-10-17 20:26:33 +00:00
Nadav Rotem	89a452a9a6	Update the release notes about how to enable the loop vectorizer. llvm-svn: 166123	2012-10-17 19:49:21 +00:00
Evan Cheng	839fb650b2	Add a really faster pre-RA scheduler (-pre-RA-sched=linearize). It doesn't use any scheduling heuristics nor does it build up any scheduling data structure that other heuristics use. It essentially linearize by doing a DFA walk but it does handle glues correctly. IMPORTANT: it probably can't handle all the physical register dependencies so it's not suitable for x86. It also doesn't deal with dbg_value nodes right now so it's definitely is still WIP. rdar://12474515 llvm-svn: 166122	2012-10-17 19:39:36 +00:00
Jakob Stoklund Olesen	0736442683	Merge MRI::isPhysRegOrOverlapUsed() into isPhysRegUsed(). All callers of these functions really want the isPhysRegOrOverlapUsed() functionality which also checks aliases. For historical reasons, targets without register aliases were calling isPhysRegUsed() instead. Change isPhysRegUsed() to also check aliases, and switch all isPhysRegOrOverlapUsed() callers to isPhysRegUsed(). llvm-svn: 166117	2012-10-17 18:44:18 +00:00
Nadav Rotem	ec92817e05	Update the release notes about the store-merge dag optimization. llvm-svn: 166116	2012-10-17 18:35:21 +00:00
Nadav Rotem	d9779f15cf	Update the release notes about the new TargetTransformInfo API changes. llvm-svn: 166115	2012-10-17 18:33:50 +00:00
Nadav Rotem	c260387050	Update the release notes about the new loop vectorizer. llvm-svn: 166113	2012-10-17 18:30:09 +00:00
Nadav Rotem	6b94c2a09b	Add a loop vectorizer. llvm-svn: 166112	2012-10-17 18:25:06 +00:00
Jakob Stoklund Olesen	a10c09804d	Check for empty YMM use-def lists in X86VZeroUpper. The previous MRI.isPhysRegUsed(YMM0) would also return true when the function contains a call to a function that may clobber YMM0. That's most of them. Checking the use-def chains allows us to skip functions that don't explicitly mention YMM registers. llvm-svn: 166110	2012-10-17 17:52:35 +00:00
Anton Korobeynikov	0a69176ce0	Fix fallout from RegInfo => FrameLowering refactoring on MSP430. Patch by Job Noorman! llvm-svn: 166108	2012-10-17 17:37:11 +00:00
Andrew Trick	0b1d8d04b9	misched: Better handling of invalid latencies in the machine model llvm-svn: 166107	2012-10-17 17:27:10 +00:00
Sean Silva	13c64c08c2	docs: Add link to integrated assembler HowTo llvm-svn: 166106	2012-10-17 16:36:27 +00:00
Daniel Dunbar	511479ddb4	Support: Don't remove special files on signals. - Similar to Path::eraseFromDisk(), we don't want LLVM to remove things like /dev/null, even if it has the permission. llvm-svn: 166105	2012-10-17 16:30:54 +00:00
Kostya Serebryany	20343351be	[asan] better debug diagnostics in asan compiler module llvm-svn: 166102	2012-10-17 13:40:06 +00:00
Chandler Carruth	6fab42aa39	This just in, it is a bad idea to use 'udiv' on an offset of a pointer. A very bad idea. Let's not do that. Fixes PR14105. Note that this wasn't that glaring of an oversight. Originally, these routines were only called on offsets within an alloca, which are intrinsically positive. But over the evolution of the pass, they ended up being called for arbitrary offsets, and things went downhill... llvm-svn: 166095	2012-10-17 09:23:48 +00:00
Bill Wendling	003516b592	Marked this variable as 'used' so that LTO doesn't get rid of it. llvm-svn: 166092	2012-10-17 08:08:06 +00:00
Chandler Carruth	40617f593e	Fix a really annoying "bug" introduced in r165941. The change from that revision makes no sense. We cannot use the address space of the post indexed type to conclude anything about a pre indexed pointer type's size. More importantly, this index can never be over a pointer. We are indexing over arrays and vectors here. Of course, I have no test case here. Neither did the original patch. =/ llvm-svn: 166091	2012-10-17 07:22:16 +00:00
Craig Topper	1958f04bda	Remove LLVM_DELETED_FUNCTION from destructors that override non-deleted base class destructors. This isn't legal by the C++11 standard and clang now checks for it. Curiously gcc didn't catch this, possibly because of the template usage. llvm-svn: 166089	2012-10-17 05:15:58 +00:00
Michael Liao	cef9541dac	Check SSSE3 instead of SSE4.1 - All shuffle insns required, especially PSHUB, are added in SSSE3. llvm-svn: 166086	2012-10-17 03:59:18 +00:00
Michael Liao	6f7206132f	Fix setjmp on models with non-Small code model nor non-Static relocation model - MBB address is only valid as an immediate value in Small & Static code/relocation models. On other models, LEA is needed to load IP address of the restore MBB. - A minor fix of MBB in MC lowering is added as well to enable target relocation flag being propagated into MC. llvm-svn: 166084	2012-10-17 02:22:27 +00:00
Jakob Stoklund Olesen	a2136be107	Use a SparseSet instead of a BitVector for UsedInInstr in RAFast. This is just as fast, and it makes it possible to avoid leaking the UsedPhysRegs BitVector implementation through MachineRegisterInfo::addPhysRegsUsed(). llvm-svn: 166083	2012-10-17 01:37:59 +00:00
Eric Christopher	494109b055	Use a typedef to reduce some typing and reformat code accordingly. llvm-svn: 166077	2012-10-16 23:46:25 +00:00
Eric Christopher	02509481f6	Variable name cleanup. llvm-svn: 166076	2012-10-16 23:46:23 +00:00
Eric Christopher	3680f8826e	Formatting and 80-col. llvm-svn: 166075	2012-10-16 23:46:21 +00:00
Eric Christopher	587e153197	Spacing. llvm-svn: 166074	2012-10-16 23:46:19 +00:00
Jakob Stoklund Olesen	4df59a9ff8	Avoid rematerializing a redef immediately after the old def. PR14098 contains an example where we would rematerialize a MOV8ri immediately after the original instruction: %vreg7:sub_8bit<def> = MOV8ri 9; GR32_ABCD:%vreg7 %vreg22:sub_8bit<def> = MOV8ri 9; GR32_ABCD:%vreg7 Besides being pointless, it is also wrong since the original instruction only redefines part of the register, and the value read by the new instruction is wrong. The problem was the LiveRangeEdit::allUsesAvailableAt() didn't special-case OrigIdx == UseIdx and found the wrong SSA value. llvm-svn: 166068	2012-10-16 22:51:58 +00:00
Jakob Stoklund Olesen	2043329e67	Revert r166046 "Switch back to the old coalescer for now to fix the 32 bit bit" A fix for PR14098, including the test case is in the next commit. llvm-svn: 166067	2012-10-16 22:51:55 +00:00
Michael Gottesman	02a1141e5a	[InstCombine] Teach InstCombine how to handle an obfuscated splat. An obfuscated splat is where the frontend poorly generates code for a splat using several different shuffles to create the splat, i.e., %A = load <4 x float>* %in_ptr, align 16 %B = shufflevector <4 x float> %A, <4 x float> undef, <4 x i32> <i32 0, i32 0, i32 undef, i32 undef> %C = shufflevector <4 x float> %B, <4 x float> %A, <4 x i32> <i32 0, i32 1, i32 4, i32 undef> %D = shufflevector <4 x float> %C, <4 x float> %A, <4 x i32> <i32 0, i32 1, i32 2, i32 4> llvm-svn: 166061	2012-10-16 21:29:38 +00:00
Chad Rosier	e4ad2a0b96	[ms-inline asm] Add the helper function, isParseringInlineAsm(). To be used in a future commit. llvm-svn: 166054	2012-10-16 20:16:20 +00:00
Jakub Staszak	8f46e914fb	Simplify code. No functionality change. llvm-svn: 166053	2012-10-16 19:52:32 +00:00
Michael Liao	d6f3168a08	Check .rela instead of ELF64 for the compensation vaue resetting llvm-svn: 166051	2012-10-16 19:49:51 +00:00
Jakub Staszak	25dcab1eaa	80-col fixup. llvm-svn: 166050	2012-10-16 19:39:40 +00:00
Michael Liao	19006206a1	Teach DAG combine to fold (trunc (fptoXi x)) to (fptoXi x) llvm-svn: 166049	2012-10-16 19:38:35 +00:00
Rafael Espindola	b58be2c593	Switch back to the old coalescer for now to fix the 32 bit bit llvm+clang+compiler-rt bootstrap. llvm-svn: 166046	2012-10-16 19:34:06 +00:00
Jakub Staszak	ba34fdb0e4	Simplify potentially quadratic behavior while erasing elements from std::vector. llvm-svn: 166045	2012-10-16 19:32:31 +00:00
Bill Wendling	7b9eb5c64a	And now we can call the other 'get' method from this one and not duplicate the code. llvm-svn: 166037	2012-10-16 18:20:09 +00:00
Michael Liao	02ca34541e	Support v8f32 to v8i8/vi816 conversion through custom lowering - Add custom FP_TO_SINT on v8i16 (and v8i8 which is legalized as v8i16 due to vector element-wise widening) to reduce DAG combiner and its overhead added in X86 backend. llvm-svn: 166036	2012-10-16 18:14:11 +00:00
Bill Wendling	53a6f63c26	Use the appropriate Attributes::get method to create an Attributes object. llvm-svn: 166035	2012-10-16 18:06:06 +00:00
Owen Anderson	544284eb31	Speculative fix the mask constants to be of type uintptr_t. I don't know of any case where the old form was incorrect, but I'm more confident that such cases don't exist in this version. llvm-svn: 166031	2012-10-16 17:10:33 +00:00
Dmitri Gribenko	610a86e6bf	Fix function parameter spelling in comments. Caught by -Wdocumentation. llvm-svn: 166024	2012-10-16 15:37:50 +00:00
Bill Schmidt	48081cad0d	This patch addresses PR13949. For the PowerPC 64-bit ELF Linux ABI, aggregates of size less than 8 bytes are to be passed in the low-order bits ("right-adjusted") of the doubleword register or memory slot assigned to them. A previous patch addressed this for aggregates passed in registers. However, small aggregates passed in the overflow portion of the parameter save area are still being passed left-adjusted. The fix is made in PPCTargetLowering::LowerCall_Darwin_Or_64SVR4 on the caller side, and in PPCTargetLowering::LowerFormalArguments_64SVR4 on the callee side. The main fix on the callee side simply extends existing logic for 1- and 2-byte objects to 1- through 7-byte objects, and correcting a constant left over from 32-bit code. There is also a fix to a bogus calculation of the offset to the following argument in the parameter save area. On the caller side, again a constant left over from 32-bit code is fixed. Additionally, some code for 1, 2, and 4-byte objects is duplicated to handle the 3, 5, 6, and 7-byte objects for SVR4 only. The LowerCall_Darwin_Or_64SVR4 logic is getting fairly convoluted trying to handle both ABIs, and I propose to separate this into two functions in a future patch, at which time the duplication can be removed. The patch adds a new test (structsinmem.ll) to demonstrate correct passing of structures of all seven sizes. Eight dummy parameters are used to force these structures to be in the overflow portion of the parameter save area. As a side effect, this corrects the case when aggregates passed in registers are saved into the first eight doublewords of the parameter save area: Previously they were stored left-justified, and now are properly stored right-justified. This requires changing the expected output of existing test case structsinregs.ll. llvm-svn: 166022	2012-10-16 13:30:53 +00:00
Stepan Dyatkovskiy	e59a920b0c	Issue: Stack is formed improperly for long structures passed as byval arguments for EABI mode. If we took AAPCS reference, we can found the next statements: A: "If the argument requires double-word alignment (8-byte), the NCRN (Next Core Register Number) is rounded up to the next even register number." (5.5 Parameter Passing, Stage C, C.3). B: "The alignment of an aggregate shall be the alignment of its most-aligned component." (4.3 Composite Types, 4.3.1 Aggregates). So if we have structure with doubles (9 double fields) and 3 Core unused registers (r1, r2, r3): caller should use r2 and r3 registers only. Currently r1,r2,r3 set is used, but it is invalid. Callee VA routine should also use r2 and r3 regs only. All is ok here. This behaviour is guessed by rounding up SP address with ADD+BFC operations. Fix: Main fix is in ARMTargetLowering::HandleByVal. If we detected AAPCS mode and 8 byte alignment, we waste odd registers then. P.S.: I also improved LDRB_POST_IMM regression test. Since ldrb instruction will not generated by current regression test after this patch. llvm-svn: 166018	2012-10-16 07:16:47 +00:00

... 2 3 4 5 6 ...

86085 Commits