llvm-project

Commit Graph

Author	SHA1	Message	Date
Chris Lattner	93af332819	pass/return structs of char and short as i8/i16 to avoid aweful through-memory coersion, just like we do for i32 now. llvm-svn: 107078	2010-06-28 21:59:07 +00:00
Chris Lattner	d776fb150e	more tidying up. llvm-svn: 107076	2010-06-28 21:43:59 +00:00
Ted Kremenek	e60d304d13	Remove state assertion. llvm-svn: 107064	2010-06-28 20:44:49 +00:00
Ted Kremenek	ea78375ed4	Don't crash in InitializePreprocessor() when there is no valid PTHManager. Fixes <rdar://problem/8098441>. llvm-svn: 107061	2010-06-28 20:32:40 +00:00
Chris Lattner	0cf2419cd7	random acts of tidying. llvm-svn: 107050	2010-06-28 20:05:43 +00:00
Chris Lattner	a7d81ab7f3	X86-64: pass/return structs of float/int as float/i32 instead of double/i64 to make the code generated for ABI cleaner. Passing in the low part of a double is the same as passing in a float. For example, we now compile: struct DeclGroup { float NumDecls; }; float foo(DeclGroup D); void bar(DeclGroup D) { foo(D); } into: %struct.DeclGroup = type { float } define void @_Z3barP9DeclGroup(%struct.DeclGroup* %D) nounwind { entry: %D.addr = alloca %struct.DeclGroup, align 8 ; <%struct.DeclGroup> [#uses=2] %agg.tmp = alloca %struct.DeclGroup, align 4 ; <%struct.DeclGroup> [#uses=2] store %struct.DeclGroup* %D, %struct.DeclGroup %D.addr %tmp = load %struct.DeclGroup %D.addr ; <%struct.DeclGroup> [#uses=1] %tmp1 = bitcast %struct.DeclGroup %agg.tmp to i8* ; <i8> [#uses=1] %tmp2 = bitcast %struct.DeclGroup %tmp to i8* ; <i8> [#uses=1] call void @llvm.memcpy.p0i8.p0i8.i64(i8 %tmp1, i8* %tmp2, i64 4, i32 4, i1 false) %coerce.dive = getelementptr %struct.DeclGroup* %agg.tmp, i32 0, i32 0 ; <float> [#uses=1] %0 = load float %coerce.dive, align 1 ; <float> [#uses=1] %call = call float @_Z3foo9DeclGroup(float %0) ; <float> [#uses=0] ret void } instead of: %struct.DeclGroup = type { float } define void @_Z3barP9DeclGroup(%struct.DeclGroup* %D) nounwind { entry: %D.addr = alloca %struct.DeclGroup, align 8 ; <%struct.DeclGroup> [#uses=2] %agg.tmp = alloca %struct.DeclGroup, align 4 ; <%struct.DeclGroup> [#uses=2] %tmp3 = alloca double ; <double> [#uses=2] store %struct.DeclGroup %D, %struct.DeclGroup %D.addr %tmp = load %struct.DeclGroup %D.addr ; <%struct.DeclGroup> [#uses=1] %tmp1 = bitcast %struct.DeclGroup %agg.tmp to i8* ; <i8> [#uses=1] %tmp2 = bitcast %struct.DeclGroup %tmp to i8* ; <i8> [#uses=1] call void @llvm.memcpy.p0i8.p0i8.i64(i8 %tmp1, i8* %tmp2, i64 4, i32 4, i1 false) %coerce.dive = getelementptr %struct.DeclGroup* %agg.tmp, i32 0, i32 0 ; <float> [#uses=1] %0 = bitcast double %tmp3 to float* ; <float> [#uses=1] %1 = load float %coerce.dive ; <float> [#uses=1] store float %1, float* %0, align 1 %2 = load double* %tmp3 ; <double> [#uses=1] %call = call float @_Z3foo9DeclGroup(double %2) ; <float> [#uses=0] ret void } which is this machine code (at -O0): __Z3barP9DeclGroup: subq $24, %rsp movq %rdi, 16(%rsp) movq 16(%rsp), %rdi leaq 8(%rsp), %rax movl (%rdi), %ecx movl %ecx, (%rax) movss 8(%rsp), %xmm0 callq __Z3foo9DeclGroup addq $24, %rsp ret vs this: __Z3barP9DeclGroup: subq $24, %rsp movq %rdi, 16(%rsp) movq 16(%rsp), %rdi leaq 8(%rsp), %rax movl (%rdi), %ecx movl %ecx, (%rax) movss 8(%rsp), %xmm0 movss %xmm0, (%rsp) movsd (%rsp), %xmm0 callq __Z3foo9DeclGroup addq $24, %rsp ret At -O3, it is the difference between this now: __Z3barP9DeclGroup: movss (%rdi), %xmm0 jmp __Z3foo9DeclGroup # TAILCALL vs this before: __Z3barP9DeclGroup: movl (%rdi), %eax movd %rax, %xmm0 jmp __Z3foo9DeclGroup # TAILCALL llvm-svn: 107048	2010-06-28 19:56:59 +00:00
Fariborz Jahanian	c42461e145	Minor refactorin of my last patch (radar 7860965 related). llvm-svn: 107047	2010-06-28 19:42:10 +00:00
Fariborz Jahanian	36ad0e99d5	Have __func__ and siblings point to block's implementation function name. Fixes radar 7860965. llvm-svn: 107044	2010-06-28 18:58:34 +00:00
Chris Lattner	d250b8e9a8	tweak test to pass on windows llvm-svn: 107040	2010-06-28 18:29:14 +00:00
Daniel Dunbar	ebf9da9849	tests: Rewrite test to check intent instead of implementation. llvm-svn: 107024	2010-06-28 17:43:18 +00:00
Rafael Espindola	0e1fb4f341	Set the default arch based on the triple. llvm-svn: 107021	2010-06-28 17:18:09 +00:00
Chris Lattner	c1028f689e	Fix UnitTests/2004-02-02-NegativeZero.c, which regressed when I broke negate of FP values. llvm-svn: 107019	2010-06-28 17:12:37 +00:00
Chris Lattner	18bcf76f05	fix a silly fixme. llvm-svn: 107018	2010-06-28 17:11:58 +00:00
Dan Gohman	3bc1b0c882	llvm::errs() is non-buffered, so it doesn't need to be flushed. llvm-svn: 107012	2010-06-28 15:56:07 +00:00
Craig Silverstein	282ddc646b	Add support for traversing initializer lists (in constructors), which we ignoring before. To give access to the names on the initializer, which aren't a type or an expr or a decl, I've introduced a new TraverseInitializer. By default, it just traverses on the expr that the name is being initialized to. Reviewed by chandlerc. Tested via clang's 'make test'. llvm-svn: 107008	2010-06-28 15:37:14 +00:00
Sebastian Redl	f9463106b3	Introduce Expr::Classify and Expr::ClassifyModifiable, which determine the classification of an expression under the C++0x taxology (value category). Reimplement isLvalue and isModifiableLvalue using these functions. No regressions in the test suite from this, and my rough performance check doesn't show any regressions either. llvm-svn: 107007	2010-06-28 15:09:07 +00:00
Argyrios Kyrtzidis	99a226dde4	Support CXXPseudoDestructorExpr for PCH. llvm-svn: 106999	2010-06-28 09:32:03 +00:00
Argyrios Kyrtzidis	cd444d1af7	Support DependentScopeDeclRefExpr for PCH. llvm-svn: 106998	2010-06-28 09:31:56 +00:00
Argyrios Kyrtzidis	b5288de67c	Refactor PCH reading/writing of template arguments passed to expressions. llvm-svn: 106997	2010-06-28 09:31:48 +00:00
Argyrios Kyrtzidis	ddf5f211d0	Fix PCH emitting/reading for template arguments that contain expressions. llvm-svn: 106996	2010-06-28 09:31:42 +00:00
Argyrios Kyrtzidis	0b0369a6b3	Fix various bugs in recent commits for C++ PCH. llvm-svn: 106995	2010-06-28 09:31:34 +00:00
Chandler Carruth	2d69ec7a72	Partial fix for PR7267 based on comments by John McCall on an earlier patch. This is more targeted, as it simply provides toggle actions for the parser to turn access checking on and off. We then use these to suppress access checking only while we parse the template-id (included scope specifier) of an explicit instantiation and explicit specialization of a class template. The specialization behavior is an extension, as it seems likely a defect that the standard did not exempt them as it does explicit instantiations. This allows the very common practice of specializing trait classes to work for private, internal types. This doesn't address instantiating or specializing function templates, although those apparently already partially work. The naming and style for the Action layer isn't my favorite, comments and suggestions would be appreciated there. llvm-svn: 106993	2010-06-28 08:39:25 +00:00
Jordy Rose	61176897ba	Pointer comparisons (and pointer-pointer subtraction). Basically filling in SimpleSValuator::EvalBinOpLL(). llvm-svn: 106992	2010-06-28 08:26:15 +00:00
Chandler Carruth	b6f991787b	Suppress diagnosing access violations while looking up deallocation functions much as we already do for allocation function lookup. Explicitly check access for the function we actually select in one case that was previously missing, but being caught behind the blanket diagnostics for all overload candidates. This fixs PR7436. llvm-svn: 106986	2010-06-28 00:30:51 +00:00
Rafael Espindola	b1ef8ffb15	Use softfp for linux gnueabi, keep the warning for everything else. llvm-svn: 106984	2010-06-27 18:29:21 +00:00
Anders Carlsson	3f48c603fb	Correctly destroy reference temporaries with global storage. Remove ErrorUnsupported call when binding a global reference to a non-lvalue. Fixes PR7326. llvm-svn: 106983	2010-06-27 17:52:15 +00:00
Anders Carlsson	18c205ecdf	Add a CreateReferenceTemporary that will do the right thing for variables with global storage. llvm-svn: 106982	2010-06-27 17:23:46 +00:00
Anders Carlsson	2969c8c69d	Simplify CodeGenFunction::EmitReferenceBindingToExpr as a first step towards fixing PR7326. llvm-svn: 106981	2010-06-27 16:56:04 +00:00
Anders Carlsson	ca68d357d4	Reduce indentation. llvm-svn: 106980	2010-06-27 15:24:55 +00:00
Chris Lattner	818efb64a3	misc tidying llvm-svn: 106978	2010-06-27 07:40:06 +00:00
Chris Lattner	5e016ae983	finally get around to doing a significant cleanup to irgen: have CGF create and make accessible standard int32,int64 and intptr types. This fixes a ton of 80 column violations introduced by LLVMContextification and cleans up stuff a lot. llvm-svn: 106977	2010-06-27 07:15:29 +00:00
Chris Lattner	e000907e13	tidy up OrderGlobalInits llvm-svn: 106976	2010-06-27 06:32:58 +00:00
Chris Lattner	055097f024	If coercing something from int or pointer type to int or pointer type (potentially after unwrapping it from a struct) do it without going through memory. We now compile: struct DeclGroup { unsigned NumDecls; }; int foo(DeclGroup D) { return D.NumDecls; } into: %struct.DeclGroup = type { i32 } define i32 @_Z3foo9DeclGroup(i64) nounwind ssp noredzone { entry: %D = alloca %struct.DeclGroup, align 4 ; <%struct.DeclGroup> [#uses=2] %coerce.dive = getelementptr %struct.DeclGroup %D, i32 0, i32 0 ; <i32> [#uses=1] %coerce.val.ii = trunc i64 %0 to i32 ; <i32> [#uses=1] store i32 %coerce.val.ii, i32 %coerce.dive %tmp = getelementptr inbounds %struct.DeclGroup* %D, i32 0, i32 0 ; <i32> [#uses=1] %tmp1 = load i32 %tmp ; <i32> [#uses=1] ret i32 %tmp1 } instead of: %struct.DeclGroup = type { i32 } define i32 @_Z3foo9DeclGroup(i64) nounwind ssp noredzone { entry: %D = alloca %struct.DeclGroup, align 4 ; <%struct.DeclGroup> [#uses=2] %tmp = alloca i64 ; <i64> [#uses=2] %coerce.dive = getelementptr %struct.DeclGroup* %D, i32 0, i32 0 ; <i32> [#uses=1] store i64 %0, i64 %tmp %1 = bitcast i64* %tmp to i32* ; <i32> [#uses=1] %2 = load i32 %1, align 1 ; <i32> [#uses=1] store i32 %2, i32* %coerce.dive %tmp1 = getelementptr inbounds %struct.DeclGroup* %D, i32 0, i32 0 ; <i32> [#uses=1] %tmp2 = load i32 %tmp1 ; <i32> [#uses=1] ret i32 %tmp2 } ... which is quite a bit less terrifying. llvm-svn: 106975	2010-06-27 06:26:04 +00:00
Chris Lattner	895c52ba8b	Same patch as the previous on the store side. Before we compiled this: struct DeclGroup { unsigned NumDecls; }; int foo(DeclGroup D) { return D.NumDecls; } to: %struct.DeclGroup = type { i32 } define i32 @_Z3foo9DeclGroup(i64) nounwind ssp noredzone { entry: %D = alloca %struct.DeclGroup, align 4 ; <%struct.DeclGroup> [#uses=2] %tmp = alloca i64 ; <i64> [#uses=2] store i64 %0, i64* %tmp %1 = bitcast i64* %tmp to %struct.DeclGroup* ; <%struct.DeclGroup> [#uses=1] %2 = load %struct.DeclGroup %1, align 1 ; <%struct.DeclGroup> [#uses=1] store %struct.DeclGroup %2, %struct.DeclGroup* %D %tmp1 = getelementptr inbounds %struct.DeclGroup* %D, i32 0, i32 0 ; <i32> [#uses=1] %tmp2 = load i32 %tmp1 ; <i32> [#uses=1] ret i32 %tmp2 } which caused fast isel bailouts due to the FCA load/store of %2. Now we generate this just blissful code: %struct.DeclGroup = type { i32 } define i32 @_Z3foo9DeclGroup(i64) nounwind ssp noredzone { entry: %D = alloca %struct.DeclGroup, align 4 ; <%struct.DeclGroup> [#uses=2] %tmp = alloca i64 ; <i64> [#uses=2] %coerce.dive = getelementptr %struct.DeclGroup* %D, i32 0, i32 0 ; <i32> [#uses=1] store i64 %0, i64 %tmp %1 = bitcast i64* %tmp to i32* ; <i32> [#uses=1] %2 = load i32 %1, align 1 ; <i32> [#uses=1] store i32 %2, i32* %coerce.dive %tmp1 = getelementptr inbounds %struct.DeclGroup* %D, i32 0, i32 0 ; <i32> [#uses=1] %tmp2 = load i32 %tmp1 ; <i32> [#uses=1] ret i32 %tmp2 } This avoids fastisel bailing out and is groundwork for future patch. This reduces bailouts on CGStmt.ll to 911 from 935. llvm-svn: 106974	2010-06-27 06:04:18 +00:00
Chris Lattner	1cd6698a7c	improve CreateCoercedLoad a bit to generate slightly less awful IR when handling X86-64 by-value struct stuff. For example, we use to compile this: struct DeclGroup { unsigned NumDecls; }; int foo(DeclGroup D); void bar(DeclGroup D) { foo(D); } into: define void @_Z3barP9DeclGroup(%struct.DeclGroup* %D) ssp nounwind { entry: %D.addr = alloca %struct.DeclGroup, align 8 ; <%struct.DeclGroup> [#uses=2] %agg.tmp = alloca %struct.DeclGroup, align 4 ; <%struct.DeclGroup> [#uses=2] %tmp3 = alloca i64 ; <i64> [#uses=2] store %struct.DeclGroup %D, %struct.DeclGroup %D.addr %tmp = load %struct.DeclGroup %D.addr ; <%struct.DeclGroup> [#uses=1] %tmp1 = bitcast %struct.DeclGroup %agg.tmp to i8* ; <i8> [#uses=1] %tmp2 = bitcast %struct.DeclGroup %tmp to i8* ; <i8> [#uses=1] call void @llvm.memcpy.p0i8.p0i8.i64(i8 %tmp1, i8* %tmp2, i64 4, i32 4, i1 false) %0 = bitcast i64* %tmp3 to %struct.DeclGroup* ; <%struct.DeclGroup> [#uses=1] %1 = load %struct.DeclGroup %agg.tmp ; <%struct.DeclGroup> [#uses=1] store %struct.DeclGroup %1, %struct.DeclGroup* %0, align 1 %2 = load i64* %tmp3 ; <i64> [#uses=1] call void @_Z3foo9DeclGroup(i64 %2) ret void } which would cause fastisel to bail out due to the first class aggregate load %1. With this patch we now compile it into the (still awful): define void @_Z3barP9DeclGroup(%struct.DeclGroup* %D) nounwind ssp noredzone { entry: %D.addr = alloca %struct.DeclGroup, align 8 ; <%struct.DeclGroup> [#uses=2] %agg.tmp = alloca %struct.DeclGroup, align 4 ; <%struct.DeclGroup> [#uses=2] %tmp3 = alloca i64 ; <i64> [#uses=2] store %struct.DeclGroup %D, %struct.DeclGroup %D.addr %tmp = load %struct.DeclGroup %D.addr ; <%struct.DeclGroup> [#uses=1] %tmp1 = bitcast %struct.DeclGroup %agg.tmp to i8* ; <i8> [#uses=1] %tmp2 = bitcast %struct.DeclGroup %tmp to i8* ; <i8> [#uses=1] call void @llvm.memcpy.p0i8.p0i8.i64(i8 %tmp1, i8* %tmp2, i64 4, i32 4, i1 false) %coerce.dive = getelementptr %struct.DeclGroup* %agg.tmp, i32 0, i32 0 ; <i32> [#uses=1] %0 = bitcast i64 %tmp3 to i32* ; <i32> [#uses=1] %1 = load i32 %coerce.dive ; <i32> [#uses=1] store i32 %1, i32* %0, align 1 %2 = load i64* %tmp3 ; <i64> [#uses=1] %call = call i32 @_Z3foo9DeclGroup(i64 %2) noredzone ; <i32> [#uses=0] ret void } which doesn't bail out. On CGStmt.ll, this reduces fastisel bail outs from 958 to 935, and is the precursor of better things to come. llvm-svn: 106973	2010-06-27 05:56:15 +00:00
Jordy Rose	7f8ea4d677	Implicitly compare symbolic expressions to zero when they're being used as constraints. Part of PR7491. llvm-svn: 106972	2010-06-27 01:20:56 +00:00
Chris Lattner	e01d966ce2	merge two tests. llvm-svn: 106971	2010-06-27 01:08:03 +00:00
Chris Lattner	3fcc790cd8	Change IR generation for return (in the simple case) to avoid doing silly load/store nonsense in the epilog. For example, for: int foo(int X) { int A[100]; return A[X]; } we used to generate: %arrayidx = getelementptr inbounds [100 x i32]* %A, i32 0, i64 %idxprom ; <i32> [#uses=1] %tmp1 = load i32 %arrayidx ; <i32> [#uses=1] store i32 %tmp1, i32* %retval %0 = load i32* %retval ; <i32> [#uses=1] ret i32 %0 } which codegen'd to this code: _foo: ## @foo ## BB#0: ## %entry subq $408, %rsp ## imm = 0x198 movl %edi, 400(%rsp) movl 400(%rsp), %edi movslq %edi, %rax movl (%rsp,%rax,4), %edi movl %edi, 404(%rsp) movl 404(%rsp), %eax addq $408, %rsp ## imm = 0x198 ret Now we generate: %arrayidx = getelementptr inbounds [100 x i32]* %A, i32 0, i64 %idxprom ; <i32> [#uses=1] %tmp1 = load i32 %arrayidx ; <i32> [#uses=1] ret i32 %tmp1 } and: _foo: ## @foo ## BB#0: ## %entry subq $408, %rsp ## imm = 0x198 movl %edi, 404(%rsp) movl 404(%rsp), %edi movslq %edi, %rax movl (%rsp,%rax,4), %eax addq $408, %rsp ## imm = 0x198 ret This actually does matter, cutting out 2000 lines of IR from CGStmt.ll for example. Another interesting effect is that altivec.h functions which are dead now get dce'd by the inliner. Hence all the changes to builtins-ppc-altivec.c to ensure the calls aren't dead. llvm-svn: 106970	2010-06-27 01:06:27 +00:00
Chris Lattner	726b3d09cd	reduce indentation llvm-svn: 106967	2010-06-26 23:13:19 +00:00
Chris Lattner	6c5abe88bf	Implement rdar://7530813 - collapse multiple GEP instructions in IRgen This avoids generating two gep's for common array operations. Before we would generate something like: %tmp = load i32* %X.addr ; <i32> [#uses=1] %arraydecay = getelementptr inbounds [100 x i32]* %A, i32 0, i32 0 ; <i32> [#uses=1] %arrayidx = getelementptr inbounds i32 %arraydecay, i32 %tmp ; <i32> [#uses=1] %tmp1 = load i32 %arrayidx ; <i32> [#uses=1] Now we generate: %tmp = load i32* %X.addr ; <i32> [#uses=1] %arrayidx = getelementptr inbounds [100 x i32]* %A, i32 0, i32 %tmp ; <i32> [#uses=1] %tmp1 = load i32 %arrayidx ; <i32> [#uses=1] Less IR is better at -O0. llvm-svn: 106966	2010-06-26 23:03:20 +00:00
Ted Kremenek	f00eac5cff	Allow '__extension__' to be analyzed in a lvalue context. llvm-svn: 106964	2010-06-26 22:40:52 +00:00
Chris Lattner	57ce97151f	minor cleanup: don't emit the base of an array subscript until after we're done diddling around with the index stuff. Use a cheaper type comparison. llvm-svn: 106963	2010-06-26 22:40:46 +00:00
Chris Lattner	431bef4409	fix inc/dec to honor -fwrapv and -ftrapv, implementing PR7426. llvm-svn: 106962	2010-06-26 22:18:28 +00:00
Chris Lattner	05dc78c096	move scalar inc/dec codegen into ScalarExprEmitter instead of being in CGF. No functionality change. llvm-svn: 106961	2010-06-26 22:09:34 +00:00
Chris Lattner	fa20e95043	use more efficient type comparison predicates. llvm-svn: 106958	2010-06-26 21:52:32 +00:00
Chris Lattner	0bf27620f0	Fix unary minus to trap on overflow with -ftrapv, refactoring binop code so we can use it from VisitUnaryMinus. llvm-svn: 106957	2010-06-26 21:48:21 +00:00
Chris Lattner	51924e517b	Implement support for -fwrapv, rdar://7221421 As part of this, pull together trapv handling into the same enum. This also add support for NSW multiplies. This also makes PCH disagreement on overflow behavior silent, since it really doesn't matter except for warnings and codegen (no macros get defined etc). llvm-svn: 106956	2010-06-26 21:25:03 +00:00
Chris Lattner	217e056e40	implement rdar://7432000 - signed negate should codegen as NSW. While I'm in there, adjust pointer to member adjustments as well. llvm-svn: 106955	2010-06-26 20:27:24 +00:00
Chris Lattner	30c924b3e8	Implement support for #pragma message, patch by Michael Spencer! llvm-svn: 106950	2010-06-26 17:11:39 +00:00
Anders Carlsson	04775f8413	Change EmitReferenceBindingToExpr to take a decl instead of a boolean. llvm-svn: 106949	2010-06-26 16:35:32 +00:00

1 2 3 4 5 ...

23658 Commits