llvm-project

Commit Graph

Author	SHA1	Message	Date
Bill Wendling	8971440e56	Lowering a memcpy to the stack is killing PPC. The ARM and X86 backends already have their own custom memcpy lowering code. This code needs to be factored out into a target-independent lowering method with hooks to the backend. In the meantime, just call memcpy if we're trying to copy onto a stack. llvm-svn: 43262	2007-10-23 21:30:25 +00:00
Ted Kremenek	bd3501887f	Added preliminary implementation of generic object serialization to bitcode. llvm-svn: 43261	2007-10-23 21:29:33 +00:00
Owen Anderson	9c614117da	Make DomTree and PostDomTree thin wrappers around DomTreeBase, rather than inheriting from it. llvm-svn: 43259	2007-10-23 20:58:37 +00:00
Evan Cheng	5d7032bb08	It's possible to commute instrctions with more than 3 operands. llvm-svn: 43256	2007-10-23 20:14:40 +00:00
Evan Cheng	847d42a85c	isSubRegOf() is a dup of isSubRegister. llvm-svn: 43249	2007-10-23 06:51:50 +00:00
Evan Cheng	ec271b104c	Temporary solution: added a different set of BCTRL_Macho / BCTRL_ELF with right callee-saved defs set for ppc64. llvm-svn: 43248	2007-10-23 06:42:42 +00:00
Evan Cheng	1f2dd35898	Fix memcpy lowering when addresses are 4-byte aligned but size is not multiple of 4. llvm-svn: 43234	2007-10-22 22:11:27 +00:00
Dan Gohman	d09f1c40a2	The #include <iterator> isn't needed in this header. llvm-svn: 43232	2007-10-22 20:44:10 +00:00
Dan Gohman	e0c3d9f338	Strength reduction improvements. - Avoid attempting stride-reuse in the case that there are users that aren't addresses. In that case, there will be places where the multiplications won't be folded away, so it's better to try to strength-reduce them. - Several SSE intrinsics have operands that strength-reduction can treat as addresses. The previous item makes this more visible, as any non-address use of an IV can inhibit stride-reuse. - Make ValidStride aware of whether there's likely to be a base register in the address computation. This prevents it from thinking that things like stride 9 are valid on x86 when the base register is already occupied. Also, XFAIL the 2007-08-10-LEA16Use32.ll test; the new logic to avoid stride-reuse elimintes the LEA in the loop, so the test is no longer testing what it was intended to test. llvm-svn: 43231	2007-10-22 20:40:42 +00:00
Dan Gohman	bf474959a3	Fix the folding of multiplication into addresses on x86, which was broken by the recent {U,S}MUL_LOHI changes. llvm-svn: 43230	2007-10-22 20:22:24 +00:00
Evan Cheng	bdbed66333	Use ptr type in the immediate field of a BxA instruction so we don't end up selecting 32-bit call instruction for ppc64. llvm-svn: 43228	2007-10-22 19:46:19 +00:00
Evan Cheng	5163a8f53e	Add missing paratheses. llvm-svn: 43227	2007-10-22 19:42:28 +00:00
Duncan Sands	941db4da0a	Support for expanding extending loads of integers with funky bit-widths. llvm-svn: 43225	2007-10-22 19:00:05 +00:00
Dan Gohman	a37eaf2bf9	Move the SCEV object factors from being static members of the individual SCEV subclasses to being non-static member functions of the ScalarEvolution class. llvm-svn: 43224	2007-10-22 18:31:58 +00:00
Duncan Sands	8fc995069b	Fix up the logic for result expanding the various extension operations so they work right for integers with funky bit-widths. For example, consider extending i48 to i64 on a 32 bit machine. The i64 result is expanded to 2 x i32. We know that the i48 operand will be promoted to i64, then also expanded to 2 x i32. If we had the expanded promoted operand to hand, then expanding the result would be trivial. Unfortunately at this stage we can only get hold of the promoted operand. So instead we kind of hand-expand, doing explicit shifting and truncating to get the top and bottom halves of the i64 operand into 2 x i32, which are then used to expand the result. This is harmless, because when the promoted operand is finally expanded all this bit fiddling turns into trivial operations which are eliminated either by the expansion code itself or the DAG combiner. llvm-svn: 43223	2007-10-22 18:26:21 +00:00
Evan Cheng	c92446af1f	Fix an unfolding bug. llvm-svn: 43212	2007-10-22 03:03:20 +00:00
Evan Cheng	8557603781	- Only perform the unfolding optimization when the folding in question is modref. - Remove a bogus assertion. llvm-svn: 43211	2007-10-22 03:01:44 +00:00
Chris Lattner	fd6f3257b8	add a mechanism for the JIT to invoke a function to lazily create functions as they are referenced. llvm-svn: 43210	2007-10-22 02:50:12 +00:00
Chris Lattner	bf5e958ba0	llvm-gcc3 is dead, along with it __main. llvm-svn: 43209	2007-10-22 02:39:47 +00:00
Anton Korobeynikov	7499a3b092	Reg2Mem cleanup and optimizations: - enable phi instructions demotion to stack - create alloca instructions in the entry block llvm-svn: 43208	2007-10-21 23:05:16 +00:00
Chris Lattner	edaf0b4651	LoadLibraryPermanently doesn't throw. llvm-svn: 43207	2007-10-21 22:58:11 +00:00
Chris Lattner	b5163bb9f0	Add a convenience method for creating EE's. llvm-svn: 43206	2007-10-21 22:57:11 +00:00
Dale Johannesen	8ee70112ea	Allow for copysign having f80 second argument. Fixes 5550319. llvm-svn: 43205	2007-10-21 01:07:44 +00:00
Chris Lattner	36f06c80e6	Add promote operand support for [su]int_to_fp. llvm-svn: 43204	2007-10-20 22:57:56 +00:00
Chris Lattner	2ba4b148f3	Add result promotion of FP_TO_*INT, fixing CodeGen/X86/trunc-to-bool.ll with the new legalizer. llvm-svn: 43199	2007-10-20 04:32:38 +00:00
Chris Lattner	1c87f0c620	simplify some code. llvm-svn: 43198	2007-10-20 04:09:48 +00:00
Chris Lattner	2bcac640b7	Implement promote and expand for operands of memcpy and friends. This fixes CodeGen/X86/mem*.ll. llvm-svn: 43197	2007-10-20 04:07:07 +00:00
Evan Cheng	f12967124c	Added missing curly braces which renders the if clause useless in debug build. llvm-svn: 43196	2007-10-20 04:01:47 +00:00
Dale Johannesen	771188cf60	Fix a few places vector operations were not getting the operand's type from the right place. llvm-svn: 43195	2007-10-20 00:07:52 +00:00
Evan Cheng	45e096c77e	Resolve unfold tables ambiguity. llvm-svn: 43194	2007-10-19 23:50:58 +00:00
Evan Cheng	35ff79370b	Local spiller optimization: Turn a store folding instruction into a load folding instruction. e.g. xorl %edi, %eax movl %eax, -32(%ebp) movl -36(%ebp), %eax orl %eax, -32(%ebp) => xorl %edi, %eax orl -36(%ebp), %eax mov %eax, -32(%ebp) This enables the unfolding optimization for a subsequent instruction which will also eliminate the newly introduced store instruction. llvm-svn: 43192	2007-10-19 21:23:22 +00:00
Bill Wendling	ac5c93040f	Don't branch fold inline asm statements. llvm-svn: 43191	2007-10-19 21:09:55 +00:00
Duncan Sands	a87c9e4b75	Add support for a few more nodes. llvm-svn: 43190	2007-10-19 20:29:48 +00:00
Dale Johannesen	6802d0c96f	Redo "last ppc long double fix" as Chris wants. llvm-svn: 43189	2007-10-19 20:29:00 +00:00
Chris Lattner	064c31ebac	Fix a really nasty vector miscompilation bill recently introduced. llvm-svn: 43181	2007-10-19 16:47:35 +00:00
Chris Lattner	3ea519e56d	rename ExpandOperation to ExpandOperationResult, as suggested by Duncan llvm-svn: 43177	2007-10-19 15:28:47 +00:00
Rafael Espindola	18a831d783	split LowerMEMCPY into LowerMEMCPYCall and LowerMEMCPYInline in the ARM backend. llvm-svn: 43176	2007-10-19 14:35:17 +00:00
Duncan Sands	a9953e4d0a	Support for expanding ADDE and SUBE. llvm-svn: 43175	2007-10-19 13:06:17 +00:00
Duncan Sands	d9834b29dd	If the value types are equal then this routine asserts in later checks rather than producing the ordinary load it is supposed to. Avoid all such hassles by directly returning an ordinary load in this case. llvm-svn: 43174	2007-10-19 13:05:40 +00:00
Rafael Espindola	846c19dd70	Add support for byval function whose argument is not 32 bit aligned. To do this it is necessary to add a "always inline" argument to the memcpy node. For completeness I have also added this node to memmove and memset. I have also added getMem* functions, because the extra argument makes it cumbersome to use getNode and because I get confused by it :-) llvm-svn: 43172	2007-10-19 10:41:11 +00:00
Chris Lattner	e5a6448533	Implement a few new operations. llvm-svn: 43171	2007-10-19 04:46:45 +00:00
Chris Lattner	e31365eecc	Implement expansion of SINT_TO_FP and UINT_TO_FP operands. llvm-svn: 43170	2007-10-19 04:32:47 +00:00
Chris Lattner	9081d08083	implement support for custom expansion of any node type, in one place. llvm-svn: 43169	2007-10-19 04:14:36 +00:00
Chris Lattner	b193576bc6	comment fixes llvm-svn: 43168	2007-10-19 04:08:28 +00:00
Chris Lattner	d01b8ea4a5	Make use of TLI.ExpandOperation, remove softfloat stuff. llvm-svn: 43167	2007-10-19 03:58:25 +00:00
Chris Lattner	3c7ee41c78	add expand support for bit_convert result, even allowing custom expansion. llvm-svn: 43166	2007-10-19 03:33:14 +00:00
Chris Lattner	579db81f1c	add a new target hook. llvm-svn: 43165	2007-10-19 03:31:45 +00:00
Chris Lattner	5d979d57ae	Add an easy microoptimization I noticed. llvm-svn: 43164	2007-10-19 03:29:26 +00:00
Bill Wendling	de16ad1446	Negative indices aren't allowed here. llvm-svn: 43161	2007-10-19 01:10:49 +00:00
Dale Johannesen	10432e5a67	More ppcf128 issues (maybe the last)? llvm-svn: 43160	2007-10-19 00:59:18 +00:00
Evan Cheng	463e2ab0ac	- Added getOpcodeAfterMemoryUnfold(). It doesn't unfold an instruction, but only returns the opcode of the instruction post unfolding. - Fix some copy+paste bugs. llvm-svn: 43153	2007-10-18 22:40:57 +00:00
Evan Cheng	aa9a225699	Use SmallVectorImpl instead of SmallVector with hardcoded size in MRegister public interface. llvm-svn: 43150	2007-10-18 21:29:24 +00:00
Devang Patel	df49cf52e2	Try again. Instead of loading small global string from memory, use integer constant. llvm-svn: 43148	2007-10-18 19:52:32 +00:00
Owen Anderson	09b83ba6f1	Allow GVN to eliminate redundant calls to functions without side effects. llvm-svn: 43147	2007-10-18 19:39:33 +00:00
Christopher Lamb	79dfbed6f6	Fix a misnamed parameter. llvm-svn: 43145	2007-10-18 19:29:45 +00:00
Christopher Lamb	7f68cf0d57	Fix a typo llvm-svn: 43144	2007-10-18 19:28:55 +00:00
Chris Lattner	9715d9fb59	Fix PR1735 and Transforms/DeadArgElim/2007-10-18-VarargsReturn.ll by fixing some obviously broken code :( llvm-svn: 43141	2007-10-18 18:49:29 +00:00
Chris Lattner	ef6500992f	this doesn't need dynamic_cast. llvm-svn: 43133	2007-10-18 16:26:24 +00:00
Chris Lattner	9afb8e4e29	Reduce reliance on rtti info llvm-svn: 43130	2007-10-18 16:11:18 +00:00
Chris Lattner	9b6ec77647	fix typo llvm-svn: 43129	2007-10-18 16:10:48 +00:00
Chris Lattner	1b88e3c2dd	This requires rtti info because tblgen uses commandline, and tblgen requires rtti. llvm-svn: 43127	2007-10-18 15:57:29 +00:00
Gordon Henriksen	ea31de8dc1	Work around downrev gccs which do not inherit visibility of the Registry<>::iterator member class. llvm-svn: 43122	2007-10-18 11:53:05 +00:00
Bill Wendling	070aca5d25	Pointer arithmetic should be done with the index the same size as the pointer. llvm-svn: 43120	2007-10-18 08:32:37 +00:00
Duncan Sands	cb7aca0dcb	Support for ADDC/SUBC. llvm-svn: 43119	2007-10-18 08:22:16 +00:00
Evan Cheng	e6a41c066a	Really fix PR1734. Carefully track which register uses are sub-register uses by traversing inverse register coalescing map. llvm-svn: 43118	2007-10-18 07:49:59 +00:00
Chris Lattner	84f3461c49	legalizing the ret operation on f64 shouldn't introduce a new i64 bit convert needlessly. llvm-svn: 43116	2007-10-18 06:17:07 +00:00
Owen Anderson	ca831a829d	Move Split<...>() into DomTreeBase. This should make the #include's of DominatorInternals.h in CodeExtractor and LoopSimplify unnecessary. Hartmut, could you confirm that this fixes the issues you were seeing? llvm-svn: 43115	2007-10-18 05:13:52 +00:00
Evan Cheng	cdcc1d0444	Reverting r43070 for now. It's causing llc test failures. llvm-svn: 43103	2007-10-17 23:51:13 +00:00
Gordon Henriksen	ef5d08f4ea	Switching TargetMachineRegistry to use the new generic Registry. llvm-svn: 43094	2007-10-17 21:28:48 +00:00
Devang Patel	b3dac3f5d9	Do not raise free() call that is called through invoke instruction. llvm-svn: 43083	2007-10-17 20:12:58 +00:00
Hartmut Kaiser	2f842e613f	Fixed linker errors (unresolved externals: split<>(...)) when compiling with VC++. Please review. llvm-svn: 43081	2007-10-17 18:37:09 +00:00
Dan Gohman	07159205dd	Define a helper function ConstantVector::getSplatValue for testing for and working with broadcasted constants. llvm-svn: 43076	2007-10-17 17:51:30 +00:00
Dan Gohman	8f518b9875	Add support for ISD::SELECT in SplitVectorOp. llvm-svn: 43072	2007-10-17 14:48:28 +00:00
Duncan Sands	d42c812f4a	Return Expand from getOperationAction for all extended types. This is needed for SIGN_EXTEND_INREG at least. It is not clear if this is correct for other operations. On the other hand, for the various load/store actions it seems to correct to return the type action, as is currently done. Also, it seems that SelectionDAG::getValueType can be called for extended value types; introduce a map for holding these, since we don't really want to extend the vector to be 2^32 pointers long! Generalize DAGTypeLegalizer::PromoteResult_TRUNCATE and DAGTypeLegalizer::PromoteResult_INT_EXTEND to handle the various funky possibilities that apints introduce, for example that you can promote to a type that needs to be expanded. llvm-svn: 43071	2007-10-17 13:49:58 +00:00
Devang Patel	91ff13edcc	Apply "Instead of loading small c string constant, use integer constant directly" transformation while processing load instruction. llvm-svn: 43070	2007-10-17 07:24:40 +00:00
Evan Cheng	0dde6e5761	Apply Chris' suggestions. llvm-svn: 43069	2007-10-17 06:53:44 +00:00
Chris Lattner	12d5da49d3	Change fp to sint legalization on x86-32 to do 2 x i32 loads instead of 1 x i64 loads. This doesn't change any functionality yet. llvm-svn: 43068	2007-10-17 06:17:29 +00:00
Chris Lattner	693cbeadff	fix some funny indentation, add comments. llvm-svn: 43066	2007-10-17 06:02:13 +00:00
Evan Cheng	c8b5397000	One more extract_subreg coalescing bug fix. llvm-svn: 43065	2007-10-17 05:29:37 +00:00
Evan Cheng	9b0a44a2ce	Fix MergeValueInAsValue(). It allows overlapping live ranges but should replace their value numbers with the specified value number. llvm-svn: 43062	2007-10-17 02:13:29 +00:00
Evan Cheng	a6fd8bc97e	Clean up code that calculate MBB live-in's. llvm-svn: 43061	2007-10-17 02:12:22 +00:00
Evan Cheng	8b8c7c9927	Clean up code that calculate MBB live-in's. llvm-svn: 43060	2007-10-17 02:10:22 +00:00
Owen Anderson	84490d44ec	Move splitBlock into DomTreeBase from DomTree. llvm-svn: 43059	2007-10-17 02:03:17 +00:00
Devang Patel	8d818f5e80	Use immediate stores. llvm-svn: 43055	2007-10-16 23:44:18 +00:00
Dale Johannesen	e5facd51cb	Disable attempts to constant fold PPC f128. Remove the assumption that this will happen from various places. llvm-svn: 43053	2007-10-16 23:38:29 +00:00
Evan Cheng	8f644cef0f	Some clean up. llvm-svn: 43043	2007-10-16 21:09:14 +00:00
Owen Anderson	4187801f85	Template DominatorTreeBase by node type. This is the next major step towards having dominator information on MBB's. llvm-svn: 43036	2007-10-16 19:59:25 +00:00
Evan Cheng	fab7ca89d5	Fix PR1734. llvm-svn: 43035	2007-10-16 19:29:47 +00:00
Dale Johannesen	e5530a35d4	Check for invalid cc's in f80 select. llvm-svn: 43033	2007-10-16 18:09:08 +00:00
Chris Lattner	1366653e2f	Fix a bug handling frame references in ppc inline asm when the frame offset doesn't fit into 16 bits. llvm-svn: 43032	2007-10-16 18:00:18 +00:00
Duncan Sands	bbbfbe95f7	Initial infrastructure for arbitrary precision integer codegen support. This should have no effect on codegen for other types. Debatable bits: (1) the use (abuse?) of a set in SDNode::getValueTypeList; (2) the length of getTypeToTransformTo, which maybe should be refactored with a non-inline part for extended value types. llvm-svn: 43030	2007-10-16 09:56:48 +00:00
Duncan Sands	052c843559	Fixes due to lack of type-safety for ValueType: (1) ValueType being passed instead of an opcode; (2) ValueType being passed for isVolatile (!) in getLoad. llvm-svn: 43028	2007-10-16 09:07:20 +00:00
Arnold Schwaighofer	b3d58b98d0	Correction to tail call optimization code. The new return address was stored to the acutal stack slot before the parameters were lowered to their stack slot. This could cause arguments to be overwritten by the return address if the called function had less parameters than the caller function. The update should remove the last failing test case of llc-beta: SPASS. llvm-svn: 43027	2007-10-16 09:05:00 +00:00
Evan Cheng	ecf62cb763	Code clean up. llvm-svn: 43026	2007-10-16 08:04:24 +00:00
Chris Lattner	cece03dd89	implement promotion of select and select_cc, allowing MallocBench/gs to work with type promotion on x86. llvm-svn: 43025	2007-10-16 03:00:22 +00:00
Dan Gohman	9aa4fc5cd6	Teach IntrinsicLowering.cpp about the sin, cos, and pow intrinsics. llvm-svn: 43020	2007-10-15 22:07:31 +00:00
Evan Cheng	04c44712d3	Make CalcLatency() non-recursive. llvm-svn: 43017	2007-10-15 21:33:22 +00:00
Chris Lattner	06a4954e6e	Change LowerFP_TO_SINT to create the specific code it needs instead of unconditionally creating an i64 bitcast. With the future legalizer design, operation legalization can't introduce new nodes with illegal types. This fixes the rest of olden on ppc32. llvm-svn: 43005	2007-10-15 20:14:52 +00:00
Evan Cheng	7bcfd8f880	LowerFP_TO_SINT must not create a stack object if it's not needed. llvm-svn: 43004	2007-10-15 20:11:21 +00:00
Devang Patel	324fe8904f	Add removeModuleProvider() llvm-svn: 43002	2007-10-15 19:56:32 +00:00
Evan Cheng	a5abba65b6	Fix PR1729: watch out for val# with no def. llvm-svn: 42996	2007-10-15 18:33:50 +00:00
Chris Lattner	d6f7d44eae	Move CreateStackTemporary out to SelectionDAG llvm-svn: 42995	2007-10-15 17:48:57 +00:00
Chris Lattner	9eb7a829e6	add a new CreateStackTemporary helper method. llvm-svn: 42994	2007-10-15 17:47:20 +00:00
Chris Lattner	9d5b131e70	implement promotion of BR_CC operands, fixing bisort on ppc. llvm-svn: 42992	2007-10-15 17:16:12 +00:00
Chris Lattner	8555e69def	updates from duncan llvm-svn: 42991	2007-10-15 16:46:29 +00:00
Devang Patel	bff4aea328	Achieve same result but use fewer lines of code. llvm-svn: 42985	2007-10-15 15:31:35 +00:00
Neil Booth	9130551996	Fast-track obviously over-large and over-small exponents during decimal-> integer conversion. In some such cases this makes us one or two orders of magnitude faster than NetBSD's libc. Glibc seems to have a similar fast path. Also, tighten up some upper bounds to save a bit of memory. llvm-svn: 42984	2007-10-15 15:00:55 +00:00
Duncan Sands	f6977d9842	Fix some typos. Call getTypeToTransformTo rather than getTypeToExpandTo. The difference is that getTypeToExpandTo gives the final result of expansion (eg: i128 -> i32 on a 32 bit machine) while getTypeToTransformTo does just one step (i128 -> i64). llvm-svn: 42982	2007-10-15 13:30:18 +00:00
Chris Lattner	3cfb56d489	One mundane change: Change ReplaceAllUsesOfValueWith to optionally take a deleted nodes vector, instead of requiring it. One more significant change: Implement the start of a legalizer that just works on types. This legalizer is designed to run before the operation legalizer and ensure just that the input dag is transformed into an output dag whose operand and result types are all legal, even if the operations on those types are not. This design/impl has the following advantages: 1. When finished, this will significantly reduce the amount of code in LegalizeDAG.cpp. It will remove all the code related to promotion and expansion as well as splitting and scalarizing vectors. 2. The new code is very simple, idiomatic, and modular: unlike LegalizeDAG.cpp, it has no 3000 line long functions. :) 3. The implementation is completely iterative instead of recursive, good for hacking on large dags without blowing out your stack. 4. The implementation updates nodes in place when possible instead of deallocating and reallocating the entire graph that points to some mutated node. 5. The code nicely separates out handling of operations with invalid results from operations with invalid operands, making some cases simpler and easier to understand. 6. The new -debug-only=legalize-types option is very very handy :), allowing you to easily understand what legalize types is doing. This is not yet done. Until the ifdef added to SelectionDAGISel.cpp is enabled, this does nothing. However, this code is sufficient to legalize all of the code in 186.crafty, olden and freebench on an x86 machine. The biggest issues are: 1. Vectors aren't implemented at all yet 2. SoftFP is a mess, I need to talk to Evan about it. 3. No lowering to libcalls is implemented yet. 4. Various operations are missing etc. 5. There are FIXME's for stuff I hax0r'd out, like softfp. Hey, at least it is a step in the right direction :). If you'd like to help, just enable the #ifdef in SelectionDAGISel.cpp and compile code with it. If this explodes it will tell you what needs to be implemented. Help is certainly appreciated. Once this goes in, we can do three things: 1. Add a new pass of dag combine between the "type legalizer" and "operation legalizer" passes. This will let us catch some long-standing isel issues that we miss because operation legalization often obfuscates the dag with target-specific nodes. 2. We can rip out all of the type legalization code from LegalizeDAG.cpp, making it much smaller and simpler. When that happens we can then reimplement the core functionality left in it in a much more efficient and non-recursive way. 3. Once the whole legalizer is non-recursive, we can implement whole-function selectiondags maybe... llvm-svn: 42981	2007-10-15 06:10:22 +00:00
Chris Lattner	b193517eed	One xform performed by LegalizeDAG is transformation of "store of fp" to "store of int". Make two changes: 1) only xform "store of f32" if i32 is a legal type for the target. 2) only xform "store of f64" if either i64 or i32 are legal for the target. 3) if i64 isn't legal, manually lower to 2 stores of i32 instead of letting a later pass of legalize do it. This is ugly, but helps future changes I'm about to commit. llvm-svn: 42980	2007-10-15 05:46:06 +00:00
Chris Lattner	2b827fd70d	avoid an APFloat copy. llvm-svn: 42979	2007-10-15 05:34:10 +00:00
Chris Lattner	90e0b271df	Add a (disabled by default) way to view the ID of a node. llvm-svn: 42978	2007-10-15 05:32:43 +00:00
Dale Johannesen	207bd4d90e	Handle PPC long double in CBackend. llvm-svn: 42972	2007-10-15 01:05:37 +00:00
Chris Lattner	fbbe570994	remove misleading comment. llvm-svn: 42970	2007-10-14 20:35:12 +00:00
Chris Lattner	ebe491ea9c	If a target doesn't have HasMULHU or HasUMUL_LOHI, ExpandOp would return without lo/hi set. Fall through to making a libcall instead. llvm-svn: 42969	2007-10-14 18:35:05 +00:00
Neil Booth	5fe658b21d	Consolidate logic for creating NaNs. Silence compiler warning. llvm-svn: 42966	2007-10-14 10:39:51 +00:00
Neil Booth	06077e7c3c	Whether arithmetic is supported is a property of the semantics. Make it so, and clean up the checks by putting them in an inline function. llvm-svn: 42965	2007-10-14 10:29:28 +00:00
Neil Booth	4ed401b898	Separate out parsing of decimal number. Use this to only allocate memory for the significand once up-front. Also ignore insignificant trailing zeroes; this saves unnecessary multiplications later. llvm-svn: 42964	2007-10-14 10:16:12 +00:00
Evan Cheng	4099f4f91a	Unbreak x86-64. llvm-svn: 42962	2007-10-14 10:09:39 +00:00
Evan Cheng	8d6da9142c	When coalescing an EXTRACT_SUBREG and the dst register is a physical register, the source register will be coalesced to the super register of the LHS. Properly merge in the live ranges of the resulting coalesced interval that were part of the original source interval to the live interval of the super-register. llvm-svn: 42961	2007-10-14 10:08:34 +00:00
Evan Cheng	cdf3609130	Revert 42908 for now. llvm-svn: 42960	2007-10-14 05:57:21 +00:00
Dale Johannesen	2f6b6d6fb0	Fix type mismatch error in PPC Altivec (only causes a problem when asserts are on). From vecLib. llvm-svn: 42959	2007-10-14 01:58:32 +00:00
Dale Johannesen	19db093b35	Disable some compile-time optimizations on PPC long double. llvm-svn: 42958	2007-10-14 01:56:47 +00:00
Duncan Sands	29af26f147	Clarify that fastcc has a problem with nested function trampolines, rather than with nested functions themselves. llvm-svn: 42955	2007-10-13 07:38:37 +00:00
Chris Lattner	f47e30627a	Enhance the truncstore optimization code to handle shifted values and propagate demanded bits through them in simple cases. This allows this code: void foo(char *P) { strcpy(P, "abc"); } to compile to: _foo: ldrb r3, [r1] ldrb r2, [r1, #+1] ldrb r12, [r1, #+2]! ldrb r1, [r1, #+1] strb r1, [r0, #+3] strb r2, [r0, #+1] strb r12, [r0, #+2] strb r3, [r0] bx lr instead of: _foo: ldrb r3, [r1, #+3] ldrb r2, [r1, #+2] orr r3, r2, r3, lsl #8 ldrb r2, [r1, #+1] ldrb r1, [r1] orr r2, r1, r2, lsl #8 orr r3, r2, r3, lsl #16 strb r3, [r0] mov r2, r3, lsr #24 strb r2, [r0, #+3] mov r2, r3, lsr #16 strb r2, [r0, #+2] mov r3, r3, lsr #8 strb r3, [r0, #+1] bx lr testcase here: test/CodeGen/ARM/truncstore-dag-combine.ll This also helps occasionally for X86 and other cases not involving unaligned load/stores. llvm-svn: 42954	2007-10-13 06:58:48 +00:00
Chris Lattner	5e6fe054a2	Add a simple optimization to simplify the input to truncate and truncstore instructions, based on the knowledge that they don't demand the top bits. llvm-svn: 42952	2007-10-13 06:35:54 +00:00
Neil Booth	c799fe9ed9	If the power of 5 is exact, and the reciprocal exact, the error is zero not one half-ulps. This prevents an infinite loop in rare cases. llvm-svn: 42950	2007-10-13 03:34:08 +00:00
Evan Cheng	b63076504e	Local spiller optimization: Turn this: movswl %ax, %eax movl %eax, -36(%ebp) xorl %edi, -36(%ebp) into movswl %ax, %eax xorl %edi, %eax movl %eax, -36(%ebp) by unfolding the load / store xorl into an xorl and a store when we know the value in the spill slot is available in a register. This doesn't change the number of instructions but reduce the number of times memory is accessed. Also unfold some load folding instructions and reuse the value when similar situation presents itself. llvm-svn: 42947	2007-10-13 02:50:24 +00:00
Evan Cheng	7082dcf605	Change unfoldMemoryOperand(). User is now responsible for passing in the register used by the unfolded instructions. User can also specify whether to unfold the load, the store, or both. llvm-svn: 42946	2007-10-13 02:35:06 +00:00
Evan Cheng	9490e0d078	Optionally create a MachineInstr without default implicit operands. llvm-svn: 42945	2007-10-13 02:23:01 +00:00
Arnold Schwaighofer	e8d0bf2669	Correcting the corrections. Bad bad baaad emacs! llvm-svn: 42935	2007-10-12 21:53:12 +00:00
Arnold Schwaighofer	1f0da1fefb	Corrected many typing errors. And removed 'nest' parameter handling for fastcc from X86CallingConv.td. This means that nested functions are not supported for calling convention 'fastcc'. llvm-svn: 42934	2007-10-12 21:30:57 +00:00
Devang Patel	371e6ca690	Dest type is always i8 *. This allows some simplification. Do not filter memmove. llvm-svn: 42930	2007-10-12 20:10:21 +00:00
Duncan Sands	a6286bd502	Due to the new tail call optimization, trampolines can no longer be created for fastcc functions. llvm-svn: 42925	2007-10-12 19:37:31 +00:00
Dale Johannesen	61c574fc51	ppc long double. Implement fabs and fneg. llvm-svn: 42924	2007-10-12 19:02:17 +00:00
Evan Cheng	409fa443fc	Update. llvm-svn: 42922	2007-10-12 18:22:55 +00:00
Chris Lattner	ad618f66e6	Fix a bug in my patch last night that broke InstCombine/2007-10-12-Crash.ll llvm-svn: 42920	2007-10-12 18:05:47 +00:00
Dale Johannesen	a1a4a9ebfa	Implement i64->ppcf128 conversions. llvm-svn: 42919	2007-10-12 17:52:03 +00:00
Evan Cheng	1410b8512c	Did mean to leave this in. INSERT_SUBREG isn't being coalesced yet. llvm-svn: 42916	2007-10-12 17:16:50 +00:00
Neil Booth	d502a82092	Remove duplicate comment. llvm-svn: 42913	2007-10-12 16:05:57 +00:00
Neil Booth	b93d90e98c	Implement correctly-rounded decimal->binary conversion, i.e. conversion from user input strings. Such conversions are more intricate and subtle than they may appear; it is unlikely I have got it completely right first time. I would appreciate being informed of any bugs and incorrect roundings you might discover. llvm-svn: 42912	2007-10-12 16:02:31 +00:00
Neil Booth	e9dbe094aa	Remove a field that was never used. llvm-svn: 42911	2007-10-12 15:35:10 +00:00
Neil Booth	146fdb3eeb	If we're trying to be arbitrary precision, unsigned char clearly won't cut it. Needed for dec->bin conversions. llvm-svn: 42910	2007-10-12 15:33:27 +00:00
Neil Booth	7e74b17ad2	Don't attempt to mask no bits llvm-svn: 42909	2007-10-12 15:31:31 +00:00
Dan Gohman	dc35bd79ca	Change the names used for internal labels to use the current function symbol name instead of a codegen-assigned function number. Thanks Evan! :-) llvm-svn: 42908	2007-10-12 14:53:36 +00:00
Dan Gohman	e3583817ac	Fix some corner cases with vectors in copyToRegs and copyFromRegs. llvm-svn: 42907	2007-10-12 14:33:11 +00:00
Dan Gohman	4f056f3c10	Add support to SplitVectorOp for powi, where the second operand is a scalar integer. llvm-svn: 42906	2007-10-12 14:13:46 +00:00
Dan Gohman	8d978da3b0	Mark vector ctpop, cttz, and ctlz as Expand on x86. llvm-svn: 42905	2007-10-12 14:09:42 +00:00
Dan Gohman	9013eaff9a	Mark vector pow, ctpop, cttz, and ctlz as Expand on PowerPC. llvm-svn: 42904	2007-10-12 14:08:57 +00:00
Evan Cheng	11330f7526	Restrict EXTRACT_SUBREG coalescing to avoid negative performance impact. llvm-svn: 42903	2007-10-12 09:15:53 +00:00

1 2 3 4 5 ...

20582 Commits