llvm-project

Commit Graph

Author	SHA1	Message	Date
Cameron Zwarich	f03fa189ca	Add an intrinsic and codegen support for fused multiply-accumulate. The intent is to use this for architectures that have a native FMA instruction. llvm-svn: 134742	2011-07-08 21:39:21 +00:00
Lang Hames	5a00499e87	Add functions 'hasPredecessor' and 'hasPredecessorHelper' to SDNode. The hasPredecessorHelper function allows predecessors to be cached to speed up repeated invocations. This fixes PR10186. X.isPredecessorOf(Y) now just calls Y.hasPredecessor(X) Y.hasPredecessor(X) calls Y.hasPredecessorHelper(X, Visited, Worklist) with empty Visited and Worklist sets (i.e. no caching over invocations). Y.hasPredecessorHelper(X, Visited, Worklist) caches search state in Visited and Worklist to speed up repeated calls. The Visited set is searched for X before going to the worklist to further search the DAG if necessary. llvm-svn: 134592	2011-07-07 04:31:51 +00:00
Eric Christopher	ea336c797c	Grammar and 80-col. llvm-svn: 134555	2011-07-06 22:41:18 +00:00
Benjamin Kramer	e1fc29b6ac	Don't allocate empty read-only SmallVectors during SelectionDAG deallocation. llvm-svn: 133348	2011-06-18 13:13:44 +00:00
Devang Patel	5de2375db8	Remove dead code. llvm-svn: 131974	2011-05-24 18:27:52 +00:00
Evan Cheng	88f9137fd7	- Teach SelectionDAG::isKnownNeverZero to return true (op x, c) when c is non-zero. - Teach X86 cmov optimization to eliminate the cmov from ctlz, cttz extension when the source of X86ISD::BSR / X86ISD::BSF is proven to be non-zero. rdar://9490949 llvm-svn: 131948	2011-05-24 01:48:22 +00:00
Devang Patel	efec7715ec	Revert 121907 (it causes llc crash) and apply original patch from PR9817. llvm-svn: 131926	2011-05-23 22:04:42 +00:00
Devang Patel	c4d9a84159	While replacing all uses of a SDValue with another value, do not forget to transfer SDDbgValue. llvm-svn: 131907	2011-05-23 17:35:08 +00:00
Owen Anderson	66fd073974	Other parts of the SelectionDAG framework assume that targets use their pointer type for vector indices. Make the vector unrolling code respect that. llvm-svn: 130733	2011-05-02 22:25:45 +00:00
Evan Cheng	c5c2cfa381	sext(undef) = 0, because the top bits will all be the same. zext(undef) = 0, because the top bits will be zero. llvm-svn: 127649	2011-03-15 02:22:10 +00:00
Evan Cheng	37139edc8c	BIT_CONVERT has been renamed to BITCAST. llvm-svn: 127600	2011-03-14 18:19:52 +00:00
Evan Cheng	d2f3b01797	Minor optimization. sign-ext/anyext of undef is still undef. llvm-svn: 127598	2011-03-14 18:15:55 +00:00
Owen Anderson	cd526fa15e	Use the correct LHS type when determining the legalization of a shift's RHS type. llvm-svn: 127163	2011-03-07 18:29:47 +00:00
Bob Wilson	24b3ba5990	Avoid exponential blow-up when printing DAGs. David Greene changed CannotYetSelect() to print the full DAG including multiple copies of operands reached through different paths in the DAG. Unfortunately this blows up exponentially in some cases. The depth limit of 100 is way too high to prevent this -- I'm seeing a message string of 150MB with a depth of only 40 in one particularly bad case, even though the DAG has less than 200 nodes. Part of the problem is that the printing code is following chain operands, so if you fail to select an operation with a chain, the printer will follow all the chained operations back to the entry node. llvm-svn: 126899	2011-03-02 23:38:06 +00:00
Owen Anderson	b2c80da4ae	Allow targets to specify a the type of the RHS of a shift parameterized on the type of the LHS. llvm-svn: 126518	2011-02-25 21:41:48 +00:00
Cameron Zwarich	3cf9280214	Add a getNumSignBits() method to APInt. llvm-svn: 126379	2011-02-24 10:00:20 +00:00
Devang Patel	b7ae3ccb84	Do not lose debug info of an inlined function argument even if the argument is only used through GEPs. This time with a fix that avoids using invalidated DenseMap iterator. llvm-svn: 125984	2011-02-18 22:43:42 +00:00
Cameron Zwarich	0a1a36dc46	Roll out r125794 to help diagnose the llvm-gcc-i386-linux-selfhost failure. llvm-svn: 125830	2011-02-18 04:58:10 +00:00
Devang Patel	f922a431ee	Do not lose debug info of an inlined function argument even if the argument is only used through GEPs. llvm-svn: 125794	2011-02-17 23:33:27 +00:00
Stuart Hastings	81c4306005	Swap VT and DebugLoc operands of getExtLoad() for consistency with other getNode() methods. Radar 9002173. llvm-svn: 125665	2011-02-16 16:23:55 +00:00
Chris Lattner	eaa8341d3b	fix two comment thinkos llvm-svn: 125481	2011-02-14 06:14:42 +00:00
Chris Lattner	46c01a30f4	Enhance ComputeMaskedBits to know that aligned frameindexes have their low bits set to zero. This allows us to optimize out explicit stack alignment code like in stack-align.ll:test4 when it is redundant. Doing this causes the code generator to start turning FI+cst into FI\|cst all over the place, which is general goodness (that is the canonical form) except that various pieces of the code generator don't handle OR aggressively. Fix this by introducing a new SelectionDAG::isBaseWithConstantOffset predicate, and using it in places that are looking for ADD(X,CST). The ARM backend in particular was missing a lot of addressing mode folding opportunities around OR. llvm-svn: 125470	2011-02-13 22:25:43 +00:00
Chris Lattner	e95d195014	Revisit my fix for PR9028: the issue is that DAGCombine was generating i8 shift amounts for things like i1024 types. Add an assert in getNode to prevent this from occuring in the future, fix the buggy transformation, revert my previous patch, and document this gotcha in ISDOpcodes.h llvm-svn: 125465	2011-02-13 19:09:16 +00:00
Devang Patel	639dd997eb	Remove comment about an argument that was removed couple of years ago. llvm-svn: 125054	2011-02-07 21:58:52 +00:00
Matt Beaumont-Gay	29c8c8fe92	Take Bill Wendling's suggestion for structuring a couple of asserts. llvm-svn: 124688	2011-02-01 22:12:50 +00:00
Devang Patel	1cec755494	Speculatively revert r124380. llvm-svn: 124397	2011-01-27 19:15:01 +00:00
Devang Patel	3b266a2780	While legalizing SDValues do not drop SDDbgValues, trasfer them to new legal nodes. Take 2. This includes fix for dragonegg crash. llvm-svn: 124380	2011-01-27 17:43:53 +00:00
Matt Beaumont-Gay	a148c59231	Try harder to not have unused variables. llvm-svn: 124350	2011-01-27 02:39:27 +00:00
Matt Beaumont-Gay	0cddbf2bdf	Opt-mode -Wunused-variable cleanup llvm-svn: 124346	2011-01-27 01:47:50 +00:00
David Greene	bab5e6ed0e	[AVX] Add INSERT_SUBVECTOR and support it on x86. This provides a default implementation for x86, going through the stack in a similr fashion to how the codegen implements BUILD_VECTOR. Eventually this will get matched to VINSERTF128 if AVX is available. llvm-svn: 124307	2011-01-26 19:13:22 +00:00
David Greene	b6f1611928	[AVX] Support EXTRACT_SUBVECTOR on x86. This provides a default implementation of EXTRACT_SUBVECTOR for x86, going through the stack in a similr fashion to how the codegen implements BUILD_VECTOR. Eventually this will get matched to VEXTRACTF128 if AVX is available. llvm-svn: 124292	2011-01-26 15:38:49 +00:00
Devang Patel	efc6b16e4b	Provide an interface to transfer SDDbgValue from one SDNode to another. llvm-svn: 124245	2011-01-25 23:27:42 +00:00
Ted Kremenek	3c4408ceb6	Null initialize a few variables flagged by clang's -Wuninitialized-experimental warning. While these don't look like real bugs, clang's -Wuninitialized-experimental analysis is stricter than GCC's, and these fixes have the benefit of being general nice cleanups. llvm-svn: 124073	2011-01-23 17:05:06 +00:00
Anton Korobeynikov	2f93128109	Rename TargetFrameInfo into TargetFrameLowering. Also, put couple of FIXMEs and fixes here and there. llvm-svn: 123170	2011-01-10 12:39:04 +00:00
Jakob Stoklund Olesen	1331a15b0c	Replace TargetRegisterInfo::printReg with a PrintReg class that also works without a TRI instance. Print virtual registers numbered from 0 instead of the arbitrary FirstVirtualRegister. The first virtual register is printed as %vreg0. TRI::NoRegister is printed as %noreg. llvm-svn: 123107	2011-01-09 03:05:53 +00:00
Evan Cheng	3ae2b79aa3	Re-implement r122936 with proper target hooks. Now getMaxStoresPerMemcpy etc. takes an option OptSize. If OptSize is true, it would return the inline limit for functions with attribute OptSize. llvm-svn: 122952	2011-01-06 06:52:41 +00:00
Evan Cheng	c052ba7ff3	Revert r122936. I'll re-implement the change. llvm-svn: 122949	2011-01-06 06:17:53 +00:00
Evan Cheng	06536e7158	r105228 reduced the memcpy / memset inline limit to 4 with -Os to avoid blowing up freebsd bootloader. However, this doesn't make much sense for Darwin, whose -Os is meant to optimize for size only if it doesn't hurt performance. rdar://8821501 llvm-svn: 122936	2011-01-06 01:04:47 +00:00
Benjamin Kramer	25e6e06e42	Try to reuse the value when lowering memset. This allows us to compile: void test(char *s, int a) { __builtin_memset(s, a, 15); } into 1 mul + 3 stores instead of 3 muls + 3 stores. llvm-svn: 122710	2011-01-02 19:57:05 +00:00
Benjamin Kramer	2fdea4c8f1	Lower the i8 extension in memset to a multiply instead of a potentially long series of shifts and ors. We could implement a DAGCombine to turn x * 0x0101 back into logic operations on targets that doesn't support the multiply or it is slow (p4) if someone cares enough. Example code: void test(char *s, int a) { __builtin_memset(s, a, 4); } before: _test: ## @test movzbl 8(%esp), %eax movl %eax, %ecx shll $8, %ecx orl %eax, %ecx movl %ecx, %eax shll $16, %eax orl %ecx, %eax movl 4(%esp), %ecx movl %eax, 4(%ecx) movl %eax, (%ecx) ret after: _test: ## @test movzbl 8(%esp), %eax imull $16843009, %eax, %eax ## imm = 0x1010101 movl 4(%esp), %ecx movl %eax, 4(%ecx) movl %eax, (%ecx) ret llvm-svn: 122707	2011-01-02 19:44:58 +00:00
Chris Lattner	3e5fbd74ed	rename MVT::Flag to MVT::Glue. "Flag" is a terrible name for something that just glues two nodes together, even if it is sometimes used for flags. llvm-svn: 122310	2010-12-21 02:38:05 +00:00
Chris Lattner	440b2804ff	teach MaskedValueIsZero how to analyze ADDE. This is enough to teach it that ADDE(0,0) is known 0 except the low bit, for example. llvm-svn: 122191	2010-12-19 20:38:28 +00:00
Duncan Sands	d2e70b5442	Catch attempts to remove a deleted node from the CSE maps. Better to catch this here rather than later after accessing uninitialized memory etc. Fires when compiling the testcase in PR8237. llvm-svn: 121635	2010-12-12 13:22:50 +00:00
Jay Foad	583abbc4df	PR5207: Change APInt methods trunc(), sext(), zext(), sextOrTrunc() and zextOrTrunc(), and APSInt methods extend(), extOrTrunc() and new method trunc(), to be const and to return a new value instead of modifying the object in place. llvm-svn: 121120	2010-12-07 08:25:19 +00:00
Benjamin Kramer	31920b0a2a	Remove unneeded zero arrays. llvm-svn: 120910	2010-12-04 15:28:22 +00:00
Jay Foad	25a5e4ca1f	PR5207: Rename overloaded APInt methods set(), clear(), flip() to setAllBits(), setBit(unsigned), etc. llvm-svn: 120564	2010-12-01 08:53:58 +00:00
Michael J. Spencer	447762da85	Merge System into Support. llvm-svn: 120298	2010-11-29 18:16:10 +00:00
Wesley Peck	527da1b6e2	Renaming ISD::BIT_CONVERT to ISD::BITCAST to better reflect the LLVM IR concept. llvm-svn: 119990	2010-11-23 03:31:01 +00:00
Benjamin Kramer	f6fb58a216	Silence Release build warnings about unused functions. llvm-svn: 119903	2010-11-20 15:53:24 +00:00
Duncan Sands	7c601ded34	On X86, MEMBARRIER, MFENCE, SFENCE, LFENCE are not target memory intrinsics, so don't claim they are. They are allocated using DAG.getNode, so attempts to access MemSDNode fields results in reading off the end of the allocated memory. This fixes crashes with "llc -debug" due to debug code trying to print MemSDNode fields for these barrier nodes (since the crashes are not deterministic, use valgrind to see this). Add some nasty checking to try to catch this kind of thing in the future. llvm-svn: 119901	2010-11-20 11:25:00 +00:00

1 2 3 4 5 ...

1038 Commits