llvm-project

Commit Graph

Author	SHA1	Message	Date
Chris Lattner	6c2d178957	add plumbing for handling multiple result nodes in some more places. llvm-svn: 99366	2010-03-24 00:41:19 +00:00
Chris Lattner	f57437abb1	don't form a RecordChild or CheckChildType for child #'s over 7, we don't have enums for them. llvm-svn: 98597	2010-03-16 00:35:11 +00:00
Chris Lattner	c95d58d31a	turn off debug spew llvm-svn: 97912	2010-03-07 07:21:24 +00:00
Chris Lattner	a160389290	more factoring. llvm-svn: 97911	2010-03-07 07:20:49 +00:00
Chris Lattner	b9071a2d5d	teach tblgen to be more aggressive when factoring CheckType nodes. Now it will factor things like this: CheckType i32 ... CheckOpcode ISD::AND CheckType i64 ... into: SwitchType: i32: ... i64: CheckOpcode ISD::AND ... This shrinks hte table by a few bytes, nothing spectacular. llvm-svn: 97908	2010-03-07 07:01:28 +00:00
Chris Lattner	3e1ffd06fc	introduce a new SwitchTypeMatcher node (which is analogous to SwitchOpcodeMatcher) and have DAGISelMatcherOpt form it. This speeds up selection, particularly for X86 which has lots of variants of instructions with only type differences. llvm-svn: 97645	2010-03-03 06:28:15 +00:00
Chris Lattner	1e634e31de	resolve a fixme by having the .td file parser reject thigns like (set GPR, somecomplexpattern) if somecomplexpattern doesn't declare what it can match. llvm-svn: 97513	2010-03-01 22:29:19 +00:00
Chris Lattner	00f2e4b6d6	remove dead code, simplify. llvm-svn: 97510	2010-03-01 22:19:47 +00:00
Chris Lattner	664ac989ff	tolerate factoring the last node for CellSPU. llvm-svn: 97508	2010-03-01 22:04:33 +00:00
Chris Lattner	053a28a397	eliminate the CheckMultiOpcodeMatcher code and have each ComplexPattern at the root be generated multiple times, once for each opcode they are part of. This encourages factoring because the opcode checks get treated just like everything else in the matcher. llvm-svn: 97439	2010-03-01 07:17:40 +00:00
Chris Lattner	f4d1775263	add a new OPC_SwitchOpcode which is semantically equivalent to a scope where every child starts with a CheckOpcode, but executes more efficiently. Enhance DAGISelMatcherOpt to form it. This also fixes a bug in CheckOpcode: apparently the SDNodeInfo objects are not pointer comparable, we have to compare the enum name. llvm-svn: 97438	2010-03-01 06:59:22 +00:00
Chris Lattner	db5b73a77f	pull MarkFlagResult out from between an EmitNode/CompleteMatch pair. This encourages MorphNodeTo formation, this gets us 200 more MorphNodeTo's on X86 and shrinks the table a bit. llvm-svn: 97434	2010-03-01 02:33:14 +00:00
Chris Lattner	4634d9beef	enhance RecordNode and RecordChild comments to indicate what slot they're recording into, no functionality change. llvm-svn: 97433	2010-03-01 02:24:17 +00:00
Chris Lattner	90b1b9d984	Emit redundant opcode checks for andimm and orimm tests at root so that we get grouping at the top level. Add an optimization to reorder type check & record nodes after opcode checks. We prefer to expose tree shape matching which improves grouping and will enhance the next optimization. llvm-svn: 97432	2010-03-01 02:15:34 +00:00
Chris Lattner	560169d5c4	simplify some code now that chain/flag results are not stored in the vtlist for emitnode. llvm-svn: 97429	2010-02-28 23:00:47 +00:00
Chris Lattner	a838264af1	enhance the EmitNode/MorphNodeTo operands to take a bit that specifies whether there is an output flag or not. Use this instead of redundantly encoding the chain/flag results in the output vtlist. llvm-svn: 97419	2010-02-28 21:53:42 +00:00
Chris Lattner	9d67dcadce	use MorphNodeTo instead of SelectNodeTo. SelectNodeTo is just a silly wrapper around MorphNodeTo. llvm-svn: 97416	2010-02-28 20:55:18 +00:00
Chris Lattner	102a8a01e6	enhance the new isel to use SelectNodeTo for most patterns, even some the old isel didn't. There are several parts of this that make me feel dirty, but it's no worse than the old isel. I'll clean up the parts I can do without ripping out the old one next. llvm-svn: 97415	2010-02-28 20:49:53 +00:00
Chris Lattner	abb1c7912e	enhance EmitNodeMatcher to keep track of the recorded slot numbers it will populate. llvm-svn: 97363	2010-02-28 02:41:25 +00:00
Chris Lattner	c3f80e0806	add infrastructure to support forming selectnodeto. Not used yet because I have to go on another detour first. llvm-svn: 97362	2010-02-28 02:31:26 +00:00
Chris Lattner	278606b06e	change CheckOpcodeMatcher to hold the SDNodeInfo instead of the opcode name. This gives the optimizer more semantic info. llvm-svn: 97346	2010-02-27 21:48:43 +00:00
Chris Lattner	21a7bf3fab	fix logic in DEBUG. llvm-svn: 97315	2010-02-27 08:13:23 +00:00
Chris Lattner	2586c86a8a	teach the optimizer that opcode == ISD::STORE is contradictory with getType() == MVT::i32 etc. Teach it that two different integer constants are contradictory. This cuts 1K off the X86 table, down to 98k llvm-svn: 97314	2010-02-27 08:11:15 +00:00
Chris Lattner	c577b816fc	Teach the grouper some simple tricks about looking contradictory predicates. For example if we have: Scope: CheckType i32 ABC CheckType f32 DEF CheckType i32 GHI Then we know that we can transform this into: Scope: CheckType i32 Scope ABC GHI CheckType f32 DEF This reorders the check for the 'GHI' predicate above the check for the 'DEF' predidate. However it is safe to do this in this situation because we know that a node cannot have both an i32 and f32 type. We're now doing more factoring that the old isel did. llvm-svn: 97312	2010-02-27 07:49:13 +00:00
Chris Lattner	d9e1e83e3b	implement a new optimization to sink pattern predicates (like isSSE1) as deeply into the pattern as we can get away with. In pratice, this means "all the way to to the emitter code, but not across ComplexPatterns". This substantially increases the amount of factoring we get. llvm-svn: 97305	2010-02-27 06:22:57 +00:00
Chris Lattner	4f9a671a30	switch from my nice hashtable based merging solution to a gross little neighbor merging implementation. This one has the benefit of not violating the ordering of patterns, so it generates code that passes tests again. llvm-svn: 97218	2010-02-26 08:08:41 +00:00
Chris Lattner	be5b634cea	finish off the factoring optimization along the lines of the current design. This generates a matcher that successfully runs, but it turns out that the factoring we're doing violates the ordering of patterns, so we end up matching (e.g.) movups where we want movaps. This won't due, but I'll address this in a follow on patch. It's nice to not be on by default yet! :) llvm-svn: 97215	2010-02-26 07:36:37 +00:00
Chris Lattner	f7fc2d8b86	change the scope node to include a list of children to be checked instead of to have a chained series of scope nodes. This makes the generated table smaller, improves the efficiency of the interpreter, and make the factoring optimization much more reasonable to implement. llvm-svn: 97160	2010-02-25 19:00:39 +00:00
Chris Lattner	62702da070	Implement the first half of redundancy factoring: efficiently splitting all the patterns under scope nodes into equality sets based on their first node. The second step is to rewrite the graph info a form that exposes the sharing. Before I do this, I want to redesign the Scope node. llvm-svn: 97130	2010-02-25 07:45:24 +00:00
Chris Lattner	2c3f6492b0	rename fooMatcherNode to fooMatcher. llvm-svn: 97096	2010-02-25 02:04:40 +00:00
Chris Lattner	c36ab923c6	add some noop code to push it out of my tree. llvm-svn: 97094	2010-02-25 01:57:41 +00:00
Chris Lattner	ac55f9df88	rename PushMatcherNode -> ScopeMatcherNode to more accurately reflect what it does. Switch the sense of the Next and the Check arms to be more logical. No functionality change. llvm-svn: 97093	2010-02-25 01:56:48 +00:00
Chris Lattner	0c95baa848	contract movechild+checktype into a new checkchild node, shrinking the x86 table by 1200 bytes. llvm-svn: 97053	2010-02-24 20:15:25 +00:00
Chris Lattner	6b79232138	split the movechild/record/moveparent -> recordchild optzn into a movechild/record -> recordchild/movechild and movechild/moveparent -> noop xforms. This slightly shrinks the tables (x86 to 117454) and enables adding future improvements. llvm-svn: 97051	2010-02-24 19:52:48 +00:00
Chris Lattner	ab41756c2e	implement a simple proof-of-concept optimization for the new isel: fold movechild+record+moveparent into a single recordchild N node. This shrinks the X86 table from 125443 to 117502 bytes. llvm-svn: 97031	2010-02-24 07:31:45 +00:00
Chris Lattner	e7327435a7	The new isel passes all tests, time to start making it go fast. Also add an easy macro at the top of DAGISelEmitter.cpp to enable it. Lets see if I can avoid accidentally turning it on :) llvm-svn: 97029	2010-02-24 07:06:50 +00:00

36 Commits