llvm-project

Commit Graph

Author	SHA1	Message	Date
Chandler Carruth	1c8ace0e89	Teach the BranchProbabilityInfo pass to print its results, and use that to bring it under direct test instead of merely indirectly testing it in the BlockFrequencyInfo pass. The next step is to start adding tests for the various heuristics employed, and to start fixing those heuristics once they're under test. llvm-svn: 142778	2011-10-23 21:21:50 +00:00
Bill Wendling	addcfcac5c	Rename the script to indicate that this is for the TEST=simple tests. llvm-svn: 142764	2011-10-23 20:14:06 +00:00
Bill Wendling	e10675a39d	Resurrect the 'find regressions for the TEST=nightly tests' script. llvm-svn: 142763	2011-10-23 20:13:14 +00:00
Chandler Carruth	fd7475e906	Now that we have comparison on probabilities, add some static functions to get important constant branch probabilities and use them for finding the best branch out of a set of possibilities. llvm-svn: 142762	2011-10-23 20:10:34 +00:00
Chandler Carruth	446210b616	Remove a commented out line of code that snuck by my auditing. llvm-svn: 142761	2011-10-23 20:10:30 +00:00
Benjamin Kramer	cc0ed6ba04	Print branch probabilities as percentages. 50% is much more readable than 5.000000e-01. llvm-svn: 142752	2011-10-23 11:32:54 +00:00
Benjamin Kramer	929f53f65c	Add compare operators to BranchProbability and use it to determine if an edge is hot. llvm-svn: 142751	2011-10-23 11:19:14 +00:00
Chandler Carruth	bd1be4d01c	Completely re-write the algorithm behind MachineBlockPlacement based on discussions with Andy. Fundamentally, the previous algorithm is both counter productive on several fronts and prioritizing things which aren't necessarily the most important: static branch prediction. The new algorithm uses the existing loop CFG structure information to walk through the CFG itself to layout blocks. It coalesces adjacent blocks within the loop where the CFG allows based on the most likely path taken. Finally, it topologically orders the block chains that have been formed. This allows it to choose a (mostly) topologically valid ordering which still priorizes fallthrough within the structural constraints. As a final twist in the algorithm, it does violate the CFG when it discovers a "hot" edge, that is an edge that is more than 4x hotter than the competing edges in the CFG. These are forcibly merged into a fallthrough chain. Future transformations that need te be added are rotation of loop exit conditions to be fallthrough, and better isolation of cold block chains. I'm also planning on adding statistics to model how well the algorithm does at laying out blocks based on the probabilities it receives. The old tests mostly still pass, and I have some new tests to add, but the nested loops are still behaving very strangely. This almost seems like working-as-intended as it rotated the exit branch to be fallthrough, but I'm not convinced this is actually the best layout. It is well supported by the probabilities for loops we currently get, but those are pretty broken for nested loops, so this may change later. llvm-svn: 142743	2011-10-23 09:18:45 +00:00
Craig Topper	980d59832a	Add X86 RORX instruction llvm-svn: 142741	2011-10-23 07:34:00 +00:00
Cameron Zwarich	057fbb1a10	The element insertion code in scalar replacement doesn't handle incorrect element types, even though the element extraction code does. It is surprising that this bug has been here for so long. Fixes <rdar://problem/10318778>. llvm-svn: 142740	2011-10-23 07:02:10 +00:00
Craig Topper	e94d277db8	Add X86 MULX instruction for disassembler. llvm-svn: 142738	2011-10-23 00:33:32 +00:00
Craig Topper	7412aa9886	Remove some duplicate specifying of neverHasSideEffects and mayLoad from X86 multiply instructions. llvm-svn: 142737	2011-10-22 23:13:53 +00:00
Nick Lewycky	52340ac5f8	Oops! Fix test I forgot to submit as part of r142735. llvm-svn: 142736	2011-10-22 22:07:31 +00:00
Nick Lewycky	32f8051d66	A non-escaping malloc in the entry block is not unlike an alloca. Do dead-store elimination on them too. llvm-svn: 142735	2011-10-22 21:59:35 +00:00
Nick Lewycky	a6674c7fc9	Make SCEV's brute force analysis stronger in two ways. Firstly, we should be able to constant fold load instructions where the argument is a constant. Second, we should be able to watch multiple PHI nodes through the loop; this patch only supports PHIs in loop headers, more can be done here. With this patch, we now constant evaluate: static const int arr[] = {1, 2, 3, 4, 5}; int test() { int sum = 0; for (int i = 0; i < 5; ++i) sum += arr[i]; return sum; } llvm-svn: 142731	2011-10-22 19:58:20 +00:00
Nadav Rotem	aa6fab2484	Fix a typo.w llvm-svn: 142729	2011-10-22 18:44:51 +00:00
Jim Grosbach	dfc072d452	Minor updates. llvm-svn: 142728	2011-10-22 18:17:32 +00:00
Nadav Rotem	9f5ca0ba7d	Added my name to CREDITS.TXT llvm-svn: 142727	2011-10-22 17:51:04 +00:00
Benjamin Kramer	0d6d098841	Move various generated tables into read-only memory, fixing up const correctness along the way. llvm-svn: 142726	2011-10-22 16:50:00 +00:00
Nadav Rotem	e649d66552	Fix pr11193. SHL inserts zeros from the right, thus even when the original sign_extend_inreg value was of 1-bit, we need to sra. llvm-svn: 142724	2011-10-22 12:39:25 +00:00
Bill Wendling	94e6643fce	The different flavors of ARM have different valid subsets of registers. Check that the set of callee-saved registers is correct for the specific platform. <rdar://problem/10313708> & ctor_dtor_count & ctor_dtor_count-2 llvm-svn: 142706	2011-10-22 00:29:28 +00:00
Jim Grosbach	11c0b347c6	Assembly parsing for 4-register sequential variant of VLD2. llvm-svn: 142704	2011-10-21 23:58:57 +00:00
Jim Grosbach	118b38cbf1	Assembly parsing for 2-register sequential variant of VLD2. llvm-svn: 142691	2011-10-21 22:21:10 +00:00
Bill Wendling	b1c430886b	Make sure that the landing pads themselves have no PHI instructions in them. The assumption in the back-end is that PHIs are not allowed at the start of the landing pad block for SjLj exceptions. <rdar://problem/10313708> llvm-svn: 142689	2011-10-21 22:08:56 +00:00
Benjamin Kramer	606a50a9f8	Extend the floating point heuristic to consider NaN checks unlikely. llvm-svn: 142687	2011-10-21 21:13:47 +00:00
Tanya Lattner	8a8d6466ff	Revert r141657 for now. This has broken css and changed links on llvm.org. I'd like to understand exactly why the links have changed and if a newer doxygen is required. This may be reapplied once we upgrade on llvm.org and it is fully tested. llvm-svn: 142686	2011-10-21 20:51:54 +00:00
Eli Friedman	688db1d6d0	Remap blockaddress correctly when inlining a function. Fixes PR10162. llvm-svn: 142684	2011-10-21 20:45:19 +00:00
Owen Anderson	500ebeb87f	Use LLVMBool for a function that logically returns a boolean value. llvm-svn: 142683	2011-10-21 20:35:58 +00:00
Jim Grosbach	846bcff7c7	Assembly parsing for 4-register variant of VLD1. llvm-svn: 142682	2011-10-21 20:35:01 +00:00
Owen Anderson	8b81be0e4a	Fix typo. llvm-svn: 142681	2011-10-21 20:28:19 +00:00
Benjamin Kramer	1e731a10d0	BranchProbabilityInfo: floating point equality is unlikely. This is from the same paper from Ball and Larus as the rest of the currently implemented heuristics. llvm-svn: 142677	2011-10-21 20:12:47 +00:00
Jim Grosbach	c4360fe575	Assembly parsing for 3-register variant of VLD1. llvm-svn: 142675	2011-10-21 20:02:19 +00:00
Owen Anderson	cef5641982	STABS symbols are debug symbols. llvm-svn: 142673	2011-10-21 19:26:54 +00:00
Eli Friedman	303c81c773	Minor simplification: use ShuffleVectorInst::getMaskValue instead of a more expensive helper. llvm-svn: 142672	2011-10-21 19:11:34 +00:00
Eli Friedman	ce818277fc	Extend instcombine's shufflevector simplification to handle more cases where the input and output vectors have different sizes. Patch by Xiaoyi Guo. llvm-svn: 142671	2011-10-21 19:06:29 +00:00
Jim Grosbach	2f2e3c4737	ARM VLD parsing and encoding. Next step in the ongoing saga of NEON load/store assmebly parsing. Handle VLD1 instructions that take a two-register register list. Adjust the instruction definitions to only have the single encoded register as an operand. The super-register from the pseudo is kept as an implicit def, so passes which come after pseudo-expansion still know that the instruction defines the other subregs. llvm-svn: 142670	2011-10-21 18:54:25 +00:00
Owen Anderson	03a173eb71	Don't automatically set the "fc" bits on MSR instructions if the user didn't ask for them. This is a divergence from gas' behavior, but it is correct per the documentation and allows us to forge ahead with roundtrip testing. llvm-svn: 142669	2011-10-21 18:43:28 +00:00
Owen Anderson	07bfdbb233	Bind libObject API for obtaining the section containing a Symbol. llvm-svn: 142667	2011-10-21 18:21:22 +00:00
Owen Anderson	f239db40d5	Expand the coverage of the libObject C bindings to include more SectionRef accessors as well as Symbol iterators. llvm-svn: 142661	2011-10-21 17:50:59 +00:00
Nadav Rotem	5e00bb5feb	Fix pr11194. When promoting and splitting integers we need to use ZExtPromotedInteger and SExtPromotedInteger based on the operation we legalize. SetCC return type needs to be legalized via PromoteTargetBoolean. llvm-svn: 142660	2011-10-21 17:35:19 +00:00
Jim Grosbach	e6d88c9a51	Nuke an #if0 that got accidentally left in. llvm-svn: 142658	2011-10-21 16:59:08 +00:00
Jim Grosbach	20cb505e2f	whitespace. llvm-svn: 142657	2011-10-21 16:56:40 +00:00
Chandler Carruth	70a38058b1	Don't hard code the desired alignment for loops -- it isn't 16-bytes on all x86 systems. Sorry for the breakage. llvm-svn: 142656	2011-10-21 16:41:39 +00:00
Jim Grosbach	e3013dd62d	Remove some outdated comments. llvm-svn: 142653	2011-10-21 16:14:12 +00:00
Nadav Rotem	d315157f12	1. Fix the widening of SETCC in WidenVecOp_SETCC. Use the correct return CC type. 2. Fix a typo in CONCAT_VECTORS which exposed the bug in #1. llvm-svn: 142648	2011-10-21 11:42:07 +00:00
Duncan Sands	12a16dbcb0	Ensure timestamps are not embedded into files when doing a release build. llvm-svn: 142647	2011-10-21 09:47:14 +00:00
Anton Korobeynikov	b27f11e01d	Fix build on mingw-w64. Patch by Ruben Van Boxem! llvm-svn: 142646	2011-10-21 09:38:50 +00:00
Chandler Carruth	8b9737cb54	Add loop aligning to MachineBlockPlacement based on review discussion so it's a bit more plausible to use this instead of CodePlacementOpt. The code for this was shamelessly stolen from CodePlacementOpt, and then trimmed down a bit. There doesn't seem to be much utility in returning true/false from this pass as we may or may not have rewritten all of the blocks. Also, the statistic of counting how many loops were aligned doesn't seem terribly important so I removed it. If folks would like it to be included, I'm happy to add it back. This was probably the most egregious of the missing features, and now I'm going to start gathering some performance numbers and looking at specific loop structures that have different layout between the two. Test is updated to include both basic loop alignment and nested loop alignment. llvm-svn: 142645	2011-10-21 08:57:37 +00:00
Chandler Carruth	ddfeaafdfb	Add a very basic test for MachineBlockPlacement. This is essentially the canonical example I used when developing it, and is one of the primary motivating real-world use cases for __builtin_expect (when burried under a macro). I'm working on more test cases here, but I'm trying to make sure both that the pass is doing the right thing with the test cases and that they aren't too brittle to changes elsewhere in the code generation pipeline. Feedback and/or suggestions on how to test this are very welcome. Especially feedback on whether testing the block comments is a good strategy; I couldn't find any good examples to steal from but all the other ideas I had were a lot uglier or more fragile. llvm-svn: 142644	2011-10-21 08:01:56 +00:00
Bill Wendling	7e9a7c4a7f	Modify the script to output the regressions and passes into categories. My Python-fu could use some improving... llvm-svn: 142643	2011-10-21 06:58:01 +00:00

1 2 3 4 5 ...

77192 Commits