llvm-project

Commit Graph

Author	SHA1	Message	Date
Matt Arsenault	e8e0f5cac6	Make analyzeBranch family of instruction names consistent analyzeBranch was renamed to use lowercase first, rename the related set to match. llvm-svn: 281506	2016-09-14 17:24:15 +00:00
Kyle Butt	64e428147f	Branch Folding: Accept explicit threshold for tail merge size. This is prep work for allowing the threshold to be different during layout, and to enforce a single threshold between merging and duplicating during layout. No observable change intended. llvm-svn: 279117	2016-08-18 18:57:29 +00:00
Sjoerd Meijer	15c81b05ea	[MBP] do not reorder and move up loop latch block Do not reorder and move up a loop latch block before a loop header when optimising for size because this will generate an extra unconditional branch. Differential Revision: https://reviews.llvm.org/D22521 llvm-svn: 278840	2016-08-16 19:50:33 +00:00
David Majnemer	c700490f48	Use the range variant of remove_if instead of unpacking begin/end No functionality change is intended. llvm-svn: 278475	2016-08-12 04:32:37 +00:00
David Majnemer	0d955d0bf5	Use the range variant of find instead of unpacking begin/end If the result of the find is only used to compare against end(), just use is_contained instead. No functionality change is intended. llvm-svn: 278433	2016-08-11 22:21:41 +00:00
Kyle Butt	02d8d054ab	Codegen: MachineBlockPlacement Improve probability layout. The following pattern was being layed out poorly: A / \ B C / \ / \ D E ? (Doesn't matter) Where A->B is far more likely than A->C, and prob(B->D) = prob(B->E) The current algorithm gives: A,B,C,E (D goes on worklist) It does this even if C has a frequency count of 0. This patch adjusts the layout calculation so that if freq(B->E) >> freq(C->E) then we go ahead and layout E rather than C. Fallthrough half the time is better than fallthrough never, or fallthrough very rarely. The resulting layout is: A,B,E, (C and D are in a worklist) llvm-svn: 277187	2016-07-29 18:09:28 +00:00
Sjoerd Meijer	5e11a18f5a	[MBP] Added some more debug messages and some clean ups /NFC Differential Revision: https://reviews.llvm.org/D22669 llvm-svn: 276849	2016-07-27 08:49:23 +00:00
Sjoerd Meijer	fd0ad4e193	[MBP] Clean up of the comments, and a first attempt to better describe a part of the algorithm. Differential Revision: https://reviews.llvm.org/D22364 llvm-svn: 275595	2016-07-15 18:41:56 +00:00
Jacques Pienaar	71c30a14b7	Rename AnalyzeBranch* to analyzeBranch*. Summary: NFC. Rename AnalyzeBranch/AnalyzeBranchPredicate to analyzeBranch/analyzeBranchPredicate to follow LLVM coding style and be consistent with TargetInstrInfo's analyzeCompare and analyzeSelect. Reviewers: tstellarAMD, mcrosier Subscribers: mcrosier, jholewinski, jfb, arsenm, dschuff, jyknight, dsanders, nemanjai Differential Revision: https://reviews.llvm.org/D22409 llvm-svn: 275564	2016-07-15 14:41:04 +00:00
Xinliang David Li	93926acbb2	[MBP] method interface cleanup Make worklist and ehworklist member of the class so that they don't need to be passed around. llvm-svn: 274333	2016-07-01 05:46:48 +00:00
Kyle Butt	82c2290e0f	Codegen: [MBP] Add messages to asserts. NFC llvm-svn: 274075	2016-06-28 22:50:54 +00:00
Xinliang David Li	449cdfd00a	[MBP] show function name in debug dump llvm-svn: 273744	2016-06-24 22:54:21 +00:00
Kyle Butt	b3875ea71b	Codegen: [MBP] Add assert strings. NFC llvm-svn: 273067	2016-06-17 22:40:19 +00:00
Xinliang David Li	e34ed833e5	[MBP] add comments and bug fix Document the new parameter and threshod computation model. Also fix a bug when the threshold parameter is set to be different from the default. llvm-svn: 272749	2016-06-15 03:03:30 +00:00
Dehao Chen	9f2bdfb40f	Set machine block placement hot prob threshold for both static and runtime profile. Summary: With runtime profile, we have more confidence in branch probability, thus during basic block layout, we set a lower hot prob threshold so that blocks can be layouted optimally. Reviewers: djasper, davidxl Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D20991 llvm-svn: 272729	2016-06-14 22:27:17 +00:00
Xinliang David Li	52530a72c9	[MBP] Interface cleanups /NFC Save machine function pointer so that the reference does not need to be passed around. This also gives other methods access to machine function for information such as entry count etc. llvm-svn: 272594	2016-06-13 22:23:44 +00:00
Xinliang David Li	cbf1214f76	[MBP] Code cleanup #3 /NFC This is third patch to clean up the code. Included in this patch: 1. Further unclutter trace/chain formation main routine; 2. Isolate the logic to compute global cost/conflict detection into its own method; 3. Heavily document the selection algorithm; 4. Added helper hook to allow PGO specific logic to be added in the future. llvm-svn: 272582	2016-06-13 20:24:19 +00:00
Xinliang David Li	071d0f1807	[MBP] Code cleanup /NFC This is second patch to clean up the code. In this patch, the logic to determine block outlinining is refactored and more comments are added. llvm-svn: 272514	2016-06-12 16:54:03 +00:00
Xinliang David Li	594ffa3d36	[MBP] Code cleanup /NFC This is one of the patches to clean up the code so that it is in a better form to make future enhancements easier. In htis patch, the logic to collect viable successors are extrated as a helper to unclutter the caller which gets very large recenty. Also cleaned up BP adjustment code. llvm-svn: 272482	2016-06-11 18:35:40 +00:00
Haicheng Wu	5b458cc1f6	Reapply "[MBP] Reduce code size by running tail merging in MBP."" This reapplies commit r271930, r271915, r271923. They hit a bug in Thumb which is fixed in r272258 now. The original message: The code layout that TailMerging (inside BranchFolding) works on is not the final layout optimized based on the branch probability. Generally, after BlockPlacement, many new merging opportunities emerge. This patch calls Tail Merging after MBP and calls MBP again if Tail Merging merges anything. llvm-svn: 272267	2016-06-09 15:24:29 +00:00
Dehao Chen	769219b11a	Revive http://reviews.llvm.org/D12778 to handle forward-hot-prob and backward-hot-prob consistently. Summary: Consider the following diamond CFG: A / \ B C \/ D Suppose A->B and A->C have probabilities 81% and 19%. In block-placement, A->B is called a hot edge and the final placement should be ABDC. However, the current implementation outputs ABCD. This is because when choosing the next block of B, it checks if Freq(C->D) > Freq(B->D) * 20%, which is true (if Freq(A) = 100, then Freq(B->D) = 81, Freq(C->D) = 19, and 19 > 8120%=16.2). Actually, we should use 25% instead of 20% as the probability here, so that we have 19 < 8125%=20.25, and the desired ABDC layout will be generated. Reviewers: djasper, davidxl Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D20989 llvm-svn: 272203	2016-06-08 21:30:12 +00:00
Haicheng Wu	4fa9f3ae45	Revert "[MBP] Reduce code size by running tail merging in MBP." This reverts commit r271930, r271915, r271923. They break a thumb selfhosting bot. llvm-svn: 272017	2016-06-07 15:17:21 +00:00
Haicheng Wu	77ea344786	[MBP] Reduce code size by running tail merging in MBP. The code layout that TailMerging (inside BranchFolding) works on is not the final layout optimized based on the branch probability. Generally, after BlockPlacement, many new merging opportunities emerge. This patch calls Tail Merging after MBP and calls MBP again if Tail Merging merges anything. Differential Revision: http://reviews.llvm.org/D20276 llvm-svn: 271925	2016-06-06 18:36:07 +00:00
Xinliang David Li	ff2873742e	Replace hard coded probability threshold with parameter /NFC llvm-svn: 271751	2016-06-03 23:48:36 +00:00
Haicheng Wu	90a55651e6	[MBP] Factor out the optimizations on branch conditions and unanalyzable branches. NFCI. The benefits of this patch are -- We call AnalyzeBranch() to optimize unanalyzable branches, but the result of AnalyzeBranch() is not used. Now the result is useful. -- Before the layout of all the MBBs is set, the result of AnalyzeBranch() is not correct and needs to be fixed before using it to optimize the branch conditions. Now this optimization is called after the layout, the code used to fix the result of AnalyzeBranch() is not needed. -- The branch condition of the last block is not optimized before. Now it is optimized. Differential Revision: http://reviews.llvm.org/D20177 llvm-svn: 270623	2016-05-24 22:16:14 +00:00
Haicheng Wu	c01919e796	[MBP] Remove a redundant skipFunction(). NFC. skipFunction() is called twice. Differential Revision: http://reviews.llvm.org/D20377 llvm-svn: 269994	2016-05-18 22:34:45 +00:00
Xinliang David Li	b840bb8714	Fix option description /NFC llvm-svn: 269307	2016-05-12 16:39:02 +00:00
Xinliang David Li	f0ab6dfedc	[Layout] Add a new option (NFC) Currently cost based loop rotation algo can only be turned on with two conditions: the function has real profile data, and -precise-rotation-cost flag is turned on. This is not convenient for developers to experiment when profile is not available. Add a new option to force the new rotation algorithm -force-precise-rotation-cost llvm-svn: 269266	2016-05-12 02:04:41 +00:00
Andrew Kaylor	50271f787e	Add opt-bisect support to additional passes that can be skipped Differential Revision: http://reviews.llvm.org/D19882 llvm-svn: 268457	2016-05-03 22:32:30 +00:00
Quentin Colombet	776e6de516	[MachineBlockPlacement] Let the target optimize the branches at the end. After the layout of the basic blocks is set, the target may be able to get rid of unconditional branches to fallthrough blocks that the generic code does not catch. This happens any time TargetInstrInfo::AnalyzeBranch is not able to analyze all the branches involved in the terminators sequence, while still understanding a few of them. In such situation, AnalyzeBranch can directly modify the branches if it has been instructed to do so. This patch takes advantage of that. llvm-svn: 268328	2016-05-02 22:58:59 +00:00
Haicheng Wu	4afe0425db	[MBP] Use Function::optForSize() instead of checking OptimizeForSize directly. Fix a FIXME. Disable loop alignment if compiled with -Oz now. llvm-svn: 268121	2016-04-29 22:01:10 +00:00
Haicheng Wu	e749ce53d4	[MBP] Split placement and alignment into two functions. NFC. Cut and Paste. llvm-svn: 268067	2016-04-29 17:06:44 +00:00
Andrew Kaylor	aa641a5171	Re-commit optimization bisect support (r267022) without new pass manager support. The original commit was reverted because of a buildbot problem with LazyCallGraph::SCC handling (not related to the OptBisect handling). Differential Revision: http://reviews.llvm.org/D19172 llvm-svn: 267231	2016-04-22 22:06:11 +00:00
Vedant Kumar	6013f45f92	Revert "Initial implementation of optimization bisect support." This reverts commit r267022, due to an ASan failure: http://lab.llvm.org:8080/green/job/clang-stage2-cmake-RgSan_check/1549 llvm-svn: 267115	2016-04-22 06:51:37 +00:00
Andrew Kaylor	f0f279291c	Initial implementation of optimization bisect support. This patch implements a optimization bisect feature, which will allow optimizations to be selectively disabled at compile time in order to track down test failures that are caused by incorrect optimizations. The bisection is enabled using a new command line option (-opt-bisect-limit). Individual passes that may be skipped call the OptBisect object (via an LLVMContext) to see if they should be skipped based on the bisect limit. A finer level of control (disabling individual transformations) can be managed through an addition OptBisect method, but this is not yet used. The skip checking in this implementation is based on (and replaces) the skipOptnoneFunction check. Where that check was being called, a new call has been inserted in its place which checks the bisect limit and the optnone attribute. A new function call has been added for module and SCC passes that behaves in a similar way. Differential Revision: http://reviews.llvm.org/D19172 llvm-svn: 267022	2016-04-21 17:58:54 +00:00
Amaury Sechet	c53ad4f3b2	Do not select EhPad BB in MachineBlockPlacement when there is regular BB to schedule Summary: EHPad BB are not entered the classic way and therefor do not need to be placed after their predecessors. This patch make sure EHPad BB are not chosen amongst successors to form chains, and are selected as last resort when selecting the best candidate. EHPad are scheduled in reverse probability order in order to have them flow into each others naturally. Reviewers: chandlerc, majnemer, rafael, MatzeB, escha, silvas Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D17625 llvm-svn: 265726	2016-04-07 21:29:39 +00:00
Amaury Sechet	33c161c02f	[BlockPlacement] Remove an unnecessary continue NFC. llvm-svn: 265643	2016-04-07 06:35:00 +00:00
Amaury Sechet	9ee4ddd710	[MBP] Remove an unused function parameter NFC. llvm-svn: 265642	2016-04-07 06:34:47 +00:00
Amaury Sechet	41474a52e7	Revert "[BlockPlacement] Remove an unnecessary continue" and "[MBP] Remove an unused function parameter" llvm-svn: 265638	2016-04-07 04:28:40 +00:00
Haicheng Wu	1951cf24a7	[MBP] Remove an unused function parameter NFC. llvm-svn: 265596	2016-04-06 20:38:20 +00:00
Haicheng Wu	3618fa786f	[BlockPlacement] Remove an unnecessary continue NFC. llvm-svn: 265407	2016-04-05 15:37:08 +00:00
Amaury Sechet	eae09c2c2a	Factor out MachineBlockPlacement::fillWorkLists. NFC Summary: There are places in MachineBlockPlacement where a worklist is filled in pretty much identical way. The code is duplicated. This refactor it so that the same code is used in both scenarii. Reviewers: chandlerc, majnemer, rafael, MatzeB, escha, silvas Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D18077 llvm-svn: 263495	2016-03-14 21:24:11 +00:00
Junmo Park	4ba6cf69e4	Minor code cleanup. NFC. llvm-svn: 263196	2016-03-11 05:07:07 +00:00
Philip Reames	ae27b2380f	[MBP] Renaming a confusing variable and add clarifying comments Was discussed as part of http://reviews.llvm.org/D17830 llvm-svn: 262571	2016-03-03 00:58:43 +00:00
Philip Reames	23d933982a	[MBP] Avoid placing random blocks between loop preheader and header If we have a loop with a rarely taken path, we will prune that from the blocks which get added as part of the loop chain. The problem is that we weren't then recognizing the loop chain as schedulable when considering the preheader when forming the function chain. We'd then fall to various non-predecessors before finally scheduling the loop chain (as if the CFG was unnatural.) The net result was that there could be lots of garbage between a loop preheader and the loop, even though we could have directly fallen into the loop. It also meant we separated hot code with regions of colder code. The particular reason for the rejection of the loop chain was that we were scanning predecessor of the header, seeing the backedge, believing that was a globally more important predecessor (true), but forgetting to account for the fact the backedge precessor was already part of the existing loop chain (oops!. Differential Revision: http://reviews.llvm.org/D17830 llvm-svn: 262547	2016-03-03 00:01:42 +00:00
Philip Reames	02e1132afb	[MBP] Remove overly verbose debug output llvm-svn: 262531	2016-03-02 22:40:51 +00:00
Philip Reames	b9688f4382	[MBP] Adjust debug output to be more focused and approachable llvm-svn: 262522	2016-03-02 21:45:13 +00:00
Chad Rosier	406808e344	Partially revert "Add command line options to force function/loop alignments." This partially reverts r256571 in favor of the solution in r258409. llvm-svn: 258421	2016-01-21 18:49:15 +00:00
Geoff Berry	10494aca05	[BlockPlacement] Add option to align all non-fall-through blocks. Summary: This option is being added for testing purposes. Reviewers: mcrosier Subscribers: mcrosier, llvm-commits Differential Revision: http://reviews.llvm.org/D16410 llvm-svn: 258409	2016-01-21 17:25:52 +00:00
Chad Rosier	6b4326367a	Add command line options to force function/loop alignments. These are being added for testing purposes. http://reviews.llvm.org/D15648 llvm-svn: 256571	2015-12-29 18:18:07 +00:00

1 2 3 4

158 Commits