llvm-project

Commit Graph

Author	SHA1	Message	Date
Alexander Kornienko	f00654e31b	Revert r240137 (Fixed/added namespace ending comments using clang-tidy. NFC) Apparently, the style needs to be agreed upon first. llvm-svn: 240390	2015-06-23 09:49:53 +00:00
Alexander Kornienko	70bc5f1398	Fixed/added namespace ending comments using clang-tidy. NFC The patch is generated using this command: tools/clang/tools/extra/clang-tidy/tool/run-clang-tidy.py -fix \ -checks=-,llvm-namespace-comment -header-filter='llvm/.\|clang/.*' \ llvm/lib/ Thanks to Eugene Kosov for the original patch! llvm-svn: 240137	2015-06-19 15:57:42 +00:00
Benjamin Kramer	76e37aa334	unique_ptrs are unique already, no need to unique them any further. llvm-svn: 232178	2015-03-13 16:59:29 +00:00
Eric Christopher	6f1e5680f6	Remove subtarget dependence in pass pipeline setup for AArch64. llvm-svn: 231165	2015-03-03 23:22:40 +00:00
Eric Christopher	1cdefae9c4	Rewrite MachineOperand::print and MachineInstr::print to avoid uses of TM->getSubtargetImpl and propagate to all calls. This could be a debugging regression in places where we had a TargetMachine and/or MachineFunction but don't have it as part of the MachineInstr. Fixing this would require passing a MachineFunction/Function down through the print operator, but none of the existing uses in tree seem to do this. llvm-svn: 230710	2015-02-27 00:11:34 +00:00
Chad Rosier	b23c4dd3a4	[AArch64] Make AArch64A57FPLoadBalancing output stable. Add tie breaker to colorChainSet() sort so that processing order doesn't depend on std::set order, which depends on pointer order, which is unstable from run to run. No test case as this is nearly impossible to reproduce. Phabricator Review: http://reviews.llvm.org/D7265 Patch by Geoff Berry <gberry@codeaurora.org>! llvm-svn: 227606	2015-01-30 19:55:40 +00:00
Eric Christopher	125898a2a1	Clean up some uses of getSubtarget in AArch64. llvm-svn: 227530	2015-01-30 01:10:24 +00:00
Chad Rosier	11d943d32c	[AArch64] Add INITIALIZE_PASS macros to AArch64A57FPLoadBalancing. These are needed so this pass will produce output when e.g. -print-after-all is used. Phabricator Review: http://reviews.llvm.org/D7264 Patch by Geoff Berry <gberry@codeaurora.org>! llvm-svn: 227506	2015-01-29 22:57:37 +00:00
Chandler Carruth	d9903888d9	[cleanup] Re-sort all the #include lines in LLVM using utils/sort_includes.py. I clearly haven't done this in a while, so more changed than usual. This even uncovered a missing include from the InstrProf library that I've added. No functionality changed here, just mechanical cleanup of the include order. llvm-svn: 225974	2015-01-14 11:23:27 +00:00
Chad Rosier	ba0e0664ff	[AArch64] Fix clobber computation in A57LoadBalancing pass. Extremely difficult to reproduce, so no test case included. PR21637 llvm-svn: 222677	2014-11-24 18:57:58 +00:00
Benjamin Kramer	e12a6bac32	Eliminate some deep std::vector copies. NFC. llvm-svn: 218999	2014-10-03 18:33:16 +00:00
James Molloy	05ce999134	[A57FPLoadBalancing] Modify r217689 - actually we do need to check defs ... Just make sure we check uses first so we see the kill first. It turns out ignoring defs gives some pretty nasty runtime failures. I'm certain this is the fix but I'm still reducing a testcase. llvm-svn: 217735	2014-09-14 18:24:26 +00:00
James Molloy	4689647dbb	[A57FPLoadBalancing] Remove support for vector types Vector MUL/MLAs have tied operands, which gives us extra constraints that we currently can't handle. Instead of silently doing the wrong thing, remove support to be readded later properly. llvm-svn: 217690	2014-09-12 16:55:32 +00:00
James Molloy	a6e05a789e	[A57FPLoadBalancing] Ignore <def>s when checking if a chain may be killed. Defs are seen before uses, so a def without the kill flag doesn't necessarily mean that the register is not killed on that instruction. It may be killed in a later use operand. llvm-svn: 217689	2014-09-12 16:55:26 +00:00
James Molloy	f0de7e58f6	[A57LoadBalancing] unique_ptr-ify. Thanks to David Blakie for the in-depth review! llvm-svn: 217682	2014-09-12 14:35:17 +00:00
Arnaud A. de Grandmaison	6afbf2aa5e	[AArch64] FPLoadBalancing: move ownership of the chain to its current accumulator register and forget about the previously used accumulator. Coming up with a simple testcase is not easy, as this highly depends on what the register allocator is doing: this issue showed up while working with the PBQP allocator, which produced a different allocation scheme. A testcase would need to come up with chain starting in D[0-7], then moving to D[8-15], followed by a call to a function whose regmask clobbers the starting accumulator in D[0-7], then another use of the chain. Fixed some formatting, added some invariant checks while there. llvm-svn: 216721	2014-08-29 09:54:11 +00:00
James Molloy	36b8a88188	Change the return value of "getEnd()" from a MachineInstr* to a MachineBasicBlock::iterator. It seems on Darwin the illegal round-trip ::iterator -> MachineInstr* -> ::iterator breaks execution horribly when the iterator is not a real MachineInstr, like ::end(). llvm-svn: 216455	2014-08-26 13:41:31 +00:00
Tim Northover	e42fac5191	AArch64: avoid deleting the current iterator in a loop. std::map invalidates the iterator to any element that gets deleted, which means we can't increment it correctly afterwards. This was causing Darwin test failures. llvm-svn: 215233	2014-08-08 17:31:52 +00:00
NAKAMURA Takumi	08e30fd3d2	AArch64A57FPLoadBalancing.cpp: Define ColorNames in !NDEBUG. llvm-svn: 215226	2014-08-08 17:00:59 +00:00
James Molloy	3feea9c11a	[AArch64] Add an FP load balancing pass for Cortex-A57 For best-case performance on Cortex-A57, we should try to use a balanced mix of odd and even D-registers when performing a critical sequence of independent, non-quadword FP/ASIMD floating-point multiply or multiply-accumulate operations. This pass attempts to detect situations where the register allocation may adversely affect this load balancing and to change the registers used so as to better utilize the CPU. Ideally we'd just take each multiply or multiply-accumulate in turn and allocate it alternating even or odd registers. However, multiply-accumulates are most efficiently performed in the same functional unit as their accumulation operand. Therefore this pass tries to find maximal sequences ("Chains") of multiply-accumulates linked via their accumulation operand, and assign them all the same "color" (oddness/evenness). This optimization affects S-register and D-register floating point multiplies and FMADD/FMAs, as well as vector (floating point only) muls and FMADD/FMA. Q register instructions (and 128-bit vector instructions) are not affected. llvm-svn: 215199	2014-08-08 12:33:21 +00:00

20 Commits