Go to file
Hal Finkel b359b735d6 [PowerPC] Enable late partial unrolling on the POWER7
The P7 benefits from not have really-small loops so that we either have
multiple dispatch groups in the loop and/or the ability to form more-full
dispatch groups during scheduling. Setting the partial unrolling threshold to
44 seems good, empirically, for the P7. Compared to using no late partial
unrolling, this yields the following test-suite speedups:

SingleSource/Benchmarks/Adobe-C++/simple_types_constant_folding
	-66.3253% +/- 24.1975%
SingleSource/Benchmarks/Misc-C++/oopack_v1p8
	-44.0169% +/- 29.4881%
SingleSource/Benchmarks/Misc/pi
	-27.8351% +/- 12.2712%
SingleSource/Benchmarks/Stanford/Bubblesort
	-30.9898% +/- 22.4647%

I've speculatively added a similar setting for the P8. Also, I've noticed that
the unroller does not quite calculate the unrolling factor correctly for really
tiny loops because it neglects to account for the fact that not every loop body
replicant contains an ending branch and counter increment. I'll fix that later.

llvm-svn: 225522
2015-01-09 15:51:16 +00:00
clang Sema: Don't crash when variable is redefined as a constexpr function 2015-01-09 10:33:23 +00:00
clang-tools-extra clang-tidy: [misc-use-override] Simplify regular expressions in test. 2015-01-09 14:10:15 +00:00
compiler-rt [asan] Adding a regression test for https://code.google.com/p/address-sanitizer/issues/detail?id=368 2015-01-09 00:26:45 +00:00
debuginfo-tests New round of fixes for "Always compile debuginfo-tests for the host triple" 2014-10-18 23:47:59 +00:00
libclc Use amdgcn triple for SI+ GPUs 2015-01-06 20:42:12 +00:00
libcxx In early C++11 standard drafts, std::function derived from std::unary_function or std::binary_function if there was only one (or two) parameters. Before C++11 shipped, this restiction was lifted, but libc++ still does this (which is fine). However, the tests still check for this outdated requiremnt. Change then to check for the nested typedefs instead (which are still required by the standard). No change to the library. 2015-01-08 06:36:41 +00:00
libcxxabi Fix some formatting I messed up in r225187. 2015-01-05 19:31:59 +00:00
lld PE/COFF: add support to import functions in ARM NT 2015-01-08 04:19:08 +00:00
lldb [InlineTest] getRerunArgs returns an empty string if the test was skipped. 2015-01-09 01:54:44 +00:00
llgo [llgo] irgen: generate switch instructions 2015-01-08 07:49:28 +00:00
llvm [PowerPC] Enable late partial unrolling on the POWER7 2015-01-09 15:51:16 +00:00
openmp I apologise in advance for the size of this check-in. At Intel we do 2014-10-07 16:25:50 +00:00
polly Adjust test for the new 'distinct' metadata nodes 2015-01-09 08:10:36 +00:00