llvm-project

Commit Graph

Author	SHA1	Message	Date
Chen Li	9f27fc0599	[LoopUnswitch] Add block frequency analysis to recognize hot/cold regions Summary: This patch adds block frequency analysis to LoopUnswitch pass to recognize hot/cold regions. For cold regions the pass only performs trivial unswitches since they do not increase code size, and for hot regions everything works as before. This helps to minimize code growth in cold regions and be more aggressive in hot regions. Currently the default cold regions are blocks with frequencies below 20% of function entry frequency, and it can be adjusted via -loop-unswitch-cold-block-frequency flag. The entire feature is controlled via -loop-unswitch-with-block-frequency flag and it is off by default. Reviewers: broune, silvas, dnovillo, reames Subscribers: davidxl, llvm-commits Differential Revision: http://reviews.llvm.org/D11605 llvm-svn: 248777	2015-09-29 05:03:32 +00:00
Craig Topper	55e39a7cf3	Simplify or remove calls to makeArrayRef based on feedback from David Blaikie. NFC. llvm-svn: 248776	2015-09-29 04:53:28 +00:00
John McCall	8460bcaa33	Honor the casted-to alignment of an explicit cast even when Sema thinks the cast is a no-op, as it does when (e.g.) the only thing that changes is an alignment attribute. Fixed PR24944. llvm-svn: 248775	2015-09-29 04:37:40 +00:00
Craig Topper	8674c5cf70	Remove 'const' from some ArrayRef arguments since they're passed by value anyway. NFC llvm-svn: 248774	2015-09-29 04:30:07 +00:00
Craig Topper	9798b931fa	Pass ArrayRef by value. NFC. llvm-svn: 248773	2015-09-29 04:30:05 +00:00
Alexey Bataev	5f600d6a49	[OPENMP 4.1] Codegen for ‘simd’ clause in ‘ordered’ directive. Description. If the simd clause is specified, the ordered regions encountered by any thread will use only a single SIMD lane to execute the ordered regions in the order of the loop iterations. Restrictions. An ordered construct with the simd clause is the only OpenMP construct that can appear in the simd region. An ordered directive with ‘simd’ clause is generated as an outlined function and corresponding function call to prevent this part of code from vectorization later in backend. llvm-svn: 248772	2015-09-29 03:48:57 +00:00
NAKAMURA Takumi	0c12a3949e	[CMake] X86AsmParser: Prune redundant LINK_LIBS. It is described in LLVMBuild.txt. llvm-svn: 248771	2015-09-29 01:25:01 +00:00
Evgeniy Stepanov	d8b86f7cdc	Move dbg.declare intrinsics when merging and replacing allocas. Place new and update dbg.declare calls immediately after the corresponding alloca. Current code in replaceDbgDeclareForAlloca puts the new dbg.declare at the end of the basic block. LLVM codegen has problems emitting debug info in a situation when dbg.declare appears after all uses of the variable. This usually kinda works for inlining and ASan (two users of this function) but not for SafeStack (see the pending change in http://reviews.llvm.org/D13178). llvm-svn: 248769	2015-09-29 00:30:19 +00:00
Cameron Esfahani	e211e204da	NFC: Fix spelling error in comment. llvm-svn: 248768	2015-09-29 00:21:38 +00:00
Matthias Braun	99ae16217e	RegisterPressure: LiveRegSet tracks register units not physregs There are always more physical registers and register units so the previous behaviour was correct but we can do with less memory. llvm-svn: 248767	2015-09-29 00:20:32 +00:00
David Blaikie	6163f67ad0	Remove unnecessary default dtor. The base dtor is already virtual and the derived dtor adds nothing. llvm-svn: 248765	2015-09-29 00:12:50 +00:00
David Blaikie	8e9ea2a439	Make Polly -Wdeprecated clean by explicitly making BlockGenerator copy constructible This is a bit of an awkward API and I'm not sure what the right solution is. Having a publicly copy constructible base class makes it easy to accidentally slice derived objects in a number of contexts. llvm-svn: 248764	2015-09-29 00:00:29 +00:00
Reid Kleckner	c71d6275ca	[WinEH] Fix ip2state table emission with funclets Previously we were hijacking the old LandingPadInfo data structures to communicate our state numbers. Now we don't need that anymore. llvm-svn: 248763	2015-09-28 23:56:30 +00:00
Adrian Prantl	525c013921	Unique-pointerify these pointers an plug a memory leak. Thanks to echristo for noticing! llvm-svn: 248762	2015-09-28 23:56:10 +00:00
David Blaikie	a8173bad4e	Remove the only use of LookupResult's implicit copy ctor LookupResult should not be copyable, it's not readily copyable and can only be copied when it's in specific states (in a query state, without any results, basically). Instead, just extract the /query/ state and pass that across the copy boundary, then build a new LookupResult on the other side. I wonder if a better API (one in which the query state is separate from the result state - essentialyl making QueryState a first class part of the Lookup API - pass a QueryState, get a LookupResult, rather than mutating the LookupResult in place (LookupResult could contain a QueryState if it's particularly helpful to be able to observe the query parameters while also examining the result)) might be a good idea here. Future patches will probably make LookupResult actually non-copyable (transition the CXXBasePaths to unique_ptr, for example) and hopefully we'll enable -Wdeprecated in LLVM soon to avoid issues like this. llvm-svn: 248761	2015-09-28 23:48:55 +00:00
David Blaikie	a9aa270c67	Add an explicitly defaulted copy ctor (and FIXME) to Command since its copy ctor is currently used (and actually slices derived objects... ) in some diagnostics handling. Justin mentioned he'd look into this. llvm-svn: 248760	2015-09-28 23:48:52 +00:00
David Blaikie	27a1bc0eae	Remove trivial dtor that was making uses of the copy ctor -Wdeprecated unclean And also remove an unused variable that's now diagnosed by -Wunused-variable since the dtor is now trivial. llvm-svn: 248759	2015-09-28 23:48:49 +00:00
Chris Bieneman	5def2a8a44	[CMake] [Darwin] [builtins] Apply OS and OS-arch filters to cc_kext builtin libraries. We don't want to filter out the builtins that are present in libSystem like we do for the normal builtins because kexts can't link libSystem, but we should filter out all the builtins that are generally not supported on the OS and architecture. llvm-svn: 248756	2015-09-28 23:09:46 +00:00
Jason Molenda	c8badb717b	Remove one of the three spaces after a period in one of the breakpoint set help messages. <rdar://problem/22870082> llvm-svn: 248755	2015-09-28 23:02:00 +00:00
Richard Trieu	e778e87d2a	Fix unused variable warning in non-debug builds. llvm-svn: 248754	2015-09-28 22:54:43 +00:00
Artem Belevich	236cfdc4be	[CUDA] 32-bit NVPTX should have 32-bit long type. Currently it's 64-bit which will lead to mismatch between host and device code if we compile for i386. Differential Revision: http://reviews.llvm.org/D13181 llvm-svn: 248753	2015-09-28 22:54:08 +00:00
Chris Bieneman	88d8534e87	[CMake] Accidentally committed code that was using a variable only populated in some earlier code. NFC. llvm-svn: 248752	2015-09-28 22:20:25 +00:00
Chris Bieneman	1341472f92	[CMake] [Darwin] [builtins] Fix building builtins for Darwin simulator platforms. For Darwin simulator platforms we shouldn't build the cc_kext builtins at all because they aren't applicable, and we should includ the simulator builtins as slices inside the main platform builtin library. llvm-svn: 248751	2015-09-28 22:18:31 +00:00
Sanjay Patel	4e6527682a	tidy up comments; NFC llvm-svn: 248750	2015-09-28 22:14:51 +00:00
Rafael Espindola	2732235508	Try to fix gcc warning llvm-svn: 248749	2015-09-28 22:12:54 +00:00
Rafael Espindola	5f19f1224c	Delete dead code. llvm-svn: 248747	2015-09-28 22:07:52 +00:00
Sanjay Patel	3a14f1a338	add a FIXME for a CPU model check that should have an attribute instead llvm-svn: 248746	2015-09-28 22:00:24 +00:00
Sanjay Patel	5e5f0e9756	move one-use check under the comment that describes it; NFCI llvm-svn: 248745	2015-09-28 21:44:46 +00:00
Chris Bieneman	4b16003590	[CMake] [Darwin] [builtins] Exclude aeabi builtins from iOS arm32 architectures since we don't support aeabi on iOS. llvm-svn: 248744	2015-09-28 21:15:46 +00:00
Sanjoy Das	4f1c45952c	[SCEV] Don't crash on pointer comparisons `ScalarEvolution::isImpliedCondOperandsViaNoOverflow` tries to cast the operand type of the comparison it is given to an `IntegerType`. This is incorrect because it could actually be simplifying a comparison between two pointers. Switch it to using `getTypeSizeInBits` instead, which does the right thing for both pointers and integers. Fixed PR24956. llvm-svn: 248743	2015-09-28 21:14:32 +00:00
Matt Arsenault	ba6aae785a	AMDGPU: Factor switch into separate function llvm-svn: 248742	2015-09-28 20:54:57 +00:00
Matt Arsenault	73aa8f687a	AMDGPU: Fix splitting x16 SMRD loads When used recursively, this would set the kill flag on the intermediate step from first splitting x16 to x8. llvm-svn: 248741	2015-09-28 20:54:52 +00:00
Matt Arsenault	e5d042cd56	AMDGPU: Fix moving SMRD loads with literal offsets on CI llvm-svn: 248740	2015-09-28 20:54:46 +00:00
Matt Arsenault	dd49c5fc1b	AMDGPU: Fix splitting SMRD with large offset The splitting of > 4 dword SMRD instructions if using an offset in an SGPR instead of an immediate was not setting the destination register, resulting an an instruction missing an operand which would assert later. Test will be included in a following commit which fixes a related issue. llvm-svn: 248739	2015-09-28 20:54:42 +00:00
Matt Arsenault	b378f075a2	AMDGPU: Add testcases Make sure we are testing moving users of the moved and split SMRD loads. llvm-svn: 248738	2015-09-28 20:54:38 +00:00
Matt Arsenault	f3c91f573f	AMDGPU: Cleanup test Run instnamer on it, and rename check prefix. This is in preparation for adding new testcases to cover bugs on other subtargets. llvm-svn: 248737	2015-09-28 20:54:32 +00:00
Lang Hames	9a4c94ec5a	[lld][MachO] Fix a think-o to get the twolevel/dynamic_lookup test passing. llvm-svn: 248736	2015-09-28 20:52:21 +00:00
Andrew Kaylor	16c4da03d5	Improved the interface of methods commuting operands, improved X86-FMA3 mem-folding&coalescing. Patch by Slava Klochkov (vyacheslav.n.klochkov@intel.com) Differential Revision: http://reviews.llvm.org/D11370 llvm-svn: 248735	2015-09-28 20:33:22 +00:00
Piotr Padlewski	69dc971527	Generate assume loads only with -fstrict-vtable-pointers Temporary fix till InstCombine and other possible passes will be efficient to handle multiple assumes. llvm-svn: 248734	2015-09-28 20:30:22 +00:00
Rafael Espindola	4b2ca85c1a	Implement --allow-multiple-definition. Patch by George Rimar! llvm-svn: 248733	2015-09-28 20:30:11 +00:00
Lang Hames	5c692009bc	[lld][MachO] Initial implementation of -flat_namespace and -undefined. This is a basic initial implementation of the -flat_namespace and -undefined options for LLD-darwin. It ignores several subtlties, but the result is close enough that we can now link LLVM (but not clang) on Darwin and pass all regression tests. llvm-svn: 248732	2015-09-28 20:25:14 +00:00
Rafael Espindola	085a8f5ea7	Don't include fully resolved relocations in the dynamic relocation table. llvm-svn: 248731	2015-09-28 20:18:40 +00:00
Rafael Espindola	46e01554b1	Add support for got/ptl referring to symbols defined in .o files. llvm-svn: 248730	2015-09-28 19:48:34 +00:00
Aaron Ballman	176a1a3931	Fixing a sphinx warning. llvm-svn: 248729	2015-09-28 19:27:37 +00:00
Sean Silva	ace7818ce6	[GlobalOpt] Sort members of llvm.used deterministically Patch by Jake VanAdrighem! Summary: Fix the way we sort the llvm.used and llvm.compiler.used members. This bug seems to have been introduced in rL183756 through a set of improper casts to GlobalValue*. In subsequent patches this problem was missed and transformed into a getName call on a ConstantExpr. Reviewers: silvas Subscribers: silvas, llvm-commits Differential Revision: http://reviews.llvm.org/D12851 llvm-svn: 248728	2015-09-28 19:02:11 +00:00
Fiona Glaser	f74cc40e34	Improve performance of SimplifyInstructionsInBlock 1. Use a worklist, not a recursive approach, to avoid needless revisitation and being repeatedly forced to jump back to the start of the BB if a handle is invalidated. 2. Only insert operands to the worklist if they become unused after a dead instruction is removed, so we don’t have to visit them again in most cases. 3. Use a SmallSetVector to track the worklist. 4. Instead of pre-initting the SmallSetVector like in DeadCodeEliminationPass, only put things into the worklist if they have to be revisited after the first run-through. This minimizes how much the actual SmallSetVector gets used, which saves a lot of time. llvm-svn: 248727	2015-09-28 18:56:07 +00:00
Rafael Espindola	dfc7200b18	Add support for local absolute symbols. llvm-svn: 248726	2015-09-28 18:29:47 +00:00
Daniel Sanders	7727e1098c	[mips][p5600] Added P5600 processor and initial scheduler. Summary: The P5600 is an out-of-order, superscalar implementation of the MIPS32R5 architecture. The scheduler has a few missing details (see the 'Tricky Instructions' section and some quirks of the P5600 are deliberately omitted due to implementation difficulty and low chance of significant benefit (e.g. the predicate on P5600WriteEitherALU). However, testing on SingleSource is showing significant performance benefits on some apps (seven in the 10-30% range) and only one significant regression (12%) when -pre-RA-sched=linearize is given. Without -pre-RA-sched=linearize the results are more variable. Some do even better (up to 55% improvement) but increased numbers of copies are slowing others down (up to 12%). Overall, the scheduler as it currently stands is a 2.4% win with -pre-RA-sched=linearize and a 2.7% win without -pre-RA-sched=linearize. I'm sure we can improve on this further. For completeness, the FPGA this was tested on shows some failures with and without the P5600 scheduler. These appear to be scheduling related since the two test runs have fairly different sets of failing tests even after accounting for other factors (e.g. spurious connection failures) however it's not P5600 specific since we also get some for the generic scheduler. Reviewers: vkalintiris Subscribers: mpf, llvm-commits, atrick, vkalintiris Differential Revision: http://reviews.llvm.org/D12193 llvm-svn: 248725	2015-09-28 18:24:08 +00:00
Rui Ueyama	c5e22d906b	ELF2: Include file names in error messages. llvm-svn: 248724	2015-09-28 18:20:41 +00:00
Guillaume Papin	68b591079a	[clang-tidy] add option to specify build path Summary: compile_commands.json is usually generated in the build directory. Projects like LLVM/Clang enforce out-of-source builds. This option allow allow such projects to work out of the box, without moving the compilation database manually. The naming of the option is similar to the one use by other tools: clang-{check,modernize,query,rename,tidy} -p=<build_path> <...> Reviewers: alexfh Differential Revision: http://reviews.llvm.org/D13199 llvm-svn: 248723	2015-09-28 17:53:04 +00:00

... 3 4 5 6 7 ...

211670 Commits All Branches Search

211670 Commits

All Branches