llvm-project

Commit Graph

Author	SHA1	Message	Date
Chandler Carruth	e038552c8a	[PM] Port TTI to the new pass manager, introducing a TargetIRAnalysis to produce it. This adds a function to the TargetMachine that produces this analysis via a callback for each function. This in turn faves the way to produce a different TTI per-function with the correct subtarget cached. I've also done the necessary wiring in the opt tool to thread the target machine down and make it available to the pass registry so that we can construct this analysis from a target machine when available. llvm-svn: 227721	2015-02-01 10:11:22 +00:00
Craig Topper	2844ca7319	[X86] Add a few target specific nodes to 'getTargetNodeName' llvm-svn: 227720	2015-02-01 10:00:37 +00:00
Elena Demikhovsky	534a99d878	AVX2: Added 2 more tests for gather intrinsics. llvm-svn: 227718	2015-02-01 08:52:15 +00:00
Michael Kuperstein	a691f3e921	Removed assert that doesn't typecheck and breaks debug MSVC build. llvm-svn: 227717	2015-02-01 08:46:20 +00:00
Chandler Carruth	2dc92e90d8	[PM] Refactor the analysis registration and pass pipeline parsing to live in a class. While this isn't really significant right now, I need to expose some state to the pass construction expressions, and making them get evaluated within a class context is a nice way to collect members that they may need to access. llvm-svn: 227715	2015-02-01 07:40:05 +00:00
Jingyue Wu	6c26bb63fe	[SeparateConstOffsetFromGEP] skip optnone functions llvm-svn: 227705	2015-02-01 02:34:41 +00:00
Jingyue Wu	6e091c8eab	[SeparateConstOffsetFromGEP] set PreservesCFG flag SeparateConstOffsetFromGEP does not change the shape of the control flow graph. llvm-svn: 227704	2015-02-01 02:33:02 +00:00
Jingyue Wu	0220df0dfd	[NVPTX] Emit .pragma "nounroll" for loops marked with nounroll Summary: CUDA driver can unroll loops when jit-compiling PTX. To prevent CUDA driver from unrolling a loop marked with llvm.loop.unroll.disable is not unrolled by CUDA driver, we need to emit .pragma "nounroll" at the header of that loop. This patch also extracts getting unroll metadata from loop ID metadata into a shared helper function. Test Plan: test/CodeGen/NVPTX/nounroll.ll Reviewers: eliben, meheff, jholewinski Reviewed By: jholewinski Subscribers: jholewinski, llvm-commits Differential Revision: http://reviews.llvm.org/D7041 llvm-svn: 227703	2015-02-01 02:27:45 +00:00
Adrian Prantl	152ac396db	Fix PR22393. When recursively replacing an aggregate with a smaller aggregate or scalar, the debug info needs to refer to the absolute offset (relative to the entire variable) instead of storing the offset inside the smaller aggregate. llvm-svn: 227702	2015-02-01 00:58:04 +00:00
Adrian Prantl	02d6f22c93	Add missing tags. llvm-svn: 227701	2015-02-01 00:57:31 +00:00
NAKAMURA Takumi	7ae226dfaf	[CMake] LLVMLTO requires Intrinsics.gen since r227685 introduced llvm/Analysis/TargetTransformInfo.h. llvm-svn: 227700	2015-02-01 00:55:43 +00:00
NAKAMURA Takumi	d75e020342	[CMake] LLVMTarget requires Intrinsics.gen since r227669 introduced llvm/Analysis/TargetTransformInfo.h. llvm-svn: 227699	2015-02-01 00:55:32 +00:00
Chandler Carruth	d8b3e9a420	[PM] Remove a bunch of stale TTI creation method declarations. I nuked their definitions, but forgot to clean up all the declarations which are in different files. llvm-svn: 227698	2015-02-01 00:22:15 +00:00
Matt Arsenault	25f61a6f89	Fix typo llvm-svn: 227697	2015-01-31 23:37:27 +00:00
Matt Arsenault	08ad328ae2	R600/SI: Only select cvt_flr/cvt_rpi with no NaNs. These have different behavior from cvt_i32_f32 on NaN. llvm-svn: 227693	2015-01-31 21:28:13 +00:00
Saleem Abdulrasool	3475dc358c	X86: silence a GCC warning GCC 4.9 gives the following warning: warning: enumeral and non-enumeral type in conditional expression Cast the enumeral value to an integer within the ternary operation. NFC. llvm-svn: 227692	2015-01-31 17:56:11 +00:00
Diego Novillo	6253035c18	Remove unused variable. Summary: This variable is only used inside an assert. This breaks builds with asserts disabled. OK for trunk? Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D7314 llvm-svn: 227691	2015-01-31 17:17:33 +00:00
Aaron Ballman	a3bcd37c02	Removed a spurious semicolon; NFC llvm-svn: 227690	2015-01-31 15:18:47 +00:00
Simon Pilgrim	43fbaada8e	Removed SSE lane blend findCommutedOpIndices overrides. NFCI. The default op indices frmo TargetInstrInfo::findCommutedOpIndices are being commuted so we don't need to do this. llvm-svn: 227689	2015-01-31 15:16:30 +00:00
Simon Pilgrim	9c76b47469	[X86][SSE] Shuffle mask decode support for zero extend, scalar float/double moves and integer load instructions This patch adds shuffle mask decodes for integer zero extends (pmovzx** and movq xmm,xmm) and scalar float/double loads/moves (movss/movsd). Also adds shuffle mask decodes for integer loads (movd/movq). Differential Revision: http://reviews.llvm.org/D7228 llvm-svn: 227688	2015-01-31 14:09:36 +00:00
Chandler Carruth	93dcdc47db	[PM] Switch the TargetMachine interface from accepting a pass manager base which it adds a single analysis pass to, to instead return the type erased TargetTransformInfo object constructed for that TargetMachine. This removes all of the pass variants for TTI. There is now a single TTI pass in the Analysis layer. All of the Analysis <-> Target communication is through the TTI's type erased interface itself. While the diff is large here, it is nothing more that code motion to make types available in a header file for use in a different source file within each target. I've tried to keep all the doxygen comments and file boilerplate in line with this move, but let me know if I missed anything. With this in place, the next step to making TTI work with the new pass manager is to introduce a really simple new-style analysis that produces a TTI object via a callback into this routine on the target machine. Once we have that, we'll have the building blocks necessary to accept a function argument as well. llvm-svn: 227685	2015-01-31 11:17:59 +00:00
Kumar Sukhani	9559a5c05e	[asan][mips] Fix MIPS64 Asan mapping llvm-svn: 227684	2015-01-31 10:43:18 +00:00
Owen Anderson	be2edf30e4	Replace another std::set in the core of CodeGenRegister, this time with sorted arrays. The hot path through this region of code does lots of batch inserts into sets. By storing them as sorted arrays, we can defer the sorting to the end of the batch, which is dramatically more efficient. This reduces tblgen runtime by 25% on my worst-case target. llvm-svn: 227682	2015-01-31 09:13:36 +00:00
Owen Anderson	a366d7b217	Change more of the guts of CodeGenRegister's RegUnit tracking to be based on bit vectors. This is a continuation of my prior work to move some of the inner workings for CodeGenRegister to use bit vectors when computing about register units. This is highly beneficial to TableGen runtime on targets with large, dense register files. This patch represents a ~40% runtime reduction over and above my earlier improvement on a stress test of this case. llvm-svn: 227678	2015-01-31 07:49:41 +00:00
Saleem Abdulrasool	9fd41316ad	llvm-readobj: add a test case for ARM_MOV32(T) base relocation Add a trivial binary (int main() { return 0; }) built for Windows on ARM to ensure that we can correctly identify ARM_MOV32(T) base relocations. Addresses post-commit review comments. llvm-svn: 227673	2015-01-31 04:46:50 +00:00
Saleem Abdulrasool	d90e64ede5	ARM: make a table more readable (NFC) This adds some comments and splits the flag calculation on type boundaries to make the table more readable. Addresses some post-commit review comments to SVN r227603. NFC. llvm-svn: 227670	2015-01-31 04:12:06 +00:00
Chandler Carruth	705b185f90	[PM] Change the core design of the TTI analysis to use a polymorphic type erased interface and a single analysis pass rather than an extremely complex analysis group. The end result is that the TTI analysis can contain a type erased implementation that supports the polymorphic TTI interface. We can build one from a target-specific implementation or from a dummy one in the IR. I've also factored all of the code into "mix-in"-able base classes, including CRTP base classes to facilitate calling back up to the most specialized form when delegating horizontally across the surface. These aren't as clean as I would like and I'm planning to work on cleaning some of this up, but I wanted to start by putting into the right form. There are a number of reasons for this change, and this particular design. The first and foremost reason is that an analysis group is complete overkill, and the chaining delegation strategy was so opaque, confusing, and high overhead that TTI was suffering greatly for it. Several of the TTI functions had failed to be implemented in all places because of the chaining-based delegation making there be no checking of this. A few other functions were implemented with incorrect delegation. The message to me was very clear working on this -- the delegation and analysis group structure was too confusing to be useful here. The other reason of course is that this is much more natural fit for the new pass manager. This will lay the ground work for a type-erased per-function info object that can look up the correct subtarget and even cache it. Yet another benefit is that this will significantly simplify the interaction of the pass managers and the TargetMachine. See the future work below. The downside of this change is that it is very, very verbose. I'm going to work to improve that, but it is somewhat an implementation necessity in C++ to do type erasure. =/ I discussed this design really extensively with Eric and Hal prior to going down this path, and afterward showed them the result. No one was really thrilled with it, but there doesn't seem to be a substantially better alternative. Using a base class and virtual method dispatch would make the code much shorter, but as discussed in the update to the programmer's manual and elsewhere, a polymorphic interface feels like the more principled approach even if this is perhaps the least compelling example of it. ;] Ultimately, there is still a lot more to be done here, but this was the huge chunk that I couldn't really split things out of because this was the interface change to TTI. I've tried to minimize all the other parts of this. The follow up work should include at least: 1) Improving the TargetMachine interface by having it directly return a TTI object. Because we have a non-pass object with value semantics and an internal type erasure mechanism, we can narrow the interface of the TargetMachine to just do what we need: build and return a TTI object that we can then insert into the pass pipeline. 2) Make the TTI object be fully specialized for a particular function. This will include splitting off a minimal form of it which is sufficient for the inliner and the old pass manager. 3) Add a new pass manager analysis which produces TTI objects from the target machine for each function. This may actually be done as part of #2 in order to use the new analysis to implement #2. 4) Work on narrowing the API between TTI and the targets so that it is easier to understand and less verbose to type erase. 5) Work on narrowing the API between TTI and its clients so that it is easier to understand and less verbose to forward. 6) Try to improve the CRTP-based delegation. I feel like this code is just a bit messy and exacerbating the complexity of implementing the TTI in each target. Many thanks to Eric and Hal for their help here. I ended up blocked on this somewhat more abruptly than I expected, and so I appreciate getting it sorted out very quickly. Differential Revision: http://reviews.llvm.org/D7293 llvm-svn: 227669	2015-01-31 03:43:40 +00:00
Saleem Abdulrasool	fb8a66fbc5	ARM: support stack probe size on Windows on ARM Now that -mstack-probe-size is piped through to the backend via the function attribute as on Windows x86, honour the value to permit handling of non-default values for stack probes. This is needed /Gs with the clang-cl driver or -mstack-probe-size with the clang driver when targeting Windows on ARM. llvm-svn: 227667	2015-01-31 02:26:37 +00:00
Kostya Serebryany	e8cee11570	[fuzzer] add flags to run fuzzer in multiple parallel processes llvm-svn: 227664	2015-01-31 01:14:40 +00:00
Kevin Enderby	f6d258537d	Add the -section option to llvm-objdump used with -macho that takes the argument segname,sectname to specify a Mach-O section to print. The printing is based on the section type or section attributes. The printing of the module initialization and termination section types is printed with this change. Printing of other section types will be added next. llvm-svn: 227649	2015-01-31 00:37:11 +00:00
Eric Christopher	295596a0a7	Remove the last vestiges of resetOperationActions. llvm-svn: 227648	2015-01-31 00:21:17 +00:00
Eric Christopher	a6734178a7	Reuse a bunch of cached subtargets and remove getSubtarget calls without a Function argument. llvm-svn: 227647	2015-01-31 00:06:45 +00:00
David Blaikie	c52b4944ad	Add PPC test for r227481, but XFAIL because this is actually more work than it appeared to be. Same sort of bug as on ARM where the cmp+branch are lowered to br_cc (choosing the branch's debugloc for the br_cc's debugloc) then expanded out to a cmp and a br, but both using the debug loc of the br_cc, thus losing fidelity. llvm-svn: 227645	2015-01-30 23:52:19 +00:00
Eric Christopher	f5e9406243	Reuse a bunch of cached subtargets and remove getSubtarget calls without a Function argument. llvm-svn: 227644	2015-01-30 23:46:43 +00:00
Eric Christopher	1e51334459	Avoid using the cast and use the templated accessor function. llvm-svn: 227643	2015-01-30 23:46:40 +00:00
Ahmed Bougacha	aab9677f65	[AArch64] Add a few more DUP testcases. NFC. Also, don't lie about testing index 0. llvm-svn: 227642	2015-01-30 23:41:15 +00:00
Philip Reames	1ffa9377e1	Factor out statepoint verification into separate function. (NFC) Patch by: Igor Laevsky "Simple refactoring. This is done in preparation to support verification of invokable statepoints." Differential Revision: http://reviews.llvm.org/D7276 llvm-svn: 227640	2015-01-30 23:28:05 +00:00
Kostya Serebryany	71672552db	[fuzzer] Add a gtest-style test Summary: Add one gtest-style test. Test Plan: run on bot Reviewers: samsonov Reviewed By: samsonov Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D7287 llvm-svn: 227639	2015-01-30 23:26:57 +00:00
Eric Christopher	7792e32b64	Reuse a bunch of cached subtargets and remove getSubtarget calls without a Function argument. llvm-svn: 227638	2015-01-30 23:24:40 +00:00
Philip Reames	c2f99b421b	Fix statepoint verifier tests to actually test verifier. Patch by: Igor Laevsky "Statepoint verifier tests were using wrong names for the statepoint and gc.relocate intrinsics. This change renames them to use correct names and fixes all uncovered issues." Differential Revision: http://reviews.llvm.org/D7266 llvm-svn: 227636	2015-01-30 23:18:42 +00:00
Ahmed Bougacha	77b76542eb	[AArch64] Robustize neon-scalar-copy.ll tests. NFC. Some of those didn't even have run lines: they were removed inadvertently during the Great Merge of 2014. They used to check for DUPs, but now we go through W-regs? Filed PR22418 for that potential regression. For now, just make the tests explicit, so we now where we stand. llvm-svn: 227635	2015-01-30 23:13:57 +00:00
David Blaikie	3ef249c9c0	Add ARM test for r227489, but XFAIL because this is actually more work than it appeared to be. Also revert r227489 since it didn't actually fix the thing I thought I was fixing (since the test case was targeting the wrong architecture initially). The change might be correct & demonstrated by other test cases, but it's not a priority for me to find those test cases right now. Filed PR22417 for the failure. llvm-svn: 227632	2015-01-30 23:04:39 +00:00
Lang Hames	8d3f8d75cd	[PBQP] Fix transposed worst row/column check in handleAdd/RemoveNode in the PBQP allocator. Patch by Jonas Paulsson. Thanks Jonas! llvm-svn: 227628	2015-01-30 22:28:49 +00:00
Chris Bieneman	d77bbab393	NFC. Making printOptionValues an API on the parser class. llvm-svn: 227626	2015-01-30 22:16:01 +00:00
Alexey Samsonov	964f4dbaae	Fix memory leak in WinEHPrepare introduced in r227405. This leak was detected by ASan bootstrap of LLVM. llvm-svn: 227625	2015-01-30 22:07:05 +00:00
Eric Christopher	b1874ebd32	Remove unused function. llvm-svn: 227624	2015-01-30 22:02:36 +00:00
Eric Christopher	6d5c47c3c2	Remove extraneous forward declaration. llvm-svn: 227623	2015-01-30 22:02:34 +00:00
Eric Christopher	cccae7951c	Use the cached subtargets and remove calls to getSubtarget/getSubtargetImpl without a Function argument. llvm-svn: 227622	2015-01-30 22:02:31 +00:00
Eric Christopher	6d9fcc38be	Add a similar templated cast for getSubtarget off of the MachineFunction to save typing a lot of static_casts. llvm-svn: 227621	2015-01-30 22:02:19 +00:00
Michael Liao	3fd78353fb	Add one more vim swap file pattern llvm-svn: 227620	2015-01-30 21:59:28 +00:00

1 2 3 4 5 ...

112534 Commits