llvm-project

Commit Graph

Author	SHA1	Message	Date
Ahmed Bougacha	e892d13d90	[CodeGen] Add hook/combine to form vector extloads, enabled on X86. The combine that forms extloads used to be disabled on vector types, because "None of the supported targets knows how to perform load and sign extend on vectors in one instruction." That's not entirely true, since at least SSE4.1 X86 knows how to do those sextloads/zextloads (with PMOVS/ZX). But there are several aspects to getting this right. First, vector extloads are controlled by a profitability callback. For instance, on ARM, several instructions have folded extload forms, so it's not always beneficial to create an extload node (and trying to match extloads is a whole 'nother can of worms). The interesting optimization enables folding of s/zextloads to illegal (splittable) vector types, expanding them into smaller legal extloads. It's not ideal (it introduces some legalization-like behavior in the combine) but it's better than the obvious alternative: form illegal extloads, and later try to split them up. If you do that, you might generate extloads that can't be split up, but have a valid ext+load expansion. At vector-op legalization time, it's too late to generate this kind of code, so you end up forced to scalarize. It's better to just avoid creating egregiously illegal nodes. This optimization is enabled unconditionally on X86. Note that the splitting combine is happy with "custom" extloads. As is, this bypasses the actual custom lowering, and just unrolls the extload. But from what I've seen, this is still much better than the current custom lowering, which does some kind of unrolling at the end anyway (see for instance load_sext_4i8_to_4i64 on SSE2, and the added FIXME). Also note that the existing combine that forms extloads is now also enabled on legal vectors. This doesn't have a big effect on X86 (because sext+load is usually combined to sext_inreg+aextload). On ARM it fires on some rare occasions; that's for a separate commit. Differential Revision: http://reviews.llvm.org/D6904 llvm-svn: 228325	2015-02-05 18:31:02 +00:00
Ahmed Bougacha	db1da7a54c	[CodeGen] Add isLoadExtLegalOrCustom helper to TargetLowering. llvm-svn: 228322	2015-02-05 18:15:59 +00:00
Michael Kuperstein	d2b6fdbc31	Teach isDereferenceablePointer() to look through bitcast constant expressions. This fixes a LICM regression due to the new load+store pair canonicalization. Differential Revision: http://reviews.llvm.org/D7411 llvm-svn: 228284	2015-02-05 09:15:37 +00:00
Matt Arsenault	f28cf0cbaf	Add addrspacecast node to tablegen The node is still defined oddly so that the address spaces are not operands and not accessible from tablegen, but as-is this can now be used to write a ComplexPattern with an addrspacecast root node. llvm-svn: 228270	2015-02-05 03:35:34 +00:00
Matt Arsenault	d931642cc7	Add support for double / float to EndianStream Also add new unit tests for endian::Writer llvm-svn: 228269	2015-02-05 03:30:08 +00:00
Cameron Esfahani	17177d1e84	Value soft float calls as more expensive in the inliner. Summary: When evaluating floating point instructions in the inliner, ask the TTI whether it is an expensive operation. By default, it's not an expensive operation. This keeps the default behavior the same as before. The ARM TTI has been updated to return back TCC_Expensive for targets which don't have hardware floating point. Reviewers: chandlerc, echristo Reviewed By: echristo Subscribers: t.p.northover, aemerson, llvm-commits Differential Revision: http://reviews.llvm.org/D6936 llvm-svn: 228263	2015-02-05 02:09:33 +00:00
Duncan P. N. Exon Smith	252b62d3c2	IR: Split out getOperandAs(), NFC llvm-svn: 228250	2015-02-05 01:07:47 +00:00
Sean Silva	0e1fe184c8	[MC] Remove various unused MCAsmInfo parameters. llvm-svn: 228244	2015-02-05 00:58:51 +00:00
Duncan P. N. Exon Smith	5a914a8c63	ADT: Add int64_t interoperability to APSInt Add some API to `APSInt` to make it easier to compare with `int64_t`. - `APSInt::compareValues(APSInt, APSInt)` returns 1, -1 or 0 for greater, lesser, or equal, doing the right thing for mismatched "has-sign" and bitwidths. This is just like `isSameValue()` (and is now the implementation of it). - `APSInt::get(int64_t)` gets a signed `APSInt`. - `operator<(int64_t)`, etc., are implemented trivially via `get()` and `compareValues()`. - Also added `APSInt::getUnsigned(uint64_t)` to make it easier to test `compareValues()`. llvm-svn: 228239	2015-02-05 00:17:43 +00:00
Reid Kleckner	2c1990778d	Remove useless call to isOSCygMing() This used to do something when we modeled the Cygwin and MinGW environments as distinct OSs, but now it is not needed. llvm-svn: 228229	2015-02-04 23:17:19 +00:00
Matthias Braun	26e7ea6267	MachineCSE: Clear dead-def flag on CSE. In case CSE reuses a previoulsy unused register the dead-def flag has to be cleared on the def operand, as exposed by the arm64-cse.ll test. This fixes PR22439 and the corresponding rdar://19694987 Differential Revision: http://reviews.llvm.org/D7395 llvm-svn: 228178	2015-02-04 19:35:16 +00:00
Reid Kleckner	c26a17a822	Add range adapters predecessors() and successors() for BBs Use them in two isolated transforms so we know they work and aren't dead code. llvm-svn: 228173	2015-02-04 19:14:57 +00:00
Juergen Ributzka	719615f6dd	Add missing include. llvm-svn: 228161	2015-02-04 18:16:53 +00:00
Alexey Samsonov	b9b8027cee	SpecialCaseList: Add support for parsing multiple input files. Summary: This change allows users to create SpecialCaseList objects from multiple local files. This is needed to implement a proper support for -fsanitize-blacklist flag (allow users to specify multiple blacklists, in addition to default blacklist, see PR22431). DFSan can also benefit from this change, as DFSan instrumentation pass now accepts ABI-lists both from -fsanitize-blacklist= and -mllvm -dfsan-abilist flags. Go bindings are fixed accordingly. Test Plan: regression test suite Reviewers: pcc Subscribers: llvm-commits, axw, kcc Differential Revision: http://reviews.llvm.org/D7367 llvm-svn: 228155	2015-02-04 17:39:48 +00:00
Rafael Espindola	a5eb775c4d	Fix warning: "function declaration isn’t a prototype" llvm-svn: 228139	2015-02-04 13:30:28 +00:00
Philip Reames	47cc673e1f	Add a pass for inserting safepoints into (nearly) arbitrary IR This pass is responsible for figuring out where to place call safepoints and safepoint polls. It doesn't actually make the relocations explicit; that's the job of the RewriteStatepointsForGC pass (http://reviews.llvm.org/D6975). Note that this code is not yet finalized. Its moving in tree for incremental development, but further cleanup is needed and will happen over the next few days. It is not yet part of the standard pass order. Planned changes in the near future: - I plan on restructuring the statepoint rewrite to use the functions add to the IRBuilder a while back. - In the current pass, the function "gc.safepoint_poll" is treated specially but is not an intrinsic. I plan to make identifying the poll function a property of the GCStrategy at some point in the near future. - As follow on patches, I will be separating a collection of test cases we have out of tree and submitting them upstream. - It's not explicit in the code, but these two patches are introducing a new state for a statepoint which looks a lot like a patchpoint. There's no a transient form which doesn't yet have the relocations explicitly represented, but does prevent reordering of memory operations. Once this is in, I need to update actually make this explicit by reserving the 'unused' argument of the statepoint as a flag, updating the docs, and making the code explicitly check for such a thing. This wasn't really planned, but once I split the two passes - which was done for other reasons - the intermediate state fell out. Just reminds us once again that we need to merge statepoints and patchpoints at some point in the not that distant future. Future directions planned: - Identifying more cases where a backedge safepoint isn't required to ensure timely execution of a safepoint poll. - Tweaking the insertion process to generate easier to optimize IR. (For example, investigating making SplitBackedge) the default. - Adding opt-in flags for a GCStrategy to use this pass. Once done, add this pass to the actual pass ordering. Differential Revision: http://reviews.llvm.org/D6981 llvm-svn: 228090	2015-02-04 00:37:33 +00:00
Justin Bogner	26b3142d34	InstrProf: Make CounterMappingRegions less confusing to construct Creating empty and expansion regions is awkward with the current API. Expose static methods to make this simpler. llvm-svn: 228075	2015-02-03 23:59:33 +00:00
Arnaud A. de Grandmaison	10797c5707	[PBQP] Provide more information in the debug prints Based on a patch by Jonas Paulsson llvm-svn: 228068	2015-02-03 23:40:24 +00:00
Arnaud A. de Grandmaison	1f4448ad51	[PBQP] Constify Graph::getEdgeNode1Id and Graph::getEdgeNode2Id llvm-svn: 228048	2015-02-03 22:02:45 +00:00
Duncan P. N. Exon Smith	4e4aa70535	IR: Assembly and bitcode for GenericDebugNode llvm-svn: 228041	2015-02-03 21:54:14 +00:00
Justin Bogner	de15817ea2	InstrProf: Remove CoverageMapping::HasCodeBefore, it isn't used It's not entirely clear to me what this field was meant for, but it's always false. Remove it. llvm-svn: 228034	2015-02-03 21:35:36 +00:00
Duncan P. N. Exon Smith	6f5546cdee	Support: Add string => unsigned mapping for DW_TAG Add `dwarf::getTag()` to translate from `StringRef` to `unsigned`. llvm-svn: 228031	2015-02-03 21:16:49 +00:00
Duncan P. N. Exon Smith	981811efc8	Support: Re-implement dwarf::TagString() using a .def file, NFC Also re-implements the `dwarf::Tag` enumerator. I've moved the mock tags into the enumerator since there's no other way to do this. Really they shouldn't be used at all (they're just a hack to identify `MDNode`s, but we have a class hierarchy for that now). llvm-svn: 228030	2015-02-03 21:13:16 +00:00
Colin LeMahieu	cd9cb023d7	[Hexagon] Converting XTYPE/SHIFT intrinsics. Cleaning out old intrinsic patterns and updating tests. llvm-svn: 228026	2015-02-03 20:40:52 +00:00
Jingyue Wu	d7966ff3b9	Add straight-line strength reduction to LLVM Summary: Straight-line strength reduction (SLSR) is implemented in GCC but not yet in LLVM. It has proven to effectively simplify statements derived from an unrolled loop, and can potentially benefit many other cases too. For example, LLVM unrolls #pragma unroll foo (int i = 0; i < 3; ++i) { sum += foo((b + i) * s); } into sum += foo(b * s); sum += foo((b + 1) * s); sum += foo((b + 2) * s); However, no optimizations yet reduce the internal redundancy of the three expressions: b * s (b + 1) * s (b + 2) * s With SLSR, LLVM can optimize these three expressions into: t1 = b * s t2 = t1 + s t3 = t2 + s This commit is only an initial step towards implementing a series of such optimizations. I will implement more (see TODO in the file commentary) in the near future. This optimization is enabled for the NVPTX backend for now. However, I am more than happy to push it to the standard optimization pipeline after more thorough performance tests. Test Plan: test/StraightLineStrengthReduce/slsr.ll Reviewers: eliben, HaoLiu, meheff, hfinkel, jholewinski, atrick Reviewed By: jholewinski, atrick Subscribers: karthikthecool, jholewinski, llvm-commits Differential Revision: http://reviews.llvm.org/D7310 llvm-svn: 228016	2015-02-03 19:37:06 +00:00
Rafael Espindola	a5ef4905a5	Fix duplicated symbol error. llvm-svn: 228012	2015-02-03 19:25:53 +00:00
Manman Ren	8121e1db91	[LTO API] split lto_codegen_compile to lto_codegen_optimize and lto_codegen_compile_optimized. Also add lto_api_version. Before this commit, we can only dump the optimized bitcode after running lto_codegen_compile, but it includes some impacts of running codegen passes, one example is StackProtector pass. We will get assertion failure when running llc on the optimized bitcode, because StackProtector is effectively run twice. After splitting lto_codegen_compile, the linker can choose to dump the bitcode before running lto_codegen_compile_optimized. lto_api_version is added so ld64 can check for runtime-availability of the new API. rdar://19565500 llvm-svn: 228000	2015-02-03 18:39:15 +00:00
Adam Nemet	b60295a525	[LoopVectorize] Fix rebase glitch in r227751 LoopVectorizationLegality::{getNumLoads,getNumStores} should forward to LoopAccessAnalysis now. Thanks to Takumi for noticing this! llvm-svn: 227992	2015-02-03 17:59:53 +00:00
Eric Christopher	36fe028a2a	Only access TLOF via the TargetMachine, not TargetLowering. llvm-svn: 227949	2015-02-03 07:22:52 +00:00
Lang Hames	d48bf3f912	[PBQP Regalloc] Pre-spill vregs that have no legal physregs. The PBQP::RegAlloc::MatrixMetadata class assumes that matrices have at least two rows/columns (for the spill option plus at least one physreg). This patch ensures that that invariant is met by pre-spilling vregs that have no physreg options so that no node (and no corresponding edges) need be added to the PBQP graph. This fixes a bug in an out-of-tree target that was identified by Jonas Paulsson. Thanks for tracking this down Jonas! llvm-svn: 227942	2015-02-03 06:14:06 +00:00
Justin Bogner	195a4f08ea	InstrProf: Simplify RawCoverageMappingReader's API slightly This is still kind of a weird API, but dropping the (partial) update of the passed in CoverageMappingRecord makes it a little easier to understand and use. llvm-svn: 227900	2015-02-03 00:20:11 +00:00
Jingyue Wu	49a766e468	Resurrect the assertion removed by r227717 Summary: MSVC can compile "LoopID->getOperand(0) == LoopID" when LoopID is MDNode*. Test Plan: no regression Reviewers: mkuper Subscribers: jholewinski, llvm-commits Differential Revision: http://reviews.llvm.org/D7327 llvm-svn: 227853	2015-02-02 20:41:11 +00:00
Duncan P. N. Exon Smith	9146fc8fd6	IR: Allow GenericDebugNode construction from MDString Allow `GenericDebugNode` construction directly from `MDString`, rather than requiring `StringRef`s. I've refactored the `StringRef` constructors to use these. There's no real functionality change here, except for exposing the lower-level API. The purpose of this is to simplify construction of string operands when reading bitcode. It's unnecessarily indirect to parse an `MDString` ID, lookup the `MDString` in the bitcode reader list, get the `StringRef` out of that, and then have `GenericDebugNode::getImpl()` use `MDString::get()` to acquire the original `MDString`. Instead, this allows the bitcode reader to directly pass in the `MDString`. llvm-svn: 227848	2015-02-02 20:01:03 +00:00
Duncan P. N. Exon Smith	61e62a5b04	IR: Extract DEFINE_MDNODE_GET(), NFC llvm-svn: 227847	2015-02-02 19:55:21 +00:00
Duncan P. N. Exon Smith	442ec0223b	IR: Separate helpers for string operands, NFC llvm-svn: 227846	2015-02-02 19:54:05 +00:00
Duncan P. N. Exon Smith	d9901ff586	IR: Split out DebugInfoMetadata.h, NFC Move debug-info-centred `Metadata` subclasses into their own header/source file. A couple of private template functions are needed from both `Metadata.cpp` and `DebugInfoMetadata.cpp`, so I've moved them to `lib/IR/MetadataImpl.h`. llvm-svn: 227835	2015-02-02 18:53:21 +00:00
David Blaikie	440a045606	STLExtras: Provide less/equal functors with templated function call operators, plus a deref'ing functor template utility Similar to the C++14 void specializations of these templates, useful as a stop-gap until LLVM switches to '14. Example use-cases in tblgen because I saw some functors that looked like they could be simplified/refactored. Reviewers: dexonsmith Differential Revision: http://reviews.llvm.org/D7324 llvm-svn: 227828	2015-02-02 18:35:10 +00:00
Duncan P. N. Exon Smith	71db64258d	Fix some file headers, NFC llvm-svn: 227826	2015-02-02 18:20:15 +00:00
Eric Christopher	c22f83a590	Remove unnecessary forward declaration. llvm-svn: 227813	2015-02-02 17:38:40 +00:00
Lang Hames	29dff9837a	[Orc] Make the ObjectLinkingLayer take ownership of object files until finalization time. As currently implemented, RuntimeDyldELF requires the original object file to be avaible when relocations are being resolved. This patch ensures that the ObjectLinkingLayer preserves it until then. In the future RuntimeDyldELF should be rewritten to remove this requirement, at which point this patch can be reverted. Regression test cases for Orc (which include coverage of this bug) will be committed shortly. llvm-svn: 227778	2015-02-02 04:32:17 +00:00
Lang Hames	15be546140	[Orc] Add sensible defaults for the ObjectLinkingLayer constructor. llvm-svn: 227776	2015-02-02 01:03:10 +00:00
Benjamin Kramer	813fc59b9f	FoldingSetVectorIterator is just a subset of pointee_iterator, remove it. llvm-svn: 227761	2015-02-01 19:26:05 +00:00
Adam Nemet	a9dc8d817d	Include cstddef in EquivalenceClasses.h This is to try to appease bots complaining that ptrdiff_t is undefined in LoopAccessAnalysis.cpp. llvm-svn: 227757	2015-02-01 17:21:06 +00:00
Adam Nemet	0456327cfb	[LoopVectorize] Move LoopAccessAnalysis to its own module Other than moving code and adding the boilerplate for the new files, the code being moved is unchanged. There are a few global functions that are shared with the rest of the LoopVectorizer. I moved these to the new module as well (emitLoopAnalysis, stripIntegerCast, replaceSymbolicStrideSCEV) along with the Report class used by emitLoopAnalysis. There is probably room for further improvement in this area. I kept DEBUG_TYPE "loop-vectorize" because it's used as the PassName with emitOptimizationRemarkAnalysis. This will obviously have to change. NFC. This is part of the patchset that splits out the memory dependence logic from LoopVectorizationLegality into a new class LoopAccessAnalysis. LoopAccessAnalysis will be used by the new Loop Distribution pass. llvm-svn: 227756	2015-02-01 16:56:15 +00:00
Adam Nemet	fababbded9	[LoopVectorize] Make hasVectorInstrinsicScalarOpd inline VectorUtils.h needs to be included in LoopAccessAnalysis.cpp for getIntrinsicIDForCall but hasVectorInstrinsicScalarOpd is not used by this module. NFC. This is part of the patchset that splits out the memory dependence logic from LoopVectorizationLegality into a new class LoopAccessAnalysis. LoopAccessAnalysis will be used by the new Loop Distribution pass. llvm-svn: 227753	2015-02-01 16:56:05 +00:00
Michael Kuperstein	13fbd45263	[X86] Convert esp-relative movs of function arguments to pushes, step 2 This moves the transformation introduced in r223757 into a separate MI pass. This allows it to cover many more cases (not only cases where there must be a reserved call frame), and perform rudimentary call folding. It still doesn't have a heuristic, so it is enabled only for optsize/minsize, with stack alignment <= 8, where it ought to be a fairly clear win. (Re-commit of r227728) Differential Revision: http://reviews.llvm.org/D6789 llvm-svn: 227752	2015-02-01 16:56:04 +00:00
Michael Kuperstein	e86aa9a8a4	Revert r227728 due to bad line endings. llvm-svn: 227746	2015-02-01 16:15:07 +00:00
Chandler Carruth	21fc195c13	[multiversion] Kill FunctionTargetTransformInfo, TTI itself is now per-function and supports the exact desired interface. llvm-svn: 227743	2015-02-01 14:37:03 +00:00
Chandler Carruth	ab5cb36c40	[multiversion] Remove the function parameter from the unrolling preferences interface on TTI now that all of TTI is per-function. llvm-svn: 227741	2015-02-01 14:31:23 +00:00
Chandler Carruth	c956ab6603	[multiversion] Switch the TTI queries from TargetMachine to Subtarget now that we have a correct and cached subtarget specific to the function. Also, finish providing a cached per-function subtarget in the core LLVMTargetMachine -- that layer hadn't switched over yet. The only use of the TargetMachine was to re-lookup a subtarget for a particular function to work around the fact that TTI was immutable. Now that it is per-function and we haved a cached subtarget, use it. This still leaves a few interfaces with real warts on them where we were passing Function objects through the TTI interface. I'll remove these and clean their usage up in subsequent commits now that this isn't necessary. llvm-svn: 227738	2015-02-01 14:22:17 +00:00

1 2 3 4 5 ...

22426 Commits