llvm-project

Commit Graph

Author	SHA1	Message	Date
Dan Gohman	82607f56bd	[WebAssembly] Add support for using a wasm global for the stack pointer. This replaces the __stack_pointer variable which was allocated in linear memory. llvm-svn: 296201	2017-02-24 23:46:05 +00:00
Dan Gohman	d934cb8806	[WebAssembly] Basic support for Wasm object file encoding. With the "wasm32-unknown-unknown-wasm" triple, this allows writing out simple wasm object files, and is another step in a larger series toward migrating from ELF to general wasm object support. Note that this code and the binary format itself is still experimental. llvm-svn: 296190	2017-02-24 23:18:00 +00:00
Stanislav Mekhanoshin	42259cf35e	Revert "Correct register pressure calculation in presence of subregs" This reverts commit r296009. It broke one out of tree target and also does not account for all partial lines added or removed when calculating PressureDiff. llvm-svn: 296182	2017-02-24 21:56:16 +00:00
Dan Gohman	9e188e3366	[WebAssembly] Define an initial set of relocation types for Wasm. This set will likely evolve, along with the Wasm linking ABI. llvm-svn: 296177	2017-02-24 21:21:44 +00:00
Lang Hames	630d2639f9	[Orc][RPC] Accept both const char* and char* arguments for string serialization. llvm-svn: 296168	2017-02-24 20:56:43 +00:00
Simon Pilgrim	cdf2bd656a	Revert: r296141 [APInt] Add APInt::extractBits() method to extract APInt subrange The current pattern for extract bits in range is typically: Mask.lshr(BitOffset).trunc(SubSizeInBits); Which can be particularly slow for large APInts (MaskSizeInBits > 64) as they require the allocation of memory for the temporary variable. This is another of the compile time issues identified in PR32037 (see also D30265). This patch adds the APInt::extractBits() helper method which avoids the temporary memory allocation. Differential Revision: https://reviews.llvm.org/D30336 llvm-svn: 296147	2017-02-24 18:31:04 +00:00
Simon Pilgrim	bd9fb2ae95	[APInt] Add APInt::extractBits() method to extract APInt subrange The current pattern for extract bits in range is typically: Mask.lshr(BitOffset).trunc(SubSizeInBits); Which can be particularly slow for large APInts (MaskSizeInBits > 64) as they require the allocation of memory for the temporary variable. This is another of the compile time issues identified in PR32037 (see also D30265). This patch adds the APInt::extractBits() helper method which avoids the temporary memory allocation. Differential Revision: https://reviews.llvm.org/D30336 llvm-svn: 296141	2017-02-24 17:46:18 +00:00
Daniel Sanders	066ebbfd46	[globalisel] Decouple src pattern operands from dst pattern operands. Summary: This isn't testable for AArch64 by itself so this patch also adds support for constant immediates in the pattern and physical register uses in the result. The new IntOperandMatcher matches the constant in patterns such as '(set $rd:GPR32, (G_XOR $rs:GPR32, -1))'. It's always safe to fold immediates into an instruction so this is the first rule that will match across multiple BB's. The Renderer hierarchy is responsible for adding operands to the result instruction. Renderers can copy operands (CopyRenderer) or add physical registers (in particular %wzr and %xzr) to the result instruction in any order (OperandMatchers now import the operand names from SelectionDAG to allow renderers to access any operand). This allows us to emit the result instruction for: %1 = G_XOR %0, -1 --> %1 = ORNWrr %wzr, %0 %1 = G_XOR -1, %0 --> %1 = ORNWrr %wzr, %0 although the latter is untested since the matcher/importer has not been taught about commutativity yet. Added BuildMIAction which can build new instructions and mutate them where possible. W.r.t the mutation aspect, MatchActions are now told the name of an instruction they can recycle and BuildMIAction will emit mutation code when the renderers are appropriate. They are appropriate when all operands are rendered using CopyRenderer and the indices are the same as the matcher. This currently assumes that all operands have at least one matcher. Finally, this change also fixes a crash in AArch64InstructionSelector::select() caused by an immediate operand passing isImm() rather than isCImm(). This was uncovered by the other changes and was detected by existing tests. Depends on D29711 Reviewers: t.p.northover, ab, qcolombet, rovka, aditya_nandakumar, javed.absar Reviewed By: rovka Subscribers: aemerson, dberris, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D29712 llvm-svn: 296131	2017-02-24 15:43:30 +00:00
Simon Pilgrim	aed352273e	[APInt] Add APInt::setBits() method to set all bits in range The current pattern for setting bits in range is typically: Mask \|= APInt::getBitsSet(MaskSizeInBits, LoPos, HiPos); Which can be particularly slow for large APInts (MaskSizeInBits > 64) as they require the allocation memory for the temporary variable. This is one of the key compile time issues identified in PR32037. This patch adds the APInt::setBits() helper method which avoids the temporary memory allocation completely, this first implementation uses setBit() internally instead but already significantly reduces the regression in PR32037 (~10% drop). Additional optimization may be possible. I investigated whether there is need for APInt::clearBits() and APInt::flipBits() equivalents but haven't seen these patterns to be particularly common, but reusing the code would be trivial. Differential Revision: https://reviews.llvm.org/D30265 llvm-svn: 296102	2017-02-24 10:15:29 +00:00
Craig Topper	f2529c188b	[AVX-512] Remove lzcnt intrinsics and autoupgrade them to generic ctlz intrinsics with select. Clang has been emitting cltz intrinsics for a while now. llvm-svn: 296091	2017-02-24 05:35:04 +00:00
Justin Bogner	369ba753aa	OptDiag: Summarize the instruction count in asm-printer Add an optimization remark to asm-printer that summarizes the number of instructions emitted per function. llvm-svn: 296053	2017-02-24 00:19:22 +00:00
Justin Bogner	0bb740540b	OptDiag: Use DiagnosticLocation in MachineOptimizationRemarks DiagnosticInfo switched from DebugLoc to DiagnosticLocation in r295519, update these subclasses to match. llvm-svn: 296052	2017-02-24 00:19:18 +00:00
Adrian McCarthy	649b8e0c45	Implement some methods for NativeRawSymbol This allows the ability to call IPDBSession::getGlobalScope with a NativeSession and to then query it for some basic fields from the PDB's InfoStream. Note that the symbols now have non-const references back to the Session so that NativeRawSymbol can access the PDBFile through the Session. Differential Revision: https://reviews.llvm.org/D30314 llvm-svn: 296049	2017-02-24 00:10:47 +00:00
Adam Nemet	25e92e29ba	[OptDiag] Comment about the legacy status of emitOptimizationRemark* functions llvm-svn: 296039	2017-02-23 23:11:23 +00:00
Adam Nemet	ca8f40f27a	[OptDiag] Remove hotness parameter from legacy remark ctors Anything using hotness should be using ORE. llvm-svn: 296038	2017-02-23 23:11:21 +00:00
Adam Nemet	de53bfb94d	[OptDiag] Hide legacy remark ctors These are only used when emitting remarks without ORE directly using the free functions emitOptimizationRemark*. llvm-svn: 296037	2017-02-23 23:11:11 +00:00
Bryant Wong	332e6e5ae5	[ADT] Fix zip iterator interface. This commit provides `zip_{first,shortest}` with the standard member types and methods expected of iterators (e.g., `difference_type`), in order for zip to be used with other adaptors, such as `make_filter_range`. Support for reverse iteration has also been added. Differential Revision: https://reviews.llvm.org/D30246 llvm-svn: 296036	2017-02-23 23:00:46 +00:00
Sanjoy Das	aa722ae84c	[IR] Add a Instruction::dropPoisonGeneratingFlags helper Summary: The helper will be used in a later change. This change itself is NFC since the only user of this new function is its unit test. Reviewers: majnemer, efriedma Reviewed By: efriedma Subscribers: efriedma, mcrosier, llvm-commits Differential Revision: https://reviews.llvm.org/D30184 llvm-svn: 296035	2017-02-23 22:50:52 +00:00
Dehao Chen	cc75d2441d	Add call branch annotation for ICP promoted direct call in SamplePGO mode. Summary: SamplePGO uses branch_weight annotation to represent callsite hotness. When ICP promotes an indirect call to direct call, we need to make sure the direct call is annotated with branch_weight in SamplePGO mode, so that downstream function inliner can use hot callsite heuristic. Reviewers: davidxl, eraman, xur Reviewed By: davidxl, xur Subscribers: mehdi_amini, llvm-commits Differential Revision: https://reviews.llvm.org/D30282 llvm-svn: 296028	2017-02-23 22:15:18 +00:00
Adam Nemet	2531df3d49	[ORE] Remove ORE.emit{{.+}} functions Last use was killed in my previous patch. The preferred way is now to construct the remark, pipe things to it and pass it to ORE.emit. llvm-svn: 296019	2017-02-23 21:32:53 +00:00
Adam Nemet	41b019a39c	[LAA] Remove unused LoopAccessReport The need for this removed when I converted everything to use the opt-remark classes directly with the streaming interface. llvm-svn: 296017	2017-02-23 21:17:36 +00:00
Evgeniy Stepanov	ee2d77f6d6	Disable TLS for stack protector on Android API<17. The TLS slot did not exist back then. llvm-svn: 296014	2017-02-23 21:06:35 +00:00
Ahmed Bougacha	ae9dadecf3	[GlobalISel] Emit opt remarks on isel fallbacks. Having more fine-grained information on the specific construct that caused us to fallback is valuable for large-scale data collection. We still have the fallback warning, that's also used for FastISel. We still need to remove the fallback warning, and teach FastISel to also emit remarks (it currently has a combination of the warning, stats, and debug prints: the remarks could unify all three). The abort-on-fallback path could also be better handled using remarks: one could imagine a "-Rpass-error", analoguous to "-Werror", which would promote missed/failed remarks to errors. It's not clear whether that would be useful for other remarks though, so we're not there yet. llvm-svn: 296013	2017-02-23 21:05:42 +00:00
Ahmed Bougacha	272739751d	[CodeGen] Teach opt remarks how to print MI instructions. This will be used with GISel opt remarks. llvm-svn: 296012	2017-02-23 21:05:33 +00:00
Ahmed Bougacha	97119d48db	[CodeGen] Print MI without a newline when skipping debugloc. NFC. This matches the behavior for skip-operands. While there, document it. This is a follow-up to r296007. llvm-svn: 296011	2017-02-23 21:05:29 +00:00
Ahmed Bougacha	0cf8cfa165	[CodeGen] Use const MBBs in the opt remark diagnostics. NFC. llvm-svn: 296010	2017-02-23 21:05:23 +00:00
Stanislav Mekhanoshin	ce3ddd2de4	Correct register pressure calculation in presence of subregs If a subreg is used in an instruction it counts as a whole superreg for the purpose of register pressure calculation. This patch corrects improper register pressure calculation by examining operand's lane mask. Differential Revision: https://reviews.llvm.org/D29835 llvm-svn: 296009	2017-02-23 20:19:44 +00:00
Ahmed Bougacha	851125dca9	[ORE] Use const CodeRegions in the remark diagnostics. NFC. llvm-svn: 296008	2017-02-23 19:17:34 +00:00
Ahmed Bougacha	4319224628	[CodeGen] Add a way to SkipDebugLoc in MachineInstr::print(). NFC. llvm-svn: 296007	2017-02-23 19:17:31 +00:00
Sanjay Patel	4a4fbe162f	[DAG] add convenience function to get -1 constant; NFCI llvm-svn: 296004	2017-02-23 19:02:33 +00:00
Adam Nemet	b516cf3f3f	[LazyMachineBFI] Reimplement with getAnalysisIfAvailable Since LoopInfo is not available in machine passes as universally as in IR passes, using the same approach for OptimizationRemarkEmitter as we did for IR will run LoopInfo and DominatorTree unnecessarily. (LoopInfo is not used lazily by ORE.) To fix this, I am modifying the approach I took in D29836. LazyMachineBFI now uses its client passes including MachineBFI itself that are available or otherwise compute them on the fly. So for example GreedyRegAlloc, since it's already using MBFI, will reuse that instance. On the other hand, AsmPrinter in Justin's patch will generate DT, LI and finally BFI on the fly. (I am of course wondering now if the simplicity of this approach is even preferable in IR. I will do some experiments.) Testing is provided by an updated version of D29837 which requires Justin's patch to bring ORE to the AsmPrinter. Differential Revision: https://reviews.llvm.org/D30128 llvm-svn: 295996	2017-02-23 17:30:01 +00:00
Matt Arsenault	3e9ce44eee	TargetOptions: Fix not accounting for NoSignedZerosFPMath in == llvm-svn: 295928	2017-02-23 03:16:44 +00:00
Matt Arsenault	f5262256a1	AMDGPU: Add replacement bfe intrinsics llvm-svn: 295899	2017-02-22 23:04:58 +00:00
Eugene Zelenko	db56e5a89a	[CodeGen] Fix some Clang-tidy modernize and Include What You Use warnings; other minor fixes (NFC). llvm-svn: 295893	2017-02-22 22:32:51 +00:00
Daniel Berlin	fccbda967a	PredicateInfo: Support switch statements Summary: Depends on D29606 and D29682 Makes us pass GVN's edge.ll (we also will pass a few other testcases they just need cleaning up). Thoughts on the Predicate* hiearchy of classes especially welcome :) (it's not clear to me how best to organize it, and currently, the getBlock* seems ... uglier than maybe wasting a field somewhere or something). Reviewers: davide Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D29747 llvm-svn: 295889	2017-02-22 22:20:58 +00:00
Daniel Berlin	211a1209a5	Add pair conversion functions to BasicBlockEdge. llvm-svn: 295888	2017-02-22 22:20:53 +00:00
Daniel Berlin	17e8d0eae2	Move updating functions to MemorySSAUpdater. Add updater to passes that now need it. Move around code in MemorySSA to expose needed functions. Summary: Mostly cleanup Reviewers: george.burgess.iv Subscribers: llvm-commits, Prazek Differential Revision: https://reviews.llvm.org/D30221 llvm-svn: 295887	2017-02-22 22:19:55 +00:00
Krzysztof Parzyszek	65971d97b0	[Hexagon] Add intrinsics for masked vector stores Patch by Harsha Jagasia. llvm-svn: 295879	2017-02-22 21:23:09 +00:00
Dan Gohman	7ea5adfff4	[WebAssembly] Implement the wasm binary container header. Also, update the version number to 0x1, which is what engines are now expecting. llvm-svn: 295860	2017-02-22 18:50:20 +00:00
Justin Bogner	8281c81413	OptDiag: Add const to some interfaces that don't modify anything. NFC This needed a const_cast for the dominator tree recalculation in OptimizationRemarkEmitter, but we do that all over the place already and it's safe. llvm-svn: 295812	2017-02-22 07:38:17 +00:00
Dan Gohman	18eafb6c68	[WebAssembly] Add skeleton MC support for the Wasm container format This just adds the basic skeleton for supporting a new object file format. All of the actual encoding will be implemented in followup patches. Differential Revision: https://reviews.llvm.org/D26722 llvm-svn: 295803	2017-02-22 01:23:18 +00:00
Matt Arsenault	1f17c66890	AMDGPU: Add cvt.pkrtz intrinsic Convert llvm.SI.packf16 test uses llvm-svn: 295797	2017-02-22 00:27:34 +00:00
Eugene Zelenko	49e2fc4f5f	[CodeGen] Fix some Clang-tidy modernize and Include What You Use warnings; other minor fixes (NFC). llvm-svn: 295773	2017-02-21 22:07:52 +00:00
Zachary Turner	392ed9d342	[Support] Add a function to check if a file resides locally. Differential Revision: https://reviews.llvm.org/D30010 llvm-svn: 295768	2017-02-21 20:55:47 +00:00
Rafael Espindola	23a76be5ad	Don't modify archive members unless really needed. For whatever reason ld64 requires that member headers (not the member themselves) should be aligned. The only way to do that is to edit the previous member so that it ends at an aligned boundary. Since modifying data put in an archive is an undesirable property, llvm-ar should only do it when it is absolutely necessary. llvm-svn: 295765	2017-02-21 20:40:54 +00:00
Zachary Turner	43313b3e89	Try to fix line endings. llvm-svn: 295759	2017-02-21 19:52:57 +00:00
Zachary Turner	3788818730	Remove svn:eol-style property from 2 files. There are still over 3400 files remaining with this property set, but there are tens of thousands more with the property not set. Until we decide what to do on a global scale, this at least unblocks me temporarily. llvm-svn: 295756	2017-02-21 19:29:56 +00:00
Xin Tong	a05a6c101d	More comments for getUniqueExitBlocks. NFCI llvm-svn: 295750	2017-02-21 19:08:03 +00:00
Geoff Berry	5d534b6a11	[CodeGenPrepare] Sink and duplicate more 'and' instructions. Summary: Rework the code that was sinking/duplicating (icmp and, 0) sequences into blocks where they were being used by conditional branches to form more tbz instructions on AArch64. The new code is more general in that it just looks for 'and's that have all icmp 0's as users, with a target hook used to select which subset of 'and' instructions to consider. This change also enables 'and' sinking for X86, where it is more widely beneficial than on AArch64. The 'and' sinking/duplicating code is moved into the optimizeInst phase of CodeGenPrepare, where it can take advantage of the fact the OptimizeCmpExpression has already sunk/duplicated any icmps into the blocks where they are used. One minor complication from this change is that optimizeLoadExt needed to be updated to always mark 'and's it has determined should be in the same block as their feeding load in the InsertedInsts set to avoid an infinite loop of hoisting and sinking the same 'and'. This change fixes a regression on X86 in the tsan runtime caused by moving GVNHoist to a later place in the optimization pipeline (see PR31382). Reviewers: t.p.northover, qcolombet, MatzeB Subscribers: aemerson, mcrosier, sebpop, llvm-commits Differential Revision: https://reviews.llvm.org/D28813 llvm-svn: 295746	2017-02-21 18:53:14 +00:00
Matthias Braun	9ab403942b	ScheduleDAG: Cleanup; NFC - Fix doxygen comments (do not repeat documented name, remove definition comment if there is already one at the declaration, add \p, ...) - Add some const modifiers - Use range based for llvm-svn: 295688	2017-02-21 01:27:33 +00:00

1 2 3 4 5 ...

30443 Commits