llvm-project

Commit Graph

Author	SHA1	Message	Date
Philip Reames	f388050105	[RewriteStatepointsForGC] Minor code cleanup [NFC] We can use builders to simplify part of the code and we only check for the existance of the metadata value; this enables us to delete some redundant code. llvm-svn: 242751	2015-07-21 00:49:55 +00:00
Matt Arsenault	f849bb49cc	AMDGPU: Set isMoveImm on s_movk_i32 llvm-svn: 242747	2015-07-21 00:40:08 +00:00
Matthias Braun	a50d2203fa	ARMLoadStoreOpt: Merge subs/adds into LDRD/STRD; Factor out common code Re-apply of r241928 which had to be reverted because of the r241926 revert. This commit factors out common code from MergeBaseUpdateLoadStore() and MergeBaseUpdateLSMultiple() and introduces a new function MergeBaseUpdateLSDouble() which merges adds/subs preceding/following a strd/ldrd instruction into an strd/ldrd instruction with writeback where possible. Differential Revision: http://reviews.llvm.org/D10676 llvm-svn: 242743	2015-07-21 00:19:01 +00:00
Matthias Braun	e40d89ef9b	ARMLoadStoreOptimizer: Create LDRD/STRD on thumb2 Re-apply r241926 with an additional check that r13 and r15 are not used for LDRD/STRD. See http://llvm.org/PR24190. This also already includes the fix from r241951. Differential Revision: http://reviews.llvm.org/D10623 llvm-svn: 242742	2015-07-21 00:18:59 +00:00
Akira Hatanaka	42427d2c38	Revert r242737. This caused builds to fail with the following error message: error:Too many subtarget features! Bump MAX_SUBTARGET_FEATURES. llvm-svn: 242740	2015-07-20 23:51:12 +00:00
Akira Hatanaka	7482d40cd5	[ARM] Define subtarget feature "reserve-r9", which is used to decide whether register r9 should be reserved. This change is needed because we cannot use a backend option to set cl::opt "arm-reserve-r9" when doing LTO. Out-of-tree projects currently using cl::opt option "-arm-reserve-r9" to reserve r9 should make changes to add subtarget feature "reserve-r9" to the IR. rdar://problem/21529937 Differential Revision: http://reviews.llvm.org/D11320 llvm-svn: 242737	2015-07-20 23:21:30 +00:00
Matthias Braun	731e359e70	Revert "ARMLoadStoreOptimizer: Create LDRD/STRD on thumb2" This reverts commit r241926. This caused http://llvm.org/PR24190 llvm-svn: 242735	2015-07-20 23:17:20 +00:00
Matthias Braun	84e289702a	Revert "ARMLoadStoreOpt: Merge subs/adds into LDRD/STRD; Factor out common code" This reverts commit r241928. This caused http://llvm.org/PR24190 llvm-svn: 242734	2015-07-20 23:17:16 +00:00
Matthias Braun	22f3960759	Revert "ARM: Use SpecificBumpPtrAllocator to fix leak introduced in r241920" This reverts commit r241951. It caused http://llvm.org/PR24190 llvm-svn: 242733	2015-07-20 23:17:14 +00:00
Matthias Braun	c8b67e656b	AArch64: Restrict macroop fusion heuristics to cyclone. Even though this is just some hinting for the scheduler it doesn't make sense to do that unless you know the target can perform the fusion. llvm-svn: 242732	2015-07-20 23:11:42 +00:00
JF Bastien	e4d22d59d1	Targets: commonize some stack realignment code This patch does the following: * Fix FIXME on `needsStackRealignment`: it is now shared between multiple targets, implemented in `TargetRegisterInfo`, and isn't `virtual` anymore. This will break out-of-tree targets, silently if they used `virtual` and with a build error if they used `override`. * Factor out `canRealignStack` as a `virtual` function on `TargetRegisterInfo`, by default only looks for the `no-realign-stack` function attribute. Multiple targets duplicated the same `needsStackRealignment` code: - Aarch64. - ARM. - Mips almost: had extra `DEBUG` diagnostic, which the default implementation now has. - PowerPC. - WebAssembly. - x86 almost: has an extra `-force-align-stack` option, which the default implementation now has. The default implementation of `needsStackRealignment` used to just return `false`. My current patch changes the behavior by simply using the above shared behavior. This affects: - AMDGPU - BPF - CppBackend - MSP430 - NVPTX - Sparc - SystemZ - XCore - Out-of-tree targets This is a breaking change! `make check` passes. The only implementation of the `virtual` function (besides the slight different in x86) was Hexagon (which did `MF.getFrameInfo()->getMaxAlignment() > 8`), and potentially some out-of-tree targets. Hexagon now uses the default implementation. `needsStackRealignment` was being overwritten in `<Target>GenRegisterInfo.inc`, to return `false` as the default also did. That was odd and is now gone. Reviewers: sunfish Subscribers: aemerson, llvm-commits, jfb Differential Revision: http://reviews.llvm.org/D11160 llvm-svn: 242727	2015-07-20 22:51:32 +00:00
Reid Kleckner	87d03450a5	Don't try to instrument allocas used by outlined SEH funclets Summary: Arguments to llvm.localescape must be static allocas. They must be at some statically known offset from the frame or stack pointer so that other functions can access them with localrecover. If we ever want to instrument these, we can use more indirection to recover the addresses of these local variables. We can do it during clang irgen or with the asan module pass. Reviewers: eugenis Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D11307 llvm-svn: 242726	2015-07-20 22:49:44 +00:00
Matthias Braun	e536f4f681	AArch64: Add aditional Cyclone macroop fusion opportunities Related to rdar://19205407 Differential Revision: http://reviews.llvm.org/D10746 llvm-svn: 242724	2015-07-20 22:34:47 +00:00
Matthias Braun	2bd6dd8d54	MachineScheduler: Restrict macroop fusion to data-dependent instructions. Before creating a schedule edge to encourage MacroOpFusion check that: - The predecessor actually writes a register that the branch reads. - The predecessor has no successors in the ScheduleDAG so we can schedule it in front of the branch. This avoids skewing the scheduling heuristic in cases where macroop fusion cannot happen. Differential Revision: http://reviews.llvm.org/D10745 llvm-svn: 242723	2015-07-20 22:34:44 +00:00
Geoff Berry	e41c2df0ef	Fix comment typo (test commit). NFC llvm-svn: 242719	2015-07-20 22:03:52 +00:00
Quentin Colombet	71a71485f4	[ARM] Refactor the prologue/epilogue emission to be more robust. This is the first step toward supporting shrink-wrapping for this target. The changes could be summarized by these items: - Expand the tail-call return as part of the expand pseudo pass. - Get rid of the assumptions that the epilogue is the exit block: * Do not assume which registers are free in the epilogue. (This indirectly improve the lowering of the code for the segmented stacks, see the test cases.) * Take into account that the basic block can be empty. Related to <rdar://problem/20821730> llvm-svn: 242714	2015-07-20 21:42:14 +00:00
Jingyue Wu	48a9bdc6aa	[NVPTX] make load on global readonly memory to use ldg Summary: [NVPTX] make load on global readonly memory to use ldg Summary: As describe in [1], ld.global.nc may be used to load memory by nvcc when __restrict__ is used and compiler can detect whether read-only data cache is safe to use. This patch will try to check whether ldg is safe to use and use them to replace ld.global when possible. This change can improve the performance by 18~29% on affected kernels (ratt_kernel and rwdot_kernel) in S3D benchmark of shoc [2]. Patched by Xuetian Weng. [1] http://docs.nvidia.com/cuda/kepler-tuning-guide/#read-only-data-cache [2] https://github.com/vetter/shoc Test Plan: test/CodeGen/NVPTX/load-with-non-coherent-cache.ll Reviewers: jholewinski, jingyue Subscribers: jholewinski, llvm-commits Differential Revision: http://reviews.llvm.org/D11314 llvm-svn: 242713	2015-07-20 21:28:54 +00:00
Krzysztof Parzyszek	921722049d	[Hexagon] Generate MUX from conditional transfers when dot-new not possible llvm-svn: 242711	2015-07-20 21:23:25 +00:00
Alex Lorenz	ab98049947	MIR Serialization: Initial serialization of machine constant pools. This commit implements the initial serialization of machine constant pools and the constant pool index machine operands. The constant pool is serialized using a YAML sequence of YAML mappings that represent the constant values. The target-specific constant pool items aren't serialized by this commit. Reviewers: Duncan P. N. Exon Smith llvm-svn: 242707	2015-07-20 20:51:18 +00:00
Sanjoy Das	93d608c3c3	[ImplicitNullChecks] Work with implicit defs. Summary: This change generalizes the implicit null checks pass to work with instructions that don't have any explicit register defs. This lets us use X86's `cmp` against memory as faulting load instructions. Reviewers: reames, JosephTremoulet Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D11286 llvm-svn: 242703	2015-07-20 20:31:39 +00:00
Alex Lorenz	b29554dab9	MIR Parser: Add support for quoted named global value operands. This commit extends the machine instruction lexer and implements support for the quoted global value tokens. With this change the syntax for the global value identifier tokens becomes identical to the syntax for the global identifier tokens from the LLVM's assembly language. Reviewers: Duncan P. N. Exon Smith llvm-svn: 242702	2015-07-20 20:31:01 +00:00
Chad Rosier	3da0ea7f5d	[AArch64] Change EON pattern to match more often. Phabricator: http://reviews.llvm.org/D11359 Patch by Geoff Berry <gberry@codeaurora.org> llvm-svn: 242694	2015-07-20 18:42:27 +00:00
Tom Stellard	70580f83cc	AMDGPU/SI: Add VI patterns to select FLAT instructions for global memory ops Summary: The MUBUF addr64 bit has been removed on VI, so we must use FLAT instructions when the pointer is stored in VGPRs. Reviewers: arsenm Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D11067 llvm-svn: 242673	2015-07-20 14:28:41 +00:00
Vasileios Kalintiris	974d409259	[mips] Added support for the ERETNC instruction. Summary: This required adding the instruction predicate HasMips32r5. Patch by Scott Egerton. Reviewers: dsanders, vkalintiris Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D11136 llvm-svn: 242666	2015-07-20 12:28:56 +00:00
Arnold Schwaighofer	764d6de823	Revert "MergeFuncs: Transfer the function parameter attributes to the call site" It is okay to not transfer parameter attributes. This reverts commit r242558. llvm-svn: 242646	2015-07-19 19:30:43 +00:00
Yaron Keren	c66c06b899	Narrow Callee scope, suggestion from David Blaikie. llvm-svn: 242644	2015-07-19 15:48:07 +00:00
Simon Pilgrim	e2c244f3b4	[X86][SSE] Reordered cast vectorization costs. NFCI. Reordered the data tables at the top and placed the lookups after. The first stage in the yak shaving necessary to get more accurate costs for a variety of targets given the recent improvements to SINT_TO_FP/UINT_TO_FP/SIGN_EXTEND vector lowering. llvm-svn: 242643	2015-07-19 15:36:12 +00:00
Yaron Keren	611f614ee1	De-duplicate CS.getCalledFunction() expression. Not sure if the optimizer will save the call as getCalledFunction() is not a trivial access function but the code is clearer this way. llvm-svn: 242641	2015-07-19 11:52:02 +00:00
Simon Pilgrim	4ef0576c40	[DAGCombiner] Fixed minor typo that was missed in D9097. We don't bitcast the UNDEFs - that is done in visitVECTOR_SHUFFLE, and the getValueType should come from the operand's SDValue not the SDNode. llvm-svn: 242640	2015-07-19 11:31:40 +00:00
Michael Kuperstein	69e40a4c85	[X86] Add support for tbyte memory operand size for Intel-syntax x86 assembly Differential Revision: http://reviews.llvm.org/D11257 Patch by: marina.yatsina@intel.com llvm-svn: 242639	2015-07-19 11:03:08 +00:00
Simon Pilgrim	ba51d116c4	Remove TargetInstrInfo::canFoldMemoryOperand canFoldMemoryOperand is not actually used anywhere in the codebase - all existing users instead call foldMemoryOperand directly when they wish to fold and can correctly deduce what they need from the return value. This patch removes the canFoldMemoryOperand base function and the target implementations; only x86 had a real (bit-rotted) implementation, although AMDGPU had a preparatory stub that had never needed to be completed. Differential Revision: http://reviews.llvm.org/D11331 llvm-svn: 242638	2015-07-19 10:50:53 +00:00
Elena Demikhovsky	17b906058e	AVX-512: Floating point conversions for SKX - DAG Lowering. SKX supports conversion for all FP types. Integer types include doublewords and quardwords. I added "Legal" status for these nodes and a bunch of tests. I added "NoVLX" for AVX DAG selection to force VLX instructions selection when VLX is supported. Differential Revision: http://reviews.llvm.org/D11255 llvm-svn: 242637	2015-07-19 10:17:33 +00:00
Simon Pilgrim	3aca32ea4a	Use SDValue bool check. NFCI. llvm-svn: 242636	2015-07-19 09:56:36 +00:00
Simon Pilgrim	59764dccfb	[X86][SSE] Updated SHL/LSHR i64 vectorization costs. This was missed in D8416. llvm-svn: 242621	2015-07-18 20:06:30 +00:00
Benjamin Kramer	c9436ad659	[AggressiveAntiDepBreaker] Use range loops for multimap access. No functionality change intended. llvm-svn: 242620	2015-07-18 20:05:10 +00:00
Yaron Keren	3d49f6df94	Rangify for loops in GlobalDCE, NFC. llvm-svn: 242619	2015-07-18 19:57:34 +00:00
Benjamin Kramer	9a5d788948	[Hexagon] Use composition instead of inheritance from STL types The standard containers are not designed to be inherited from, as illustrated by the MSVC hacks for NodeOrdering. No functional change intended. llvm-svn: 242616	2015-07-18 17:43:23 +00:00
Chandler Carruth	9f2bf1aff5	[PM/AA] Remove the addEscapingUse update API that won't be easy to directly model in the new PM. This also was an incredibly brittle and expensive update API that was never fully utilized by all the passes that claimed to preserve AA, nor could it reasonably have been extended to all of them. Any number of places add uses of values. If we ever wanted to reliably instrument this, we would want a callback hook much like we have with ValueHandles, but doing this for every use addition seems extremely expensive in terms of compile time. The only user of this update mechanism is GlobalsModRef. The idea of using this to keep it up to date doesn't really work anyways as its analysis requires a symmetric analysis of two different memory locations. It would be very hard to make updates be sufficiently rigorous to guarantee symmetric analysis in this way, and it pretty certainly isn't true today. However, folks have been using GMR with this update for a long time and seem to not be hitting the issues. The reported issue that the update hook fixes isn't even a problem any more as other changes to GetUnderlyingObject worked around it, and that issue stemmed from many years ago. As a consequence, a prior patch provided a flag to control the unsafe behavior of GMR, and this patch removes the update mechanism that has questionable compile-time tradeoffs and is causing problems with moving to the new pass manager. Note the lack of test updates -- not one test in tree actually requires this update, even for a contrived case. All of this was extensively discussed on the dev list, this patch will just enact what that discussion decides on. I'm sending it for review in part to show what I'm planning, and in part to show the amazing amount of work this avoids. Every call to the AA here is something like three to six indirect function calls, which in the non-LTO pipeline never do any work! =[ Differential Revision: http://reviews.llvm.org/D11214 llvm-svn: 242605	2015-07-18 03:26:46 +00:00
Kostya Serebryany	86e4a3e0a3	[libFuzzer] require the files and directories passed to the fuzzer to exist llvm-svn: 242596	2015-07-18 00:03:37 +00:00
Evgeniy Stepanov	9cb08f823f	[asan] Fix shadow mapping on Android/AArch64. Instrumentation and the runtime library were in disagreement about ASan shadow offset on Android/AArch64. This fixes a large number of existing tests on Android/AArch64. llvm-svn: 242595	2015-07-17 23:51:18 +00:00
Matthias Braun	9e85980658	ARM: Enable MachineScheduler and disable PostRAScheduler for swift. Reapply r242500 now that the swift schedmodel includes LDRLIT. This is mostly done to disable the PostRAScheduler which optimizes for instruction latencies which isn't a good fit for out-of-order architectures. This also allows to leave out the itinerary table in swift in favor of the SchedModel ones. This change leads to performance improvements/regressions by as much as 10% in some benchmarks, in fact we loose 0.4% performance over the llvm-testsuite for reasons that appear to be unknown or out of the compilers control. rdar://20803802 documents the investigation of these effects. While it is probably a good idea to perform the same switch for the other ARM out-of-order CPUs, I limited this change to swift as I cannot perform the benchmark verification on the other CPUs. Differential Revision: http://reviews.llvm.org/D10513 llvm-svn: 242588	2015-07-17 23:18:30 +00:00
Matthias Braun	141d1c9d8f	ARM: Add scheduling information for LDRLIT instructions to swift scheduling model These pseudo instructions are only lowered after register allocation and are therefore still present when the machine scheduler runs. Add a run: line to a testcase that uses the uncommon flags necessary to actually produce a LDRLIT instruction on swift. llvm-svn: 242587	2015-07-17 23:18:26 +00:00
Quentin Colombet	11922946fe	[RAGreedy] Add an experimental deferred spilling feature. The idea of deferred spilling is to delay the insertion of spill code until the very end of the allocation. A "candidate" to spill variable might not required to be spilled because of other evictions that happened after this decision was taken. The spirit is similar to the optimistic coloring strategy implemented in Preston and Briggs graph coloring algorithm. For now, this feature is highly experimental. Although correct, it would require much more modification to properly model the effect of spilling. Anyway, this early patch helps prototyping this feature. Note: The test case cannot unfortunately be reduced and is probably fragile. llvm-svn: 242585	2015-07-17 23:04:06 +00:00
Alex Lorenz	484903ecd2	MIR Parser: Allow the dollar characters in all of the identifier tokens. This commit modifies the machine instruction lexer so that it now accepts the '$' characters in identifier tokens. This change makes the syntax for unquoted global value tokens consistent with the syntax for the global idenfitier tokens in the LLVM's assembly language. llvm-svn: 242584	2015-07-17 22:48:04 +00:00
Alex Lorenz	d225595dcf	AsmParser: Add a function to parse a standalone constant value. This commit extends the interface provided by the AsmParser library by adding a function that allows the user to parse a standalone contant value. This change is useful for MIR serialization, as it will allow the MIR Parser to parse the constant values in a machine constant pool. Reviewers: Duncan P. N. Exon Smith Differential Revision: http://reviews.llvm.org/D10280 llvm-svn: 242579	2015-07-17 22:07:03 +00:00
Kuba Brecka	7f54753180	[asan] Add a comment explaining why non-instrumented allocas are moved. Addition to r242510. llvm-svn: 242561	2015-07-17 19:20:21 +00:00
Arnold Schwaighofer	690cd87dcd	MergeFuncs: Transfer the function parameter attributes to the call site rdar://21516488 llvm-svn: 242558	2015-07-17 18:59:08 +00:00
Adam Nemet	5a6d5bc17b	Revert "ARM: Enable MachineScheduler and disable PostRAScheduler for swift." This reverts commit r242500. It broke some internal tests and Matthias asked me to revert it while he is investigating. llvm-svn: 242553	2015-07-17 18:14:19 +00:00
Matthias Braun	244a6773c7	Use llvm_unreachable() instead of report_fatal_error() if the machine model is incomplete This error is for developers only so it makes sense to abort and get a backtrace. llvm-svn: 242551	2015-07-17 17:50:11 +00:00
James Molloy	a6702e2f14	[ARM] Use [SU]ABSDIFF nodes instead of intrinsics for VABD/VABA No functional change, but it preps codegen for the future when SABSDIFF will start getting generated in anger. llvm-svn: 242546	2015-07-17 17:10:55 +00:00

1 2 3 4 5 ...

81428 Commits