llvm-project

Commit Graph

Author	SHA1	Message	Date
Lang Hames	f49bc3f1b1	[X86] Optimize stackmap shadows on X86. This patch minimizes the number of nops that must be emitted on X86 to satisfy stackmap shadow constraints. To minimize the number of nops inserted, the X86AsmPrinter now records the size of the most recent stackmap's shadow in the StackMapShadowTracker class, and tracks the number of instruction bytes emitted since the that stackmap instruction was encountered. Padding is emitted (if it is required at all) immediately before the next stackmap/patchpoint instruction, or at the end of the basic block. This optimization should reduce code-size and improve performance for people using the llvm stackmap intrinsic on X86. <rdar://problem/14959522> llvm-svn: 213892	2014-07-24 20:40:55 +00:00
Reid Kleckner	9a412d13c1	Replace an assertion with a fatal error Frontends are responsible for putting inalloca on parameters that would be passed in memory and not registers. llvm-svn: 213891	2014-07-24 19:53:33 +00:00
Saleem Abdulrasool	c61ed0474e	X86: correct library call setup for Windows itanium This target is identical to the Windows MSVC (and follows Microsoft ABI for C). Correct the library call setup for this target. The same set of library calls are missing on this environment. llvm-svn: 213883	2014-07-24 17:46:36 +00:00
Matt Arsenault	83592a2d32	R600: Add FMA instructions for Evergreen llvm-svn: 213882	2014-07-24 17:41:01 +00:00
Saleem Abdulrasool	34610e33ae	X86: silence sign comparison warning GCC 4.8 detected a signed compare [-Wsign-compare]. Add a cast for the destination index. Add an assert to catch a potential overflow however unlikely it may be. llvm-svn: 213878	2014-07-24 17:12:06 +00:00
Matt Arsenault	83e60581c3	R600: Add new functions for splitting vector loads and stores. These will be used in future patches and shouldn't change anything yet. llvm-svn: 213877	2014-07-24 17:10:35 +00:00
Joerg Sonnenberger	c7dbc13e77	Include relative path for header outside the current directory. llvm-svn: 213872	2014-07-24 16:04:46 +00:00
Tim Northover	7324e845a4	AArch64: refactor ReconstructShuffle function Quite a bit of cruft had accumulated as we realised the various different cases it had to handle and squeezed them in where possible. This refactoring mostly flattens the logic and special-cases. The result is slightly longer, but I think clearer. Should be no functionality change. llvm-svn: 213867	2014-07-24 15:39:55 +00:00
Hal Finkel	cc39b67530	AA metadata refactoring (introduce AAMDNodes) In order to enable the preservation of noalias function parameter information after inlining, and the representation of block-level __restrict__ pointer information (etc.), additional kinds of aliasing metadata will be introduced. This metadata needs to be carried around in AliasAnalysis::Location objects (and MMOs at the SDAG level), and so we need to generalize the current scheme (which is hard-coded to just one TBAA MDNode). This commit introduces only the necessary refactoring to allow for the introduction of other aliasing metadata types, but does not actually introduce any (that will come in a follow-up commit). What it does introduce is a new AAMDNodes structure to hold all of the aliasing metadata nodes associated with a particular memory-accessing instruction, and uses that structure instead of the raw MDNode in AliasAnalysis::Location, etc. No functionality change intended. llvm-svn: 213859	2014-07-24 12:16:19 +00:00
NAKAMURA Takumi	8d745ca7cc	Prune redundant libdeps. llvm-svn: 213857	2014-07-24 11:45:27 +00:00
NAKAMURA Takumi	98d18be5fe	Prune dependency to MC from each target disassembler. llvm-svn: 213856	2014-07-24 11:45:11 +00:00
Tilmann Scheller	96ef72e54a	[ARM] Make the assembler reject unpredictable pre/post-indexed ARM STRH instructions. The ARM ARM prohibits STRH instructions with writeback into the source register. With this commit this constraint is now enforced and we stop assembling STRH instructions with unpredictable behavior. llvm-svn: 213850	2014-07-24 09:55:46 +00:00
Daniel Sanders	bdcfab117c	[mips] Fix ll and sc instructions Summary: The ll and sc instructions for r6 and non-r6 are misplaced. This patch fixes that. Patch by Jyun-Yan You Differential Revision: http://reviews.llvm.org/D4578 llvm-svn: 213847	2014-07-24 09:47:14 +00:00
Matt Arsenault	9acb978105	R600: Match rcp node on pre-SI llvm-svn: 213844	2014-07-24 06:59:24 +00:00
Matt Arsenault	0daeb63f03	R600: Fix LowerSDIV24 Use ComputeNumSignBits instead of checking for i8 / i16 which only worked when AMDIL was lying about having legal i8 / i16. If an integer is known to fit in 24-bits, we can do division faster with float ops. llvm-svn: 213843	2014-07-24 06:59:20 +00:00
NAKAMURA Takumi	9c3bd7618a	Update library dependencies. llvm-svn: 213832	2014-07-24 02:10:42 +00:00
Matt Arsenault	034d666bb7	R600: Implement enableClusterLoads() llvm-svn: 213831	2014-07-24 02:10:17 +00:00
Kevin Qin	9a2a2c502b	[AArch64] Fix a bug generating incorrect instruction when building small vector. This bug is introduced by r211144. The element of operand may be smaller than the element of result, but previous commit can only handle the contrary condition. This commit is to handle this scenario and generate optimized codes like ZIP1. llvm-svn: 213830	2014-07-24 02:05:42 +00:00
Jiangning Liu	451f30e89f	[AArch64] Disable some optimization cases for type conversion from sint to fp, because those optimization cases are micro-architecture dependent and only make sense for Cyclone. A new predicate Cyclone is introduced in .td file. llvm-svn: 213827	2014-07-24 01:29:59 +00:00
Filipe Cabecinhas	933cccf3fa	Fixed PR20411 - bug in getINSERTPS() When we had a vector_shuffle where we had an input from each vector, we could miscompile it because we were assuming the input from V2 wouldn't be moved from where it was on the vector. Added a test case. llvm-svn: 213826	2014-07-24 01:28:21 +00:00
Rafael Espindola	45bcf8a59c	Fix the build when building with only the ARM backend. llvm-svn: 213814	2014-07-23 22:54:28 +00:00
Rafael Espindola	5addace56d	Finish inverting the MC -> Object dependency. There were still some disassembler bits in lib/MC, but their use of Object was only visible in the includes they used, not in the symbols. llvm-svn: 213808	2014-07-23 22:26:07 +00:00
Jim Grosbach	724e438c62	[X86,AArch64] Extend vcmp w/ unary op combine to work w/ more constants. The transform to constant fold unary operations with an AND across a vector comparison applies when the constant is not a splat of a scalar as well. llvm-svn: 213800	2014-07-23 20:41:43 +00:00
Jim Grosbach	8f6f0858ec	X86: restrict combine to when type sizes are safe. The folding of unary operations through a vector compare and mask operation is only safe if the unary operation result is of the same size as its input. For example, it's not safe for [su]itofp from v4i32 to v4f64. llvm-svn: 213799	2014-07-23 20:41:38 +00:00
Justin Holewinski	2cb5e181d1	[NVPTX] Silence a GCC warning found by the buildbots The cast to NVPTXTargetLowering was missing a 'const', but let's just access the right pointer through the subtarget anyway. llvm-svn: 213793	2014-07-23 20:23:47 +00:00
Juergen Ributzka	1b014504ab	[FastISel][AArch64] Fix return type in FastLowerCall. I used the wrong method to obtain the return type inside FinishCall. This fix simply uses the return type from FastLowerCall, which we already determined to be a valid type. Reduced test case from Chad. Thanks. llvm-svn: 213788	2014-07-23 20:03:13 +00:00
Justin Holewinski	ecca715b3c	[NVPTX] mul.wide generation works for any smaller integer source types, not just the next smaller power of two llvm-svn: 213784	2014-07-23 18:46:03 +00:00
Justin Holewinski	511664dc76	[NVPTX] Make sure we do not generate MULWIDE ISD nodes when optimizations are disabled With optimizations disabled, we disable the isel patterns for mul.wide; but we were still generating MULWIDE ISD nodes. Now, we only try to generate MULWIDE ISD nodes in DAGCombine if the optimization level is not zero. llvm-svn: 213773	2014-07-23 17:40:45 +00:00
Chad Rosier	17020f96c7	[AArch64] Lower sdiv x, pow2 using add + select + shift. The target-independent DAGcombiner will generate: asr w1, X, #31 w1 = splat sign bit. add X, X, w1, lsr #28 X = X + 0 or pow2-1 asr w0, X, asr #4 w0 = X/pow2 However, the add + shifts is expensive, so generate: add w0, X, 15 w0 = X + pow2-1 cmp X, wzr X - 0 csel X, w0, X, lt X = (X < 0) ? X + pow2-1 : X; asr w0, X, asr 4 w0 = X/pow2 llvm-svn: 213758	2014-07-23 14:57:52 +00:00
Robert Khasanov	74acbb7767	[SKX] Enabling mask instructions: encoding, lowering KMOVB, KMOVW, KMOVD, KMOVQ, KNOTB, KNOTW, KNOTD, KNOTQ Reviewed by Elena Demikhovsky <elena.demikhovsky@intel.com> llvm-svn: 213757	2014-07-23 14:49:42 +00:00
Tim Northover	14ff2df05c	ARM: spot SBFX-compatbile code expressed with sign_extend_inreg We were assuming all SBFX-like operations would have the shl/asr form, but often when the field being extracted is an i8 or i16, we end up with a SIGN_EXTEND_INREG acting on a shift instead. Simple enough to check for though. llvm-svn: 213754	2014-07-23 13:59:12 +00:00
Tim Northover	7ad2a0e0c2	ARM: add patterns for [su]xta[bh] from just a shift. Although the final shifter operand is a rotate, this actually only matters for the half-word extends when the amount == 24. Otherwise folding a shift in is just as good. llvm-svn: 213753	2014-07-23 13:59:07 +00:00
James Molloy	bc9fed82cc	Enable partial libcall inlining for all targets by default. This pass attempts to speculatively use a sqrt instruction if one exists on the target, falling back to a libcall if the target instruction returned NaN. This was enabled for MIPS and System-Z, but is well guarded and is good for most targets - GCC does this for (that I've checked) X86, ARM and AArch64. llvm-svn: 213752	2014-07-23 13:33:00 +00:00
Tilmann Scheller	2727279117	[ARM] Make the assembler reject unpredictable pre/post-indexed ARM STRB instructions. The ARM ARM prohibits STRB instructions with writeback into the source register. With this commit this constraint is now enforced and we stop assembling STRB instructions with unpredictable behavior. llvm-svn: 213750	2014-07-23 13:03:47 +00:00
Tim Northover	35910d7fa8	AArch64: remove "arm64_be" support in favour of "aarch64_be". There really is no arm64_be: it was a useful fiction to test big-endian support while both backends existed in parallel, but now the only platform that uses the name (iOS) doesn't have a big-endian variant, let alone one called "arm64_be". llvm-svn: 213748	2014-07-23 12:58:11 +00:00
Tilmann Scheller	3352a58ddc	[ARM] Make the assembler reject unpredictable pre/post-indexed ARM STR instructions. The ARM ARM prohibits STR instructions with writeback into the source register. With this commit this constraint is now enforced and we stop assembling STR instructions with unpredictable behavior. llvm-svn: 213745	2014-07-23 12:38:17 +00:00
Tim Northover	e19bed7d33	AArch64: remove arm64 triple enumerator. Having both Triple::arm64 and Triple::aarch64 is extremely confusing, and invites bugs where only one is checked. In reality, the only legitimate difference between the two (arm64 usually means iOS) is also present in the OS part of the triple and that's what should be checked. We still parse the "arm64" triple, just canonicalise it to Triple::aarch64, so there aren't any LLVM-side test changes. llvm-svn: 213743	2014-07-23 12:32:47 +00:00
Andrea Di Biagio	842355e900	Revert r211771. It was: "[X86] Improve the selection of SSE3/AVX addsub instructions". This chang fully reverts r211771. That revision added a canonicalization rule which has the potential to causes a combine-cycle in the target-independent canonicalizing DAG combine. The plan is to move the logic that forms target specific addsub nodes as part of the lowering of shuffles. llvm-svn: 213736	2014-07-23 11:20:24 +00:00
Tilmann Scheller	c28f0d587d	[ARM] Add earlyclobber constraint to pre/post-indexed ARM STRH instructions. The post-indexed instructions were missing the constraint, causing unpredictable STRH instructions to be emitted. The earlyclobber constraint on the pre-indexed STR instructions is not strictly necessary, as the instruction selection for pre-indexed STR instructions goes through an additional layer of pseudo instructions which have the constraint defined, however it doesn't hurt to specify the constraint directly on the pre-indexed instructions as well, since at some point someone might create instances of them programmatically and then the constraint is definitely needed. llvm-svn: 213729	2014-07-23 08:12:51 +00:00
Juergen Ributzka	2581fa505f	[FastIsel][AArch64] Add support for the FastLowerCall and FastLowerIntrinsicCall target-hooks. This commit modifies the existing call lowering functions to be used as the FastLowerCall and FastLowerIntrinsicCall target-hooks instead. This enables patchpoint intrinsic lowering for AArch64. This fixes <rdar://problem/17733076> llvm-svn: 213704	2014-07-22 23:14:58 +00:00
Tim Northover	0942e39061	X86: drop relocations on __eh_frame sections globally. Without this, we produce non-extern relocations when targeting older OS X versions that ld64 can't cope with in the particular context of __eh_frame sections (who'd want generic relocation-processing anyway?). This means that an updated linker (ld64 from Xcode 3.2.6 or later) may be needed when targeting such platforms with a modern version of LLVM, but this is probably the case anyway and a reasonable requirement. PR20212, rdar://problem/17544795 llvm-svn: 213665	2014-07-22 15:47:09 +00:00
Sasa Stankovic	319f0ff3b7	[mips] Fix two patterns that select i32's (for MIPS32r6) / i64's (for MIPS64r6) from setne comparison with an i32. The patterns that are fixed: * (select (i32 (setne i32, immZExt16)), i32, i32) (for MIPS32r6) * (select (i32 (setne i32, immZExt16)), i64, i64) (for MIPS64r6) llvm-svn: 213653	2014-07-22 13:36:02 +00:00
Elena Demikhovsky	f164859efc	AVX-512: Fixed intrinsic of VSQRTPS/PD instructions. I set number and types of parameters according to GCC intrinsics. llvm-svn: 213640	2014-07-22 11:07:31 +00:00
Saleem Abdulrasool	913666f9bc	R600: silence GCC warning GCC believes it may be possible to not return a value from the switch: lib/Target/R600/SIRegisterInfo.cpp:187:1: warning: control reaches end of non-void function [-Wreturn-type] Add an unreachable label to indicate that this is not possible and still permit switch coverage checking. llvm-svn: 213572	2014-07-21 17:52:00 +00:00
Tom Stellard	bda32c9e47	R600/SI: Refactor VOP3 instruction definitions llvm-svn: 213571	2014-07-21 17:44:29 +00:00
Tom Stellard	e5a1cdab47	R600/SI: Separate encoding and operand definitions into their own classes llvm-svn: 213570	2014-07-21 17:44:28 +00:00
Tom Stellard	f757b5ddc2	R600/SI: Initailize encoding fields of unused VOP3 modifiers to 0 llvm-svn: 213564	2014-07-21 17:12:40 +00:00
Tom Stellard	ca000c6c7b	R600/SI: Initialize unused VOP3 sources to 0 instead of SIOperand.ZERO llvm-svn: 213563	2014-07-21 17:12:37 +00:00
Tom Stellard	1aaad6970c	R600/SI: Add instruction shrinking pass This pass converts 64-bit instructions to 32-bit when possible. llvm-svn: 213561	2014-07-21 16:55:33 +00:00
Tom Stellard	63797d4a23	R600/SI: VOPC instructions explicitly define VCC Therefore we don't need to add it to the implict defs list. llvm-svn: 213558	2014-07-21 16:27:24 +00:00

1 2 3 4 5 ...

29289 Commits