llvm-project

Commit Graph

Author	SHA1	Message	Date
Andrea Di Biagio	a03624d8ab	[X86] Add a check for 'isMOVHLPSMask' within method 'isShuffleMaskLegal'. Before this change, method 'isShuffleMaskLegal' didn't know that shuffles implementing a 'movhlps' operation were perfectly legal for SSE targets. This patch adds the missing check for 'isMOVHLPSMask' inside method 'isShuffleMaskLegal' to fix the problem. The reason why it is important to do this is because the DAGCombiner conservatively avoids combining a pair of shuffles if the resulting shuffle node has an illegal mask. Before this patch, shuffles with a MOVHLPS mask were wrongly considered not to be legal. This was the root cause of some poor-code generation bugs. llvm-svn: 213137	2014-07-16 11:29:39 +00:00
Justin Bogner	0c274aed23	unittests: Actually test reverse iterators in Path tests This re-enables some #if 0'd code (since 2010) in the Path unittests and makes at least a weak effort at testing sys::path's rbegin/rend. This change was inspired by some test failures near uses of rbegin and rend here: http://lab.llvm.org:8011/builders/clang-x86_64-linux-vg/builds/3209 The "valgrind was whining" comment looked promising in terms of a simpler to debug case of the same errors. However, it appears that the valgrind complaints the comment was referring to are distinct from the ones in the frontend, since this updated test isn't complaining for me under valgrind. In any case, the disabled tests weren't helping anybody. llvm-svn: 213125	2014-07-16 08:18:58 +00:00
Reid Kleckner	56b56ea15b	Roundtrip the inalloca bit on allocas through bitcode This was an oversight in the original support. As it is, I stuffed this bit into the alignment. The alignment is stored in log2 form, so it doesn't need more than 5 bits, given that Value::MaximumAlignment is 1 << 29. Reviewers: nicholas Differential Revision: http://reviews.llvm.org/D3943 llvm-svn: 213118	2014-07-16 01:34:27 +00:00
Manuel Jacob	b4db99c76a	Fix comment in InstCombiner::visitAddrSpaceCast. In the original version of the patch the behaviour was like described in the comment. This behaviour was changed before committing it without updating the comment. llvm-svn: 213117	2014-07-16 01:34:21 +00:00
Hans Wennborg	21f0f13394	Perform wildcard expansion in Process::GetArgumentVector on Windows (PR17098) On Windows, wildcard expansion isn't performed by the shell, but left to the program itself. The common way to do this is to link with setargv.obj, which performs the expansion on argc/argv before main is entered. However, we don't use argv in Clang on Windows, but instead call GetCommandLineW so we can handle unicode arguments. This means we have to do wildcard expansion ourselves. A test case will be added on the Clang side. Differential Revision: http://reviews.llvm.org/D4529 llvm-svn: 213114	2014-07-16 00:52:11 +00:00
Tyler Nowicki	641d8a06bd	Emit warnings if vectorization is forced and fails. This patch modifies the existing DiagnosticInfo system to create a generic base class that is inherited to produce diagnostic-based warnings. This is used by the loop vectorizer to trigger a warning when vectorization is forced and fails. Several tests have been added to verify this behavior. Reviewed by: Arnold Schwaighofer llvm-svn: 213110	2014-07-16 00:36:00 +00:00
Juergen Ributzka	480872b4ce	Remove TLI from isInTailCallPosition's arguments. NFC. There is no need to pass on TLI separately to the function. As Eric pointed out the Target Machine already provides everything we need. llvm-svn: 213108	2014-07-16 00:01:22 +00:00
Matt Arsenault	22ca3f8860	R600/SI: Allow using f32 rcp / rsq when denormals not handled. These are precise enough to use for OpenCL unless denormals are handled. llvm-svn: 213107	2014-07-15 23:50:10 +00:00
David Majnemer	3821ff03cd	X86: Simplify X86WindowsTargetObjectFile::getSectionForConstant There exists a helper function to abstract away the various differences between ConstantVector, ConstantDataVector, ConstantAggregateZero, etc. Use it to simplify X86WindowsTargetObjectFile::getSectionForConstant. llvm-svn: 213104	2014-07-15 23:01:10 +00:00
Sanjay Patel	a2f658d69d	Move Post RA Scheduling flag bit into SchedMachineModel Refactoring; no functional changes intended Removed PostRAScheduler bits from subtargets (X86, ARM). Added PostRAScheduler bit to MCSchedModel class. This bit is set by a CPU's scheduling model (if it exists). Removed enablePostRAScheduler() function from TargetSubtargetInfo and subclasses. Fixed the existing enablePostMachineScheduler() method to use the MCSchedModel (was just returning false!). Added methods to TargetSubtargetInfo to allow overrides for AntiDepBreakMode, CriticalPathRCs, and OptLevel for PostRAScheduling. Added enablePostRAScheduler() function to PostRAScheduler class which queries the subtarget for the above values. Preserved existing scheduler behavior for ARM, MIPS, PPC, and X86: a. ARM overrides the CPU's postRA settings by enabling postRA for any non-Thumb or Thumb2 subtarget. b. MIPS overrides the CPU's postRA settings by enabling postRA for everything. c. PPC overrides the CPU's postRA settings by enabling postRA for everything. d. X86 is the only target that actually has postRA specified via sched model info. Differential Revision: http://reviews.llvm.org/D4217 llvm-svn: 213101	2014-07-15 22:39:58 +00:00
Peter Collingbourne	9947c49812	[dfsan] Introduce further optimization to reduce the number of union queries. Specifically, do not compute a union if it is statically known that one shadow set subsumes the other. llvm-svn: 213100	2014-07-15 22:13:19 +00:00
Alp Toker	b9aadfa673	CMake: avoid a reconfigure loop from r213091 Removing the native CMakeCache.txt causes the target to get re-run needlessly on some systems. We'll want another solution for that part of the fix. llvm-svn: 213099	2014-07-15 22:11:54 +00:00
Matt Arsenault	0d89e849bd	R600/SI: Fix select on i1 llvm-svn: 213096	2014-07-15 21:44:37 +00:00
David Blaikie	87e92eb439	Try out FileCheck's new (in r212810) -implicit-check-not in a DebugInfo test. Just tried this on a few tests and this was the only one that was easily ported to use the new feature, so we'll go with that for now. Hopefully can act as inspiration/reminder for other tests. Not all debug info tests need to check for every DW_TAG or NULL child terminator, but perhaps they should (just to ensure they don't accidentally end up with tags nested inside other tags without the test failing, for example) llvm-svn: 213092	2014-07-15 21:06:37 +00:00
Alp Toker	a507bb5903	CMake: fix cross-compilation with external source directories This adds support for building native artifacts when cross-compiling using the popular side-by-side source directory layout (no symlinks, no nested repositories). llvm-svn: 213091	2014-07-15 21:04:12 +00:00
Duncan P. N. Exon Smith	f51601c856	ADT: Add MapVector::remove_if Add a `MapVector::remove_if()` that erases items in bulk in linear time, as opposed to quadratic time for repeated calls to `MapVector::erase()`. llvm-svn: 213090	2014-07-15 20:24:56 +00:00
Matt Arsenault	e9fa3b8e6b	R600/SI: Implement less wrong f32 fdiv Assuming single precision denormals and accurate sqrt/div are not reported, this passes the OpenCL conformance test. llvm-svn: 213089	2014-07-15 20:18:31 +00:00
Matt Arsenault	1d077749ea	R600: Add predicate for UnsafeFPMath llvm-svn: 213088	2014-07-15 20:18:24 +00:00
Matt Arsenault	84446a026b	R600: Remove intrinsics that appear to be unused llvm-svn: 213087	2014-07-15 20:10:27 +00:00
Lang Hames	84bc818baf	[RuntimeDyld] Revert r211652 - MachO object GDB registration support. The registration scheme used in r211652 violated the read-only contract of MemoryBuffer. This caused crashes in llvm-rtdyld where macho objects were backed by read-only mmap'd memory. llvm-svn: 213086	2014-07-15 19:35:22 +00:00
Duncan P. N. Exon Smith	db88e31e1a	ADT: Fix MapVector::erase() Actually update the changed indexes in the map portion of `MapVector` when erasing from the middle. Add a unit test that checks for this. Note that `MapVector::erase()` is a linear time operation (it was and still is). I'll commit a new method in a moment called `MapVector::remove_if()` that deletes multiple entries in linear time, which should be slightly less painful. llvm-svn: 213084	2014-07-15 18:32:30 +00:00
Duncan P. N. Exon Smith	34e5ea6c9f	ADT: Add "end namespace" comment This keeps clang-format from deleting the preceding newline. llvm-svn: 213082	2014-07-15 18:06:56 +00:00
Chris Bieneman	03695ab57e	[RegisterCoalescer] Add new subtarget hook allowing targets to opt-out of coalescing. The coalescer is very aggressive at propagating constraints on the register classes, and the register allocator doesn’t know how to split sub-registers later to recover. This patch provides an escape valve for targets that encounter this problem to limit coalescing. This patch also implements such for ARM to lower register pressure when using lots of large register classes. This works around PR18825. llvm-svn: 213078	2014-07-15 17:18:41 +00:00
Tilmann Scheller	fc14cefeea	[AArch64] Add negative tests for the SIMD & FP LDP instructions. LDP is unpredictable if the registers in the pair are identical, these tests check that we don't assemble instructions like that and error out instead. llvm-svn: 213074	2014-07-15 16:33:24 +00:00
Cameron McInally	44f3e30cf2	Revert r213070. It's breaking the build in MCELFStreamer::EmitInstToData(...). llvm-svn: 213073	2014-07-15 16:24:24 +00:00
Jan Vesely	6ddb8dd442	R600: Implement zero undef variants of ctlz/cttz v2: use ffbh/l if available v3: Rebase on top of Matt's SI patches Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Tom Stellard <tom@stellard.net> llvm-svn: 213072	2014-07-15 15:51:09 +00:00
Daniel Sanders	a6e125f07e	[mips] Correct .MIPS.abiflags fp_abi field for -mfpxx and without .module Summary: Previously all the test cases set it after initialization with '.module fp=xx'. Differential Revision: http://reviews.llvm.org/D4489 llvm-svn: 213071	2014-07-15 15:31:39 +00:00
Cameron McInally	53bc7a3330	Add x86 patterns to match a specific add-with-carry. llvm-svn: 213070	2014-07-15 15:03:32 +00:00
Andrea Di Biagio	bd5555cc3f	[DAGCombiner] Add more rules to fold shuffles. This patch adds two new rules to the DAGCombiner: 1. shuffle (shuffle A, Undef, M0), B, M1 -> shuffle A, B, M2 2. shuffle (shuffle A, Undef, M0), A, M1 -> shuffle A, Undef, M2 We only do this if the combined shuffle is legal for the target. Example: ;; define <4 x float> @test(<4 x float> %a, <4 x float> %b) { %1 = shufflevector <4 x float> %a, <4 x float> undef, <4 x i32><i32 6, i32 0, i32 1, i32 7> %2 = shufflevector <4 x float> %1, <4 x float> %b, <4 x i32><i32 1, i32 2, i32 4, i32 5> ret <4 x i32> %2 } ;; (using llc -mcpu=corei7 -march=x86-64) Before, the x86 backend generated: pshufd $120, %xmm0, %xmm0 shufps $-108, %xmm0, %xmm1 movaps %xmm1, %xmm0 Now the x86 backend generates: movsd %xmm1, %xmm0 llvm-svn: 213069	2014-07-15 13:26:28 +00:00
NAKAMURA Takumi	04b8b37f56	Prune Redundant libdeps in CMake's target_link_libraries and LLVMBuild.txt. I checked this with Release+Asserts on x86_64-mingw32. Please restore partially if this were overkill. llvm-svn: 213064	2014-07-15 11:37:03 +00:00
Andrea Di Biagio	04d5a7b337	Silence a warning in conditional expression. Fixes a gcc warning caused by a typo. A redundant assignment operation was accidentally used as the third operand of a conditional expression. No functional change intended. llvm-svn: 213061	2014-07-15 10:53:44 +00:00
Stepan Dyatkovskiy	dee612d4f6	MergeFunc patch from Björn Steinbrink. Phabricator ticket: D4246, Don't merge functions with different range metadata on call/invoke. Thanks! llvm-svn: 213060	2014-07-15 10:46:51 +00:00
Tim Northover	e4b8e138e1	AArch64: fall back to generic code for out of range extract/insert. rdar://problem/17624784 llvm-svn: 213059	2014-07-15 10:00:26 +00:00
David Majnemer	d4d9944416	Fix typo in comment No functionality changed. llvm-svn: 213052	2014-07-15 07:11:32 +00:00
Juergen Ributzka	8f073c8d60	[FastISel][X86] Remove no longer needed functions. llvm-svn: 213051	2014-07-15 06:35:53 +00:00
Juergen Ributzka	3566c08dd9	[FastISel][X86] Implement the FastLowerIntrinsicCall hook. Rename X86VisitIntrinsicCall -> FastLowerIntrinsicCall, which effectively implements the target hook. llvm-svn: 213050	2014-07-15 06:35:50 +00:00
Juergen Ributzka	23d43318c7	[FastISel][X86] Implement the FastLowerCall hook. This implements the FastLowerCall hook, which is based on the DoSelectCall function. The implementation is very similar, but the target-independent call lowering part has been factored out. This should also enable patchpoint intrinsic lowering for FastISel on X86. Related to <rdar://problem/17427052>. llvm-svn: 213049	2014-07-15 06:35:47 +00:00
Juergen Ributzka	5ee9d90248	Revert "[FastISel][X86] Remove no longer needed functions." Revert "[FastISel][X86] Implement the FastLowerIntrinsicCall hook." Revert "[FastISel][X86] Implement the FastLowerCall hook." This reverts commit r213035, r213036, and r213037 to make the buildbots happy again. llvm-svn: 213048	2014-07-15 05:23:40 +00:00
Peter Collingbourne	705a1ae3c8	[dfsan] Introduce an optimization to reduce the number of union queries. Specifically, when building a union query, if we are dominated by an identical query then use the result of that query instead. llvm-svn: 213047	2014-07-15 04:41:17 +00:00
Peter Collingbourne	83def1cbe7	[dfsan] Move combineShadows to DFSanFunction in preparation for it to use a domtree. llvm-svn: 213046	2014-07-15 04:41:14 +00:00
Peter Collingbourne	818f5c4837	Give SplitBlockAndInsertIfThen the ability to update a domtree. llvm-svn: 213045	2014-07-15 04:40:27 +00:00
David Majnemer	e0ae47f06c	Some targets don't prefix private symbols with dot llvm-svn: 213042	2014-07-15 03:00:41 +00:00
David Majnemer	1d8fe265f5	Specify a more specific triple for constant-pool-remat-0.ll Instead of specifying 32-bit x86, specify 32-bit x86 linux. This test is testing a very specific behavior which changed with WinCOFF's constant pools. llvm-svn: 213041	2014-07-15 03:00:39 +00:00
David Majnemer	ee9447cc0e	Relax tests expecting to see CPI symbols WinCOFF doesn't use CPI symbols, it has a different scheme for naming constant pool entries. Update tests to handle either appearing. llvm-svn: 213039	2014-07-15 02:44:49 +00:00
David Majnemer	4e3ccc0505	CodeGen: Handle ConstantVector and undef in WinCOFF constant pools The constant pool entry code for WinCOFF assumed that vector constants would be formed using ConstantDataVector, it did not expect to see a ConstantVector. Furthermore, it did not expect undef as one of the elements of the vector. ConstantVectors should be handled like ConstantDataVectors, treat Undef as zero. llvm-svn: 213038	2014-07-15 02:34:12 +00:00
Juergen Ributzka	9fbf33d70f	[FastISel][X86] Remove no longer needed functions. llvm-svn: 213037	2014-07-15 02:22:56 +00:00
Juergen Ributzka	170f9354bb	[FastISel][X86] Implement the FastLowerIntrinsicCall hook. Rename X86VisitIntrinsicCall -> FastLowerIntrinsicCall, which effectively implements the target hook. llvm-svn: 213036	2014-07-15 02:22:53 +00:00
Juergen Ributzka	a9cced8a94	[FastISel][X86] Implement the FastLowerCall hook. This implements the FastLowerCall hook, which is based on the DoSelectCall function. The implementation is very similar, but the target-independent call lowering part has been factored out. This should also enable patchpoint intrinsic lowering for FastISel on X86. Related to <rdar://problem/17427052>. llvm-svn: 213035	2014-07-15 02:22:49 +00:00
Juergen Ributzka	718bb71ade	[FastISel] Insert patchpoint instruction before the target generated call instruction. The patchpoint instruction should have been inserted before the target generated call instruction to be inside the ADJSTACKDOWN/ADJSTACKUP call sequence window. llvm-svn: 213034	2014-07-15 02:22:46 +00:00
Juergen Ributzka	a415943590	[FastISel] Fix patchpoint lowering to set the result register. Always update the value map with the result register (if there is one), for the patchpoint instruction we created to replace the target-specific call instruction. llvm-svn: 213033	2014-07-15 02:22:43 +00:00
Matt Arsenault	ca3976f7ae	R600: Add dag combine for copy of an illegal type. This helps avoid redundant instructions to unpack, and repack the vectors. Ideally we could recognize that pattern and eliminate it. Currently v4i8 and other small element type vectors are scalarized, so this has the added bonus of avoiding that. llvm-svn: 213031	2014-07-15 02:06:31 +00:00
Matt Arsenault	f1a7e62033	Teach computeKnownBits to look through addrspacecast. This fixes inferring alignment through an addrspacecast. llvm-svn: 213030	2014-07-15 01:55:03 +00:00
Andrea Di Biagio	138551bb15	Improve test 'CodeGen/X86/combine-vec-shuffle-3.ll'. Now functions 'test4', 'test9', 'test14' and 'test19' correctly perform a move of two packed values from the high quadword of vector %b to the low quadword of vector %a (movhlps idiom). No functional change intended. llvm-svn: 213029	2014-07-15 01:29:27 +00:00
Reid Kleckner	15fe7a530d	Document the maximum LLVM IR alignment, which is 1 << 29 or 0.5 GiB Add verifier checks. We already check these in the assembly parser, but a frontend producing IR in memory wouldn't hit those checks. llvm-svn: 213027	2014-07-15 01:16:09 +00:00
Matt Arsenault	70f4db88d5	Teach GetUnderlyingObject / BasicAA about addrspacecast llvm-svn: 213025	2014-07-15 00:56:40 +00:00
Nick Lewycky	7a63c3b389	Revert r212572 "improve BasicAA CS-CS queries", it causes PR20303. llvm-svn: 213024	2014-07-15 00:53:38 +00:00
Matt Arsenault	ee54aaef0c	Convert test to FileCheck. Check the individual test functions for more useful failure errors. llvm-svn: 213021	2014-07-15 00:07:27 +00:00
Andrea Di Biagio	2152a6c78b	[DAGCombiner] Avoid calling method 'isShuffleMaskLegal' on illegal vector types. This patch fixes a crasher in method 'DAGCombiner::visitOR' due to an invalid call to method 'isShuffleMaskLegal'. On x86, method 'isShuffleMaskLegal' always expects a legal vector value type in input. With this patch, we immediately check if the input OR dag node has a legal vector type; we only try to fold a OR dag node into a single shufflevector if we know that the resulting shuffle will have a legal type. This is to avoid calling method 'isShuffleMaskLegal' on a potentially illegal vector value type. Added a new test-case to file 'CodeGen/X86/combine-or.ll' to verify that DAGCombiner doesn't crash in the attempt to check/combine an OR between shuffles with illegal types. llvm-svn: 213020	2014-07-15 00:02:32 +00:00
Matt Arsenault	f171cf23b8	R600: Add denormal handling subtarget features. llvm-svn: 213018	2014-07-14 23:40:49 +00:00
Matt Arsenault	c6ae7b4763	R600/SI: Default to no single precision denormals. llvm-svn: 213017	2014-07-14 23:40:43 +00:00
Alp Toker	6be46b8ae5	Revert "Revert "Move clang feature flags settings out of LLVM core and into cfe"" It turns out this commit was fine. The problem was in the legacy build system (fixed r213010). This reverts commit r213005. llvm-svn: 213015	2014-07-14 23:30:31 +00:00
Lang Hames	c832ae3eae	[RuntimeDyld] Handle endiannes differences between the host and target while reading MachO files magic numbers in RuntimeDyld. This is required now that we're testing cross-platform JITing (via RuntimeDyldChecker), and should fix some issues that David Fang has seen on PPC builds. llvm-svn: 213012	2014-07-14 23:19:50 +00:00
Adam Nemet	cf7c905cfb	[X86] Specify all TSFlags bit-offsets symbolically No functional change. The offsets for the other bitfields are specified symbolically. I need to increase the size for one of the earlier fields which is easier after this cleanup. Why these bits are relative to VEXShift is a bit strange but that is for another cleanup. I made sure that the values for the enums are unchanged after this change. llvm-svn: 213011	2014-07-14 23:18:39 +00:00
David Majnemer	8bce66b093	CodeGen: Stick constant pool entries in COMDAT sections for WinCOFF COFF lacks a feature that other object file formats support: mergeable sections. To work around this, MSVC sticks constant pool entries in special COMDAT sections so that each constant is in it's own section. This permits unused constants to be dropped and it also allows duplicate constants in different translation units to get merged together. This fixes PR20262. Differential Revision: http://reviews.llvm.org/D4482 llvm-svn: 213006	2014-07-14 22:57:27 +00:00
Alp Toker	2034ac8181	Revert "Move clang feature flags settings out of LLVM core and into cfe" This broke one of the builds, presumably side-by-side modular CMake. Investigating. This reverts commit r212998. llvm-svn: 213005	2014-07-14 22:54:22 +00:00
Alp Toker	6a10223d4a	Fix a -Wunused-local-typedefs warning llvm-svn: 213002	2014-07-14 22:46:45 +00:00
Andrea Di Biagio	3960a9571f	[DAGCombiner] Add more rules to combine shuffle vector dag nodes. This patch teaches the DAGCombiner how to fold a pair of shuffles according to rules: 1. shuffle(shuffle A, B, M0), B, M1) -> shuffle(A, B, M2) 2. shuffle(shuffle A, B, M0), A, M1) -> shuffle(A, B, M3) The new rules would only trigger if the resulting shuffle has legal type and legal mask. Added test 'combine-vec-shuffle-3.ll' to verify that DAGCombiner correctly folds shuffles on x86 when the resulting mask is legal. Also added some negative cases to verify that we avoid introducing illegal shuffles. llvm-svn: 213001	2014-07-14 22:46:26 +00:00
Matt Arsenault	caa9c71f32	Look through addrspacecast in IsConstantOffsetFromGlobal llvm-svn: 213000	2014-07-14 22:39:26 +00:00
Matt Arsenault	fd78d0c934	Look through addrspacecast in GetPointerBaseWithConstantOffset llvm-svn: 212999	2014-07-14 22:39:22 +00:00
Alp Toker	7746d9b033	Move clang feature flags settings out of LLVM core and into cfe clang r212997 incorporated these settings into its own build system. They no longer need to be set from LLVM. llvm-svn: 212998	2014-07-14 22:19:04 +00:00
David Majnemer	5a1c4b8283	CodeGen: Add a getSectionKind method to MachineConstantPoolEntry This is just a helper routine, no functionality has changed. llvm-svn: 212993	2014-07-14 22:06:29 +00:00
Matt Arsenault	eb9e5f41a6	Convert test to FileCheck llvm-svn: 212992	2014-07-14 21:59:26 +00:00
David Majnemer	54b2d64cdc	ADT: Surface LowerCase argument for utohexstr The underlying function. utohex_buffer, already supports an argument for deciding if the hex characters should be upper or lower case. Expose an identical argument for utohexstr. llvm-svn: 212991	2014-07-14 21:56:54 +00:00
Sanjay Patel	a56f8c227c	removed circular definitions in comments llvm-svn: 212990	2014-07-14 21:51:59 +00:00
Justin Bogner	759645ea89	Support: Fix option handling when using cl::Required with aliasopt Until now, attempting to create an alias of a required option would complain if the user supplied the alias, because the required option didn't have a value. Similarly, if you said the alias was required, then using the base option would complain that the alias wasn't supplied. Lastly, if you put required on both, neither option would work. By changning alias to overload addOccurrence and setting cl::Required on the original option, we can get this to behave in a more useful way. I've also added a test and updated a user that was getting this wrong. llvm-svn: 212986	2014-07-14 20:53:57 +00:00
David Majnemer	b8f435ca70	Fix a test broken in r212981 @icmp_sdiv_neg1 should have referred to %a instead of %call, it was renamed at the last second. llvm-svn: 212983	2014-07-14 20:46:04 +00:00
David Majnemer	af9180fd04	InstSimplify: Correct sdiv x / -1 Determining the bounds of x/ -1 would start off with us dividing it by INT_MIN. Suffice to say, this would not work very well. Instead, handle it upfront by checking for -1 and mapping it to the range: [INT_MIN + 1, INT_MAX. This means that the result of our division can be any value other than INT_MIN. llvm-svn: 212981	2014-07-14 20:38:45 +00:00
Sanjay Patel	8c98230248	fixed link llvm-svn: 212977	2014-07-14 19:52:36 +00:00
David Majnemer	5ea4fc0b33	InstSimplify: The upper bound of X / C was missing a rounding step Summary: When calculating the upper bound of X / -8589934592, we would perform the following calculation: Floor[INT_MAX / 8589934592] However, flooring the result would make us wrongly come to the conclusion that 1073741824 was not in the set of possible values. Instead, use the ceiling of the result. Reviewers: nicholas Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D4502 llvm-svn: 212976	2014-07-14 19:49:57 +00:00
Justin Bogner	973b2ff322	Support: Use a range-based for llvm-svn: 212973	2014-07-14 19:24:13 +00:00
Matt Arsenault	199b39e063	Look through addrspacecast when checking isDereferenceablePointer llvm-svn: 212971	2014-07-14 18:54:12 +00:00
Nick Lewycky	703e488ed9	Don't eliminate memcpy's when the address of the pointer may itself be relevant. Fixes PR18304. Patch by David Wiberg! llvm-svn: 212970	2014-07-14 18:52:02 +00:00
Bill Wendling	c80b6c92e2	Unify the lowering of arguments during SjLj prepare. The 'select true, %arg, undef' instruction can be used for both aggregate and non-aggregate arguments. llvm-svn: 212967	2014-07-14 18:21:11 +00:00
Sanjay Patel	b49bf168f2	fixed typo llvm-svn: 212966	2014-07-14 18:21:07 +00:00
Matt Arsenault	d0d6c0b4c9	Use pointer type cast helpers. llvm-svn: 212963	2014-07-14 17:24:38 +00:00
Matt Arsenault	740980ee69	Add CreatePointerBitCastOrAddrSpaceCast to IRBuilder and co. llvm-svn: 212962	2014-07-14 17:24:35 +00:00
Matt Arsenault	62c836fd3b	Update comments to include addrspacecast llvm-svn: 212961	2014-07-14 17:24:31 +00:00
Matt Arsenault	73064e1429	Remove GCC 3.3 workaround llvm-svn: 212960	2014-07-14 17:11:20 +00:00
Saleem Abdulrasool	b51d464f1e	X86: correct 64-bit atomics on 32-bit We would emit a libcall for a 64-bit atomic on x86 after SVN r212119. This was due to the misuse of hasCmpxchg16 to indicate if cmpxchg8b was supported on a 32-bit target. They were added at different times and would result in the border condition being mishandled. This fixes the border case to emit the cmpxchg8b instruction for 64-bit atomic operations on x86 at the cost of restoring a long-standing bug in the codegen. We emit a cmpxchg8b on all x86 targets even where the CPU does not support this instruction (pre-Pentium CPUs). Although this bug should be fixed, this was present prior to SVN r212119 and this change, so this is not really introducing a regression. llvm-svn: 212956	2014-07-14 16:28:13 +00:00
Saleem Abdulrasool	271ac58eb3	CodeGen: add missing include Found during windows unwinding work. This header is indirectly included through a chain leading through Support/Win64EH.h. Explicitly include the header. NFC. llvm-svn: 212955	2014-07-14 16:28:09 +00:00
David Majnemer	8f6b04cb57	llvm-objdump: Handle BSS sections larger than the object file The size of the uninitialized sections, like BSS, can exceed the size of the object file. Do not attempt to grab the contents of such sections. llvm-svn: 212953	2014-07-14 16:20:14 +00:00
Tim Northover	6c647eae8b	X86: remove temporary atomicrmw used during lowering. We construct a temporary "atomicrmw xchg" instruction when lowering atomic stores for widths that aren't supported natively. This isn't on the top-level worklist though, so it won't be removed automatically and we have to do it ourselves once that itself has been lowered. Thanks Saleem for pointing this out! llvm-svn: 212948	2014-07-14 15:31:13 +00:00
Daniel Sanders	41ffa5d1ba	Re-commit: [mips] Correct section alignments and EntrySizes for .bss, .text, .data, .reginfo, .MIPS.options, and .MIPS.abiflags The lld tests will temporarily fail again but Simon Atanasyan will commit a fix for those shortly. llvm-svn: 212946	2014-07-14 15:05:51 +00:00
Daniel Sanders	cb0d36e592	Revert: [mips] Correct section alignments and EntrySizes for .bss, .text, .data, .reginfo, .MIPS.options, and .MIPS.abiflags This commit causes multiple lld tests to fail. Reverting while I investigate the issue. llvm-svn: 212945	2014-07-14 14:43:45 +00:00
Daniel Sanders	8e254166e1	[mips] Correct section alignments and EntrySizes for .bss, .text, .data, .reginfo, .MIPS.options, and .MIPS.abiflags Summary: .bss, .text, and .data are at least 16-byte aligned. .reginfo is 4-byte aligned and has a 24-byte EntrySize. .MIPS.abiflags has an 24-byte EntrySize. .MIPS.options is 8-byte aligned and has 1-byte EntrySize. Using a 1-byte EntrySize for .MIPS.options seems strange because the records are neither 1-byte long nor fixed-length but this matches the value that GAS emits. Differential Revision: http://reviews.llvm.org/D4487 llvm-svn: 212939	2014-07-14 14:02:14 +00:00
Daniel Sanders	7ddb0ab85f	[mips] For the FP64A ABI, odd-numbered double-precision moves must not use mtc1/mfc1. Summary: This is because the FP64A the hardware will redirect 32-bit reads/writes from/to odd-numbered registers to the upper 32-bits of the corresponding even register. In effect, simulating FR=0 mode when FR=0 mode is not available. Unfortunately, we have to make the decision to avoid mfc1/mtc1 before register allocation so we currently do this for even registers too. FPXX has a similar requirement on 32-bit architectures that lack mfhc1/mthc1 so this patch also handles the affected moves from the FPU for FPXX too. Moves to the FPU were supported by an earlier commit. Differential Revision: http://reviews.llvm.org/D4484 llvm-svn: 212938	2014-07-14 13:08:14 +00:00
Daniel Sanders	24e08fd5c0	[mips] Use MFHC1 when it is available (MIPS32r2 and later) for both FP32 and FP64 moves Summary: This is similar to r210771 which did the same thing for MTHC1. Also corrected MTHC1_D32 and MTHC1_D64 which used AFGR64 and FGR64 on the wrong definitions. Differential Revision: http://reviews.llvm.org/D4483 llvm-svn: 212936	2014-07-14 12:41:31 +00:00
NAKAMURA Takumi	6a931f507a	[CMake][Win32.DLL] Let llvm_add_library(SHARED) link dependent libraries as PRIVATE. For example, c-index-test.exe requires just libclang.dll (its import library). When libraries in libclang were not PRIVATE but PUBLIC, c-index-test required libraries transitive by libclang. Note, on mingw with BUILD_SHARED_LIBS, library dependencies would become more strict. In principle, required libraries should be "required in its source file". This will help to detect missing dependencies. llvm-svn: 212934	2014-07-14 12:26:15 +00:00
Tim Northover	3cb24110b1	AArch64: remove unnecessary pseudo-instruction. Sufficiently twisted use of TableGen lets us write patterns directly for f16 (as an i16 promoted to i32) -> f32 conversion. llvm-svn: 212933	2014-07-14 11:16:02 +00:00
Daniel Sanders	9ee2aee859	[mips] Correct the AFL_FLAGS1_ODDSPREG flag in .MIPS.abiflags when no '.module oddspreg' is used Differential Revision: http://reviews.llvm.org/D4486 llvm-svn: 212932	2014-07-14 10:26:15 +00:00
Sasa Stankovic	b976fee83c	[mips] Expand BuildPairF64 to a spill and reload when the O32 FPXX ABI is enabled and mthc1 and dmtc1 are not available (e.g. on MIPS32r1) This prevents the upper 32-bits of a double precision value from being moved to the FPU with mtc1 to an odd-numbered FPU register. This is necessary to ensure that the code generated executes correctly regardless of the current FPU mode. MIPS32r2 and above continues to use mtc1/mthc1, while MIPS-IV and above continue to use dmtc1. Differential Revision: http://reviews.llvm.org/D4465 llvm-svn: 212930	2014-07-14 09:40:29 +00:00
Bill Wendling	151b44d653	Support lowering of empty aggregates. This crash was pretty common while compiling Rust for iOS (armv7). Reason - SjLj preparation step was lowering aggregate arguments as ExtractValue + InsertValue. ExtractValue has assertion which checks that there is some data in value, which is not true in case of empty (no fields) structures. Rust uses them quite extensively so this patch uses a 'select true, %val, undef' instruction to lower the argument. Patch by Valerii Hiora. llvm-svn: 212922	2014-07-14 06:22:36 +00:00
NAKAMURA Takumi	5c40508457	[CMake] LINK_COMPONENTS: Add also corresponding MCTargetDesc and TargetInfo as well, when target names or "nativecodegen" are specified. llvm-svn: 212921	2014-07-14 05:07:07 +00:00
NAKAMURA Takumi	23b702c8de	[CMake] Update libdeps. llvm-svn: 212920	2014-07-14 05:01:53 +00:00
NAKAMURA Takumi	c3b3897e8a	NVPTX/LLVMBuild.txt: Add "Scalar" to required_libraries. It is really referenced. llvm-svn: 212918	2014-07-14 02:52:19 +00:00
NAKAMURA Takumi	a56f860e29	Object/LLVMBuild.txt: Sort required_libraries by alphabetical order. llvm-svn: 212917	2014-07-14 02:52:08 +00:00
Andrea Di Biagio	67d8b2e2b0	[DAGCombiner] Fix a crash caused by a missing check for legal type when trying to fold shuffles. Verify that DAGCombiner does not crash when trying to fold a pair of shuffles according to rule (added at r212539): (shuffle (shuffle A, Undef, M0), Undef, M1) -> (shuffle A, Undef, M2) The DAGCombiner avoids folding shuffles if the resulting shuffle dag node is not legal for the target. That means, the resulting shuffle must have legal type and legal mask. Before, the DAGCombiner only called method 'TargetLowering::isShuffleMaskLegal' to check if it was "safe" to fold according to the above-mentioned rule. However, this caused a crash in the x86 backend since method 'isShuffleMaskLegal' always expects to be called on a legal vector type. llvm-svn: 212915	2014-07-13 21:02:14 +00:00
Saleem Abdulrasool	c7c3cb1f4e	MC: make MCWin64EHInstruction a POD-like struct This is the first of a number of changes designed to generalise MCWin64EHInstruction to support different target architectures. An ordered set (vector) of these instructions is saved per frame to permit the emission of information for Windows NT style unwinding. The only bit of information which is actually target specific here is the Opcode for the unwinding bytecode. The remainder of the information is simply generic information that is relevant to the Windows NT unwinding model. Remove the accessors for the fields, making them const and public instead. Sink the knowledge of the alias'ed name into the single source and sink a single-use check method into the use. llvm-svn: 212914	2014-07-13 19:03:45 +00:00
Saleem Abdulrasool	5705cd8cf6	MC: make helper function be more const-correct Introduce const-ness on parameters, they are used as read-only and should not be modified. NFC. llvm-svn: 212913	2014-07-13 19:03:40 +00:00
Saleem Abdulrasool	3f3cefd392	MC: make DWARF and Windows unwinding handling more similar Rename member variables and functions for the MCStreamer for DWARF-like unwinding management. Rename the Windows ones as well and make the naming and handling similar across the two. No functional change intended. llvm-svn: 212912	2014-07-13 19:03:36 +00:00
Simon Atanasyan	6abd4be930	Add forgotten `break` statement. llvm-svn: 212910	2014-07-13 16:18:56 +00:00
Simon Atanasyan	1cd169f137	[Mips] Support SHT_MIPS_ABIFLAGS section type flag in the llvm-readobj, obj2yaml and yaml2obj tools. llvm-svn: 212908	2014-07-13 15:28:54 +00:00
NAKAMURA Takumi	b5bb1e2fc6	[CMake] Enable loadable modules, aka plugins, with BUILD_SHARED_LIBS on cygming. Loadable modules could be enabled without BUILD_SHARED_LIBS with tweaks in future. llvm-svn: 212907	2014-07-13 13:47:37 +00:00
NAKAMURA Takumi	c4fc6eec53	[CMake] Add LLVM_LINK_COMPONENTS to loadable modules, LLVMHello and BugpointPasses, on Win32. llvm-svn: 212904	2014-07-13 13:36:48 +00:00
NAKAMURA Takumi	44f64eadd0	[CMake] Introduce moddir for MODULE -- corresponding to LIBRARY_OUTPUT_DIRECTORY. On Win32.DLL, it points not lib but bin. LIBRARY_OUTPUT_DIRECTORY affects add_library(MODULE), especially Win32.DLL. llvm-svn: 212903	2014-07-13 13:33:26 +00:00
NAKAMURA Takumi	f0e30f6e5b	bugpoint/ToolRunner.cpp: ProcessFailure(): Close ErrorFD immediately, or it couldn't be reopened on Win32. FIXME: We may have an option in openFileForWrite(), not to use ResultFD but to close it. llvm-svn: 212902	2014-07-13 13:28:18 +00:00
David Majnemer	ebc741168b	IR: Allow comdats to be applied to globals with internal linkage Our verifier check for checking if a global has local linkage was too strict. Forbid private linkage but permit local linkage. Object file formats permit this and forbidding it prevents elimination of unused, internal, vftables under the MSVC ABI. llvm-svn: 212900	2014-07-13 04:56:11 +00:00
David Majnemer	299674e94f	MC: Let non-temporary COFF aliases be in symtab MC was aping a binutils bug where aliases would default their linkage to private instead of internal. I've sent a patch to the binutils maintainers and they've recently applied it to the GNU assembler sources. This fixes PR20152. Differential Revision: http://reviews.llvm.org/D4395 llvm-svn: 212899	2014-07-13 04:31:19 +00:00
Matt Arsenault	c3f6a7e44e	Remove unused include llvm-svn: 212898	2014-07-13 03:08:59 +00:00
Matt Arsenault	d32dbb6a10	R600: Use range for and fix missing consts. llvm-svn: 212897	2014-07-13 03:06:43 +00:00
Matt Arsenault	762af96f46	R600: Make ShaderType private llvm-svn: 212896	2014-07-13 03:06:39 +00:00
Matt Arsenault	7d5e2cb09f	R600: Run more tests with promote alloca disabled. Re-run tests changed in r211110 to test both paths. Also fix broken check line. llvm-svn: 212895	2014-07-13 02:46:17 +00:00
Matt Arsenault	d0b6f3e173	R600: Run private-memory test with and without alloca promote The unpromoted path still needs to be tested since we can't always promote to using LDS. llvm-svn: 212894	2014-07-13 02:18:06 +00:00
Matt Arsenault	d9a23ab20d	R600: Add option to disable promote alloca This can make writing some tests harder, so add a flag to disable it. llvm-svn: 212893	2014-07-13 02:08:26 +00:00
Matt Arsenault	1871d8a080	Try to fix MSVC warning. llvm-svn: 212889	2014-07-12 23:16:26 +00:00
Matt Arsenault	3c0514f34f	Try to fix MSVC build llvm-svn: 212888	2014-07-12 23:09:02 +00:00
Matt Arsenault	93b775f479	Try to fix MSVC build llvm-svn: 212886	2014-07-12 22:19:49 +00:00
Matt Arsenault	4181ea36a9	Templatify DominanceFrontier. Theoretically this should now work for MachineBasicBlocks. llvm-svn: 212885	2014-07-12 21:59:52 +00:00
Saleem Abdulrasool	f74d48a011	AArch64: add support for llvm.aarch64.hint intrinsic This adds a llvm.aarch64.hint intrinsic to mirror the llvm.arm.hint in order to support the various hint intrinsic functions in the ACLE. Add an optional pattern field that permits the subclass to specify the pattern that matches the selection. The intrinsic pattern is set as mayLoad, mayStore, so overload the value for the definition of the hint instruction. llvm-svn: 212883	2014-07-12 21:20:49 +00:00
Saleem Abdulrasool	db514056de	MC: remove use of unnecessary variable Due to the fact that the windows unwinding has the concept of chained frames, we maintain a current frame info pointer that is adjusted on any push and pop of a unwinding context. This just removes an unnecessary variable that was used to mirror the DWARF unwinding code. llvm-svn: 212882	2014-07-12 20:49:13 +00:00
Saleem Abdulrasool	4a1a2f7790	MC: rename MCW64UnwindInfo to MCWinFrameInfo This structure contains information related to the call frame used to generate unwinding information. Rename this to reflect the future use to represent the shared state between various architectures for WinCFI information. llvm-svn: 212881	2014-07-12 20:49:09 +00:00
Simon Atanasyan	8ebb6aed9b	[ELFYAML] Group ELF section type flags to target specific blocks. Recognize only flags which correspond to the current target. llvm-svn: 212880	2014-07-12 18:25:08 +00:00
Owen Anderson	a8d1c3e74e	Fix an issue with the MergeBasicBlockIntoOnlyPred() helper function where it did not properly handle the case where the predecessor block was the entry block to the function. The only in-tree client of this is JumpThreading, which worked around the issue in its own code. This patch moves the solution into the helper so that JumpThreading (and other clients) do not have to replicate the same fix everywhere. llvm-svn: 212875	2014-07-12 07:12:47 +00:00
Alexey Samsonov	15c9669615	[ASan] Collect unmangled names of global variables in Clang to print them in error reports. Currently ASan instrumentation pass creates a string with global name for each instrumented global (to include global names in the error report). Global name is already mangled at this point, and we may not be able to demangle it at runtime (e.g. there is no __cxa_demangle on Android). Instead, create a string with fully qualified global name in Clang, and pass it to ASan instrumentation pass in llvm.asan.globals metadata. If there is no metadata for some global, ASan will use the original algorithm. This fixes https://code.google.com/p/address-sanitizer/issues/detail?id=264. llvm-svn: 212872	2014-07-12 00:42:52 +00:00
Matt Arsenault	6026858158	R600: Add missing tests for some intrinsics llvm-svn: 212870	2014-07-12 00:36:19 +00:00
Duncan P. N. Exon Smith	6075510839	BFI: Add constructor for Weight llvm-svn: 212868	2014-07-12 00:26:00 +00:00
Duncan P. N. Exon Smith	345c287da9	BFI: Clean up BlockMass Implementation is small now -- the interesting logic was moved to `BranchProbability` a while ago. Move it into `bfi_detail` and get rid of the related TODOs. I was originally planning to define it within `BlockFrequencyInfoImpl` (or `BFIIBase`), but it seems cleaner in a namespace. Besides, `isPodLike` needs to be specialized before `BlockMass` can be used in some of the other data structures, and there isn't a clear way to do that. llvm-svn: 212866	2014-07-12 00:21:30 +00:00
Reid Kleckner	9ef63b2836	Option: Propagate flags from groups to options in each group This should make it easy to set a flag for a whole group of clang driver options. llvm-svn: 212865	2014-07-12 00:18:58 +00:00
Lang Hames	5d10284238	[RuntimeDyld] Fix stub size and offset for AArch64 in RuntimeDyldMachO.h. <rdar://problem/17648000> llvm-svn: 212864	2014-07-12 00:16:47 +00:00
Reid Kleckner	fb9519838a	Avoid a warning from MSVC on "*/" in this code by inserting a space llvm-svn: 212862	2014-07-12 00:06:46 +00:00
Duncan P. N. Exon Smith	b5650e5eae	BFI: Mark the end of namespaces llvm-svn: 212861	2014-07-11 23:56:50 +00:00
Lang Hames	cb314ceaa9	[RuntimeDyld] Add GOT support for AArch64 to RuntimeDyldMachO. Test cases to follow once RuntimeDyldChecker supports introspection of stubs. Fixes <rdar://problem/17648000> llvm-svn: 212859	2014-07-11 23:52:07 +00:00
Juergen Ributzka	d755e9f730	Revert "[FastISel][X86] Implement the FastLowerIntrinsicCall hook." This reverts commit r212851, because it broke the memset lowering. llvm-svn: 212855	2014-07-11 23:10:08 +00:00
Juergen Ributzka	04b444913b	[FastISel][X86] Implement the FastLowerIntrinsicCall hook. Rename X86VisitIntrinsicCall -> FastLowerIntrinsicCall, which effectively implements the target hook. llvm-svn: 212851	2014-07-11 22:37:43 +00:00
Alexey Samsonov	08f022ae84	[ASan] Introduce a struct representing the layout of metadata entry in llvm.asan.globals. No functionality change. llvm-svn: 212850	2014-07-11 22:36:02 +00:00
Juergen Ributzka	3d9e6755e4	[FastISel] Add target-independent patchpoint intrinsic support. WIP. This implements the target-independent lowering for the patchpoint intrinsic. Targets have to implement the FastLowerCall hook to support this intrinsic. Related to <rdar://problem/17427052> llvm-svn: 212849	2014-07-11 22:19:02 +00:00
Juergen Ributzka	8179e9e5ad	[FastISel] Add basic infrastructure to support a target-independent call lowering hook in FastISel. WIP The infrastructure mimics the call lowering we have already in place for SelectionDAG, but with limitations. For example structure return demotion and non-simple types are not supported (yet). Currently every backend has its own implementation and duplicated code for call lowering. There is also no specified interface that could be called from target-independent code. The target-hook is opt-in and doesn't affect current implementations. llvm-svn: 212848	2014-07-11 22:01:42 +00:00
Aditya Nandakumar	0b5a674243	When we sink an instruction, this can open up opportunity for the operands to be sunk - add them to the worklist llvm-svn: 212847	2014-07-11 21:49:39 +00:00
Argyrios Kyrtzidis	730abd2f4a	Move the API and implementation of clang::driver::getARMCPUForMArch() to llvm::Triple::getARMCPUForArch(). Suggested by Eric Christopher. llvm-svn: 212846	2014-07-11 21:44:54 +00:00
Juergen Ributzka	4ce9863d0b	[FastISel] Make isInTailCallPosition independent of SelectionDAG. Break out the arguemnts required from SelectionDAG, so that this function can also be used by FastISel. llvm-svn: 212844	2014-07-11 20:50:47 +00:00
Juergen Ributzka	5dd32136b9	[FastISel] Breakout intrinsic lowering into a separate function and add a target-hook. Create a separate helper function for target-independent intrinsic lowering. Also add an target-hook that allows to directly call into a target-sepcific intrinsic lowering method. Currently the implementation is opt-in and doesn't affect existing target implementations. llvm-svn: 212843	2014-07-11 20:42:12 +00:00
Kevin Enderby	fe6ad97ca8	Add the "-s" flag to llvm-nm for Mach-O files that prints symbols only in the specified section. This is same functionality as darwin’s nm(1) "-s" flag. There is one FIXME in the code and I’m all ears to anyone that can help me with that. This option takes exactly two strings and should be allowed anywhere on the command line. Such that "llvm-nm -s __TEXT __text foo.o" would work. But that does not as the CommandLine Library does not have a way to make this work as far as I can tell. For now the "-s __TEXT __text" has to be last on the command line. llvm-svn: 212842	2014-07-11 20:30:00 +00:00
Alp Toker	48bbd061bc	Simplify the raw_svector_ostream tweak from r212816 The memcpy() and overlap helps didn't help much with timings, so clean up the change. The difference at this point is that we now leave growth of the storage buffer up to SmallVector's implementation: - OS.reserve(OS.capacity() * 2); + OS.reserve(OS.size() + 64); llvm-svn: 212837	2014-07-11 18:23:08 +00:00
Ulrich Weigand	0a51abc100	[MC] Constify MCELF::GetVisibility and MCELF::getOther These two routines didn't take a "const MCSymbolData &SD" like the other MCELF::Get routines for some reason ... llvm-svn: 212834	2014-07-11 17:34:44 +00:00
Ulrich Weigand	ea147a9d43	[PowerPC] Fix invalid displacement created by LocalStackAlloc This commit fixes a bug in PPCRegisterInfo::isFrameOffsetLegal that could result in the LocalStackAlloc pass creating an MI instruction out-of-range displacement: %vreg17<def> = LD 33184, %vreg31; mem:LD8[%g](align=32) %G8RC:%vreg17 G8RC_and_G8RC_NOX0:%vreg31 (In final assembler output the top bits are stripped off, resulting in a negative offset loading from below the stack pointer.) Common code expects the isFrameOffsetLegal routine to verify whether adding a given offset to the offset already present in the instruction results in a valid displacement. However, on PowerPC the routine did not take the already present instruction offset into account. This commit fixes isFrameOffsetLegal to add the instruction offset, and updates a local caller (needsFrameBaseReg) to no longer add the instruction offset itself before calling isFrameOffsetLegal. Reviewed by Hal Finkel. llvm-svn: 212832	2014-07-11 17:19:31 +00:00
Marek Olsak	eac5062cc0	R600/SI: Use i32 vectors for resources and samplers This affects new intrinsics only. What surprises me is that v32i8 still works. llvm-svn: 212831	2014-07-11 17:11:52 +00:00
Marek Olsak	d8ecaeec02	R600/SI: add sample and image intrinsics exposing all instruction fields We need the intrinsics with offsets, so why not just add them all. The R128 parameter will also be useful for reducing SGPR usage. GL_ARB_image_load_store also adds some image GLSL modifiers like "coherent", so Mesa will probably translate those to slc, glc, etc. When LLVM 3.5 is released, I'll switch Mesa to these new intrinsics. llvm-svn: 212830	2014-07-11 17:11:46 +00:00
Marek Olsak	ba77c3e4ed	R600/SI: fix shadow mapping for 1D and 2D array textures It was conflicting with def TEX_SHADOW_ARRAY, which also handles them. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 212829	2014-07-11 17:11:39 +00:00
Timur Iskhodzhanov	6de08c3dc4	Add a test case for r212596 llvm-svn: 212828	2014-07-11 16:32:53 +00:00
NAKAMURA Takumi	767ee96534	llvm/test/BugPoint/compile-custom.ll: Use explicit %python to invoke a test script, compile-custom.ll.py, for shebang-incapable hosts. llvm-svn: 212820	2014-07-11 14:44:10 +00:00
NAKAMURA Takumi	ca3b03a574	llvm/test/lit.cfg: Let %python available. llvm-svn: 212819	2014-07-11 14:36:39 +00:00
NAKAMURA Takumi	506c0dca5d	[CMake] add_llvm_library: Add "RUNTIME DESTINATION bin" to install(). It affects add_library(SHARED) for Win32.DLL. llvm-svn: 212818	2014-07-11 14:36:28 +00:00
Alp Toker	bc4d1a3604	raw_svector_ostream: grow and reserve atomically Including the scratch buffer size in the initial reservation eliminates the subsequent malloc+move operation and offers a healthier constant growth with less memory wastage. When doing this, take care to avoid invalidating the source buffer. llvm-svn: 212816	2014-07-11 14:02:04 +00:00
Oliver Stannard	6eda6ffc0c	ARM: Allow __fp16 as a function arg or return type for AArch64 ACLE 2.0 allows __fp16 to be used as a function argument or return type. This enables this for AArch64. llvm-svn: 212812	2014-07-11 13:33:46 +00:00
Alexander Kornienko	56ccdbbd29	Add FileCheck -implicit-check-not option to allow stricter tests without adding too many CHECK-NOTs manually. Summary: Add FileCheck -implicit-check-not option which allows specifying a pattern that should only occur in the input when explicitly matched by a positive check. This feature allows checking tool diagnostics in a way clang -verify does it for compiler diagnostics. The option has been tested on a number of clang-tidy checks, I'll post a link to the clang-tidy patch to this thread. Once there's an agreement on the general direction, I can add tests and documentation. Reviewers: djasper, bkramer Reviewed By: bkramer Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D4462 llvm-svn: 212810	2014-07-11 12:39:32 +00:00
Quentin Colombet	0f179c4d8a	[X86] Fix the inversion of low and high bits for the lowering of MUL_LOHI. Also add a few comments. <rdar://problem/17581756> llvm-svn: 212808	2014-07-11 12:08:23 +00:00
Marcello Maggioni	83442bb8b2	Added test for commit r212802 that was missing llvm-svn: 212803	2014-07-11 10:36:00 +00:00
Marcello Maggioni	78035b11ec	Fixup PHIs in LowerSwitch when a Leaf node is not emitted. This commit fixes bug http://llvm.org/bugs/show_bug.cgi?id=20103. Thanks to Qwertyuiop for the report and the proposed fix. llvm-svn: 212802	2014-07-11 10:34:36 +00:00
Adam Nemet	26f817497c	[X86] AVX512: Improve readability of isCDisp8 No functional change. As I was trying to understand this function, I found that variables were reused with confusing names and the broadcast case was a bit too implicit. Hopefully, this is an improvement. llvm-svn: 212795	2014-07-11 05:23:25 +00:00
Adam Nemet	e311c3c836	[X86] AVX512: Simplify logic in isCDisp8 It was computing the VL/n case as: MemObjSize = VectorByteSize / ElemByteSize / Divider * ElemByteSize ElemByteSize not only falls out but VectorByteSize/Divider now actually matches the definition of VL/n. Also some formatting fixes. llvm-svn: 212794	2014-07-11 05:23:12 +00:00
David Blaikie	de1e1a60e8	Revert "Reapply "DebugInfo: Ensure that all debug location scope chains from instructions within a function, lead to the function itself."" This reverts commit r212776. Nope, still seems to be failing on the sanitizer bots... but hey, not the msan self-host anymore, it's failing in asan now. I'll start looking there next. llvm-svn: 212793	2014-07-11 02:42:57 +00:00
Mark Heffernan	675d401a26	Partially fix PR20058: reduce compile time for loop unrolling with very high count by reducing calls to SE->forgetLoop llvm-svn: 212782	2014-07-10 23:30:06 +00:00
Lang Hames	87c025bb17	[RuntimeDyld] Replace a crufty old ARM RuntimeDyld test with a new one that uses RuntimeDyldChecker. This allows us to remove one of the six remaining object files in the LLVM source tree. llvm-svn: 212780	2014-07-10 23:29:11 +00:00
Lang Hames	16086b984e	[RuntimeDyld] Improve error diagnostic in RuntimeDyldChecker. The compiler often emits assembler-local labels (beginning with 'L') for use in relocation expressions, however these aren't included in the object files. Teach RuntimeDyldChecker to warn the user if they try to use one of these in an expression, since it will never work. llvm-svn: 212777	2014-07-10 23:26:20 +00:00
David Blaikie	3ca92d2406	Reapply "DebugInfo: Ensure that all debug location scope chains from instructions within a function, lead to the function itself." Committed in r212205 and reverted in r212226 due to msan self-hosting failure, I believe I've got that fixed by r212761 to Clang. Original commit message: "Originally committed in r211723, reverted in r211724 due to failure cases found and fixed (ArgumentPromotion: r211872, Inlining: r212065), committed again in r212085 and reverted again in r212089 after fixing some other cases, such as debug info subprogram lists not keeping track of the function they represent (r212128) and then short-circuiting things like LiveDebugVariables that build LexicalScopes for functions that might not have full debug info. And again, I believe the invariant actually holds for some reasonable amount of code (but I'll keep an eye on the buildbots and see what happens... ). Original commit message: PR20038: DebugInfo: Inlined call sites where the caller has debug info but the call itself has no debug location. This situation does bad things when inlined, so I've fixed Clang not to produce inlinable call sites without locations when the caller has debug info (in the one case where I could find that this occurred). This updates the PR20038 test case to be what clang now produces, and readds the assertion that had to be removed due to this bug. I've also beefed up the debug info verifier to help diagnose these issues in the future, and I hope to add checks to the inliner to just assert-fail if it encounters this situation. If, in the future, we decide we have to cope with this situation, the right thing to do is probably to just remove all the DebugLocs from the inlined instructions." llvm-svn: 212776	2014-07-10 22:59:39 +00:00
David Blaikie	d9bdaf00ff	This test case doesn't actually need the inliner to reproduce the input. llvm-svn: 212775	2014-07-10 22:57:40 +00:00
Jan Vesely	2cb62ce2a0	R600: Implement float to long/ulong Use alg. from LegalizeDAG.cpp Move Expand setting to SIISellowering v2: Extend existing tests instead of creating new ones v3: use separate LowerFPTOSINT function v4: use TargetLowering::expandFP_TO_SINT add comment about using FP_TO_SINT for uints Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Tom Stellard <tom@stellard.net> llvm-svn: 212773	2014-07-10 22:40:21 +00:00
Jan Vesely	eca89d283e	SelectionDAG: Factor FP_TO_SINT lower code out of DAGLegalizer Move the code to a helper function to allow calls from TypeLegalizer. No functionality change intended Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Tom Stellard <tom@stellard.net> Reviewed-by: Owen Anderson <resistor@mac.com> llvm-svn: 212772	2014-07-10 22:40:18 +00:00
Brad Smith	733cb6437d	Use the integrated assembler by default on OpenBSD. llvm-svn: 212771	2014-07-10 22:37:28 +00:00
Zoran Jovanovic	f34b454219	[mips] Emit two CFI offset directives per double precision SDC1/LDC1 instead of just one for FR=1 registers Differential Revision: http://reviews.llvm.org/D4310 llvm-svn: 212769	2014-07-10 22:23:30 +00:00
Andrea Di Biagio	2f6d821b88	Extend the test coverage in combine-vec-shuffle-2.ll adding some negative tests. Add test cases where we don't expect to trigger the combine optimizations introduced at revision 212748. No functional change intended. llvm-svn: 212756	2014-07-10 18:59:41 +00:00
Matt Arsenault	3332b70627	Revert "Revert r212640, "Add trunc (select c, a, b) -> select c (trunc a), (trunc b) combine."" Don't try to convert the select condition type. llvm-svn: 212750	2014-07-10 18:21:04 +00:00
Andrea Di Biagio	b2921c7ca0	[DAG] Further improve the logic in DAGCombiner that folds a pair of shuffles into a single shuffle if the resulting mask is legal. This patch teaches the DAGCombiner how to fold shuffles according to the following new rules: 1. shuffle(shuffle(x, y), undef) -> x 2. shuffle(shuffle(x, y), undef) -> y 3. shuffle(shuffle(x, y), undef) -> shuffle(x, undef) 4. shuffle(shuffle(x, y), undef) -> shuffle(y, undef) The backend avoids to combine shuffles according to rules 3. and 4. if the resulting shuffle does not have a legal mask. This is to avoid introducing illegal shuffles that are potentially expanded into a sub-optimal sequence of target specific dag nodes during vector legalization. Added test case combine-vec-shuffle-2.ll to verify that we correctly triggers the new rules when combining shuffles. llvm-svn: 212748	2014-07-10 18:04:55 +00:00
Akira Hatanaka	7cc27649a6	[X86] Mark pseudo instruction TEST8ri_NOEREX as hasSIdeEffects=0. Also, add a case clause in X86InstrInfo::shouldScheduleAdjacent to enable macro-fusion. <rdar://problem/15680770> llvm-svn: 212747	2014-07-10 18:00:53 +00:00
Eric Christopher	54fe1b260c	Add the CSR company and the Kalimba DSP processor to Triple. Patch by Matthew Gardiner with fixes by me. llvm-svn: 212745	2014-07-10 17:26:54 +00:00
Eric Christopher	22405e4bbf	Make it possible for the Subtarget to change between function passes in the mips back end. This, unfortunately, required a bit of churn in the various predicates to use a pointer rather than a reference. llvm-svn: 212744	2014-07-10 17:26:51 +00:00
Duncan P. N. Exon Smith	04934b0fec	InstCombine: Fix a crash in Descale for multiply-by-zero Fix a crash in `InstCombiner::Descale()` when a multiply-by-zero gets created as an argument to a GEP partway through an iteration, causing -instcombine to optimize the GEP before the multiply. rdar://problem/17615671 llvm-svn: 212742	2014-07-10 17:13:27 +00:00
David Majnemer	ed33243e86	IR: Aliases don't belong to an explicit comdat Aliases inherit their comdat from their aliasee, they don't have an explicit comdat. This fixes PR20279. llvm-svn: 212732	2014-07-10 16:26:10 +00:00
Hal Finkel	511fea7acd	Feeding isSafeToSpeculativelyExecute its DataLayout pointer (in Sink) This is the one remaining place I see where passing isSafeToSpeculativelyExecute a DataLayout pointer might matter (at least for loads) -- I think I got the others in r212720. Most of the other remaining callers of isSafeToSpeculativelyExecute only use it for call sites (or otherwise exclude loads). llvm-svn: 212730	2014-07-10 16:07:11 +00:00
David Majnemer	99ef236542	Mips: Silence a -Wcovered-switch-default Remove a default label which covered no enumerators, replace it with a llvm_unreachable. No functionality changed. llvm-svn: 212729	2014-07-10 16:04:04 +00:00
Zoran Jovanovic	255d00dc23	[mips] Added FPXX modeless calling convention. Differential Revision: http://reviews.llvm.org/D4293 llvm-svn: 212726	2014-07-10 15:36:12 +00:00
Arnaud A. de Grandmaison	f643231163	[AArch64] Add logical alias instructions to MC AsmParser This patch teaches the AsmParser to accept some logical+immediate instructions and convert them as shown: bic Rd, Rn, #imm -> and Rd, Rn, #~imm bics Rd, Rn, #imm -> ands Rd, Rn, #~imm orn Rd, Rn, #imm -> orr Rd, Rn, #~imm eon Rd, Rn, #imm -> eor Rd, Rn, #~imm Those instructions are an alternate syntax available to assembly coders, and are needed in order to support code already compiling with some other assemblers. For example, the bic construct is used by the linux kernel. llvm-svn: 212722	2014-07-10 15:12:26 +00:00
Hal Finkel	a995f92627	Feeding isSafeToSpeculativelyExecute its DataLayout pointer isSafeToSpeculativelyExecute can optionally take a DataLayout pointer. In the past, this was mainly used to make better decisions regarding divisions known not to trap, and so was not all that important for users concerned with "cheap" instructions. However, now it also helps look through bitcasts for dereferencable loads, and will also be important if/when we add a dereferencable pointer attribute. This is some initial work to feed a DataLayout pointer through to callers of isSafeToSpeculativelyExecute, generally where one was already available. llvm-svn: 212720	2014-07-10 14:41:31 +00:00
Tim Northover	fee2adefba	AArch64: correctly fast-isel i8 & i16 multiplies We were asking for a register for type i8 or i16 which caused an assert. rdar://problem/17620015 llvm-svn: 212718	2014-07-10 14:18:46 +00:00
Daniel Sanders	7e527423f5	[mips] Add support for -modd-spreg/-mno-odd-spreg Summary: When -mno-odd-spreg is in effect, 32-bit floating point values are not permitted in odd FPU registers. The option also prohibits 32-bit and 64-bit floating point comparison results from being written to odd registers. This option has three purposes: * It allows support for certain MIPS implementations such as loongson-3a that do not allow the use of odd registers for single precision arithmetic. * When using -mfpxx, -mno-odd-spreg is the default and this allows us to statically check that code is compliant with the O32 FPXX ABI since mtc1/mfc1 instructions to/from odd registers are guaranteed not to appear for any reason. Once this has been established, the user can then re-enable -modd-spreg to regain the use of all 32 single-precision registers. * When using -mfp64 and -mno-odd-spreg together, an O32 extension named O32 FP64A is used as the ABI. This is intended to provide almost all functionality of an FR=1 processor but can also be executed on a FR=0 core with the assistance of a hardware compatibility mode which emulates FR=0 behaviour on an FR=1 processor. * Added '.module oddspreg' and '.module nooddspreg' each of which update the .MIPS.abiflags section appropriately * Moved setFpABI() call inside emitDirectiveModuleFP() so that the caller doesn't have to remember to do it. * MipsABIFlags now calculates the flags1 and flags2 member on demand rather than trying to maintain them in the same format they will be emitted in. There is one portion of the -mfp64 and -mno-odd-spreg combination that is not implemented yet. Moves to/from odd-numbered double-precision registers must not use mtc1. I will fix this in a follow-up. Differential Revision: http://reviews.llvm.org/D4383 llvm-svn: 212717	2014-07-10 13:38:23 +00:00
Zinovy Nis	cad431c122	[x32] Add AsmBackend for X32 which uses ELF32 with x86_64 (the author is Pavel Chupin). This is minimal change for backend required to have "hello world" compiled and working on x32 target (x86_64-linux-gnux32). More patches for x32 will follow. Differential Revision: http://reviews.llvm.org/D4181 llvm-svn: 212716	2014-07-10 13:03:26 +00:00
Chandler Carruth	0b666e0648	[x86,SDAG] Introduce any- and sign-extend-vector-inreg nodes analogous to the zero-extend-vector-inreg node introduced previously for the same purpose: manage the type legalization of widened extend operations, especially to support the experimental widening mode for x86. I'm adding both because sign-extend is expanded in terms of any-extend with shifts to propagate the sign bit. This removes the last fundamental scalarization from vec_cast2.ll (a test case that hit many really bad edge cases for widening legalization), although the trunc tests in that file still appear scalarized because the the shuffle legalization is scalarizing. Funny thing, I've been working on that. Some initial experiments with this and SSE2 scenarios is showing moderately good behavior already for sign extension. Still some work to do on the shuffle combining on X86 before we're generating optimal sequences, but avoiding scalarization is a huge step forward. llvm-svn: 212714	2014-07-10 12:32:32 +00:00
Richard Sandiford	02bb0ec368	[SystemZ] Use SystemZCallingConv.td to define callee-saved registers Just a clean-up. No behavioral change intended. llvm-svn: 212711	2014-07-10 11:44:37 +00:00
NAKAMURA Takumi	76d7e08a61	SpecialCaseList.h: Fix -Wdocumentation with \code. llvm-svn: 212710	2014-07-10 11:39:59 +00:00
NAKAMURA Takumi	e808b28ee1	llvm/test/CodeGen/X86/shift-parts.ll: FileCheck-ize. (from r212640) llvm-svn: 212709	2014-07-10 11:37:39 +00:00
NAKAMURA Takumi	f862ce8908	Revert r212640, "Add trunc (select c, a, b) -> select c (trunc a), (trunc b) combine." This caused miscompilation on, at least, x86-64. SExt(i1 cond) confused other optimizations. llvm-svn: 212708	2014-07-10 11:37:28 +00:00
Richard Sandiford	909aa3ad21	[SystemZ] Tweak instruction format classifications There's no real need to have Shift as a separate format type from Binary. The comments for other format types were too specific and in some cases no longer accurate. Just a clean-up, no behavioral change intended. llvm-svn: 212707	2014-07-10 11:29:23 +00:00
Chandler Carruth	df8d0caab7	[x86] Add another combine that is particularly useful for the new vector shuffle lowering: match shuffle patterns equivalent to an unpcklwd or unpckhwd instruction. This allows us to use generic lowering code for v8i16 shuffles and match the unpack pattern late. llvm-svn: 212705	2014-07-10 11:09:29 +00:00
Richard Sandiford	e66e8c8b66	[SystemZ] Add MC support for LEDBRA, LEXBRA and LDXBRA These instructions aren't used for codegen since the original L*DB instructions are suitable for fround. llvm-svn: 212703	2014-07-10 11:00:55 +00:00
Richard Sandiford	ca44614ac0	[SystemZ] Avoid using i8 constants for immediate fields Immediate fields that have no natural MVT type tended to use i8 if the field was small enough. This was a bit confusing since i8 isn't a legal type for the target. Fields for short immediates in a 32-bit or 64-bit operation use i32 or i64 instead, so it would be better to do the same for all fields. No behavioral change intended. llvm-svn: 212702	2014-07-10 10:52:51 +00:00
Richard Sandiford	ac1dba0fdf	[SystemZ] Fix FPR dwarf numbering The dwarf FPR numbers are supposed to have the order F0, F2, F4, F6, F1, F3, F5, F7, F8, etc., which matches the pairing of registers for long doubles. E.g. a long double stored in F0 is paired with F2. llvm-svn: 212701	2014-07-10 10:45:11 +00:00
Daniel Sanders	cbd44c591d	Make it possible for ints/floats to return different values from getBooleanContents() Summary: On MIPS32r6/MIPS64r6, floating point comparisons return 0 or -1 but integer comparisons return 0 or 1. Updated the various uses of getBooleanContents. Two simplifications had to be disabled when float and int boolean contents differ: - ScalarizeVecRes_VSELECT except when the kind of boolean contents is trivially discoverable (i.e. when the condition of the VSELECT is a SETCC node). - visitVSELECT (select C, 0, 1) -> (xor C, 1). Come to think of it, this one could test for the common case of 'C' being a SETCC too. Preserved existing behaviour for all other targets and updated the affected MIPS32r6/MIPS64r6 tests. This also fixes the pi benchmark where the 'low' variable was counting in the wrong direction because it thought it could simply add the result of the comparison. Reviewers: hfinkel Reviewed By: hfinkel Subscribers: hfinkel, jholewinski, mcrosier, llvm-commits Differential Revision: http://reviews.llvm.org/D4389 llvm-svn: 212697	2014-07-10 10:18:12 +00:00
Chandler Carruth	853fa0ac8d	[x86] Expand the target DAG combining for PSHUFD nodes to be able to combine into half-shuffles through unpack instructions that expand the half to a whole vector without messing with the dword lanes. This fixes some redundant instructions in splat-like lowerings for v16i8, which are now getting to be really nice. llvm-svn: 212695	2014-07-10 09:57:36 +00:00
Chandler Carruth	a34a8e230d	[x86] Tweak the v16i8 single input special case lowering for shuffles that splat i8s into i16s. Previously, we would try much too hard to arrange a sequence of i8s in one half of the input such that we could unpack them into i16s and shuffle those into place. This isn't always going to be a cheaper i8 shuffle than our other strategies. The case where it is always going to be cheaper is when we can arrange all the necessary inputs into one half using just i16 shuffles. It happens that viewing the problem this way also makes it much easier to produce an efficient set of shuffles to move the inputs into one half and then unpack them. With this, our splat code gets one step closer to being not terrible with the new experimental lowering strategy. It also exposes two combines missing which I will add next. llvm-svn: 212692	2014-07-10 09:16:40 +00:00
Hal Finkel	a71fe078c8	A test case for not asserting in isDereferenceablePointer upon unsized types This is the test case for r212687. llvm-svn: 212688	2014-07-10 07:04:37 +00:00
Hal Finkel	66e23f126d	Fix isDereferenceablePointer not to try to take the size of an unsized type. I'll add a test-case shortly. llvm-svn: 212687	2014-07-10 06:06:11 +00:00
Hal Finkel	2e42c34d05	Allow isDereferenceablePointer to look through some bitcasts isDereferenceablePointer should not give up upon encountering any bitcast. If we're casting from a pointer to a larger type to a pointer to a small type, we can continue by examining the bitcast's operand. This missing capability was noted in a comment in the function. In order for this to work, isDereferenceablePointer now takes an optional DataLayout pointer (essentially all callers already had such a pointer available). Most code uses isDereferenceablePointer though isSafeToSpeculativelyExecute (which already took an optional DataLayout pointer), and to enable the LICM test case, LICM needs to actually provide its DL pointer to isSafeToSpeculativelyExecute (which it was not doing previously). llvm-svn: 212686	2014-07-10 05:27:53 +00:00
Saleem Abdulrasool	1e76cbdff7	MC: modernise for loop Convert a for loop to range bsaed form. NFC. llvm-svn: 212684	2014-07-10 04:50:09 +00:00
Saleem Abdulrasool	427c08d48b	MC: add and use an accessor for WinCFI This adds a utility method to access the WinCFI information in bulk and uses that to iterate rather than requesting the count and individually iterating them. This is in preparation for restructuring WinCFI handling to enable more clear sharing across architectures to enable unwind information emission for Windows on ARM. llvm-svn: 212683	2014-07-10 04:50:06 +00:00
Peter Collingbourne	8876c3face	Remove move assignment operator to appease older GCCs. llvm-svn: 212682	2014-07-10 04:39:40 +00:00
Chandler Carruth	7d2ffb5492	[x86] Initial improvements to the new shuffle lowering for v16i8 shuffles specifically for cases where a small subset of the elements in the input vector are actually used. This is specifically targetted at improving the shuffles generated for trunc operations, but also helps out splat-like operations. There is still some really low-hanging fruit here that I want to address but this is a huge step in the right direction. llvm-svn: 212680	2014-07-10 04:34:06 +00:00
Peter Collingbourne	05b9ebf2f9	Explicitly define move constructor and move assignment operator to appease MSVC. llvm-svn: 212679	2014-07-10 04:29:06 +00:00
Peter Collingbourne	d5feb7ba42	SpecialCaseList: use std::unique_ptr. llvm-svn: 212678	2014-07-10 03:55:02 +00:00
Hao Liu	71224b02fb	[AArch64]Fix an assertion failure in DAG Combiner about concating 2 build_vector. llvm-svn: 212677	2014-07-10 03:41:50 +00:00
Matt Arsenault	b0df92577d	R600/SI: Add support for llvm.convert.{to\|from}.fp16 llvm-svn: 212676	2014-07-10 03:22:20 +00:00
Matt Arsenault	3e3ddda7a2	Fix types in documentation. The examples were using f32, but the IR type is called float llvm-svn: 212675	2014-07-10 03:22:16 +00:00
Chandler Carruth	b3840a55ae	[x86] Refactor some of the new code for lowering v16i8 shuffles to remove duplication and make it easier to select different strategies. No functionality changed. llvm-svn: 212674	2014-07-10 02:24:26 +00:00
Peter Collingbourne	2e28edf8e1	[dfsan] Handle bitcast aliases. llvm-svn: 212668	2014-07-10 01:30:39 +00:00
Chandler Carruth	d3561f6fec	[SDAG] Make the new zext-vector-inreg node default to expand so targets don't need to set it manually. This is based on feedback from Tom who pointed out that if every target needs to handle this we need to reach out to those maintainers. In fact, it doesn't make sense to duplicate everything when anything other than expand seems unlikely at this stage. llvm-svn: 212661	2014-07-09 22:53:04 +00:00
David Blaikie	029bd3350e	Recommit r212203: Don't try to construct debug LexicalScopes hierarchy for functions that do not have top level debug information. Reverted by Eric Christopher (Thanks!) in r212203 after Bob Wilson reported LTO issues. Duncan Exon Smith and Aditya Nandakumar helped provide a reduced reproduction, though the failure wasn't too hard to guess, and even easier with the example to confirm. The assertion that the subprogram metadata associated with an llvm::Function matches the scope data referenced by the DbgLocs on the instructions in that function is not valid under LTO. In LTO, a C++ inline function might exist in multiple CUs and the subprogram metadata nodes will refer to the same llvm::Function. In this case, depending on the order of the CUs, the first intance of the subprogram metadata may not be the one referenced by the instructions in that function and the assertion will fail. A test case (test/DebugInfo/cross-cu-linkonce-distinct.ll) is added, the assertion removed and a comment added to explain this situation. Original commit message: If a function isn't actually in a CU's subprogram list in the debug info metadata, ignore all the DebugLocs and don't try to build scopes, track variables, etc. While this is possibly a minor optimization, it's also a correctness fix for an incoming patch that will add assertions to LexicalScopes and the debug info verifier to ensure that all scope chains lead to debug info for the current function. Fix up a few test cases that had broken/incomplete debug info that could violate this constraint. Add a test case where this occurs by design (inlining a debug-info-having function in an attribute nodebug function - we want this to work because /if/ the nodebug function is then inlined into a debug-info-having function, it should be fine (and will work fine - we just stitch the scopes up as usual), but should the inlining not happen we need to not assert fail either). llvm-svn: 212649	2014-07-09 21:02:41 +00:00
Alexey Samsonov	b7dd329f2f	Decouple llvm::SpecialCaseList text representation and its LLVM IR semantics. Turn llvm::SpecialCaseList into a simple class that parses text files in a specified format and knows nothing about LLVM IR. Move this class into LLVMSupport library. Implement two users of this class: * DFSanABIList in DFSan instrumentation pass. * SanitizerBlacklist in Clang CodeGen library. The latter will be modified to use actual source-level information from frontend (source file names) instead of unstable LLVM IR things (LLVM Module identifier). Remove dependency edge from ClangCodeGen/ClangDriver to LLVMTransformUtils. No functionality change. llvm-svn: 212643	2014-07-09 19:40:08 +00:00
Tim Northover	0f0a6c1e1d	Use simpler constructor for range adapter. It is a good idea, it's slightly clearer and simpler. Unfortunately the headline news is: we save one line! llvm-svn: 212641	2014-07-09 19:14:34 +00:00
Matt Arsenault	658c5576d1	Add trunc (select c, a, b) -> select c (trunc a), (trunc b) combine. Do this if the truncate is free and the select is legal. llvm-svn: 212640	2014-07-09 19:12:07 +00:00
Jim Grosbach	34cc92b475	AArch64: Better codegen for storing to __fp16. Storing will generally be immediately preceded by rounding from an f32 or f64, so make sure to match those patterns directly to convert into the FPR16 register class directly rather than going through the integer GPRs. This also eliminates an extra step in the convert-from-f64 path which was first converting to f32 and then to f16 from there. rdar://17594379 llvm-svn: 212638	2014-07-09 18:55:52 +00:00
Jim Grosbach	37b8093a8f	Change an assert() to a diagnostic. llvm-svn: 212637	2014-07-09 18:55:49 +00:00
Benjamin Kramer	c560a6cadc	TargetRegisterInfo: Remove function that fell out of use years ago. llvm-svn: 212636	2014-07-09 18:53:57 +00:00
Cameron McInally	0c01caa2ad	Update ReleaseNotes to mention Atomic NAND semantic changes. llvm-svn: 212635	2014-07-09 18:29:55 +00:00
Adam Nemet	2820a5b9e9	[X86] AVX512: Enable it in the Loop Vectorizer This lets us experiment with 512-bit vectorization without passing force-vector-width manually. The code generated for a simple integer memset loop is properly vectorized. Disassembly is still broken for it though :(. llvm-svn: 212634	2014-07-09 18:22:33 +00:00
Louis Gerbarg	1ce0c37bf0	Make AArch64FastISel::EmitIntExt explicitly check its source and destination types This is a follow up to r212492. There should be no functional difference, but this patch makes it clear that SrcVT must be an i1/i8/16/i32 and DestVT must be an i8/i16/i32/i64. rdar://17516686 llvm-svn: 212633	2014-07-09 17:54:32 +00:00
Sanjay Patel	7ae7a831b9	removed duplicate testcase llvm-svn: 212632	2014-07-09 17:49:58 +00:00
Sanjay Patel	58814445d4	Fix for PR20059 (instcombine reorders shufflevector after instruction that may trap) In PR20059 ( http://llvm.org/pr20059 ), instcombine eliminates shuffles that are necessary before performing an operation that can trap (srem). This patch calls isSafeToSpeculativelyExecute() and bails out of the optimization in SimplifyVectorOp() if needed. Differential Revision: http://reviews.llvm.org/D4424 llvm-svn: 212629	2014-07-09 16:34:54 +00:00
Daniel Sanders	c5626f4444	Add Imagination Technologies to the vendors in llvm::Triple Summary: This is a pre-requisite for supporting the mips-img-linux-gnu triple in clang. Differential Revision: http://reviews.llvm.org/D4435 llvm-svn: 212626	2014-07-09 16:03:10 +00:00
Tim Northover	ac002d3e34	Generic: add range-adapter for option parsing. I want to use it in lld, but while I'm here I'll update LLVM uses. llvm-svn: 212615	2014-07-09 13:03:37 +00:00
Chandler Carruth	5865a73a82	[x86] Fix a bug in my new zext-vector-inreg DAG trickery where we were not widening the input type to the node sufficiently to let the ext take place in a register. This would in turn result in a mysterious bitcast assertion failure downstream. First change here is to add back the helpful assert I had in an earlier version of the code to catch this immediately. Next change is to add support to the type legalization to detect when we have widened the operand either too little or too much (for whatever reason) and find a size-matched legal vector type to convert it to first. This can also fail so we get a new fallback path, but that seems OK. With this, we no longer crash on vec_cast2.ll when using widening. I've also added the CHECK lines for the zero-extend cases here. We still need to support sign-extend and trunc (or something) to get plausible code for the other two thirds of this test which is one of the regression tests that showed the most scalarization when widening was force-enabled. Slowly closing in on widening being a viable legalization strategy without it resorting to scalarization at every turn. =] llvm-svn: 212614	2014-07-09 12:36:54 +00:00
Chandler Carruth	14cad41e14	Sink two variables only used in an assert into the assert itself. Should fix the release builds with Werror. llvm-svn: 212612	2014-07-09 11:13:16 +00:00
Benjamin Kramer	d6f1733add	X86: When lowering v8i32 himuls use the correct shuffle masks for AVX2. Turns out my trick of using the same masks for SSE4.1 and AVX2 didn't work out as we have to blend two vectors. While there remove unecessary cross-lane moves from the shuffles so the backend can lower it to palignr instead of vperm. Fixes PR20118, a miscompilation of vector sdiv by constant on AVX2. llvm-svn: 212611	2014-07-09 11:12:39 +00:00
Chandler Carruth	afe4b2507e	[x86] Add a ZERO_EXTEND_VECTOR_INREG DAG node and use it when widening vector types to be legal and a ZERO_EXTEND node is encountered. When we use widening to legalize vector types, extend nodes are a real challenge. Either the input or output is likely to be legal, but in many cases not both. As a consequence, we don't really have any way to represent this situation and the prior code in the widening legalization framework would just scalarize the extend operation completely. This patch introduces a new DAG node to represent doing a zero extend of a vector "in register". The core of the idea is to allow legal but different vector types in the input and output. The output vector must have fewer lanes but wider elements. The operation is defined to zero extend the low elements of the input to the size of the output elements, and drop all of the high elements which don't have a corresponding lane in the output vector. It also includes generic expansion of this node in terms of blending a zero vector into the high elements of the vector and bitcasting across. This in turn yields extremely nice code for x86 SSE2 when we use the new widening legalization logic in conjunction with the new shuffle lowering logic. There is still more to do here. We need to support sign extension, any extension, and potentially int-to-float conversions. My current plan is to continue using similar synthetic nodes to model each of these transitions with generic lowering code for each one. However, with this patch LLVM already reaches performance parity with GCC for the core C loops of the x264 code (assuming you disable the hand-written assembly versions) when compiling for SSE2 and SSE3 architectures and enabling the new widening and lowering logic for vectors. Differential Revision: http://reviews.llvm.org/D4405 llvm-svn: 212610	2014-07-09 10:58:18 +00:00
Daniel Sanders	e31155fd1a	[mips][mips64r6] Correct select patterns that have the condition or true/false values backwards Summary: This bug caused SingleSource/Regression/C/uint64_to_float and SingleSource/UnitTests/2002-05-02-CastTest3 to fail (among others). Differential Revision: http://reviews.llvm.org/D4388 llvm-svn: 212608	2014-07-09 10:47:26 +00:00
Daniel Sanders	dc06718e0b	[mips][mips64r6] Correct cond names in the cmp.cond.[ds] instructions Summary: It seems we accidentally read the wrong column of the table MIPS64r6 spec and used the names for c.cond.fmt instead of cmp.cond.fmt. Differential Revision: http://reviews.llvm.org/D4387 llvm-svn: 212607	2014-07-09 10:40:20 +00:00
Chandler Carruth	ef5dcf571e	[x86] Initialize a pointer to null to fix a bug in r212602. This should restore GCC hosts (which happen to put the bad stuff into the pointer) and MSan, etc. llvm-svn: 212606	2014-07-09 10:36:42 +00:00
Daniel Sanders	f5a5fbd3f4	[mips][mips64r6] Use JALR for indirect branches instead of JR (which is not available on MIPS32r6/MIPS64r6) Summary: This completes the change to use JALR instead of JR on MIPS32r6/MIPS64r6. Reviewers: jkolek, vmedic, zoran.jovanovic, dsanders Reviewed By: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D4269 llvm-svn: 212605	2014-07-09 10:21:59 +00:00
Daniel Sanders	338513b3fa	[mips][mips64r6] Use JALR for returns instead of JR (which is not available on MIPS32r6/MIPS64r6) Summary: RET, and RET_MM have been replaced by a pseudo named PseudoReturn. In addition a version with a 64-bit GPR named PseudoReturn64 has been added. Instruction selection for a return matches RetRA, which is expanded post register allocation to PseudoReturn/PseudoReturn64. During MipsAsmPrinter, this PseudoReturn/PseudoReturn64 are emitted as: - (JALR64 $zero, $rs) on MIPS64r6 - (JALR $zero, $rs) on MIPS32r6 - (JR_MM $rs) on microMIPS - (JR $rs) otherwise On MIPS32r6/MIPS64r6, 'jr $rs' is an alias for 'jalr $zero, $rs'. To aid development and review (specifically, to ensure all cases of jr are updated), these aliases are temporarily named 'r6.jr' instead of 'jr'. A follow up patch will change them back to the correct mnemonic. Added (JALR $zero, $rs) to MipsNaClELFStreamer's definition of an indirect jump, and removed it from its definition of a call. Note: I haven't accounted for MIPS64 in MipsNaClELFStreamer since it's doesn't appear to account for any MIPS64-specifics. The return instruction created as part of eh_return expansion is now expanded using expandRetRA() so we use the right return instruction on MIPS32r6/MIPS64r6 ('jalr $zero, $rs'). Also, fixed a misuse of isABI_N64() to detect 64-bit wide registers in expandEhReturn(). Reviewers: jkolek, vmedic, mseaborn, zoran.jovanovic, dsanders Reviewed By: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D4268 llvm-svn: 212604	2014-07-09 10:16:07 +00:00
Daniel Sanders	123c38de3b	Add ability to emit internal instruction representation to CodeGen assembly output. Summary: This patch re-uses the implementation of 'llvm-mc -show-inst' and makes it available to llc as 'llc -asm-show-inst'. This is necessary to test parts of MIPS32r6/MIPS64r6 without resorting to 'llc -filetype=obj' tests. For example, on MIPS32r2 and earlier we use the 'jr $rs' instruction for indirect branches and returns. On MIPS32r6, we no longer have 'jr $rs' and use 'jalr $zero, $rs' instead. The catch is that, on MIPS32r6, 'jr $rs' is an alias for 'jalr $zero, $rs' and is the preferred way of writing this instruction. As a result, all MIPS ISA's emit 'jr $rs' in their assembly output and the assembler encodes this to different opcodes according to the ISA. Using this option, we can check that the MCInst really is a JR or a JALR by matching the emitted comment. This removes the need for a 'llc -filetype=obj' test. Reviewers: rafael, dsanders Reviewed By: dsanders Subscribers: zoran.jovanovic, llvm-commits Differential Revision: http://reviews.llvm.org/D4267 llvm-svn: 212603	2014-07-09 10:07:36 +00:00
Chandler Carruth	2ebc942683	[x86] Re-apply a variant of the x86 side of r212324 now that the rest has settled without incident, removing the x86-specific and overly strict 'isVectorSplat' routine in favor of generic and more powerful splat detection. The primary motivation and result of this is that the x86 backend can now see through splats which contain undef elements. This is essential if we are using a widening form of legalization and I've updated a test case to also run in that mode as before this change the generated code for the test case was completely scalarized. This version of the patch much more carefully handles the undef lanes. - We aren't overly conservative about them in the shift lowering (where we will never use the splat itself). - One place where the splat would have been re-used by the existing code now explicitly constructs a new constant splat that will be safe. - The broadcast lowering is much more reasonable with undefs by doing a correct check of whether the splat is the only user of a loaded value, checking that the splat actually crosses multiple lanes before using a broadcast, and handling broadcasts of non-constant splats. As a consequence of the last bullet, the weird usage of vpshufd instead of vbroadcast is gone, and we actually can lower an AVX splat with vbroadcastss where before we emitted a really strange pattern of a vector load and a manual splat across the vector. llvm-svn: 212602	2014-07-09 10:06:58 +00:00
Timur Iskhodzhanov	e40fb373ef	[ASan/Win] Don't instrument COMDAT globals. Properly fixes PR20244. llvm-svn: 212596	2014-07-09 08:35:33 +00:00
Dmitri Gribenko	a5b27a7128	SourceMgr: consistently use 'unsigned' for the memory buffer ID type llvm-svn: 212595	2014-07-09 08:30:15 +00:00
Alp Toker	696b4a089c	Prospective -fsanitize=memory build fix following r212586 This -f group flag appears to influence linker flags, breaking the usual rules and causing CMake's link invocation to fail during feature detection due to missing link dependencies (msan_*). Let's forcibly add it for now to get things the way they were before feature detection started working. llvm-svn: 212590	2014-07-09 06:27:05 +00:00
Nikola Smiljanic	9fa5444888	Use correct memeber when displaying StringMap's size. llvm-svn: 212588	2014-07-09 05:34:24 +00:00
Alp Toker	7099cd7549	CMake: make __DATE__, __TIME__ etc. macro usage an error When LLVM_ENABLE_TIMESTAMPS has been disabled we can prevent the preprocessor from embedding dates, times and file timestamps. There are a few motivations for this: 1) Validate the recent CMake feature detection bugfix from LLVM r212586 with a flag that's not actually available everywhere. 2) Dogfood clang's new -Wdate-time warning from r210511 when bootstrapping. 3) Encourage reproducible builds. llvm-svn: 212587	2014-07-09 03:39:32 +00:00
Alp Toker	0068397f34	CMake: fix compiler feature detection add_flag_if_supported() and add_flag_or_print_warning() were effectively no-ops, just returning the value of the first result (usually '-fno-omit-frame-pointer') for all subsequent checks for different flags. Due to the way CMake caches feature detection results, we need to provide symbolic variable names which will persist the cached results. This commit fixes feature detection using these two macros. The feature checks now run and get stored correctly, and the correct output can be observed in configure logs: -- Performing Test C_SUPPORTS_FPIC -- Performing Test C_SUPPORTS_FPIC - Success -- Performing Test CXX_SUPPORTS_FPIC -- Performing Test CXX_SUPPORTS_FPIC - Success llvm-svn: 212586	2014-07-09 03:38:19 +00:00
Chandler Carruth	f0a33b71e9	[SDAG] At the suggestion of Hal, switch to an output parameter that tracks which elements of the build vector are in fact undef. This should make actually inpsecting them (likely in my next patch) reasonably pretty. Also makes the output parameter optional as it is clear now that most users are happy with undefs in their splats. llvm-svn: 212581	2014-07-09 00:41:34 +00:00
Ehsan Akhgari	342595f832	[ms-coff] Add a test for proper handling of full Windows path names in the .drectve section Summary: This test ensures that we can correctly specify a full Windows path to the clang ASAN runtime libraries. This is in preparation to fix PR20246. Reviewers: rafael Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D4427 llvm-svn: 212580	2014-07-09 00:40:50 +00:00
NAKAMURA Takumi	843c4cb401	MipsTargetStreamer.h: Avoid "using" to appease msc17. llvm-svn: 212577	2014-07-08 23:48:22 +00:00
Kevin Enderby	8da4bd60fb	Changed the lvm-nm alias "-s" for -print-armap to "-M". This will allow the "-s" flag to implemented in the future as it is in darwin’s nm(1) to list symbols only in the specified section. Given a LGTM by Shankar Easwaran who originally implemented the support for lvm-nm’s -print-armap and archive map symbols. llvm-svn: 212576	2014-07-08 23:47:31 +00:00
Jim Grosbach	04691a530d	AArch64: Better codegen for loading from __fp16. Loading will generally extend to an f32 or an 64, so make sure to match those patterns directly to load into the FPR16 register class directly rather than going through the integer GPRs. This also eliminates an extra step in the convert-to-f64 path which was first converting to f32 and then to f64 from there. rdar://17594379 llvm-svn: 212573	2014-07-08 23:28:48 +00:00
Hal Finkel	8ae0f8d618	Improve BasicAA CS-CS queries BasicAA contains knowledge of certain intrinsics, such as memcpy and memset, and uses that information to form more-accurate answers to CallSite vs. Loc ModRef queries. Unfortunately, it did not use this information when answering CallSite vs. CallSite queries. Generically, when an intrinsic takes one or more pointers and the intrinsic is marked only to read/write from its arguments, the offset/size is unknown. As a result, the generic code that answers CallSite vs. CallSite (and CallSite vs. Loc) queries in AA uses UnknownSize when forming Locs from an intrinsic's arguments. While BasicAA's CallSite vs. Loc override could use more-accurate size information for some intrinsics, it did not do the same for CallSite vs. CallSite queries. This change refactors the intrinsic-specific logic in BasicAA into a generic AA query function: getArgLocation, which is overridden by BasicAA to supply the intrinsic-specific knowledge, and used by AA's generic implementation. This allows the intrinsic-specific knowledge to be used by both CallSite vs. Loc and CallSite vs. CallSite queries, and simplifies the BasicAA implementation. Currently, only one function, Mac's memset_pattern16, is handled by BasicAA (all the rest are intrinsics). As a side-effect of this refactoring, BasicAA's getModRefBehavior override now also returns OnlyAccessesArgumentPointees for this function (which is an improvement). llvm-svn: 212572	2014-07-08 23:16:49 +00:00
Tobias Grosser	ca7f76c406	DominanceInfo is strongly preferred over RegionInfo This is and always was strong community consensus. Make this clear in the header in case newcomers may not be aware. llvm-svn: 212570	2014-07-08 22:51:03 +00:00
Kevin Enderby	8c50dbb8cb	Add support for BSD format Archive map symbols (aka the table of contents from a __.SYMDEF or "__.SYMDEF SORTED" archive member). llvm-svn: 212568	2014-07-08 22:10:02 +00:00
Pete Cooper	91e4ba2f88	Revert "GlobalDCE: Delete available_externally initializers if it allows removing the value the initializer is referring to." This reverts commit 5b55a47e94e28fbb56d0cd5d72c3db9105c15b4c. A test case was found to crash after this was applied. I'll file a bug to track fixing this with the test case needed. llvm-svn: 212550	2014-07-08 17:06:03 +00:00
Ulrich Weigand	862d8b8d06	[PowerPC] Implement atomic NAND operations as actual NAND This changes the implementation of atomic NAND operations from "a & ~b" (compatible with GCC < 4.4) to actual "~(a & b)" (compatible with GCC >= 4.4). This is in line with the common-code and ARM back-end change implemented in r212433. llvm-svn: 212547	2014-07-08 16:16:02 +00:00
Andrea Di Biagio	d261e98f3d	[DAG] Teach how to combine a pair of shuffles into a single shuffle if the resulting mask is legal. This patch teaches how to fold a shuffle according to rule: shuffle (shuffle (x, undef, M0), undef, M1) -> shuffle(x, undef, M2) We do this only if the resulting mask M2 is legal; this is to avoid introducing illegal shuffles that are potentially expanded into a sub-optimal sequence of target specific dag nodes. This patch has the advantage of being target independent, since it works on ISD nodes. Therefore, all targets (not only x86) can take advantage of this rule. The idea behind this patch is that most shuffle pairs can be safely combined before we run the legalizer on vector operations. This allows us to combine/simplify dag nodes earlier in the process and not only immediately before instruction selection stage. That said. This patch is not meant to replace any existing target specific combine rules; backends might still introduce new shuffles during legalization stage. Also, this rule is very simple and avoids to aggressively optimize shuffles. llvm-svn: 212539	2014-07-08 15:22:29 +00:00
Benjamin Kramer	cccdadca45	Fix some Twine locals. Two of those are use after frees. Found by clang-tidy, fixed by me. llvm-svn: 212537	2014-07-08 14:55:06 +00:00
Timur Iskhodzhanov	a4212c244a	[ASan/Win] Don't instrument private COMDAT globals until PR20244 is properly fixed llvm-svn: 212530	2014-07-08 13:18:58 +00:00
Daniel Sanders	324ad956e0	[mips] Fixed struct/class mismatch introduced in r212522. Clang emits a warning about this. llvm-svn: 212528	2014-07-08 13:13:42 +00:00
Daniel Sanders	7201a3e3bb	Fix r212522 - [mips] Improve encapsulation of the .MIPS.abiflags implementation and limit scope of related enums Added two lines that should have been in r212522. llvm-svn: 212523	2014-07-08 10:35:52 +00:00
Daniel Sanders	c7dbc630e5	[mips] Improve encapsulation of the .MIPS.abiflags implementation and limit scope of related enums Summary: Follow on to r212519 to improve the encapsulation and limit the scope of the enums. Also merged two very similar parser functions, fixed a bug where ASE's were not being reported, and marked CPR1's as being 128-bit when MSA is enabled. Differential Revision: http://reviews.llvm.org/D4384 llvm-svn: 212522	2014-07-08 10:11:38 +00:00
Renato Golin	b8a86c43c0	Revert "Refactor ARM subarchitecture parsing" This reverts commit 7b4a6882467e7fef4516a0cbc418cbfce0fc6f6d. llvm-svn: 212521	2014-07-08 10:06:16 +00:00
Arnaud A. de Grandmaison	d7827606de	Truncate the immediate in logical operation to the register width And continue to produce an error if the 32 most significant bits are not all ones or zeros. llvm-svn: 212520	2014-07-08 09:53:04 +00:00
Vladimir Medic	fb8a2a95cd	Mips.abiflags is a new implicitly generated section that will be present on all new modules. The section contains a versioned data structure which represents essentially information to allow a program loader to determine the requirements of the application. This patch implements mips.abiflags section and provides test cases for it. llvm-svn: 212519	2014-07-08 08:59:22 +00:00
Chandler Carruth	142e966261	[x86,SDAG] Sink the logic for folding shuffles of splats more aggressively from the x86 shuffle lowering to the generic SDAG vector shuffle formation code. This code already tried to fold away shuffles of splats! It just had lots of bugs and couldn't handle the case my new x86 shuffle lowering needed. First, it failed to correctly compute whether N2 was undef because it pre-computed this, then did transformations which could make N2 undef, then failed to ever re-consider the precomputed state. Second, it didn't look through bitcasts at all, even in the safe cases where they are just element-type bitcasts with no change to the number of elements. Third, it didn't handle all-zero bit casts nicely the way my code in the x86 side of things did, which is essential to getting good zext-shuffle lowerings. But all of these are generic. I just ported the code down to this layer and fixed the surrounding bugs. Tests exercising this in the x86 backend still pass and some silly code in widen_cast-6.ll gets better. I updated that test to be a bit more precise but it's still pretty unclear what the value of the test is in this day and age. llvm-svn: 212517	2014-07-08 08:45:38 +00:00
Chandler Carruth	efbce58775	[SDAG] Actually check for a non-constant splat and clarify comments around the handling of UNDEF lanes in boolean vector content analysis. The code before my changes here also failed to check for non-constant splats in a buildvector. I have no idea how to trigger this, I just spotted by inspection when trying to understand the code. It seems extremely unlikely to be worth the trouble to teach the only caller of this code (DAG combining setcc patterns) how to cleverly handle undef lanes, so I've just commented more thoroughly that we're giving up there. llvm-svn: 212515	2014-07-08 07:44:15 +00:00
Chandler Carruth	b844e72e85	[SDAG] Build up a more rich set of APIs for querying build-vector SDAG nodes about whether they are splats. This is factored out and improved from r212324 which got reverted as it was far too aggressive. The new API should help more conservatively handle buildvectors that are a mixture of splatted and undef values. No functionality change at this point. The hope is to slowly re-introduce the undef-tolerant optimization of splats, but each time being forced to make a concious decision about how to handle the undefs in a way that doesn't lead to contradicting assumptions about the collapsed value. Hal has pointed out in discussions that this may not end up being the desired API and instead it may be more convenient to get a mask of the undef elements or something similar. I'm starting simple and will expand the API as I adapt actual callers and see exactly what they need. llvm-svn: 212514	2014-07-08 07:19:55 +00:00
Alexey Samsonov	c94285a1a0	[ASan] Completely remove sanitizer blacklist file from instrumentation pass. All blacklisting logic is now moved to the frontend (Clang). If a function (or source file it is in) is blacklisted, it doesn't get sanitize_address attribute and is therefore not instrumented. If a global variable (or source file it is in) is blacklisted, it is reported to be blacklisted by the entry in llvm.asan.globals metadata, and is not modified by the instrumentation. The latter may lead to certain false positives - not all the globals created by Clang are described in llvm.asan.globals metadata (e.g, RTTI descriptors are not), so we may start reporting errors on them even if "module" they appear in is blacklisted. We assume it's fine to take such risk: 1) errors on these globals are rare and usually indicate wild memory access 2) we can lazily add descriptors for these globals into llvm.asan.globals lazily. llvm-svn: 212505	2014-07-08 00:50:49 +00:00
Adam Nemet	79580db918	[X86] AVX512: Only allow k1-k7 as predicates to vpcmp* As destination k0 is allowed but not as predicate/writemask. I also modified the test to allow checking of error messages by the assembler. I applied a similar approach to the test ret.s in the same directory. llvm-svn: 212504	2014-07-08 00:22:32 +00:00
Alexey Samsonov	07435c4775	Kill unnecessary include llvm-svn: 212503	2014-07-08 00:03:11 +00:00
Andrea Di Biagio	2620b877b6	[x86] Fix assertion failure caused by a wrong combine of PSHUFD nodes with different types. When combining a sequence of two PSHUFD dag nodes into a single PSHUFD, make sure that we assign the correct type to the resulting PSHUFD. X86ISD::PSHUFD dag nodes can be either MVT::v4i32 or MVT::v4f32. Before this change, an assertion failure was triggered in method 'DAGCombinerInfo::CombineTo' when trying to combine the shuffles from the test below into a single PSHUFD. define <4 x float> @test1(<4 x float> %V) { %1 = shufflevector <4 x float> %V, <4 x float> undef, <4 x i32> <i32 3, i32 0, i32 2, i32 1> %2 = shufflevector <4 x float> %1, <4 x float> undef, <4 x i32> <i32 3, i32 0, i32 2, i32 1> ret <4 x float> %2 } llvm-svn: 212498	2014-07-07 23:25:23 +00:00
Sanjay Patel	70af1fdf9d	fixed some typos llvm-svn: 212495	2014-07-07 22:13:58 +00:00
Juergen Ributzka	665ea71fcd	[FastISel][X86] Fix smul.with.overflow.i8 lowering. Add custom lowering code for signed multiply instruction selection, because the default FastISel instruction selection for ISD::MUL will use unsigned multiply for the i8 type and signed multiply for all other types. This would set the incorrect flags for the overflow check. This fixes <rdar://problem/17549300> llvm-svn: 212493	2014-07-07 21:52:21 +00:00
Louis Gerbarg	4c5b4054b2	Allow AArch64FastISel to degrade graceully in the presence of an MVT::i128 Currently AArch64FastISel crashes if it tries to extend an integer into an MVT::i128. This can happen by creating 128 bit integers like so: typedef unsigned int uint128_t __attribute__((mode(TI))); typedef int sint128_t __attribute__((mode(TI))); This patch makes EmitIntExt check for their presence and then falls back to SelectionDAG. Tests included. rdar://17516686 llvm-svn: 212492	2014-07-07 21:37:51 +00:00
Sanjay Patel	a932da8f35	Fix for PR17073 ( http://llvm.org/pr17073 ), simplifycfg illegally hoists an operation in a phi node that can trap. This patch adds to an existing loop over phi nodes in SimplifyCondBranchToCondBranch() to check for trapping ops and bails out of the optimization if we find one of those. The test cases verify that trapping ops are not hoisted and non-trapping ops are still optimized as expected. llvm-svn: 212490	2014-07-07 21:19:00 +00:00
Rafael Espindola	44cb242dda	Use raw_fd_ostream instead of std::ofstream. llvm-svn: 212483	2014-07-07 20:34:51 +00:00
Renato Golin	1e9c282cd1	Refactor ARM subarchitecture parsing According to a FIXME in ARMMCTargetDesc.cpp the ARM version parsing should be in the Triple helper class. Patch by: Gabor Ballabas llvm-svn: 212479	2014-07-07 20:01:11 +00:00
Ulrich Weigand	5fd91e0c0b	[PowerPC] Fix testcase regression Use -mcpu to avoid different codegen depending on host platform. llvm-svn: 212478	2014-07-07 19:41:54 +00:00
Ulrich Weigand	de8641bfde	[PowerPC] Fix no-assert build r212476 caused a compile failure (unused variable) in a non-assertion build ... llvm-svn: 212477	2014-07-07 19:39:44 +00:00
Ulrich Weigand	ec2bf93895	[PowerPC] Fix "byval align" arguments Arguments passed as "byval align" should get the specified alignment in the parameter save area. There was some code in PPCISelLowering.cpp that attempted to implement this, but this didn't work correctly: while code did update the ArgOffset value, it neglected to update the PtrOff value (which was already computed from the old ArgOffset), and it also neglected to update GPR_idx -- fields skipped due to alignment in the save area must likewise be skipped in GPRs. This patch fixes and simplifies this logic by: - handling argument offset alignment right at the beginning of argument processing, using a new helper routine CalculateStackSlotAlignment (this avoids having to update PtrOff and other derived values later on) - not tracking GPR_idx separately, but always computing the correct GPR_idx for each argument from its ArgOffset - removing some redundant computation in LowerFormalArguments: MinReservedArea must equal ArgOffset after argument processing, so there's no use in computing it twice. [This doesn't change the behavior of the current clang front-end, since that never creates "byval align" arguments at the moment. This will change with a follow-on patch, however.] llvm-svn: 212476	2014-07-07 19:26:41 +00:00
Chandler Carruth	beeacac0b3	[x86] Revert r212324 which was too aggressive w.r.t. allowing undef lanes in vector splats. The core problem here is that undef lanes can't unilaterally be considered to contribute to splats. Their handling needs to be more cautious. There is also a reported failure of the nightly testers (thanks Tobias!) that may well stem from the same core issue. I'm going to fix this theoretical issue, factor the APIs a bit better, and then verify that I don't see anything bad with Tobias's reduction from the test suite before recommitting. Original commit message for r212324: [x86] Generalize BuildVectorSDNode::getConstantSplatValue to work for any constant, constant FP, or undef splat and to tolerate any undef lanes in a splat, then replace all uses of isSplatVector in X86's lowering with it. This fixes issues where undef lanes in an otherwise splat vector would prevent the splat logic from firing. It is a touch more awkward to use this interface, but it is much more accurate. Suggestions for better interface structuring welcome. With this fix, the code generated with the widening legalization strategy for widen_cast-4.ll is dramatically improved as the special lowering strategies for a v16i8 SRA kick in even though the high lanes are undef. We also get a slightly different choice for broadcasting an aligned memory location, and use vpshufd instead of vbroadcastss. This looks like a minor win for pipelining and domain crossing, but a minor loss for the number of micro-ops. I suspect its a wash, but folks can easily tweak the lowering if they want. llvm-svn: 212475	2014-07-07 19:03:32 +00:00
Matt Arsenault	d2c9e08b63	R600: Fix mishandling of load / store chains. Fixes various bugs with reordering loads and stores. Scalarized vector loads weren't collecting the chains at all. llvm-svn: 212473	2014-07-07 18:34:45 +00:00
Matt Arsenault	fda9dad17f	Fix typo, weird indentation llvm-svn: 212472	2014-07-07 18:34:42 +00:00
Tim Northover	aeacbb1b54	[testing]: lld generally lives in tools/, so fix llvm-lit. Otherwise we can't run individual tests directly ("llvm-lit /path/to/test") llvm-svn: 212461	2014-07-07 15:26:53 +00:00
Benjamin Kramer	6cbe670db8	Make helper functions static. llvm-svn: 212460	2014-07-07 14:47:51 +00:00
Tim Northover	3705283b24	X86: revert unintentional change to X86FastISel. This crept in with r212443. llvm-svn: 212459	2014-07-07 14:06:42 +00:00
Evgeniy Stepanov	6fa6c677cc	[asan] Generate asm instrumentation in MC. Generate entire ASan asm instrumentation in MC without relying on runtime helper functions. Patch by Yuri Gorshenin. llvm-svn: 212455	2014-07-07 13:57:37 +00:00
Evgeniy Stepanov	d948a5f3c3	[msan] Fix handling of phi in blacklisted functions. llvm-svn: 212454	2014-07-07 13:28:31 +00:00
Benjamin Kramer	d0993e0077	InstCombine: Simplify code, no functionality change. llvm-svn: 212449	2014-07-07 11:01:16 +00:00
Chandler Carruth	0dcb366268	[x86] Teach the new vector shuffle lowering code to handle what is essentially a DAG combine that never gets a chance to run. We might typically expect DAG combining to remove shuffles-of-splats and other similar patterns, but we don't get a chance to run the DAG combiner when we recursively form sub-shuffles during the lowering of a shuffle. So instead hand-roll a really important combine directly into the lowering code to detect shuffles-of-splats, especially shuffles of an all-zero splat which needn't even have the same element width, etc. This lets the new vector shuffle lowering handle shuffles which implement things like zero-extension really nicely. This will become even more important when I wire the legalization of zero-extension to vector shuffles with the new widening legalization strategy. llvm-svn: 212444	2014-07-07 09:06:58 +00:00
Tim Northover	55beb64bd0	CodeGen: it turns out that NAND is not the same thing as BIC. At all. We've been performing the wrong operation on ARM for "atomicrmw nand" for years, since "a NAND b" is "~(a & b)" rather than ARM's very tempting "a & ~b". This bled over into the generic expansion pass. So I assume no-one has ever actually tried to do an atomic nand in the real world. Oh well. llvm-svn: 212443	2014-07-07 09:06:35 +00:00
Saleem Abdulrasool	763f9a50a5	ARM: properly lower dllimport'ed global values This completes the handling for DLL import storage symbols when lowering instructions. A DLL import storage symbol must have an additional load performed prior to use. This is applicable to variables and functions. This is particularly important for non-function symbols as it is possible to handle function references by emitting a thunk which performs the translation from the unprefixed __imp_ symbol to the proper symbol (although, this is a non-optimal lowering). For a variable symbol, no such thunk can be accommodated. llvm-svn: 212431	2014-07-07 05:18:35 +00:00
Saleem Abdulrasool	220a044888	ARM: correctly mangle dllimport symbols Add support for tracking DLLImport storage class information on a per symbol basis in the ARM instruction selection. Use that information to correctly mangle the symbol (dllimport symbols are referenced via *__imp_<name>). llvm-svn: 212430	2014-07-07 05:18:30 +00:00
Saleem Abdulrasool	1eb4a28b44	ARM: unify symbol name retrieval Ensure that all paths that retrieve the symbol name go through GetARMGVSymbol rather than getSymbol. This is desirable so that any global symbol mangling can be centralised to this function. The motivation for this is handling of symbols that are marked as having dll import dll storage. Such a symbol requires an extra load that is currently handled in the backend and a __imp_ prefix on the symbol name. llvm-svn: 212429	2014-07-07 05:18:22 +00:00
Kevin Qin	4473c1943f	[AArch64] Normalize all constants to build a vector. The value of constant operands will be truncated to fit element width. llvm-svn: 212428	2014-07-07 02:45:40 +00:00
Sanjay Patel	784a5a41e7	fixed typos in comments llvm-svn: 212424	2014-07-06 23:24:53 +00:00
Sanjay Patel	0a2ada7b98	fixed some typos in comments llvm-svn: 212423	2014-07-06 23:10:24 +00:00
Saleem Abdulrasool	97255a017b	AArch64: whitespace cleanup llvm-svn: 212420	2014-07-06 22:13:26 +00:00
Aaron Ballman	586ee604b5	These should be EXPECT_TRUE, not EXPECT_FALSE. Amends r212415. llvm-svn: 212419	2014-07-06 20:20:02 +00:00
Aaron Ballman	e56fe8af86	Fixing compile errors related to changes with MemoryBuffer::getFile. llvm-svn: 212415	2014-07-06 19:34:52 +00:00
Rafael Espindola	adf21f2a56	Update the MemoryBuffer API to use ErrorOr. llvm-svn: 212405	2014-07-06 17:43:13 +00:00
Rafael Espindola	e54d821671	Declare variable on first use. llvm-svn: 212403	2014-07-06 14:31:22 +00:00
Rafael Espindola	a3c65096cf	This only needs a StringRef. llvm-svn: 212402	2014-07-06 14:24:03 +00:00
Rafael Espindola	8026bd0b2a	This only needs a StringRef. llvm-svn: 212401	2014-07-06 14:17:29 +00:00
Alp Toker	51ba5b279e	Fix the MSVC build following r212382 Looks like the casts are needed there after all. llvm-svn: 212399	2014-07-06 10:54:41 +00:00
Alp Toker	a55b95b58a	SourceMgr: make valid buffer IDs start from one Use 0 for the invalid buffer instead of -1/~0 and switch to unsigned representation to enable more idiomatic usage. Also introduce a trivial SourceMgr::getMainFileID() instead of hard-coding 0/1 to identify the main file. llvm-svn: 212398	2014-07-06 10:33:31 +00:00
Alp Toker	54cc62740f	Don't use StringRef iterator functions for data access And also remove some redundant casts from r212371. llvm-svn: 212397	2014-07-06 10:32:55 +00:00
Alp Toker	d7a0d64c14	Remove IntrusiveRefCntPtr::getPtr() function It was deprecated in r212366 and all uses have been switched to get(). llvm-svn: 212382	2014-07-05 22:20:59 +00:00
Matt Arsenault	4261973548	Use cast<> instead of dyn_cast + assert llvm-svn: 212380	2014-07-05 21:16:43 +00:00
Matt Arsenault	258c6e7cd9	Fix grammar llvm-svn: 212379	2014-07-05 21:16:40 +00:00
Saleem Abdulrasool	98fee4984c	ARM: mark matching ARM intrinsics as MSBuiltin A number of the ARM intrinsics are aliased with alternative names in MSVC compatibility mode. This change indicates those intrinsics to permit tablegen to construct an appropriate list of MSBuiltins. With the corresponding change in clang, these intrinsics can then be mapped from the frontend. The tests to validate the intrinsics are aliased correctly will be added with the corresponding clang change. llvm-svn: 212377	2014-07-05 20:09:24 +00:00
Ehsan Akhgari	b3efe0602d	Revert r212375 because of test failures llvm-svn: 212376	2014-07-05 19:46:10 +00:00
Ehsan Akhgari	7c35b0f004	Add a test case for the tilde operator in Microsoft inline assembly llvm-svn: 212375	2014-07-05 19:40:35 +00:00
Simon Atanasyan	5a63aa305d	[llvm-readobj] Fix output of MIPS GOT without local and global entries. llvm-svn: 212374	2014-07-05 19:28:49 +00:00
Rafael Espindola	d5a8efe733	This only needs a StringRef. No functionality change. llvm-svn: 212371	2014-07-05 11:38:52 +00:00
David Majnemer	45647d8f74	ADT: Add a drop_back() helper to ArrayRef The slice(N, M) interface is powerful but not concise when wanting to drop a few elements off of an ArrayRef, fix this by adding a drop_back method. llvm-svn: 212370	2014-07-05 06:12:30 +00:00
Alp Toker	b0c31a56e5	Deprecate IntrusiveRefCntPtr::getPtr() in favour of get() This better aligns with other LLVM-specific and C++ standard library smart pointer types. In particular there are at least a few uses of intrusive refcounting in the frontend where it's worth investigating std::shared_ptr as a more appropriate alternative. llvm-svn: 212366	2014-07-05 03:03:21 +00:00
David Majnemer	82cb0309e2	MC: make MCSymbolData::dump work on const objects This just lets us dump a const MCSymbolData object, no functionality changed. llvm-svn: 212365	2014-07-05 00:39:52 +00:00
Rafael Espindola	8286fbf4c4	Make a helper function static. No functionality change. llvm-svn: 212364	2014-07-05 00:39:08 +00:00
David Majnemer	e0950ee85c	MC: Correct comment in ExportSymbol No functionality changed, just make it so that the code _could_ be uncommented. llvm-svn: 212363	2014-07-04 23:20:46 +00:00
David Majnemer	bee5f754f2	MC: Cleanup COFFAsmParser::ParseSectionFlags Switch a normal for-loop to a range-based for. No functionality changed. llvm-svn: 212362	2014-07-04 23:15:28 +00:00
Rafael Espindola	ba79dba8ed	Make RecordStreamer.h private. llvm-svn: 212361	2014-07-04 22:44:18 +00:00
David Majnemer	d1bea693e2	IR: Fold away compares between GV GEPs and GVs A GEP of a non-weak global variable will not be equivalent to another non-weak global variable or a GEP of such a variable. Differential Revision: http://reviews.llvm.org/D4238 llvm-svn: 212360	2014-07-04 22:05:26 +00:00
Rafael Espindola	e6107799fa	Fix a bug in the conversion to ErrorOr. The regular end of the bitcode parsing is in the BitstreamEntry::EndBlock case. Should fix the LTO bootstrap on OS X (this function is only used by ld64). llvm-svn: 212357	2014-07-04 20:05:56 +00:00
Rafael Espindola	c75c4fad46	Revert "Convert a few std::strings to StringRef." This reverts commit r212342. We can get a StringRef into the current Record, but not one in the bitcode itself since the string is compressed in it. llvm-svn: 212356	2014-07-04 20:02:42 +00:00
Sanjay Patel	69bf48eeb1	fixed typos llvm-svn: 212355	2014-07-04 19:40:43 +00:00
Rafael Espindola	089a317c64	Ignore llvm specific symbols in the LTOModule. These are the llvm.* globals and functions. I don't think it is possible to test this directly since llvm-lto is not a full linker and will not report duplicated symbols, but this fixes bootstrap with gold and lto enabled. llvm-svn: 212354	2014-07-04 19:31:27 +00:00
Ehsan Akhgari	4103da6bfb	Add support for parsing the not operator in Microsoft inline assembly This fixes http://llvm.org/PR20202 llvm-svn: 212352	2014-07-04 19:13:05 +00:00
Rafael Espindola	2dc0d9bddb	Ignore llvm.* globals. It is not clear if llvm.global_ctors should or should not be in llvm.metadata, but in practice it is not and we need to ignore it for LTO. llvm-svn: 212351	2014-07-04 19:08:22 +00:00
Saleem Abdulrasool	4e63fc498c	TableGen: introduce support for MSBuiltin Add MSBuiltin which is similar in vein to GCCBuiltin. This allows for adding intrinsics for Microsoft compatibility to individual instructions. This is needed to permit the creation of ARM specific MSVC extensions. This is not currently in use, and requires an associated change in clang to enable use of the intrinsics defined by this new class. This merely sets the LLVM portion of the infrastructure in place to permit the use of this functionality. A separate set of changes will enable the new intrinsics. llvm-svn: 212350	2014-07-04 18:42:25 +00:00
Rafael Espindola	dddd1fd9f4	Implement LTOModule on top of IRObjectFile. IRObjectFile provides all the logic for producing mangled names and getting symbols from inline assembly. LTOModule then adds logic for linking specific tasks, like constructing llvm.compiler_user or extracting linker options from the bitcode. The rule of the thumb is that IRObjectFile has the functionality that is needed by both LTO and llvm-ar. llvm-svn: 212349	2014-07-04 18:40:36 +00:00
Rafael Espindola	0972d41c73	Avoid mangling names twice. No functionality change. llvm-svn: 212348	2014-07-04 16:37:02 +00:00
Rafael Espindola	3885090b86	Mark intrinsic functions as llvm-specific. llvm-svn: 212347	2014-07-04 15:58:00 +00:00
Daniel Sanders	950f48d3c7	[mips][mips64r6] Set ELF e_flags for MIPS32r6/MIPS64r6. Also do MIPS-I to MIPS-V Differential Revision: http://reviews.llvm.org/D4386 llvm-svn: 212346	2014-07-04 15:21:53 +00:00
Daniel Sanders	20c82ee4fa	[mips] Add tests for the 'ret', 'call', and 'indirectbr' LLVM IR instruction. Summary: The tests in this directory are intended to test a single IR instruction with as few dependencies on other instructions as possible. The aim is to be very confident that each LLVM-IR instruction is implemented correctly and with the optimal sequence of instructions, as well as to make it easy to tell what is tested, and make it easier to bring up new ISA revisions in the future. This gives us a good foundation on which to test bigger things. These particular tests will allow testing that MIPS32r6/MIPS64r6 generate the correct return instruction for returns, calls, and indirect branches. This will be a bit tricky since the assembly text is identical but the instruction is actually different. On MIPS32r6/MIPS64r6 'jr $rs' has been removed in favour of the equivalent 'jalr $zero, $rs'. 'jr $rs' remains as an alias for 'jalr $zero, $rs'. Differential Revision: http://reviews.llvm.org/D4266 llvm-svn: 212345	2014-07-04 15:16:14 +00:00
Rafael Espindola	b674c17deb	Don't include llvm.metadata variables in archive symbol tables. llvm-svn: 212344	2014-07-04 15:03:17 +00:00
Rafael Espindola	d749fb5129	Change LTOModule`s getTargetTriple and setTargetTriple to use c++ types. llvm-svn: 212343	2014-07-04 14:19:41 +00:00
Rafael Espindola	f98536a046	Convert a few std::strings to StringRef. llvm-svn: 212342	2014-07-04 14:12:46 +00:00
Rafael Espindola	d346cc8efc	Convert these functions to use ErrorOr. llvm-svn: 212341	2014-07-04 13:52:01 +00:00
Rafael Espindola	ce8a0d6cd8	Remove unused old-style error handling. If needed, an ErrorOr should be used. llvm-svn: 212340	2014-07-04 13:30:13 +00:00

... 5 6 7 8 9 ...

105858 Commits