llvm-project

Commit Graph

Author	SHA1	Message	Date
Shuxin Yang	a9df0db27d	Add function DominatorTree::getDescendants(). As its name suggests, this function will return all basic blocks dominated by a given block. llvm-svn: 191014	2013-09-19 17:18:35 +00:00
Reid Kleckner	58c16ee609	Include an LLVM-vs2012_xp toolset in the MSBuild integration Patch by Paul Hampson! llvm-svn: 191010	2013-09-19 16:50:40 +00:00
Evgeniy Stepanov	37b8645480	[msan] Wrap indirect functions. Adds a flag to the MemorySanitizer pass that enables runtime rewriting of indirect calls. This is part of MSanDR implementation and is needed to return control to the DynamiRio-based helper tool on transition between instrumented and non-instrumented modules. Disabled by default. llvm-svn: 191006	2013-09-19 15:22:35 +00:00
Benjamin Kramer	d443e4a080	DAGCombiner: Don't fold vector muls with constants that look like a splat of a power of 2 but differ in bit width. PR17283. llvm-svn: 191000	2013-09-19 13:28:20 +00:00
Justin Holewinski	a54daa4640	[NVPTX] Make constant vector test case endian-independent llvm-svn: 190998	2013-09-19 13:14:44 +00:00
Justin Holewinski	95564bdf5e	[NVPTX] Support constant vector globals llvm-svn: 190997	2013-09-19 12:51:46 +00:00
Amara Emerson	3308909508	[ARMv8] Add support for the v8 cryptography extensions. llvm-svn: 190996	2013-09-19 11:59:01 +00:00
Tim Northover	97347a81bc	X86: FrameIndex addressing modes do have a base register. When selecting the DAG (add (WrapperRIP ...), (FrameIndex ...)), X86 code had spotted the FrameIndex possibility and was working out whether it could fold the WrapperRIP into this. The test for forming a %rip version is notionally whether we already have a base or index register (%rip precludes both), but we were forgetting to account for the register that would be inserted later to access the frame. rdar://problem/15024520 llvm-svn: 190995	2013-09-19 11:33:53 +00:00
Andrew Trick	b5e1e6cc11	Revert "Encapsulate PassManager debug flags to avoid static init and cxa_exit." Working on a better solution to this. This reverts commit 7d4e9934e7ca83094c5cf41346966c8350179ff2. llvm-svn: 190990	2013-09-19 06:02:43 +00:00
Andrew Trick	f33d6df899	Encapsulate PassManager debug flags to avoid static init and cxa_exit. This puts all the global PassManager debugging flags, like -print-after-all and -time-passes, behind a managed static. This eliminates their static initializers and, more importantly, exit-time destructors. The only behavioral change I anticipate is that tools need to initialize the PassManager before parsing the command line in order to export these options, which makes sense. Tools that already initialize the standard passes (opt/llc) don't need to do anything new. llvm-svn: 190974	2013-09-18 23:31:16 +00:00
Andrew Trick	dc073addc5	whitespace llvm-svn: 190973	2013-09-18 23:31:10 +00:00
Reed Kotler	d6aadc797c	Fix two issues regarding Got pointer (GP) setup. 1) make sure that the first two instructions of the sequence cannot separate from each other. The linker requires that they be sequential. If they get separated, it can still work but it will not work in all cases because the first of the instructions mostly involves the hi part of the pc relative offset and that part changes slowly. You would have to be at the right boundary for this to matter. 2) make sure that this sequence begins on a longword boundary. There appears to be a bug in binutils which makes some of these calculations get messed up if the instruction sequence does not begin on a longword boundary. This is being investigated with the appropriate binutils folks. llvm-svn: 190966	2013-09-18 22:46:09 +00:00
Adrian Prantl	262bcf4584	Debug info: Get rid of the VLA indirection hack in FastISel. Use the DIVariable::isIndirect() flag set by the frontend instead of guessing whether to set the machine location's indirection bit. Paired commit with CFE. llvm-svn: 190961	2013-09-18 22:08:59 +00:00
Preston Gurd	dd9891f22d	Attempt to fix llvm-ppc64-linux2 buildbot failure by adding -march=x86 to SLM test. llvm-svn: 190958	2013-09-18 21:39:33 +00:00
Preston Gurd	457daddc9b	Verify that llvm can generate the prefetchw instruction when the CPU is Atom Silvermont. Patch by Sriram Murali. llvm-svn: 190957	2013-09-18 21:08:09 +00:00
Filip Pizlo	57093e88e0	Make DynamicLibrary use ManagedStatic. This is pretty simple and should just work as advertised - but it does have the caveat that calls to DynamicLibrary::AddSymbol will "reset" if you shutdown llvm and try to come back for seconds. This is a subtle behavior change, but I'm assuming that nobody is affected by it. llvm-svn: 190946	2013-09-18 16:40:14 +00:00
Chandler Carruth	dbf6589b56	More XCore TTI cleanup -- remove an unused private field flagged by -Wunused-private-field with Clang. llvm-svn: 190941	2013-09-18 14:11:11 +00:00
Chandler Carruth	b5a34963c8	Name the XCore target-specific subdirectories canonically. llvm-svn: 190940	2013-09-18 14:08:30 +00:00
Kostya Serebryany	f322382e22	[asan] call __asan_stack_malloc_N only if use-after-return detection is enabled with the run-time option llvm-svn: 190939	2013-09-18 14:07:14 +00:00
NAKAMURA Takumi	69ae1b9aa2	A couple of tests, in llvm/test/Transforms/*/xcore, are XCore-specific. They should be excluded when XCore is not built. llvm-svn: 190938	2013-09-18 13:56:16 +00:00
NAKAMURA Takumi	0b642ec13d	Target/XCore/CMakeLists.txt: Add XCoreTargetTransformInfo.cpp. llvm-svn: 190937	2013-09-18 12:59:41 +00:00
Robert Lytton	f637e2cb23	Prevent LoopVectorizer and SLPVectorizer running if the target has no vector registers. XCore target: Add XCoreTargetTransformInfo This is where getNumberOfRegisters() resides, which in turn returns the number of vector registers (=0). llvm-svn: 190936	2013-09-18 12:43:35 +00:00
Andrea Di Biagio	1f5d74d8ae	Re-add tests from r179291 which were accidentally removed by r181177. llvm-svn: 190934	2013-09-18 12:06:59 +00:00
Richard Sandiford	93183ee78c	[SystemZ] Add unsigned compare-and-branch instructions For some reason I never got around to adding these at the same time as the signed versions. No idea why. I'm not sure whether this SystemZII::BranchC* stuff is useful, or whether it should just be replaced with an "is normal" flag. I'll leave that for later though. There are some boundary conditions that can be tweaked, such as preferring unsigned comparisons for equality with [128, 256), and "<= 255" over "< 256", but again I'll leave those for a separate patch. llvm-svn: 190930	2013-09-18 09:56:40 +00:00
Joey Gouly	36b2e5de3c	'svn add' the test cases. llvm-svn: 190929	2013-09-18 09:46:49 +00:00
Joey Gouly	2f8890ed1c	[ARMv8] Add CRC instructions. Patch by Bradley Smith! llvm-svn: 190928	2013-09-18 09:45:55 +00:00
Filip Pizlo	591f15411a	Revert r190921. It broke Windows. I'll roll it back in when I have a chance to look at it in detail. llvm-svn: 190923	2013-09-18 06:37:55 +00:00
Filip Pizlo	4389ee380b	Make DynamicLibrary use ManagedStatic. This is pretty simple and should just work as advertised - but it does have the caveat that calls to DynamicLibrary::AddSymbol will "reset" if you shutdown llvm and try to come back for seconds. This is a subtle behavior change, but I'm assuming that nobody is affected by it. llvm-svn: 190921	2013-09-18 06:03:27 +00:00
Craig Topper	358c7989b1	Prevent extra calls to ToggleFeature for Feature64Bit and FeatureCMOV if they've already been enabled. The extra call ends up clearing the bit in FeatureBits since its a 'toggle'. Can't prove that anything was broken because of this since I don't think the FeatureBits for these are used. llvm-svn: 190920	2013-09-18 06:01:53 +00:00
Craig Topper	a8442344ed	Fix X86 subtarget to not overwrite the autodetected features by calling InitMCProcessorInfo right after detecting them. Instead add a new function that only updates the scheduling model and call that. llvm-svn: 190919	2013-09-18 05:54:09 +00:00
Craig Topper	be3e01e61f	Revert accidental commit I had to make to get the test case in PR17268 to still work correctly. llvm-svn: 190917	2013-09-18 04:10:17 +00:00
Craig Topper	98064b9f4d	Lift alignment restrictions for load/store folding on VINSERTF128/VEXTRACTF128. Fixes PR17268. llvm-svn: 190916	2013-09-18 03:55:53 +00:00
David Blaikie	eacc287b49	ifndef NDEBUG-out an asserts-only constant committed in r190863 llvm-svn: 190905	2013-09-18 00:11:27 +00:00
Matt Arsenault	d12e8020ec	Fix a constant folding address space place I missed. If address space 0 was smaller than the address space in a constant inttoptr/ptrtoint pair, the wrong mask size would be used. llvm-svn: 190899	2013-09-17 23:23:16 +00:00
Reid Kleckner	c1e7621e01	COFF: Ensure that objects produced by LLVM link with /safeseh Summary: We indicate that the object files are safe by emitting a @feat.00 absolute address symbol. The address is presumably interpreted as a bitfield of features that the compiler would like to enable. Bit 0 is documented in the PE COFF spec to opt in to "registered SEH", which is what /safeseh enables. LLVM's object files are safe by default because LLVM doesn't know how to produce SEH handlers. Reviewers: Bigcheese CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D1691 llvm-svn: 190898	2013-09-17 23:18:05 +00:00
Matt Arsenault	ce3e4fc934	Missed using check type enum in one place llvm-svn: 190897	2013-09-17 23:15:35 +00:00
Matt Arsenault	c4d2d471ce	Use function's argument instead of the global flag. For now it happens the argument is always the same. llvm-svn: 190896	2013-09-17 22:45:57 +00:00
Matt Arsenault	38820972e9	FileCheck refactor: use enum instead of bunch of bools llvm-svn: 190893	2013-09-17 22:30:02 +00:00
Quentin Colombet	870b662779	Revert the load slicing done in r190870. To avoid regressions with bitfield optimizations, this slicing should take place later, like ISel time. llvm-svn: 190891	2013-09-17 22:01:26 +00:00
Reid Kleckner	3ea536fef4	COFF: Emit all MCSymbols rather than filtering out some of them In particular, this means we emit non-external symbols defined to variables, such as aliases or absolute addresses. This is needed to implement /safeseh, and it appears there was some confusion about what symbols to emit previously. llvm-svn: 190888	2013-09-17 21:24:44 +00:00
Reid Kleckner	50689eb917	COFF: Remove ExportSection, which has been dead since r114823 llvm-svn: 190887	2013-09-17 21:24:02 +00:00
Eric Christopher	e7af7bd8d0	Move variable into assert to avoid unused variable warning. llvm-svn: 190886	2013-09-17 21:13:57 +00:00
Matt Arsenault	e6952f28ca	Cleanup handling of constant function casts. Some of this code is no longer necessary since int<->ptr casts are no longer occur as of r187444. This also fixes handling vectors of pointers, and adds a bunch of new testcases for vectors and address spaces. llvm-svn: 190885	2013-09-17 21:10:14 +00:00
Bill Schmidt	bdae03f227	[PowerPC] Add a FIXME. Documenting a design choice to generate only medium model sequences for TLS addresses at this time. Small and large code models could be supported if necessary. llvm-svn: 190883	2013-09-17 20:22:05 +00:00
Bill Schmidt	bb381d7063	[PowerPC] Fix problems with large code model (PR17169). Large code model on PPC64 requires creating and referencing TOC entries when using the addis/ld form of addressing. This was not being done in all cases. The changes in this patch to PPCAsmPrinter::EmitInstruction() fix this. Two test cases are also modified to reflect this requirement. Fast-isel was not creating correct code for loading floating-point constants using large code model. This also requires the addis/ld form of addressing. Previously we were using the addis/lfd shortcut which is only applicable to medium code model. One test case is modified to reflect this requirement. llvm-svn: 190882	2013-09-17 20:03:25 +00:00
Arnold Schwaighofer	cae8735a54	Costmodel: Add support for horizontal vector reductions Upcoming SLP vectorization improvements will want to be able to estimate costs of horizontal reductions. Add infrastructure to support this. We model reductions as a series of (shufflevector,add) tuples ultimately followed by an extractelement. For example, for an add-reduction of <4 x float> we could generate the following sequence: (v0, v1, v2, v3) \ \ / / \ \ / + + (v0+v2, v1+v3, undef, undef) \ / ((v0+v2) + (v1+v3), undef, undef) %rdx.shuf = shufflevector <4 x float> %rdx, <4 x float> undef, <4 x i32> <i32 2, i32 3, i32 undef, i32 undef> %bin.rdx = fadd <4 x float> %rdx, %rdx.shuf %rdx.shuf7 = shufflevector <4 x float> %bin.rdx, <4 x float> undef, <4 x i32> <i32 1, i32 undef, i32 undef, i32 undef> %bin.rdx8 = fadd <4 x float> %bin.rdx, %rdx.shuf7 %r = extractelement <4 x float> %bin.rdx8, i32 0 This commit adds a cost model interface "getReductionCost(Opcode, Ty, Pairwise)" that will allow clients to ask for the cost of such a reduction (as backends might generate more efficient code than the cost of the individual instructions summed up). This interface is excercised by the CostModel analysis pass which looks for reduction patterns like the one above - starting at extractelements - and if it sees a matching sequence will call the cost model interface. We will also support a second form of pairwise reduction that is well supported on common architectures (haddps, vpadd, faddp). (v0, v1, v2, v3) \ / \ / (v0+v1, v2+v3, undef, undef) \ / ((v0+v1)+(v2+v3), undef, undef, undef) %rdx.shuf.0.0 = shufflevector <4 x float> %rdx, <4 x float> undef, <4 x i32> <i32 0, i32 2 , i32 undef, i32 undef> %rdx.shuf.0.1 = shufflevector <4 x float> %rdx, <4 x float> undef, <4 x i32> <i32 1, i32 3, i32 undef, i32 undef> %bin.rdx.0 = fadd <4 x float> %rdx.shuf.0.0, %rdx.shuf.0.1 %rdx.shuf.1.0 = shufflevector <4 x float> %bin.rdx.0, <4 x float> undef, <4 x i32> <i32 0, i32 undef, i32 undef, i32 undef> %rdx.shuf.1.1 = shufflevector <4 x float> %bin.rdx.0, <4 x float> undef, <4 x i32> <i32 1, i32 undef, i32 undef, i32 undef> %bin.rdx.1 = fadd <4 x float> %rdx.shuf.1.0, %rdx.shuf.1.1 %r = extractelement <4 x float> %bin.rdx.1, i32 0 llvm-svn: 190876	2013-09-17 18:06:50 +00:00
Arnold Schwaighofer	4a3dcaa193	SLPVectorizer: Don't vectorize phi nodes that use invoke values We can't insert an insertelement after an invoke. We would have to split a critical edge. So when we see a phi node that uses an invoke we just give up. radar://14990770 llvm-svn: 190871	2013-09-17 17:03:29 +00:00
Quentin Colombet	b8d672ef5b	[InstCombiner] Slice a big load in two loads when the elements are next to each other in memory. The motivation was to get rid of truncate and shift right instructions that get in the way of paired load or floating point load. E.g., Consider the following example: struct Complex { float real; float imm; }; When accessing a complex, llvm was generating a 64-bits load and the imm field was obtained by a trunc(lshr) sequence, resulting in poor code generation, at least for x86. The idea is to declare that two load instructions is the canonical form for loading two arithmetic type, which are next to each other in memory. Two scalar loads at a constant offset from each other are pretty easy to detect for the sorts of passes that like to mess with loads. <rdar://problem/14477220> llvm-svn: 190870	2013-09-17 16:57:34 +00:00
Preston Gurd	ba6f9d1b7d	Remove unused code, which had been commented out. llvm-svn: 190869	2013-09-17 16:53:36 +00:00
Serge Pavlov	8ec39992c1	Added documentation to getMemsetStores. llvm-svn: 190866	2013-09-17 16:24:42 +00:00
Ben Langmuir	de39520f79	Add llvm.x86.* intrinsics for Intel SHA Extensions Add llvm.x86.* intrinsics for all of the Intel SHA Extensions instructions, as well as tests. Also remove mayLoad and hasSideEffects, which can be inferred from the instruction patterns. llvm-svn: 190864	2013-09-17 13:44:39 +00:00
Kostya Serebryany	bc86efb89d	[asan] inline the calls to __asan_stack_free_* with small sizes. Yet another 10%-20% speedup for use-after-return llvm-svn: 190863	2013-09-17 12:14:50 +00:00
Joey Gouly	830c27ab2d	[ARM] Fix the deprecation of MCR encodings that map to CP15{ISB,DSB,DMB}. llvm-svn: 190862	2013-09-17 09:54:57 +00:00
Stepan Dyatkovskiy	dc2c4b4462	Bugfix for PR17099: Wrong cast operation. MergeFunctions emits Bitcast instead of pointer-to-integer operation. Patch fixes MergeFunctions::writeThunk function. It replaces unconditional Bitcast creation with "Value* createCast(...)" method, that checks operand types and selects proper instruction. See unit-test as example. llvm-svn: 190859	2013-09-17 09:36:11 +00:00
Elena Demikhovsky	ac3e8eb9f0	AVX-512: Converted to Unix style llvm-svn: 190851	2013-09-17 07:34:34 +00:00
Craig Topper	514f02cc07	Add AES and SHA instructions to the load folding tables. llvm-svn: 190850	2013-09-17 06:50:11 +00:00
Craig Topper	684abc8236	Fix column alignment. No functional change. llvm-svn: 190849	2013-09-17 06:05:17 +00:00
Craig Topper	79d1bff2ad	Make a more clear AVX-512 section header that matches similar in the file. llvm-svn: 190843	2013-09-17 03:34:09 +00:00
Kevin Qin	36399e6b68	Implement 3 AArch64 neon instructions : umov smov ins. llvm-svn: 190839	2013-09-17 02:21:02 +00:00
Quentin Colombet	d30a9585b8	[SelectionDAG] Teach the vector scalarizer about TRUNCATE. When a truncate node defines a legal vector type but uses an illegal vector type, the legalization process was splitting the vector until <1 x vector> type, but then it was failing to scalarize the node because it did not know how to handle TRUNCATE. <rdar://problem/14989896> llvm-svn: 190830	2013-09-17 00:26:56 +00:00
Adrian Prantl	aa420d04b8	mention command line parameters llvm-svn: 190827	2013-09-17 00:15:36 +00:00
Adrian Prantl	35c885879a	simplify expression llvm-svn: 190826	2013-09-17 00:15:33 +00:00
Adrian Prantl	b2b2bb7445	Be sure we run ARM tests only when an ARM backend is present. llvm-svn: 190822	2013-09-16 23:48:45 +00:00
Adrian Prantl	db3e26d193	Debug info: Fix PR16736 and rdar://problem/14990587. A DBG_VALUE is register-indirect iff the first operand is a register _and_ the second operand is an immediate. llvm-svn: 190821	2013-09-16 23:29:03 +00:00
Matt Arsenault	899f7d2b00	MemCpyOptimizer: Use max legal int size instead of pointer size If there are no legal integers, assume 1 byte. This makes more sense than using the pointer size as a guess for the maximum GPR width. It is conceivable to want to use some 64-bit pointers on a target where 64-bit integers aren't legal. llvm-svn: 190817	2013-09-16 22:43:16 +00:00
Preston Gurd	f4f8d8acc8	Add Atom Silvermont (slm) tests - check that -mcpu=slm uses the call register indirect optimization - check that -mcpu=slm runs the scheduler - check that -mcpu=slm supports the movbe instruction llvm-svn: 190814	2013-09-16 22:22:07 +00:00
Jakub Staszak	ec2ffa92d8	Use reference instead of copy. llvm-svn: 190813	2013-09-16 22:03:38 +00:00
Jordan Rose	66ea0363e4	[CMake] Hack GetSVN.cmake to handle unusual terminals. I got a report of a hang in git's helper functions trying to figure out how to display results of "git svn info" when run inside ninja, even though the result is immediately piped to grep. This seems to avoid that. llvm-svn: 190808	2013-09-16 21:38:01 +00:00
Krzysztof Parzyszek	3c463aa5e7	Add testcase for r190631 llvm-svn: 190807	2013-09-16 21:24:30 +00:00
Tim Northover	9c30f7a4ff	TableGen: fix constness of new comparison function. libc++ didn't seem to like a non-const call operator. llvm-svn: 190797	2013-09-16 17:33:40 +00:00
Bill Schmidt	c763c22469	[PowerPC] Fix PR17155 - Ignore COPY_TO_REGCLASS during emit. Fast-isel generates a COPY_TO_REGCLASS for widening f32 to f64, which is a nop on PPC64. This is needed to keep the register class system happy, but on the fast-isel path it is not removed before emit as it is for DAG select. Ignore this op when emitting instructions. llvm-svn: 190795	2013-09-16 17:25:12 +00:00
Tim Northover	c74e691c27	TableGen: give asm match classes deterministic order. TableGen was sorting the entries in some of its internal data structures by pointer. This order filtered through to the final matching table and affected the diagnostics produced on bad assembly occasionally. It also turns out STL algorithms are ridiculously easy to misuse on containers with custom order methods. (No bugs before, or now that I know of, but plenty in the middle). This should fix the sanitizer bot, which ends up with weird pointers. llvm-svn: 190793	2013-09-16 16:43:19 +00:00
Tim Northover	f9aaafd97b	AsmMatcher: emit subtarget feature enum in deterministic order. llvm-svn: 190792	2013-09-16 16:43:16 +00:00
Arnold Schwaighofer	53e622cef4	Don't vectorize if there are outside loop users of the induction variable. We would have to compute the pre increment value, either by computing it on every loop iteration or by splitting the edge out of the loop and inserting a computation for it there. For now, just give up vectorizing such loops. Fixes PR17179. llvm-svn: 190790	2013-09-16 16:17:24 +00:00
Evgeniy Stepanov	604293fbb4	[msan] Check return value of main(). llvm-svn: 190782	2013-09-16 13:24:32 +00:00
Vladimir Medic	05bcde6d9a	This patch implements Mips load/store instructions from/to coprocessor 2. Test cases are added. llvm-svn: 190780	2013-09-16 10:29:42 +00:00
Benjamin Kramer	2ef689caf3	ARM: Deduplicate ConstantPoolValues. llvm-svn: 190779	2013-09-16 10:17:31 +00:00
Daniel Sanders	a6822ed91f	Fix the build for git repositories with multiple remotes. Summary: When a git repository had multiple remotes, ${repository} will be set to a multiline string. This causes compilation errors in SVNVersion.inc. Fix this by limiting the output of utils/GetRepositoryPath to the first remote (which is reasonably likely to be 'origin'). Reviewers: jordan_rose CC: llvm-commits, t.p.northover Differential Revision: http://llvm-reviews.chandlerc.com/D1659 llvm-svn: 190778	2013-09-16 09:25:49 +00:00
Richard Sandiford	109a7c6ff1	[SystemZ] Improve extload handling The port originally had special patterns for extload, mapping them to the same instructions as sextload. It seemed neater to have patterns that match "an extension that is allowed to be signed" and "an extension that is allowed to be unsigned". This was originally meant to be a clean-up, but it does improve the handling of promoted integers a little, as shown by args-06.ll. llvm-svn: 190777	2013-09-16 09:03:10 +00:00
Craig Topper	a6d204ec68	Make F16C feature flag imply AVX rather than just checking both at the patterns. llvm-svn: 190775	2013-09-16 04:29:58 +00:00
Peter Collingbourne	3fa50f9b05	Implement function prefix data as an IR feature. Previous discussion: http://lists.cs.uiuc.edu/pipermail/llvmdev/2013-July/063909.html Differential Revision: http://llvm-reviews.chandlerc.com/D1191 llvm-svn: 190773	2013-09-16 01:08:15 +00:00
Hal Finkel	40c34781b5	PPC: Don't restrict lvsl generation to after type legalization This is a re-commit of r190764, with an extra check to make sure that we're not performing the transformation on illegal types (a small test case has been added for this as well). Original commit message: The PPC backend uses a target-specific DAG combine to turn unaligned Altivec loads into a permutation-based sequence when possible. Unfortunately, the target-specific DAG combine is not always called on all loads of interest (sometimes the routines in DAGCombine call CombineTo such that the new node and users are not added to the worklist); allowing the combine to trigger early (before type legalization) mitigates this problem. Because the autovectorizers only create legal vector types, I don't expect a lot of cases where this optimization is enabled by type legalization in practice. llvm-svn: 190771	2013-09-15 22:09:58 +00:00
Benjamin Kramer	7d6052687e	Replace some unnecessary vector copies with references. llvm-svn: 190770	2013-09-15 22:04:42 +00:00
Benjamin Kramer	ac511cac77	ELF: Add support for the exclude section bit for gas compat. llvm-svn: 190769	2013-09-15 19:53:20 +00:00
David Majnemer	a4b521b7fc	MC: Add support for '?' flags in .section directives Summary: The '?' flag uses the last section group if the last had a section group. We treat combining an explicit section group and the '?' as a hard error. This fixes PR17198. Reviewers: rafael, bkramer Reviewed By: bkramer CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D1686 llvm-svn: 190768	2013-09-15 19:24:16 +00:00
Kai Nacke	8539b4637b	Fix alignment of unwind data. For alignment purposes, the instruction array will always have an even number of entries, with the final entry potentially unused (in which case the array will be one longer than indicated by the count of unwind codes field). Reviewed by Anton Korobeynikov, Charles Davis and Nico Rieck. llvm-svn: 190767	2013-09-15 18:01:09 +00:00
Kai Nacke	74adc8a457	Generate IMAGE_REL_AMD64_ADDR32NB relocations for SEH data structures. The Win64 EH data structures must be of type IMAGE_REL_AMD64_ADDR32NB instead of IMAGE_REL_AMD64_ADDR32. This is easiely achieved by adding the VK_COFF_IMGREL32 modifier to the symbol reference. Change also references to start and end of the SEH range of a function as offsets to start of the function. Reviewed by Jim Grosbach, Charles Davis and Nico Rieck. llvm-svn: 190766	2013-09-15 17:46:46 +00:00
Hal Finkel	31025a6325	Revert r190764: PPC: Don't restrict lvsl generation to after type legalization This is causing test-suite failures. Original commit message: The PPC backend uses a target-specific DAG combine to turn unaligned Altivec loads into a permutation-based sequence when possible. Unfortunately, the target-specific DAG combine is not always called on all loads of interest (sometimes the routines in DAGCombine call CombineTo such that the new node and users are not added to the worklist); allowing the combine to trigger early (before type legalization) mitigates this problem. Because the autovectorizers only create legal vector types, I don't expect a lot of cases where this optimization is enabled by type legalization in practice. llvm-svn: 190765	2013-09-15 15:41:11 +00:00
Hal Finkel	2945d4e916	PPC: Don't restrict lvsl generation to after type legalization The PPC backend uses a target-specific DAG combine to turn unaligned Altivec loads into a permutation-based sequence when possible. Unfortunately, the target-specific DAG combine is not always called on all loads of interest (sometimes the routines in DAGCombine call CombineTo such that the new node and users are not added to the worklist); allowing the combine to trigger early (before type legalization) mitigates this problem. Because the autovectorizers only create legal vector types, I don't expect a lot of cases where this optimization is enabled by type legalization in practice. llvm-svn: 190764	2013-09-15 15:20:54 +00:00
Hal Finkel	31658834e6	Prevent assert in CombinerGlobalAA with null values DAGCombiner::isAlias can be called with SrcValue1 or SrcValue2 null, and we can't use AA in this case (if we try, then the casting code in AA will assert). llvm-svn: 190763	2013-09-15 02:19:49 +00:00
Reed Kotler	655531521e	Expand the mask capability for deciding which functions are mips16 and mips32 so it can be better used for general interoperability testing between mips32 and mips16. llvm-svn: 190762	2013-09-15 02:09:08 +00:00
Benjamin Kramer	43cc98a78d	Remove unused StringRef that no compiler warned about, I wonder why. llvm-svn: 190759	2013-09-14 22:55:54 +00:00
Ben Langmuir	8eb45a4ef6	Add the remaining Intel SHA instructions Also assembly/disassembly tests, and for sha256rnds2, aliases with an explicit xmm0 dependency. llvm-svn: 190754	2013-09-14 15:03:21 +00:00
Robert Wilhelm	042f10ce41	Fix spelling. llvm-svn: 190750	2013-09-14 09:34:59 +00:00
Robert Wilhelm	516be56fd9	Fix spelling. llvm-svn: 190749	2013-09-14 09:34:24 +00:00
Chandler Carruth	ebeac5cb89	Remove the long, long defunct IR block placement pass. This pass was based on the previous (essentially unused) profiling infrastructure and the assumption that by ordering the basic blocks at the IR level in a particular way, the correct layout would happen in the end. This sometimes worked, and mostly didn't. It also was a really naive implementation of the classical paper that dates from when branch predictors were primarily directional and when loop structure wasn't commonly available. It also didn't factor into the equation non-fallthrough branches and other machine level details. Anyways, for all of these reasons and more, I wrote MachineBlockPlacement, which completely supercedes this pass. It both uses modern profile information infrastructure, and actually works. =] llvm-svn: 190748	2013-09-14 09:28:14 +00:00
Zoran Jovanovic	fc26cfcde7	Fixed bug when generating Load Upper Immediate microMIPS instruction. llvm-svn: 190746	2013-09-14 07:35:41 +00:00
Zoran Jovanovic	3671a5441a	Support for microMIPS DIV instructions. llvm-svn: 190745	2013-09-14 07:15:21 +00:00
Zoran Jovanovic	ab85278137	Support for misc microMIPS instructions. llvm-svn: 190744	2013-09-14 06:49:25 +00:00
Matt Arsenault	2e5f5b2e78	Add missing CHECK-LABEL llvm-svn: 190740	2013-09-14 02:44:06 +00:00
Matt Arsenault	8e48a7f911	Add test for untested path in SimplifyCFG This case wasn't checked with a pointer condition. llvm-svn: 190739	2013-09-14 02:44:02 +00:00
Daniel Dunbar	cd625f4e54	[lit] Add an --output option, for writing results in a machine readable form. llvm-svn: 190738	2013-09-14 01:19:17 +00:00
Filip Pizlo	67d9709341	Make PrettyStackTraceEntry use ManagedStatic for its ThreadLocal. This was somewhat tricky because ~PrettyStackTraceEntry() may run after llvm_shutdown() has been called. This is rare and only happens for a common idiom used in the main() functions of command-line tools. This works around the idiom by skipping the stack clean-up if the PrettyStackTraceHead ManagedStatic is not constructed (i.e. llvm_shutdown() has been called). llvm-svn: 190730	2013-09-13 22:59:47 +00:00
Hal Finkel	c3cfbf8677	Add missing break statement in PPCISelLowering As it turns out, not a problem in practice, but it should be there. llvm-svn: 190720	2013-09-13 20:09:02 +00:00
Preston Gurd	3fe264d625	Adds support for Atom Silvermont (SLM) - -march=slm Implements Instruction scheduler latencies for Silvermont, using latencies from the Intel Silvermont Optimization Guide. Auto detects SLM. Turns on post RA scheduler when generating code for SLM. llvm-svn: 190717	2013-09-13 19:23:28 +00:00
Quentin Colombet	cf71c6320b	[Peephole] Rewrite copies to avoid cross register banks copies. By definition copies across register banks are not coalescable. Still, it may be possible to get rid of such a copy when the value is available in another register of the same register file. Consider the following example, where capital and lower letters denote different register file: b = copy A <-- cross-bank copy ... C = copy b <-- cross-bank copy This could have been optimized this way: b = copy A <-- cross-bank copy ... C = copy A <-- same-bank copy Note: b and C's definitions may be in different basic blocks. This patch adds a peephole optimization that looks through a chain of copies leading to a cross-bank copy and reuses a source that is on the same register file if available. This solution could also be used to get rid of some copies (e.g., A could have been used instead of C). However, we do not do so because: - It may over constrain the coloring of the source register for coalescing. - The register allocator may not be able to find a nice split point for the longer live-range, leading to more spill. <rdar://problem/14742333> llvm-svn: 190713	2013-09-13 18:26:31 +00:00
Benjamin Kramer	e35e7c982f	Add warn_unused_result to empty() on various containers. empty() doesn't actually empty out the container, making this a common typo. llvm-svn: 190708	2013-09-13 17:33:24 +00:00
Nuno Lopes	c38b2dabb0	typo fix: use BUILD_ARCHIVE to build .a libs and not ARCHIVE_LIBRARY llvm-svn: 190696	2013-09-13 15:01:54 +00:00
Amaury de la Vieuville	9867f8649b	Fix tests for hasFPARMv8 name change (r190692) Patch by Bradley Smith llvm-svn: 190694	2013-09-13 14:37:52 +00:00
Joey Gouly	ccd04894c4	[ARMv8] Change hasV8Fp to hasFPARMv8, and other command line options to be more consistent. llvm-svn: 190692	2013-09-13 13:46:57 +00:00
Evgeniy Stepanov	0435ecd18f	[msan] Add source file:line to stack origin reports. Compiler part. llvm-svn: 190689	2013-09-13 12:54:49 +00:00
Daniel Sanders	eb03a37cfa	Fix build failure reported by Tobias Markmann in bug 17203. svn 1.8.0 emits an additional line matching 'URL:' in its 'svn info' command ('Relative URL:'). Changed the grep to match only the intended line so that a valid SVNVersion.inc is generated. The problem doesnt occur with the svn version I'm using (1.7.5) but Tobias has confirmed that the change fixes the problem. See http://llvm.org/bugs/show_bug.cgi?id=17203 llvm-svn: 190685	2013-09-13 12:41:38 +00:00
Joey Gouly	3c0e5567a9	[ARMv8] Emit the proper .fpu directive. Patch by Bradley Smith! llvm-svn: 190683	2013-09-13 11:51:52 +00:00
Amaury de la Vieuville	9fd5e53f1d	Add "native" to config.available_features, to make it easier to disable non-x-compile-safe tests Patch by Artyom Skrobov! llvm-svn: 190679	2013-09-13 10:59:01 +00:00
Patrik Hagglund	57cb2bd0ef	Fix for executing AutoRegen.sh. Revert a part of r187209. Since r187209, which modified ltdl.m4, I was unable to execute AutoRegen.sh, getting: ../configure:10779: error: possibly undefined macro: AC_LTDL_FUNC_ARGZ This commit re-adds AC_LTDL_FUNC_ARGZ to ltdl.m4, as a quick fix. For me, this corresponds to the configure file currently checked in. (However, the ltdl library seems to be unused since r74924 in 2009, except for the use of the LTDL_SHLIB_EXT macro in bugpoint(?). Therefore, the right solution seems to try to get rid of the local ltdl.m4 file, specified by autoconf/README.TXT.) llvm-svn: 190677	2013-09-13 10:29:42 +00:00
Zoran Jovanovic	def5d3475f	Test commit to verify that commit access works. llvm-svn: 190676	2013-09-13 10:08:05 +00:00
Richard Sandiford	d816320809	[SystemZ] Use getTarget{Insert,Extract}Subreg rather than getMachineNode Just a clean-up, no behavioral change intended. llvm-svn: 190673	2013-09-13 09:12:44 +00:00
Richard Sandiford	030c165710	[SystemZ] Try to fold shifts into TMxx E.g. "SRL %r2, 2; TMLL %r2, 1" => "TMLL %r2, 4". llvm-svn: 190672	2013-09-13 09:09:50 +00:00
Duncan Sands	c9e95ad0db	Avoid a compiler warning about Found not being used when assertions are disabled. llvm-svn: 190668	2013-09-13 08:16:06 +00:00
Tim Northover	635a979038	AArch64: use RegisterOperand for NEON registers. Previously we modelled VPR128 and VPR64 as essentially identical register-classes containing V0-V31 (which had Q0-Q31 as "sub_alias" sub-registers). This model is starting to cause significant problems for code generation, particularly writing EXTRACT/INSERT_SUBREG patterns for converting between the two. The change here switches to classifying VPR64 & VPR128 as RegisterOperands, which are essentially aliases for RegisterClasses with different parsing and printing behaviour. This fits almost exactly with their real status (VPR128 == FPR128 printed strangely, VPR64 == FPR64 printed strangely). llvm-svn: 190665	2013-09-13 07:26:52 +00:00
Craig Topper	21a916b6db	Move operator to end of previous line to match coding standards. llvm-svn: 190659	2013-09-13 04:41:06 +00:00
Eric Christopher	dd1a01203d	Add initial support for handling gnu style pubnames accepted by some versions of gold. This support is designed to allow gold to produce gdb_index sections similar to the accelerator tables and consumable by gdb. llvm-svn: 190649	2013-09-13 00:35:05 +00:00
Eric Christopher	8b3737fbb0	Reformat and hoist section grabbing to top level. llvm-svn: 190648	2013-09-13 00:34:58 +00:00
Vincent Lejeune	0167a313da	R600: Move clamp handling code to R600IselLowering.cpp llvm-svn: 190645	2013-09-12 23:45:00 +00:00
Vincent Lejeune	9a248e5c2d	R600: Move code handling literal folding into R600ISelLowering. llvm-svn: 190644	2013-09-12 23:44:53 +00:00
Vincent Lejeune	ab3baf80a8	R600: Move fabs/fneg/sel folding logic into PostProcessIsel This move makes possible to correctly handle multiples instructions from a single pattern. llvm-svn: 190643	2013-09-12 23:44:44 +00:00
Chandler Carruth	51428e363f	Remove an unused variable, fixing -Werror build with latest Clang. llvm-svn: 190640	2013-09-12 23:30:48 +00:00
Hal Finkel	a5ebe426a5	Remove unnecessary TBAA metadata from r190636's test case llvm-svn: 190637	2013-09-12 23:23:12 +00:00
Hal Finkel	262a224712	Fix PPC ABI for ByVal structs with vector members When a structure is passed by value, and that structure contains a vector member, according to the PPC ABI, the structure will receive enhanced alignment (so that the vector within the structure will always be aligned). This should resolve PR16641. llvm-svn: 190636	2013-09-12 23:20:06 +00:00
Joe Abbey	1a6e77080f	Patch provide by Tom Roeder! Reviewed by Joe Abbey and Tobias Grosser Here is a patch that fixes decoding of CE_SELECT in BitcodeReader, along with a simple test case. The problem in the current code is that it generates but doesn't accept bitcode that uses vectors for the first element of a select in this context. llvm-svn: 190634	2013-09-12 22:02:31 +00:00
Krzysztof Parzyszek	de7485af55	In AliasSetTracker, do not change the alias set to "mod/ref" when adding a volatile load, or a volatile store. llvm-svn: 190631	2013-09-12 20:15:50 +00:00
Hal Finkel	1e2e3ea584	Make the PPC fast-math sqrt expansion safe at 0 In fast-math mode sqrt(x) is calculated using the fast expansion of the reciprocal of the reciprocal sqrt expansion. The reciprocal and reciprocal sqrt expansions use the associated estimate instructions along with some Newton iterations. Unfortunately, as a result, sqrt(0) was being calculated as NaN, which is not correct. Now we explicitly return a result of zero if the input is zero. llvm-svn: 190624	2013-09-12 19:04:12 +00:00
Roman Divacky	62cb63543b	Implement asm support for a few PowerPC bookIII that are needed for assembling FreeBSD kernel. llvm-svn: 190618	2013-09-12 17:50:54 +00:00
Filip Pizlo	f2189bf311	This switches CrashRecoveryContext to using ManagedStatic for its global Mutex and global ThreadLocals, thereby getting rid of the load-time initialization of those objects and also getting rid of their destruction unless the LLVM client calls llvm_shutdown. llvm-svn: 190617	2013-09-12 17:46:57 +00:00
Ben Langmuir	1650175de6	Partial support for Intel SHA Extensions (sha1rnds4) Add basic assembly/disassembly support for the first Intel SHA instruction 'sha1rnds4'. Also includes feature flag, and test cases. Support for the remaining instructions will follow in a separate patch. llvm-svn: 190611	2013-09-12 15:51:31 +00:00
Hal Finkel	0096dbd50d	Mark PPC MFTB and DST (and friends) as deprecated Use the new instruction deprecation feature to mark mftb (now replaced with mfspr) and dst (along with the other Altivec cache control instructions) as deprecated when targeting cores supporting at least ISA v2.03. llvm-svn: 190605	2013-09-12 14:40:06 +00:00
Joey Gouly	904d8806ce	Somehow this important part of the patch, where I actually check the Mask, got lost during my iterations of review. Thanks to Hal for spotting it! llvm-svn: 190604	2013-09-12 14:23:19 +00:00
Joey Gouly	db6144e3e3	[LTO] Fix the LTO tool, after my API breakage. Thanks to Zonr Chang! llvm-svn: 190602	2013-09-12 12:55:29 +00:00
Elena Demikhovsky	c2293fc7f2	LLVM interpreter: added a test for insert- extract- value llvm-svn: 190600	2013-09-12 10:52:03 +00:00
Elena Demikhovsky	8e97f0164d	LLVM Interpreter: implementation of "insertvalue" and "extractvalue"; undef constatnt for structure and test for these functions. done by Yuri Veselov (mailto:Yuri.Veselov@intel.com) llvm-svn: 190599	2013-09-12 10:48:23 +00:00
Joey Gouly	0e76fa7df5	Add an instruction deprecation feature to TableGen. The 'Deprecated' class allows you to specify a SubtargetFeature that the instruction is deprecated on. The 'ComplexDeprecationPredicate' class allows you to define a custom predicate that is called to check for deprecation. For example: ComplexDeprecationPredicate<"MCR"> would mean you would have to define the following function: bool getMCRDeprecationInfo(MCInst &MI, MCSubtargetInfo &STI, std::string &Info) Which returns 'false' for not deprecated, and 'true' for deprecated and store the warning message in 'Info'. The MCTargetAsmParser constructor was chaned to take an extra argument of the MCInstrInfo class, so out-of-tree targets will need to be changed. llvm-svn: 190598	2013-09-12 10:28:05 +00:00
Elena Demikhovsky	8952974e29	AVX-512: implemented extractelement with variable index. Added parsing of mask register and "zeroing" semantic, like {%k1} {z}. llvm-svn: 190595	2013-09-12 08:55:00 +00:00
Alexey Samsonov	9c0748a90c	Fixup for r190409: add dep on LZMA only if CMake is cross-compiling llvm-svn: 190591	2013-09-12 08:26:53 +00:00
Hal Finkel	7fe6a5390f	PPC: Enable aggressive anti-dependency breaking Aggressive anti-dependency breaking is enabled by default for all PPC cores. This provides a general speedup on the P7 and other platforms (among other factors, the instruction group formation for the non-embedded PPC cores is done during post-RA scheduling). In order to do this safely, the incompatibility between uses of the MFOCRF instruction and anti-dependency breaking are resolved by marking MFOCRF with hasExtraSrcRegAllocReq. As noted in the removed FIXME, the problem was that MFOCRF's output is sensitive to the identify of the source register, and always paired with a shift to undo this effect. Because anti-dependency breaking is unaware of this hidden dependency of the shift amount on the source register of the MFOCRF instruction, changing that register must be inhibited. Two test cases were adjusted: The SjLj test was made more insensitive to register choices and scheduling; the saveCR test disabled anti-dependency breaking because part of what it is testing is proper register reuse. llvm-svn: 190587	2013-09-12 05:24:49 +00:00
Hal Finkel	6f1ff8e1a8	Fix crash in AggressiveAntiDepBreaker with empty CriticalPathSet If no register classes are added to CriticalPathRCs, then the CriticalPathSet bitmask will be empty. In that case, ExcludeRegs must remain NULL or else this line will cause a segfault: } else if ((ExcludeRegs != NULL) && ExcludeRegs->test(AntiDepReg)) { I have no in-tree test case. llvm-svn: 190584	2013-09-12 04:22:31 +00:00
Tom Stellard	afcf12f33a	R600/SI: expose TBUFFER_STORE_FORMAT_* for OpenGL transform feedback For _XYZ, the type of VDATA is v4i32, because v3i32 doesn't exist. The ADDR64 bit is not exposed. A simpler intrinsic that doesn't take a resource descriptor might be nicer. The maximum number of input SGPRs is bumped to 17. Signed-off-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 190575	2013-09-12 02:55:14 +00:00
Tom Stellard	7f6fa4c4c5	R600: Don't use trans slot for instructions that read LDS source registers This fixes some regressions in the piglit local memory store tests introduced by recent commits which made the scheduler aware of the trans slot. It's not possible to test this using lit, because there is no way to determine from the assembly dumps whether or not an instruction is in the trans slot. Even if this were possible, the test would be highly sensitive to changes in the scheduler and might generate confusing false negatives. Reviewed-by: Vincent Lejeune<vljn at ovi.com> llvm-svn: 190574	2013-09-12 02:55:06 +00:00
Rui Ueyama	539b1df7c3	Typo fixes. llvm-svn: 190569	2013-09-12 01:43:21 +00:00
Matt Arsenault	bed5bf2e90	Move variable under condition where it is used llvm-svn: 190567	2013-09-12 01:07:58 +00:00
Matt Arsenault	a9e7c7abdc	Fix comment to match what the assert actually enforces llvm-svn: 190566	2013-09-12 01:07:54 +00:00

1 2 3 4 5 ...

95951 Commits