llvm-project

Commit Graph

Author	SHA1	Message	Date
Simon Pilgrim	2829c1cf7b	[MachineInstrBuilder] Support for adding a ConstantPoolIndex MO with an additional offset. MachineInstrBuilder::addDisp can already add an immediate or global address MO with an adjusted offset, this patch adds support for constant pool indices as well. All remaining MO types still assert - there are a number of other types that could support adjusted offsets but I have no test cases at this time. Required to fix a regression in D13988 found by Mikael Holmén during stress testing (test case attached). Differential Revision: http://reviews.llvm.org/D14867 llvm-svn: 253795	2015-11-21 21:42:26 +00:00
Sanjay Patel	42afa272ed	move a single test case to where most other instcombine shuffle bug test cases exist llvm-svn: 253784	2015-11-21 16:12:58 +00:00
Simon Pilgrim	162b3f2125	[X86][SSE] Added SSE2 PSUBUS tests llvm-svn: 253783	2015-11-21 13:57:22 +00:00
Simon Pilgrim	b85acbadcc	[X86][SSE] Regenerate TRUNC-SEXT tests Tidied up triple and regenerate tests using update_llc_test_checks.py llvm-svn: 253782	2015-11-21 13:32:29 +00:00
Simon Pilgrim	213f1eda5d	[X86][SSE] Regenerate MINMAX tests Tidied up triple and regenerate tests using update_llc_test_checks.py llvm-svn: 253781	2015-11-21 13:29:42 +00:00
Simon Pilgrim	68288265cc	[X86][SSE] Regenerate PSUBUS tests Tidied up triple and regenerate tests using update_llc_test_checks.py llvm-svn: 253780	2015-11-21 13:25:50 +00:00
Jonas Paulsson	8f0d2b7f1f	[DAGCombiner] Bugfix for lost chain depenedency. When MergeConsecutiveStores() combines two loads and two stores into wider loads and stores, the chain users of both of the original loads must be transfered to the new load, because it may be that a chain user only depends on one of the loads. New test case: test/CodeGen/SystemZ/dag-combine-01.ll Reviewed by James Y Knight. Bugzilla: https://llvm.org/bugs/show_bug.cgi?id=25310#c6 llvm-svn: 253779	2015-11-21 13:25:07 +00:00
Simon Pilgrim	967190894a	[X86][AVX] Regenerate AVX splat tests Tidied up triple and regenerate tests using update_llc_test_checks.py llvm-svn: 253778	2015-11-21 13:23:14 +00:00
Simon Pilgrim	d5a154424b	[X86][AVX512] Added AVX512 VMOVLHPS/VMOVHLPS shuffle decode comments. llvm-svn: 253777	2015-11-21 13:04:42 +00:00
Simon Pilgrim	96cbce61b2	[X86][SSE] Legal XMM Register Class ordering for SSE1 It turns out we have a number of places that just grab the first type attached to a register class for various reasons. This is fine unless for some reason that type isn't legal on the current target, such as for SSE1 which doesn't support v16i8/v8i16/v4i32/v2i64 - all of which were included before 4f32 in the class. Given that this is such a rare situation I've just re-ordered the types and placed the float types first. Fix for PR16133 Differential Revision: http://reviews.llvm.org/D14787 llvm-svn: 253773	2015-11-21 12:38:34 +00:00
Teresa Johnson	5bc88be11e	llvm-link option and test for recent metadata mapping bug Summary: Add a -preserve-modules option to llvm-link that simulates LTO clients that don't destroy modules as they are linked. This enables reproduction of a recent bug introduced by a metadata linking change that was only caught when the modules weren't destroyed before writing bitcode (LTO on Windows). See http://llvm.org/viewvc/llvm-project?view=revision&revision=253170 for more details on the original bug and the fix. Confirmed the new test added here reproduces the failure using the new option when I suppress the fix. Reviewers: pcc Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14818 llvm-svn: 253740	2015-11-21 00:35:38 +00:00
NAKAMURA Takumi	0b498c7af8	Move free-zext.ll to llvm/test/Transforms/CodeGenPrepare/AArch64/ llvm-svn: 253730	2015-11-20 22:55:34 +00:00
Owen Anderson	8e85130bb9	Fix another infinite loop in Reassociate caused by Constant::isZero(). Not all zero vectors are ConstantDataVector's. llvm-svn: 253723	2015-11-20 22:34:48 +00:00
Geoff Berry	5256fcada0	[CodeGenPrepare] Create more extloads and fewer ands Summary: Add and instructions immediately after loads that only have their low bits used, assuming that the (and (load x) c) will be matched as a extload and the ands/truncs fed by the extload will be removed by isel. Reviewers: mcrosier, qcolombet, ab Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14584 llvm-svn: 253722	2015-11-20 22:34:39 +00:00
Arnaud A. de Grandmaison	4e89e9f846	[ShrinkWrap] Teach ShrinkWrap to handle targets requiring a register scavenger. The included test only checks for a compiler crash for now. Several people are facing this issue, so we first resolve the crash, and will increase shrinkwrap's coverage later in a follow-up patch. llvm-svn: 253718	2015-11-20 21:54:27 +00:00
Diego Novillo	2dc683c3b9	SamplePGO - Tweak RUN command for a test. NFC. llvm-svn: 253717	2015-11-20 21:46:41 +00:00
Diego Novillo	5fb49e5c5f	SamplePGO - Do not count never-executed inlined functions when computing coverage. If a function was originally inlined but not actually hot at runtime, its samples will not be counted inside the parent function. This throws off the coverage calculation because it expects to find more used records than it should. Fixed by ignoring functions that will not be inlined into the parent. Currently, this is inlined functions with 0 samples. In subsequent patches, I'll change this to mean "cold" functions. llvm-svn: 253716	2015-11-20 21:46:38 +00:00
Jun Bum Lim	80ec0d3f5a	[AArch64]Merge narrow zero stores to a wider store This change merges adjacent zero stores into a wider single store. For example : strh wzr, [x0] strh wzr, [x0, #2] becomes str wzr, [x0] This will fix PR25410. llvm-svn: 253711	2015-11-20 21:14:07 +00:00
Eric Christopher	c180836722	Weak non-function symbols were being accessed directly, which is incorrect, as the chosen representative of the weak symbol may not live with the code in question. Always indirect the access through the TOC instead. Patch by Kyle Butt! llvm-svn: 253708	2015-11-20 20:51:31 +00:00
Bill Seurer	aea3d38d81	Fix test case label check Several (but not all) of the labels that are checked for in this test case are checked as strings instead of labels. This can cause an apparent test case failure if they are tested in an appropriately named directory. For example, one of them that fails: define zeroext i32 @test2(i32 %A.u, i32 %B.u) { ; A8: test2 ; A8: uxtab r0, r0, r1 Output that causes it to fail: . . . .file "/home/seurer/llvm/llvm-test2/test/CodeGen/Thumb2/thumb2-uxt_rot.ll" . . . .globl test2 .align 1 .type test2,%function .code 16 @ @test2 .thumb_func test2: .fnstart The "A8: test2" matches on the directory name instead of the label. llvm-svn: 253702	2015-11-20 20:24:49 +00:00
Artyom Skrobov	91f339ab3f	Handle ARMv6-J as an alias, instead of fake architecture Summary: This follows D14577 to treat ARMv6-J as an alias for ARMv6, instead of an architecture in its own right. The functional change is that the default CPU when targeting ARMv6-J changes from arm1136j-s to arm1136jf-s, which is currently used as the default CPU for ARMv6; both are, in fact, ARMv6-J CPUs. The J-bit (Jazelle support) is irrelevant to LLVM, and it doesn't affect code generation, attributes, optimizations, or anything else, apart from selecting the default CPU. Reviewers: rengolin, logan, compnerd Subscribers: aemerson, llvm-commits, rengolin Differential Revision: http://reviews.llvm.org/D14755 llvm-svn: 253675	2015-11-20 16:46:09 +00:00
Diego Novillo	df544a098a	SamplePGO - Add line offset and discriminator information to sample reports. While debugging some sampling coverage problems, I found this useful: When applying samples from a profile, it helps to also know what line offset and discriminator the sample belongs to. This makes it easy to correlate against the input profile. llvm-svn: 253670	2015-11-20 15:39:42 +00:00
Owen Anderson	630077ef55	Fix a pair of issues that caused an infinite loop in reassociate. Terrifyingly, one of them is a mishandling of floating point vectors in Constant::isZero(). How exactly this issue survived this long is beyond me. llvm-svn: 253655	2015-11-20 08:16:13 +00:00
Hrvoje Varga	b65518c15c	[mips][microMIPS] Implement MUL[_S].PH, MULEQ_S.W.PHL, MULEQ_S.W.PHR, MULEU_S.PH.QBL, MULEU_S.PH.QBR, MULQ_RS.PH, MULQ_RS.W, MULQ_S.PH and MULQ_S.W instructions Differential Revision: http://reviews.llvm.org/D14280 llvm-svn: 253651	2015-11-20 07:14:52 +00:00
Dan Gohman	bb7ce8e408	[WebAssembly] Rename SWITCH to TABLESWITCH to match the current wording in the spec. llvm-svn: 253642	2015-11-20 03:02:49 +00:00
Peter Collingbourne	c85f4ced4d	ScalarEvolution: do not set nuw when creating exprs of form <expr> + <all-ones>. The nuw constraint will not be satisfied unless <expr> == 0. This bug has been around since r102234 (in 2010!), but was uncovered by r251052, which introduced more aggressive optimization of nuw scev expressions. Differential Revision: http://reviews.llvm.org/D14850 llvm-svn: 253627	2015-11-20 01:26:13 +00:00
Tobias Edler von Koch	49c9a6e802	[LTO] Add options to llvm-lto to select output format and dump merged module This introduces two new options: - "llvm-lto -save-merged-module -o outfile" dumps the LTO Module to outfile.merged.bc prior to CodeGen and after LTO optimizations have been run. - "llvm-lto -filetype=asm -o outfile" makes llvm-lto emit assembly instead of object code in outfile. Both are intended for use in lit tests. llvm-svn: 253624	2015-11-20 00:13:05 +00:00
Reid Kleckner	cc2f6c35a3	[WinEH] Disable most forms of demotion Now that the register allocator knows about the barriers on funclet entry and exit, testing has shown that this is unnecessary. We still demote PHIs on unsplittable blocks due to the differences between the IR CFG and the Machine CFG. llvm-svn: 253619	2015-11-19 23:23:33 +00:00
Simon Pilgrim	a9912617c8	[X86][SSE4A] Fix issue with EXTRQI shuffles not starting at the correct start index. Found during stress testing. llvm-svn: 253611	2015-11-19 22:13:56 +00:00
Sanjay Patel	c4aa50414b	[InstCombine] add tests to show missing trunc optimizations llvm-svn: 253609	2015-11-19 22:11:52 +00:00
Sanjay Patel	f1c2370c48	[InstCombine] add tests to show missing bitcast optimizations llvm-svn: 253602	2015-11-19 21:32:25 +00:00
Dehao Chen	23e2278e27	Reimplement discriminator assignment algorithm. Summary: The new algorithm is more efficient (O(n), n is number of basic blocks). And it is guaranteed to cover all cases of multiple BB mapped to same line. Reviewers: dblaikie, davidxl, dnovillo Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14738 llvm-svn: 253594	2015-11-19 19:53:05 +00:00
James Molloy	1d695a09dd	[GlobalOpt] Localize some globals that have non-instruction users We currently bail out of global localization if the global has non-instruction users. However, often these can be simple bitcasts or constant-GEPs, which we can easily turn into instructions before localizing. Be a bit more aggressive. llvm-svn: 253584	2015-11-19 18:04:33 +00:00
Jun Bum Lim	4c35ccac91	[AArch64]Extend merging narrow loads into a wider load This change extends r251438 to handle more narrow load promotions including byte type, unscaled, and signed. For example, this change will convert : ldursh w1, [x0, #-2] ldurh w2, [x0, #-4] into ldur w2, [x0, #-4] asr w1, w2, #16 and w2, w2, #0xffff llvm-svn: 253577	2015-11-19 17:21:41 +00:00
Sanjay Patel	ae3680cbcd	this new test file was accidentally left out of r253573 llvm-svn: 253574	2015-11-19 16:39:00 +00:00
Sanjay Patel	4699b8ab6a	[CGP] despeculate expensive cttz/ctlz intrinsics This is another step towards allowing SimplifyCFG to speculate harder, but then have CGP clean things up if the target doesn't like it. Previous patches in this series: http://reviews.llvm.org/D12882 http://reviews.llvm.org/D13297 D13297 should catch most expensive ops, but speculation of cttz/ctlz requires special handling because of weirdness in the intrinsic definition for handling a zero input (that definition can probably be blamed on x86). For example, if we have the usual speculated-by-select expensive op pattern like this: %tobool = icmp eq i64 %A, 0 %0 = tail call i64 @llvm.cttz.i64(i64 %A, i1 true) ; is_zero_undef == true %cond = select i1 %tobool, i64 64, i64 %0 ret i64 %cond There's an instcombine that will turn it into: %0 = tail call i64 @llvm.cttz.i64(i64 %A, i1 false) ; is_zero_undef == false This CGP patch is looking for that case and despeculating it back into: entry: %tobool = icmp eq i64 %A, 0 br i1 %tobool, label %cond.end, label %cond.true cond.true: %0 = tail call i64 @llvm.cttz.i64(i64 %A, i1 true) ; is_zero_undef == true br label %cond.end cond.end: %cond = phi i64 [ %0, %cond.true ], [ 64, %entry ] ret i64 %cond This unfortunately may lead to poorer codegen (see the changes in the existing x86 test), but if we increase speculation in SimplifyCFG (the next step in this patch series), then we should avoid those kinds of cases in the first place. The need for this patch was originally mentioned here: http://reviews.llvm.org/D7506 with follow-up here: http://reviews.llvm.org/D7554 Differential Revision: http://reviews.llvm.org/D14630 llvm-svn: 253573	2015-11-19 16:37:10 +00:00
Hans Wennborg	dcc2500452	X86: More efficient legalization of wide integer compares In particular, this makes the code for 64-bit compares on 32-bit targets much more efficient. Example: define i32 @test_slt(i64 %a, i64 %b) { entry: %cmp = icmp slt i64 %a, %b br i1 %cmp, label %bb1, label %bb2 bb1: ret i32 1 bb2: ret i32 2 } Before this patch: test_slt: movl 4(%esp), %eax movl 8(%esp), %ecx cmpl 12(%esp), %eax setae %al cmpl 16(%esp), %ecx setge %cl je .LBB2_2 movb %cl, %al .LBB2_2: testb %al, %al jne .LBB2_4 movl $1, %eax retl .LBB2_4: movl $2, %eax retl After this patch: test_slt: movl 4(%esp), %eax movl 8(%esp), %ecx cmpl 12(%esp), %eax sbbl 16(%esp), %ecx jge .LBB1_2 movl $1, %eax retl .LBB1_2: movl $2, %eax retl Differential Revision: http://reviews.llvm.org/D14496 llvm-svn: 253572	2015-11-19 16:35:08 +00:00
Diego Novillo	ef548d2918	SamplePGO - Sort samples by source location when emitting as text. When dumping function samples or writing them out as text format, it helps if the samples are emitted sorted by source location. The sorting of the maps is a bit slow, so we only do it on demand. llvm-svn: 253568	2015-11-19 15:33:08 +00:00
Zoran Jovanovic	307f80eab1	[mips] Add tests for ROL and ROR macros expansion Author: obucina llvm-svn: 253567	2015-11-19 15:04:31 +00:00
Elena Demikhovsky	7c2c9fd243	AVX-512: Fixed COPY_TO_REGCLASS for mask registers Copying one mask register to another under BW should be done with kmovq instruction, otherwise we can loose some bits. Copying 8 bits under DQ may be done with kmovb. Differential Revision: http://reviews.llvm.org/D14812 llvm-svn: 253563	2015-11-19 13:13:00 +00:00
Artyom Skrobov	444d544e9d	Removing specific target from the generic test llvm-svn: 253562	2015-11-19 12:24:47 +00:00
Simon Pilgrim	846b64e17a	[X86][AVX] Fix lowering of X86ISD::VZEXT_MOVL for 128-bit -> 256-bit extension The lowering patterns for X86ISD::VZEXT_MOVL for 128-bit to 256-bit vectors were just copying the lower xmm instead of actually masking off the first scalar using a blend. Fix for PR25320. Differential Revision: http://reviews.llvm.org/D14151 llvm-svn: 253561	2015-11-19 12:18:37 +00:00
Alexey Bataev	b7b82bf33e	Alternative to long nops for X86 CPUs, by Andrey Turetsky Make X86AsmBackend generate smarter nops instead of a bunch of 0x90 for code alignment for CPUs which don't support long nop instructions. Differential Revision: http://reviews.llvm.org/D14178 llvm-svn: 253557	2015-11-19 11:44:35 +00:00
James Molloy	0ecdbe7d6b	[FunctionAttrs] Provide a mechanism for adding function attributes from the command line This provides a way to force a function to have certain attributes from the command line. This can be useful when debugging or doing workload exploration, where manually editing IR is tedious or not possible (due to build systems etc). The syntax is -force-attribute=function_name:attribute_name All function attributes are parsed except alignstack as it requires an argument. llvm-svn: 253550	2015-11-19 08:49:57 +00:00
Igor Breger	1f78296869	AVX512: Implemented encoding, intrinsics and DAG lowering for VMOVDDUP instructions. Differential Revision: http://reviews.llvm.org/D14702 llvm-svn: 253548	2015-11-19 08:26:56 +00:00
Igor Breger	4424aaa28e	AVX512: Implemented encoding for the vmovss.s and vmovsd.s instructions. Differential Revision: http://reviews.llvm.org/D14771 llvm-svn: 253547	2015-11-19 07:58:33 +00:00
Igor Breger	81b79de54c	AVX512: Implemented encoding for the follow instructions. vmovapd.s, vmovaps.s, vmovdqa32.s, vmovdqa64.s, vmovdqu16.s, vmovdqu32.s, vmovdqu64.s, vmovdqu8.s, vmovupd.s, vmovups.s Differential Revision: http://reviews.llvm.org/D14768 llvm-svn: 253546	2015-11-19 07:43:43 +00:00
Elena Demikhovsky	1ca72e1846	Pointers in Masked Load, Store, Gather, Scatter intrinsics The masked intrinsics support all integer and floating point data types. I added the pointer type to this list. Added tests for CodeGen and for Loop Vectorizer. Updated the Language Reference. Differential Revision: http://reviews.llvm.org/D14150 llvm-svn: 253544	2015-11-19 07:17:16 +00:00
Pete Cooper	67cf9a723b	Revert "Change memcpy/memset/memmove to have dest and source alignments." This reverts commit r253511. This likely broke the bots in http://lab.llvm.org:8011/builders/clang-ppc64-elf-linux2/builds/20202 http://bb.pgr.jp/builders/clang-3stage-i686-linux/builds/3787 llvm-svn: 253543	2015-11-19 05:56:52 +00:00
Weiming Zhao	b69babd01e	Fix bug 25440: GVN assertion after coercing loads Optimizations like LoadPRE in GVN will insert new instructions. If the insertion point is in a already processed BB, they should get a value number explicitly. If the insertion point is after current instruction, then just leave it. However, current GVN framework has no support for it. In this patch, we just bail out if a VN can't be found. Dfferential Revision: http://reviews.llvm.org/D14670 A test/Transforms/GVN/pr25440.ll M lib/Transforms/Scalar/GVN.cpp llvm-svn: 253536	2015-11-19 02:45:18 +00:00
Quentin Colombet	46d5c71135	[X86] Enable shrink-wrapping by default. Differential Revision: http://reviews.llvm.org/D14156 rdar://problem/21118279 llvm-svn: 253528	2015-11-19 00:38:00 +00:00
Reid Kleckner	441d207e66	Disable Go bindings test with MSan, it has tons of linker errors llvm-svn: 253525	2015-11-19 00:05:20 +00:00
Davide Italiano	c5cedd195a	[SimplifyLibCalls] New trick: pow(x, 0.5) -> sqrt(x) under -ffast-math. Differential Revision: http://reviews.llvm.org/D14466 llvm-svn: 253521	2015-11-18 23:21:32 +00:00
Quentin Colombet	f6645cce91	[AArch64] Enable shrink-wrapping by default. Differential Revision: http://reviews.llvm.org/D14360 rdar://problem/20820748 llvm-svn: 253520	2015-11-18 23:12:20 +00:00
Pete Cooper	72bc23ef02	Change memcpy/memset/memmove to have dest and source alignments. Note, this was reviewed (and more details are in) http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20151109/312083.html These intrinsics currently have an explicit alignment argument which is required to be a constant integer. It represents the alignment of the source and dest, and so must be the minimum of those. This change allows source and dest to each have their own alignments by using the alignment attribute on their arguments. The alignment argument itself is removed. There are a few places in the code for which the code needs to be checked by an expert as to whether using only src/dest alignment is safe. For those places, they currently take the minimum of src/dest alignments which matches the current behaviour. For example, code which used to read: call void @llvm.memcpy.p0i8.p0i8.i32(i8* %dest, i8* %src, i32 500, i32 8, i1 false) will now read: call void @llvm.memcpy.p0i8.p0i8.i32(i8* align 8 %dest, i8* align 8 %src, i32 500, i1 false) For out of tree owners, I was able to strip alignment from calls using sed by replacing: (call.llvm\.memset.)i32\ [0-9]\,\ i1 false\) with: $1i1 false) and similarly for memmove and memcpy. I then added back in alignment to test cases which needed it. A similar commit will be made to clang which actually has many differences in alignment as now IRBuilder can generate different source/dest alignments on calls. In IRBuilder itself, a new argument was added. Instead of calling: CreateMemCpy(Dst, Src, getInt64(Size), DstAlign, / isVolatile / false) you now call CreateMemCpy(Dst, Src, getInt64(Size), DstAlign, SrcAlign, / isVolatile */ false) There is a temporary class (IntegerAlignment) which takes the source alignment and rejects implicit conversion from bool. This is to prevent isVolatile here from passing its default parameter to the source alignment. Note, changes in future can now be made to codegen. I didn't change anything here, but this change should enable better memcpy code sequences. Reviewed by Hal Finkel. llvm-svn: 253511	2015-11-18 22:17:24 +00:00
Simon Pilgrim	c1a46b729b	[DAGCombiner] Vector constant folding for comparisons This patch adds support for vector constant folding of integer/float comparisons. This requires FoldConstantVectorArithmetic to support scalar constant operands (in this case ISD::CONDCASE). In future we should be able to support other scalar constant types as necessary (and possibly start calling FoldConstantVectorArithmetic for all node creations) Differential Revision: http://reviews.llvm.org/D14683 llvm-svn: 253504	2015-11-18 21:17:19 +00:00
Tim Northover	747ae9a7de	ARM: make sure backend is consistent about exception handling method. It turns out we decide whether to use SjLj exceptions or some alternative in two separate places in the backend, and they disagreed with each other. This led to inconsistent code and is generally a terrible idea. So make them consistent and add an assert that they do match (unfortunately MCAsmInfo isn't available in opt, so it can't be used to initialise the CodeGen version directly). llvm-svn: 253502	2015-11-18 21:10:39 +00:00
Mike Aizatsky	c7810baaa6	Disable gvn non-local speculative loads under asan. Summary: Fix for https://llvm.org/bugs/show_bug.cgi?id=25550 Differential Revision: http://reviews.llvm.org/D14763 llvm-svn: 253498	2015-11-18 20:43:00 +00:00
Betul Buyukkurt	6fac1741c9	[PGO] Value profiling support This change introduces an instrumentation intrinsic instruction for value profiling purposes, the lowering of the instrumentation intrinsic and raw reader updates. The raw profile data files for llvm-profdata testing are updated. llvm-svn: 253484	2015-11-18 18:14:55 +00:00
Artyom Skrobov	763c4ab32d	Removing specific target from the generic test llvm-svn: 253479	2015-11-18 17:50:47 +00:00
Dan Gohman	3d4a20662a	[WebAssembly] Make bogus inline asm strings in tests be comments. These tests aren't testing that the result is valid syntax; they're testing that the compiler emits the inline asm operands correctly. llvm-svn: 253469	2015-11-18 16:28:58 +00:00
Dan Gohman	4ba4816b97	[WebAssembly] Enable register coloring and register stackifying. This also takes the push/pop syntax another step forward, introducing stack slot numbers to make it easier to see how expressions are connected. For example, the value pushed in $push7 is popped in $pop7. And, this begins an experiment with making get_local and set_local implicit when an operation directly uses or defines a register. This greatly reduces clutter. If this experiment succeeds, it may make sense to do this for const instructions as well. And, this introduces more special code for ARGUMENTS; hopefully this code will soon be obviated by proper support for live-in virtual registers. llvm-svn: 253465	2015-11-18 16:12:01 +00:00
Manuel Klimek	272d3f17fc	Fix bug where WinCOFFObjectWriter would assume starting from an empty output. Starting on an input stream that is not at offset 0 would trigger the assert in WinCOFFObjectWriter.cpp:1065: assert(getStream().tell() <= (*i)->Header.PointerToRawData && "Section::PointerToRawData is insane!"); llvm-svn: 253464	2015-11-18 15:24:17 +00:00
Jonas Paulsson	af722f8287	[SelectionDAGBuilder] Make sure DemoteReg ends up in right reg-class. The virtual register containing the address for returned value on stack should in the DAG be represented with a CopyFromReg node and not a Register node. Otherwise, InstrEmitter will not make sure that it ends up in the right register class for the target instruction. SystemZ needs this, becuause the reg class for address registers is a subset of the general 64 bit register class. test/SystemZ/CodeGen/args-07.ll and args-04.ll updated to run with -verify-machineinstrs. Reviewed by Hal Finkel. llvm-svn: 253461	2015-11-18 14:59:00 +00:00
Igor Laevsky	7310c68e85	Revert "Revert "Strip metadata when speculatively hoisting instructions (r252604)" Failing clang test is now fixed by the r253458. llvm-svn: 253459	2015-11-18 14:50:18 +00:00
James Molloy	ea3bb626d4	[LTO] Appease buildbots take 3 This time I've found a linux box and checked it there. This test now passes. Because I'd introduced an undefined reference in @bar, gold now returns an error. This doesn't matter for the test itself, because it also emits the remarks the test is checking for. But it does cause LIT to notice a nonzero return code which it faults on. llvm-svn: 253454	2015-11-18 12:08:24 +00:00
James Molloy	ec9698a8c8	[LTO] Buildbot appeasing take 2 Let's try again. This time using the right function signature. It's a real pity I can't run this on a darwin machine... llvm-svn: 253453	2015-11-18 11:37:32 +00:00
James Molloy	784adffce1	[LTO] Fix up test/tools/gold/X86/remarks.ll It needs the same fixes as in test/LTO/X86/remarks.ll, but this test appears not to get run on my system (but does on the buildbot). Strange. llvm-svn: 253452	2015-11-18 11:32:14 +00:00
James Molloy	9ad4f22538	[LTO] Add an early run of functionattrs Because we internalize early, we can potentially mark a bunch of functions as norecurse. Do this before globalopt. llvm-svn: 253451	2015-11-18 11:24:42 +00:00
Asaf Badouh	0d957b8b09	[X86][AVX512CD] add mask broadcast intrinsics Differential Revision: http://reviews.llvm.org/D14573 llvm-svn: 253450	2015-11-18 09:42:45 +00:00
Simon Pilgrim	e896f9f8c3	[X86][AVX] Added 256-bit shuffle splat tests. llvm-svn: 253449	2015-11-18 09:39:38 +00:00
Igor Breger	5574730454	AVX512: Implemented encoding for vpextrw.s instruction. Differential Revision: http://reviews.llvm.org/D14766 llvm-svn: 253447	2015-11-18 08:46:16 +00:00
Hrvoje Varga	78409019d9	[mips][microMIPS] Implement DPS.W.PH, DPSQ_S.W.PH, DPSQ_SA.L.W, DPSQX_S.W.PH, DPSQX_SA.W.PH, DPSU.H.QBL, DPSU.H.QBR and DPSX.W.PH instructions Differential Revision: http://reviews.llvm.org/D14058 llvm-svn: 253443	2015-11-18 07:41:35 +00:00
Sanjoy Das	2d16145acf	Teach the inliner to track deoptimization state Summary: This change teaches LLVM's inliner to track and suitably adjust deoptimization state (tracked via deoptimization operand bundles) as it inlines through call sites. The operation is described in more detail in the LangRef changes. Reviewers: reames, majnemer, chandlerc, dexonsmith Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14552 llvm-svn: 253438	2015-11-18 06:23:38 +00:00
Rafael Espindola	449711cb36	Stop producing .data.rel sections. If a section is rw, it is irrelevant if the dynamic linker will write to it or not. It looks like llvm implemented this because gcc was doing it. It looks like gcc implemented this in the hope that it would put all the relocated items close together and speed up the dynamic linker. There are two problem with this: * It doesn't work. Both bfd and gold will map .data.rel to .data and concatenate the input sections in the order they are seen. * If we want a feature like that, it can be implemented directly in the linker since it knowns where the dynamic relocations are. llvm-svn: 253436	2015-11-18 06:02:15 +00:00
David Majnemer	8447ff6357	Add a test for r253323 Forgot to do this simultaneously with committing the fix. llvm-svn: 253430	2015-11-18 02:50:39 +00:00
David Majnemer	fbb1c3a70b	[llvm-objdump] Use the COFF export table for additional symbols Most linked executables do not have a symbol table in COFF. However, it is pretty typical to have some export entries. Use those entries to inform the disassembler about potential function definitions and call targets. llvm-svn: 253429	2015-11-18 02:49:19 +00:00
Cong Hou	41cf1a5dfb	Improving edge probabilities computation when choosing the best successor in machine block placement. When looking for the best successor from the outer loop for a block belonging to an inner loop, the edge probability computation can be improved so that edges in the inner loop are ignored. For example, suppose we are building chains for the non-loop part of the following code, and looking for B1's best successor. Assume the true body is very hot, then B3 should be the best candidate. However, because of the existence of the back edge from B1 to B0, the probability from B1 to B3 can be very small, preventing B3 to be its successor. In this patch, when computing the probability of the edge from B1 to B3, the weight on the back edge B1->B0 is ignored, so that B1->B3 will have 100% probability. if (...) do { B0; ... // some branches B1; } while(...); else B2; B3; Differential revision: http://reviews.llvm.org/D10825 llvm-svn: 253414	2015-11-18 00:52:52 +00:00
Quentin Colombet	8cb95b8e51	[ARM] Enable shrink-wrapping by default. Differential Revision: http://reviews.llvm.org/D14357 rdar://problem/21942589 llvm-svn: 253411	2015-11-18 00:40:54 +00:00
Simon Pilgrim	2da4178737	[X86][AVX512] Added AVX512 SHUFP/VPERMILP shuffle decode comments. llvm-svn: 253396	2015-11-17 23:29:49 +00:00
David Blaikie	35c2eebfe4	dwarfdump: support indexed string dumping in dwp based on the STR_OFFSETS component of the index llvm-svn: 253392	2015-11-17 22:39:23 +00:00
Simon Pilgrim	8483df6e24	[X86][AVX512] Added support for AVX512 UNPCK shuffle decode comments. llvm-svn: 253391	2015-11-17 22:35:45 +00:00
Nathan Slingerland	e6e30d5e88	[llvm-profdata] Improve error messaging when merging mismatched profile data Summary: This change tries to make the root cause of instrumented profile data merge failures clearer. Previous: $ llvm-profdata merge test_0.profraw test_1.profraw -o test_merged.profdata test_1.profraw: foo: Function count mismatch test_1.profraw: bar: Function count mismatch test_1.profraw: baz: Function count mismatch ... Changed: $ llvm-profdata merge test_0.profraw test_1.profraw -o test_merged.profdata test_1.profraw: foo: Function basic block count change detected (counter mismatch) Make sure that all profile data to be merged is generated from the same binary. test_1.profraw: bar: Function basic block count change detected (counter mismatch) test_1.profraw: baz: Function basic block count change detected (counter mismatch) ... Reviewers: dnovillo, davidxl, bogner Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14739 llvm-svn: 253384	2015-11-17 22:08:53 +00:00
Simon Pilgrim	6095410e09	[X86][SSE] Share AVX1/AVX2 shuffle tests with AVX512 where possible llvm-svn: 253379	2015-11-17 21:19:45 +00:00
Reid Kleckner	c20276d0b2	[WinEH] Move WinEHFuncInfo from MachineModuleInfo to MachineFunction Summary: Now that there is a one-to-one mapping from MachineFunction to WinEHFuncInfo, we don't need to use a DenseMap to select the right WinEHFuncInfo for the current funclet. The main challenge here is that X86WinEHStatePass is an IR pass that doesn't have access to the MachineFunction. I gave it its own WinEHFuncInfo object that it uses to calculate state numbers, which it then throws away. As long as nobody creates or removes EH pads between this pass and SDAG construction, we will get the same state numbers. The other thing X86WinEHStatePass does is to mark the EH registration node. Instead of communicating which alloca was the registration through WinEHFuncInfo, I added the llvm.x86.seh.ehregnode intrinsic. This intrinsic generates no code and simply marks the alloca in use. Reviewers: JCTremoulet Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14668 llvm-svn: 253378	2015-11-17 21:10:25 +00:00
David Blaikie	c4e2bed738	dwarfdump: Reference the appropriate line table segment when dumping dwp files Also improves .dwo type unit dumping which didn't handle this either. llvm-svn: 253377	2015-11-17 21:08:05 +00:00
Andrew Kaylor	de642cef2c	[EH] Keep filter clauses for types that have been caught. The instruction combiner previously removed types from filter clauses in Landing Pad instructions if the type had previously been seen in a catch clause. This is incorrect and prevents unexpected exception handlers from rethrowing the caught type. Differential Revision: http://reviews.llvm.org/D14669 llvm-svn: 253370	2015-11-17 20:13:04 +00:00
Elena Demikhovsky	3ec9e15ad4	Vector of pointers in function attributes calculation While setting function attributes we check all instructions that may access memory. For a call instruction we check all arguments. The special check is required for pointers. I added vector-of-pointers to the call arguments types that should be checked. Differential Revision: http://reviews.llvm.org/D14693 llvm-svn: 253363	2015-11-17 19:30:51 +00:00
Mike Aizatsky	3ac8a5abbf	enabling sancov tests on linux x86_64 only Differential Revision: http://reviews.llvm.org/D14728 llvm-svn: 253354	2015-11-17 18:25:21 +00:00
Charlie Turner	7968b981bf	[ARM] Don't pessimize i32 vselect. The underlying issues surrounding codegen for 32-bit vselects have been resolved. The pessimistic costs for 64-bit vselects remain due to the bad scalarization that is still happening there. I tested this on A57 in T32, A32 and A64 modes. I saw no regressions, and some improvements. From my benchmarks, I saw these improvements in A57 (T32) spec.cpu2000.ref.177_mesa 5.95% lnt.SingleSource/Benchmarks/Shootout/strcat 12.93% lnt.MultiSource/Benchmarks/MiBench/telecomm-CRC32/telecomm-CRC32 11.89% I also measured A57 A32, A53 T32 and A9 T32 and found no performance regressions. I see much bigger wins in third-party benchmarks with this change Differential Revision: http://reviews.llvm.org/D14743 llvm-svn: 253349	2015-11-17 17:25:15 +00:00
Ahmed Bougacha	88ddeae8bd	[AArch64] Promote f16 SELECT_CC CC operands when op is legal. SELECT_CC has the nasty property of having operands with unrelated types. So if you do something like: f32 = select_cc f16, f16, f32, f32, cc You'd only look for the action for <select_cc, f32>, but never f16. If the types are all legal, but the op isn't (as for f16 on AArch64, or for f128 on x86_64/AArch64?), then you get into trouble. For f128, we have softenSetCCOperands to handle this case. Similarly, for f16, we can directly promote the CC operands. llvm-svn: 253344	2015-11-17 16:45:40 +00:00
Dan Gohman	6c1b0b9a88	Update DebugInfo tests for the change in DEBUG_VALUE output in r253338. llvm-svn: 253340	2015-11-17 16:15:11 +00:00
Pat Gavlin	c8ea157811	Lower statepoints with multi-def targets. Statepoint lowering currently expects that the target method of a statepoint only defines a single value. This precludes using statepoints with ABIs that return values in multiple registers (e.g. the SysV AMD64 ABI). This change adds support for lowering statepoints with mutli-def targets. llvm-svn: 253339	2015-11-17 16:04:21 +00:00
Dan Gohman	7aa4abac24	Use TargetRegisterInfo for printing MachineOperand register comments Several places in AsmPrinter.cpp print comments describing MachineOperand registers using MCRegisterInfo, which uses MCOperand-oriented names. This doesn't work for targets that use virtual registers exclusively, as WebAssembly does, since virtual registers are represented and printed differently. This patch preserves what seems to be the spirit of r229978, avoiding the use of TM.getSubtargetImpl(), while still using MachineOperand-oriented printing for MachineOperands. Differential Revision: http://reviews.llvm.org/D14709 llvm-svn: 253338	2015-11-17 16:01:28 +00:00
Charlie Turner	b4613c6973	[ARM] Match VABDL from log2 shuffles. Differential Revision: http://reviews.llvm.org/D14664 llvm-svn: 253334	2015-11-17 13:21:35 +00:00
Zlatko Buljan	72a7f9c1f5	[mips][microMIPS] Implement EXTP, EXTPDP, EXTPDPV, EXTPV, EXTR[_RS].W, EXTR_S.H, EXTRV[_RS].W and EXTRV_S.H instructions Differential Revision: http://reviews.llvm.org/D14174 llvm-svn: 253332	2015-11-17 12:54:15 +00:00
Zlatko Buljan	246b21f66a	[mips][microMIPS] Implement SUBQ[_S].PH, SUBQ_S.W, SUBQH[_R].PH, SUBQH[_R].W, SUBU[_S].PH, SUBU[_S].QB and SUBUH[_R].QB instructions Differential Revision: http://reviews.llvm.org/D14114 llvm-svn: 253329	2015-11-17 10:11:22 +00:00
Oliver Stannard	9be59af3ab	[Assembler] Make fatal assembler errors non-fatal Currently, if the assembler encounters an error after parsing (such as an out-of-range fixup), it reports this as a fatal error, and so stops after the first error. However, for most of these there is an obvious way to recover after emitting the error, such as emitting the fixup with a value of zero. This means that we can report on all of the errors in a file, not just the first one. MCContext::reportError records the fact that an error was encountered, so we won't actually emit an object file with the incorrect contents. Differential Revision: http://reviews.llvm.org/D14717 llvm-svn: 253328	2015-11-17 10:00:43 +00:00
Oliver Stannard	07b43d39a8	[Assembler] Allow non-fatal errors after parsing This adds reportError to MCContext, which can be used as an alternative to reportFatalError when the assembler wants to try to continue processing the rest of the file after the error is reported, so that all of the errors ina file can be reported. It records the fact that an error was encountered, so we can avoid emitting an object file if any errors occurred. This patch doesn't add any uses of this function (a later patch will convert most uses of reportFatalError to use it), but there is a small functional change: we use the SourceManager to print the error message, even if we have a null SMLoc. This means that we get a SourceManager-style message, with the file and line information shown as <unknown>, rather than the "LLVM ERROR" style used by report_fatal_error. llvm-svn: 253327	2015-11-17 09:58:07 +00:00
Zlatko Buljan	3e0588d033	[mips][microMIPS] Implement PRECEQ.W.PHL, PRECEQ.W.PHR, PRECEQU.PH.QBL, PRECEQU.PH.QBLA, PRECEQU.PH.QBR, PRECEQU.PH.QBRA, PRECEU.PH.QBL, PRECEU.PH.QBLA, PRECEU.PH.QBR and PRECEU.PH.QBRA instructions Differential Revision: http://reviews.llvm.org/D14279 llvm-svn: 253326	2015-11-17 09:43:29 +00:00
Igor Breger	a8c9ec85ce	AVX512 : regenerate the test file against trunk. Differential Revision: http://reviews.llvm.org/D14742 llvm-svn: 253321	2015-11-17 08:03:43 +00:00
Zlatko Buljan	d1dea944b1	Added microMIPSDSPr1 assembler and disassembler tests to existing microMIPSDSPr2 test files. llvm-svn: 253320	2015-11-17 07:58:27 +00:00
Rafael Espindola	65e4902156	Drop prelink support. The way prelink used to work was * The compiler decides if a given section only has relocations that are know to point to the same DSO. If so, it names it .data.rel.ro.local<something>. * The static linker puts all of these together. * The prelinker program assigns addresses to each library and resolves the local relocations. There are many problems with this: * It is incompatible with address space randomization. * The information passed by the compiler is redundant. The linker knows if a given relocation is in the same DSO or not. If could sort by that if so desired. * There are newer ways of speeding up DSO (gnu hash for example). * Even if we want to implement this again in the compiler, the previous implementation is pretty broken. It talks about relocations that are "resolved by the static linker". If they are resolved, there are none left for the prelinker. What one needs to track is if an expression will require only dynamic relocations that point to the same DSO. At this point it looks like the prelinker is an historical curiosity. For example, fedora has retired it because it failed to build for two releases (http://pkgs.fedoraproject.org/cgit/prelink.git/commit/?id=eb43100a8331d91c801ee3dcdb0a0bb9babfdc1f) This patch removes support for it. That is, it stops printing the ".local" sections. llvm-svn: 253280	2015-11-17 00:51:23 +00:00
David Blaikie	82641be467	dwarfdump: Use the index to find the right abbrev offset in DWP files llvm-svn: 253277	2015-11-17 00:39:55 +00:00
Derek Schuff	71e8169ea8	[WebAssembly] Fix printing of global operands This was regressed in r252656 which wasn't quite NFC. Instead of using a custom instruction as before, use a pattern to select CONST_I32 for the global addrs. Differential Revision: http://reviews.llvm.org/D14587 llvm-svn: 253276	2015-11-17 00:20:44 +00:00
Philip Reames	b6e8fe3dac	[PRE] Preserve !invariant.load metadata Spoted via inspection. Test case included. llvm-svn: 253275	2015-11-17 00:15:09 +00:00
Derek Schuff	46e3316888	[WebAssembly] Fix function return type printing Summary: Previously return type information for a function was derived from return dag nodes. But this didn't work for dags with != return node. So instead compute it directly from the LLVM function as is done for imports. Differential Revision: http://reviews.llvm.org/D14593 llvm-svn: 253251	2015-11-16 21:12:41 +00:00
Derek Schuff	4ed4778419	[WebAssembly] Reverse the order of operands for br_if Summary: This is to match the new version in the spec Reviewers: sunfish Subscribers: jfb, llvm-commits, dschuff Differential Revision: http://reviews.llvm.org/D14519 llvm-svn: 253249	2015-11-16 21:04:51 +00:00
Kit Barton	9c432ae111	Find available scratch register to use in function prologue and epilogue as part of shrink wrapping. Phabricator: http://reviews.llvm.org/D13955 llvm-svn: 253247	2015-11-16 20:22:15 +00:00
Reid Kleckner	c397b26790	[WinEH] Don't let UnwindHelp alias the return address On top of that, don't bother allocating and initializing UnwindHelp if we don't have any funclets. Currently we always use RBP as our frame pointer when funclets are present, so this change makes it impossible to come here without any fixed stack objects. Fixes PR25533. llvm-svn: 253245	2015-11-16 18:47:25 +00:00
Owen Anderson	2de9f545aa	Add intermediate subtract instructions to reassociation worklist. We sometimes create intermediate subtract instructions during reassociation. Adding these to the worklist to revisit exposes many additional reassociation opportunities. Patch by Aditya Nandakumar. llvm-svn: 253240	2015-11-16 18:07:30 +00:00
David Majnemer	7378e7a333	[LoopStrengthReduce] Don't increment iterator past the end of the BB We tried to move the insertion point beyond instructions like landingpad and cleanuppad. However, we also tried to move past catchpad. This is problematic because catchpad is also a terminator. This fixes PR25541. llvm-svn: 253238	2015-11-16 17:37:58 +00:00
Vasileios Kalintiris	88faf6d697	[mips] Disable code generation through FastISel for MIPS32R6. Reviewers: dsanders Subscribers: llvm-commits, dsanders Differential Revision: http://reviews.llvm.org/D14708 llvm-svn: 253225	2015-11-16 17:05:01 +00:00
Oliver Stannard	9327a7575b	[ARM,AArch64] Store source location of asm constant pool entries Storing the source location of the expression that created a constant pool entry allows us to emit better error messages if we later discover that the expression cannot be represented by a relocation. Differential Revision: http://reviews.llvm.org/D14646 llvm-svn: 253220	2015-11-16 16:25:47 +00:00
Oliver Stannard	09be060606	[ARM,AArch64] Store source location for values in assembly files The MCValue class can store a SMLoc to allow better error messages to be emitted if an error is detected after parsing. The ARM and AArch64 assembly parsers were not setting this, so error messages did not have source information. Differential Revision: http://reviews.llvm.org/D14645 llvm-svn: 253219	2015-11-16 16:22:47 +00:00
Daniel Sanders	6b6679276c	[mips][ias] Remove spurious ';' from inline assembly test. IAS will not emit it. NFC at the moment but will prevent a test failure once IAS is enabled. llvm-svn: 253210	2015-11-16 14:19:32 +00:00
Daniel Sanders	7d0662cdac	[mips][ias] Accept $31 or $ra in hf16call32.ll. IAS prints the latter. NFC at the moment, but it will prevent a test failure once IAS is enabled. llvm-svn: 253209	2015-11-16 14:16:45 +00:00
Daniel Sanders	00a4aacecc	[mips][ias] Allow whitespace after commas in inlineasm*.ll tests. IAS always prints whitespace after a comma. NFC at the moment but this will prevent failures when IAS is enabled. llvm-svn: 253208	2015-11-16 14:14:59 +00:00
Artyom Skrobov	f187a65f99	Handle ARMv6KZ naming Summary: * ARMv6KZ is the "canonical" name, given in the ARMARM * ARMv6Z is an "official abbreviation" for it, mentioned in the ARMARM * ARMv6ZK is a popular misspelling, which we should support as an alias. The patch corrects the handling of the names. Functional changes: * ARMv6Z no longer treated as an architecture in its own right * ARMv6ZK renamed to ARMv6KZ, accepting ARMv6ZK as an alias * arm1176jz-s and arm1176jzf-s recognized as ARMv6ZK, instead of ARMv6K * default ARMv6K CPU changed to arm1176j-s Reviewers: rengolin, logan, compnerd Subscribers: aemerson, llvm-commits, rengolin Differential Revision: http://reviews.llvm.org/D14568 llvm-svn: 253206	2015-11-16 14:05:32 +00:00
James Molloy	2018091e87	Properly check if a CMPZ node is in fact comparing against zero This was left implicit and never ever checked, which means we could have a CMPZ against some non-zero value and we were carrying on with BFI conversion regardless. Caught by Oliver Stannard using csmith; regression test added. llvm-svn: 253195	2015-11-16 10:49:25 +00:00
Pavel Labath	978060ce2f	Don't generate discriminators for calls to debug intrinsics Summary: This fails a check in Verifier.cpp, which checks for location matches between the declared variable and the !dbg attachments. Reviewers: dnovillo, dblaikie, danielcdh Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14657 llvm-svn: 253194	2015-11-16 10:40:38 +00:00
Oliver Stannard	db9081bf89	[AArch64] ldr= pseudo-instruction silently ignored if register invalid The AArch64 assembler was silently ignoring instructions like this: ldr foo, =bar AArch64AsmParser::parseOperand was returning true as the parse failed, but was not calling AArch64AsmParser::Error to report this to the user, so the instruction was ignored without printing an error message. Differential Revision: http://reviews.llvm.org/D14651 llvm-svn: 253193	2015-11-16 10:25:19 +00:00
Keno Fischer	6c543c501d	Fix r253186 test case Referencing a DILocation whose scope is a different subprogram causes an assertion failure. llvm-svn: 253187	2015-11-16 08:25:14 +00:00
Keno Fischer	b011c63d19	[DIBuilder] Make createReferenceType take size and align Summary: Since we're passing references to dbg.value as pointers, we need to have the frontend properly declare their sizes and alignments (as it already does for regular pointers) in preparation for my upcoming patch to have the verifer check that the sizes agree. Also augment the backend logic that skips actually emitting this information into DWARF such that it also handles reference types. Reviewers: aprantl, dexonsmith, dblaikie Subscribers: dblaikie, llvm-commits Differential Revision: http://reviews.llvm.org/D14275 llvm-svn: 253186	2015-11-16 07:57:32 +00:00
Igor Breger	24cab0fa06	AVX512: Implemented encoding and intrinsics for VMOVSHDUP/VMOVSLDUP instructions. Differential Revision: http://reviews.llvm.org/D14322 llvm-svn: 253185	2015-11-16 07:22:00 +00:00
Keno Fischer	86c95b5642	[Sink] Don't move landingpads Summary: Moving landingpads into successor basic blocks makes the verifier sad. Teach Sink that much like PHI nodes and terminator instructions, landingpads (and cleanuppads, etc.) may not be moved between basic blocks. Reviewers: majnemer Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14475 llvm-svn: 253182	2015-11-16 04:47:58 +00:00
James Molloy	9c7d4d8855	[GlobalOpt] Demote globals to locals more aggressively Global to local demotion can speed up programs that use globals a lot. It is particularly useful with LTO, when the entire call graph is known and most functions have been internalized. For a global to be demoted, it must only be accessed by one function and that function: 1. Must never recurse directly or indirectly, else the GV would be clobbered. 2. Must never rely on the value in GV at the start of the function (apart from the initializer). GlobalOpt can already do this, but it is hamstrung and only ever tries to demote globals inside "main", because C++ gives extra guarantees about how main is called - once and only once. In LTO mode, we can often prove the first property (if the function is internal by this point, we know enough about the callgraph to determine if it could possibly recurse). FunctionAttrs now infers the "norecurse" attribute for this reason. The second property can be proven for a subset of functions by proving that all loads from GV are dominated by a store to GV. This is conservative in the name of compile time - this only requires a DominatorTree which is fairly cheap in the grand scheme of things. We could do more fancy stuff with MemoryDependenceAnalysis too to catch more cases but this appears to catch most of the useful ones in my testing. llvm-svn: 253168	2015-11-15 14:21:37 +00:00
Igor Breger	3ff8ef9eb7	Revert r253160. It broke layering violation. Reproducible with BUILD_SHARED_LIBS=ON. llvm-svn: 253163	2015-11-15 12:19:11 +00:00
Elena Demikhovsky	121d49b640	Fixed GEP visitor in the InstCombine pass. The current implementation of GEP visitor in InstCombine fails with assertion on Vector GEP with mix of scalar and vector types, like this: getelementptr double, double* %a, <8 x i32> %i (It fails to create a "sext" from <8 x i32> to <8 x i64>) I fixed it and added some tests. Differential Revision: http://reviews.llvm.org/D14485 llvm-svn: 253162	2015-11-15 08:19:35 +00:00
Igor Breger	aa40ddd3ba	AVX512: Implemented encoding and intrinsics for VMOVSHDUP/VMOVSLDUP instructions. Differential Revision: http://reviews.llvm.org/D14322 llvm-svn: 253160	2015-11-15 07:23:13 +00:00
Dan Gohman	19601fbc8a	[WebAssembly] Make indentation consistent with the other testcases. NFC. llvm-svn: 253149	2015-11-14 23:17:07 +00:00
Dan Gohman	8ad045c1d1	[WebAssembly] Support signext, zeroext, and several other function attributes. llvm-svn: 253148	2015-11-14 23:15:41 +00:00
Dan Gohman	c17e140b39	[WebAssembly] Change int_wasm_memory_size from IntrNoMem to IntrReadMem. llvm-svn: 253147	2015-11-14 23:02:31 +00:00
Simon Pilgrim	0de179b23b	[X86][SSE] Fixed arch/triple and regenerated results. Tidyup before diffs from new patch. llvm-svn: 253144	2015-11-14 20:42:01 +00:00
Simon Pilgrim	96d34d34b0	[X86][SSE] Added extra vector truncation tests Baseline comparison to D14588 llvm-svn: 253132	2015-11-14 15:23:59 +00:00
Michael Zolotukhin	8ef44f93ca	Don't recompute LCSSA after loop-unrolling when possible. Summary: Currently we always recompute LCSSA for outer loops after unrolling an inner loop. That leads to compile time problem when we have big loop nests, and we can solve it by avoiding unnecessary work. For instance, if w eonly do partial unrolling, we don't break LCSSA, so we don't need to rebuild it. Also, if all exits from the inner loop are inside the enclosing loop, then complete unrolling won't break LCSSA either. I replaced unconditional LCSSA recomputation with conditional recomputation + unconditional assert and added several tests, which were failing when I experimented with it. Soon I plan to follow up with a similar patch for recalculation of dominators tree. Reviewers: hfinkel, dexonsmith, bogner, joker.eph, chandlerc Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14526 llvm-svn: 253126	2015-11-14 05:51:41 +00:00
Quentin Colombet	2cdcfd23cd	[ShrinkWrapping] Disable the optimization for functions with sanitize like attribute. Even if the target supports shrink-wrapping, the prologue and epilogue must not move because a crash can happen anywhere and sanitizers need to be able to unwind from the PC of the crash. llvm-svn: 253116	2015-11-14 01:55:17 +00:00
Chad Rosier	cc299b627d	[LIR] Add support for creating memcpys from loops with a negative stride. This allows us to transform the below loop into a memcpy. void test(unsigned __restrict__ a, unsigned __restrict__ b) { for (int i = 2047; i >= 0; --i) { a[i] = b[i]; } } This is the memcpy version of r251518, which added support for memset with negative strided loops. llvm-svn: 253091	2015-11-13 21:51:02 +00:00
Reid Kleckner	75b4be9a11	[WinEH] Fix ESP management with 32-bit __CxxFrameHandler3 The C++ EH personality automatically restores ESP from the C++ EH registration node after a catchret. I mistakenly thought it was like SEH, which does not restore ESP. It makes sense for C++ EH to differ from SEH here because SEH does not use funclets for catches, and does not allow catching inside of finally. C++ EH may need to unwind through multiple catch funclets and eventually catchret to some outer funclet. Therefore, the runtime has to keep track of which ESP to use with catchret, rather than having the compiler reload it manually. llvm-svn: 253084	2015-11-13 21:27:00 +00:00
Evgeniy Stepanov	447bbdb171	[safestack] Rewrite isAllocaSafe using SCEV. Use ScalarEvolution to calculate memory access bounds. Handle function calls based on readnone/nocapture attributes. Handle memory intrinsics with constant size. This change improves both recall and precision of IsAllocaSafe. See the new tests (ex. BitCastWide) for the kind of code that was wrongly classified as safe. SCEV efficiency seems to be limited by the fact the SafeStack runs late (in CodeGenPrepare), and many loops are unrolled or otherwise not in LCSSA. llvm-svn: 253083	2015-11-13 21:21:42 +00:00
Diego Novillo	8e415a821f	SamplePGO - Add dump routines for LineLocation, SampleRecord and FunctionSamples llvm-svn: 253071	2015-11-13 20:24:28 +00:00
Cong Hou	ef4074bac2	[X86][SSE] Combine UNPCKL with vector_shuffle into UNPCKH to save one instruction for sext from v16i8 to v16i16 and v8i16 to v8i32. This patch is enabling combining UNPCKL with vector_shuffle that moves the upper half of a vector into the lower half, into a UNPCKH instruction. For example: t2: v16i8 = vector_shuffle<8,9,10,11,12,13,14,15,u,u,u,u,u,u,u,u> t1, undef:v16i8 t3: v16i8 = X86ISD::UNPCKL undef:v16i8, t2 will be combined to: t3: v16i8 = X86ISD::UNPCKH undef:v16i8, t1 Differential revision: http://reviews.llvm.org/D14399 llvm-svn: 253067	2015-11-13 19:47:43 +00:00
David Blaikie	8e8dd57e0b	dwarfdump: Add support for dumping the table contents of DWP indexes This is a recommit of 252842 which was reverted in 252859. The issue was using %s format specifier for a StringRef - used Format's left_justify(StringRef, int) instead. It'd be nice to have __attribute__((format(..))) on llvm::format, but apparently it's only implemented for c-style variadics, not C++ variadic templates. Perhaps we could fix that & conditionalize the attribute on such... llvm-svn: 253065	2015-11-13 19:18:49 +00:00
Reid Kleckner	82a6d4bf5c	Add missing triple to WinEH test case llvm-svn: 253062	2015-11-13 19:11:12 +00:00
Reid Kleckner	94b57065c6	[WinEH] Make UnwindHelp a fixed stack object allocated after XMM CSRs Now the offset of UnwindHelp in our EH tables and the offset that we store to in the prologue agree. llvm-svn: 253059	2015-11-13 19:06:01 +00:00
Tom Stellard	f9f5f12ce7	ELFYAML: Add support for parsing AMDGPU section attribute flags Reviewers: silvas Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14444 llvm-svn: 253052	2015-11-13 17:06:29 +00:00
James Molloy	b564098c62	[ARM] Replace ARMISD::RBIT with ISD::BITREVERSE ISD::BITREVERSE matches "rbit" completely, so remove ARMISD::RBIT and mark ISD::BITREVERSE as legal, adding a test for lowering. llvm-svn: 253047	2015-11-13 16:05:22 +00:00
Zlatko Buljan	32fb5c40d2	[mips][microMIPS] Implement SHRA[_R].PH, SHRAV[_R].PH, SHRAV[_R].QB, SHRAV_R.W, SHRA_R.W, SHRL.PH, SHRL.QB, SHRLV.PH and SHRLV.QB instructions Differential Revision: http://reviews.llvm.org/D14010 llvm-svn: 253041	2015-11-13 13:14:25 +00:00
Daniel Sanders	dd0eb2bbdd	[mips][ias] Explicitly disable IAS on asm-large-immediate.ll. NFC at the moment but it will prevent a failure when IAS is enabled by default. llvm-svn: 253039	2015-11-13 13:02:31 +00:00
Daniel Sanders	05d81d8286	[mips][ias] Replace invalid assembly insn in test since IAS parses inline assembly. This is NFC at the moment but will prevent this test from failing when IAS is the default. llvm-svn: 253033	2015-11-13 11:44:00 +00:00
James Molloy	67ca6edbb1	[AArch64] Check the expansion of BITREVERSE in regression test Something I missed from Hal's review, rightly pointed out by Ben Kramer - we should make sure the expansion is properly checked as it can be easy for bugs to creep in. I've checked the scalar i8 expansion here and the vector i8 expansion in a previous commit. llvm-svn: 253024	2015-11-13 10:05:31 +00:00
James Molloy	bb1dbf530a	[SDAG] Fix expansion of BITREVERSE Richard Trieu noted that UBSan detected an overflowing shift, and the obvious fix caused a crash. What was happening was that the shiftee (1U) was indeed too small for the possible range of shifts it had to handle, but also we were using "VT.getSizeInBits()" to get the maximum type bitwidth, but we wanted "VT.getScalarSizeInBits()" to get the vector lane size instead of the entire vector size. Use an APInt for the shift and VT.getScalarSizeInBits(). llvm-svn: 253023	2015-11-13 10:02:36 +00:00
NAKAMURA Takumi	7706fe58d2	llvm/test/tools/llvm-profdata/text-format-errors.test: Use prepared version of the input file, instead of using echo. ...and s/\C9/\xC9/ llvm-svn: 253014	2015-11-13 06:06:58 +00:00
Nathan Slingerland	4f82366759	[llvm-profdata] Add check for text profile formats and improve error reporting (2nd try) Summary: This change addresses two possible instances of user error / confusion when merging sampled profile data. Previously any input that didn't match the raw or processed instrumented format would automatically be interpreted as instrumented profile text format data. No error would be reported during the merge. Example: If foo-sampled.profdata and bar-sampled.profdata are binary sampled profiles: Old behavior: $ llvm-profdata merge foo-sampled.profdata bar-sampled.profdata -output foobar-sampled.profdata $ llvm-profdata show -sample foobar-sampled.profdata error: foobar-sampled.profdata:1: Expected 'mangled_name:NUM:NUM', found lprofi This change adds basic checks for valid input data when assuming text input. It also makes error messages related to file format validity more specific about the assumbed profile data type. New behavior: $ llvm-profdata merge foo-sampled.profdata bar-sampled.profdata -o foobar-sampled.profdata error: foo.profdata: Unrecognized instrumentation profile encoding format Perhaps you forgot to use the -sample option? Reviewers: bogner, davidxl, dnovillo Subscribers: davidxl, llvm-commits Differential Revision: http://reviews.llvm.org/D14558 llvm-svn: 253009	2015-11-13 03:47:58 +00:00
Colin LeMahieu	fa5558307b	[Hexagon] NFC. Adding a number of packet correctness tests. llvm-svn: 253000	2015-11-13 01:46:06 +00:00
Dan Gohman	f19ed56288	[WebAssembly] Inline asm support. llvm-svn: 252997	2015-11-13 01:42:29 +00:00
Colin LeMahieu	8bb168b160	[Hexagon] Adding relaxation functionality to backend and test. llvm-svn: 252989	2015-11-13 01:12:25 +00:00
Joseph Tremoulet	149c433bcc	[WinEH] Find root frame correctly in CLR funclets Summary: The value that the CoreCLR personality passes to a funclet for the establisher frame may be the root function's frame or may be the parent funclet's (mostly empty) frame in the case of nested funclets. Each funclet stores a pointer to the root frame in its own (mostly empty) frame, as does the root function itself. All frames allocate this slot at the same offset, measured from the post-prolog stack pointer, so that the same sequence can accept any ancestor as an establisher frame parameter value, and so that a single offset can be reported to the GC, which also looks at this slot. This change allocate the slot when processing function entry, and records its frame index on the WinEHFuncInfo object, then inserts the code to set/copy it during prolog emission. Reviewers: majnemer, AndyAyers, pgavlin, rnk Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14614 llvm-svn: 252983	2015-11-13 00:39:23 +00:00
Dan Gohman	058fce5435	[WebAssembly] Introduce a new pseudo-operand for unused expression results. llvm-svn: 252975	2015-11-13 00:21:05 +00:00
Vyacheslav Klochkov	cbc56baae6	X86-FMA3: Implemented commute transformations FMA_Int instructions. It made it possible to apply the memory folding optimization for the 2nd operand of FMA_Int instructions. Reviewer: Quentin Colombet Differential Revision: http://reviews.llvm.org/D14550 llvm-svn: 252973	2015-11-13 00:07:35 +00:00
Colin LeMahieu	7a92c6ecbb	[Hexagon] Adding checks for values out of operand range and correct new-value producer usage. llvm-svn: 252969	2015-11-12 23:28:01 +00:00
Colin LeMahieu	a1fa71ead9	[Hexagon] Adding test to make sure labels and register pairs are correctly parsed. llvm-svn: 252968	2015-11-12 22:54:14 +00:00
Sanjay Patel	fbaf5a9534	specify triple and tighten checks using update_llc_test_checks.py llvm-svn: 252962	2015-11-12 22:27:38 +00:00
Tom Stellard	0967c91e0c	Revert "Remove unnecessary call to getAllocatableRegClass" This reverts commit r252565. This also includes the revert of the commit mentioned below in order to avoid breaking tests in AMDGPU: Revert "AMDGPU: Set isAllocatable = 0 on VS_32/VS_64" This reverts commit r252674. llvm-svn: 252956	2015-11-12 21:43:25 +00:00
Mike Aizatsky	ba8a5b1f91	disabling sancov tests: too many failures on different platforms. Differential Revision: http://reviews.llvm.org/D14624 llvm-svn: 252945	2015-11-12 20:47:12 +00:00
Mike Aizatsky	67e3d651f5	sancov tests - platform independent separators llvm-svn: 252943	2015-11-12 20:17:49 +00:00
Tobias Grosser	8241795d20	Revert "Fix bug 25440: GVN assertion after coercing loads" This reverts 252919 which broke LNT: MultiSource/Applications/SPASS llvm-svn: 252936	2015-11-12 20:04:21 +00:00
Mike Aizatsky	14a06ac05e	sancov test suite Differential Revision: http://reviews.llvm.org/D14589 llvm-svn: 252933	2015-11-12 19:34:21 +00:00
Teresa Johnson	ba5d68dfff	[ThinLTO] Update test to be more tolerant of ordering changes Update the ThinLTO function importing test to use DAG forms of checks so that it is more tolerant of changes to relative ordering between imported decls/defs. This reduces the number of changes required by the comdat importing patch I am sending for review shortly. llvm-svn: 252932	2015-11-12 19:31:46 +00:00
Nathan Slingerland	911ced6bf3	reverting r252916 to investigate test failure llvm-svn: 252921	2015-11-12 18:39:26 +00:00
Weiming Zhao	eed0145dd2	Fix bug 25440: GVN assertion after coercing loads Summary: when coercing loads, it inserts some instructions, which have no GV assigned. https://llvm.org/bugs/show_bug.cgi?id=25440 Reviewers: hfinkel, dberlin Subscribers: dberlin, llvm-commits Differential Revision: http://reviews.llvm.org/D14479 llvm-svn: 252919	2015-11-12 18:19:59 +00:00
Quentin Colombet	94dc1e0d34	[ShrinkWrap] Make sure we do not mess up with EH funclet lowering. ShrinkWrapping does not understand exception handling constraints for now, so make sure we do not mess with them by aborting on functions that use EH funclets. llvm-svn: 252917	2015-11-12 18:13:42 +00:00
Nathan Slingerland	f0e107e38a	[llvm-profdata] Add check for text profile formats and improve error reporting Summary: This change addresses two possible instances of user error / confusion when merging sampled profile data. Previously any input that didn't match the raw or processed instrumented format would automatically be interpreted as instrumented profile text format data. No error would be reported during the merge. Example: If foo-sampled.profdata and bar-sampled.profdata are binary sampled profiles: Old behavior: $ llvm-profdata merge foo-sampled.profdata bar-sampled.profdata -output foobar-sampled.profdata $ llvm-profdata show -sample foobar-sampled.profdata error: foobar-sampled.profdata:1: Expected 'mangled_name:NUM:NUM', found lprofi This change adds basic checks for valid input data when assuming text input. It also makes error messages related to file format validity more specific about the assumbed profile data type. New behavior: $ llvm-profdata merge foo-sampled.profdata bar-sampled.profdata -o foobar-sampled.profdata error: foo.profdata: Unrecognized instrumentation profile encoding format Perhaps you forgot to use the -sample option? Reviewers: bogner, davidxl, dnovillo Subscribers: davidxl, llvm-commits Differential Revision: http://reviews.llvm.org/D14558 llvm-svn: 252916	2015-11-12 18:06:18 +00:00
Dan Gohman	cf4748f180	[WebAssembly] Reapply r252858, with svn add for the new file. Switch to MC for instruction printing. This encompasses several changes which are all interconnected: - Use the MC framework for printing almost all instructions. - AsmStrings are now live. - This introduces an indirection between LLVM vregs and WebAssembly registers, and a new pass, WebAssemblyRegNumbering, for computing a basic the mapping. This addresses some basic issues with argument registers and unused registers. - The way ARGUMENT instructions are handled no longer generates redundant get_local+set_local for every argument. This also changes the assembly syntax somewhat; most notably, MC's printing does not use sigils on label names, so those are no longer present, and push/pop now have a sigil to keep them unambiguous. The usage of set_local/get_local/$push/$pop will continue to evolve significantly. This patch is just one step of a larger change. llvm-svn: 252910	2015-11-12 17:04:33 +00:00
Michael Zuckerman	fd3fe9e45a	[x86] translating "fp" (floating point) instructions from {fadd,fdiv,fmul,fsub,fsubr,fdivr} to {faddp,fdivp,fmulp,fsubp,fsubrp,fdivrp} LLVM Missing the following instructions: fadd\fdiv\fmul\fsub\fsubr\fdivr. GAS and MS supporting this instruction and lowering them in to a faddp\fdivp\fmulp\fsubp\fsubrp\fdivrp instructions. Differential Revision: http://reviews.llvm.org/D14217 llvm-svn: 252908	2015-11-12 16:58:51 +00:00
Hans Wennborg	7384a2de02	Revert r252858: "[WebAssembly] Switch to MC for instruction printing." It broke the CMake build: "Cannot find source file: WebAssemblyRegNumbering.cpp" llvm-svn: 252897	2015-11-12 14:37:56 +00:00
Vasileios Kalintiris	48e0256ed6	Re-apply "[mips] Use correct frame register for DWARF info when dynamically realigning the stack."" r252219 reversed the direction of subprogram -> function edge. Fixed the IR to account for this. llvm-svn: 252895	2015-11-12 14:11:43 +00:00
James Molloy	8e99e97f2a	[ARM] CMOV->BFI combining: handle both senses of CMPZ I completely misunderstood what ARMISD::CMPZ means. It's not "compare equal to zero", it's "compare, only setting the zero/Z flag". It can either be equal-to-zero or not-equal-to-zero, and we weren't checking what sense it was. If it's equal-to-zero, we can swap the operands around and pretend like it is not-equal-to-zero, which is both a bug fix and lets us handle more cases. llvm-svn: 252891	2015-11-12 13:49:17 +00:00
Renato Golin	93064025bd	Revert "[ARM] Enable shrink-wrapping by default." This reverts commit r252825, as it broke ASAN on ARM. Investigating... llvm-svn: 252889	2015-11-12 13:34:50 +00:00
Daniel Sanders	9f6ad49740	Implement .reloc (constant offset only) with support for R_MIPS_NONE and R_MIPS_32. Summary: Support for R_MIPS_NONE allows us to parse MIPS16's usage of .reloc. R_MIPS_32 was included to be able to better test the directive. Targets can add their relocations by overriding MCAsmBackend::getFixupKind(). Subscribers: grosbach, rafael, majnemer, dsanders, llvm-commits Differential Revision: http://reviews.llvm.org/D13659 llvm-svn: 252888	2015-11-12 13:33:00 +00:00
Zlatko Buljan	797c2aec6b	[mips][microMIPS] Implement LWM16, SB16, SH16, SW16, SWSP and SWM16 instructions Differential Revision: http://reviews.llvm.org/D11406 llvm-svn: 252885	2015-11-12 13:21:33 +00:00
Vasileios Kalintiris	d38860610d	Revert "[mips] Use correct frame register for DWARF info when dynamically realigning the stack." This reverts commit r252882. LLParser complains for invalid field 'function' in DISubprogram. llvm-svn: 252884	2015-11-12 13:19:11 +00:00
Vasileios Kalintiris	352eb55baf	[mips] Use correct frame register for DWARF info when dynamically realigning the stack. Summary: This patch overrides TargetFrameLowering::getFrameIndexReference() in order to specify the correct register when the function needs dynamic stack realignment. The values returned from this function are used in order to create DW_AT_locations for DWARF info. These locations would use the wrong registers as it's been reported in PR25028. Reviewers: dsanders Subscribers: dean, llvm-commits Differential Revision: http://reviews.llvm.org/D13511 llvm-svn: 252882	2015-11-12 13:04:16 +00:00
James Molloy	2d09c00b91	[InstCombine] Add trivial folding (bitreverse (bitreverse x)) -> x There are plenty more instcombines we could probably do with bitreverse, but this seems like a very obvious and trivial starting point and was brought up by Hal in his review. llvm-svn: 252879	2015-11-12 12:39:41 +00:00
James Molloy	90111f79f9	[SDAG] Introduce a new BITREVERSE node along with a corresponding LLVM intrinsic Several backends have instructions to reverse the order of bits in an integer. Conceptually matching such patterns is similar to @llvm.bswap, and it was mentioned in http://reviews.llvm.org/D14234 that it would be best if these patterns were matched in InstCombine instead of reimplemented in every different target. This patch introduces an intrinsic @llvm.bitreverse.i* that operates similarly to @llvm.bswap. For plumbing purposes there is also a new ISD node ISD::BITREVERSE, with simple expansion and promotion support. The intention is that InstCombine's BSWAP detection logic will be extended to support BITREVERSE too, and @llvm.bitreverse intrinsics emitted (if the backend supports lowering it efficiently). llvm-svn: 252878	2015-11-12 12:29:09 +00:00
James Molloy	7e9bdd5d01	Revert "Revert "[FunctionAttrs] Identify norecurse functions"" This reapplies this patch, with test fixes. llvm-svn: 252871	2015-11-12 10:55:20 +00:00
Kuba Brecka	de8332257b	[Object, MachO] Mark symbols from DATA and BSS sections as ST_Data In `MachOObjectFile::getSymbolType` we currently always return `SymbolRef::ST_Function` for symbols from any section. In order for llvm-symbolizer to correctly symbolize Mach-O globals, symbols from data and BSS sections should return `SymbolRef::ST_Data`. Differential Revision: http://reviews.llvm.org/D14576 llvm-svn: 252867	2015-11-12 09:40:29 +00:00
Amjad Aboud	e59cc3e540	dwarfdump: Added macro support to llvm-dwarfdump tool. Added "macro" option to "-debug-dump" flag, which trigger parsing and dumping of the ".debug_macinfo" section. Differential Revision: http://reviews.llvm.org/D14294 llvm-svn: 252866	2015-11-12 09:38:54 +00:00
James Molloy	9a32da74f7	Revert "[FunctionAttrs] Identify norecurse functions" This reverts commit r252862. This introduced test failures and I'm reverting while I investigate how this happened. llvm-svn: 252863	2015-11-12 09:05:43 +00:00
James Molloy	b14994e752	[FunctionAttrs] Identify norecurse functions A function can be marked as norecurse if: * The SCC to which it belongs has cardinality 1; and either a) It does not call any non-norecurse function. This includes self-recursion; or b) It only has one callsite and the function that callsite is within is marked norecurse. a) is best propagated bottom-up and b) is best propagated top-down. We build up the norecurse attributes bottom-up using the existing SCC pass, and mark functions with no obvious recursion (but not provably norecurse) to sweep later, top-down. llvm-svn: 252862	2015-11-12 08:53:04 +00:00
David Blaikie	6400fc146e	Mostly revert 252842 due to failures on some buildbots. I imagine there's some UB in here somewhere, though Valgrind doesn't seem to have picked it up (not sure if I have a working asan build right now to test there). GDB bot seems to be crashing: http://lab.llvm.org:8011/builders/clang-x86_64-ubuntu-gdb-75/builds/26267/steps/check-all/logs/FAIL%3A%20LLVM%3A%3Adwarfdump-dwp.test Hexagon ELF bot is, presumably, just getting different output: http://lab.llvm.org:8011/builders/clang-hexagon-elf/builds/32927/steps/check-all/logs/FAIL%3A%20LLVM%3A%3Adwarfdump-dwp.test llvm-svn: 252859	2015-11-12 06:33:14 +00:00
Dan Gohman	9dd55a8065	[WebAssembly] Switch to MC for instruction printing. This encompasses several changes which are all interconnected: - Use the MC framework for printing almost all instructions. - AsmStrings are now live. - This introduces an indirection between LLVM vregs and WebAssembly registers, and a new pass, WebAssemblyRegNumbering, for computing a basic the mapping. This addresses some basic issues with argument registers and unused registers. - The way ARGUMENT instructions are handled no longer generates redundant get_local+set_local for every argument. This also changes the assembly syntax somewhat; most notably, MC's printing use sigils on label names, so those are no longer present, and push/pop now have a sigil to keep them unambiguous. The usage of set_local/get_local/$push/$pop will continue to evolve significantly. This patch is just one step of a larger change. llvm-svn: 252858	2015-11-12 06:10:03 +00:00
David Blaikie	5b9bf49c6f	dwarfdump: Dump the contents of DWP indexes llvm-svn: 252842	2015-11-12 01:41:52 +00:00
Matthias Braun	b9610a6bc2	LegalizeDAG: Fix and improve FCOPYSIGN/FABS legalization - Factor out code to query and modify the sign bit of a floatingpoint value as an integer. This also works if none of the targets integer types is big enough to hold all bits of the floatingpoint value. - Legalize FABS(x) as FCOPYSIGN(x, 0.0) if FCOPYSIGN is available, otherwise perform bit manipulation on the sign bit. The previous code used "x >u 0 ? x : -x" which is incorrect for x being -0.0! It also takes 34 instructions on ARM Cortex-M4. With this patch we only require 5: vldr d0, LCPI0_0 vmov r2, r3, d0 lsrs r2, r3, #31 bfi r1, r2, #31, #1 bx lr (This could be further improved if the compiler would recognize that r2, r3 is zero). - Only lower FCOPYSIGN(x, y) = sign(x) ? -FABS(x) : FABS(x) if FABS is available otherwise perform bit manipulation on the sign bit. - Perform the sign(x) test by masking out the sign bit and comparing with 0 rather than shifting the sign bit to the highest position and testing for "<s 0". For x86 copysignl (on 80bit values) this gets us: testl $32768, %eax rather than: shlq $48, %rax sets %al testb %al, %al Differential Revision: http://reviews.llvm.org/D11172 llvm-svn: 252839	2015-11-12 01:02:47 +00:00
Manman Ren	3f2b9c18e2	[TLS on Darwin] use a different mask for tls calls on x86-64. Calls involved in thread-local variable lookup save more registers than normal calls. rdar://problem/23073171 llvm-svn: 252837	2015-11-12 00:54:04 +00:00
Quentin Colombet	10f9813528	[ARM] Enable shrink-wrapping by default. Differential Revision: http://reviews.llvm.org/D14357 rdar://problem/21942589 llvm-svn: 252825	2015-11-11 23:31:46 +00:00
Reid Kleckner	b9204a584c	[WinEH] Don't forward branches across empty EH pad BBs For really simple SEH catchpads, we tried to forward the invoke unwind edge across the empty block. llvm-svn: 252822	2015-11-11 23:09:31 +00:00
David Majnemer	f0f224d12d	[IR] Add support for empty tokens When working with tokens, it is often the case that one has instructions which consume a token and produce a new token. Currently, we have no mechanism to represent an initial token state. Instead, we can create a notional "empty token" by inventing a new constant which captures the semantics we would like. This new constant is called ConstantTokenNone and is written textually as "token none". Differential Revision: http://reviews.llvm.org/D14581 llvm-svn: 252811	2015-11-11 21:57:16 +00:00
Sanjoy Das	cdafd8490a	Introduce deoptimization operand bundles Summary: This change introduces the notion of "deoptimization" operand bundles. LLVM can recognize and optimize these in more precise ways than it can a generic "unknown" operand bundles. The current form of this special recognition / optimization is an enum entry in LLVMContext, a LangRef blurb and a verifier rule. Over time we will teach LLVM to do more aggressive optimization around deoptimization operand bundles, exploiting known facts about kinds of state deoptimization operand bundles are allowed to track. Reviewers: reames, majnemer, chandlerc, dexonsmith Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14551 llvm-svn: 252806	2015-11-11 21:38:02 +00:00
Hemant Kulkarni	bdce12a01b	[Symbolizer]: Add -pretty-print option Differential Revision: http://reviews.llvm.org/D13671 llvm-svn: 252798	2015-11-11 20:41:43 +00:00

... 2 3 4 5 6 ...

33214 Commits