llvm-project

Commit Graph

Author	SHA1	Message	Date
Daniel Sanders	946dee3b5b	[mips] Range check vsplat_uimm[1234568]. Summary: Reviewers: vkalintiris Subscribers: dsanders, llvm-commits Differential Revision: http://reviews.llvm.org/D18143 llvm-svn: 264053	2016-03-22 14:17:41 +00:00
Daniel Sanders	93fa4ce9b7	[mips] Range check uimm4_ptr, remove uimm6_ptr, and use correctly sized immediates in MSA copy/insert. Reviewers: vkalintiris Subscribers: dsanders, llvm-commits Differential Revision: http://reviews.llvm.org/D18142 llvm-svn: 264052	2016-03-22 13:58:53 +00:00
Zinovy Nis	07ac2bd4d0	[PATCH] Force LoopReroll to reset the loop trip count value after reroll. It's a bug fix. For rerolled loops SE trip count remains unchanged. It leads to incorrect work of the next passes. My patch just resets SE info for rerolled loop forcing SE to re-evaluate it next time it requested. I also added a verifier call in the exisitng test to be sure no invalid SE data remain. Without my fix this test would fail with -verify-scev. Differential Revision: http://reviews.llvm.org/D18316 llvm-svn: 264051	2016-03-22 13:50:57 +00:00
Marina Yatsina	33ef7dad18	[ELF][gcc compatibility]: support section names with special characters (e.g. "/") Adding support for section names with special characters in them (e.g. "/"). GCC successfully compiles such section names. This also fixes PR24520. Differential Revision: http://reviews.llvm.org/D15678 llvm-svn: 264038	2016-03-22 11:23:15 +00:00
Mehdi Amini	844baa240a	Fix unittests: resize() -> reserve() From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 264029	2016-03-22 07:35:51 +00:00
Mehdi Amini	c04fc7a60f	Rename DenseMap::resize() into DenseMap::reserve() (NFC) This is more coherent with usual containers. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 264026	2016-03-22 07:20:00 +00:00
Junmo Park	5ac1a47cad	Minor code cleanup. NFC. llvm-svn: 264024	2016-03-22 04:37:32 +00:00
Sanjoy Das	dd1d72ce92	Appease the windows buildbots The guess is that the stdout/stderr ordering may differ between windows / unix. llvm-svn: 264019	2016-03-22 02:11:57 +00:00
Sanjoy Das	38bfc22161	Add "first class" lowering for deopt operand bundles Summary: After this change, deopt operand bundles can be lowered directly by SelectionDAG into STATEPOINT instructions (which are then lowered to a call or sequence of nop, with an associated __llvm_stackmaps entry0. This obviates the need to round-trip deoptimization state through gc.statepoint via RewriteStatepointsForGC. Reviewers: reames, atrick, majnemer, JosephTremoulet, pgavlin Subscribers: sanjoy, mcrosier, majnemer, llvm-commits Differential Revision: http://reviews.llvm.org/D18257 llvm-svn: 264015	2016-03-22 00:59:13 +00:00
Mike Aizatsky	602f79275d	[sancov] do not instrument nodes that are full pre-dominators Summary: Without tree pruning clang has 2,667,552 points. Wiht only dominators pruning: 1,515,586. With both dominators & predominators pruning: 1,340,534. Resubmit of r262103. Differential Revision: http://reviews.llvm.org/D18341 llvm-svn: 264003	2016-03-21 23:08:16 +00:00
Justin Lebar	32835c82d5	[CUDA] Add documentation explaining how to detect clang vs nvcc. llvm-svn: 264002	2016-03-21 23:05:15 +00:00
Nicolai Haehnle	0a33abdfd2	AMDGPU: Fix dangling references introduced by r263982 Fixes Valgrind errors on the test cases that were reported as failing by buildbots. llvm-svn: 264000	2016-03-21 22:54:02 +00:00
Simon Pilgrim	b57b002253	[InstCombine] Ensure all undef operands are handled before binary instruction constant folding As noted in PR18355, this patch makes it clear that all cases with undef operands have been handled before further constant folding is attempted. Differential Revision: http://reviews.llvm.org/D18305 llvm-svn: 263994	2016-03-21 22:15:50 +00:00
Duncan P. N. Exon Smith	20be876a64	Fix -Wdocumentation warnings from r263853 Thanks to chapuni for catching this. llvm-svn: 263993	2016-03-21 22:13:44 +00:00
George Burgess IV	3887a41725	[MemorySSA] Consider def-only BBs for live-in calculations. If we have a BB with only MemoryDefs, live-in calculations will ignore it. This means we get results like this: define void @foo(i8* %p) { ; 1 = MemoryDef(liveOnEntry) store i8 0, i8* %p br i1 undef, label %if.then, label %if.end if.then: ; 2 = MemoryDef(1) store i8 1, i8* %p br label %if.end if.end: ; 3 = MemoryDef(1) store i8 2, i8* %p ret void } ...When there should be a MemoryPhi in the `if.end` BB. This patch fixes that behavior. llvm-svn: 263991	2016-03-21 21:25:39 +00:00
Krzysztof Parzyszek	67e6ae5e2a	Remove leftover options from multiline.ll I added -march=hexagon to force using Hexagon target when testing locally, and I forgot to take it out. llvm-svn: 263990	2016-03-21 21:25:01 +00:00
Rafael Espindola	7ff714c339	Add a testcase that would have found the bug in r263971. llvm-svn: 263988	2016-03-21 21:09:38 +00:00
Rafael Espindola	9219fe79b9	Revert "[llvm-objdump] Printing relocations in executable and shared object files. This partially reverts r215844 by removing test objdump-reloc-shared.test which stated GNU objdump doesn't print relocations, it does." This reverts commit r263971. It produces the wrong results for .rela.dyn. I will add a test. llvm-svn: 263987	2016-03-21 20:59:15 +00:00
Krzysztof Parzyszek	738c6277a6	Unxfail test/DebugInfo/Generic/multiline.ll on Hexagon llvm-svn: 263986	2016-03-21 20:55:59 +00:00
Nicolai Haehnle	a56e6b6a53	AMDGPU: Coding style fixes I meant to add these before committing r263982 as per the review, but I forgot to squash. llvm-svn: 263983	2016-03-21 20:39:24 +00:00
Nicolai Haehnle	213e87f2ee	AMDGPU: Add SIWholeQuadMode pass Summary: Whole quad mode is already enabled for pixel shaders that compute derivatives, but it must be suspended for instructions that cause a shader to have side effects (i.e. stores and atomics). This pass addresses the issue by storing the real (initial) live mask in a register, masking EXEC before instructions that require exact execution and (re-)enabling WQM where required. This pass is run before register coalescing so that we can use machine SSA for analysis. The changes in this patch expose a problem with the second machine scheduling pass: target independent instructions like COPY implicitly use EXEC when they operate on VGPRs, but this fact is not encoded in the MIR. This can lead to miscompilation because instructions are moved past changes to EXEC. This patch fixes the problem by adding use-implicit operands to target independent instructions. Some general codegen passes are relaxed to work with such implicit use operands. Reviewers: arsenm, tstellarAMD, mareko Subscribers: MatzeB, arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D18162 llvm-svn: 263982	2016-03-21 20:28:33 +00:00
Krzysztof Parzyszek	b14f4fd0de	[Hexagon] Add handling fixups and instruction relaxation llvm-svn: 263981	2016-03-21 20:27:17 +00:00
Krzysztof Parzyszek	c6f1e1a709	[Hexagon] Properly encode registers in duplex instructions llvm-svn: 263980	2016-03-21 20:13:33 +00:00
Krzysztof Parzyszek	6514a887f4	[Hexagon] Fix reserving emergency spill slots for register scavenger - R10 and R11 are not reserved registers. - Check for reserved registers when finding unused caller-saved registers. llvm-svn: 263977	2016-03-21 19:57:08 +00:00
Dan Gohman	c8d7f14506	[WebAssembly] Implement the eqz instructions. llvm-svn: 263976	2016-03-21 19:54:41 +00:00
Chad Rosier	2e5c526bb1	[SLP] Remove unnecessary member variables by using container APIs. This changes the debug output, but still retains its usefulness. Differential Revision: http://reviews.llvm.org/D18324 llvm-svn: 263975	2016-03-21 19:47:44 +00:00
Colin LeMahieu	cdaf644c48	[llvm-objdump] Printing relocations in executable and shared object files. This partially reverts r215844 by removing test objdump-reloc-shared.test which stated GNU objdump doesn't print relocations, it does. In executable and shared object ELF files, relocations in the file contain the final virtual address rather than section offset so this is adjusted to display section offset. Differential revision: http://reviews.llvm.org/D15965 llvm-svn: 263971	2016-03-21 19:14:50 +00:00
Tom Stellard	92339e888f	AMDGPU/SI: Fix threshold calculation for branching when exec is zero Summary: When control flow is implemented using the exec mask, the compiler will insert branch instructions to skip over the masked section when exec is zero if the section contains more than a certain number of instructions. The previous code would only count instructions in successor blocks, and this patch modifies the code to start counting instructions in all blocks between the start and end of the branch. Reviewers: nhaehnle, arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D18282 llvm-svn: 263969	2016-03-21 18:56:58 +00:00
Chad Rosier	cf173ffb46	[AArch64] Add a helpful assert. NFC. llvm-svn: 263965	2016-03-21 18:04:10 +00:00
Matt Arsenault	cb38a6bd35	AMDGPU: Remove SignBitIsZero for mubuf scratch offsets These instructions do not have the same negative base address problem that DS instructions do on SI. llvm-svn: 263964	2016-03-21 18:02:18 +00:00
Peter Collingbourne	86b9fbe980	ARM: Better codegen for 64-bit compares. This introduces a custom lowering for ISD::SETCCE (introduced in r253572) that allows us to emit a short code sequence for 64-bit compares. Before: push {r7, lr} cmp r0, r2 mov.w r0, #0 mov.w r12, #0 it hs movhs r0, #1 cmp r1, r3 it ge movge.w r12, #1 it eq moveq r12, r0 cmp.w r12, #0 bne .LBB1_2 @ BB#1: @ %bb1 bl f pop {r7, pc} .LBB1_2: @ %bb2 bl g pop {r7, pc} After: push {r7, lr} subs r0, r0, r2 sbcs.w r0, r1, r3 bge .LBB1_2 @ BB#1: @ %bb1 bl f pop {r7, pc} .LBB1_2: @ %bb2 bl g pop {r7, pc} Saves around 80KB in Chromium's libchrome.so. Some notes on this patch: - I don't much like the ARMISD::BRCOND and ARMISD::CMOV combines I introduced (nothing else needs them). However, they are necessary in order to avoid poor codegen, and they seem similar to existing combines in other backends (e.g. X86 combines (brcond (cmp (setcc Compare))) to (brcond Compare)). - No support for Thumb-1. This is in principle possible, but we'd need to implement ARMISD::SUBE for Thumb-1. Differential Revision: http://reviews.llvm.org/D15256 llvm-svn: 263962	2016-03-21 18:00:02 +00:00
Renato Golin	2b6b7ffd6c	[ARM] Add Cortex-A32 support Adding Cortex-A32 as an available target in the ARM backend. Patch by Sam Parker. llvm-svn: 263956	2016-03-21 17:29:01 +00:00
Hemant Kulkarni	a11fbe1cb1	[llvm-readobj] Impl GNU style symbols printing Implements "readelf -sW and readelf -DsW" Differential Revision: http://reviews.llvm.org/D18224 llvm-svn: 263952	2016-03-21 17:18:23 +00:00
Lang Hames	a258b01b12	[Orc] Switch RPC Procedure to take a function type, rather than an arg list. No functional change, just a little more readable. llvm-svn: 263951	2016-03-21 16:56:25 +00:00
Matt Arsenault	c25a71106c	APFloat: Add frexp llvm-svn: 263950	2016-03-21 16:49:16 +00:00
Matt Arsenault	b96b57347a	AMDGPU: Add frexp_mant intrinsic llvm-svn: 263948	2016-03-21 16:11:05 +00:00
Matt Arsenault	155dda9134	Implement constant folding for bitreverse llvm-svn: 263945	2016-03-21 15:00:35 +00:00
Chad Rosier	4aeab5fbf2	[AArch64] Fix a -Wdocumentation warning. NFC. llvm-svn: 263942	2016-03-21 13:43:58 +00:00
Silviu Baranga	f875e4fd92	[IndVars] Fix PR26974: make sure replaceCongruentIVs doesn't break LCSSA Summary: replaceCongruentIVs can break LCSSA when trying to replace IV increments since it tries to replace all uses of a phi node with another phi node while both of the phi nodes are not necessarily in the processed loop. This will cause an assert in IndVars. To fix this, we add a check to make sure that the replacement maintains LCSSA. Reviewers: sanjoy Subscribers: mzolotukhin, llvm-commits Differential Revision: http://reviews.llvm.org/D18266 llvm-svn: 263941	2016-03-21 12:44:29 +00:00
Silviu Baranga	46030585b3	[DAGCombine] Catch the case where extract_vector_elt can cause an any_ext while processing AND SDNodes Summary: extract_vector_elt can cause an implicit any_ext if the types don't match. When processing the following pattern: (and (extract_vector_elt (load ([non_ext\|any_ext\|zero_ext] V))), c) DAGCombine was ignoring the possible extend, and sometimes removing the AND even though it was required to maintain some of the bits in the result to 0, resulting in a miscompile. This change fixes the issue by limiting the transformation only to cases where the extract_vector_elt doesn't perform the implicit extend. Reviewers: t.p.northover, jmolloy Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D18247 llvm-svn: 263935	2016-03-21 11:43:46 +00:00
Elena Demikhovsky	39a0020f2d	Fixed -mcpu flag "core-avx" does not exist; I changed to "nehalem" llvm-svn: 263932	2016-03-21 11:06:20 +00:00
Simon Pilgrim	4af44f3c13	[X86][SSE] Add vector integer division by constant tests Expanded tests and split into sdiv/srem and udiv/urem cases for 128 and 256 bit vectors. llvm-svn: 263917	2016-03-20 21:46:58 +00:00
Jingyue Wu	1375560bdb	[NVPTX] Adds a new address space inference pass. Summary: The old address space inference pass (NVPTXFavorNonGenericAddrSpaces) is unable to convert the address space of a pointer induction variable. This patch adds a new pass called NVPTXInferAddressSpaces that overcomes that limitation using a fixed-point data-flow analysis (see the file header comments for details). The new pass is experimental and not enabled by default. Users can turn it on by setting the -nvptx-use-infer-addrspace flag of llc. Reviewers: jholewinski, tra, jlebar Subscribers: jholewinski, llvm-commits Differential Revision: http://reviews.llvm.org/D17965 llvm-svn: 263916	2016-03-20 20:59:20 +00:00
Davide Italiano	289a43ed0a	[gold] Emit a diagnostic in case we fail to remove a file. llvm-svn: 263914	2016-03-20 20:12:33 +00:00
Simon Pilgrim	fcc4532afa	[X86][SSE] Tidyup setTargetShuffleZeroElements to match computeZeroableShuffleElements Based on feedback for D14261 llvm-svn: 263911	2016-03-20 17:43:07 +00:00
Simon Pilgrim	c44472a5bc	[X86][SSE] Detect zeroable shuffle elements from different value types Improve computeZeroableShuffleElements to be able to peek through bitcasts to extract zero/undef values from BUILD_VECTOR nodes of different element sizes to the shuffle mask. Differential Revision: http://reviews.llvm.org/D14261 llvm-svn: 263906	2016-03-20 15:45:42 +00:00
Igor Breger	3ea8af5108	AVX512BW: Enable v32i1/v64i1 BUILD_VECTOR Differential Revision: http://reviews.llvm.org/D18211 llvm-svn: 263898	2016-03-20 13:09:43 +00:00
George Rimar	25a63b1bcc	[ELF] Update x86_64 relocations to 0.99.8 ABI Added: R_X86_64_GOTPCRELX, R_X86_64_REX_GOTPCRELX llvm-svn: 263894	2016-03-20 09:45:08 +00:00
Craig Topper	ea87eae4ca	Suppress a -Wunused-variable warning in release builds. llvm-svn: 263892	2016-03-20 01:17:54 +00:00
Michael Kuperstein	048cc3b7a8	Use a range-based for loop. NFC. llvm-svn: 263889	2016-03-20 00:16:13 +00:00

1 2 3 4 5 ...

128879 Commits