llvm-project

Commit Graph

Author	SHA1	Message	Date
Matt Arsenault	b21f9592be	AMDGPU: Move a flawed assert when spilling SGPRs It's possible to validly spill the frame offset register in a call sequence to a VGPR. There are definitely issues with SGPR spilling to memory, so move the assert later. llvm-svn: 330612	2018-04-23 16:13:30 +00:00
Simon Pilgrim	8cd01aaa0f	[X86] Replace x87 instregex with instrs if they only match one instruction llvm-svn: 330611	2018-04-23 16:10:50 +00:00
Adrian Prantl	bbe980dfe1	Fix computeSymbolSizes SEGFAULT on invalid file We use llvm-symbolizer in some production systems, and we run it against all possibly related files, including some that are not ELF. We noticed that for some of those invalid files, llvm-symbolizer would crash with SEGFAULT. Here is an example of such a file. It is due to that in computeSymbolSizes, a loop uses condition for (unsigned I = 0, N = Addresses.size() - 1; I < N; ++I) { where if Addresses.size() is 0, N would overflow and causing the loop to access invalid memory. Instead of patching the loop conditions, the commit makes so that the function returns early if Addresses is empty. Validated by checking that llvm-symbolizer no longer crashes. Patch by Teng Qin! Differential Revision: https://reviews.llvm.org/D44285 llvm-svn: 330610	2018-04-23 16:08:01 +00:00
Matt Arsenault	adc59d7076	AMDGPU: Assign enum name to stack ID Also assert that it is correct for SGPRs. There is currently a bug where stack slot coloring replaces SGPR spill FIs with one with the default ID, which results in a more confusing assert later about a dead object. llvm-svn: 330607	2018-04-23 15:51:26 +00:00
Matt Arsenault	488476c6eb	StackSlotColoring: Fix missing skipFunction check llvm-svn: 330606	2018-04-23 15:51:21 +00:00
Daniel Neilson	9863b48d4e	[SelectionDAG] Refactor lowering of atomic memory intrinsics. Summary: This just refactors the lowering of the atomic memory intrinsics to more closely match the code patterns used in the lowering of the non-atomic memory intrinsics. Specifically, we encapsulate the lowering in SelectionDAG::getAtomicMem*() functions rather than embedding the code directly in the SelectionDAGBuilder code. llvm-svn: 330603	2018-04-23 15:40:37 +00:00
Robert Widmann	6978db7800	[LLVM-C] DIBuilderBindings for Subrange and Arrays Summary: Move Go bindings for subranges and DINode arrays. Reviewers: harlanhaskins, whitequark, deadalnix Reviewed By: whitequark Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D45933 llvm-svn: 330594	2018-04-23 14:29:33 +00:00
Nico Weber	514837cc6e	Sort a target list a bit better. llvm-svn: 330593	2018-04-23 14:28:49 +00:00
Alexey Bataev	6b2a5a6dd0	[DEBUGINFO, NVPTX] Add the test for the debug info of the local variables, NFC. llvm-svn: 330592	2018-04-23 14:00:53 +00:00
Robert Widmann	b02fe644d4	[LLVM-C] Finish Up Scope Bindings Summary: Adds bindings for Module and NameSpace scopes and LLVMDIBuilderCreateForwardDecl, a counterpart to LLVMDIBuilderCreateReplaceableCompositeType. Reviewers: harlanhaskins, whitequark, deadalnix Reviewed By: whitequark Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D45934 llvm-svn: 330591	2018-04-23 13:51:43 +00:00
Marianne Mailhot-Sarrasin	05cc8f66e2	[doc] Removed obsolete -count-aa from AliasAnalysis documentation Summary: This patch removes references to AliasAnalysisCounter pass from the AliasAnalysis documentation. That pass have been eliminated in 2015, at revision trunk@247167. Reviewed By: hfinkel Differential Revision: https://reviews.llvm.org/D45876 llvm-svn: 330590	2018-04-23 13:45:28 +00:00
Simon Pilgrim	455d0b2cfe	[X86] Remove instregex matching from CLAC/STAC. Note - noticed this as the STAC case as it was unintentionally matching against STACK pseudo instructions. llvm-svn: 330588	2018-04-23 13:24:17 +00:00
Nico Weber	77c5471d9f	List cpp file only once (was added in 147117 and 147117 as build fix each). llvm-svn: 330587	2018-04-23 13:11:51 +00:00
Nicolai Haehnle	cbebba4917	AMDGPU: Fix SDWA peephole for V_AND_B32 Summary: Found by inspection. We care about the operand that doesn't contain the immediate. I believe this is currently not hit because we fold 0xff / 0xffff immediates only later. Change-Id: Ic3cf8538bc7da5eff3200d96eccf9d339e6345a7 Reviewers: arsenm, rampitec Subscribers: kzhuravl, wdng, yaxunl, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D45886 llvm-svn: 330586	2018-04-23 13:06:03 +00:00
Nicolai Haehnle	5a995664f0	AMDGPU: Fix a corner case crash in SIOptimizeExecMasking Summary: See the new test case; this is really unlikely to happen with real code, but I ran into this while attempting to bugpoint-reduce a different issue. Change-Id: I9ade1dc1aa8fd9c4d9fc83661d7b80e310b5c4a6 Reviewers: arsenm, rampitec Subscribers: kzhuravl, wdng, yaxunl, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D45885 llvm-svn: 330585	2018-04-23 13:05:50 +00:00
Nico Weber	5d53aed419	Consistently sort add_subdirectory calls in lib/Target/*/CMakeLists.txt llvm-svn: 330584	2018-04-23 12:49:34 +00:00
Sander de Smalen	7893f722b2	[AArch64][SVE] Asm: Support for contiguous, non-faulting LDNF1 (scalar+imm) load instructions Reviewers: fhahn, rengolin, javed.absar, huntergr, SjoerdMeijer, t.p.northover, echristo, evandro Reviewed By: rengolin Subscribers: tschuett, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D45684 llvm-svn: 330583	2018-04-23 12:43:19 +00:00
Max Kazantsev	91f481665e	[LoopRotate] Fix incorrect SCEV invalidation in loop rotation LoopRotate only invalidates innermost loops while the changes that it makes may also affert any of this parents. With patch rL329047, SCEV becomes much smarter about calculation of exit counts for outer loops, so we cannot assume that they are not affected. Differential Revision: https://reviews.llvm.org/D45945 llvm-svn: 330582	2018-04-23 12:33:31 +00:00
Simon Pilgrim	0a334a8668	[X86] Remove unnecessary MMX reg-mem InstRW scheduler overrides. llvm-svn: 330581	2018-04-23 11:57:15 +00:00
Max Kazantsev	acda4c0f18	[LoopUnroll] Fix potentially incorrect SCEV invalidation in UnrollRuntime Current runtime unrolling invalidates parent loop saying that it might have changed after the inner loop has changed, but it doesn't bother to do the same to its parents. With patch rL329047, SCEV becomes much smarter about calculation of exit counts for outer loops. We might need to invalidate not only the immediate parent, but also any of its parents as well. There is no clear evidence that there is some miscompile happening because of this (at least I don't have such test), but the common sense says that the current code is wrong. Differential Revision: https://reviews.llvm.org/D45940 Reviewed By: chandlerc llvm-svn: 330577	2018-04-23 10:39:38 +00:00
Max Kazantsev	b1137c42fa	[LoopSimplify] Fix incorrect SCEV invalidation In the function `simplifyOneLoop` we optimistically assume that changes in the inner loop only affect this very loop and have no impact on its parents. In fact, after rL329047 has been merged, we can now calculate exit counts for outer loops which may depend on inner loops. Thus, we need to invalidate all parents when we do something to a loop. There is an evidence of incorrect behavior of `simplifyOneLoop`: when we insert `SE->verify()` check in the end of this funciton, it fails on a bunch of existing test, in particular: LLVM :: Transforms/LoopUnroll/peel-loop-not-forced.ll LLVM :: Transforms/LoopUnroll/peel-loop-pgo.ll LLVM :: Transforms/LoopUnroll/peel-loop.ll LLVM :: Transforms/LoopUnroll/peel-loop2.ll Note that previously we have fixed issues of this variety, see rL328483. This patch makes this function invalidate the outermost loop properly. Differential Revision: https://reviews.llvm.org/D45937 Reviewed By: chandlerc llvm-svn: 330576	2018-04-23 10:32:37 +00:00
Simon Tatham	047c1ab161	Fix BNF nits in TableGen language reference. Summary: In the course of writing an experimental ANTLR grammar based on this document, I found three errors in the documented BNF: SimpleValues of dag type are allowed to have no operands at all after the initial DagArg specifying the operator. For example, the value (outs) is extremely common in backends; an example in the test suite is test/TableGen/AsmVariant.td line 30. But the BNF doesn't allow DagArgList to expand to the empty string (it must contain at least one DagArg), and therefore the DagArgList specifying the operands in the dag-shaped production for SimpleValue should be optional. In the production for BodyItem with a 'let' and an optional RangeList, the RangeList should have braces around it if it's present, matching code such as "let E{7-0} = ..." on test/TableGen/BitsInit.td line 42. Those braces aren't included in the RangeList nonterminal itself, so instead they need to be part of the optional segment of the BodyItem production. Finally, the identifier after 'defm' should be optional. Again, this is very common in the real back end .td files; an example in the test suite is in test/TableGen/defmclass.td line 49. Reviewers: rengolin, nhaehnle, stoklund Reviewed By: nhaehnle Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D45818 llvm-svn: 330570	2018-04-23 09:15:47 +00:00
Simon Tatham	e489e26d0e	Test commit access. Should be a harmless trimming of trailing whitespace from a documentation file. (There are other instances of trailing whitespace in this file alone. I've only fixed one of them, on the basis that that way the rest are still available for other people's commit-access tests :-) llvm-svn: 330567	2018-04-23 08:41:53 +00:00
Sander de Smalen	1b6d374422	[AArch64][SVE] Asm: Support for structured ST2, ST3 and ST4 (scalar+imm) store instructions. Reviewers: fhahn, rengolin, javed.absar, SjoerdMeijer, t.p.northover, echristo, evandro, huntergr Reviewed By: rengolin Subscribers: tschuett, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D45681 llvm-svn: 330565	2018-04-23 07:50:35 +00:00
Chandler Carruth	bf7190a154	[PM/LoopUnswitch] Remove a buggy assert in the new loop unswitch. The condition this was asserting doesn't actually hold. I've added comments to explain why, removed the assert, and added a fun test case reduced out of 403.gcc. llvm-svn: 330564	2018-04-23 06:58:36 +00:00
Craig Topper	3f1d538165	[X86] Add VEX_WIG to VEX encoded version of VCMPPSY/VCMPPDY. llvm-svn: 330563	2018-04-23 04:50:01 +00:00
Chandler Carruth	b525424118	[PM/LoopUnswitch] Fix comment typo. NFC. llvm-svn: 330560	2018-04-23 00:48:42 +00:00
Simon Pilgrim	326594bc92	[X86][Znver1] Remove unnecessary BMI1 ANDN InstRW overrides. llvm-svn: 330558	2018-04-22 21:37:08 +00:00
Simon Pilgrim	87ba905fe9	[llvm-mca][X86] Add BMI/LZCNT/POPCNT resource tests to all relevant models The SandyBridge BMI tests are actually run on IvyBridge as that's the first lowest CPU that actually support the ISAs (but still use the SandyBridge model). llvm-svn: 330556	2018-04-22 20:42:24 +00:00
Robert Widmann	12e367b6db	[LLVM-C] Add DIBuilder Bindings For Variable Creation Summary: Wrap LLVMDIBuilderCreateAutoVariable, LLVMDIBuilderCreateParameterVariable, LLVMDIBuilderCreateExpression, and move and correct LLVMDIBuilderInsertDeclareBefore and LLVMDIBuilderInsertDeclareAtEnd from the Go bindings to the C bindings. Reviewers: harlanhaskins, whitequark, deadalnix Reviewed By: harlanhaskins, whitequark Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D45928 llvm-svn: 330555	2018-04-22 19:24:44 +00:00
Simon Pilgrim	06e16541ba	[X86] Remove unnecessary WriteFBlend/WriteBlend InstRW overrides. Fixed a lot of the default classes which were being completely overridden. llvm-svn: 330554	2018-04-22 18:35:53 +00:00
Simon Pilgrim	091680b6e7	[X86] Remove unnecessary WriteFMul/WriteFRcp/WriteFRsqrt InstRW overrides. llvm-svn: 330553	2018-04-22 18:09:50 +00:00
Simon Pilgrim	b362d02229	[X86] Remove unnecessary CVT instrw overrides. llvm-svn: 330552	2018-04-22 17:54:58 +00:00
Andres Freund	d7489a44de	Test commit access. Remove trailing whitespace. llvm-svn: 330551	2018-04-22 17:53:34 +00:00
Sanjay Patel	30be665e82	[PatternMatch] allow undef elements when matching a vector zero This is the last step in getting constant pattern matchers to allow undef elements in constant vectors. I'm adding a dedicated m_ZeroInt() function and building m_Zero() from that. In most cases, calling code can be updated to use m_ZeroInt() directly when there's no need to match pointers, but I'm leaving that efficiency optimization as a follow-up step because it's not always clear when that's ok. There are just enough icmp folds in InstSimplify that can be used for integer or pointer types, that we probably still want a generic m_Zero() for those cases. Otherwise, we could eliminate it (and possibly add a m_NullPtr() as an alias for isa<ConstantPointerNull>()). We're conservatively returning a full zero vector (zeroinitializer) in InstSimplify/InstCombine on some of these folds (see diffs in InstSimplify), but I'm not sure if that's actually necessary in all cases. We may be able to propagate an undef lane instead. One test where this happens is marked with 'TODO'. llvm-svn: 330550	2018-04-22 17:07:44 +00:00
Simon Pilgrim	c7f9b183c2	[X86][SkylakeServer] Remove unnecessary PMULLD instrw overrides. llvm-svn: 330549	2018-04-22 16:51:12 +00:00
Simon Pilgrim	3e8640a93a	[X86][Atom] Remove unnecessary scalar/vector load/move instrw overrides. llvm-svn: 330548	2018-04-22 16:49:35 +00:00
Sanjay Patel	c1265ab99e	[InstCombine] add vector test with undef elts; NFC llvm-svn: 330547	2018-04-22 15:59:14 +00:00
Simon Pilgrim	ef8d3ae4b5	[X86] Fix (completely overridden) WriteFHAdd/WritePHAdd classes to allow us to remove unnecessary instrw overrides. llvm-svn: 330546	2018-04-22 15:25:59 +00:00
Simon Pilgrim	2fd8269c6f	[X86][MMX][SSE] Tag missed PHADD/PHSUB instructions with WritePHAdd llvm-svn: 330545	2018-04-22 15:02:23 +00:00
Simon Pilgrim	96855ec39e	[X86] Remove unnecessary WriteFVarBlend/WriteVarBlend InstRW overrides. This also fixes some of the ReadAfterLd issues due to InstRW. llvm-svn: 330544	2018-04-22 14:43:12 +00:00
Sanjay Patel	e187cd3273	[InstSimplify, InstCombine] add vector tests with undef elts; NFC llvm-svn: 330543	2018-04-22 14:19:37 +00:00
Simon Pilgrim	a41ae2f005	[X86] Fix WriteMPSAD/WritePSADBW values to allow us to remove unnecessary instrw overrides. llvm-svn: 330542	2018-04-22 10:39:16 +00:00
Simon Pilgrim	523fd335b1	[X86][SandyBridge] Remove unnecessary WritePOPCNTLd overrides by fixing load latency. llvm-svn: 330541	2018-04-22 10:03:52 +00:00
Simon Pilgrim	5e9f1da0cd	[llvm-mca][X86] Add POPCNT resource test llvm-svn: 330540	2018-04-22 09:58:00 +00:00
Jonas Devlieghere	3eecf73b10	[test] Fix MC/ELF/nocompression.s Unbreak the linux build bots: http://lab.llvm.org:8011/builders/clang-lld-x86_64-2stage/builds/5165/ http://lab.llvm.org:8011/builders/llvm-clang-lld-x86_64-scei-ps4-ubuntu-fast/builds/28775 http://lab.llvm.org:8011/builders/clang-with-lto-ubuntu/builds/8227 llvm-svn: 330539	2018-04-22 08:46:27 +00:00
Jonas Devlieghere	7b5fa24bcd	[lli] Fix syntax error: missing ';' Fixes build issue on the windows bots: error C2143: syntax error: missing ';' llvm-svn: 330538	2018-04-22 08:35:00 +00:00
Jonas Devlieghere	4a2863ccbc	[lli] Make error handling more consistent. Makes error handling more consistent by using the helpers in support. llvm-svn: 330537	2018-04-22 08:02:11 +00:00
Jonas Devlieghere	c976aa7dc7	[llvm-mc] Make error handling more consistent. Makes error handling more consistent by using the helpers in support. llvm-svn: 330536	2018-04-22 08:01:35 +00:00
Jonas Devlieghere	578c049497	[Support] Fix prefix logic in WithColor. When a prefix is passed, we need to print a colon a space after it, not just the prefix. llvm-svn: 330535	2018-04-22 08:01:01 +00:00
Craig Topper	9dcc50fcef	[X86] Remove an unnecessary HANDLE_OPTIONAL line from the disassembler operand processing. llvm-svn: 330534	2018-04-22 06:40:37 +00:00
Craig Topper	e958c7270e	[X86] Change TB to PS on LFENCE instruction. This matches the other FENCE instructions. llvm-svn: 330533	2018-04-22 03:15:02 +00:00
Craig Topper	2a28336f34	[X86] Remove OpSizeIgnore, it's not implemented any differently than OpSizeFixed. llvm-svn: 330532	2018-04-22 01:24:58 +00:00
Craig Topper	e33ed7d667	[X86] Remove DATA32_PREFIX. Hack the printing for DATA16_PREFIX to print 'data32' in 16-bit mode. Hack the asm parser to convert 'data32' to 'data16' in 16-bit mode. Improve the error messages to match GNU assembler. This also allows us to remove the hack from the disassembler table building. llvm-svn: 330531	2018-04-22 00:52:02 +00:00
Brian Gesiak	b13588982f	[bcanalyzer] Recognize more stream types Summary: `llvm-bcanalyzer` prints out the stream type of the file it is analyzing. If the file begins with the LLVM IR magic number, it reports a stream type of "LLVM IR". However, any other bitstream format is reported as "unknown". Add some checks for two other common bitstream formats: Clang AST files, which begin with 'CPCH', and Clang serialized diagnostics, which begin with 'DIAG'. Test Plan: `check-llvm` Reviewers: pcc, aprantl, mehdi_amini, davide, george.karpenkov, JDevlieghere Reviewed By: JDevlieghere Subscribers: JDevlieghere, bruno, davide, llvm-commits Differential Revision: https://reviews.llvm.org/D41979 llvm-svn: 330529	2018-04-21 23:52:04 +00:00
Simon Pilgrim	37334ea67a	[X86] Strip unnecessary prefetch + vector move/load instrw overrides from scheduler models. llvm-svn: 330527	2018-04-21 21:59:36 +00:00
Jonas Devlieghere	0c1b29540c	[Support] Add optional prefix to convenience helpers in WithColor. Several tools prefix the error/warning/note output with the name of the tool. One such tool is LLD for example. This commit adds as an optional 'Prefix' argument to the convenience helpers. llvm-svn: 330526	2018-04-21 21:36:11 +00:00
Simon Pilgrim	920802cc50	[X86] Strip unnecessary WriteCvtF2I instrw overrides from scheduler models. llvm-svn: 330525	2018-04-21 21:16:44 +00:00
Jonas Devlieghere	2cd41eb058	[tools] Use WithColor for printing errors. Use convenience helpers in WithColor to print errors, warnings and notes in a few more tools. llvm-svn: 330524	2018-04-21 21:11:59 +00:00
Simon Pilgrim	825ead950e	[X86] Strip unnecessary broadcast/shuffle256 instrw overrides from scheduler models. llvm-svn: 330523	2018-04-21 20:45:12 +00:00
Simon Pilgrim	58ddaeabe2	[X86][AVX] VPERM2F128/VINSERTF128 should be a shuffle256 schedule like VPERM2I128/VINSERTI128 llvm-svn: 330522	2018-04-21 20:04:24 +00:00
Simon Pilgrim	74ccc6a303	[X86] Strip unnecessary vector integer math, shift-imm, extend, shuffle, pack/unpack instruction instrw overrides from scheduler models. llvm-svn: 330521	2018-04-21 19:11:55 +00:00
Craig Topper	fe59bea07b	[X86] Add DAG combine to turn (trunc (srl (mul ext, ext), 16) into PMULHW/PMULHUW. Ultimately I want to use this to remove the intrinsics for these instructions. llvm-svn: 330520	2018-04-21 18:39:21 +00:00
Craig Topper	1b223e75da	[X86] Add test cases that show the current codegen for (trunc (srl (mul ext, ext), 16)). NFC A future patch will turn this into MULHU/MULHS. llvm-svn: 330519	2018-04-21 18:39:20 +00:00
Craig Topper	05242bf691	[X86] Add SchedWrites for LDMXCSR/STMXCSR. llvm-svn: 330517	2018-04-21 18:07:36 +00:00
Sanjay Patel	5f845732ed	[InstSimplify] move tests for shifts; NFC llvm-svn: 330516	2018-04-21 16:58:00 +00:00
Sanjay Patel	d0b27a1156	[InstSimplify] move/add/regenerate checks for tests; NFC llvm-svn: 330515	2018-04-21 16:23:47 +00:00
Simon Pilgrim	44278f6598	[X86][Haswell] Strip unnecessary WriteFAdd/WriteFHAdd instruction instrw overrides. llvm-svn: 330514	2018-04-21 16:20:28 +00:00
Simon Pilgrim	a80df0999f	[X86][Broadwell] Remove unnecessary VORPD/VORPS instrw override - missed in D45629 llvm-svn: 330513	2018-04-21 16:17:47 +00:00
Simon Pilgrim	e25aa02bc4	[llvm-mca][X86] Add AVX2 resource tests llvm-svn: 330512	2018-04-21 16:12:42 +00:00
Simon Pilgrim	93b102cd45	[X86] Strip unnecessary WriteFRcp/WriteFRsqrt instruction instrw overrides from scheduler models. The required the default skylake schedules to be updated - these were being completely overriden by the InstRW and the existing values not used at all. llvm-svn: 330510	2018-04-21 15:16:59 +00:00
Simon Pilgrim	2193524fb4	[X86] Strip unnecessary WriteFShuffle instruction instrw overrides from scheduler models. llvm-svn: 330508	2018-04-21 14:56:56 +00:00
Simon Pilgrim	d73bd154d9	[llvm-mca][X86] Add SSE resource tests to all models llvm-svn: 330506	2018-04-21 14:16:57 +00:00
Simon Pilgrim	f7f84a0ca3	[X86][SandyBridge] Strip unnecessary MOVQ/CVT instruction instrw overrides. llvm-svn: 330505	2018-04-21 14:03:40 +00:00
Simon Pilgrim	02fc375a22	[X86] Strip unnecessary MMX instruction instrw overrides from scheduler models. llvm-svn: 330503	2018-04-21 12:15:42 +00:00
Simon Pilgrim	26178d4336	[llvm-mca][X86] Add MMX resource tests llvm-svn: 330502	2018-04-21 11:28:59 +00:00
Simon Pilgrim	c0f654f18e	[X86] Strip unnecessary x87 instruction instrw overrides from scheduler models. llvm-svn: 330501	2018-04-21 11:25:02 +00:00
Simon Pilgrim	1264066cd7	[llvm-mca][X86] Add X87 resource tests llvm-svn: 330499	2018-04-21 10:36:19 +00:00
Simon Pilgrim	342cf58668	[X86][X87] Add missing fldlg2 schedule test llvm-svn: 330498	2018-04-21 10:35:04 +00:00
Hiroshi Inoue	33486787cb	[PowerPC] fix incorrect vectorization of abs() on POWER9 Vectorized loops with abs() returns incorrect results on POWER9. This patch fixes it. For example the following code returns negative result if input values are negative though it sums up the absolute value of the inputs. int vpx_satd_c(const int16_t *coeff, int length) { int satd = 0; for (int i = 0; i < length; ++i) satd += abs(coeff[i]); return satd; } This problem causes test failures for libvpx. For vector absolute and vector absolute difference on POWER9, LLVM generates VABSDUW (Vector Absolute Difference Unsigned Word) instruction or variants. Since these instructions are for unsigned integers, we need adjustment for signed integers. For abs(sub(a, b)), we generate VABSDUW(a+0x80000000, b+0x80000000). Otherwise, abs(sub(-1, 0)) returns 0xFFFFFFFF(=-1) instead of 1. For abs(a), we generate VABSDUW(a+0x80000000, 0x80000000). Differential Revision: https://reviews.llvm.org/D45522 llvm-svn: 330497	2018-04-21 09:32:17 +00:00
Eli Friedman	0644130612	[AArch64] Don't crash trying to resolve __stack_chk_guard. In certain cases, the compiler might try to merge __stack_chk_guard with another global variable. (Or someone could theoretically define __stack_chk_guard as an alias.) In that case, make sure we don't crash. Differential Revision: https://reviews.llvm.org/D45746 llvm-svn: 330495	2018-04-21 00:07:46 +00:00
Jessica Paquette	e5d279e6d6	Fix typo in test (verify-machine-instrs -> verify-machineinstrs) llvm-svn: 330494	2018-04-20 23:37:48 +00:00
Jessica Paquette	d442c3a632	[MachineOutliner] XFAIL machine-outliner-noredzone.ll The verifier began complaining about an undefined physical register in this test. XFAILing for the purposes of getting a bot up while I look into it. Failure: http://lab.llvm.org:8080/green/job/clang-stage1-cmake-RA-expensive/11385/ llvm-svn: 330493	2018-04-20 23:35:54 +00:00
Shoaib Meenai	106df7dd20	[ObjCARC] Take BlockColors by const reference. NFC llvm-svn: 330489	2018-04-20 22:14:45 +00:00
Shoaib Meenai	d64b83266b	[ObjCARC] Account for funclet token in storeStrong transform When creating a call to storeStrong in ObjCARCContract, ensure the call gets the correct funclet token, otherwise WinEHPrepare will turn the call (and all subsequent instructions) into unreachable. We already have logic to do this for the ARC autorelease elision marker; factor that out into a common function that's used for both. These are the only two places in this transform that create call instructions. Differential Revision: https://reviews.llvm.org/D45857 llvm-svn: 330487	2018-04-20 22:11:03 +00:00
Simon Pilgrim	1803bfb75f	[llvm-mca][X86] Add MMX/SSE/AES/CLMUL resource SandyBridge tests llvm-svn: 330486	2018-04-20 22:04:11 +00:00
Simon Pilgrim	d14d2e7b18	[X86] Add WriteFSign/WriteFLogic scheduler classes Split the fp and integer vector logical instruction scheduler classes - older CPUs especially often handled these on different pipes. This unearthed a couple of things that are also handled in this patch: (1) We were tagging avx512 fp logic ops as WriteFAdd, probably because of the lack of WriteFLogic (2) SandyBridge had integer logic ops only using Port5, when afaict they can use Ports015. (3) Cleaned up x86 FCHS/FABS scheduling as they are typically treated as fp logic ops. Differential Revision: https://reviews.llvm.org/D45629 llvm-svn: 330480	2018-04-20 21:16:05 +00:00
Alexander Shaposhnikov	52db4335b3	[llvm-objcopy] Fix sh_link This diff fixes sh_link for various types of sections (i.e. for SHT_ARM_EXIDX, SHT_HASH). In particular, this change enables us to use llvm-objcopy with clang -gsplit-dwarf for the target android-arm. Test plan: make check-all Differential revision: https://reviews.llvm.org/D45851 llvm-svn: 330478	2018-04-20 20:46:04 +00:00
Alex Shlyapnikov	99cf54baa6	[HWASan] Introduce non-zero based and dynamic shadow memory (LLVM). Summary: Support the dynamic shadow memory offset (the default case for user space now) and static non-zero shadow memory offset (-hwasan-mapping-offset option). Keeping the the latter case around for functionality and performance comparison tests (and mostly for -hwasan-mapping-offset=0 case). The implementation is stripped down ASan one, picking only the relevant parts in the following assumptions: shadow scale is fixed, the shadow memory is dynamic, it is accessed via ifunc global, shadow memory address rematerialization is suppressed. Keep zero-based shadow memory for kernel (-hwasan-kernel option) and calls instreumented case (-hwasan-instrument-with-calls option), which essentially means that the generated code is not changed in these cases. Reviewers: eugenis Subscribers: srhines, llvm-commits Differential Revision: https://reviews.llvm.org/D45840 llvm-svn: 330475	2018-04-20 20:04:04 +00:00
Sean Fertile	18f17333dd	[PartialInlining] Fix Crash from holding a reference to a destructed ORE. The callback used to create an ORE for the legacy PI pass caches the allocated object in a unique_ptr in the runOnModule function, and returns a reference to that object. Under certian circumstances we can end up holding onto that reference after the OREs destruction. Rather then allowing the new and legacy passes to create ORE object in diffrent ways, create the ORE at the point of use. Differential Revision: https://reviews.llvm.org/D43219 llvm-svn: 330473	2018-04-20 19:56:26 +00:00
Krzysztof Parzyszek	5061b37e9c	[Hexagon] hexagon-autohvx was left on again llvm-svn: 330472	2018-04-20 19:45:49 +00:00
Krzysztof Parzyszek	41a24b7b13	[Hexagon] Improve HVX instruction selection (bitcast, vsplat) There was some unfortunate interaction between VSPLAT and BITCAST related to the selection of constant vectors (coming from selecting shuffles). Introduce VSPLATW that always splats a 32-bit word, and can have arbitrary result type (to avoid BITCASTs of VSPLAT). Clean up the previous selection of BITCAST/VSPLAT. llvm-svn: 330471	2018-04-20 19:38:37 +00:00
Eric Christopher	aadbabc070	Remove unused argument from emitModuleMetadata. NFCI. llvm-svn: 330470	2018-04-20 19:07:57 +00:00
Krzysztof Parzyszek	642120122c	[Hexagon] Skip fixed-stack indexes in HexagonConstExtenders Fixed slots have negative values, and TRI::stackSlot2Index and TRI::index2StackSlot do not handle negative numbers. llvm-svn: 330468	2018-04-20 19:06:46 +00:00
Craig Topper	173d59b62e	[X86][SandyBridge] Remove duplciate InstRWs from Sandy Brige scheduler model. llvm-svn: 330465	2018-04-20 18:55:40 +00:00
Gabor Buella	31fa8025ba	[X86] WaitPKG instructions Three new instructions: umonitor - Sets up a linear address range to be monitored by hardware and activates the monitor. The address range should be a writeback memory caching type. umwait - A hint that allows the processor to stop instruction execution and enter an implementation-dependent optimized state until occurrence of a class of events. tpause - Directs the processor to enter an implementation-dependent optimized state until the TSC reaches the value in EDX:EAX. Also modifying the description of the mfence instruction, as the rep prefix (0xF3) was allowed before, which would conflict with umonitor during disassembly. Before: $ echo 0xf3,0x0f,0xae,0xf0 \| llvm-mc -disassemble .text mfence After: $ echo 0xf3,0x0f,0xae,0xf0 \| llvm-mc -disassemble .text umonitor %rax Reviewers: craig.topper, zvi Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D45253 llvm-svn: 330462	2018-04-20 18:42:47 +00:00
Jessica Paquette	2e5ada5c81	[MachineOutliner] Change B instruction for tail calls to TCRETURNdi First off, this is more correct than having the B. Second off, this was making a bot upset. This fixes that. Update the test to include -verify-machineinstrs as well to prevent stuff like this slipping by non debug/assert builds in the future. llvm-svn: 330459	2018-04-20 18:03:21 +00:00
Zachary Turner	194be871b9	[LLD/PDB] Emit first section contribution for DBI Module Descriptor. Part of the DBI stream is a list of variable length structures describing each module that contributes to the final executable. One member of this structure is a section contribution entry that describes the first section contribution in the output file for the given module. We have been leaving this structure unpopulated until now, so with this patch it is now filled out correctly. Differential Revision: https://reviews.llvm.org/D45832 llvm-svn: 330457	2018-04-20 18:00:46 +00:00
Nico Weber	3a1b697d6e	Remove llvm-build's --configure-target-def-file. It was added 6.5 years ago in r144345, but was never hooked up and has been unused since. If _you_ do use this, feel free to revert, but add a comment on where it's used. https://reviews.llvm.org/D45262 llvm-svn: 330455	2018-04-20 17:21:10 +00:00
Sanjay Patel	21d9c70b91	[utils] improve AArch64 asm parser If we don't mark the cfi line as optional, the script won't work with 'nounwind' code. Without that attr, there may be extra noise in the asm body that we don't want to see. llvm-svn: 330453	2018-04-20 17:16:23 +00:00
Nicholas Wilson	ef90ff36da	[WebAssembly] Distinguish debug/symbol names in the Wasm structs. NFC Differential Revision: https://reviews.llvm.org/D45021 llvm-svn: 330448	2018-04-20 17:07:24 +00:00
Michael Zolotukhin	e268304122	Revert r330431. There are still stage3/stage4 miscompares :( llvm-svn: 330446	2018-04-20 16:57:10 +00:00
Sanjay Patel	f04ab64b25	[x86] auto-generate checks; NFC There's a proposal to change/add to this file in D45653, so we should know exactly what those differences would be. llvm-svn: 330445	2018-04-20 16:46:58 +00:00
Florian Hahn	773872fd67	[NewGVN] Split OpPHI detection and creation. It also adds a check making sure PHIs for operands are all in the same block. Patch by Daniel Berlin <dberlin@dberlin.org> Reviewers: dberlin, davide Differential Revision: https://reviews.llvm.org/D43865 llvm-svn: 330444	2018-04-20 16:37:13 +00:00
Andrew Ng	7a2fa74ab0	[DebugInfo] Use WithColor for more debug line warnings Updated two more debug line related warnings to use WithColor. This was necessary to ensure consistent output order of the warnings on Windows for debug line tests. Differential Revision: https://reviews.llvm.org/D45871 llvm-svn: 330440	2018-04-20 15:29:47 +00:00
Simon Pilgrim	ab9798765c	[CostModel][X86] Add vector element insert/extract cost tests llvm-svn: 330439	2018-04-20 15:26:59 +00:00
Douglas Yung	51db3abac8	Fix test by allowing it to accept an upper or lower case letter as the first character. Windows for some reason uses a lower case letter, while linux uses upper case. llvm-svn: 330438	2018-04-20 15:23:57 +00:00
Sanjay Patel	3d453ad711	[DAGCombine] (float)((int) f) --> ftrunc (PR36617) This was originally committed at rL328921 and reverted at rL329920 to investigate failures in Chrome. This time I've added to the ReleaseNotes to warn users of the potential of exposing UB and let me repeat that here for more exposure: Optimization of floating-point casts is improved. This may cause surprising results for code that is relying on undefined behavior. Code sanitizers can be used to detect affected patterns such as this: int main() { float x = 4294967296.0f; x = (float)((int)x); printf("junk in the ftrunc: %f\n", x); return 0; } $ clang -O1 ftrunc.c -fsanitize=undefined ; ./a.out ftrunc.c:5:15: runtime error: 4.29497e+09 is outside the range of representable values of type 'int' junk in the ftrunc: 0.000000 Original commit message: fptosi / fptoui round towards zero, and that's the same behavior as ISD::FTRUNC, so replace a pair of casts with the equivalent node. We don't have to account for special cases (NaN, INF) because out-of-range casts are undefined. Differential Revision: https://reviews.llvm.org/D44909 llvm-svn: 330437	2018-04-20 15:07:55 +00:00
Simon Pilgrim	863ffeb750	[CostModel][X86] Add srem/urem constant cost tests llvm-svn: 330436	2018-04-20 15:01:03 +00:00
Simon Pilgrim	8a15d72550	[CostModel][X86] Add SLM/GLM/BtVer2 compare + division/remainder cost tests llvm-svn: 330435	2018-04-20 14:50:34 +00:00
Michael Zolotukhin	f79d15e432	Fix typo in a test. llvm-svn: 330434	2018-04-20 13:51:36 +00:00
Simon Pilgrim	cd9ccf8824	[CostModel][X86] Split off BtVer2 cost checks llvm-svn: 330433	2018-04-20 13:50:33 +00:00
Simon Pilgrim	25b7782975	[CostModel][X86] Add GoldmontPlus cost tests Just reuses goldmont costs atm llvm-svn: 330432	2018-04-20 13:42:53 +00:00
Michael Zolotukhin	a2c9af0209	Revert "Revert r330403 and r330413." Reapply the patches with a fix. Thanks Ilya and Hans for the reproducer! This reverts commit r330416. The issue was that removing predecessors invalidated uses that we stored for rewrite. The fix is to finish manipulating with CFG before we select uses for rewrite. llvm-svn: 330431	2018-04-20 13:34:32 +00:00
Simon Pilgrim	df8fa6d734	[X86][BtVer2] Cleanup some old FIXMEs from the model. NFCI. llvm-svn: 330428	2018-04-20 13:12:04 +00:00
Simon Pilgrim	2f522ef13d	[X86] Tag CLDEMOTE instruction with WriteLoad scheduling class Same as other cacheline instructions llvm-svn: 330424	2018-04-20 12:54:53 +00:00
Sander de Smalen	30f9f11d51	[AArch64][SVE] Asm: Support for contiguous LD1 (scalar+scalar) load instructions. This is patch [4/4] in a series to add assembler/disassembler support for SVE's contiguous LD1 (scalar+scalar) instructions: - Patch [1/4]: https://reviews.llvm.org/D45687 - Patch [2/4]: https://reviews.llvm.org/D45688 - Patch [3/4]: https://reviews.llvm.org/D45689 - Patch [4/4]: https://reviews.llvm.org/D45690 Reviewers: fhahn, rengolin, javed.absar, huntergr, SjoerdMeijer, t.p.northover, echristo, evandro Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D45690 llvm-svn: 330423	2018-04-20 12:52:01 +00:00
Jonas Devlieghere	5c709eda07	[ObjectYAML] Add ability for DWARFYAML to calculate DIE lengths This patch adds the ability for the ObjectYAML DWARFEmitter to calculate the lengths of DIEs. This is accomplished by creating a DIEFixupVisitor class which traverses the DWARF DIEs to calculate and fix up the lengths in the Compile Unit header. The DIEFixupVisitor can be extended in the future to enable more complex fix ups which will enable simplified YAML string representations. This is also very useful when using the YAML format in unit tests because you no longer need to know the length of the compile unit when writing the YAML string. Differential commandeered from Chris Bieneman (beanz) Differential revision: https://reviews.llvm.org/D30666 llvm-svn: 330421	2018-04-20 12:33:49 +00:00
Greg Bedwell	d22b35b48c	[UpdateTestChecks] Fix update_mca_test_checks.py slowness issue The script was using Python's difflib module to calculate the number of lines changed so that it could report it in its status output. It turns out this can be very very slow on large sets of lines (Python bug 6931). It's not worth the cost, so just remove the usage of difflib entirely. llvm-svn: 330419	2018-04-20 11:38:11 +00:00
Florian Hahn	3085cdc99e	Require asserts for stats-file-option tests. llvm-svn: 330417	2018-04-20 11:21:13 +00:00
Ilya Biryukov	afe822bd6d	Revert r330403 and r330413. Revert r330413: "[SSAUpdaterBulk] Use SmallVector instead of DenseMap for storing rewrites." Revert r330403 "Reapply "[PR16756] Use SSAUpdaterBulk in JumpThreading." one more time." r330403 commit seems to crash clang during our integrate while doing PGO build with the following stacktrace: #2 llvm::SSAUpdaterBulk::RewriteAllUses(llvm::DominatorTree, llvm::SmallVectorImpl<llvm::PHINode>) #3 llvm::JumpThreadingPass::ThreadEdge(llvm::BasicBlock, llvm::SmallVectorImpl<llvm::BasicBlock> const&, llvm::BasicBlock) #4 llvm::JumpThreadingPass::ProcessThreadableEdges(llvm::Value, llvm::BasicBlock, llvm::jumpthreading::ConstantPreference, llvm::Instruction) #5 llvm::JumpThreadingPass::ProcessBlock(llvm::BasicBlock) The crash happens while compiling 'lib/Analysis/CallGraph.cpp'. r3340413 is reverted due to conflicting changes. llvm-svn: 330416	2018-04-20 10:52:54 +00:00
Roman Lebedev	f6934d725b	[NFC][InstCombine] Regenerate two tests that are affected by folding masked merge llvm-svn: 330415	2018-04-20 10:49:19 +00:00
Andrew Ng	a6763bfd6d	[DebugInfo] Fix for split dwarf test on Windows (NFC) On Windows, %llc_dwarf automatically adds -mtriple causing this test to error. Changed %llc_dwarf to llc. Differential Revision: https://reviews.llvm.org/D45869 llvm-svn: 330414	2018-04-20 10:44:42 +00:00
Michael Zolotukhin	9dea079315	[SSAUpdaterBulk] Use SmallVector instead of DenseMap for storing rewrites. llvm-svn: 330413	2018-04-20 10:31:06 +00:00
Ilya Biryukov	2bf7c51d0e	[Dockerfiles] Split checkout and build scripts into separate files. Summary: This is a small refactoring to extract the svn checkout code from the build script used inside the docker image. This would give more flexibility if more than a single invocation of cmake is needed inside the docker image. User-facing interface (build_docker_image.sh) hasn't changed, only the internal scripts running inside the build container are affected. Reviewers: ioeric Reviewed By: ioeric Subscribers: mehdi_amini, llvm-commits Differential Revision: https://reviews.llvm.org/D45868 llvm-svn: 330412	2018-04-20 10:19:38 +00:00
Florian Hahn	d4332eb3b7	[LTO] Add stats-file option to LTO/Config.h. This patch adds a StatsFile option to LTO/Config.h and updates both LLVMGold and llvm-lto2 to set it. Reviewers: MatzeB, tejohnson, espindola Reviewed By: tejohnson Differential Revision: https://reviews.llvm.org/D45531 llvm-svn: 330411	2018-04-20 10:18:36 +00:00
Andrea Di Biagio	4d79c580ce	CODE_OWNERS: Take code ownership of llvm-mca. llvm-svn: 330410	2018-04-20 10:16:31 +00:00
Dan Liew	872b8ea596	[lit] Fix a bug where UNRESOLVED tests were not handled in the XUnit XML printer. A test has been added that tries to comprehensively test emitting XUnit XML output for shell tests. Differential Revision: https://reviews.llvm.org/D45567 llvm-svn: 330409	2018-04-20 10:11:41 +00:00
Sander de Smalen	137efb231e	[AArch64][SVE] Fix diagnostic for SVE LD4 instructions: Diagnostic: 'index must be multiple of 3 in range [-32, 28]' Must be: 'index must be multiple of 4 in range [-32, 28]' llvm-svn: 330407	2018-04-20 09:45:50 +00:00
Sander de Smalen	367694b093	[AArch64][SVE] Added GPR64shifted and GPR64NoXZRshifted register classes. Summary: This is patch [3/4] in a series to add assembler/disassembler support for SVE's contiguous LD1 (scalar+scalar) instructions: - Patch [1/4]: https://reviews.llvm.org/D45687 - Patch [2/4]: https://reviews.llvm.org/D45688 - Patch [3/4]: https://reviews.llvm.org/D45689 - Patch [4/4]: https://reviews.llvm.org/D45690 Reviewers: fhahn, rengolin, javed.absar, huntergr, SjoerdMeijer, t.p.northover, echristo, evandro Reviewed By: SjoerdMeijer Subscribers: tschuett, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D45689 llvm-svn: 330406	2018-04-20 08:54:49 +00:00
Michael Zolotukhin	79e4f7fadb	Reapply "[PR16756] Use SSAUpdaterBulk in JumpThreading." one more time. Hopefully, changing set to vector removes nondeterminism detected by some bots, or the new assert will catch something. This reverts commit r330180. llvm-svn: 330403	2018-04-20 08:01:08 +00:00
Michael Zolotukhin	26339b445a	[SSAUpdaterBulk] Add an assert. llvm-svn: 330402	2018-04-20 07:59:57 +00:00
Daniel Cederman	1c8fb18557	Add SPARC support to update_llc_test_checks.py Reviewers: spatel, jyknight Reviewed By: spatel Subscribers: fedor.sergeev, llvm-commits Differential Revision: https://reviews.llvm.org/D45809 llvm-svn: 330401	2018-04-20 07:59:13 +00:00
Michael Zolotukhin	0df1d48ca9	[SSAUpdaterBulk] Add * and & to auto. llvm-svn: 330400	2018-04-20 07:58:54 +00:00
Michael Zolotukhin	bc843211fd	[SSAUpdaterBulk] Use PredCache in ComputeLiveInBlocks. llvm-svn: 330399	2018-04-20 07:57:24 +00:00
Michael Zolotukhin	79cb54b2d9	[SSAUpdaterBulk] Use SmallVector instead of SmallPtrSet for uses. llvm-svn: 330398	2018-04-20 07:56:00 +00:00
Daniel Cederman	4557178061	Revert "This pass, fixing an erratum in some LEON 2 processors..." Summary: Reading Atmel's AT697E errata document this does not seem like a valid workaround. While the text only mentions SDIV, it says that the ICC flags can be wrong, and those are only generated by SDIVcc. Verification on hardware shows that simply replacing SDIV with SDIVcc does not avoid the bug with negative operands. This reverts r283727. Reviewers: lero_chris, jyknight Reviewed By: jyknight Subscribers: fedor.sergeev, jrtc27, llvm-commits Differential Revision: https://reviews.llvm.org/D45813 llvm-svn: 330397	2018-04-20 07:53:27 +00:00
Daniel Cederman	c67b3ffba7	[Sparc] Use synthetic instruction clr to zero register instead of sethi Using `clr reg`/`mov %g0, reg`/`or %g0, %g0, reg` to zero a register looks much better than `sethi 0, reg`. Reviewers: jyknight, venkatra Reviewed By: jyknight Subscribers: eraman, fedor.sergeev, jrtc27, llvm-commits Differential Revision: https://reviews.llvm.org/D45810 llvm-svn: 330396	2018-04-20 07:47:12 +00:00
Sander de Smalen	149916d29a	[AArch64][AsmParser] Extend RegOp with integrated 'shift/extend'. Summary: In some cases the shift/extend needs to be explicitly parsed together with the register, rather than as a separate operand. This is needed for addressing modes where the instruction as a whole dictates the scaling/extend, rather than specific bits in the instruction. By parsing them as a single operand, we avoid the need to pass an extra operand in all CodeGen patterns (because all operands need to have an associated value), and we avoid the need to update TableGen to accept operands that have no associated bits in the instruction. An added benefit of parsing them together is that the assembler can give a sensible diagnostic if the scaling is not correct. This is patch [2/4] in a series to add assembler/disassembler support for SVE's contiguous LD1 (scalar+scalar) instructions: - Patch [1/4]: https://reviews.llvm.org/D45687 - Patch [2/4]: https://reviews.llvm.org/D45688 - Patch [3/4]: https://reviews.llvm.org/D45689 - Patch [4/4]: https://reviews.llvm.org/D45690 Reviewers: fhahn, rengolin, javed.absar, huntergr, SjoerdMeijer, t.p.northover, echristo, evandro Reviewed By: fhahn, SjoerdMeijer Subscribers: kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D45688 llvm-svn: 330394	2018-04-20 07:24:20 +00:00
Nicolai Haehnle	7a87977fb2	AMDGPU: Legalize the operand of SI_INIT_M0 Summary: This fixes a case where the argument to a sendmsg intrinsic ends up in a VGPR, for whatever reason. The underlying performance issue is that a multiplication that can be an s_mul_i32 is instead needlessly generated as v_mul_u32_u24, but this is not addressed by this patch. Change-Id: I61fd4034314d5acdf6074632c30b65364dfa7328 Reviewers: arsenm, rampitec Subscribers: kzhuravl, wdng, yaxunl, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D45826 llvm-svn: 330393	2018-04-20 07:14:25 +00:00
Daniel Cederman	793af3b9f0	[Sparc] Fix addressing mode when using 64-bit values in inline assembly Summary: If a 64-bit register is used as an operand in inline assembly together with a memory reference, the memory addressing will be wrong. The addressing will be a single reg, instead of reg+reg or reg+imm. This will generate a bad offset value or an exception in printMemOperand(). For example: ``` long long int val = 5; long long int mem; __asm__ volatile ("std %1, %0":"=m"(mem):"r"(val)); ``` becomes: ``` std %i0, [%i2+589833] ``` The problem is that SelectInlineAsmMemoryOperand() is never called for the memory references if one of the operands is a 64-bit register. By calling SelectInlineAsmMemoryOperands() in tryInlineAsm() the Sparc version of SelectInlineAsmMemoryOperand() gets called for each memory reference. Reviewers: jyknight, venkatra Reviewed By: jyknight Subscribers: eraman, fedor.sergeev, jrtc27, llvm-commits Differential Revision: https://reviews.llvm.org/D45761 llvm-svn: 330392	2018-04-20 06:57:49 +00:00
Vlad Tsyrklevich	5d15230c37	Fix build failures for r330387 on buildbots that don't build the X86 target llvm-svn: 330388	2018-04-20 02:26:12 +00:00
Vlad Tsyrklevich	230b256783	LowerTypeTests: Propagate symver directives Summary: This change fixes https://crbug.com/834474, a build failure caused by LowerTypeTests not preserving .symver symbol versioning directives for exported functions. Emit symver information to ThinLTO summary data and then propagate symver directives for exported functions to the merged module. Emitting symver information to the summaries increases the size of intermediate build artifacts for a Chromium build by less than 0.2%. Reviewers: pcc Reviewed By: pcc Subscribers: tejohnson, mehdi_amini, eraman, llvm-commits, eugenis, kcc Differential Revision: https://reviews.llvm.org/D45798 llvm-svn: 330387	2018-04-20 01:36:48 +00:00
Amara Emerson	6aacbf4d7c	Move a dump() implementation out of line. Fixes some link issues. llvm-svn: 330384	2018-04-20 00:42:46 +00:00
Jessica Paquette	1eca23bdd8	[MachineOutliner] NFC: Move EnableLinkOnceODROutlining into MachineOutliner.cpp This moves the EnableLinkOnceODROutlining flag from TargetPassConfig.cpp into MachineOutliner.cpp. It also removes OutlineFromLinkOnceODRs from the MachineOutliner constructor. This is now handled by the moved command-line flag. llvm-svn: 330373	2018-04-19 22:17:07 +00:00
Simon Pilgrim	0a6bfb1843	[llvm-mca][X86] Add prefetch instruction resource tests llvm-svn: 330371	2018-04-19 22:11:58 +00:00
Sam Clegg	f009da2448	[WebAssembly] Enabled -triple=wasm32-unknown-unknown-wasm path using ELF directive parser. This is a temporary solution until a proper WASM implementation of MCAsmParserExtension is in place, but at least for now will unblock this path. Added test to make sure this path works with the WASM Assembler. Patch By Wouter van Oortmerssen! Differential Revision: https://reviews.llvm.org/D45386 llvm-svn: 330370	2018-04-19 22:00:53 +00:00
Sanjay Patel	ad8976db16	[Reassociate] add baseline tests for binop swapping; NFC Similar to rL330086, I don't know if we want to do these transforms here, but we might as well have the tests here either way to show that this pass is missing potential functionality (intentionally or not). llvm-svn: 330368	2018-04-19 21:56:17 +00:00
Simon Pilgrim	7209117868	[llvm-mca][FMA] Add FMA resource tests llvm-svn: 330366	2018-04-19 21:32:22 +00:00
Stanislav Mekhanoshin	160f85794d	[AMDGPU] Use packed literals with zero either lower or hi part Differential Revision: https://reviews.llvm.org/D45790 llvm-svn: 330365	2018-04-19 21:16:50 +00:00
Gerolf Hoflehner	bf26d54047	[llvm-objdump] Issue error message when object file cannot be created llvm-svn: 330364	2018-04-19 20:48:35 +00:00
Craig Topper	6496d51284	[X86] Remove non-existant instruction name from X86DisassemblerTables.cpp. This instruction was removed a long time so we don't need to check for it here. llvm-svn: 330363	2018-04-19 20:44:15 +00:00
Jin Lin	585f2699cf	Refine the loop rotation's API Summary: The following changes addresses the following two issues. 1) The existing loop rotation pass contains both loop latch simplification and loop rotation. So one flag RotationOnly is added to be passed to the loop rotation pass. 2) The threshold value is initialized with MAX_UINT since the loop rotation utility should not have threshold limit. Reviewers: dmgreen, efriedma Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D45582 llvm-svn: 330362	2018-04-19 20:29:43 +00:00
Lang Hames	ee68ec06a1	[ORC] Fix an assertion condition from r329934. Thanks to Alexander Ivchenko for finding the issue! llvm-svn: 330359	2018-04-19 19:30:35 +00:00
Craig Topper	bc895a3afc	[X86] Enable popcnt false dependency breaking on Silvermont and Goldmont. Silvermont and Goldmont have the same issue on popcnt as Sandy Bridge, Haswell, Broadwell, and Skylake. Believe it is fixed in Goldmont Plus. llvm-svn: 330358	2018-04-19 19:25:24 +00:00
Chandler Carruth	32e62f9c5b	[PM/LoopUnswitch] Detect irreducible control flow within loops and skip unswitching non-trivial edges. Summary: This fixes the bug pointed out in review with non-trivial unswitching. This also provides a basis that should make it pretty easy to finish fleshing out a routine to scan an entire function body for irreducible control flow, but this patch remains minimal for disabling loop unswitch. Reviewers: sanjoy, fedor.sergeev Subscribers: mcrosier, hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D45754 llvm-svn: 330357	2018-04-19 18:44:25 +00:00
Lang Hames	9bbd653084	[ORC] Make VSO symbol resolution/finalization operations private. This forces these operations to be carried out via a MaterializationResponsibility instance, ensuring responsibility is explicitly tracked. llvm-svn: 330356	2018-04-19 18:42:49 +00:00
Simon Pilgrim	4a486c13fa	[llvm-mca][X86] Add resource test for every out-of-order scheduler model I've copied and regenerated a resource file from btver2 to every x86 scheduler model supported by llvm-mca so we have at least some basic coverage. For most this has been the avx1 tests, but for silvermont I've used sse42 as thats the latest it supports. More will be added later. llvm-svn: 330352	2018-04-19 18:08:10 +00:00
Simon Pilgrim	4ba057dbd1	[X86][SLM] Fix typo using SandyBridge resources. Luckily this was on instructions not supported on Silvermont.... llvm-svn: 330351	2018-04-19 18:01:52 +00:00
Craig Topper	b5f2659130	[X86] Correct the scheduling data for register forms of XCHG and XADD on Intel CPUs. The XCHG16rr/XCHG32rr/XCHG64rr instructions should be 3 uops just like XCHG8rr. I believe they're just implemented as 3 move uops with a temporary register. XADD is probably 2 moves and an add also using a temporary register. Change the latency for both from 2 cycles to 3 cycles. Only 2 of the uops are serialized in their execution, the move into the temporary and the move out of the temporary. The move from one GPR to the other should be able to go in parallel with this if there are ALU resources available. llvm-svn: 330349	2018-04-19 18:00:17 +00:00
Sanjay Patel	a201787fd7	[Reassociate] fix formatting; NFC llvm-svn: 330348	2018-04-19 17:56:36 +00:00
Simon Pilgrim	5e492d29a3	[X86] Merge some MMX instregex There's a lot more but I'd prefer focussing on removing unnecessary InstRWs first. llvm-svn: 330347	2018-04-19 17:32:10 +00:00
Krzysztof Parzyszek	fbee8574ab	[if-converter] Handle BBs that terminate in ret during diamond conversion This fixes https://llvm.org/PR36825. Original patch by Valentin Churavy (D45218). Differential Revision: https://reviews.llvm.org/D45731 llvm-svn: 330345	2018-04-19 17:26:46 +00:00
Krzysztof Parzyszek	2a9a83cd3f	[Hexagon] Use legal types when lowering CONCAT_VECTORS via BUILD_VECTOR llvm-svn: 330344	2018-04-19 17:11:58 +00:00
Francis Visoiu Mistrih	dca79d2867	[llvm-objdump] Remove test object file Forgot to remove it from the previous commit. llvm-svn: 330343	2018-04-19 17:05:03 +00:00
Francis Visoiu Mistrih	1834682b97	[llvm-objdump] Print "..." instead of random data for virtual sections When disassembling with -D, skip virtual sections by printing "..." for each symbol. This patch also implements `MachOObjectFile::isSectionVirtual`. Test case comes from: ``` .zerofill __DATA,__common,_data64unsigned,472,3 ``` Differential Revision: https://reviews.llvm.org/D45824 llvm-svn: 330342	2018-04-19 17:02:57 +00:00
Teresa Johnson	aa94393ec5	[gold/ThinLTO] Invoke llvm_shutdown when exiting after ThinLTO indexing Summary: Instead of manually invoking PrintStatistics, simply invoke llvm_shutdown which will take care of destroying managed statics, and as a side effect will destroy the StatisticInfo ManagedStatic, invoking PrintStatistics when needed. Reviewers: fhahn Subscribers: inglorion, llvm-commits Differential Revision: https://reviews.llvm.org/D45820 llvm-svn: 330341	2018-04-19 16:55:13 +00:00
Mark Searles	1bc6e71f32	[AMDGPU] Do not only rely on BB number when finding bottom loop We should also check that the "bottom" basic block of a loopis a successor of the "header" basic block, otherwise we don't propagate the information correctly when the CFG is complex. This fixes an important rendering problem with Wolfsentein 2, because of one vector-memory wait was missing. Differential Revision: https://reviews.llvm.org/D43831 llvm-svn: 330337	2018-04-19 15:42:30 +00:00
Simon Pilgrim	f209321d61	[llvm-mca][X86] Add mmx instruction to btver2 resource tests Useful to see scheduler class deltas against xmm equivalents llvm-svn: 330335	2018-04-19 15:09:46 +00:00
Florian Hahn	b789165e6b	[NewGVN] Add ops as dependency if we cannot find a leader for ValueOp. If those operands change, we might find a leader for ValueOp, which could enable new phi-of-op creation. This fixes a case where we missed creating a phi-of-ops node. With D43865 and this patch, bootstrapping clang/llvm works with -enable-newgvn, whereas without it, the "value changed after iteration" assertion is triggered. Reviewers: dberlin, davide Reviewed By: dberlin Differential Revision: https://reviews.llvm.org/D42180 llvm-svn: 330334	2018-04-19 15:05:47 +00:00
Krzysztof Parzyszek	d92c37e090	[Hexagon] Generate code for vector bswap intrinsics llvm-svn: 330333	2018-04-19 14:46:44 +00:00
Simon Pilgrim	f21ace6cdd	[X86][BtVer2] Remove SSE4A EXTRQ/EXTRQI InstRW overrides. These are already handled identically by WriteALU. llvm-svn: 330332	2018-04-19 14:38:36 +00:00
Krzysztof Parzyszek	23bcf06a15	[Hexagon] Add/fix patterns for 32/64-bit vector compares and logical ops llvm-svn: 330330	2018-04-19 14:24:31 +00:00
Mikhail Maltsev	aeb6c48d29	[Unittests] Fix plugins test Summary: Currently the PluginsTests.LoadPlugin unit test is failing in LLVM configurations that have LLVM_EXPORT_SYMBOLS_FOR_PLUGINS enabled because the EnableABIBreakingChecks symbol is missing. This patch fixes the issue by linking some additional libraries to the test plugin if LLVM_EXPORT_SYMBOLS_FOR_PLUGINS is enabled. Reviewers: philip.pfaffe Reviewed By: philip.pfaffe Subscribers: mgorny, llvm-commits, rogfer01 Differential Revision: https://reviews.llvm.org/D45811 llvm-svn: 330329	2018-04-19 14:02:46 +00:00
Simon Dardis	5d61c8b225	[mips] Correct the definitions of the unaligned word memory operation instructions These instructions lacked the correct predicates, were not marked as loads and stores and lacked the proper instruction mapping information. In the case of microMIPS sw(l\|r)e (EVA) these instructions were using the load EVA description. Reviewers: abeserminji, smaksimovic, atanasyan Differential Revision: https://reviews.llvm.org/D45626 llvm-svn: 330326	2018-04-19 13:33:51 +00:00
Roman Lebedev	d536de1e7b	[NFC][InstCombine] A few more tests for masked merge add/xor -> or with constant mask llvm-svn: 330325	2018-04-19 13:02:17 +00:00
Alexander Ivchenko	e8fed1546e	Lowering x86 adds/addus/subs/subus intrinsics (llvm part) This is the patch that lowers x86 intrinsics to native IR in order to enable optimizations. The patch also includes folding of previously missing saturation patterns so that IR emits the same machine instructions as the intrinsics. Patch by tkrupa Differential Revision: https://reviews.llvm.org/D44785 llvm-svn: 330322	2018-04-19 12:13:30 +00:00
Florian Hahn	9a175bc1bc	Remove file accidentally added in r330320. llvm-svn: 330321	2018-04-19 12:09:05 +00:00
Florian Hahn	2342533e1a	[IR/BasicBlockTest] Fix asan failure introduced in rL330316. The argument has to be deleted after the module containing the function gets deleted. llvm-svn: 330320	2018-04-19 12:06:26 +00:00
Simon Pilgrim	3c06617f0e	[X86][FMA] Remove FMA reg-reg InstRW scheduler overrides. These are all already handled identically by WriteFMA. llvm-svn: 330319	2018-04-19 11:37:26 +00:00
Simon Pilgrim	33dede9075	[X86][BtVer2] Remove 128-bit F16C InstRW overrides. These are already handled identically by WriteCvtF2F. llvm-svn: 330318	2018-04-19 11:16:33 +00:00
Simon Pilgrim	b04cd1b9f3	[llvm-exegesis] Fix PfmIssueCountersTable creation This patch ensures that the pfm issue counter tables are the correct size, accounting for the invalid resource entry at the beginning of the resource tables. It also fixes an issue with pfm failing to match event counters due to a trailing comma added to all the event names. I've also added a counter comment to each entry as it helps locate problems with the tables. Note: I don't have access to a SandyBridge test machine, which is the only model to make use of multiple event counters being mapped to a single resource. I don't know if pfm accepts a comma-seperated list or not, but that is what it was doing. Differential Revision: https://reviews.llvm.org/D45787 llvm-svn: 330317	2018-04-19 10:59:49 +00:00
Florian Hahn	147fc016e3	[BasicBlock] Add instructionsWithoutDebug methods to skip debug insts. Reviewers: aprantl, vsk, mattd, chandlerc Reviewed By: aprantl, vsk Differential Revision: https://reviews.llvm.org/D45657 llvm-svn: 330316	2018-04-19 09:48:07 +00:00
Simon Dardis	fdc052686c	[mips] Guard some macro expansions properly Reviewers: atanasyan, abeserminji Differential Revision: https://reviews.llvm.org/D45565 llvm-svn: 330315	2018-04-19 09:45:04 +00:00
Sjoerd Meijer	a79ea80d7b	[ARM] Add some missing FP16 VSEL test cases Differential Revision: https://reviews.llvm.org/D45724 llvm-svn: 330313	2018-04-19 08:21:50 +00:00
Sander de Smalen	50d8702f26	[AArch64][AsmParser] NFC: Cleanup parsing of scalar registers. Summary: - Renamed tryParseRegister to tryParseScalarRegister, which now returns an OperandMatchResultTy. - Moved matching of certain aliases into matchRegisterNameAlias. - Changed type of most 'Reg' variables to 'unsigned'. This is patch [1/4] in a series to add assembler/disassembler support for SVE's contiguous LD1 (scalar+scalar) instructions: - Patch [1/4]: https://reviews.llvm.org/D45687 - Patch [2/4]: https://reviews.llvm.org/D45688 - Patch [3/4]: https://reviews.llvm.org/D45689 - Patch [4/4]: https://reviews.llvm.org/D45690 Reviewers: fhahn, rengolin, javed.absar, huntergr, SjoerdMeijer, t.p.northover, echristo, evandro, samparker Reviewed By: samparker Subscribers: samparker, llvm-commits, kristof.beyls Differential Revision: https://reviews.llvm.org/D45687 llvm-svn: 330311	2018-04-19 07:35:08 +00:00
Craig Topper	f846e2d1b1	[X86] Scrub scheduling information for MUL/IMUL on Intel CPUs. This removes a bunch of unnecessary InstRW overrides. It also cleans up the missing information from the Sandy Bridge model. Other fixes to other models. llvm-svn: 330308	2018-04-19 05:34:05 +00:00
Bob Haarman	cb80a3fce0	Fix data race in X86FloatingPoint.cpp ASSERT_SORTED Summary: ASSERT_SORTED checks if a table is sorted, and uses a boolean to prevent the check from being run again if it was earlier determined that the table is in fact sorted. Unsynchronized reads and writes of that boolean triggered ThreadSanitizer's data race detection. This change rewrites the code to use std::atomic<bool> instead. Fixes PR36922. Reviewers: rnk Reviewed By: rnk Subscribers: llvm-commits, hiraditya Differential Revision: https://reviews.llvm.org/D45742 llvm-svn: 330301	2018-04-18 23:04:09 +00:00
Craig Topper	ebf52e80c1	[X86] Correct the Defs, Uses, hasSideEffects, mayLoad, mayStore for XCHG and XADD instructions. I don't think we emit any of these from codegen except for using XCHG16ar as 2 byte NOP. llvm-svn: 330298	2018-04-18 22:07:53 +00:00
Artem Belevich	0ae8590354	[NVPTX, CUDA] Added support for m8n32k16 and m32n8k16 variants of wmma instructions. The new instructions were added added for sm_70+ GPUs in CUDA-9.1. Differential Revision: https://reviews.llvm.org/D45068 llvm-svn: 330296	2018-04-18 21:51:48 +00:00
Simon Pilgrim	c310bfa193	[llvm-mca][X86] Add mmx versions of SSSE3 instructions Move PABS instructions incorrectly tested under SSE2 llvm-svn: 330295	2018-04-18 20:47:48 +00:00
Alex Bradbury	9891ba3476	[RISCV] Add test changes missed from rL330293 llvm-svn: 330294	2018-04-18 20:36:12 +00:00
Alex Bradbury	3ff2022bb9	[RISCV] Introduce pattern for materialising immediates with 0 for lower 12 bits These immediates can be materialised with just an lui, rather than an lui+addi pair. llvm-svn: 330293	2018-04-18 20:34:23 +00:00
Alex Bradbury	792547b348	[RISCV] Add imm-cse.ll test case This test case demonstrates that common subexpression elimination takes place between code sequences for materialising constants. In particular, it demonstrates that redundant lui aren't generated. This would capture a regression if applying a patch such as D41949. llvm-svn: 330291	2018-04-18 20:25:07 +00:00
Lei Huang	829cd8e263	[NFC] test case clean up 1. remove redundant tests 2. update XForm_tests to generated expected code gen llvm-svn: 330290	2018-04-18 20:22:26 +00:00
Alex Bradbury	c0464d9271	[RISCV] Expand codegen -> compression sanity checks and move to a single file The objdump tests interfere with update_llc_test_checks.py and can't be automatically update them. Put the sanitify check for compression on the codegen codepath into a separate file, and expand it to also include tests of integer materialisation. This would catch changes such as those triggered by D41949. llvm-svn: 330288	2018-04-18 20:17:29 +00:00
Craig Topper	04244cbf45	[X86] Fix the Uses/Defs,mayLoad,mayStore,hasSideEffects flags for the CMPXCHG instructions. The compiler only emits the locked version of these which use different instruction definitions. The versions fixed here are only used by the assembler/disassembler. llvm-svn: 330287	2018-04-18 20:15:00 +00:00
Alex Bradbury	099c720426	Revert "[RISCV] implement li pseudo instruction" Reverts rL330224, while issues with the C extension and missed common subexpression elimination opportunities are addressed. Neither of these issues are visible in current RISC-V backend unit tests, which clearly need expanding. llvm-svn: 330281	2018-04-18 19:02:31 +00:00
Lei Huang	192c6ccf6d	[Power9]Legalize and emit code for converting Unsigned HWord/Char to Quad-Precision Legalize and emit code for converting unsigned HWord/Char to QP: xscvsdqp xscvudqp Only covering patterns for unsigned forms cause we don't have part-word sign-extending integer loads into VSX registers. Differential Revision: https://reviews.llvm.org/D45494 llvm-svn: 330278	2018-04-18 17:41:46 +00:00
Amara Emerson	9de072f8ae	[AArch64] Add isel pattern for v8i8->v2f32 NVCASTs. rdar://39454635 llvm-svn: 330276	2018-04-18 17:10:19 +00:00

... 2 3 4 5 6 ...

163329 Commits