llvm-project

Commit Graph

Author	SHA1	Message	Date
Simon Pilgrim	0e29d8d81f	[X86][SSE] Add extra trunc(shl) test cases The existing trunc_shl_17_v8i16_v8i32 test case should (but doesn't) fold to zero, I've added 2 new test cases: - trunc_shl_16_v8i16_v8i32 which folds to zero (this is actually testing the target faux shuffle combine) - trunc_shl_15_v8i16_v8i32 which should perform the full shl + truncate llvm-svn: 334188	2018-06-07 11:22:52 +00:00
Florian Hahn	0d6b01761c	[Mem2Reg] Avoid replacing load with itself in promoteSingleBlockAlloca. We do the same thing in rewriteSingleStoreAlloca. Fixes PR37632. Reviewers: chandlerc, davide, efriedma Reviewed By: davide Differential Revision: https://reviews.llvm.org/D47825 llvm-svn: 334187	2018-06-07 11:09:05 +00:00
Matt Arsenault	697300bd4f	AMDGPU: Use scalar operations for f16 fabs/fneg patterns Fixes unnecessary differences between subtargets. llvm-svn: 334184	2018-06-07 10:15:20 +00:00
Simon Pilgrim	cc92897be9	[X86] Regenerate rotate tests Add 32-bit tests to show missed SHLD/SHRD cases llvm-svn: 334183	2018-06-07 10:13:09 +00:00
Paul Semel	e57bc78324	[llvm-strip] Expose --strip-unneeded option Differential Revision: https://reviews.llvm.org/D47818 llvm-svn: 334182	2018-06-07 10:05:25 +00:00
Matt Arsenault	90083d3088	AMDGPU: Try a lot harder to emit scalar loads This has two main components. First, widen widen short constant loads in DAG when they have the correct alignment. This is already done a bit in AMDGPUCodeGenPrepare, since that has access to DivergenceAnalysis. This can't help kernarg loads created in the DAG. Start to use DAG divergence analysis to help this case. The second part is to avoid kernel argument lowering breaking the alignment of short vector elements because calling convention lowering wants to split everything into legal register types. When loading a split type, load the nearest 4-byte aligned segment and shift to get the desired bits. This extra load of the earlier argument piece ends up merging, and the bit extract hopefully folds out. There are a number of improvements and regressions with this, but I think as-is this is a better compromise between several of the worst parts of SelectionDAG. Particularly when i16 is legal, this produces worse code for i8 and i16 element vector kernel arguments. This is partially due to the very weak load merging the DAG does. It only looks for fairly specific combines between pairs of loads which no longer appear. In particular this causes v4i16 loads to be split into 2 components when previously the two halves were merged. Worse, because of the newly introduced shifts, there is a lot more unnecessary vector packing and unpacking code emitted. At least some of this is due to reporting false for isTypeDesirableForOp for i16 as a workaround for the lack of divergence information in the DAG. The cases where this happens it doesn't actually matter, but the relevant code in SimplifyDemandedBits doens't have the context to know to ignore this. The use of the scalar cache is probably more important than the mess of mostly scalar instructions doing this packing and unpacking. Future work can fix this, possibly by making better use of the new DAG divergence information for controlling promotion decisions, or adding another version of shift + trunc + shift combines that doesn't only know about the used types. llvm-svn: 334180	2018-06-07 09:54:49 +00:00
Clement Courbet	4281b1d3b5	[X86][NFC] Fix harmless typo in BtVer2 model. See D46356 for context. llvm-svn: 334178	2018-06-07 09:26:33 +00:00
Tomasz Krupa	f8c7637027	[X86] Block UndefRegUpdate Summary: Prevent folding of operations with memory loads when one of the sources has undefined register update. Reviewers: craig.topper Subscribers: llvm-commits, mike.dvoretsky, ashlykov Differential Revision: https://reviews.llvm.org/D47621 llvm-svn: 334175	2018-06-07 08:48:45 +00:00
Max Kazantsev	b4b2ccea6d	[NFC] Use variable instead of accessing pair many times llvm-svn: 334173	2018-06-07 08:47:19 +00:00
Tomasz Krupa	145825162a	Test commit access. Added a bunch of periods after comments. llvm-svn: 334171	2018-06-07 08:20:28 +00:00
Guillaume Chatelet	7b852cd814	[llvm-exegesis] Add a Configuration object for Benchmark. Summary: This is the first step to have the BenchmarkRunner create and measure many different configurations (different initial values for instance). Reviewers: courbet Subscribers: tschuett, llvm-commits Differential Revision: https://reviews.llvm.org/D47826 llvm-svn: 334169	2018-06-07 08:11:54 +00:00
Guillaume Chatelet	8c91d4cb04	[llvm-exegesis] Improve error reporting. Summary: BenchmarkResult IO functions now return an Error or Expected so caller can deal take proper action. Reviewers: courbet Subscribers: tschuett, llvm-commits Differential Revision: https://reviews.llvm.org/D47868 llvm-svn: 334167	2018-06-07 07:51:16 +00:00
Guillaume Chatelet	083a0c1621	[llvm-exegesis] Serializes instruction's operand in BenchmarkResult's key. Summary: Follow up patch to https://reviews.llvm.org/D47764. Reviewers: courbet Subscribers: tschuett, llvm-commits Differential Revision: https://reviews.llvm.org/D47785 llvm-svn: 334165	2018-06-07 07:40:40 +00:00
Clement Courbet	9212ef0a0a	[X86][NFC] Fix harmless typos in BDW/ZnVer1 sched models. See D46356 for context. llvm-svn: 334164	2018-06-07 07:37:49 +00:00
Karl-Johan Karlsson	abb11f805f	[BranchFolding] Fix live-in's when hoisting code Summary: When the branch folder hoist code into a predecessor it adjust live-in's in the blocks it hoist code from. However it fail to handle hoisted code that contain a defed register that originally is live-in in the block through a super register. This is fixed by replacing the live-in handling code with calls to utility functions in LivePhysRegs. Reviewers: kparzysz, gberry, MatzeB, uweigand, aprantl Reviewed By: kparzysz Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D47529 llvm-svn: 334163	2018-06-07 07:20:33 +00:00
Jonas Paulsson	e80d405760	[SystemZ] Build Load And Test from scratch in convertToLoadAndTest. This is needed to get CC operand in right place, as expected by the SchedModel. Review: Ulrich Weigand https://reviews.llvm.org/D47820 llvm-svn: 334161	2018-06-07 05:59:07 +00:00
Michael Zolotukhin	31800864dc	SpeculativeExecution Pass: Set PreserveCFG to avoid unnecessary analyses invalidation. The pass doesn't touch CFG in any way, only moves instructions between blocks. llvm-svn: 334150	2018-06-07 00:19:29 +00:00
Peter Collingbourne	ac0f5cf74b	Add definition for ELF dynamic tag DT_SYMTAB_SHNDX. DT_SYMTAB_SHNDX is defined in generic-abi: http://www.sco.com/developers/gabi/latest/ch5.dynamic.html Patch by Rahul Chaudhry! Differential Revision: https://reviews.llvm.org/D47803 llvm-svn: 334149	2018-06-07 00:06:41 +00:00
Peter Collingbourne	cf017ada68	llvm-readobj: fix printing number of relocations in Android packed format. With '-elf-output-style=GNU -relocations', a header containing the number of entries is printed before all the relocation entries in the section. For Android packed format, we need to perform the unpacking first before we can get the actual number of relocations in the section. Patch by Rahul Chaudhry! Differential Revision: https://reviews.llvm.org/D47800 llvm-svn: 334147	2018-06-07 00:02:07 +00:00
Stanislav Mekhanoshin	df61be70b2	[AMDGPU] Improve reciprocal handling When denormals are supported we are producing a full division for 1.0f / x. That still can be replaced by the faster version: bool c = fabs(x) > 0x1.0p+96f; float s = c ? 0x1.0p-32f : 1.0f; x = s; return s v_rcp_f32(x) in case if requested accuracy is 2.5ulp or less. The same version is used if denormals are not supported for non 1.0 numerators, where just v_rcp_f32 is then used for 1.0 numerator. The optimization of 1/x is extended to the case -1/x, which is the same except for the resulting sign bit. OpenCL conformance passed with both enabled and disabled denorms. Differential Revision: https://reviews.llvm.org/D47805 llvm-svn: 334142	2018-06-06 22:22:32 +00:00
Teresa Johnson	4ffc3e7834	[ThinLTO] Rename index IsAnalysis flag to HaveGVs (NFC) With the upcoming patch to add summary parsing support, IsAnalysis would be true in contexts where we are not performing module summary analysis. Rename to the more specific and approprate HaveGVs, which is essentially what this flag is indicating. llvm-svn: 334140	2018-06-06 22:22:01 +00:00
Sanjay Patel	3cd1aa88f9	[InstCombine] fold another shifty abs pattern to cmp+sel (PR36036) The bug report: https://bugs.llvm.org/show_bug.cgi?id=36036 ...requests a DAG change for this, but an IR canonicalization probably handles most cases. If we still want to match this pattern in the backend, there's a proposal for that too: D47831 Alive proofs including nsw/nuw cases that were first noted in: D46988 https://rise4fun.com/Alive/Kmp This patch is largely copied from the existing code that was initially added with: D40984 ...but I didn't see much gain from trying to share code. llvm-svn: 334137	2018-06-06 21:58:12 +00:00
Petr Hosek	0acc024d7a	[CMake] Pass additional CMake tools to external projects This is needed when the external projects try to use other tools besides just the compiler and the linker. Differential Revision: https://reviews.llvm.org/D47833 llvm-svn: 334136	2018-06-06 21:43:37 +00:00
Sanjay Patel	6fda6b1210	[InstCombine] add tests for another abs() pattern (PR36036); NFC llvm-svn: 334133	2018-06-06 21:32:42 +00:00
Matt Arsenault	e9524f1fb3	AMDGPU: Custom lower v2f16 fneg/fabs with illegal f16 Fixes terrible code on targets without f16 support. The legalization creates a mess that is difficult to recover from. Also should avoid randomly breaking these tests multiple times in sequence in future commits. Some regressions in cases where it happens to be better to pull the source modifier after the conversion. llvm-svn: 334132	2018-06-06 21:28:11 +00:00
Alexander Shaposhnikov	29407f3abe	[llvm-strip] Expose --discard-all option Expose objcopy's --discard-all option in llvm-strip. Test plan: make check-all Differential revision: https://reviews.llvm.org/D47750 llvm-svn: 334131	2018-06-06 21:23:19 +00:00
Roman Lebedev	cbf8446359	[InstCombine] PR37603: low bit mask canonicalization Summary: This is [[ https://bugs.llvm.org/show_bug.cgi?id=37603 \| PR37603 ]]. https://godbolt.org/g/VCMNpS https://rise4fun.com/Alive/idM When doing bit manipulations, it is quite common to calculate some bit mask, and apply it to some value via `and`. The typical C code looks like: ``` int mask_signed_add(int nbits) { return (1 << nbits) - 1; } ``` which is translated into (with `-O3`) ``` define dso_local i32 @mask_signed_add(int)(i32) local_unnamed_addr #0 { %2 = shl i32 1, %0 %3 = add nsw i32 %2, -1 ret i32 %3 } ``` But there is a second, less readable variant: ``` int mask_signed_xor(int nbits) { return ~(-(1 << nbits)); } ``` which is translated into (with `-O3`) ``` define dso_local i32 @mask_signed_xor(int)(i32) local_unnamed_addr #0 { %2 = shl i32 -1, %0 %3 = xor i32 %2, -1 ret i32 %3 } ``` Since we created such a mask, it is quite likely that we will use it in `and` next. And then we may get rid of `not` op by folding into `andn`. But now that i have actually looked: https://godbolt.org/g/VTUDmU _some_ backend changes will be needed too. We clearly loose `bzhi` recognition. Reviewers: spatel, craig.topper, RKSimon Reviewed By: spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D47428 llvm-svn: 334127	2018-06-06 19:38:27 +00:00
Roman Lebedev	4771bc6c35	[InstCombine][NFC] PR37603: low bit mask canonicalization tests Differential Revision: https://reviews.llvm.org/D47427 llvm-svn: 334126	2018-06-06 19:38:21 +00:00
Roman Lebedev	488d28d4e5	[X86] Emit BZHI when mask is ~(-1 << nbits)) Summary: In D47428, i propose to choose the `~(-(1 << nbits))` as the canonical form of low-bit-mask formation. As it is seen from these tests, there is a reason for that. AArch64 currently better handles `~(-(1 << nbits))`, but not the more traditional `(1 << nbits) - 1` (sic!). The other way around for X86. It would be much better to canonicalize. This patch is completely monkey-typing. I don't really understand how this works :) I have based it on `// x & (-1 >> (32 - y))` pattern. Also, when we only have `BMI`, i wonder if we could use `BEXTR` with `start=0` ? Related links: https://bugs.llvm.org/show_bug.cgi?id=36419 https://bugs.llvm.org/show_bug.cgi?id=37603 https://bugs.llvm.org/show_bug.cgi?id=37610 https://rise4fun.com/Alive/idM Reviewers: craig.topper, spatel, RKSimon, javed.absar Reviewed By: craig.topper Subscribers: kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D47453 llvm-svn: 334125	2018-06-06 19:38:16 +00:00
Roman Lebedev	cb56f7a550	[NFC][X86][AArch64] Reorganize/cleanup BZHI test patterns Summary: In D47428, i propose to choose the `~(-(1 << nbits))` as the canonical form of low-bit-mask formation. As it is seen from these tests, there is a reason for that. AArch64 currently better handles `~(-(1 << nbits))`, but not the more traditional `(1 << nbits) - 1` (sic!). The other way around for X86. It would be much better to canonicalize. It would seem that there is too much tests, but this is most of all the auto-generated possible variants of C code that one would expect for BZHI to be formed, and then manually cleaned up a bit. So this should be pretty representable, which somewhat good coverage... Related links: https://bugs.llvm.org/show_bug.cgi?id=36419 https://bugs.llvm.org/show_bug.cgi?id=37603 https://bugs.llvm.org/show_bug.cgi?id=37610 https://rise4fun.com/Alive/idM Reviewers: javed.absar, craig.topper, RKSimon, spatel Reviewed By: RKSimon Subscribers: kristof.beyls, llvm-commits, RKSimon, craig.topper, spatel Differential Revision: https://reviews.llvm.org/D47452 llvm-svn: 334124	2018-06-06 19:38:10 +00:00
Krzysztof Parzyszek	c1e712baa5	[Hexagon] Implement vector-pair zero as V6_vsubw_dv llvm-svn: 334123	2018-06-06 19:34:40 +00:00
Craig Topper	ef813a5226	[X86] Properly disassemble gather/scatter instructions where xmm4/ymm4/zmm4 are used as the index. These encodings correspond to the cases in the normal encoding scheme where there is no index and our modrm reading code initially decodes it as such. The VSIB handling code tried to compensate for this, but failed to add the base needed to make later code do the right thing. Fixes PR37712. llvm-svn: 334121	2018-06-06 19:15:15 +00:00
Craig Topper	d04cc8e640	[X86] Rename vy512mem->vy512xmem and vz256xmem->vz256mem. The index size is represented by the letter after the 'v'. The number represents the memory size. If an 'x' appears after the number its means the index register can be from VR128X/VR256X instead of VR128/VR256. As vy512mem uses a VR256X index it should have an x. And vz256mem uses a VR512 index so it shouldn't have an x. I admit these names kind of suck and are confusing. llvm-svn: 334120	2018-06-06 19:15:12 +00:00
Simon Pilgrim	aef5bdbea1	[X86][BtVer2] Add support for all vector instructions that should match the dependency-breaking 'zero-idiom' As detailed on Agner's Microarchitecture doc (21.8 AMD Bobcat and Jaguar pipeline - Dependency-breaking instructions), all these instructions are dependency breaking and zero the destination register. llvm-svn: 334119	2018-06-06 19:06:09 +00:00
Vedant Kumar	6d354ed72e	[Debugify] Move debug value intrinsics closer to their operand defs Before this patch, debugify would insert debug value intrinsics before the terminating instruction in a block. This had the advantage of being simple, but was a bit too simple/unrealistic. This patch teaches debugify to insert debug values immediately after their operand defs. This enables better testing of the compiler. For example, with this patch, `opt -debugify-each` is able to identify a vectorizer DI-invariance bug fixed in llvm.org/PR32761. In this bug, the vectorizer produced different output with/without debug info present. Reverting Davide's bugfix locally, I see: $ ~/scripts/opt-check-dbg-invar.sh ./bin/opt \ .../SLPVectorizer/AArch64/spillcost-di.ll -slp-vectorizer Comparing: -slp-vectorizer .../SLPVectorizer/AArch64/spillcost-di.ll Baseline: /var/folders/j8/t4w0bp8j6x1g6fpghkcb4sjm0000gp/T/tmp.iYYeL1kf With DI : /var/folders/j8/t4w0bp8j6x1g6fpghkcb4sjm0000gp/T/tmp.sQtQSeet 9,11c9,11 < %5 = getelementptr inbounds %0, %0* %2, i64 %0, i32 1 < %6 = bitcast i64* %4 to <2 x i64>* < %7 = load <2 x i64>, <2 x i64>* %6, align 8, !tbaa !0 --- > %5 = load i64, i64* %4, align 8, !tbaa !0 > %6 = getelementptr inbounds %0, %0* %2, i64 %0, i32 1 > %7 = load i64, i64* %6, align 8, !tbaa !5 12a13 > store i64 %5, i64* %8, align 8, !tbaa !0 14,15c15 < %10 = bitcast i64* %8 to <2 x i64>* < store <2 x i64> %7, <2 x i64>* %10, align 8, !tbaa !0 --- > store i64 %7, i64* %9, align 8, !tbaa !5 :: Found a test case ^ Running this over the *.ll files in tree, I found four additional examples which compile differently with/without DI present. I plan on filing bugs for these. llvm-svn: 334118	2018-06-06 19:05:42 +00:00
Vedant Kumar	a9e27312b8	[Debugify] Add a quiet mode to suppress warnings Suppressing warning output and module dumps significantly speeds up fuzzing with `opt -debugify-each`. llvm-svn: 334117	2018-06-06 19:05:41 +00:00
Evandro Menezes	b2c8244715	[AArch64, ARM] Add support for Samsung Exynos M4 Create a separate feature set for Exynos M4 and add test cases. llvm-svn: 334115	2018-06-06 18:56:00 +00:00
Han Shen	2c5d2ea8a6	Fix the test case that places intermediate in source directory. This causes "permission denied" error in some controlled test environment where source tree is read-only. Differential Revision: https://reviews.llvm.org/D47839 llvm-svn: 334114	2018-06-06 18:53:17 +00:00
Michael Berg	cc1c4b6912	guard fsqrt with fmf sub flags Summary: This change uses fmf subflags to guard optimizations as well as unsafe. These changes originated from D46483. It contains only context for fsqrt. Reviewers: spatel, hfinkel, arsenm Reviewed By: spatel Subscribers: hfinkel, wdng, andrew.w.kaylor, wristow, efriedma, nemanjai Differential Revision: https://reviews.llvm.org/D47749 llvm-svn: 334113	2018-06-06 18:47:55 +00:00
Teresa Johnson	9e46c6da69	[ThinLTO] Make ValueInfo operator!= consistent with operator== (NFC) Compare Ref pointers instead of GUID, to handle comparison with special empty/tombstone ValueInfo. This was already done for operator==, to support inserting ValueInfo into DenseMap, but I need the operator!= side change for upcoming AsmParser summary parsing support. llvm-svn: 334111	2018-06-06 18:32:16 +00:00
Simon Pilgrim	7a48bb6e44	[llvm-mca][x86] Fix all resources-x86_64.s tests to use different registers in reg-reg cases I noticed while working on zero-idiom + dependency-breaking support (PR36671) that most of our binary instruction tests were reusing the same src registers, which would cause the tests to fail once we enable scalar zero-idiom support on btver2. Fixed in all targets to keep them in sync. llvm-svn: 334110	2018-06-06 18:20:25 +00:00
Krzysztof Parzyszek	0da1fe3770	[Hexagon] Split CTPOP of vector pairs llvm-svn: 334109	2018-06-06 18:03:29 +00:00
Sanjay Patel	0e8b90da0c	[ConstProp] move tests for fp <--> int; NFC These were added for D5603 / rL219542, and there's a proposal to change one side in D47807. These are tests of constant propagation, so they shouldn't have ever been tested/housed under InstCombine. llvm-svn: 334107	2018-06-06 16:53:56 +00:00
Petar Jovanovic	8cb6a521be	Change TII isCopyInstr way of returning arguments(NFC) Make TII isCopyInstr() return MachineOperands through pointer to pointer instead via reference. Patch by Nikola Prica. Differential Revision: https://reviews.llvm.org/D47364 llvm-svn: 334105	2018-06-06 16:36:30 +00:00
Simon Pilgrim	64541ff297	[X86][BtVer2] Add tests for all vector instructions that should match the dependency-breaking 'zero-idiom' As detailed on Agner's Microarchitecture doc (21.8 AMD Bobcat and Jaguar pipeline - Dependency-breaking instructions), all these instructions are dependency breaking and zero the destination register. TODO: Scalar instructions still need to be tested (need to check EFLAGS handling). llvm-svn: 334104	2018-06-06 16:14:37 +00:00
Hans Wennborg	c4b7e0125f	Relax shtest-run-at-line.py The test was failing on Windows machines which had bash.exe on PATH (but not in the so called lit tools dir, containing cmp.exe, grep.exe etc.). The problem was that the outer lit invocation would load LLVMConfig from utils/lit/lit/llvm/config.py, which looks up the tools path with getToolsPath(). That has a surprising side effect of also setting bashPath, in our case setting it to empty. The outer lit invocation would thus configure the pdbg0 and pdbg1 substitutions based on not running with bash. But the inner lit invocation would not load LLVMConfig, so bash would be found on PATH, that would be used as external shell, and so the output wouldn't match pdbg0 and pdbg1. It seems weird to me that getBashPath() will return different results depending on whether getToolsPath() has been called before, but I also don't know how to fix it properly. This commit just relaxes the test case, because there doesn't seem to be much point in testing for the exact syntax of the run file as long as it works. (See https://crbug.com/850023) llvm-svn: 334100	2018-06-06 14:53:03 +00:00
David Green	25312b2b6c	[GlobalMerge] Set the alignment on merged global structs If no alignment is set, the abi/preferred alignment of structs will be used which may be higher than required. This can lead to extra padding and in the end an increase in data size. Differential Revision: https://reviews.llvm.org/D47633 llvm-svn: 334099	2018-06-06 14:48:32 +00:00
Kristof Beyls	566c74cc98	Avoid UnicodeEncodeError on non-ascii reviewer names ... by using unicode instead of byte strings where non-ascii strings can be formatted in. llvm-svn: 334098	2018-06-06 14:19:58 +00:00
Simon Dardis	9b1182acf4	[mips] Add testcase for i64, i128 addition for the DSP ASE llvm-svn: 334094	2018-06-06 13:30:39 +00:00
Tim Northover	9b80060d7b	InstCombine: ignore debug instructions during fence combine We should never get different CodeGen based on whether the code is being compiled in debug mode so we must skip over @llvm.dbg.value (and similar) calls. Should fix at least the worst part of PR37690. llvm-svn: 334090	2018-06-06 12:46:02 +00:00
Greg Bedwell	86026bdaee	Update the project name in README.txt Per llvm.org: "The name "LLVM" itself is not an acronym; it is the full name of the project." Differential Revision: https://reviews.llvm.org/D47796 llvm-svn: 334087	2018-06-06 11:15:54 +00:00
Simon Pilgrim	f06ff16049	Fix MSVC '*/' found outside of comment warning. NFCI. llvm-svn: 334086	2018-06-06 11:10:11 +00:00
Ilya Biryukov	3c9c10649b	Fix compilation of WebAssembly and RISCV after r334078 llvm-svn: 334085	2018-06-06 10:57:50 +00:00
Simon Dardis	0bba0df896	[mips] Partially revert r334031 The test changes in r334031 give unstable pass/fail results on the llvm-clang-x86_64-expensive-checks-win buildbot. Revert the test changes to turn the bot green. llvm-svn: 334084	2018-06-06 10:54:30 +00:00
Simon Pilgrim	3d14158891	[X86][BMI][TBM] Only demand bottom 16-bits of the BEXTR control op (PR34042) Only the bottom 16-bits of BEXTR's control op are required (0:8 INDEX, 15:8 LENGTH). Differential Revision: https://reviews.llvm.org/D47690 llvm-svn: 334083	2018-06-06 10:52:10 +00:00
Pavel Labath	1b8bfd7e7d	[cmake] fix a typo in llvm_config macro Summary: The macro parses out the USE_SHARED option out of the argument list, but then ignores it and accesses the variable with the same name instead. It seems the intention here was to check the argument value. Technically, this is NFC, because the only in-tree usage (add_llvm_executable) of USE_SHARED sets both the variable and the argument when calling llvm_config, but it makes the usage of this macro for out-of-tree users more sensible. Reviewers: mgorny, beanz Reviewed By: mgorny Subscribers: foutrelis, llvm-commits Differential Revision: https://reviews.llvm.org/D44420 llvm-svn: 334082	2018-06-06 10:07:08 +00:00
Clement Courbet	62b34fa89a	[llvm-exegesis] move Mode from Key to BenchmarResult. Moves the Mode field out of the Key. The existing yaml benchmark results can be fixed with the following script: ``` readonly FILE=$1 readonly MODE=latency # Change to uops to fix a uops benchmark. cat $FILE \| \ sed "/^\ \+mode:\ \+$MODE$/d" \| \ sed "/^cpu_name.*$/i mode: $MODE" ``` Differential Revision: https://reviews.llvm.org/D47813 Authored by: Guillaume Chatelet llvm-svn: 334079	2018-06-06 09:42:36 +00:00
Peter Smith	57f661bd7d	[MC] Pass MCSubtargetInfo to fixupNeedsRelaxation and applyFixup On targets like Arm some relaxations may only be performed when certain architectural features are available. As functions can be compiled with differing levels of architectural support we must make a judgement on whether we can relax based on the MCSubtargetInfo for the function. This change passes through the MCSubtargetInfo for the function to fixupNeedsRelaxation so that the decision on whether to relax can be made per function. In this patch, only the ARM backend makes use of this information. We must also pass the MCSubtargetInfo to applyFixup because some fixups skip error checking on the assumption that relaxation has occurred, to prevent code-generation errors applyFixup must see the same MCSubtargetInfo as fixupNeedsRelaxation. Differential Revision: https://reviews.llvm.org/D44928 llvm-svn: 334078	2018-06-06 09:40:06 +00:00
Elena Demikhovsky	0ef2ce3667	Added documentation for Masked Vector Expanding Load and Compressing Store Intrinsics Differential Revision: https://reviews.llvm.org/D26743 llvm-svn: 334075	2018-06-06 09:11:46 +00:00
Petar Jovanovic	326ec32403	[MIPS GlobalISel] Add lowerCall Add minimal support to lower function calls. Support only functions with arguments/return that go through registers and have type i32. Patch by Petar Avramovic. Differential Revision: https://reviews.llvm.org/D45627 llvm-svn: 334071	2018-06-06 07:24:52 +00:00
Petr Hosek	fc9b29bd61	[Support] Use zx_cache_flush on Fuchsia to flush instruction cache Fuchsia doesn't use __clear_cache, instead it provide zx_cache_flush system call. Use it to flush instruction cache. Differential Revision: https://reviews.llvm.org/D47753 llvm-svn: 334068	2018-06-06 06:26:18 +00:00
Vlad Tsyrklevich	80a764bab1	[Analyzer] Fix the Z3 lit test config Summary: The '%analyze' extra_args config argument seems to have been erroneously deleted in r315627 disabling Z3 tests for the clang analyzer. Add the flag back. Reviewers: george.karpenkov, NoQ, ddcc Reviewed By: george.karpenkov Subscribers: xazax.hun, szepet, delcypher, a.sidorin, llvm-commits Differential Revision: https://reviews.llvm.org/D47722 llvm-svn: 334066	2018-06-06 06:25:37 +00:00
Sanjay Patel	59313be8d3	[CodeGen] assume max/default throughput for unspecified instructions This is a fix for the problem arising in D47374 (PR37678): https://bugs.llvm.org/show_bug.cgi?id=37678 We may not have throughput info because it's not specified in the model or it's not available with variant scheduling, so assume that those instructions can execute/complete at max-issue-width. Differential Revision: https://reviews.llvm.org/D47723 llvm-svn: 334055	2018-06-05 23:34:45 +00:00
Amaury Sechet	a79b6b3ef0	[Mips] Remove uneeded variants of ADDC/ADDE lowering Summary: As it turns out, the lowering for the Mips16* family of target is the exact same thing as what the ops expands to, so the code handling them can be removed and the ops only enabled for the MipsSE* family of targets. Reviewers: smaksimovic, atanasyan, abeserminji Subscribers: sdardis, arichardson, llvm-commits Differential Revision: https://reviews.llvm.org/D47703 llvm-svn: 334052	2018-06-05 22:13:56 +00:00
Guozhi Wei	c4c6b548c5	[CodeGenPrepare] Move Extension Instructions Through Logical And Shift Instructions CodeGenPrepare pass move extension instructions close to load instructions in different BB, so they can be combined later. But the extension instructions can't move through logical and shift instructions in current implementation. This patch enables this enhancement, so we can eliminate more extension instructions. Differential Revision: https://reviews.llvm.org/D45537 This is re-commit of r331783, which was reverted by r333305. The performance regression was caused by some unlucky alignment, not a code generation problem. llvm-svn: 334049	2018-06-05 21:03:52 +00:00
Zachary Turner	8ac1c38a72	[FileSystem] Remove OpenFlags param from several functions. There was only one place in the entire codebase where a non default value was being passed, and that place was already hidden in an implementation file. So we can delete the extra parameter and all existing clients continue to work as they always have, while making the interface a bit simpler. Differential Revision: https://reviews.llvm.org/D47789 llvm-svn: 334046	2018-06-05 19:58:26 +00:00
Matt Arsenault	57e541e87e	AMDGPU: Preserve metadata when widening loads Preserves the low bound of the !range. I don't think it's legal to do anything with the top half since it's theoretically reading garbage. llvm-svn: 334045	2018-06-05 19:52:56 +00:00
Matt Arsenault	9224c00d2b	AMDGPU: Use more custom insert/extract_vector_elt lowering Apply to i8 vectors. llvm-svn: 334044	2018-06-05 19:52:46 +00:00
Krzysztof Parzyszek	b984ffcc71	[Hexagon] Add pattern to generate 64-bit neg instruction llvm-svn: 334043	2018-06-05 19:52:39 +00:00
Krzysztof Parzyszek	d8b093efef	[Hexagon] Add more patterns for generating abs/absp instructions llvm-svn: 334038	2018-06-05 19:00:50 +00:00
Michael Berg	96925fe0df	guard fneg with fmf sub flags Summary: This change uses fmf subflags to guard optimizations as well as unsafe. These changes originated from D46483. Reviewers: spatel, hfinkel Reviewed By: spatel Subscribers: nemanjai Differential Revision: https://reviews.llvm.org/D47389 llvm-svn: 334037	2018-06-05 18:49:47 +00:00
Michael Berg	8f6d6c817d	NFC: adding baseline fneg case for fmf llvm-svn: 334035	2018-06-05 18:12:25 +00:00
Simon Dardis	0d95ff03f2	[mips] Fix the predicates for arithmetic operations Reviewers: smaksimovic, atanasyan, abeserminji Differential Revision: https://reviews.llvm.org/D47635 llvm-svn: 334031	2018-06-05 17:53:22 +00:00
Greg Bedwell	a9a6d54146	[UpdateTestChecks] Error if --llvm-mca-binary gets an empty string If the command line was mistyped like: ./update_mca_test_checks.py --llvm-mca-binary= /path/to/llvm-mca *.s ^-- extra whitespace then /path/to/llvm-mca would get treated by argparse as a test-path pattern and could actually be opened in write mode and overwritten. llvm-svn: 334029	2018-06-05 17:16:19 +00:00
Andrea Di Biagio	757600bccb	[llvm-mca] Correctly update the CyclesLeft of a register read in the presence of partial register updates. This patch fixe the logic in ReadState::cycleEvent(). That method was not correctly updating field `TotalCycles`. Added extra code comments in class ReadState to better describe each field. llvm-svn: 334028	2018-06-05 17:12:02 +00:00
Fangrui Song	c581e567b3	Remove a self-referencing #include llvm-svn: 334027	2018-06-05 16:59:40 +00:00
Simon Pilgrim	f2f043acbb	[X86][SSE] Use multiplication scale factors for v8i16 SHL on pre-AVX2 targets. Similar to v4i32 SHL, convert v8i16 shift amounts to scale factors instead to improve performance and reduce instruction count. We were already doing this for constant shifts, this adds variable shift support. Reduces the serial nature of the codegen, which relies on chains of plendvb/pand+pandn+por shifts. This is a step towards adding support for vXi16 vector rotates. Differential Revision: https://reviews.llvm.org/D47546 llvm-svn: 334023	2018-06-05 15:17:39 +00:00
Nirav Dave	05b589101e	[MC][X86] Allow assembler variable assignment to register name. Summary: Allow extended parsing of variable assembler assignment syntax and modify X86 to permit VAR = register assignment. As we emit these as .set directives when possible, we inline such expressions in output assembly. Fixes PR37425. Reviewers: rnk, void, echristo Reviewed By: rnk Subscribers: nickdesaulniers, llvm-commits, hiraditya Differential Revision: https://reviews.llvm.org/D47545 llvm-svn: 334022	2018-06-05 15:13:39 +00:00
Matt Arsenault	191bc71541	DAG: Stop dropping invariant/dereferencable When legalizing illegal FP load results, this was for some reason dropping the invariant and dereferencable memory flags. There doesn't seem to be any reason for this, and the equivalent isn't done for integer loads. Fixes an issue in a future AMDGPU commit where some identical loads fail to merge because one of the loads ends up dropping the flags. llvm-svn: 334020	2018-06-05 14:52:24 +00:00
John Brawn	e4ff0bd401	[InstCombine] Correct the cmp operand type used when canonicalizing abs/nabs When adjusting a cmp in order to canonicalize an abs/nabs select pattern we need to use the type of the existing operand when creating a new operand not the type of a select operand, as the two may be different. This fixes PR37686. llvm-svn: 334019	2018-06-05 14:10:55 +00:00
Gabor Buella	1181f94ae4	[X86] NFC Fix typo introduced in r328016 HSI->HDI llvm-svn: 334016	2018-06-05 12:55:12 +00:00
Krzysztof Parzyszek	aafb8c204c	[Hexagon] Minor cleanups in isel lowering llvm-svn: 334015	2018-06-05 12:49:19 +00:00
Hiroshi Inoue	955655f558	[PowerPC] reduce rotate in BitPermutationSelector BitPermutationSelector builds the output value by repeating rotate-and-mask instructions with input registers. Here, we may avoid one rotate instruction if we start building from an input register that does not require rotation. For example of the test case bitfieldinsert.ll, it first rotates left r4 by 8 bits and then inserts some bits from r5 without rotation. This can be executed by one rlwimi instruction, which rotates r4 by 8 bits and inserts its bits into r5. This patch adds a check for rotation amounts in the comparator used in sorting to process the input without rotation first. Differential Revision: https://reviews.llvm.org/D47765 llvm-svn: 334011	2018-06-05 11:58:01 +00:00
Clement Courbet	53d35d2dc4	[llvm-exegesis] Add instructions to BenchmarkResult Key. We want llvm-exegesis to explore instructions (effect of initial register values, effect of operand selection). To enable this a BenchmarkResult muststore all the relevant data in its key. This patch starts adding such data. Here we simply allow to store the generated instructions, following patches will add operands and initial values for registers. https://reviews.llvm.org/D47764 Authored by: Guilluame Chatelet llvm-svn: 334008	2018-06-05 10:56:19 +00:00
Simon Pilgrim	fef9b6eea6	[X86][SSE] Add target shuffle support to X86TargetLowering::computeKnownBitsForTargetNode Ideally we'd use resolveTargetShuffleInputs to handle faux shuffles as well but: (a) that code path doesn't handle general/pre-legalized ops/types very well. (b) I'm concerned about the compute time as they recurse to calls to computeKnownBits/ComputeNumSignBits which would need depth limiting somehow. llvm-svn: 334007	2018-06-05 10:52:29 +00:00
Gabor Buella	349ffcee87	[X86] NFC Refactor some code in InstPrinters Summary: Bringing some come duplicated in the AT&T and the Intel printers into a common parent class. Reviewers: craig.topper Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D47682 llvm-svn: 334005	2018-06-05 10:41:39 +00:00
Peter Smith	ef945b2240	[MC][ARM] Add range checking for Thumb2 resolved fixups. When the branch target of a Thumb2 unconditional or conditonal branch is resolved at assembly time, no range checking is performed on the result leading to incorrect immediates. This change adds a range check: +- 16 Megabytes for unconditional branches, +- 1 Megabyte for the conditional branch. Differential Revision: https://reviews.llvm.org/D46306 llvm-svn: 333997	2018-06-05 10:00:56 +00:00
Simon Pilgrim	7bbe7a2920	[X86][SSE] Add basic PACKUS support to X86TargetLowering::computeKnownBitsForTargetNode Helps improve analysis of saturation ops llvm-svn: 333995	2018-06-05 09:45:03 +00:00
Peter Smith	0aafe0cee5	[MC][ARM] Correct Thumb BL instruction range The Thumb BL range is + or - either 16 Megabytes or 4 Megabytes depending on whether the CPU supports Thumb2 or the v8-m baseline ops. The existing check for BL range is incorrectly set at +- 32 Megabytes. This change corrects the higher range and uses the lower range if the featurebits don't have the necessary support for it. Differential Revision: https://reviews.llvm.org/D46305 llvm-svn: 333991	2018-06-05 09:32:28 +00:00
Alexander Ivchenko	964b27fa21	[X86][CET] Shadow stack fix for setjmp/longjmp This is the new version of D46181, allowing setjmp/longjmp to work correctly with the Intel CET shadow stack by storing SSP on setjmp and fixing it on longjmp. The patch has been updated to use the cf-protection-return module flag instead of HasSHSTK, and the bug that caused D46181 to be reverted has been fixed with the test expanded to track that fix. patch by mike.dvoretsky Differential Revision: https://reviews.llvm.org/D47311 llvm-svn: 333990	2018-06-05 09:22:30 +00:00
Craig Topper	f17b33d6c6	[X86] Make all instructions that operate on MMX types, but were added after the initial MMX support via one of the SSE features flags make them require the MMX feature as well. Passing -mattr=-mmx needs to disable these instructions since the MMX register class won't have been set up. But we don't want -mattr=-mmx to disable SSE so we have to do it separately. llvm-svn: 333984	2018-06-05 06:20:06 +00:00
Nirav Dave	e5eb99668c	[RegAllocGreedy] Use simpler map class for EvicteeInfo. NFCI. RegAlloc keeps a insertion-time ordered map of evictee information, but we only use membership. Replace MapVector with contextually equivalent DenseMap which is smaller and faster. llvm-svn: 333981	2018-06-05 03:16:28 +00:00
Vedant Kumar	b6ed992de0	[opt] Introduce -strip-named-metadata This renames and generalizes -strip-module-flags to erase all named metadata from a module. This makes it easier to diff IR. llvm-svn: 333977	2018-06-05 00:56:08 +00:00
Vedant Kumar	800255f9f1	[Debugify] Don't insert debug values after terminating deopts As is the case with musttail calls, the IR does not allow for instructions inserted after a terminating deopt. llvm-svn: 333976	2018-06-05 00:56:07 +00:00
Vedant Kumar	ab112b8e99	Apply clang-format on a file, NFC llvm-svn: 333975	2018-06-05 00:56:07 +00:00
Francis Visoiu Mistrih	2c0ef67327	Use MF instead of Fn for MachineFunction references. NFC llvm-svn: 333973	2018-06-05 00:27:28 +00:00
Francis Visoiu Mistrih	ca69b3bf6d	[ShrinkWrap] Add optimization remarks to the shrink-wrapping pass Start by emitting remarks for very basic unsupported cases such as irreducible CFGs and EHFunclets. The end goal is to be able to cover all the cases where we give up with an explanation. llvm-svn: 333972	2018-06-05 00:27:24 +00:00
Amara Emerson	d496cc8ffb	[MIRParser] Add parser support for 'true' and 'false' i1s. We already output true and false in the printer, but the parser isn't able to read it. Differential Revision: https://reviews.llvm.org/D47424 llvm-svn: 333970	2018-06-05 00:17:13 +00:00
Reid Kleckner	adcaddb6da	Fix -Wcovered-switch-default warning and clang-format it llvm-svn: 333967	2018-06-04 23:47:29 +00:00
David Blaikie	10d25ffe7d	Move Compiler.h from Demangle back to Support Code review feedback from r328123 prefers copying the few feature test macros used by Demangle into there, rather than sinking the header into an odd corner like Demangle. llvm-svn: 333965	2018-06-04 22:53:38 +00:00
Derek Schuff	72f19241d6	Simplified WebAssemblyAsmBackend by removing explicit ELF variant. The ELF version was broken (does not deal with wasm specific fixups), and now is slightly less broken. It will be removed in its entirety in the future which this change makes slightly easier (just remove the IsELF bool). Differential Revision: https://reviews.llvm.org/D47745 Patch by Wouter van Oortmerssen llvm-svn: 333964	2018-06-04 22:53:36 +00:00
Sanjay Patel	dcb8d304c3	[InstCombine] refine UB-handling in shuffle-binop transform As noted in rL333782, we can be both better for optimization and safer with this transform: BinOp (shuffle V1, Mask), C --> shuffle (BinOp V1, NewC), Mask The only potentially unsafe-to-speculate binops are integer div/rem. All other binops are always safe (although I don't see a way to assert that in code here). For opcodes like shifts that can produce poison, it can't matter here because we know the lanes with undef are dropped by the subsequent shuffle. Differential Revision: https://reviews.llvm.org/D47686 llvm-svn: 333962	2018-06-04 22:26:45 +00:00
Amaury Sechet	800ac42573	Remove various use of undef in the X86 test suite as patern involving undef can collapse them. NFC llvm-svn: 333961	2018-06-04 22:09:26 +00:00
Amaury Sechet	e2729faf52	Revert "Regenerate expected test results for test/CodeGen/X86/pr23103.ll . NFC" This reverts commit cf25dfc503c861845947f3e6a9d308811ebb9da3. llvm-svn: 333960	2018-06-04 21:49:23 +00:00
Vedant Kumar	fb7c768a3b	[Debugify] Preserve analyses in -check-debugify The -check-debugify pass should preserve all analyses. Otherwise, it may invalidate an optional analysis and inadvertently alter codegen. The test case is reduced from deopt-bundle.ll. The result of `opt -O1` on this file would differ when -debugify-each was toggled. That happened because CheckDebugify failed to preserve GlobalsAA. Thanks to Davide Italiano for his help chasing this down! llvm-svn: 333959	2018-06-04 21:43:28 +00:00
David Blaikie	36df9d8514	Add missing header llvm-svn: 333957	2018-06-04 21:33:56 +00:00
David Blaikie	31b98d2e99	Move Analysis/Utils/Local.h back to Transforms Review feedback from r328165. Split out just the one function from the file that's used by Analysis. (As chandlerc pointed out, the original change only moved the header and not the implementation anyway - which was fine for the one function that was used (since it's a template/inlined in the header) but not in general) llvm-svn: 333954	2018-06-04 21:23:21 +00:00
Amaury Sechet	f5db3a15bf	Revert "Remove various use of undef in the X86 test suite as patern involving undef can collapse them. NFC" This reverts commit f0e85c194ae5e87476bc767304470dec85b6774f. llvm-svn: 333953	2018-06-04 21:20:45 +00:00
Jessica Paquette	aa087327ce	[MachineOutliner] NFC - Move intermediate data structures to MachineOutliner.h This is setting up to fix bug 37573 cleanly. This moves data structures that are technically both used in some way by the target and the general-purpose outlining algorithm into MachineOutliner.h. In particular, the `Candidate` class is of importance. Before, the outliner passed the locations of `Candidates` to the target, which would then make some decisions about the prospective outlined function. This change allows us to just pass `Candidates` along to the target. This will allow the target to discard `Candidates` that would be considered unsafe before cost calculation. Thus, we will be able to remove the unsafe candidates described in the bug without resorting to torching the entire prospective function. Also, as a side-effect, it makes the outliner a bit cleaner. https://bugs.llvm.org/show_bug.cgi?id=37573 llvm-svn: 333952	2018-06-04 21:14:16 +00:00
Alexander Ivchenko	2f038c4094	[X86][ELF][CET] Adding the .note.gnu.property ELF section in X86 In preparation for the proposed linker ABI changes (https://github.com/hjl-tools/linux-abi/wiki/linux-abi-draft.pdf, https://github.com/hjl-tools/x86-psABI/wiki/x86-64-psABI-cet.pdf), this patch enables emission of the .note.gnu.property section to ELF object files when building CET-enabled modules. patch by mike.dvoretsky Differential Revision: https://reviews.llvm.org/D47145 llvm-svn: 333951	2018-06-04 21:07:35 +00:00
Amaury Sechet	87f1a240ba	Remove various use of undef in the X86 test suite as patern involving undef can collapse them. NFC llvm-svn: 333950	2018-06-04 20:57:27 +00:00
Amaury Sechet	1910090328	Regenerate expected test results for test/CodeGen/X86/pr23103.ll . NFC llvm-svn: 333949	2018-06-04 20:47:00 +00:00
Scott Linder	ba81d7f1eb	[CodeGen] Always update divergence in SelectionDAG::UpdateNodeOperands Some overloads failed to update divergence. Differential Revision: https://reviews.llvm.org/D47148 llvm-svn: 333947	2018-06-04 20:19:45 +00:00
Zachary Turner	63db25ba0d	[Support] Add functions that operate on native file handles on Windows. Windows' CRT has a limit of 512 open file descriptors, and fds which are generated by converting a HANDLE via _get_osfhandle count towards this limit as well. Regardless, often you find yourself marshalling back and forth between native HANDLE objects and fds anyway. If we know from the getgo that we're going to need to work directly with the handle, we can cut out the marshalling layer while also not contributing to filling up the CRT's very limited handle table. On Unix these functions just delegate directly to the existing set of functions since an fd is the native file type. It would be nice, very long term, if we could convert most uses of fds to file_t. Differential Revision: https://reviews.llvm.org/D47688 llvm-svn: 333945	2018-06-04 19:38:11 +00:00
Amaury Sechet	da661e9236	[DAGcombine] Teach the combiner about -a = ~a + 1 Summary: This include variant for add, uaddo and addcarry. usubo and subcarry require the carry to be flipped to preserve semantic, but we chose to do the transform anyway in that case as to push the transform down the carry chain. Reviewers: efriedma, spatel, RKSimon, zvi, bkramer Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D46505 llvm-svn: 333943	2018-06-04 19:23:22 +00:00
Teresa Johnson	0cff935036	Fix for llvm-dis/llvm-bcanalyzer overflows Summary: These tools failed for a very large bitcode file produced by LTO due to 64-bit values being assigned to 32-bit types. For the BitstreamReader.h fix, the value initially fit into the 32-bit unsigned, but there was an overflow when multiplying by 32 furter below to compute the bit offset. No test case in the patch as this requires a huge bitcode file. Reviewers: pcc, george.karpenkov Subscribers: mehdi_amini, a.sidorin, llvm-commits Differential Revision: https://reviews.llvm.org/D47731 llvm-svn: 333942	2018-06-04 19:20:02 +00:00
Alexander Shaposhnikov	d7eaf27654	[llvm-strip] Add missing aliases for --strip-debug Add missing aliases for --strip-debug: -g, -S, -d. Test plan: make check-all Differential revision: https://reviews.llvm.org/D47674 llvm-svn: 333940	2018-06-04 18:55:41 +00:00
Amaury Sechet	93a7d2aa3c	Get rid of SETCCE Summary: It has been deprecated in favor of SETCCCARRY for a year now and isn't used by any in tree backend. Reviewers: efriedma, craig.topper, dblaikie, bkramer Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D47685 llvm-svn: 333939	2018-06-04 18:36:22 +00:00
Dmitry Mikulin	4539487650	In thin and full LTO + CFI, direct function calls may go through jump table entries to reach the target. Since these calls don't require type checks, we can short-circuit them to their real targets, except in cases when they can be pre-empted. Differential Revision: https://reviews.llvm.org/D46326 llvm-svn: 333937	2018-06-04 18:18:12 +00:00
Craig Topper	1f956e2c5f	[X86] Don't pass ParitySrc array into isAddSubOrSubAddMask. Instead use a bool output parameter to get the real piece of info we care about. NFC The ParitySrc array is more of an implementation detail. A single bool to get the final parity is sufficient. llvm-svn: 333935	2018-06-04 17:58:45 +00:00
Stanislav Mekhanoshin	838c07c531	[AMDGPU] Small refactoring in the scheduler After last changes some code can be simplified. Differential Revision: https://reviews.llvm.org/D47661 llvm-svn: 333934	2018-06-04 17:57:40 +00:00
Stanislav Mekhanoshin	28624f94d5	[AMDGPU] Factored out common part of GCNRPTracker::reset() Differential Revision: https://reviews.llvm.org/D47664 llvm-svn: 333931	2018-06-04 17:21:54 +00:00
Sam Clegg	675a51750a	[MachO] Add out-of-bounds check to MachOObjectFile.cpp This is a followup to rL333496. Differential Revision: https://reviews.llvm.org/D47544 llvm-svn: 333929	2018-06-04 17:01:20 +00:00
Sam Clegg	537afe6f0e	[WebAssembly] Fix .td files after rL333900 Differential Revision: https://reviews.llvm.org/D47727 llvm-svn: 333928	2018-06-04 16:59:26 +00:00
John Brawn	c5a6392be3	[ValueTracking] Match select abs pattern when there's an sext involved When checking a select to see if it matches an abs, allow the true/false values to be a sign-extension of the comparison value instead of requiring that they're directly the comparison value, as all the comparison cares about is the sign of the value. This fixes a regression due to r333702, where we were no longer generating ctlz due to isKnownNonNegative failing to match such a pattern. Differential Revision: https://reviews.llvm.org/D47631 llvm-svn: 333927	2018-06-04 16:53:57 +00:00
Mark Searles	f0b93f1e9e	[AMDGPU][Waitcnt] Fix handling of flat instrs On GFX9 and earlier, flat memory ops may decrement VMCNT out-of-order as well as LGKMCNT out-of-order. Differential Revision: https://reviews.llvm.org/D46616 llvm-svn: 333926	2018-06-04 16:51:59 +00:00
Simon Pilgrim	7c000d4267	[X86] Only accept const SelectionDAG to resolveTargetShuffleInputs/getFauxShuffleMask These methods should only be using SelectionDAG for analysis (known/sign bits etc), not node creation. llvm-svn: 333925	2018-06-04 16:48:13 +00:00
Benjamin Kramer	f663eba561	[NVPTX] Delete dead code from the AsmPrinter. llvm-svn: 333924	2018-06-04 16:12:33 +00:00
Andrea Di Biagio	39e5a5695f	[RFC][patch 3/3] Add support for variant scheduling classes in llvm-mca. This patch is the last of a sequence of three patches related to LLVM-dev RFC "MC support for variant scheduling classes". http://lists.llvm.org/pipermail/llvm-dev/2018-May/123181.html This fixes PR36672. The main goal of this patch is to teach llvm-mca how to solve variant scheduling classes. This patch does that, plus it adds new variant scheduling classes to the BtVer2 scheduling model to identify so-called zero-idioms (i.e. so-called dependency breaking instructions that are known to generate zero, and that are optimized out in hardware at register renaming stage). Without the BtVer2 change, this patch would not have had any meaningful tests. This patch is effectively the union of two changes: 1) a change that teaches llvm-mca how to resolve variant scheduling classes. 2) a change to the BtVer2 scheduling model that allows us to special-case packed XOR zero-idioms (this partially fixes PR36671). Differential Revision: https://reviews.llvm.org/D47374 llvm-svn: 333909	2018-06-04 15:43:09 +00:00
Alexander Ivchenko	ab60a2823f	[llvm-readobj] Support GNU_PROPERTY_X86_FEATURE_1_AND notes in .note.gnu.property Resubmit of r333424. This version contains the fix for fails found by buildbots on some targets. This patch allows parsing GNU_PROPERTY_X86_FEATURE_1_AND notes in .note.gnu.property sections. These notes indicate that the object file is built to support Intel CET. patch by mike.dvoretsky Differential Revision: https://reviews.llvm.org/D47473 llvm-svn: 333908	2018-06-04 15:14:18 +00:00
Krzysztof Parzyszek	623eb54361	[SelectionDAG] Add missing closing parentheses in comments, NFC llvm-svn: 333907	2018-06-04 14:54:53 +00:00
Nicolai Haehnle	59198ed040	AMDGPU: Make various NamedOperands upper case Summary: Avoid name clashes with the corresponding bit fields in the instruction encoding. Change-Id: Id1644e703e976e78f7af93788d9f44cb48c3251f Reviewers: arsenm, rampitec, kzhuravl Subscribers: wdng, yaxunl, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D47433 llvm-svn: 333905	2018-06-04 14:45:20 +00:00
Nicolai Haehnle	ab390f0c41	TableGen/DAGPatterns: Allow bit constants in addition to int constants Summary: Implicit casting is a simple quality of life improvement. Change-Id: I3d2b31b8b8f12cbb1e84f691e359fa713a9c4b42 Reviewers: tra, simon_tatham, craig.topper, MartinO, arsenm Subscribers: wdng, llvm-commits Differential Revision: https://reviews.llvm.org/D47432 llvm-svn: 333904	2018-06-04 14:45:12 +00:00
Nicola Zaghen	771e3beea6	[ReleaseNotes] Formatting fixes. llvm-svn: 333902	2018-06-04 14:40:34 +00:00
Nicolai Haehnle	e2ef7560bc	TableGen: some LangRef doc fixes Summary: Change-Id: I1442e2daa09cab727a01d8c31893b50e644a5cd3 Reviewers: tra, simon_tatham, craig.topper Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D47530 Change-Id: I397655dd18b7ff978c1affa3174740d9c1a82594 llvm-svn: 333901	2018-06-04 14:26:12 +00:00
Nicolai Haehnle	01d261f18d	TableGen: Streamline the semantics of NAME Summary: The new rules are straightforward. The main rules to keep in mind are: 1. NAME is an implicit template argument of class and multiclass, and will be substituted by the name of the instantiating def/defm. 2. The name of a def/defm in a multiclass must contain a reference to NAME. If such a reference is not present, it is automatically prepended. And for some additional subtleties, consider these: 3. defm with no name generates a unique name but has no special behavior otherwise. 4. def with no name generates an anonymous record, whose name is unique but undefined. In particular, the name won't contain a reference to NAME. Keeping rules 1&2 in mind should allow a predictable behavior of name resolution that is simple to follow. The old "rules" were rather surprising: sometimes (but not always), NAME would correspond to the name of the toplevel defm. They were also plain bonkers when you pushed them to their limits, as the old version of the TableGen test case shows. Having NAME correspond to the name of the toplevel defm introduces "spooky action at a distance" and breaks composability: refactoring the upper layers of a hierarchy of nested multiclass instantiations can cause unexpected breakage by changing the value of NAME at a lower level of the hierarchy. The new rules don't suffer from this problem. Some existing .td files have to be adjusted because they ended up depending on the details of the old implementation. Change-Id: I694095231565b30f563e6fd0417b41ee01a12589 Reviewers: tra, simon_tatham, craig.topper, MartinO, arsenm, javed.absar Subscribers: wdng, llvm-commits Differential Revision: https://reviews.llvm.org/D47430 llvm-svn: 333900	2018-06-04 14:26:05 +00:00
Nicola Zaghen	9438b15946	[ReleaseNotes] Add release note for the new LLVM_DEBUG macro. This is to provide a way to migrate from the old DEBUG macro to the new one. Differential Revision: https://reviews.llvm.org/D47528 llvm-svn: 333898	2018-06-04 13:55:09 +00:00
Simon Dardis	fb4dde1142	[mips] Restore the availablity of trap for microMIPS Reviewers: smaksimovic, atanasyan, abeserminji Differential Revision: https://reviews.llvm.org/D47584 llvm-svn: 333895	2018-06-04 12:50:32 +00:00
Greg Bedwell	96f51f09d4	[llvm-mca][UpdateTestChecks] Prevent an IndexError being raised when given empty input llvm-svn: 333894	2018-06-04 12:30:10 +00:00
Greg Bedwell	bbe64af0a0	[llvm-mca] Regenerate a test to remove a double newline Command used: py update_mca_test_checks.py ..\test\tools\llvm-mca\\.s ..\test\tools\llvm-mca\\\*.s llvm-svn: 333893	2018-06-04 12:30:03 +00:00
Andrea Di Biagio	2008c7c8fd	[llvm-mca] Track cycles contributed by resources that are in a 'Super' relationship. This is required if we want to correctly match the behavior of method SubtargetEmitter::ExpandProcResource() in Tablegen. When computing the set of "consumed" processor resources and resource cycles, the logic in ExpandProcResource() doesn't update the number of resource cycles contributed by a "Super" resource to a group. We need to take this into account when a model declares a processor resource which is part of a 'processor resource group', and it is also used as the "Super" of other resources. llvm-svn: 333892	2018-06-04 12:23:07 +00:00
Roman Lebedev	7b53d1454f	[llvm-mca] Make sure not to end the test files with an empty line. Summary: It's super irritating. [properly configured] git client then complains about that double-newline, and you have to use `--force` to ignore the warning, since even if you fix it manually, it will be reintroduced the very next runtime :/ Reviewers: RKSimon, andreadb, courbet, craig.topper, javed.absar, gbedwell Reviewed By: gbedwell Subscribers: javed.absar, tschuett, gbedwell, llvm-commits Differential Revision: https://reviews.llvm.org/D47697 llvm-svn: 333887	2018-06-04 11:48:46 +00:00
Clement Courbet	2cb97b95a2	[llvm-exegesis][NFC] Use an enum instead of a string for benchmark mode. Summary: YAML encoding is backwards-compatible. Reviewers: gchatelet Subscribers: tschuett, llvm-commits Differential Revision: https://reviews.llvm.org/D47705 llvm-svn: 333886	2018-06-04 11:43:40 +00:00
Clement Courbet	7228721b30	[llvm-exegesis] Analysis: Show inconsistencies between checked-in and measured data. Summary: We now highlight any sched classes whose measurements do not match the LLVM SchedModel. "bad" clusters are marked in red. Screenshot in phabricator diff. Reviewers: gchatelet Subscribers: tschuett, mgrang, RKSimon, llvm-commits Differential Revision: https://reviews.llvm.org/D47639 llvm-svn: 333884	2018-06-04 11:11:55 +00:00
Luke Geeson	43e4367961	[AArch64] Audit on rL333634 to fix FP16 Disasm BitPatterns llvm-svn: 333879	2018-06-04 09:41:32 +00:00
Sander de Smalen	d0a6f6a502	[AArch64][SVE] Fix range for DUP immediates (16bit elts) For immediates used in DUP instructions that have the range -128 to 127, or a multiple of 256 in the range -32768 to 32512, one could argue that when the result element size is 16bits (.h), the value can be considered both signed and unsigned. Reviewers: rengolin, fhahn, SjoerdMeijer, samparker, javed.absar Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D47619 llvm-svn: 333873	2018-06-04 07:24:23 +00:00
Sander de Smalen	fd54a781f6	[AArch64][SVE] Asm: Print indexed element 0 as FPR. Print the first indexed element as a FP register, for example: mov z0.d, z1.d[0] Is now printed as: mov z0.d, d1 Next to printing, this patch also adds aliases to parse 'mov z0.d, d1'. Reviewers: rengolin, fhahn, samparker, SjoerdMeijer, javed.absar Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D47571 llvm-svn: 333872	2018-06-04 07:07:35 +00:00
Sander de Smalen	c33d668ab7	[AArch64][SVE] Asm: Support for indexed DUP instructions. Unpredicated copy of indexed SVE element to SVE vector, along with MOV-aliases. For example: dup z0.h, z1.h[0] duplicates the first 16-bit element from z1 to all elements in the result vector z0. Reviewers: rengolin, fhahn, samparker, SjoerdMeijer, javed.absar Reviewed By: SjoerdMeijer Differential Revision: https://reviews.llvm.org/D47570 llvm-svn: 333871	2018-06-04 06:40:55 +00:00
Sander de Smalen	367a53b059	[AArch64][SVE] Asm: Support for FCPY immediate instructions. Predicated copy of floating-point immediate value to SVE vector, along with MOV-aliases. Reviewers: rengolin, fhahn, samparker, SjoerdMeijer, javed.absar Reviewed By: javed.absar Differential Revision: https://reviews.llvm.org/D47518 llvm-svn: 333869	2018-06-04 05:58:06 +00:00
Sander de Smalen	512d57f1a5	[AArch64][SVE] Asm: Support for CPY immediate instructions Predicated copy of possibly shifted immediate value into SVE vector, along with MOV-aliases. Reviewers: rengolin, fhahn, samparker, SjoerdMeijer, javed.absar Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D47517 llvm-svn: 333868	2018-06-04 05:40:46 +00:00

1 2 3 4 5 ...

165140 Commits