llvm-project

Commit Graph

Author	SHA1	Message	Date
Kevin P. Neal	472e5dda11	Teach the IRBuilder about fadd and friends. The IRBuilder has calls to create floating point instructions like fadd. It does not have calls to create constrained versions of them. This patch adds support for constrained creation of fadd, fsub, fmul, fdiv, and frem. Reviewed by: John McCall, Sanjay Patel Approved by: John McCall Differential Revision: https://reviews.llvm.org/D53157 llvm-svn: 365339	2019-07-08 16:18:18 +00:00
Brian Homerding	b4b21d807e	Add, and infer, a nofree function attribute This patch adds a function attribute, nofree, to indicate that a function does not, directly or indirectly, call a memory-deallocation function (e.g., free, C++'s operator delete). Reviewers: jdoerfert Differential Revision: https://reviews.llvm.org/D49165 llvm-svn: 365336	2019-07-08 15:57:56 +00:00
James Henderson	b52a0c0cc8	[docs][llvm-readobj][llvm-readelf] Improve wording llvm-svn: 365335	2019-07-08 15:46:26 +00:00
Sanjay Patel	320a28200f	[InstCombine] fix typo in test; NFC I added this test in rL365325, but didn't mean to create an undef insert. llvm-svn: 365333	2019-07-08 15:38:03 +00:00
Jinsong Ji	41a414cdaa	[PowerPC][NFC]Update testcases using script. llvm-svn: 365330	2019-07-08 15:24:32 +00:00
Simon Pilgrim	e1a9b49d6b	[X86] ISD::INSERT_SUBVECTOR - use uint64_t index. NFCI. Keep the uint64_t type from getConstantOperandVal to stop truncation/extension overflow warnings in MSVC in subvector index math. llvm-svn: 365328	2019-07-08 14:52:56 +00:00
Alex Bradbury	c5f552d705	[Triple] Add isRISCV function This matches isARM, isThumb, isAArch64 and similar helpers. Future commits which clean-up code that currently checks for Triple::riscv32 \|\| Triple::riscv64. Differential Revision: https://reviews.llvm.org/D54215 Patch by Simon Cook. Test case added by Alex Bradbury. llvm-svn: 365327	2019-07-08 14:52:36 +00:00
Sanjay Patel	74cbaa37b6	[InstCombine] add tests for splat shuffles; NFC llvm-svn: 365325	2019-07-08 14:49:21 +00:00
Cameron McInally	771769be90	[Float2Int] Add support for unary FNeg to Float2Int Differential Revision: https://reviews.llvm.org/D63941 llvm-svn: 365324	2019-07-08 14:46:07 +00:00
Petar Avramovic	aa699b20a0	[MIPS GlobalISel] Register bank select for G_LOAD. Select i64 load Select gprb or fprb when loaded value is used by either: copy to physical register or instruction with only one mapping available for that use operand. Load of integer s64 is handled with narrowScalar when mapping is applied, produced artifacts are combined away. Manually set gprb to all register operands of instructions created during narrowScalar. Differential Revision: https://reviews.llvm.org/D64269 llvm-svn: 365323	2019-07-08 14:45:52 +00:00
Petar Avramovic	ec575f6e3e	[MIPS GlobalISel] Register bank select for G_STORE. Select i64 store Select gprb or fprb when stored value is defined by either: copy from physical register or instruction with only one mapping available for that def operand. Store of integer s64 is handled with narrowScalar when mapping is applied, produced artifacts are combined away. Manually set gprb to all register operands of instructions created during narrowScalar. Differential Revision: https://reviews.llvm.org/D64268 llvm-svn: 365322	2019-07-08 14:36:36 +00:00
Dmitry Preobrazhensky	2eff0318c6	[AMDGPU][MC] Corrected parsing of FLAT offset modifier Summary of changes: - simplified handling of FLAT offset: offset_s13 and offset_u12 have been replaced with flat_offset; - provided information about error position for pre-gfx9 targets; - improved errors handling. Reviewers: artem.tamazov, arsenm, rampitec Differential Revision: https://reviews.llvm.org/D64244 llvm-svn: 365321	2019-07-08 14:27:37 +00:00
Matt Arsenault	bd791b57f8	GlobalISel: widenScalar for G_BUILD_VECTOR llvm-svn: 365320	2019-07-08 13:48:06 +00:00
Matt Arsenault	214f67ece5	GlobalISel: Check address space when looking up iPTR size Fixes AMDGPU patterns for 32-bit address spaces always failing. Tests will be included in future patches when additional issues are solved. llvm-svn: 365319	2019-07-08 13:48:04 +00:00
Owen Reynolds	1159fe3f6e	Reapply [llvm-ar][test] Increase llvm-ar test coverage This change adds tests to cover existing llvm-ar functionality. Differential Revision: https://reviews.llvm.org/D63935 llvm-svn: 365318	2019-07-08 13:41:52 +00:00
Owen Reynolds	0c5fe7a440	Revert [llvm-ar][test] Increase llvm-ar test coverage Reverted due to test failures Differential Revision: https://reviews.llvm.org/D63935 llvm-svn: 365317	2019-07-08 13:08:25 +00:00
Owen Reynolds	870d9c61b0	[llvm-ar][test] Increase llvm-ar test coverage This change adds tests to cover existing llvm-ar functionality. Differential Revision: https://reviews.llvm.org/D63935 llvm-svn: 365316	2019-07-08 12:54:24 +00:00
Joel E. Denny	1602058c04	[lit] Parse command-line options from LIT_OPTS Similar to `FILECHECK_OPTS` for FileCheck, `LIT_OPTS` makes it easy to adjust lit behavior when running the test suite via ninja. For example: ``` $ LIT_OPTS='--time-tests -vv --filter=threadprivate' \ ninja check-clang-openmp ``` Reviewed By: probinson Differential Revision: https://reviews.llvm.org/D64135 llvm-svn: 365313	2019-07-08 12:18:40 +00:00
James Henderson	8cf99a1170	[docs][llvm-objcopy] Add description of binary input/output to doc We briefly referred to being able to specify --target=binary without explaining what binary input/output meant. This change adds a section on this. Reviewed by: MaskRay, abrachet Differential Revision: https://reviews.llvm.org/D64245 llvm-svn: 365312	2019-07-08 11:41:54 +00:00
Simon Pilgrim	9285bf0fb9	[TargetLowering] SimplifyDemandedBits - just call computeKnownBits for BUILD_VECTOR cases. Don't do this locally, computeKnownBits does this better (and can handle non-constant cases as well). A next step would be to actually simplify non-constant elements - building on what we already do in SimplifyDemandedVectorElts. llvm-svn: 365309	2019-07-08 11:00:39 +00:00
Mikhail Maltsev	ee81051fc9	[ARM] Relax constraints on operands of VQxDMLxDH instructions Summary: According to a recently updated Armv8-M spec (https://static.docs.arm.com/ddi0553/bh/DDI0553B_h_armv8m_arm.pdf) the 32-bit width versions of the following instructions: * VQDMLADH * VQDMLADHX * VQRDMLADH * VQRDMLADHX * VQDMLSDH * VQDMLSDHX * VQRDMLSDH * VQRDMLSDHX are no longer unpredictable when their output register is the same as one of the input registers. This patch updates the assembler parser and the corresponding tests and also removes @earlyclobber from the instruction constraints. Reviewers: simon_tatham, ostannard, dmgreen, SjoerdMeijer, samparker Reviewed By: simon_tatham Subscribers: javed.absar, kristof.beyls, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D64250 llvm-svn: 365306	2019-07-08 09:44:52 +00:00
Alex Bradbury	0b9addb8c0	[RISCV] Specify registers used in DWARF exception handling Defines RISCV registers for getExceptionPointerRegister() and getExceptionSelectorRegister(). Differential Revision: https://reviews.llvm.org/D63411 Patch by Edward Jones. Modified by Alex Bradbury to add CHECK lines to exception-pointer-register.ll. llvm-svn: 365301	2019-07-08 09:16:47 +00:00
Fangrui Song	7d63be09b6	[ARM] Fix null pointer dereference in CodeGen/ARM/Windows/stack-protector-msvc.ll.test after D64292/r365283 CLI.CS may not be set. llvm-svn: 365299	2019-07-08 08:43:31 +00:00
Alex Bradbury	1606a86464	[UpdateTestChecks] Skip over .Lfunc_begin for RISC-V This mirrors the change made for X86 in rL336987. Without this patch, update_llc_test_checks will completely skip functions with personality functions. llvm-svn: 365297	2019-07-08 08:34:16 +00:00
Jay Foad	38902350ef	[AMDGPU] Use a named predicate instead of a magic number. Reviewers: arsenm Reviewed By: arsenm Subscribers: arsenm, kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D64201 llvm-svn: 365294	2019-07-08 07:04:58 +00:00
Craig Topper	1deca50ab1	[X86] Allow execution domain fixing to turn SHUFPD into SHUFPS. This can help with code size on SSE targets where SHUFPD requires a 0x66 prefix and SHUFPS doesn't. llvm-svn: 365293	2019-07-08 06:52:49 +00:00
Craig Topper	d8261f0288	[X86] Make movsd commutable to shufpd with a 0x02 immediate on pre-SSE4.1 targets. This can help avoid a copy or enable load folding. On SSE4.1 targets we can commute it to blendi instead. I had to make shufpd with a 0x02 immediate commutable as well since we expect commuting to be reversible. llvm-svn: 365292	2019-07-08 06:52:43 +00:00
Alex Bradbury	e1e036a33b	[RISCV] Support z and i operand modifiers Differential Revision: https://reviews.llvm.org/D57792 Patch by James Clarke. llvm-svn: 365291	2019-07-08 05:00:26 +00:00
Craig Topper	46f2b583a2	[X86] Add MOVSDrr->MOVLPDrm entry to load folding table. Add custom handling to turn UNPCKLPDrr->MOVHPDrm when load is under aligned. If the load is aligned we can turn UNPCKLPDrr into UNPCKLPDrm. llvm-svn: 365287	2019-07-08 02:10:20 +00:00
Francis Visoiu Mistrih	4cdb68ebbd	[llvm-bcanalyzer] Refactor and move to libLLVMBitReader This allows us to use the analyzer from unit tests. * Refactor the interface to use proper error handling for most functions after JF's work. * Move everything into a BitstreamAnalyzer class. * Move that to Bitcode/BitcodeAnalyzer.h. Differential Revision: https://reviews.llvm.org/D64116 llvm-svn: 365286	2019-07-08 02:06:34 +00:00
Kang Zhang	638b1a82d8	[NFC][PowerPC] Add the test add_cmp.ll llvm-svn: 365285	2019-07-08 01:51:29 +00:00
Petr Hosek	e28fca29fe	Revert "[IRBuilder] Fold consistently for or/and whether constant is LHS or RHS" This reverts commit r365260 which broke the following tests: Clang :: CodeGenCXX/cfi-mfcall.cpp Clang :: CodeGenObjC/ubsan-nullability.m LLVM :: Transforms/LoopVectorize/AArch64/pr36032.ll llvm-svn: 365284	2019-07-07 22:12:01 +00:00
Martin Storsjo	8d9d290d4c	[ARM] Add support for MSVC stack cookie checking Heavily based on the same for AArch64, from SVN r346469. Differential Revision: https://reviews.llvm.org/D64292 llvm-svn: 365283	2019-07-07 18:57:31 +00:00
Nico Weber	8721001558	gn build: Merge r355546 Found by inspection; the sync script doesn't sync .ipp files. llvm-svn: 365280	2019-07-07 16:49:44 +00:00
Nico Weber	18d138e6ac	gn build: Merge r365273 llvm-svn: 365279	2019-07-07 16:44:07 +00:00
Nico Weber	7e06362061	gn build: Sort sync script output llvm-svn: 365278	2019-07-07 16:40:29 +00:00
Nico Weber	d22fd0e4ec	gn build: Sort cxx_sources in libcxx build file Since these are in a custom list, `gn format` doesn't automatically sort them. Now their order matches the CMake build. No additions or deletions, this just sorts the files that are there. llvm-svn: 365277	2019-07-07 16:36:11 +00:00
Nico Weber	3e9ef48440	gn build: Merge r365258 and follow-ups r365263, r365264 llvm-svn: 365276	2019-07-07 15:43:32 +00:00
Craig Topper	ac744d5a86	[X86] Make sure load isn't volatile before shrinking it in MOVDDUP isel patterns. llvm-svn: 365275	2019-07-07 05:33:20 +00:00
David Majnemer	617df204b5	[CodeGen] Add larger vector types for i32 and f32 Some out of tree backend require larger vector type. Since maintaining the changes out of tree is difficult due to the many manual changes needed when adding a new type we are adding it even if no backend currently use it. Differential Revision: https://reviews.llvm.org/D64141 Patch by Thomas Raoux! llvm-svn: 365274	2019-07-07 04:47:37 +00:00
Simon Pilgrim	a7145c45a7	[X86] SimplifyDemandedVectorEltsForTargetNode - fix shadow variable warning. NFCI. Fixes cppcheck warning. llvm-svn: 365271	2019-07-06 18:46:09 +00:00
Simon Pilgrim	01f1bad618	[X86] LowerBuildVectorv16i8 - pull out repeated getOperand() call. NFCI. llvm-svn: 365270	2019-07-06 18:33:29 +00:00
Simon Pilgrim	9c68aa33e3	[DAGCombine] convertBuildVecZextToZext - remove duplicate getOpcode() call. NFCI. llvm-svn: 365269	2019-07-06 18:32:15 +00:00
Craig Topper	e753247b06	[X86] Add PS<->PD domain changing support for MOVH/MOVL load instructions and MOVH store instructions. These instructions don't have an integer domain equivalent, but we can at least change between the two floating point domains. This allows a smaller encoding on SSE targets if we can turn PD into PS. llvm-svn: 365268	2019-07-06 17:59:57 +00:00
Craig Topper	317d6093df	[X86] Remove patterns from MOVLPSmr and MOVHPSmr instructions. These patterns are the same as the MOVLPDmr and MOVHPDmr patterns, but with a bitcast at the end. We can just select the PD instruction and let execution domain fixing switch to PS. llvm-svn: 365267	2019-07-06 17:59:51 +00:00
Craig Topper	913105ca42	[X86] Add patterns to select MOVLPDrm from MOVSD+load and MOVHPD from UNPCKL+load. These narrow the load so we can only do it if the load isn't volatile. There also tests in vector-shuffle-128-v4.ll that this should support, but we don't seem to fold bitcast+load on pre-sse4.2 targets due to the slow unaligned mem 16 flag. llvm-svn: 365266	2019-07-06 17:59:45 +00:00
Craig Topper	8c036bf784	[X86] Copy some test cases from vector-shuffle-sse1.ll to vector-shuffle-128-v4.ll and v8 where sse1 did better load folding. NFC llvm-svn: 365265	2019-07-06 17:59:41 +00:00
Nikita Popov	a01502f1ba	[LFTR] Regenerate test checks; NFC llvm-svn: 365262	2019-07-06 08:54:15 +00:00
Philip Reames	9812668d77	[IRBuilder] Fold consistently for or/and whether constant is LHS or RHS Without this, we have the unfortunate property that tests are dependent on the order of operads passed the CreateOr and CreateAnd functions. In actual usage, we'd promptly optimize them away, but it made tests slightly more verbose than they should have been. llvm-svn: 365260	2019-07-06 04:28:00 +00:00
Philip Reames	9e62c86408	[IRBuilder] Introduce helpers for and/or of multiple values at once We had versions of this code scattered around, so consolidate into one location. Not strictly NFC since the order of intermediate results may change in some places, but since these operations are associatives, should not change results. llvm-svn: 365259	2019-07-06 03:46:18 +00:00
Quentin Colombet	0ffe0db6fa	[RegisterCoalescer] Fix an overzealous assert Although removeCopyByCommutingDef deals with full copies, it is still possible to copy undef lanes and thus, we wouldn't have any a value number for these lanes. This fixes PR40215. llvm-svn: 365256	2019-07-06 00:34:54 +00:00
Matt Arsenault	705e46f449	RegUsageInfoCollector: Skip AMDGPU entry point functions I'm not sure if it's worth it or not to add a hook to disable the pass for an arbitrary function. This pass is taking up to 5% of compile time in tiny programs by iterating through all of the physical registers in every register class. This pass should be rewritten in terms of regunits. For now, skip doing anything for entry point functions. The vast majority of functions in the real world aren't callable, so just not running this will give the majority of the benefit. llvm-svn: 365255	2019-07-05 23:33:43 +00:00
Michael Liao	88b0d20edf	Revert "[FileCheck] Simplify numeric variable interface" This reverts commit `096600a4b0`. llvm-svn: 365251	2019-07-05 22:23:27 +00:00
Thomas Preud'homme	096600a4b0	[FileCheck] Simplify numeric variable interface Summary: This patch simplifies 2 aspects in the FileCheckNumericVariable code. First, setValue() method is turned into a void function since being called only on undefined variable is an invariant and is now asserted rather than returned. This remove the assert from the callers. Second, clearValue() method is also turned into a void function since the only caller does not check its return value since it may be trying to clear the value of variable that is already cleared without this being noteworthy. Reviewers: jhenderson, chandlerc, jdenny, probinson, grimar, arichardson, rnk Subscribers: JonChesterfield, rogfer01, hfinkel, kristina, rnk, tra, arichardson, grimar, dblaikie, probinson, llvm-commits, hiraditya Tags: #llvm Differential Revision: https://reviews.llvm.org/D64231 llvm-svn: 365249	2019-07-05 21:49:59 +00:00
Matt Arsenault	5e9610a3f5	AMDGPU: Fix assert in clang test llvm-svn: 365245	2019-07-05 21:09:53 +00:00
Nikita Popov	a2a09cb606	[SystemZ] Fix addcarry of usubo (PR42512) Only custom lower uaddo+addcarry or usubo+subcarry chains and leave mixtures like usubo+addcarry or uaddo+subcarry to the generic legalizer. Otherwise we run into issues because SystemZ uses different CC values for carries and borrows. Fixes https://bugs.llvm.org/show_bug.cgi?id=42512. Differential Revision: https://reviews.llvm.org/D64213 llvm-svn: 365242	2019-07-05 20:35:11 +00:00
Matt Arsenault	e7e23e3e91	AMDGPU: Make AMDGPUPerfHintAnalysis an SCC pass Add a string attribute instead of directly setting MachineFunctionInfo. This avoids trying to get the analysis in the MachineFunctionInfo in a way that doesn't work with the new pass manager. This will also avoid re-visiting the call graph for every single function. llvm-svn: 365241	2019-07-05 20:26:13 +00:00
Michael Liao	8d6ea2d48c	[CodeGen] Enhance `MachineInstrSpan` to allow the end of MBB to be used. Summary: - Explicitly specify the parent MBB to allow the end iterator to be used. Reviewers: aprantl, MatzeB, craig.topper, qcolombet Subscribers: arsenm, jvesely, nhaehnle, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D64261 llvm-svn: 365240	2019-07-05 20:23:59 +00:00
Benjamin Kramer	05eebaa949	[PowerPC] Fold another unused variable into assertion. NFC. llvm-svn: 365237	2019-07-05 19:58:39 +00:00
Benjamin Kramer	31f6b13e83	[PowerPC] Fold variable into assert. NFC. Avoids a warning in Release builds. llvm-svn: 365236	2019-07-05 19:46:48 +00:00
Benjamin Kramer	049230b4d2	[PowerPC] Remove unused variable. NFC. llvm-svn: 365235	2019-07-05 19:28:02 +00:00
Craig Topper	d22b2d01ca	[X86] Correct the size check in foldMemoryOperandCustom. The Size either needs to be 0 meaning we aren't folding a stack reload. Or the stack slot needs to be at least 16 bytes. I've also added a paranoia check ensure the RCSize is at leat 16 bytes as well. This avoids any FR32/FR64 surprises, but I think we already filtered those earlier. All of our test case have Size as either 0 or 16 and RCSize == 16. So the Size <= 16 check worked for those cases. llvm-svn: 365234	2019-07-05 18:54:00 +00:00
Nemanja Ivanovic	6c9a392c8e	[PowerPC] Move TOC save to prologue when profitable The indirect call sequence on PPC requires that the TOC base register be saved prior to the indirect call and restored after the call since the indirect call may branch to a global entry point in another DSO which will update the TOC base. Over the last couple of years, we have improved this to: - be able to hoist TOC saves from loops (with changes to MachineLICM) - avoid multiple saves when one dominates the other[s] However, it is still possible to have multiple TOC saves dynamically in the execution path if there is no dominance relationship between them. This patch moves the TOC save to the prologue when one of the TOC saves is in a block that post-dominates entry (i.e. it cannot be avoided) or if it is in a block that is hotter than entry. Differential revision: https://reviews.llvm.org/D63803 llvm-svn: 365232	2019-07-05 18:38:09 +00:00
Jonas Devlieghere	7626e1e504	Add lldb-mi deprecation to the release notes Differential revision: https://reviews.llvm.org/D64254 llvm-svn: 365231	2019-07-05 18:23:52 +00:00
Jonas Devlieghere	bb65a38b56	Add LLDB section to the release notes llvm-svn: 365228	2019-07-05 17:58:30 +00:00
Sanjay Patel	f3481b8c9a	[InferFunctionAttrs] add tests for 'dereferenceable' argument attribute; NFC llvm-svn: 365227	2019-07-05 17:49:53 +00:00
Craig Topper	6e6d229e5e	[X86] Update SSE1 MOVLPSrm and MOVHPSrm isel patterns to ensure loads are non-volatile before folding. These patterns use 128-bit loads, but the instructions only load 64-bits. We shouldn't narrow the load if its volatile. Fixes another variant of PR42079 llvm-svn: 365225	2019-07-05 17:31:29 +00:00
Craig Topper	8a93952a5c	[X86] Remove unnecessary isel pattern for MOVLPSmr. This was identical to a pattern for MOVPQI2QImr with a bitcast as an input. But we should be able to turn MOVPQI2QImr into MOVLPSmr in the execution domain fixup pass so we shouldn't need this. llvm-svn: 365224	2019-07-05 17:31:25 +00:00
Christudasan Devadasan	652ad423bb	[NFC] A test commit to check the access permission. Removed a blank line. llvm-svn: 365223	2019-07-05 17:07:42 +00:00
James Henderson	9e38f510b3	[docs][llvm-readobj] Add a note to options that do nothing in GNU output --section-data, --section-relocations and --section-symbols have no effect for GNU style ouput. This patch changes the docs to point this out, as it has caught me out on a couple of occasions. See also https://bugs.llvm.org/show_bug.cgi?id=42522. llvm-svn: 365221	2019-07-05 16:38:52 +00:00
Thomas Preud'homme	56f6308b2d	[FileCheck] Share variable instance among uses Summary: This patch changes expression support to use one instance of FileCheckNumericVariable per numeric variable rather than one per variable and per definition. The current system was only necessary for the last patch of the numeric expression support patch series in order to handle a line using a variable defined earlier on the same line from the input text. However this can be dealt more efficiently. Reviewers: jhenderson, chandlerc, jdenny, probinson, grimar, arichardson, rnk Subscribers: JonChesterfield, rogfer01, hfinkel, kristina, rnk, tra, arichardson, grimar, dblaikie, probinson, llvm-commits, hiraditya Tags: #llvm Differential Revision: https://reviews.llvm.org/D64229 llvm-svn: 365220	2019-07-05 16:25:46 +00:00
Thomas Preud'homme	fe7ac170a7	[FileCheck] Don't diagnose undef vars at parse time Summary: Diagnosing use of undefined variables takes place in parseNumericVariableUse() and printSubstitutions() for numeric variables but only takes place in printSubstitutions() for string variables. The reason for the split location of diagnostics is that parsing is not aware of the clearing of variables due to --enable-var-scope and thus use of variables cleared in this way can only be catched by printSubstitutions(). Beyond the code level inconsistency, there is also a user facing inconsistency since diagnostics look different between the two functions. While the diagnostic in printSubstitutions is more verbose, doing the diagnostic there allows to diagnose all undefined variables rather than just the first one and error out. This patch create dummy variable definition when encountering a use of undefined variable so that parsing can proceed and be diagnosed by printSubstitutions() later. Tests that were testing whether parsing fails in such case are thus modified accordingly. Reviewers: jhenderson, chandlerc, jdenny, probinson, grimar, arichardson, rnk Subscribers: JonChesterfield, rogfer01, hfinkel, kristina, rnk, tra, arichardson, grimar, dblaikie, probinson, llvm-commits, hiraditya Tags: #llvm Differential Revision: https://reviews.llvm.org/D64228 llvm-svn: 365219	2019-07-05 16:25:33 +00:00
Yaxun Liu	a62413526d	[AMDGPU] Added a new metadata for multi grid sync implicit argument Patch by Christudasan Devadasan. Differential Revision: https://reviews.llvm.org/D63886 llvm-svn: 365217	2019-07-05 16:05:17 +00:00
Matt Arsenault	27a6985d90	ScheduleDAG: Fix incorrectly killing registers in bundles When looking for uses/defs to add kill flags, the iterator was double incremented, skipping the first instruction in the bundle. The use register in the first bundle instruction was then incorrectly killed. The "First" instruction should be the BUNDLE itself as the proper reverse iterator endpoint. llvm-svn: 365216	2019-07-05 15:32:28 +00:00
Eugene Leviant	3aef35288b	[ThinLTO] Attempt to recommit r365188 after alignment fix llvm-svn: 365215	2019-07-05 15:25:05 +00:00
David Green	47afdaa487	[ARM] MVE patterns for VMVN, VORR and VBIC This add simple Q register forms of bitwise not instructions. Differential Revision: https://reviews.llvm.org/D63983 llvm-svn: 365214	2019-07-05 15:21:29 +00:00
Nico Weber	df173bf9bc	gn build: Merge r365203 llvm-svn: 365213	2019-07-05 15:14:06 +00:00
Jay Foad	7e0c10b55f	[AMDGPU] DPP combiner: recognize identities for more opcodes Summary: This allows the DPP combiner to kick in more often. For example the exclusive scan generated by the atomic optimizer for a divergent atomic add used to look like this: v_mov_b32_e32 v3, v1 v_mov_b32_e32 v5, v1 v_mov_b32_e32 v6, v1 v_mov_b32_dpp v3, v2 wave_shr:1 row_mask:0xf bank_mask:0xf s_nop 1 v_add_u32_dpp v4, v3, v3 row_shr:1 row_mask:0xf bank_mask:0xf bound_ctrl:0 v_mov_b32_dpp v5, v3 row_shr:2 row_mask:0xf bank_mask:0xf v_mov_b32_dpp v6, v3 row_shr:3 row_mask:0xf bank_mask:0xf v_add3_u32 v3, v4, v5, v6 v_mov_b32_e32 v4, v1 s_nop 1 v_mov_b32_dpp v4, v3 row_shr:4 row_mask:0xf bank_mask:0xe v_add_u32_e32 v3, v3, v4 v_mov_b32_e32 v4, v1 s_nop 1 v_mov_b32_dpp v4, v3 row_shr:8 row_mask:0xf bank_mask:0xc v_add_u32_e32 v3, v3, v4 v_mov_b32_e32 v4, v1 s_nop 1 v_mov_b32_dpp v4, v3 row_bcast:15 row_mask:0xa bank_mask:0xf v_add_u32_e32 v3, v3, v4 s_nop 1 v_mov_b32_dpp v1, v3 row_bcast:31 row_mask:0xc bank_mask:0xf v_add_u32_e32 v1, v3, v1 v_add_u32_e32 v1, v2, v1 v_readlane_b32 s0, v1, 63 But now most of the dpp movs are combined into adds: v_mov_b32_e32 v3, v1 v_mov_b32_e32 v5, v1 s_nop 0 v_mov_b32_dpp v3, v2 wave_shr:1 row_mask:0xf bank_mask:0xf s_nop 1 v_add_u32_dpp v4, v3, v3 row_shr:1 row_mask:0xf bank_mask:0xf bound_ctrl:0 v_mov_b32_dpp v5, v3 row_shr:2 row_mask:0xf bank_mask:0xf v_mov_b32_dpp v1, v3 row_shr:3 row_mask:0xf bank_mask:0xf v_add3_u32 v1, v4, v5, v1 s_nop 1 v_add_u32_dpp v1, v1, v1 row_shr:4 row_mask:0xf bank_mask:0xe s_nop 1 v_add_u32_dpp v1, v1, v1 row_shr:8 row_mask:0xf bank_mask:0xc s_nop 1 v_add_u32_dpp v1, v1, v1 row_bcast:15 row_mask:0xa bank_mask:0xf s_nop 1 v_add_u32_dpp v1, v1, v1 row_bcast:31 row_mask:0xc bank_mask:0xf v_add_u32_e32 v1, v2, v1 v_readlane_b32 s0, v1, 63 Reviewers: arsenm, vpykhtin Subscribers: kzhuravl, nemanjai, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, hiraditya, kbarton, MaskRay, jfb, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D64207 llvm-svn: 365211	2019-07-05 14:52:48 +00:00
Eugene Leviant	e91f86f0ac	Reverted r365188 due to alignment problems on i686-android llvm-svn: 365206	2019-07-05 13:26:05 +00:00
Graham Hunter	957c40db6a	Scalable Vector IR Type with further LTO fixes Reintroduces the scalable vector IR type from D32530, after it was reverted a couple of times due to increasing chromium LTO build times. This latest incarnation removes the walk over aggregate types from the verifier entirely, in favor of rejecting scalable vectors in the isValidElementType methods in ArrayType and StructType. This removes the 70% degradation observed with the second repro tarball from PR42210. Reviewers: thakis, hans, rengolin, sdesmalen Reviewed By: sdesmalen Differential Revision: https://reviews.llvm.org/D64079 llvm-svn: 365203	2019-07-05 12:48:16 +00:00
Robert Lougher	9dcfbbae76	This reverts r365061 and r365062 (test update) Revision r365061 changed a skip of debug instructions for a skip of meta instructions. This is not safe, as IMPLICIT_DEF is classed as a meta instruction. llvm-svn: 365202	2019-07-05 12:42:06 +00:00
Sam Elliott	b2c9eed0d7	[RISCV] Support @llvm.readcyclecounter() Intrinsic On RISC-V, the `cycle` CSR holds a 64-bit count of the number of clock cycles executed by the core, from an arbitrary point in the past. This matches the intended semantics of `@llvm.readcyclecounter()`, which we currently leave to the default lowering (to the constant 0). With this patch, we will now correctly lower this intrinsic to the intended semantics, using the user-space instruction `rdcycle`. On 64-bit targets, we can directly lower to this instruction. On 32-bit targets, we need to do more, as `rdcycle` only returns the low 32-bits of the `cycle` CSR. In this case, we perform a custom lowering, based on the PowerPC lowering, using `rdcycleh` to obtain the high 32-bits of the `cycle` CSR. This custom lowering inserts a new basic block which detects overflow in the high 32-bits of the `cycle` CSR during reading (because multiple instructions are required to read). The emitted assembly matches the suggested assembly in the RISC-V specification. Differential Revision: https://reviews.llvm.org/D64125 llvm-svn: 365201	2019-07-05 12:35:21 +00:00
Nico Weber	a780276301	lld, llvm-dlltool, llvm-lib: Use getAsString() instead of getSpelling() for printing unknown args Since OPT_UNKNOWN args never have any values and consist only of spelling (and are never aliased), this doesn't make any difference in practice, but it's more consistent with Arg's guidance to use getAsString() for diagnostics, and it matches what clang does. Also tweak two tests to use an unknown option that contains '=' for additional coverage while here. (The new tests pass fine with the old code too though.) llvm-svn: 365200	2019-07-05 12:31:32 +00:00
Robert Lougher	2478b62098	Revert r365198 as this accidentally commited something that should not have been added. llvm-svn: 365199	2019-07-05 12:30:45 +00:00
Robert Lougher	3bea2b15f5	This reverts r365061 and r365062 (test update) Revision r365061 changed a skip of debug instructions for a skip of meta instructions. This is not safe, as IMPLICIT_DEF is classed as a meta instruction. llvm-svn: 365198	2019-07-05 12:20:21 +00:00
Sam Elliott	6884d5e040	[RISCV][NFC] Replace hard-coded CSR duplication with symbolic references Reviewers: asb, lenary Reviewed By: asb, lenary Subscribers: MaskRay, hiraditya, rbar, johnrusso, simoncook, apazos, sabuasal, niosHD, kito-cheng, shiva0217, zzheng, edward-jones, rogfer01, MartinMosbeck, brucehoult, the_o, rkruppe, PkmX, jocewei, psnobl, benna, Jim, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D64139 Patch by James Clarke (jrtc27) llvm-svn: 365195	2019-07-05 12:16:40 +00:00
Simon Pilgrim	6af0891fe7	Fix MSVC/cppcheck Use::Next isn't initialized warning. NFCI. llvm-svn: 365194	2019-07-05 12:12:23 +00:00
Eugene Leviant	194f16b354	[llvm-objcopy] Allow strip symtab from executables and DSOs Differential revision: https://reviews.llvm.org/D61672 llvm-svn: 365193	2019-07-05 12:10:44 +00:00
Thomas Preud'homme	41f2bea60c	[FileCheck] Fix comment in parseNumericVariableUse Summary: Comment explaining the interaction between parsing of numeric variable definition and uses in parseNumericVariableUse is stale since it suggests both use and definition parsing is done in the same function. This was the case in a previous version of the patch committed as `71d3f227a7` but is no longer the case. This patch updates the comment accordingly. Reviewers: jhenderson, chandlerc, jdenny, probinson, grimar, arichardson, rnk Subscribers: JonChesterfield, rogfer01, hfinkel, kristina, rnk, tra, arichardson, grimar, dblaikie, probinson, llvm-commits, hiraditya Tags: #llvm Differential Revision: https://reviews.llvm.org/D64227 llvm-svn: 365192	2019-07-05 12:01:12 +00:00
Thomas Preud'homme	28196a5da8	[FileCheck] Factor some parsing checks out Summary: Both callers of parseNumericVariableDefinition() perform the same extra check that no character is found after the variable name. This patch factors out this check into parseNumericVariableDefinition(). Reviewers: jhenderson, chandlerc, jdenny, probinson, grimar, arichardson, rnk Subscribers: JonChesterfield, rogfer01, hfinkel, kristina, rnk, tra, arichardson, grimar, dblaikie, probinson, llvm-commits, hiraditya Tags: #llvm Differential Revision: https://reviews.llvm.org/D64226 llvm-svn: 365191	2019-07-05 12:01:06 +00:00
Thomas Preud'homme	a188ad2653	[FileCheck] Add missing final dot in comment llvm-svn: 365190	2019-07-05 12:00:56 +00:00
Eugene Leviant	820cc01d1e	[ThinLTO] Attempt to recommit r365040 after caching fix It's possible that some function can load and store the same variable using the same constant expression: store %Derived* @foo, %Derived bitcast (%Base @bar to %Derived*) %42 = load %Derived, %Derived bitcast (%Base @bar to %Derived**) The bitcast expression was mistakenly cached while processing loads, and never examined later when processing store. This caused @bar to be mistakenly treated as read-only variable. See load-store-caching.ll. llvm-svn: 365188	2019-07-05 12:00:10 +00:00
James Henderson	1a517a4630	[docs][llvm-objcopy] Improve some wording. llvm-svn: 365187	2019-07-05 11:57:07 +00:00
Nico Weber	cf1a11ded2	Make joined instances of JoinedOrSeparate flags point to the unaliased args, like all other arg types do This fixes an 8-year-old regression. r105763 made it so that aliases always refer to the unaliased option – but it missed the "joined" branch of JoinedOrSeparate flags. (r162231 then made the Args classes non-virtual, and r169344 moved them from clang to llvm.) Back then, there was no JoinedOrSeparate flag that was an alias, so it wasn't observable. Now /U in CLCompatOptions is a JoinedOrSeparate alias in clang, and warn_slash_u_filename incorrectly used the aliased arg id (using the unaliased one isn't really a regression since that warning checks if the undefined macro contains slash or backslash and only then emits the warning – and no valid use will pass "-Ufoo/bar" or similar). Also, lld has many JoinedOrSeparate aliases, and due to this bug it had to explicitly call `getUnaliasedOption()` in a bunch of places, even though that shouldn't be necessary by design. After this fix in Option, these calls really don't have an effect any more, so remove them. No intended behavior change. (I accidentally fixed this bug while working on PR29106 but then wondered why the warn_slash_u_filename broke. When I figured it out, I thought it would make sense to land this in a separate commit.) Differential Revision: https://reviews.llvm.org/D64156 llvm-svn: 365186	2019-07-05 11:45:24 +00:00
Nico Weber	1126f1a6de	gn build: Merge r365179 llvm-svn: 365185	2019-07-05 11:34:48 +00:00
George Rimar	d0921a4696	[Object/ELF.h] - Improve error reporting. The errors coming from ELF.h are usually not very useful because they are uninformative. This patch is a first step to improve the situation. I tested this patch with a run of check-llvm and found that few messages are untested. In this patch, I did not add more tests but marked all such cases with a "TODO" comment. For all tested messages I extended the error text to provide more details (see test cases changed). Differential revision: https://reviews.llvm.org/D64014 llvm-svn: 365183	2019-07-05 11:28:49 +00:00
Nico Weber	fdef18b42d	lld-link: Make /debugtype: option work better - The code tried to pass false to split()'s KeepEmpty parameter, but instead passed it to MaxSplit. As a result, it would never split on commas. This has been broken since the flag was added in r278056. - The code used getSpelling() for getting the argument's values, but getSpelling() always returns the `/debugtype:` prefix without any values. So if any /debugtype: flag was passed, it always resulted in an "unknown option:" warning. (The warning code then used the correct getValue() for printing the invalid option, so the warning looked kind of like it made sense.) This regressed in r342894. Slightly improve the test coverage of this feature (but since I don't know what this flag actually does, there's still no test for the correct semantics), and add a comment to getSpelling() explaining what it does. llvm-svn: 365182	2019-07-05 11:28:31 +00:00
Simon Pilgrim	8b25d9bf01	[X86][SSE] LowerINSERT_VECTOR_ELT - early out for out of range indices Fixes OSS-Fuzz #15662 llvm-svn: 365180	2019-07-05 10:34:53 +00:00
David Green	25cf705097	[ARM] MVE VMOV immediate handling This adds some handling for VMOVimm, using the same method that NEON uses. We create VMOVIMM/VMVNIMM/VMOVFPIMM nodes based on the immediate, and select them using the now renamed ARMvmovImm/etc. There is also an extra 64bit immediate mode that I have not yet added here. Code by David Sherwood Differential Revision: https://reviews.llvm.org/D63884 llvm-svn: 365178	2019-07-05 10:02:43 +00:00
David Green	bb7e97d783	[ARM] MVE fp to int conversions This adds the patterns needed for fptosi and sitofp. Differential Revision: https://reviews.llvm.org/D63729 llvm-svn: 365176	2019-07-05 09:34:30 +00:00
Fangrui Song	6fa850c4fe	[RISCV] Delete a ctor that is commented out. NFC llvm-svn: 365175	2019-07-05 08:25:14 +00:00
Seiya Nuta	ecb60b7e5c	[llvm-objcopy][NFC] Refactor output target parsing v2 Summary: Use an enum instead of string to hold the output file format in Config.InputFormat and Config.OutputFormat. It's essential to support other output file formats other than ELF. This patch originally has been submitted as D63239. However, there was an use-of-uninitialized-value bug and reverted in r364379 (git commit `4ee933c`). This patch includes the fix for the bug by setting Config.InputFormat/Config.OutputFormat in parseStripOptions. Reviewers: espindola, alexshap, rupprecht, jhenderson Reviewed By: jhenderson Subscribers: emaste, arichardson, jakehehrlich, MaskRay, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D64170 llvm-svn: 365173	2019-07-05 05:28:38 +00:00
Fangrui Song	3c47d38f61	[llvm-objcopy][test] Fix respect-umask.test after D62718/r365162 llvm-svn: 365172	2019-07-05 05:10:28 +00:00
Alex Brachet	de7da3f694	Fix patch not passing test cases llvm-svn: 365170	2019-07-05 01:28:41 +00:00
Alex Brachet	f0e5aa035e	Temporarily stop failing test case llvm-svn: 365168	2019-07-05 01:13:09 +00:00
Peter Collingbourne	8da0286c9c	gn build: Merge r365130. llvm-svn: 365167	2019-07-05 01:11:20 +00:00
Peter Collingbourne	08525c53b3	gn build: Merge r365103. llvm-svn: 365166	2019-07-05 01:11:18 +00:00
Peter Collingbourne	dbd9ca48a1	gn build: Merge r365007. llvm-svn: 365165	2019-07-05 01:11:16 +00:00
Peter Collingbourne	53b272b352	gn build: Merge r365091. llvm-svn: 365164	2019-07-05 01:11:14 +00:00
Craig Topper	171732aeb3	[X86] Add custom isel to select ADD/SUB/OR/XOR/AND to their non-immediate forms under optsize when the immediate has additional users. Summary: We attempt to prevent folding immediates with multiple users under optsize. But we only do this from store nodes and X86ISD::ADD/SUB/XOR/OR/AND patterns. We don't do it for ISD::ADD/SUB/XOR/OR/AND even though we count them as users when deciding whether to fold into other nodes. This leads to situations where we block folding to a compare for example, but still fold into an AND or OR as seen in PR27202. Unfortunately touching the isel patterns in tablegen for the ISD::ADD/SUB/XOR/OR/AND opcodes will cause the patterns to be unusable for fast isel. And we don't have a way to make a fast isel only pattern. To workaround this, this patch adds custom isel in front of the isel table that will select the non-immediate forms if the immediate has additional users. This may create some issues for ANDN and NOT matching. And there's room for improvement with unsigned 32 immediates on 64-bit AND. This patch needs more thorough test cases, but I wanted to get feedback on the direction. Please send me any other test cases you've seen in the wild. I think we probably have the same issue with the immediate matching when we fold RMW from X86ISD::ADD/SUB/XOR/OR/AND. And our TEST immedaite shrinking logic. Our cost modeling for immediates that can fit in a sign extended 8-bit immediate on a 16/32/64 bit operation is completely wrong. I also wonder if we should update the ConstantHoisting cost model and block folding for "opaque" constants. But of course constants can still be created by DAG combine and lowering optimizations. Fixes PR27202 Reviewers: spatel, RKSimon, andreadb Reviewed By: RKSimon Subscribers: jsji, hiraditya, jdoerfert, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D59909 llvm-svn: 365163	2019-07-04 22:53:57 +00:00
Alex Brachet	7439a9a921	[llvm-objcopy] Change handling of output file permissions Summary: Address bug [[ https://bugs.llvm.org/show_bug.cgi?id=42082 \| 42082 ]] where files were always outputted with 0775 permissions. Now, the output file is given either 0666 or 0777 if the object is executable. Reviewers: espindola, alexshap, rupprecht, jhenderson, jakehehrlich, MaskRay Reviewed By: rupprecht, jhenderson, jakehehrlich, MaskRay Subscribers: emaste, arichardson, jakehehrlich, MaskRay, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D62718 llvm-svn: 365162	2019-07-04 22:45:27 +00:00
Simon Atanasyan	1e9c00308b	[mips] Refactor expandSeq and expandSeqI methods. NFC llvm-svn: 365161	2019-07-04 22:45:07 +00:00
Hubert Tong	53a83b7e3c	[NFC] Make some ObjectFormatType switches covering Summary: This patch removes the `default` case from some switches on `llvm::Triple::ObjectFormatType`, and cases for the missing enumerators are then added. For `UnknownObjectFormat`, the action (`llvm_unreachable`) for the `default` case is kept. For the other unhandled cases, `report_fatal_error` is used instead. Reviewers: sfertile, jasonliu, daltenty Reviewed By: sfertile Subscribers: wuzish, aheejin, jsji, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D63767 llvm-svn: 365160	2019-07-04 21:40:28 +00:00
Alex Brachet	09a066b3aa	[docs] [tools] Fix see also links Summary: Changes "see also" links to use :manpage: instead of plain text or the form `name\|name` which was being treated literally, not as a link. Reviewers: jhenderson, rupprecht Reviewed By: jhenderson Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D63970 llvm-svn: 365159	2019-07-04 21:19:05 +00:00
Craig Topper	e9aed963ce	[DAGCombiner] Don't combine (addcarry (uaddo X, Y), 0, Carry) -> (addcarry X, Y, Carry) if the Carry comes from the uaddo. Summary: The uaddo won't be removed and the addcarry will still be dependent on the uaddo. So we'll just increase the use count of X and Y and potentially require a COPY. Reviewers: spatel, RKSimon, deadalnix Reviewed By: RKSimon Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D64190 llvm-svn: 365149	2019-07-04 18:18:46 +00:00
Tim Renouf	5816889c74	[AMDGPU] Custom lower INSERT_SUBVECTOR v3, v4, v5, v8 Summary: Since the changes to introduce vec3 and vec5, INSERT_VECTOR for these sizes has been marked "expand", which made LegalizeDAG lower it to loads and stores via a stack slot. The code got optimized a bit later, but the now-unused stack slot was never deleted. This commit avoids that problem by custom lowering INSERT_SUBVECTOR into an EXTRACT_VECTOR_ELT and INSERT_VECTOR_ELT for each element in the subvector to insert. V2: Addressed review comments re test. Differential Revision: https://reviews.llvm.org/D63160 Change-Id: I9e3c13e36f68cfa3431bb9814851cc1f673274e1 llvm-svn: 365148	2019-07-04 17:38:24 +00:00
Sanjay Patel	75b5edf6a1	[InstCombine] allow undef elements when forming splat from chain of insertelements We allow forming a splat (broadcast) shuffle, but we were conservatively limiting that to cases where all elements of the vector are specified. It should be safe from a codegen perspective to allow undefined lanes of the vector because the expansion of a splat shuffle would become the chain of inserts again. Forming splat shuffles can reduce IR and help enable further IR transforms. Motivating bugs: https://bugs.llvm.org/show_bug.cgi?id=42174 https://bugs.llvm.org/show_bug.cgi?id=16739 Differential Revision: https://reviews.llvm.org/D63848 llvm-svn: 365147	2019-07-04 16:45:34 +00:00
Jay Foad	0cd50b2a95	Fix typos in comments and debug output. llvm-svn: 365146	2019-07-04 15:04:29 +00:00
Simon Pilgrim	e602f70de1	[X86][SSE] Add partial dereferenceable vector load test inspired by PR21780 llvm-svn: 365145	2019-07-04 15:00:04 +00:00
Andus Yu	7aff852810	llvm-c-test avoid calling malloc(0) Summary: As explained in D63668, malloc(0) could return a null pointer. llvm-c-test does not handle this case correctly. Instead of calling malloc(0), avoid the operation altogether. Authored By: andusy Reviewers: hubert.reinterpretcast, xingxue, jasonliu, daltenty, cebowleratibm Reviewed By: hubert.reinterpretcast Subscribers: mehdi_amini, dexonsmith, jsji, llvm-commits Tags: LLVM Differential Revision: https://reviews.llvm.org/D63788 llvm-svn: 365144	2019-07-04 14:36:34 +00:00
Serge Guelton	85fc597f26	Document legacy pass manager extension points Differential Revision: https://reviews.llvm.org/D64093 llvm-svn: 365142	2019-07-04 14:03:11 +00:00
David Bolvansky	5f73e37af8	[NFC] Added tests for D64099 llvm-svn: 365141	2019-07-04 13:48:32 +00:00
Simon Pilgrim	146f1f2e5e	[X86][SSE] Add some partial dereferenceable vector load tests inspired by PR16739 llvm-svn: 365138	2019-07-04 13:31:49 +00:00
Michael Liao	7a9ad430fe	[AMDGPU] Correct the setting of `FlatScratchInit`. Summary: - That flag setting should skip spilling stack slot. Reviewers: arsenm, rampitec Subscribers: qcolombet, kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D64143 llvm-svn: 365137	2019-07-04 13:29:45 +00:00
Simon Pilgrim	8351c32764	[X86] Regenerate load fold peephole test. llvm-svn: 365136	2019-07-04 12:33:37 +00:00
Simon Pilgrim	555d743fcf	Fix -Wdocumentation param warning. Don't put the full stop at the end of a param name - it confuses the doxygen parser llvm-svn: 365128	2019-07-04 10:35:31 +00:00
Simon Pilgrim	9a8f3ecfdb	Fix -Wdocumentation warning. llvm-svn: 365127	2019-07-04 10:33:21 +00:00
Simon Pilgrim	fde766de4b	[X86][AVX1] Combine concat_vectors(pshufd(x,c),pshufd(y,c)) -> vpermilps(concat_vectors(x,y),c) Bitcast v4i32 to v8f32 and back again - it might be worth adding isel patterns for X86PShufd v8i32 on AVX1 targets like we did for X86Blendi to avoid the bitcasts? llvm-svn: 365125	2019-07-04 10:17:10 +00:00
Simon Pilgrim	8177673fb4	Fix MSVC "not all control paths return a value" warnings. NFCI. llvm-svn: 365119	2019-07-04 09:46:06 +00:00
Mikael Holmen	67dd39f86e	[Remarks] Silence gcc warning by catching unhandled values in switches Without this fix gcc (7.4) complains with ../lib/Remarks/RemarkParser.cpp: In function 'std::unique_ptr<llvm::remarks::ParserImpl> formatToParserImpl(llvm::remarks::ParserFormat, llvm::StringRef)': ../lib/Remarks/RemarkParser.cpp:29:1: error: control reaches end of non-void function [-Werror=return-type] } ^ ../lib/Remarks/RemarkParser.cpp: In function 'std::unique_ptr<llvm::remarks::ParserImpl> formatToParserImpl(llvm::remarks::ParserFormat, llvm::StringRef, const llvm::remarks::ParsedStringTable&)': ../lib/Remarks/RemarkParser.cpp:38:1: error: control reaches end of non-void function [-Werror=return-type] } ^ The Format enum currently only contains the value YAML which is indeed already handled in the switches, but gcc complains anyway. Adding a default case with an llvm_unreachable silences gcc. llvm-svn: 365118	2019-07-04 09:29:18 +00:00
David Green	2b20ee4110	[ARM] Favour PL/MI over GE/LT when possible The arm condition codes for GE is N==V (and for LT is N!=V). If the source of flags cannot set V (overflow), such as a cmp against #0, then we can use the simpler PL and MI conditions that only check N. As these PL/MI conditions are simpler than GE/LT, other passes like the peephole optimiser can have a better time optimising away the redundant CMPs. The exception is the VSEL instruction, which cannot take the PL code, so there the transform favours GE. Differential Revision: https://reviews.llvm.org/D64160 llvm-svn: 365117	2019-07-04 08:58:58 +00:00
David Green	147547ee80	[ARM] Added testing for D64160. NFC Adds some extra vsel testing and regenerates long shift and saturation bitop tests. llvm-svn: 365116	2019-07-04 08:49:32 +00:00
James Henderson	38d9903826	[docs][llvm-dwarfdump] Remove unnecessary reference to --show-children The --show-children option description describes what it does, and references the =<offset> parameter of section dump switches. I don't think it needs to be explained again in the documentation of the section dump switches too. Reviewed by: JDevlieghere Differential Revision: https://reviews.llvm.org/D64132 llvm-svn: 365115	2019-07-04 08:49:04 +00:00
Simon Tatham	c74322a11b	[TableGen] Allow DAG isel patterns to override default operands. When a Tablegen instruction description uses `OperandWithDefaultOps`, isel patterns for that instruction don't have to fill in the default value for the operand in question. But the flip side is that they actually //can't// override the defaults even if they want to. This will be very inconvenient for the Arm backend, when we start wanting to write isel patterns that generate the many MVE predicated vector instructions, in the form with predication actually enabled. So this small Tablegen fix makes it possible to write an isel pattern either with or without values for a defaulted operand, and have the default values filled in only if they are not overridden. If all the defaulted operands come at the end of the instruction's operand list, there's a natural way to match them up to the arguments supplied in the pattern: consume pattern arguments until you run out, then fill in any missing instruction operands with their default values. But if defaulted and non-defaulted operands are interleaved, it's less clear what to do. This does happen in existing targets (the first example I came across was KILLGT, in the AMDGPU/R600 backend), and of course they expect the previous behaviour (that the default for those operands is used and a pattern argument is not consumed), so for backwards compatibility I've stuck with that. Reviewers: nhaehnle, hfinkel, dmgreen Subscribers: mehdi_amini, javed.absar, tpr, kristof.beyls, steven_wu, dexonsmith, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D63814 llvm-svn: 365114	2019-07-04 08:43:20 +00:00
David Green	d2a9ec29d0	[ARM] MVE bitwise instruction patterns This adds patterns for the simpler VAND, VORR and VEOR bitwise vector instructions. It also adjusts the top16Zero PatLeaf to not match on vector instructions, which can otherwise cause problems. Code written by David Sherwood. Differential Revision: https://reviews.llvm.org/D63867 llvm-svn: 365113	2019-07-04 08:41:23 +00:00
QingShan Zhang	63e62006cf	[NFC][PowerPC] Make the PowerPC scheduling strategy feature only control the strategy instead of the scheduler. llvm-svn: 365110	2019-07-04 07:43:51 +00:00
Craig Topper	163b8bb3f5	[X86] Use pointer sized indices instead of i32 for EXTRACT_VECTOR_ELT and INSERT_VECTOR_ELT in a couple places. Most places already did this. llvm-svn: 365109	2019-07-04 06:21:54 +00:00
Dylan McKay	80728d1d75	[AVR] Fix tests after r363757 r363757 renamed ExpandISelPseudo to FinalizeISel, so the RUN line in select-must-add-unconditional-jump.mir needed updating to refer to finalize-isel. llvm-svn: 365108	2019-07-04 06:12:47 +00:00
Serguei Katkov	6d8813a391	[LoopPeel] Some small comment update. NFC. Follow-up change of comment after https://reviews.llvm.org/D63917 is landed. llvm-svn: 365107	2019-07-04 05:10:14 +00:00
Fangrui Song	1f333562de	[PowerPC] Support constraint code "ww" Summary: "ww" and "ws" are both constraint codes for VSX vector registers that hold scalar double data. "ww" is preferred for float while "ws" is preferred for double. Reviewed By: jsji Differential Revision: https://reviews.llvm.org/D64119 llvm-svn: 365106	2019-07-04 04:44:42 +00:00
Alex Brachet	fa9d232e43	[docs] [NFC] Removed excess spacing Summary: Removed excess new lines from documentations. As far as I can tell, it seems as though restructured text is agnostic to new lines, the use of new lines was inconsistent and had no effect on how the files were being displayed. Reviewers: jhenderson, rupprecht, JDevlieghere Reviewed By: jhenderson Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D63971 llvm-svn: 365105	2019-07-04 04:41:06 +00:00
Chen Zheng	469f30abab	[PowerPC] Hardware Loop branch instruction's condition may not be icmp. This fixes pr42492. Differential Revision: https://reviews.llvm.org/D64124 llvm-svn: 365104	2019-07-04 01:51:47 +00:00
Francis Visoiu Mistrih	312f1d7d7c	[Remarks] Require an explicit format to the parser Make the parser require an explicit format. This allows new formats to be easily added by following YAML as an example. llvm-svn: 365102	2019-07-04 00:31:03 +00:00
Francis Visoiu Mistrih	e6ba313a86	[Remarks][NFC] Move the string table parsing out of the parser constructor Make the parser take an already-parsed string table. llvm-svn: 365101	2019-07-04 00:30:58 +00:00
Derek Schuff	51d3c4dfcd	[WebAssembly] Update test failure explanations llvm-svn: 365100	2019-07-04 00:24:35 +00:00
Shoaib Meenai	995798d2d5	[MachO] Add valid architecture function Added array of valid architectures and function returning array. Modified llvm-lipo to include list of valid architectures in error message for invalid arch. Patch by Anusha Basana <anusha.basana@gmail.com> Differential Revision: https://reviews.llvm.org/D63735 llvm-svn: 365099	2019-07-04 00:17:02 +00:00
Lang Hames	f5a885fddd	[JITLink][ORC] Add EHFrameRegistrar interface, use in EHFrameRegistrationPlugin. Replaces direct calls to eh-frame registration with calls to methods on an EHFrameRegistrar instance. This allows clients to substitute a registrar that registers frames in a remote process via IPC/RPC. llvm-svn: 365098	2019-07-04 00:05:12 +00:00
Reid Kleckner	f7e52fbdb5	Revert [ThinLTO] Optimize writeonly globals out This reverts r365040 (git commit `5cacb91475`) Speculatively reverting, since this appears to have broken check-lld on Linux. Partial analysis in https://crbug.com/981168. llvm-svn: 365097	2019-07-04 00:03:30 +00:00
Derek Schuff	ec4be57655	[WebAssembly] Enable IndirectBrExpandPass Wasm doesn't have a direct way to lower indirectbr, so hook up the IndirectBrExpandPass to lower indirectbr into a switch. Fixes PR42498 Reviewers: aheejin Differential Revision: https://reviews.llvm.org/D64161 llvm-svn: 365096	2019-07-03 23:54:06 +00:00
Matt Arsenault	5b0922fe1f	AMDGPU: Add pass to lower SGPR spills This is split out from my patches to split register allocation into a separate SGPR and VGPR phase, and has some parts that aren't yet used (like maintaining LiveIntervals). This simplifies making the frame pointer register callee saved. As it is now, the code to determine callee saves needs to predict all the possible SGPR spills and how many callee saved VGPRs are needed. By handling this before PrologEpilogInserter, it's possible to just check the spill objects that already exist. Change-Id: I29e6df4034afcf949e06f8ef44206acb94696f04 llvm-svn: 365095	2019-07-03 23:32:29 +00:00

1 2 3 4 5 ...

181563 Commits