llvm-project

Commit Graph

Author	SHA1	Message	Date
Aaron Ballman	90cfbb8167	Add LLVM_VALUE_FUNCTION to Optional::map(); NFC This is for future-proofing when compiling with MSVC once we drop support for 2017.	2020-01-22 14:21:08 -05:00
Nico Weber	8f21acbf52	[gn build] reformat all build files again Run `git ls-files '.gn' '.gni' \| xargs llvm/utils/gn/gn.py format` after recent fixes to formatting of comments after single-element lists.	2020-01-22 14:04:20 -05:00
Saar Raz	c2a250e1c4	[Concepts] Fix bug when referencing function parameters in instantiated function template requires clause Fixes bug #44613 - incorrect instantiated parameters were being added when checking instantiated function constraints	2020-01-22 20:43:38 +02:00
Aaron Ballman	1e4764e103	Add a comment about when we can remove this construct; NFC.	2020-01-22 13:17:38 -05:00
Petr Hosek	635cbef505	[CMake][Fuchsia] Disable in-process cc1 The in-process cc1 currently breaks Goma client and therefore prevents the use of tip-of-tree Clang with Goma, so we disable the in-process cc1 in our toolchain until this issue is resolved.	2020-01-22 10:11:54 -08:00
Simon Pilgrim	5340434c94	[X86][SSE] combineExtractWithShuffle - extract(bitcast(broadcast(x))) --> x Removes some unnecessary gpr<-->fpu traffic	2020-01-22 18:02:58 +00:00
David Green	58991ba773	[ARM] Mark MVE loads/store as not having side effects The hasSideEffect parameter is usually automatically inferred from instruction patterns. For some of our MVE instructions, we do not have patterns though, such as for the pre/post inc loads and stores. This instead specifies the flag manually on the base MVE_VLDRSTR_base tablegen class, making sure we get this correct. This can help with scheduling multiple loads more optimally. Here I've added a unittest as a more direct form of testing. Differential Revision: https://reviews.llvm.org/D73117	2020-01-22 17:56:55 +00:00
Nico Weber	cd470717d1	Revert "[DA][TTI][AMDGPU] Add option to select GPUDA with TTI" This reverts commit `a90a6502ab`. Broke tests on Windows: http://lab.llvm.org:8011/builders/clang-x64-windows-msvc/builds/13808	2020-01-22 12:56:19 -05:00
Nico Weber	bfcfa53c3d	Revert "[gn build] [win] produce symbolized stack frames in release builds too" This reverts commit `fd98eccf98`. Seems to have no effect, need to try it locally for a bit first.	2020-01-22 12:54:19 -05:00
Mehdi Amini	5114d55dde	[Flang] add some cmake code to allow for out-of-tree building of MLIR and LLVM Differential Revision: https://reviews.llvm.org/D72418	2020-01-22 17:43:15 +00:00
Kostya Kortchinsky	990acd450c	[scudo][standalone] Reduce Android region sizes to 128MB Summary: Unity is making irresponsible assumptions as to how clumped up memory should be. With larger regions, we break those, resulting in errors like: "Using memoryadresses from more that 16GB of memory" This is unfortunately one of those situations where we have to bend to existing code because we doubt it's going to change any time soon. 128MB should be enough, but we could be flirting with OOMs in the higher class sizes. Reviewers: cferris, eugenis, hctim, morehouse, pcc Subscribers: #sanitizers, llvm-commits Tags: #sanitizers, #llvm Differential Revision: https://reviews.llvm.org/D73143	2020-01-22 09:42:07 -08:00
Aaron Ballman	dfe9f130e0	Revert "Unconditionally enable lvalue function designators; NFC" This reverts commit `968561bcdc`	2020-01-22 12:40:39 -05:00
Nico Weber	fd98eccf98	[gn build] [win] produce symbolized stack frames in release builds too	2020-01-22 12:36:38 -05:00
Florian Hahn	300997c41a	[AArch64] Don't rename registers with pseudo defs in Ld/St opt. If the root def of for renaming is a noop-pseudo instruction like kill, we would end up without a correct def for the renamed register, causing miscompiles. This patch conservatively bails out on any pseudo instruction. This fixes https://bugs.chromium.org/p/chromium/issues/detail?id=1037912#c70	2020-01-22 09:26:25 -08:00
Matt Arsenault	1192d7b254	AMDGPU/GlobalISel: Handle 16-bank LDS llvm.amdgcn.interp.p1.f16 The pattern is also mishandled by the generated matcher, so workaround this as in the DAG path. The existing DAG tests aren't particularly targeted to just this one intrinsic. These also end up differing in scheduling from SGPR->VGPR operand constraint copies.	2020-01-22 12:10:59 -05:00
David Tenty	45a4aaea7f	[NFC][XCOFF] Refactor Csect creation into TargetLoweringObjectFile Summary: We create a number of standard types of control sections in multiple places for things like the function descriptors, external references and the TOC anchor among others, so it is possible for their properties to be defined inconsistently in different places. This refactor moves their creation and properties into functions in the TargetLoweringObjectFile class hierarchy, where functions for retrieving various special types of sections typically seem to reside. Note: There is one case in PPCISelLowering which is specific to function entry points which we don't address since we don't have access to the TLOF there. Reviewers: DiggerLin, jasonliu, hubert.reinterpretcast Reviewed By: jasonliu, hubert.reinterpretcast Subscribers: wuzish, nemanjai, hiraditya, kbarton, jsji, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D72347	2020-01-22 12:09:11 -05:00
Stanislav Mekhanoshin	2d0fcf786c	Precommit NFC part of DAGCombiner change. NFC. This is NFC part of DAGCombiner::visitEXTRACT_SUBVECTOR() change in the D73132.	2020-01-22 09:01:22 -08:00
Stanislav Mekhanoshin	fb8a3d1834	Regenerate test/CodeGen/ARM/vext.ll. NFC. This is to pre-commit whitespace only changes before D73132.	2020-01-22 08:56:08 -08:00
Matt Arsenault	c05f23e409	AMDGPU/GlobalISel: Select llvm.amdgcn.mov.dpp This is deprecated, but easy to support.	2020-01-22 11:43:53 -05:00
Matt Arsenault	dd09ec1208	AMDGPU/GlobalISel: Select llvm.amdgcn.mov.dpp8	2020-01-22 11:43:40 -05:00
Hiroshi Yamauchi	ddbc728828	[PGO][PGSO] Update BFI in CodeGenPrepare::optimizeSelectInst. Summary: Without the BFI update, some hot blocks are incorrectly treated as cold code. This fixes a FDO perf regression in the TSVC benchmark from D71288. Reviewers: davidxl Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73146	2020-01-22 08:36:54 -08:00
Pablo Barrio	a8ff6c0b09	[AArch64] Add test for DWARF return address signing Summary: Patch by LukeCheeseman and pbarrio Reviewers: samparker, chill Subscribers: kristof.beyls, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D72835	2020-01-22 16:36:21 +00:00
Matt Arsenault	0bf434ccd5	AMDGPU: Fix element size assertion The GlobalISel usage called this with bits, but the DAG usage was incorrectly using bytes.	2020-01-22 11:18:45 -05:00
Matt Arsenault	bb562d1af0	AMDGPU/GlobalISel: Keep G_BITCAST out of waterfall loop The waterfall utility function blindly inserts a phi for every def in the loop. We don't need this one to be preserved for every iteration. Saves an extra phi and copy inside the loop body.	2020-01-22 11:16:19 -05:00
Zakk Chen	0cb274de39	[RISCV] Support ABI checking with per function target-features 1. if users don't specific -mattr, the default target-feature come from IR attribute. 2. fixed bug and re-land this patch Reviewers: lenary, asb Reviewed By: lenary Tags: #llvm Differential Revision: https://reviews.llvm.org/D70837	2020-01-22 08:12:28 -08:00
Simon Pilgrim	a14aa7dabd	[X86][SSE] combineExtractWithShuffle - extract(bictcast(scalar_to_vector(x))) --> x Removes some unnecessary gpr<-->fpu traffic	2020-01-22 16:11:08 +00:00
Matt Arsenault	52ec7379ad	AMDGPU/GlobalISel: Fold add of constant into G_INSERT_VECTOR_ELT Move the subregister base like in the extract case.	2020-01-22 11:09:15 -05:00
Nico Weber	349f6bb873	[gn build] (manually) port `a174f0da62`	2020-01-22 11:08:34 -05:00
Michał Górny	3215f7c7a8	[compiler-rt] [builtins] Fix clear_cache_test to work with MPROTECT Fix clear_cache_test to work on NetBSD with PaX MPROTECT enabled, that is when creating W+X mmaps is prohibited. Use the recommended solution: create two mappings for the same memory area, make one of them RW, while the other RX. Copy the function into the RW area but run it from the RX area. In order to implement this, I've split the pointer variables to 'write_buffer' and 'execution_buffer'. Both are separate pointers on NetBSD, while they have the same value on other systems. I've also split the memcpy_f() into two: new memcpy_f() that only takes care of copying memory and discards the (known) result of memcpy(), and realign_f() that applies ARM realignment to the given pointer. Again, there should be no difference on non-NetBSD systems but on NetBSD copying is done on write_buffer, while realignment on pointer to the execution_buffer. I have tested this change on NetBSD and Linux. Differential Revision: https://reviews.llvm.org/D72578	2020-01-22 17:04:54 +01:00
Matt Arsenault	d1dbb5e471	AMDGPU/GlobalISel: Select G_INSERT_VECTOR_ELT	2020-01-22 11:00:49 -05:00
Matt Arsenault	3524d4412c	AMDGPU/GlobalISel: Fix RegBankSelect for G_INSERT_VECTOR_ELT The result and source vector are going to be tied, so these need to be the same bank. The inserted value also needs to be broken down based on the result bank, not the inserted value itself.	2020-01-22 10:57:50 -05:00
Marcello Maggioni	04a151710e	[mlir] Swap use of to_vector() with lookupValues() in LLVMIRIntrinsicGen Summary: llvm::to_vector() accepts a Range value and not the pair of arguments we are currently passing. Also we probably want the lowered LLVM values in the vector, while operand_begin()/operand_end() on MLIR ops returns MLIR types. lookupValues() seems the correct way to collect such values. Reviewers: rriddle, andydavis1, antiagainst, nicolasvasilache, ftynse Subscribers: jdoerfert, mehdi_amini, jpienaar, burmako, shauheen, arpith-jacob, mgester, lucyrfox, liufengdb, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73137	2020-01-22 07:56:24 -08:00
Matt Arsenault	e3d352c541	AMDGPU/GlobalISel: Fold constant offset vector extract indexes Handle dynamic vector extracts that use an index that's an add of a constant offset into moving the base subregister of the indexing operation. Force the add into the loop in regbankselect, which will be recognized when selected.	2020-01-22 10:50:59 -05:00
Med Ismail Bennani	9dc9f7ca14	[lldb/Target] Sort CMakeLists (NFC) Signed-off-by: Med Ismail Bennani <medismail.bennani@gmail.com>	2020-01-22 16:38:37 +01:00
Kazushi (Jam) Marukawa	83b67526d5	[VE] select and selectcc patterns Summary: select and selectcc isel patterns and tests for i32/i64 and fp32/fp64. Includes optimized selectcc patterns for fmin/fmax/maxs/mins. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D73195	2020-01-22 16:30:38 +01:00
Matt Arsenault	e93e1b621c	AMDGPU: Fix typo	2020-01-22 10:17:46 -05:00
Matt Arsenault	2fe500ab5b	AMDGPU: Look through casted selects to constant fold bin ops The promotion of the uniform select to i32 interfered with this fold.	2020-01-22 10:16:39 -05:00
Matt Arsenault	bcd91778fe	AMDGPU: Do binop of select of constant fold in AMDGPUCodeGenPrepare DAGCombiner does this, but divisions expanded here miss this optimization. Since `67aa18f165`, divisions have been expanded here and missed out on this optimization. Avoids test regressions in a future patch.	2020-01-22 10:16:39 -05:00
Matt Arsenault	a174f0da62	AMDGPU/GlobalISel: Add pre-legalize combiner pass Just copy the AArch64 pass as-is for now, except for removing the memcpy handling.	2020-01-22 10:16:39 -05:00
Aaron Ballman	968561bcdc	Unconditionally enable lvalue function designators; NFC We previously had to guard against older MSVC and GCC versions which had rvalue references but not support for marking functions with ref qualifiers. However, having bumped our minimum required version to MSVC 2017 and GCC 5.1 mean we can unconditionally enable this feature. Rather than keeping the macro around, this replaces use of the macro with the actual ref qualifier.	2020-01-22 09:54:34 -05:00
Sanjay Patel	0ade2abdb0	[InstCombine] fneg(X + C) --> -C - X This is 1 of the potential folds uncovered by extending D72521. We don't seem to do this in the backend either (unless I'm not seeing some target-specific transform). icc and gcc (appears to be target-specific) do this transform. Differential Revision: https://reviews.llvm.org/D73057	2020-01-22 09:48:43 -05:00
Kazushi (Jam) Marukawa	dc69265eea	[VE] setcc isel patterns Summary: SETCC isel patterns and tests for i32/64 and fp32/64 comparison Reviewers: arsenm, rengolin, craig.topper, k-ishizaka Reviewed By: arsenm Subscribers: merge_guards_bot, wdng, hiraditya, llvm-commits Tags: #ve, #llvm Differential Revision: https://reviews.llvm.org/D73171	2020-01-22 15:45:57 +01:00
David Green	e9c198278e	[ARM] Basic gather scatter cost model This is a very basic MVE gather/scatter cost model, based roughly on the code that we will currently produce. It does not handle truncating scatters or extending gathers correctly yet, as it is difficult to tell that they are going to be correctly extended/truncated from the limited information in the cost function. This can be improved as we extend support for these in the future. Based on code originally written by David Sherwood. Differential Revision: https://reviews.llvm.org/D73021	2020-01-22 14:41:38 +00:00
David Green	0b83e14804	[ARM] MVE Gather Scatter cost model tests. NFC	2020-01-22 14:41:38 +00:00
Sander de Smalen	4cf16efe49	[AArch64][SVE] Add patterns for unpredicated load/store to frame-indices. This patch also fixes up a number of cases in DAGCombine and SelectionDAGBuilder where the size of a scalable vector is used in a fixed-width context (thus triggering an assertion failure). Reviewers: efriedma, c-rhodes, rovka, cameron.mcinally Reviewed By: efriedma Tags: #llvm Differential Revision: https://reviews.llvm.org/D71215	2020-01-22 14:32:27 +00:00
Jay Foad	e0f0d0e55c	[MachineScheduler] Allow clustering mem ops with complex addresses The generic BaseMemOpClusterMutation calls into TargetInstrInfo to analyze the address of each load/store instruction, and again to decide whether two instructions should be clustered. Previously this had to represent each address as a single base operand plus a constant byte offset. This patch extends it to support any number of base operands. The old target hook getMemOperandWithOffset is now a convenience function for callers that are only prepared to handle a single base operand. It calls the new more general target hook getMemOperandsWithOffset. The only requirements for the base operands returned by getMemOperandsWithOffset are: - they can be sorted by MemOpInfo::Compare, such that clusterable ops get sorted next to each other, and - shouldClusterMemOps knows what they mean. One simple follow-on is to enable clustering of AMDGPU FLAT instructions with both vaddr and saddr (base register + offset register). I've left a FIXME in the code for this case. Differential Revision: https://reviews.llvm.org/D71655	2020-01-22 14:28:24 +00:00
Matt Arsenault	70096ca111	AMDGPU/GlobalISel: Fix RegbankSelect for llvm.amdgcn.fmul.legacy	2020-01-22 09:26:17 -05:00
Matt Arsenault	a722cbf77c	AMDGPU/GlobalISel: Handle atomic_inc/atomic_dec The intermediate instruction drops the extra volatile argument. We are missing an atomic ordering on these.	2020-01-22 09:26:17 -05:00
Matt Arsenault	9c928649a0	AMDGPU: Fix interaction of tfe and d16 This using the wrong result register, and dropping the result entirely for v2f16. This would fail to select on the scalar case. I believe it was also mishandling packed/unpacked subtargets.	2020-01-22 09:26:17 -05:00
Jaroslav Sevcik	4481eefbe8	[ASTImporter] Properly delete decls from SavedImportPaths Summary: We see a significant regression (~40% slower on large codebases) in expression evaluation after https://reviews.llvm.org/rL364771. A sampling profile shows the extra time is spent in SavedImportPathsTy::operator[] when called from ASTImporter::Import. I believe this is because ASTImporter::Import adds an element to the SavedImportPaths map for each decl unconditionally (see `7b81c3f879/clang/lib/AST/ASTImporter.cpp (L8256)`). To fix this, we call SavedImportPathsTy::erase on the declaration rather than clearing its value vector. That way we do not accidentally introduce new empty elements. (With this patch the performance is restored, and we do not see SavedImportPathsTy::operator[] in the profile anymore.) Reviewers: martong, teemperor, a.sidorin, shafik Reviewed By: martong Subscribers: rnkovacs, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D73166	2020-01-22 15:20:06 +01:00

1 2 3 4 5 ...

340156 Commits All Branches Search

340156 Commits

All Branches