llvm-project

Commit Graph

Author	SHA1	Message	Date
Craig Topper	d3bf06bc81	[DAGCombiner] Add combine for (not (strict_fsetcc)) to create a strict_fsetcc with the opposite condition. Unlike the existing code that I modified here, I only handle the case where the strict_fsetcc has a single use. Not sure exactly how to handle multiples uses. Testing this on X86 is hard because we already have a other combines that get rid of lowered version of the integer setcc that this xor will eventually become. So this combine really just saves a bunch of extra nodes being created. Not sure about other targets. Differential Revision: https://reviews.llvm.org/D71816	2020-01-24 14:15:36 -08:00
Evandro Menezes	9fa0f236d5	[PATCH] [Target] Test commit Modify comment to reflect the current users of `Regisgter.CostPerUse`.	2020-01-24 15:56:08 -06:00
LLVM GN Syncbot	da8bada938	[gn build] Port `8a81daaa8b`	2020-01-24 21:42:43 +00:00
Craig Topper	b1f3a0f972	Revert `a107f86` "[GlobalsAA] Add back a check to intrinsic_addresstaken.ll to see if the AVX and AVX512 bots still fail for it." It still fails some buildbots which is what I was trying to test.	2020-01-24 13:15:23 -08:00
Matt Arsenault	3b93945587	AMDGPU/GlobalISel: Select wqm, softwqm and wwm intrinsics	2020-01-24 13:06:44 -08:00
Matt Arsenault	87c46a3129	AMDGPU: Don't error on ds.ordered intrinsic in function These should be assumed to be called from a compute context. Also don't use a 2 entry switch over constants.	2020-01-24 13:06:44 -08:00
LLVM GN Syncbot	36b887ed15	[gn build] Port `3f8b100e94`	2020-01-24 21:02:26 +00:00
Stanislav Mekhanoshin	be8e38cbd9	Correct NumLoads in clustering Scheduler sends NumLoads argument into shouldClusterMemOps() one less the actual cluster length. So for 2 instructions it will pass just 1. Correct this number. This is NFC for in tree targets. Differential Revision: https://reviews.llvm.org/D73292	2020-01-24 12:45:28 -08:00
Matt Arsenault	9c346464c1	TableGen/GlobalISel: Handle non-leaf EXTRACT_SUBREG This previously only handled EXTRACT_SUBREGs from leafs, such as operands directly in the original output. Handle extracting from a result instruction.	2020-01-24 12:15:10 -08:00
Matt Arsenault	d8328c0b62	TableGen: Work around assert on Mips register definitions This would hit the "Biggest class wasn't first" assert in getMatchingSubClassWithSubRegs in a future patch for EXTRACT_SUBREG handling. Mips defines 4 identical register classes (MSA128B, MSA128H, MSA128BW, MSA128D). These have the same set of registers, and only differ by the isel type. I believe this is an ill formed way of defining registers, that probably is just to work around the inconvenience of mixing different types in a single register class in DAG patterns. Since these all have the same size, they would all sort to the beginning, but you would not necessarily get the same super register at the front as the assert enforces. Breaking the ambiguity by also sorting by name doesn't work, since each of these register classes all want to be first. Force sorting of the original register class if the size is the same.	2020-01-24 12:15:10 -08:00
Matt Arsenault	84e035d8f1	AMDGPU: Don't check constant address space for atomic stores We define a separate list for storable address spaces. This saves entry in the matcher table address space list.	2020-01-24 12:15:09 -08:00
Matt Arsenault	4fdae24733	AMDGPU/GlobalISel: Add selection tests for G_ATOMICRMW_ADD	2020-01-24 12:15:09 -08:00
Craig Topper	a107f86417	[GlobalsAA] Add back a check to intrinsic_addresstaken.ll to see if the AVX and AVX512 bots still fail for it. These bots failed for this several months ago and as a result, this check was removed. If they still fail I'm going to try to see if I can figure out why.	2020-01-24 11:54:23 -08:00
LLVM GN Syncbot	6530136fe3	[gn build] Port `555d8f4ef5`	2020-01-24 19:37:54 +00:00
Stanislav Mekhanoshin	555d8f4ef5	[AMDGPU] Bundle loads before post-RA scheduler We are relying on atrificial DAG edges inserted by the MemOpClusterMutation to keep loads and stores together in the post-RA scheduler. This does not work all the time since it allows to schedule a completely independent instruction in the middle of the cluster. Removed the DAG mutation and added pass to bundle already clustered instructions. These bundles are unpacked before the memory legalizer because it does not work with bundles but also because it allows to insert waitcounts in the middle of a store cluster. Removing artificial edges also allows a more relaxed scheduling. Differential Revision: https://reviews.llvm.org/D72737	2020-01-24 11:33:38 -08:00
Andy Kaylor	b35b7da460	[PGO] Attach appropriate funclet operand bundles to value profiling instrumentation calls Patch by Chris Chrulski When generating value profiling instrumentation, ensure the call gets the correct funclet token, otherwise WinEHPrepare will turn the call (and all subsequent instructions) into unreachable. Differential Revision: https://reviews.llvm.org/D73221	2020-01-24 11:20:53 -08:00
Stanislav Mekhanoshin	44b865fa7f	[AMDGPU] Allow narrowing muti-dword loads Currently BE allows only a little load narrowing because of the fear it will produce sub-dword ext loads. However, we can always allow narrowing if we are shrinking one multi-dword load to another multi-dword load. In particular we were unable to reduce s_load_dwordx8 into s_load_dwordx4 if identity shuffle was used to extract low 4 dwords. Differential Revision: https://reviews.llvm.org/D73133	2020-01-24 11:03:41 -08:00
Stanislav Mekhanoshin	7a94d4f4ee	Allow combining of extract_subvector to extract element Differential Revision: https://reviews.llvm.org/D73132	2020-01-24 10:50:26 -08:00
Austin Kerbow	c226646337	Resubmit: [DA][TTI][AMDGPU] Add option to select GPUDA with TTI Summary: Enable the new diveregence analysis by default for AMDGPU. Resubmit with test updates since GPUDA was causing failures on Windows. Reviewers: rampitec, nhaehnle, arsenm, thakis Subscribers: kzhuravl, jvesely, wdng, yaxunl, dstuttard, tpr, t-tye, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73315	2020-01-24 10:39:40 -08:00
Yuta Saito	c5bd3d0726	Support Swift calling convention for WebAssembly targets This adds basic support for the Swift calling convention with WebAssembly targets. Reviewed By: dschuff Differential Revision: https://reviews.llvm.org/D71823	2020-01-24 10:30:46 -08:00
Austin Kerbow	37aa16ebb7	[DA] Don't propagate from unreachable blocks Summary: Fixes crash that could occur when a divergent terminator has an unreachable parent. Reviewers: rampitec, nhaehnle, arsenm Subscribers: jvesely, wdng, hiraditya, jfb, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73323	2020-01-24 10:28:11 -08:00
Simon Pilgrim	abd1927d44	Fix some comment typos. NFC.	2020-01-24 18:18:42 +00:00
Alina Sbirlea	0d90d2457c	[LoopStrengthReduce] Teach LoopStrengthReduce to preserve MemorySSA is available.	2020-01-24 10:13:52 -08:00
Andy Kaylor	a33accde95	[PGO] Early detection regarding whether pgo counter promotion is possible Patch by Chris Chrulski This fixes a problem with the current behavior when assertions are enabled. A loop that exits to a catchswitch instruction is skipped for the counter promotion, however this check was being done after the PGOCounterPromoter tried to collect an insertion point for the exit block. A call to getFirstInsertionPt() on a block that begins with a catchswitch instruction triggers an assertion. This change performs a check whether the counter promotion is possible prior to collecting the ExitBlocks and InsertPts. Differential Revision: https://reviews.llvm.org/D73222	2020-01-24 09:55:41 -08:00
Fangrui Song	50a3ff30e1	[PatchableFunction] Allow empty entry MachineBasicBlock Reviewed By: nickdesaulniers Differential Revision: https://reviews.llvm.org/D73301	2020-01-24 09:42:48 -08:00
David Green	b535aa405a	[ARM] Use reduction intrinsics for larger than legal reductions The codegen for splitting a llvm.vector.reduction intrinsic into parts will be better than the codegen for the generic reductions. This will only directly effect when vectorization factors are specified by the user. Also added tests to make sure the codegen for larger reductions is OK. Differential Revision: https://reviews.llvm.org/D72257	2020-01-24 17:07:24 +00:00
Fangrui Song	f1dab29908	[ELF][PowerPC] Support R_PPC_COPY and R_PPC64_COPY Reviewed By: Bdragon28, jhenderson, grimar, sfertile Differential Revision: https://reviews.llvm.org/D73255	2020-01-24 09:06:20 -08:00
Kazushi (Jam) Marukawa	0fca35c652	[VE] global variable isel patterns Summary: Asm expr fixups, isel patterns and tests for global variables addresses. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D73355	2020-01-24 17:35:14 +01:00
Tom Weaver	f5147765ba	[DebugInfo][LiveDebugValues] Teach Live Debug Values About Meta Instructions Previously LiveDebugValues pass would consider meta instructions that 'fiddle' with liveness of registers as register definitions when transfering register defs. This would mean that, for example, a KILL instruction would cause LiveDebugValues to terminate the range of an earlier DBG_VALUE instruction resulting in the none propogation of said DBG_VALUE instructions into later blocks. This patch adds the check and a helpful comment, fixes a test that previously tested for the broken behaviour by coincidence and adds a test specifically for this. reviewers: vsk, dstenb, djtodoro Differential Revision: https://reviews.llvm.org/D73210	2020-01-24 16:29:05 +00:00
Simon Pilgrim	3fd5d1c6e7	[X86][SSE] combineTargetShuffle - permilps(shufps(load(),x)) --> permilps(shufps(x,load())) Moves lowerShuffleWithSHUFPS commutation code from rG30fcd29fe479 to catch cases during combine	2020-01-24 15:23:20 +00:00
Sergey Dmitriev	f69eba0772	[llvm-objcopy][COFF] Add support for --set-section-flags Reviewers: jhenderson, MaskRay, alexshap, rupprecht, mstorsjo Reviewed By: jhenderson Subscribers: abrachet, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73107	2020-01-24 07:12:55 -08:00
Kazushi (Jam) Marukawa	08ebd8c79e	[VE] aligned load/store isel patterns Summary: Aligned load/store isel patterns and tests for i1/i8/16/32/64 (including extension and truncation) and fp32/64. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D73276	2020-01-24 15:16:54 +01:00
Thomas Preud'homme	8e96697c7d	FileCheck [9/12]: Add support for matching formats Summary: This patch is part of a patch series to add support for FileCheck numeric expressions. This specific patch adds support for selecting a matching format to match a numeric value against (ie. decimal, hex lower case letters or hex upper case letters). This commit allows to select what format a numeric value should be matched against. The following formats are supported: decimal value, lower case hex value and upper case hex value. Matching formats impact both the format of numeric value to be matched as well as the format of accepted numbers in a definition with empty numeric expression constraint. Default for absence of format is decimal value unless the numeric expression constraint is non null and use a variable in which case the format is the one used to define that variable. Conclict of format in case of several variable being used is diagnosed and forces the user to select a matching format explicitely. This commit also enables immediates in numeric expressions to be in any radix known to StringRef's GetAsInteger method, except for legacy numeric expressions (ie [[@LINE+<offset>]] which only support decimal immediates. Copyright: - Linaro (changes up to diff 183612 of revision D55940) - GraphCore (changes in later versions of revision D55940 and in new revision created off D55940) Reviewers: jhenderson, chandlerc, jdenny, probinson, grimar, arichardson Reviewed By: jhenderson, arichardson Subscribers: daltenty, MaskRay, hiraditya, llvm-commits, probinson, dblaikie, grimar, arichardson, kristina, hfinkel, rogfer01, JonChesterfield Tags: #llvm Differential Revision: https://reviews.llvm.org/D60389	2020-01-24 14:15:28 +00:00
Victor Huang	5cee34013c	[PowerPC][Future] Add prefixed instruction paddi to future CPU Future CPU will include support for prefixed instructions. These prefixed instructions are formed by a 4 byte prefix immediately followed by a 4 byte instruction effectively making an 8 byte instruction. The new instruction paddi is a prefixed form of addi. This patch adds paddi and all of the support required for that instruction. The majority of the patch deals with supporting the new prefixed instructions. The addition of paddi is mainly to allow for testing. Differential Revision: https://reviews.llvm.org/D72569	2020-01-24 07:27:25 -06:00
Simon Pilgrim	5e62e162cd	[X86][SSE] Add another shufps+shufps test for fold through commutation	2020-01-24 12:04:11 +00:00
Simon Pilgrim	30fcd29fe4	[X86][SSE] lowerShuffleWithSHUFPS - commute '2V1+2V2 elements' mask if it allows a loaded fold As mentioned on D73023.	2020-01-24 12:04:10 +00:00
Guillaume Chatelet	805c157e8a	[Alignment][NFC] Deprecate Align::None() Summary: This is a follow up on https://reviews.llvm.org/D71473#inline-647262. There's a caveat here that `Align(1)` relies on the compiler understanding of `Log2_64` implementation to produce good code. One could use `Align()` as a replacement but I believe it is less clear that the alignment is one in that case. Reviewers: xbolva00, courbet, bollu Subscribers: arsenm, dylanmckay, sdardis, nemanjai, jvesely, nhaehnle, hiraditya, kbarton, jrtc27, atanasyan, jsji, Jim, kerbowa, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D73099	2020-01-24 12:53:58 +01:00
Clement Courbet	2ee218f365	[llvm-exegesis][NFC] Simplify code. Summary: What we're redoing already exists in the X86 backend, it's called `X86II::getOperandBias`. Reviewers: gchatelet Subscribers: tschuett, mstojanovic, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73340	2020-01-24 12:45:20 +01:00
Georgii Rymar	1af6209d64	[llvm-readelf] - Improve dumping of objects without a section header string table. We have a test/Object/no-section-header-string-table.test which checks what happens when an object does not have a section header string table. It does not check the full output though. Currently our output is different from GNU readelf, because the latter prints "<no-strings>" instead of a section name, while we print nothing. This patch fixes this, adds a proper test case and removes the one from test/Object, as it is not a right folder for llvm-readelf tests. Differential revision: https://reviews.llvm.org/D73193	2020-01-24 14:30:03 +03:00
Sam McCall	7d20e80225	[clangd] Show background index status using LSP 3.15 work-done progress notifications Summary: It simply shows the completed/total items on the background queue, e.g. indexing: 233/1000 The denominator is reset to zero every time the queue goes idle. The protocol is fairly complicated here (requires creating a remote "progress" resource before sending updates). We implement the full protocol, but I've added an extension allowing it to be skipped to reduce the burden on clients - in particular the lit test takes this shortcut. The addition of background index progress to DiagnosticConsumer seems ridiculous at first glance, but I believe that interface is trending in the direction of "ClangdServer callbacks" anyway. It's due for a rename, but otherwise actually fits. Reviewers: kadircet, usaxena95 Subscribers: ilya-biryukov, MaskRay, jkorous, arphaman, jfb, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D73218	2020-01-24 12:21:08 +01:00
Simon Pilgrim	e37cdbeeab	[X86][SSE] Add shufps+shufps test for fold through commutation As mentioned on D73023, lowerShuffleWithSHUFPS should be able to commute the shufps inputs to fold the second arg as it will then permute the shufps result anyway.	2020-01-24 11:16:44 +00:00
Sam Parker	0ae13766ff	[NFC][ARM] Add test	2020-01-24 11:00:18 +00:00
Kerry McLaughlin	4c4861b577	[AArch64][SVE] Add intrinsics for FFR manipulation Summary: Implements the following intrinsics: - llvm.aarch64.sve.setffr - llvm.aarch64.sve.rdffr - llvm.aarch64.sve.rdffr.z - llvm.aarch64.sve.wrffr Reviewers: sdesmalen, efriedma, dancgr, rengolin Reviewed By: efriedma Subscribers: tschuett, kristof.beyls, hiraditya, rkruppe, psnobl, cameron.mcinally, cfe-commits, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73097	2020-01-24 10:58:12 +00:00
Simon Pilgrim	0b45c2264a	[SelectionDAG] rot(x, y) --> x iff ComputeNumSignBits(x) == BitWidth(x) Rotating an 0/-1 value by any amount will always result in the same 0/-1 value	2020-01-24 10:35:57 +00:00
James Henderson	0298a87511	[doc][llvm-objcopy] Remove redundant COFF-specific sub-heading The sub-heading used to contain the --only-keep-debug switch as that switch wasn't implemented for ELF at one point. Since the switch is now in the generic options section, and there are no other options in this sub-heading, it is pointless and can be deleted.	2020-01-24 10:25:39 +00:00
Sam Parker	ddbc077895	[NFC][ARM] Make some params members instead. Add MachineLoopInfo and ReachingDefAnalysis as members of LowOverheadLoop instead of passing them several times to different methods.	2020-01-24 10:19:17 +00:00
Simon Pilgrim	7e12325586	[X86] Add test showing failure to remove vector rotate of allsignbits Rotating an 0/-1 value by any amount will always result in the same 0/-1 value	2020-01-24 10:12:51 +00:00
Djordje Todorovic	035c106f37	[DWARF][test] Test all the call-site realted attrs Adding the test for the call site encoding in DWARF5 vs GNU extensions. Some of the attributes were not covered by any test. Differential Revision: https://reviews.llvm.org/D73266	2020-01-24 10:53:52 +01:00
Fangrui Song	daabc9a028	[WholeProgramDevirt][test] Fix test after D73094	2020-01-24 00:46:18 -08:00
Evgeny Leviant	8973fae195	[WPD] Allow load/save bitcoded index when running opt -wholeprogramdevirt Differential revision: https://reviews.llvm.org/D73094	2020-01-24 00:31:39 -08:00

1 2 3 4 5 ...

190660 Commits