llvm-project

Commit Graph

Author	SHA1	Message	Date
Craig Topper	e0110a4740	[RISCV] Add intrinsics for vfmv.v.f Also include a special case pattern to use vmv.v.x vd, zero when the argument is 0.0. Reviewed By: khchen Differential Revision: https://reviews.llvm.org/D93672	2020-12-23 10:50:48 -08:00
Evgeniy Brevnov	9fb074e7bb	[BPI] Improve static heuristics for "cold" paths. Current approach doesn't work well in cases when multiple paths are predicted to be "cold". By "cold" paths I mean those containing "unreachable" instruction, call marked with 'cold' attribute and 'unwind' handler of 'invoke' instruction. The issue is that heuristics are applied one by one until the first match and essentially ignores relative hotness/coldness of other paths. New approach unifies processing of "cold" paths by assigning predefined absolute weight to each block estimated to be "cold". Then we propagate these weights up/down IR similarly to existing approach. And finally set up edge probabilities based on estimated block weights. One important difference is how we propagate weight up. Existing approach propagates the same weight to all blocks that are post-dominated by a block with some "known" weight. This is useless at least because it always gives 50\50 distribution which is assumed by default anyway. Worse, it causes the algorithm to skip further heuristics and can miss setting more accurate probability. New algorithm propagates the weight up only to the blocks that dominates and post-dominated by a block with some "known" weight. In other words, those blocks that are either always executed or not executed together. In addition new approach processes loops in an uniform way as well. Essentially loop exit edges are estimated as "cold" paths relative to back edges and should be considered uniformly with other coldness/hotness markers. Reviewed By: yrouban Differential Revision: https://reviews.llvm.org/D79485	2020-12-23 22:47:36 +07:00
David Penry	a9f14cdc62	[ARM] Add bank conflict hazarding Adds ARMBankConflictHazardRecognizer. This hazard recognizer looks for a few situations where the same base pointer is used and then checks whether the offsets lead to a bank conflict. Two parameters are also added to permit overriding of the target assumptions: arm-data-bank-mask=<int> - Mask of bits which are to be checked for conflicts. If all these bits are equal in the offsets, there is a conflict. arm-assume-itcm-bankconflict=<bool> - Assume that there will be bank conflicts on any loads to a constant pool. This hazard recognizer is enabled for Cortex-M7, where the Technical Reference Manual states that there are two DTCM banks banked using bit 2 and one ITCM bank. Differential Revision: https://reviews.llvm.org/D93054	2020-12-23 14:00:59 +00:00
Simon Moll	c3acda0798	[VE] Vector 'and' isel and tests Reviewed By: kaz7 Differential Revision: https://reviews.llvm.org/D93709	2020-12-23 13:29:29 +01:00
Sebastian Neubauer	221fdedc69	[AMDGPU][GlobalISel] Fold flat vgpr + constant addresses Use getPtrBaseWithConstantOffset in selectFlatOffsetImpl to fold more vgpr+constant addresses. Differential Revision: https://reviews.llvm.org/D93692	2020-12-23 10:40:30 +01:00
ShihPo Hung	6301871d06	[RISCV] Add intrinsics for vfwmacc, vfwnmacc, vfwmsac, vfwnmsac instructions This patch defines vfwmacc, vfwnmacc, vfwmsc, vfwnmsac intrinsics and lower to V instructions. We work with @rogfer01 from BSC to come out this patch. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: ShihPo Hung <shihpo.hung@sifive.com> Differential Revision: https://reviews.llvm.org/D93693	2020-12-23 00:42:04 -08:00
Zakk Chen	032600b9ae	[RISCV] Define vmerge/vfmerge intrinsics. Define vmerge/vfmerge intrinsics and lower to V instructions. Include support for vector-vector vfmerge by vmerge.vvm. We work with @rogfer01 from BSC to come out this patch. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D93674	2020-12-23 00:07:09 -08:00
Evandro Menezes	4d47944393	[RISCV] Define the vfmin, vfmax RVV intrinsics Define the vfmin, vfmax IR intrinsics for the respective V instructions. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Evandro Menezes <evandro.menezes@sifive.com> Differential Revision: https://reviews.llvm.org/D93673	2020-12-23 00:27:38 -06:00
Matt Arsenault	bac54639c7	AMDGPU: Add spilled CSR SGPRs to entry block live ins	2020-12-22 21:55:59 -05:00
ShihPo Hung	ad0a7ad950	[RISCV] Add intrinsics for vf[n]macc/vf[n]msac/vf[n]madd/vf[n]msub instructions This patch defines vfmadd/vfnmacc, vfmsac/vfnmsac, vfmadd/vfnmadd, and vfmsub/vfnmsub lower to V instructions. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: ShihPo Hung <shihpo.hung@sifive.com> Differential Revision: https://reviews.llvm.org/D93691	2020-12-22 18:34:00 -08:00
ShihPo Hung	4268783998	[RISCV] Add intrinsics for vwmacc[u\|su\|us] instructions This patch defines vwmacc[u\|su\|us] intrinsics and lower to V instructions. We work with @rogfer01 from BSC to come out this patch. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: ShihPo Hung <shihpo.hung@sifive.com> Differential Revision: https://reviews.llvm.org/D93675	2020-12-22 18:17:39 -08:00
ShihPo Hung	c8874464b5	[RISCV] Add intrinsics for vslide1up/down, vfslide1up/down instruction This patch adds intrinsics for vslide1up, vslide1down, vfslide1up, vfslide1down. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: ShihPo Hung <shihpo.hung@sifive.com> Differential Revision: https://reviews.llvm.org/D93608	2020-12-22 18:14:22 -08:00
Matt Arsenault	29ed846d67	AMDGPU: Fix assert when checking for implicit operand legality	2020-12-22 20:56:24 -05:00
Stanislav Mekhanoshin	d15119a02d	[AMDGPU][GlobalISel] GlobalISel for flat scratch It does not seem to fold offsets but this is not specific to the flat scratch as getPtrBaseWithConstantOffset() does not return the split for these tests unlike its SDag counterpart. Differential Revision: https://reviews.llvm.org/D93670	2020-12-22 16:33:06 -08:00
Stanislav Mekhanoshin	ca4bf58e4e	[AMDGPU] Support unaligned flat scratch in TLI Adjust SITargetLowering::allowsMisalignedMemoryAccessesImpl for unaligned flat scratch support. Mostly needed for global isel. Differential Revision: https://reviews.llvm.org/D93669	2020-12-22 16:12:31 -08:00
Thomas Lively	a781a706b9	[WebAssembly][SIMD] Rename shuffle, swizzle, and load_splats These instructions previously used prefixes like v8x16 to signify that they were agnostic between float and int interpretations. We renamed these instructions to remove this form of prefix in https://github.com/WebAssembly/simd/issues/297 and https://github.com/WebAssembly/simd/issues/316 and this commit brings the names in LLVM up to date. Differential Revision: https://reviews.llvm.org/D93722	2020-12-22 14:29:06 -08:00
Stanislav Mekhanoshin	ae8f4b2178	[AMDGPU] Folding of FI operand with flat scratch Differential Revision: https://reviews.llvm.org/D93501	2020-12-22 10:48:04 -08:00
Paul Walker	be85b3e432	Fix some misnamed variables in sve-fixed-length-int-minmax.ll.	2020-12-22 17:11:23 +00:00
Kamau Bridgeman	8a58f21f5b	[PowerPC][Power10] Exploit store rightmost vector element instructions Using the store rightmost vector element instructions to do vector element extraction and store. The rightmost vector element on little endian is the zeroth vector element, with these patterns that element can be extracted and stored in one instruction for all vector types. Differential Revision: https://reviews.llvm.org/D89195	2020-12-22 12:06:43 -05:00
Paul Walker	8eec7294fe	[SVE] Lower vector BITREVERSE and BSWAP operations. These operations are lowered to RBIT and REVB instructions respectively. In the case of fixed-length support using SVE we also lower BITREVERSE operating on NEON sized vectors as this results in fewer instructions. Differential Revision: https://reviews.llvm.org/D93606	2020-12-22 16:49:50 +00:00
Nandor Licker	0586f048d7	[RISCV] Basic jump table lowering This patch enables jump table lowering in the RISC-V backend. In addition to the test case included, the new lowering was tested by compiling the OCaml runtime and running it under qemu. Differential Revision: https://reviews.llvm.org/D92097	2020-12-22 15:05:54 +00:00
Sjoerd Meijer	b9b62c2867	[AArch64] Add a test for MachineLICM SinkIntoLoop. NFC.	2020-12-22 12:22:24 +00:00
Nico Weber	00065d5cbd	Revert "-fstack-clash-protection: Return an actual error when used on unsupported OS" This reverts commit `4d59c8fdb9`. Breaks tens of thousands of tests, and had pending review comments, see comments on https://reviews.llvm.org/D92245 (and e.g. http://lab.llvm.org:8011/#/builders/109/builds/5236 for failures).	2020-12-22 06:51:19 -05:00
Nemanja Ivanovic	ba1202a1e4	[PowerPC] Restore stack ptr from base ptr when available On subtargets that have a red zone, we will copy the stack pointer to the base pointer in the prologue prior to updating the stack pointer. There are no other updates to the base pointer after that. This suggests that we should be able to restore the stack pointer from the base pointer rather than loading it from the back chain or adding the frame size back to either the stack pointer or the frame pointer. This came about because functions that call setjmp need to restore the SP from the FP because the back chain might have been clobbered (see https://reviews.llvm.org/D92906). However, if the stack is realigned, the restored SP might be incorrect (which is what caused the failures in the two ASan test cases). This patch was tested quite extensivelly both with sanitizer runtimes and general code. Differential revision: https://reviews.llvm.org/D93327	2020-12-22 05:44:03 -06:00
Sylvestre Ledru	4d59c8fdb9	-fstack-clash-protection: Return an actual error when used on unsupported OS $ clang-12: error: -fstack-clash-protection is not supported on Windows or Mac OS X Differential Revision: https://reviews.llvm.org/D92245	2020-12-22 12:06:08 +01:00
Hsiangkai Wang	9a8ef927df	[RISCV] Define vector compare intrinsics. Define vector compare intrinsics and lower them to V instructions. We work with @rogfer01 from BSC to come out this patch. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Hsiangkai Wang <kai.wang@sifive.com> Differential Revision: https://reviews.llvm.org/D93368	2020-12-22 14:08:18 +08:00
Zakk Chen	7a2c8be641	[RISCV] Define vleff intrinsics. Define vleff intrinsics and lower to V instructions. We work with @rogfer01 from BSC to come out this patch. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Zakk Chen <zakk.chen@sifive.com> Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D93516	2020-12-21 22:05:38 -08:00
Bing1 Yu	e8ade4569b	[LegalizeType] When LegalizeType procedure widens a masked_gather, set MemoryType's EltNum equal to Result's EltNum When LegalizeType procedure widens a masked_gather, set MemoryType's EltNum equal to Result's EltNum. As I mentioned in https://reviews.llvm.org/D91092, in previous code, If we have a v17i32's masked_gather in avx512, we widen it to a v32i32's masked_gather with a v17i32's MemoryType. When the SplitVecRes_MGATHER process this v32i32's masked_gather, GetSplitDestVTs will assert fail since what you are going to split is v17i32. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D93610	2020-12-22 13:27:38 +08:00
ShihPo Hung	b15ba2cf6f	[RISCV] Add intrinsics for vmacc/vnmsac/vmadd/vnmsub instructions This defines vmadd, vmacc, vnmsub, and vnmsac intrinsics and lower to V instructions. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: ShihPo Hung <shihpo.hung@sifive.com> Differential Revision: https://reviews.llvm.org/D93632	2020-12-21 17:37:20 -08:00
Craig Topper	704981b437	[RISCV] Update vmv.v.v-rv32.ll and vmv.v.v-rv64.ll to test the correct intrinsics. These were accidentally identical to the vmv.v.x tests. I must have fumbled when I copied them from our downstream repo.	2020-12-21 16:27:09 -08:00
Evandro Menezes	ed73a78924	[RISCV] Define the vand, vor and vxor RVV intrinsics Define the `vand`, `vor` and `vxor` IR intrinsics for the respective V instructions. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Evandro Menezes <evandro.menezes@sifive.com> Differential Revision: https://reviews.llvm.org/D93574	2020-12-21 16:20:26 -06:00
Fangrui Song	d33abc337c	Migrate MCContext::createTempSymbol call sites to AlwaysAddSuffix=true Most call sites set AlwaysAddSuffix to true. The two use cases do not really need false and can be more consistent with other temporary symbol usage.	2020-12-21 14:04:13 -08:00
Kazushi (Jam) Marukawa	8c2ad9e85f	[VE] Correct VMP allocation in calling conv VE used to allocate VM1, VM2, VMP2 (VM4+VM5), and VM3. This patch corrects to allocate VM1, VM2, VMP2 (VM4+VM5), and VM6. Also add a regression test. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D93570	2020-12-21 22:42:24 +09:00
Denis Antrushin	6f45049fb6	[Statepoints] Disable VReg lowering for values used on exception path of invoke. Currently we lower invokes the same way as usual calls, e.g.: V1 = STATEPOINT ... V (tied-def 0) But this is incorrect is V1 is used on exceptional path. By LLVM rules V1 neither dominates its uses in landing pad, nor its live range is live on entry to landing pad. So compiler is allowed to do various weird transformations like splitting live range after statepoint and use split LR in catch block. Until (and if) we find better solution to this problem, let's use old lowering (spilling) for those values which are used on exceptional path and allow VReg lowering for values used only on normal path. Differential Revision: https://reviews.llvm.org/D93449	2020-12-21 20:27:05 +07:00
Kazushi (Jam) Marukawa	a3a896d1cd	[VE] Optimize LEA combinations Change to optimize references of elements of aggregate data. Also add regression tests. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D93627	2020-12-21 22:21:10 +09:00
Kazushi (Jam) Marukawa	5e273b845b	[VE] Support STACKSAVE and STACKRESTORE Change to use default expanded code. Add regression tests also. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D93539	2020-12-21 20:15:50 +09:00
Kazushi (Jam) Marukawa	d99e4a4840	[VE] Support RETURNADDR Implement RETURNADDR for VE. Add a regression test also. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D93545	2020-12-21 20:06:03 +09:00
Fangrui Song	1635dea266	[AsmPrinter] Replace a reachable report_fatal_error with MCContext::reportError	2020-12-20 23:45:49 -08:00
Fangrui Song	8ffda237a6	MCContext::reportError: don't call report_fatal_error Errors from MCAssembler, MCObjectStreamer and *ObjectWriter typically cause a crash: ``` % cat c.c int bar; extern int foo __attribute__((alias("bar"))); % clang -c -fcommon c.c fatal error: error in backend: Common symbol 'bar' cannot be used in assignment expr PLEASE submit a bug report to ... Stack dump: ... ``` `LLVMTargetMachine::addPassesToEmitFile` constructs `MachineModuleInfoWrapperPass` which creates a MCContext without SourceMgr. `MCContext::reportError` calls `report_fatal_error` which gets captured by Clang `LLVMErrorHandler` and gets translated to the output above. Since `MCContext::reportError` errors indicate user errors, such a crashing style error is inappropriate. So this patch changes `report_fatal_error` to `SourceMgr().PrintMessage`. ``` % clang -c -fcommon c.c <unknown>:0: error: Common symbol 'bar' cannot be used in assignment expr ``` Ideally we should at least recover the original filename (the line information is generally lost). That requires general improvement to MC diagnostics, because currently in many cases SMLoc information is lost.	2020-12-20 23:23:12 -08:00
Pushpinder Singh	e2303a448e	[FastRA] Fix handling of bundled MIs Fast register allocator skips bundled MIs, as the main assignment loop uses MachineBasicBlock::iterator (= MachineInstrBundleIterator) This was causing SIInsertWaitcnts to crash which expects all instructions to have registers assigned. This patch makes sure to set everything inside bundle to the same assignments done on BUNDLE header. Reviewed By: qcolombet Differential Revision: https://reviews.llvm.org/D90369	2020-12-21 02:10:55 -05:00
Monk Chiang	3183add534	[RISCV] Define the remaining vector fixed-point arithmetic intrinsics. This patch base on D93366, and define vector fixed-point intrinsics. 1. vaaddu/vaadd/vasubu/vasub 2. vsmul 3. vssrl/vssra 4. vnclipu/vnclip We work with @rogfer01 from BSC to come out this patch. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: ShihPo Hung <shihpo.hung@sifive.com> Differential Revision: https://reviews.llvm.org/D93508	2020-12-20 22:57:07 -08:00
Esme-Yi	29eb3dcfe6	[PowerPC] Materialize i64 constants by enumerated patterns. Summary: Some constants can be handled with less instructions than our current results. And it seems our original approach is not very easy to extend. Therefore this patch proposes to materialize all 64-bit constants by enumerated patterns. I traversed almost all constants to verified the functionality of these pattens. A traversed comparison of the number of instructions used by the original method and the new method has also been completed, where no degradation was caused by this patch. This patch also passed Bootstrap test and SPEC test. Improvements of this patch are shown in llvm/test/CodeGen/PowerPC/constants-i64.ll Reviewed By: steven.zhang, stefanp Differential Revision: https://reviews.llvm.org/D92089	2020-12-21 05:21:07 +00:00
Ben Shi	9e4b682baf	[RISCV][NFC] Add tests for multiplication with constant Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D93589	2020-12-21 10:20:36 +08:00
Chen Zheng	564066524a	[PowerPC] add has side effect for SAT bit clobber intrinsics/instructions This patch does two things: 1: fix the typo that intrinsic mfvscr should be with no readmem property 2: since VSCR is not modeled yet, add has side effect for SAT bit clobber intrinsics/instructions. Reviewed By: steven.zhang Differential Revision: https://reviews.llvm.org/D90807	2020-12-20 19:48:26 -05:00
Chen Zheng	4dce7c2e20	[MachineLICM] delete dead flag if the duplicated def outside of loop is dead. Fixup dead flags for CSE-ed instructions. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D92557	2020-12-20 19:26:22 -05:00
David Green	f47bac5dd2	[ARM] Extra vecreduce tests with smaller than legal types. NFC	2020-12-20 21:20:39 +00:00
ShihPo Hung	d86a00d8fe	[RISCV] Define vslideup/vslidedown intrinsics Differential Revision: https://reviews.llvm.org/D93286	2020-12-20 05:08:15 -08:00
Hsiangkai Wang	41ab45d662	[RISCV] Define vector vfwmul intrinsics. Define vector vfwmul intrinsics and lower them to V instructions. We work with @rogfer01 from BSC to come out this patch. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Hsiangkai Wang <kai.wang@sifive.com> Differential Revision: https://reviews.llvm.org/D93584	2020-12-20 17:39:20 +08:00
Hsiangkai Wang	f86e61d886	[RISCV] Define vector vfwadd/vfwsub intrinsics. Define vector vfwadd/vfwsub intrinsics and lower them to V instructions. We work with @rogfer01 from BSC to come out this patch. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Hsiangkai Wang <kai.wang@sifive.com> Differential Revision: https://reviews.llvm.org/D93583	2020-12-20 17:39:13 +08:00
Hsiangkai Wang	bd576ac8d4	[RISCV] Define vector vfsgnj/vfsgnjn/vfsgnjx intrinsics. Define vector vfsgnj/vfsgnjn/vfsgnjx intrinsics and lower them to V instructions. We work with @rogfer01 from BSC to come out this patch. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Hsiangkai Wang <kai.wang@sifive.com> Differential Revision: https://reviews.llvm.org/D93581	2020-12-20 17:39:04 +08:00

1 2 3 4 5 ...

36964 Commits