llvm-project

Commit Graph

Author	SHA1	Message	Date
David Green	eecba95067	[ARM] Replace arm vendor with none. NFC	2020-04-22 18:19:35 +01:00
Victor Huang	02141a17ae	[PowerPC][Future] Remove redundant r2 save and restore for indirect call Currently an indirect call produces the following sequence on PCRelative mode: extern void function( ); extern void (ptrfunc) ( ); void g() { ptrfunc=function; } void f() { (ptrfunc) ( ); } Producing paddi 3, 0, .LC0@PCREL, 1 ld 3, 0(3) std 2, 24(1) ld 12, 0(3) mtctr 12 bctrl ld 2, 24(1) Though the caller does not use or preserve r2, it is still saved and restored across a function call. This patch is added to remove these redundant save and restores for indirect calls. Differential Revision: https://reviews.llvm.org/D77749	2020-04-22 12:05:51 -05:00
Johannes Doerfert	68a27587c2	[OpenMP][FIX] Do not use InaccessibleMemOrArgMemOnly for barrier and flush This was reported as PR45635, committed first as `72a9e7c926`, reverted by `188f5cde96`, and now recommitted with the test change.	2020-04-22 11:10:54 -05:00
Mark Murray	3df8135286	[ARM][MC][Thumb] Recommit: Revert relocation for some pc-relative fixups. Summary: This commit recommits the reversion of https://reviews.llvm.org/D75039. Concensus appears to be in favour of assembly-time resolution of these ADR and LDR relocations, in line with GNU. The previous backout broke many lld tests, now fixed by Peter Smith in `61bccda9d9`. Reviewers: psmith Subscribers: kristof.beyls, hiraditya, danielkiss, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78301	2020-04-22 16:54:26 +01:00
Victor Huang	43abef06f4	[PowerPC][Future] Initial support for PCRel addressing for jump tables. Add initial support for PC Relative addressing to get jump table base address instead of using TOC. Differential Revision: https://reviews.llvm.org/D75931	2020-04-22 10:45:01 -05:00
Dmitry Vyukov	5a2c31116f	[TSAN] Add optional support for distinguishing volatiles Add support to optionally emit different instrumentation for accesses to volatile variables. While the default TSAN runtime likely will never require this feature, other runtimes for different environments that have subtly different memory models or assumptions may require distinguishing volatiles. One such environment are OS kernels, where volatile is still used in various places for various reasons, and often declare volatile to be "safe enough" even in multi-threaded contexts. One such example is the Linux kernel, which implements various synchronization primitives using volatile (READ_ONCE(), WRITE_ONCE()). Here the Kernel Concurrency Sanitizer (KCSAN) [1], is a runtime that uses TSAN instrumentation but otherwise implements a very different approach to race detection from TSAN. While in the Linux kernel it is generally discouraged to use volatiles explicitly, the topic will likely come up again, and we will eventually need to distinguish volatile accesses [2]. The other use-case is ignoring data races on specially marked variables in the kernel, for example bit-flags (here we may hide 'volatile' behind a different name such as 'no_data_race'). [1] https://github.com/google/ktsan/wiki/KCSAN [2] https://lkml.kernel.org/r/CANpmjNOfXNE-Zh3MNP=-gmnhvKbsfUfTtWkyg_=VqTxS4nnptQ@mail.gmail.com Author: melver (Marco Elver) Reviewed-in: https://reviews.llvm.org/D78554	2020-04-22 17:27:09 +02:00
Roman Lebedev	a70d2ab323	[NFC][InstCombine] Tests for negation of sign-/zero- extensions * sext of non-positive can be negated. * zext of non-negative can be negated.	2020-04-22 17:37:42 +03:00
jasonliu	bcca6ae3cd	[llvm-objdump][XCOFF] Print more symbol info in relocation Summary: Print more symbol info in relocation printing when --symbol-description is specified. Differential Revision: https://reviews.llvm.org/D78499	2020-04-22 13:52:08 +00:00
John Brawn	8211cfb7c8	[ARM] Don't shrink STM if it would cause an unknown base register store If a 16-bit thumb STM with writeback stores the base register but it isn't the first register in the list, then an unknown value is stored. The load/store optimizer knows this and generates a 32-bit STM without writeback instead, but thumb2 size reduction converts it into a 16-bit STM. Fix this by having thumb2 size reduction notice such STMs and leave them as they are. Differential Revision: https://reviews.llvm.org/D78493	2020-04-22 14:50:42 +01:00
Jay Foad	1f32e7367c	[AMDGPU] Fix test failures caused by `dbdffe3ee9`.	2020-04-22 14:19:21 +01:00
David Green	892af45c86	[ARM] Distribute MVE post-increments This adds some extra processing into the Pre-RA ARM load/store optimizer to detect and merge MVE loads/stores and adds of the same base. This we don't always turn into a post-inc during ISel, and due to the nature of it being a graph we don't always know an order to use for the nodes, not knowing which nodes to make post-inc and which to use the new post-inc of. After ISel, we have an order that we can use to post-inc the following instructions. So this looks for a loads/store with a starting offset of 0, and an add/sub from the same base, plus a number of other loads/stores. We then do some checks and convert the zero offset load/store into a postinc variant. Any loads/stores after it have the offset subtracted from their immediates. For example: LDR #4 LDR #4 LDR #0 LDR_POSTINC #16 LDR #8 LDR #-8 LDR #12 LDR #-4 ADD #16 It only handles MVE loads/stores at the moment. Normal loads/store will be added in a followup patch, they just have some extra details to ensure that we keep generating LDRD/LDM successfully. Differential Revision: https://reviews.llvm.org/D77813	2020-04-22 14:16:51 +01:00
Sanjay Patel	6f19f0fb9a	[InstCombine] add tests for min/max FP intrinsics with FMF (PR45478); NFC https://bugs.llvm.org/show_bug.cgi?id=45478	2020-04-22 08:43:40 -04:00
David Green	48ac4e6938	[ARM] MVE FMA loop tests. NFC	2020-04-22 13:27:40 +01:00
Roman Lebedev	67266d879c	[InstCombine] Negator: shufflevector is negatible All these folds are correct as per alive-tv	2020-04-22 15:14:23 +03:00
Roman Lebedev	4d44ce7437	[NFC][InstCombine] Add shuffle negation tests	2020-04-22 15:14:23 +03:00
Jay Foad	dbdffe3ee9	[AMDGPU] Add 192-bit register classes Differential Revision: https://reviews.llvm.org/D78312	2020-04-22 13:10:37 +01:00
James Henderson	e9aac2c3ef	[llvm-objdump] Look in all viable sections for call/branch targets Prior to this patch, llvm-objdump would only look in the last section (according to the section header table order) that matched an address for a symbol when identifying the target symbol of a call or branch operation. If there are multiple sections with the same address, due to some of them being empty, it did not look in those, even if the symbol couldn't be found in the first section looked in. This patch causes llvm-objdump to look in all sections for possible candidate symbols. If there are multiple possible symbols, it picks one from a non-empty section, if possible (as that is more likely to be the "real" symbol since functions can't really be in emptiy sections), before falling back to those in empty sections. If all else fails, it falls back to absolute symbols as it did before. Differential Revision: https://reviews.llvm.org/D78549 Reviewed by: grimar, Higuoxing	2020-04-22 12:28:30 +01:00
Lucas Prates	727e6fb84a	[NFC][llvm][X86] Adding missing -mtiple to X86 test. The modified test was missing the specification of the intended triple in its run line, assuming X86 is the default.	2020-04-22 11:55:57 +01:00
Kerry McLaughlin	17f6e18acf	[AArch64][SVE] Add SVE intrinsic for LD1RQ Summary: Adds the following intrinsic for contiguous load & replicate: - @llvm.aarch64.sve.ld1rq The LD1RQ intrinsic only needs the SImmS16XForm added by this patch. The others (SImmS2XForm, SImmS3XForm & SImmS4XForm) were added for consistency. Reviewers: andwar, sdesmalen, efriedma, cameron.mcinally, dancgr, rengolin Reviewed By: sdesmalen Subscribers: tschuett, kristof.beyls, hiraditya, rkruppe, psnobl, danielkiss, cfe-commits, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76929	2020-04-22 11:29:27 +01:00
Georgii Rymar	2bf5674317	[yaml2obj] - Program headers: add an additional check for `Offset` The `Offset` field is used to set the file offset of a program header. In a normal object it should not be greater than the minimal offset of sections included into segment. This patch adds a check for that and adds tests. Differential revision: https://reviews.llvm.org/D78304	2020-04-22 12:49:05 +03:00
Georgii Rymar	317c4913c6	[obj2yaml] - Fix the issue with dumping empty sections when dumping program headers. Imagine we have: ``` ProgramHeaders: - Type: PT_LOAD Flags: [ PF_W, PF_R ] Sections: - Section: .bar VAddr: 0x2000 Sections: - Name: .foo Type: SHT_PROGBITS Flags: [ SHF_ALLOC, SHF_EXECINSTR ] Address: 0x1000 - Name: .bar Type: SHT_PROGBITS Flags: [ SHF_ALLOC, SHF_EXECINSTR ] Address: 0x2000 ``` Both `.foo` and `.bar` share the same starting file offset, but `VA(.foo)` < `VA(PT_LOAD)`, we should not include it into segment. This patch fixes the issue. Differential revision: https://reviews.llvm.org/D77652	2020-04-22 12:36:00 +03:00
Sam Parker	04ef154124	[NFC] Test changes Add some more targets for the ARM cost model tests and add some tests for icmps and bitcasts.	2020-04-22 08:28:52 +01:00
aartbik	5397f29087	[llvm] [X86] Make test more robust against different builds Summary: Rationale: Using the --debug-only flag requires a debug build. Also, the debug output is not always consistent over different builds. This change avoids all problems by just testing the generated assembly for AVX. Reviewers: craig.topper, mehdi_amini, nicolasvasilache Reviewed By: craig.topper Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78609	2020-04-22 00:23:46 -07:00
Qiu Chaofan	c12722cde8	[PowerPC] Exploit RLDIMI for OR with large immediates This patch exploits rldimi instruction for patterns like `or %a, 0b000011110000`, which saves number of instructions when the operand has only one use, compared with `li-ori-sldi-or`. Reviewed By: nemanjai Differential Revision: https://reviews.llvm.org/D77850	2020-04-22 14:16:52 +08:00
Sameer Sahasrabuddhe	5a7a6382bc	FixIrreducible: don't crash when moving a child loop Summary: When an irreducible SCC is converted into a new natural loop, existing loops included in that SCC now become children of the new loop. The logic that moves these loops from the parent loop to the new loop invoked undefined behaviour when it modified the container that it was iterating over. Fixed this by first extracting all the loops that are to be removed from the parent. Fixes bug 45623. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D78544	2020-04-22 07:47:30 +05:30
Johannes Doerfert	46b7ed0e6f	[Attributor] Remove dependence edges eagerly If we have a dependence between an abstract attribute A to an abstract attribute B such hat changes in A should trigger an update of B, we do not need to keep the dependence around once the update was triggered. If the dependence is still required the update will reinsert it into the dependence map, if it is not we avoid triggering B in the future. This replaces the "recompute interval" mechanism we used before to prune stale dependences. Number of required iterations is generally down, compile time for the module pass (not really the CGSCC pass) is down quite a bit. There is one test change which looks like an artifact in the undefined behavior AA that needs to be looked at.	2020-04-21 15:22:10 -05:00
Johannes Doerfert	e2b53a4c05	[Attributor][NFC] Remove obsolete option from tests Since D76871 it is sufficient to run `opt -atributor` or `-attributor-cgscc`.	2020-04-21 15:22:10 -05:00
Eli Friedman	704293b168	[ARM] Fix MIR tests with invalid live-ins. A register can't be live if it isn't defined; fix issues in various testcases. Differential Revision: https://reviews.llvm.org/D78529	2020-04-21 12:13:35 -07:00
Eli Friedman	b4b9faa120	[AArch64] Fix MIR tests with invalid live-ins. A register can't be live if it isn't defined; fix issues in various testcases. Differential Revision: https://reviews.llvm.org/D78531	2020-04-21 12:13:32 -07:00
Fangrui Song	c5d38924dc	[XRay] xray_fn_idx: set SHF_WRITE to avoid text relocations In a future change we should properly fix xray_fn_idx to use PC-relative addresses as well, but for now let's keep absolute addresses until sled addresses are all fixed.	2020-04-21 12:02:29 -07:00
Roman Lebedev	352fef3f11	[InstCombine] Negator - sink sinkable negations Summary: As we have discussed previously (e.g. in D63992 / D64090 / [[ https://bugs.llvm.org/show_bug.cgi?id=42457 \| PR42457 ]]), `sub` instruction can almost be considered non-canonical. While we do convert `sub %x, C` -> `add %x, -C`, we sparsely do that for non-constants. But we should. Here, i propose to interpret `sub %x, %y` as `add (sub 0, %y), %x` IFF the negation can be sinked into the `%y` This has some potential to cause endless combine loops (either around PHI's, or if there are some opposite transforms). For former there's `-instcombine-negator-max-depth` option to mitigate it, should this expose any such issues For latter, if there are still any such opposing folds, we'd need to remove the colliding fold. In any case, reproducers welcomed! Reviewers: spatel, nikic, efriedma, xbolva00 Reviewed By: spatel Subscribers: xbolva00, mgorny, hiraditya, reames, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D68408	2020-04-21 22:00:23 +03:00
Sanjay Patel	cf30aafa2d	[Analysis] recognize the 'null' pointer constant as not poison Differential Revision: https://reviews.llvm.org/D78575	2020-04-21 14:23:06 -04:00
Sanjay Patel	b349098d22	[InstCombine] add tests for logic-of-icmps; NFC	2020-04-21 14:23:05 -04:00
aartbik	8387bee94d	[llvm] [X86] Fixed type bug in vselect for AVX masked load Summary: Bugzilla issue 45563 https://bugs.llvm.org/show_bug.cgi?id=45563 Reviewers: nicolasvasilache, mehdi_amini, craig.topper Reviewed By: craig.topper Subscribers: RKSimon, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78527	2020-04-21 11:11:35 -07:00
Pavel Iliin	be881e2831	[AArch64] FMLA/FMLS patterns improvement. FMLA/FMLS f16 indexed patterns added. Fixes https://bugs.llvm.org/show_bug.cgi?id=45467 Removed redundant v2f32 vector_extract indexed pattern since Instruction Selection is able to match v4f32 instead.	2020-04-21 18:23:21 +01:00
Roman Lebedev	1f9c169990	[NFC][InstCombine] sub-of-negatible.ll: some more test cases	2020-04-21 20:14:09 +03:00
Fangrui Song	5771c98562	[XRay] Change xray_instr_map sled addresses from absolute to PC relative for x86-64 xray_instr_map contains absolute addresses of sleds, which are relocated by `R_*_RELATIVE` when linked in -pie or -shared mode. By making these addresses relative to PC, we can avoid the dynamic relocations and remove the SHF_WRITE flag from xray_instr_map. We can thus save VM pages containg xray_instr_map (because they are not modified). This patch changes x86-64 and bumps the sled version to 2. Subsequent changes will change powerpc64le and AArch64. Reviewed By: dberris, ianlevesque Differential Revision: https://reviews.llvm.org/D78082	2020-04-21 09:36:09 -07:00
Sanjay Patel	44a8c5410e	[InstCombine] add tests for logic-of-icmps; NFC These are mostly replicated from D78430 (instsimplify). If we implement more general transforms for instcombine, then we probably don't need to add that complexity to instsimplify.	2020-04-21 12:26:45 -04:00
Stefan Pintilie	a92ee77d85	[PowerPC][Future] Add offsets to PC Relative relocations. This is an optimization that applies to global addresses and allows for the following transformation: Convert this: paddi r3, 0, symbol@PCREL, 1 ld r4, 8(r3) To this: pld r4, symbol@PCREL+8(0), 1 An instruction is saved and the linker can do the addition when the symbol is resolved. Differential Revision: https://reviews.llvm.org/D76160	2020-04-21 11:08:19 -05:00
Kang Zhang	e477915bfe	[PowerPC] Add a new test case expand-isel-liveness.mir	2020-04-21 16:00:34 +00:00
Sean Fertile	cd8e9e8fcd	[PowerPC][AIX][NFC] Fix use of FileCheck variable in lit test.	2020-04-21 10:56:46 -04:00
Pavel Labath	c475856d05	[DWARFDebugLine] Check for errors when parsing v2 file/dir lists Summary: Without this we could silently accept an invalid prologue because the default DataExtractor behavior is to return an empty string when reaching the end of file. And empty string is also used to terminate these lists. This makes the parsing code slightly more complicated, but this complexity will go away once the parser starts working with truncating data extractors. The reason I am doing it this way is because without this, the truncation would regress the quality of error messages (right now, we produce bad error messages only near EOF, but truncation would make everything behave as if it was near EOF). Reviewers: dblaikie, probinson, jhenderson Subscribers: hiraditya, MaskRay, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77555	2020-04-21 16:55:36 +02:00
Pavel Iliin	c2dd38f1cb	[AArch64][NFC] One more intrinsic test.	2020-04-21 15:20:07 +01:00
Georgii Rymar	3471ae9dad	[yaml2obj] - Verify that sections are sorted by their file offsets when creating segments. This validates that sections listed for a segment in the YAML declaration are ordered by their file offsets. It might help to simplify the file size computation, but also is useful by itself as helps to avoid issues in test cases and to maintain their readability. Differential revision: https://reviews.llvm.org/D78361	2020-04-21 15:50:42 +03:00
Kerry McLaughlin	0df40d6ef8	[AArch64][SVE] Add addressing mode for contiguous loads & stores Summary: This patch adds the register + register addressing mode for SVE contiguous load and store intrinsics (LD1 & ST1) Reviewers: sdesmalen, fpetrogalli, efriedma, rengolin Reviewed By: fpetrogalli Subscribers: tschuett, kristof.beyls, hiraditya, rkruppe, psnobl, danielkiss, cfe-commits, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78509	2020-04-21 12:04:43 +01:00
Sam Parker	27d19101e9	[ARM][ParallelDSP] Handle squaring multiplies The logic in ARMParallelDSP is setup to merge two 16-bits loads into a 32-bit load and feed them into the smlads. This requires that four loads are combined for the four inputs, but there wasn't actually a check for this. Differential Revision: https://reviews.llvm.org/D78492	2020-04-21 08:39:56 +01:00
Johannes Doerfert	dc3b5b00fe	[OpenMPOpt] Make the combination of `ident_t` deterministic Before we kept the first applicable `ident_t` during deduplication of runtime calls. The problem is that "first" is dependent on the iteration order of a DenseMap. Since the proper solution, which is to combine the information from all `ident_t`, should be deterministic on its own, we will not try to make the iteration order deterministic. Instead, we will create a fresh `ident_t` if there is not a unique existing `ident_t*` to pick.	2020-04-20 23:27:08 -05:00
Fangrui Song	b14e9e3c0c	Reland D76675 [llvm-objcopy] Match GNU behaviour regarding file symbols Don't error on Config.KeepFileSymbols for COFF and Mach-O. Original description: GNU objcopy removes STT_FILE symbols for strip-debug operations, and keeps them for --discard-all operation. Match their behaviour for llvm-objcopy. Bug: https://github.com/android/ndk/issues/1212 Differential Revision: https://reviews.llvm.org/D76675	2020-04-20 21:18:48 -07:00
Yi Kong	37a1c2eda5	Revert "[llvm-objcopy] Match GNU behaviour regarding file symbols" This reverts commit `7c65e88d0b`. Broke non ELF targets.	2020-04-21 12:04:01 +08:00
Yi Kong	7c65e88d0b	[llvm-objcopy] Match GNU behaviour regarding file symbols GNU objcopy removes STT_FILE symbols for strip-debug operations, and keeps them for --discard-all operation. Match their behaviour for llvm-objcopy. Bug: https://github.com/android/ndk/issues/1212 Differential Revision: https://reviews.llvm.org/D76675	2020-04-21 11:30:04 +08:00

1 2 3 4 5 ...

70662 Commits