llvm-project

Commit Graph

Author	SHA1	Message	Date
Florian Hahn	4878aa36d4	[ValueLattice] Add new state for undef constants. This patch adds a new undef lattice state, which is used to represent UndefValue constants or instructions producing undef. The main difference to the unknown state is that merging undef values with constants (or single element constant ranges) produces the constant/constant range, assuming all uses of the merge result will be replaced by the found constant. Contrary, merging non-single element ranges with undef needs to go to overdefined. Using unknown for UndefValues currently causes mis-compiles in CVP/LVI (PR44949) and will become problematic once we use ValueLatticeElement for SCCP. Reviewers: efriedma, reames, davide, nikic Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D75120	2020-03-14 17:19:59 +00:00
Georgii Rymar	b236b4cb43	[yaml2obj] - Set a default value for `PAddr` property of a program header to a value of `VAddr` `PAddr` corresponds to `p_paddr` of a program header, which is the segment's physical address for systems in which physical addressing is relevant. `p_paddr` is often equal to `p_vaddr`, which is the virtual address of a segment. This patch changes the default for `PAddr` from 0 to a value of `VAddr`. Differential revision: https://reviews.llvm.org/D76131	2020-03-14 17:44:57 +03:00
Martin Storsjö	97c7be9028	[llvm-dlltool] Add a testcase to show the kind of weak external used for import library aliases. NFC.	2020-03-14 14:00:36 +02:00
Shengchen Kan	e6f1dd40bd	[X86] Disable nop padding before instruction following a prefix Reviewers: reames, MaskRay, craig.topper, LuoYuanke, jyknight Reviewed By: LuoYuanke Subscribers: hiraditya, llvm-commits, annita.zhang Tags: #llvm Differential Revision: https://reviews.llvm.org/D76052	2020-03-14 13:15:30 +08:00
Diogo Sampaio	83cdb654e4	[AArch64][Fix] LdSt optimization generate premature stack-popping Summary: When moving add and sub to memory operand instructions, aarch64-ldst-opt would prematurally pop the stack pointer, before memory instructions that do access the stack using indirect loads. e.g. ``` int foo(int offset){ int local[4] = {0}; return local[offset]; } ``` would generate: ``` sub sp, sp, #16 ; Push the stack mov x8, sp ; Save stack in register stp xzr, xzr, [sp], #16 ; Zero initialize stack, and post-increment, making it invalid ------ If an exception goes here, the stack value might be corrupted ldr w0, [x8, w0, sxtw #2] ; Access correct position, but it is not guarded by SP ``` Reviewers: fhahn, foad, thegameg, eli.friedman, efriedma Reviewed By: efriedma Subscribers: efriedma, kristof.beyls, hiraditya, danielkiss, llvm-commits, simon_tatham Tags: #llvm Differential Revision: https://reviews.llvm.org/D75755	2020-03-14 02:03:10 +00:00
Craig Topper	755e00876c	[X86] Remove isel patterns for X86VBroadcast+trunc+extload. Replace with DAG combines. This is a little more complicated than I'd like it to be. We have to manually match a trunc+srl+load pattern that generic DAG combine won't do for us due to isTypeDesirableForOp.	2020-03-13 18:12:16 -07:00
Eli Friedman	65fc706ddf	[SCEV] Add support for GEPs over scalable vectors. Because we have to use a ConstantExpr at some point, the canonical form isn't set in stone, but this seems reasonable. The pretty sizeof(<vscale x 4 x i32>) dumping is a relic of ancient LLVM; I didn't have to touch that code. :) Differential Revision: https://reviews.llvm.org/D75887	2020-03-13 16:12:45 -07:00
Akira Hatanaka	c6f1713c46	[ObjC][ARC] Don't remove autoreleaseRV/retainRV pairs if the call isn't a tail call This reapplies the patch in https://reviews.llvm.org/rG1f5b471b8bf4, which was reverted because it was causing crashes. https://bugs.chromium.org/p/chromium/issues/detail?id=1061289#c2 Check that HasSafePathToCall is true before checking the call is a tail call. Original commit message: Previosly ARC optimizer removed the autoreleaseRV/retainRV pair in the following code, which caused the object returned by @something to be placed in the autorelease pool because the call to @something isn't a tail call: ``` %call = call i8* @something(...) %2 = call i8* @objc_retainAutoreleasedReturnValue(i8* %call) %3 = call i8* @objc_autoreleaseReturnValue(i8* %2) ret i8* %3 ``` Fix the bug by checking whether @something is a tail call. rdar://problem/59275894	2020-03-13 13:52:14 -07:00
Stanislav Mekhanoshin	c262b69dcc	[AMDGPU] Fix endcf collapse Only collapse inner endcf if the outer one belongs to SI_IF. If it does belong to SI_ELSE then mask being restored in fact a partial inverse of what we need. Differential Revision: https://reviews.llvm.org/D76154	2020-03-13 13:50:21 -07:00
Martin Storsjö	8f540dad61	[COFF] Assign unique names to autogenerated .weak.<name>.default symbols These symbols need to be external (MSVC tools error out if a weak external points at a symbol that isn't external; this was tried before but had to be reverted in `bc5b7217dc`, and this was originally explicitly fixed in `732eeaf2a9`). If multiple object files have weak symbols with defaults, their defaults could cause linker errors due to duplicate definitions, unless the names of the defaults are unique. GNU binutils handles this by appending the name of another symbol from the same object file to the name of the default symbol. Try to implement something similar; before writing the object file, locate a symbol that should have a unique name and use the name of that one for making the weak defaults unique. Differential Revision: https://reviews.llvm.org/D75989	2020-03-13 22:44:55 +02:00
Matt Arsenault	015b640be4	AMDGPU: Add flag to used fixed function ABI Pass all arguments to every function, rather than only passing the minimum set of inputs needed for the call graph.	2020-03-13 13:27:05 -07:00
Alexey Zhikhartsev	f71abec661	[LoopInterchange] Fix interchanging contents of preheader BBs Summary: Previously LCSSA was getting broken by placing instructions into the (newly) inner header instead of the preheader. Fixes PR43474 Reviewers: fhahn Reviewed By: fhahn Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75943	2020-03-13 15:59:37 -04:00
Matt Arsenault	bb8622094d	AMDGPU: Don't handle kernarg.segment.ptr in functions Just lower this to null. Pass implicitarg.ptr in its place in the argument list.	2020-03-13 12:51:12 -07:00
Nico Weber	f82b32a51e	Revert "Reland "[DebugInfo] Enable the debug entry values feature by default"" This reverts commit `5aa5c943f7`. Causes clang to assert, see https://bugs.chromium.org/p/chromium/issues/detail?id=1061533#c4 for a repro.	2020-03-13 15:37:44 -04:00
Stanislav Mekhanoshin	32e90cbcd1	[AMDGPU] Disable endcf collapse There are some functional regressions and I suspect our scopes are not as perfectly enclosed as I expected. Disable it for now. Differential Revision: https://reviews.llvm.org/D76148	2020-03-13 12:33:22 -07:00
Reid Kleckner	478b06e687	Revert "[ObjC][ARC] Check the basic block size before calling DominatorTree::dominate" This reverts commit `5c3117b0a9` This should not be necessary after `7593a480db`, and Florian Hahn has confirmed that the problem no longer reproduces with this patch. I happened to notice this code because the FIXME talks about OrderedBasicBlock. Reviewed By: fhahn, dexonsmith Differential Revision: https://reviews.llvm.org/D76075	2020-03-13 11:57:55 -07:00
Simon Pilgrim	05c0d34918	[X86][SSE] Prefer trunc(movd(x)) to pextrb(x,0) If we're extracting the 0'th index of a v16i8 vector we're better off using MOVD than PEXTRB, unless we're storing the value or we require the implicit zero extension of PEXTRB. The biggest perf diff is on SLM targets where MOVD (uops=1, lat=3 tp=1) is notably faster than PEXTRB (uops=2, lat=5, tp=4). This matches what we already do for PEXTRW. Differential Revision: https://reviews.llvm.org/D76138	2020-03-13 18:43:04 +00:00
Sanjay Patel	89b19e8959	[SimplifyCFG] add test for chain of empty block conditional branches; NFC	2020-03-13 14:39:31 -04:00
Huihui Zhang	fc1f205745	[SLPVectorizer][SVE] Bail out early for scalable vector. Summary: SLPVectorizer try to vectorize list of scalar instructions of the same type, instructions already vectorized are rejected through isValidElementType(). Without this patch, tryToVectorizeList() will first try to determine vectorization factor of a list of Instructions before checking whether each instruction has unsupported type or not. For instructions already vectorized for SVE, it will crash at getVectorElementSize(), where it try to return a fixed size. This patch make sure invalid element types are rejected before trying to get vectorization factor. This make sure we are not trying to vectorize instructions already vectorized. Reviewers: sdesmalen, efriedma, spatel, RKSimon, ABataev, apazos, rengolin Reviewed By: efriedma Subscribers: tschuett, hiraditya, rkruppe, psnobl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76017	2020-03-13 11:23:31 -07:00
Sanjay Patel	afc4dcee83	[SimplifyCFG] regenerate complete test checks; NFC	2020-03-13 14:12:28 -04:00
Sanjay Patel	7fe0e70ecc	[SimplifyCFG] regenerate test checks; NFC	2020-03-13 14:12:28 -04:00
Florian Hahn	e30c257811	[CVP,SCCP] Precommit test for D75055. Test case for PR44949.	2020-03-13 17:53:39 +00:00
Philip Reames	1b86ad27a7	Use 15 byte long nops on modern Intel processors Back in D42616, we switched our default nop length from 15 to 10 bytes because some platforms have painful decode stalls when encountering multiple instruction prefixes. (10 byte long nops come from the fact that prefixes are used to pad after 8 bytes, and some platforms have issues w/more than two prefixes.) Based on Agner's guides, it appears to be the case that modern Intel (SandyBridge and later) can decode an arbitrary number of prefixes without issue. Intel's guide only provides up to 9 bytes; I read that as providing a safe default for all their chips. Older chips and Atom series have serious decode stalls. I can't find a conclusive reference beyond those two. Differential Revision: https://reviews.llvm.org/D75945	2020-03-13 10:51:09 -07:00
Simon Cook	a26bd4ec16	[TableGen] Support combining AssemblerPredicates with ORs For context, the proposed RISC-V bit manipulation extension has a subset of instructions which require one of two SubtargetFeatures to be enabled, 'zbb' or 'zbp', and there is no defined feature which both of these can imply to use as a constraint either (see comments in D65649). AssemblerPredicates allow multiple SubtargetFeatures to be declared in the "AssemblerCondString" field, separated by commas, and this means that the two features must both be enabled. There is no equivalent to say that _either_ feature X or feature Y must be enabled, short of creating a dummy SubtargetFeature for this purpose and having features X and Y imply the new feature. To solve the case where X or Y is needed without adding a new feature, and to better match a typical TableGen style, this replaces the existing "AssemblerCondString" with a dag "AssemblerCondDag" which represents the same information. Two operators are defined for use with AssemblerCondDag, "all_of", which matches the current behaviour, and "any_of", which adds the new proposed ORing features functionality. This was originally proposed in the RFC at http://lists.llvm.org/pipermail/llvm-dev/2020-February/139138.html Changes to all current backends are mechanical to support the replaced functionality, and are NFCI. At this stage, it is illegal to combine features with ands and ors in a single AssemblerCondDag. I suspect this case is sufficiently rare that adding more complex changes to support it are unnecessary. Differential Revision: https://reviews.llvm.org/D74338	2020-03-13 17:13:51 +00:00
Florian Hahn	0c5b6e2ea5	Recommit "[SCCP] Use ValueLatticeElement instead of LatticeVal (NFCI)" This patch should fix the cause of the stage2 failures and PR45185. This reverts the revert commit `c52f839e72`.	2020-03-13 17:03:22 +00:00
Simon Pilgrim	a2db388dce	[CostModel][X86] Improve ISD::CTTZ costs accounting for BSF/TZCNT implementations	2020-03-13 16:51:13 +00:00
Simon Pilgrim	ec3218dbee	[X86] Add cttz/ctlz tests for i686 with CMOV target	2020-03-13 16:51:13 +00:00
Tyker	2543567c41	[AssumeBundles] filter usefull attriutes to preserve Summary: This patch will filter attributes to only preserve those that are usefull. In the case of NoAlias it is filtered out not because it isn't usefull but because it is incorrect to preserve it as it is only valdi for the duration of the function. Reviewers: jdoerfert Reviewed By: jdoerfert Subscribers: jdoerfert, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75828	2020-03-13 17:35:47 +01:00
Tyker	69375fd0a3	[AssumeBundles] Preserve Information in the inliner Summary: during inling Create and insert an llvm.assume with attributes to preserve them. to prevent any changes for now generation of llvm.assume is under a flag disabled by default. Reviewers: jdoerfert Reviewed By: jdoerfert Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75825	2020-03-13 17:35:47 +01:00
omarahmed1111	b285b333dc	[Attributor] Detect possibly unbounded cycles in functions This patch add mayContainUnboundedCycle helper function which checks whether a function has any cycle which we don't know if it is bounded or not. Loops with maximum trip count are considered bounded, any other cycle not. It also contains some fixed tests and some added tests contain bounded and unbounded loops and non-loop cycles. Reviewed By: jdoerfert, uenoku, baziotis Differential Revision: https://reviews.llvm.org/D74691	2020-03-13 11:17:33 -05:00
Pankaj Gode	bf990530ae	[Attributor] Improve noalias preservation using reachability Resolution for below fixme: (ii) Check whether the value is captured in the scope using AANoCapture. FIXME: This is conservative though, it is better to look at CFG and check only uses possibly executed before this callsite. Propagates caller argument's noalias attribute to callee. Reviewed by: jdoerfert, uenoku Reviewers: jdoerfert, sstefan1, uenoku Subscribers: uenoku, sstefan1, hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D71617	2020-03-13 21:09:08 +05:30
Fangrui Song	7b74b0d4e5	[llvm-objdump] --syms: print 'u' for STB_GNU_UNIQUE GCC when configured with --enable-gnu-unique (default on glibc>=2.11) emits STB_GNU_UNIQUE for certain objects which are otherwise emitted as STT_OBJECT, such as an inline function's static local variable or its guard variable, and a static data member of a template. Clang does not implement -fgnu-unique. Implementing it as a binding is strange and the feature itself is considered by some as a misfeature. Reviewed By: grimar, jhenderson Differential Revision: https://reviews.llvm.org/D75797	2020-03-13 08:04:09 -07:00
Fangrui Song	e799405e53	[llvm-objdump] --syms: print 'i' for STT_GNU_IFUNC Reviewed By: grimar, Higuoxing, jhenderson Differential Revision: https://reviews.llvm.org/D75793	2020-03-13 08:02:36 -07:00
Fangrui Song	0bd3da5bfa	[llvm-objdump][test] Reorganize ELF --syms tests Merge symbol-table-elf.test and common-symbol-elf.test, and add some more tests (invalid st_type, STT_COMMON, STT_GNU_IFUNC, STT_HIOS, STT_LOPROC, SHN_UNDEF, SHN_ABS, SHN_COMMON, STB_GNU_UNIQUE, invalid binding, etc) to test/llvm-objdump/ELF/symbol-table.test The naming follows test/llvm-{readobj,objcopy}/ELF . Some discrepancy from GNU objdump: * STT_COMMON: can be produced with `ld.bfd -r -z common`, but it almost never exists in practice * STT_GNU_IFUNC: will be fixed by D75793 * STB_GNU_UNIQUE: will be fixed by D75797 * STT_TLS: GNU objdump does not print 'O' * unknown binding: GNU objdump does not print 'g'. This probably does not matter. * A reserved symbol index is displayed as ABS in GNU objdump. It is not clear what we should print. Reviewed By: grimar Differential Revision: https://reviews.llvm.org/D75796	2020-03-13 08:00:59 -07:00
Nico Weber	86eb2c3991	Revert "[ObjC][ARC] Don't remove autoreleaseRV/retainRV pairs if the call isn't" This reverts commit `1f5b471b8b`. Causes asserts when building code with arc. See https://bugs.chromium.org/p/chromium/issues/detail?id=1061289#c2 for a full repro. Will post a creduced repro once creduce is done running.	2020-03-13 10:16:02 -04:00
Clement Courbet	ffe3515aa7	[ExpandMemCmp][NFC] Add more tests.	2020-03-13 15:06:52 +01:00
Andrzej Warzynski	a0c15ed460	[AArch64][SVE] Add the @llvm.aarch64.sve.dup.x intrinsic Summary: This intrinsic implements the unpredicated duplication of scalar values and is mapped to (through ISD::SPLAT_VECTOR): * DUP <Zd>.<T>, #<imm> * DUP <Zd>.<T>, <R><n\|SP> Reviewed by: sdesmalen Differential Revision: https://reviews.llvm.org/D75900	2020-03-13 12:40:22 +00:00
David Green	2c6c169dbd	[ARM] Optimise ASRL/LSRL to smaller shifts using demand bits. The ASRL/LSRL long shifts are generated from 64bit shifts. Once we have them, it might turn out that enough of the 64bit result was not required that we can use a smaller shift to perform the same result. As the smaller shift can in general be folded in more way, such as into add instructions in one of the test cases here, we can use the demand bit analysis to prefer the smaller shifts where we can. Differential Revision: https://reviews.llvm.org/D75371	2020-03-13 10:09:03 +00:00
Georgii Rymar	6f3de2e53d	[yaml2obj][obj2yaml][test] - Add base tests for relocation addends. We had no test for `Addend` field of a relocation. Though the current behavior is not ideal and might need to be fixed. This patch adds 2 test cases to document the current behavior and add a few FIXMEs. These FIXME are fixed in the follow-up: https://reviews.llvm.org/D75527 Differential revision: https://reviews.llvm.org/D75528	2020-03-13 13:07:46 +03:00
David Green	f67d93dc23	[ARM] Constant long shift combines This changes the way that asrl and lsrl intrinsics are lowered, going via a the ISEL ASRL and LSLL nodes instead of straight to machine nodes. On top of that, it adds some constant folds for long shifts, in case it turns out that the shift amount was either constant or 0. Differential Revision: https://reviews.llvm.org/D75553	2020-03-13 08:54:59 +00:00
Juneyoung Lee	c39cb1c0dd	[CodeGenPrepare] Expand freeze conversion to support fcmp and icmp with null Summary: This is a simple patch that expands https://reviews.llvm.org/D75859 to pointer comparison and fcmp Checked with Alive2 Reviewers: reames, jdoerfert Reviewed By: jdoerfert Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76048	2020-03-13 17:21:33 +09:00
Juneyoung Lee	48b901b0e1	Add tests to Transforms/CodeGenPrepare/X86/freeze-cmp.ll before commiting D76048	2020-03-13 17:18:42 +09:00
Craig Topper	09c8f38924	[X86] Add isel patterns for X86VBroadcast with i16 truncates from i16->i64 zextload/extload. We can form vpbroadcastw with a folded load. We had patterns for i16->i32 zextload/extload, but nothing prevents i64 from occuring. I'd like to move this all to DAG combine to fix more cases, but this is trivial fix to minimize test diffs when moving to a combine.	2020-03-13 00:10:48 -07:00
Craig Topper	51a4c6125c	[X86] Add test cases for failures to form vbroadcastw due to isTypeDesirableForOp preventing load shrinking to i16. These are based on existing test cases but use i64 instead of i32. Some of these end up with i64 zextload/extloads from i16 that we don't have isel patterns for. Some of the other cases fail because isTypeDesirableForOp prevents shrinking the (trunc (i64 (srl (load)))) directly. So we try to shrink based on the (i64 (srl (load))) but we need 64 - shift_amount to be a power of 2 to do that shrink.	2020-03-12 23:20:05 -07:00
Johannes Doerfert	a198adb490	[Attributor] IPO across definition boundary of a function marked alwaysinline Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D75590	2020-03-13 01:06:12 -05:00
Johannes Doerfert	40815a4957	Revert "[Attributor] Enable test with update check lines" This reverts commit `13def55b3f`. This broke a buildbot, will investigate.	2020-03-13 00:59:47 -05:00
Johannes Doerfert	1c9c23d60e	[OpenMP][Opt][NFC] Add test case for known runtime function attributes This test somehow did not make it in before.	2020-03-13 00:28:14 -05:00
Johannes Doerfert	13def55b3f	[Attributor] Enable test with update check lines The test disabled in `528a6a1d4c` is enabled again with the check lines for `9708279c72`.	2020-03-12 23:24:15 -05:00
Arlo Siemsen	1478ed69d3	Add support for SHA256 source file checksums in debug info LLVM currently supports CSK_MD5 and CSK_SHA1 source file checksums in debug info. This change adds support for CSK_SHA256 checksums. The SHA256 checksums are supported by the CodeView debug format. Reviewed By: aprantl Differential Revision: https://reviews.llvm.org/D75785	2020-03-12 16:32:05 -07:00
Huihui Zhang	f4f2706572	[ConstantFold][SVE] Fix constant folding for scalable vector compare instruction. Summary: Do not iterate on scalable vector. Also do not return constant scalable vector from ConstantInt::get(). Fix result type by using getElementCount() instead of getNumElements(). Reviewers: sdesmalen, efriedma, apazos, huntergr, willlovett Reviewed By: efriedma Subscribers: tschuett, hiraditya, rkruppe, psnobl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73753	2020-03-12 16:15:38 -07:00

1 2 3 4 5 ...

69615 Commits