llvm-project

Commit Graph

Author	SHA1	Message	Date
Jay Foad	6fdd5a28b7	Revert "[IR] Clean up dead instructions after simplifying a conditional branch" This reverts commit `69bdfb075b`. Reverting to investigate https://bugs.llvm.org/show_bug.cgi?id=46343	2020-06-16 10:32:15 +01:00
sstefan1	73bfb4fd52	[OpenMPOpt] initial tests for ICV tracking. Only nthreads is used. Summary: Couple of tests to showcase what will be done and what to expect with ICV tracking. Reviewers: jdoerfert, JonChesterfield Subscribers: yaxunl, guansong, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D81114	2020-06-16 11:29:42 +02:00
Igor Kudrin	ffc5d98d2c	[MC] Generate .debug_frame in the 64-bit DWARF format [7/7] Note that .eh_frame sections are generated in the 32-bit format even when debug sections are 64-bit, for compatibility reasons. They use relative references between entries, so they hardly benefit from the 64-bit format. Differential Revision: https://reviews.llvm.org/D81149	2020-06-16 15:50:14 +07:00
Igor Kudrin	1e081342d4	[MC] Fix DWARF forms for 64-bit DWARFv3 files [6/7] DW_FORM_sec_offset was introduced in DWARFv4, so, for 64-bit DWARFv3, DW_FORM_data8 should be used instead. Differential Revision: https://reviews.llvm.org/D81148	2020-06-16 15:50:14 +07:00
Igor Kudrin	ab7458fb04	[MC] Generate .debug_rnglists in the 64-bit DWARF format [5/7] In addition, the patch fixes referencing the section within a compilation unit. Differential Revision: https://reviews.llvm.org/D81147	2020-06-16 15:50:13 +07:00
Igor Kudrin	b5f8959bcd	[MC] Generate .debug_aranges in the 64-bit DWARF format [4/7] Differential Revision: https://reviews.llvm.org/D81146	2020-06-16 15:50:13 +07:00
Igor Kudrin	1dfcce5395	[MC] Generate a compilation unit in the 64-bit DWARF format [3/7] The patch enables producing DWARF64 compilation units and fixes generating references to .debug_abbrev and .debug_line sections. A similar change for .debug_ranges/.debug_rnglists will be added in a forthcoming patch. Differential Revision: https://reviews.llvm.org/D81145	2020-06-16 15:50:13 +07:00
Igor Kudrin	64c049595b	[MC] Generate .debug_line in the 64-bit DWARF format [2/7] Differential Revision: https://reviews.llvm.org/D81144	2020-06-16 15:50:13 +07:00
Igor Kudrin	a8ec9de406	[MC] Add --dwarf64 to generate DWARF64 debug info [1/7] The patch adds an option `--dwarf64` to instruct a tool to generate debug information in the 64-bit DWARF format. There is no real implementation yet, only a few compatibility checks. Differential Revision: https://reviews.llvm.org/D81143	2020-06-16 15:50:13 +07:00
Simon Pilgrim	057c9c7ee0	[X86][SSE] MatchVectorAllZeroTest - handle OR vector reductions This patch extends MatchVectorAllZeroTest to handle OR vector reduction patterns where the result is compared against zero. Fixes PR45378 Differential Revision: https://reviews.llvm.org/D81547	2020-06-16 09:42:34 +01:00
Fangrui Song	a3b5f428c1	[AArch64] Print the immediate operand for SPACE pseudo instruction Reviewed By: dmgreen Differential Revision: https://reviews.llvm.org/D81814	2020-06-15 20:55:53 -07:00
Amara Emerson	1035a416a6	[AArch64][GlobalISel] Emit constant pool loads for 64 bit fp immediates. Note: don't do this for integer 64 bit materialization to match SDAG. Differential Revision: https://reviews.llvm.org/D81893	2020-06-15 20:53:09 -07:00
Qiu Chaofan	e62912b190	[LLParser] Delete temp CallInst when error occurs Only functions with floating-point return type accepts fast-math flags. When adding such flags to function returning integer, we'll see a crash, because there's still an undeleted value referencing the argument. This patch manually removes the temporary instruction when error occurs. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D78355	2020-06-16 11:41:25 +08:00
Xing GUO	8aaeaddec8	[ObjectYAML][DWARF] Implement the .debug_addr section. This patch implements the .debug_addr section. Reviewed By: jhenderson, grimar Differential Revision: https://reviews.llvm.org/D81541	2020-06-16 10:53:10 +08:00
Craig Topper	255d5dbae1	[X86] Add support for inline assembly 'x' constraint for i128. Limiting to x86-64 since that's when __int128 is legal in clang. Differential Revision: https://reviews.llvm.org/D81817	2020-06-15 19:34:02 -07:00
Alexander Shaposhnikov	913bc312b5	[llvm-objcopy][MachO] Add support for LC_CODE_SIGNATURE This diff adds support for copying binaries containing a LC_CODE_SIGNATURE load command. Test plan: make check-all Differential revision: https://reviews.llvm.org/D81768	2020-06-15 18:55:59 -07:00
Gui Andrade	b0ffa8befe	[MSAN] Pass Origin by parameter to __msan_warning functions Summary: Normally, the Origin is passed over TLS, which seems like it introduces unnecessary overhead. It's in the (extremely) cold path though, so the only overhead is in code size. But with eager-checks, calls to __msan_warning functions are extremely common, so this becomes a useful optimization. This can save ~5% code size. Reviewers: eugenis, vitalybuka Reviewed By: eugenis, vitalybuka Subscribers: hiraditya, #sanitizers, llvm-commits Tags: #sanitizers, #llvm Differential Revision: https://reviews.llvm.org/D81700	2020-06-15 17:49:18 -07:00
Amy Huang	f8170d8715	[NativeSession] Implement findLineNumbersByAddress in NativeSession, which takes an address and a length and returns all lines within that address range.	2020-06-15 17:05:39 -07:00
Jessica Paquette	5a4c3f6b06	[GlobalISel] Look through extends etc in CombinerHelper::matchConstantOp It's possible to end up with a zext or something in the way of a G_CONSTANT, even pre-legalization. This can happen with memsets. e.g. https://godbolt.org/z/Bjc8cw To make sure we can catch these cases, use `getConstantVRegValWithLookThrough` instead of `mi_match`. Differential Revision: https://reviews.llvm.org/D81875	2020-06-15 16:34:25 -07:00
Stanislav Mekhanoshin	9ee272f13d	[AMDGPU] Add gfx1030 target Differential Revision: https://reviews.llvm.org/D81886	2020-06-15 16:18:05 -07:00
Amara Emerson	fc905ae003	[GlobalISel] Don't emit multiply by magic constant for zero memset values.	2020-06-15 14:42:14 -07:00
Nick Desaulniers	2d8e105db6	[PPCAsmPrinter] support 'L' output template for memory operands Summary: L is meant to support the second word used by 32b calling conventions for 64b arguments. This is required for build 32b PowerPC Linux kernels after upstream commit 334710b1496a ("powerpc/uaccess: Implement unsafe_put_user() using 'asm goto'") Thanks for the report from @nathanchance, and reference to GCC's implementation from @segher. Fixes: pr/46186 Fixes: https://github.com/ClangBuiltLinux/linux/issues/1044 Reviewers: echristo, hfinkel, MaskRay Reviewed By: MaskRay Subscribers: MaskRay, wuzish, nemanjai, hiraditya, kbarton, steven.zhang, llvm-commits, segher, nathanchance, srhines Tags: #llvm Differential Revision: https://reviews.llvm.org/D81767	2020-06-15 14:31:44 -07:00
Davide Italiano	c2dccf9d5e	[CodeGenPrepare] Reset the debug location when promoting trunc(s) The promotion machinery in CGP moves instructions retaining debug locations. When the transformation is local, this is mostly correct, but when instructions are moved cross-BBs, this is not always true and causes jumpiness in line tables. This is the first of a series of commits. sext(s) and zext(s) need to be treated similarly. Differential Revision: https://reviews.llvm.org/D81879	2020-06-15 14:25:43 -07:00
Florian Hahn	1d33c09f22	[IR] Add nocapture & nosync to matrix intrinsics. As suggested in D81472, the load/store intrinsics' pointer arguments can be marked as nocapture and all matrix intrinsics as nosync. This also re-flows the intrinsic definitions, to make them a little more concise.	2020-06-15 22:07:40 +01:00
Jessica Paquette	7c93a19790	NFC: Remove disabled rule from postlegalizer-combiner-zip.mir test Apparently an x86 bot doesn't like the disabled rule in this test. http://lab.llvm.org:8011/builders/fuchsia-x86_64-linux/builds/6569 Remove disabled rule and update the test to try and pacify the bot.	2020-06-15 13:15:02 -07:00
Jessica Paquette	3495b884de	[AArch64][GlobalISel] Add G_EXT and select ext using it Add selection support for ext via a new opcode, G_EXT and a post-legalizer combine which matches it. Add an `applyEXT` function, because the AArch64ext patterns require a register for the immediate. So, we have to create a G_CONSTANT to get these without writing new patterns or modifying the existing ones. Tests are the same as arm64-ext.ll. Also prevent ext from firing on the zip test. It has higher priority, so we don't want it potentially getting in the way of mask tests. Also fix up the shuffle-splat test, because ext is now selected there. The test was incorrectly regbank selected before, which could cause a verifier failure when you emit copies. Differential Revision: https://reviews.llvm.org/D81436	2020-06-15 12:20:59 -07:00
Matt Arsenault	1a7f115dce	AMDGPU/GlobalISel: Extend load/store workaround to i128 vectors	2020-06-15 14:55:11 -04:00
Matt Arsenault	362eedcbb4	AMDGPU/GlobalISel: Correct memory size in test	2020-06-15 14:12:28 -04:00
Craig Topper	d72cb4ce21	Recommit "[X86] Separate imm from relocImm handling." Fix the copy/paste mistake that caused it to fail previously	2020-06-15 10:59:43 -07:00
Florian Hahn	120c059292	[DSE,MSSA] Port partial store merging. Port partial constant store merging logic to MemorySSA backed DSE. The heavy lifting is done by the existing helper function. It is used in context where we already ensured that the later instruction can eliminate the earlier one, if it is a complete overwrite.	2020-06-15 18:41:46 +01:00
Lang Hames	498dd745f5	[ORC] Honor linker private global prefix on symbol names. If a symbol name begins with the linker private global prefix (as described by the DataLayout) then it should be treated as non-exported, regardless of its LLVM IR visibility value.	2020-06-15 10:28:36 -07:00
Wouter van Oortmerssen	3b29376e3f	[WebAssembly] Adding 64-bit version of R_WASM_MEMORY_ADDR_* relocs This adds 4 new reloc types. A lot of code that previously assumed any memory or offset values could be contained in a uint32_t (and often truncated results from functions returning 64-bit values) have been upgraded to uint64_t. This is not comprehensive: it is only the values that come in contact with the new relocation values and their dependents. A new tablegen mapping was added to automatically upgrade loads/stores in the assembler, which otherwise has no way to select for these instructions (since they are indentical other than for the offset immediate). It follows a similar technique to https://reviews.llvm.org/D53307 Differential Revision: https://reviews.llvm.org/D81704	2020-06-15 10:07:42 -07:00
Jessica Paquette	1ac8451a9b	[GlobalISel] Simplify G_ADD when it has (0-X) on the LHS or RHS This implements the following combines: ((0-A) + B) -> B-A (A + (0-B)) -> A-B Porting over the basic algebraic combines from the DAGCombiner. There are several combines which fold adds away into subtracts. This is just the simplest one. I noticed that add combines are some of the most commonly hit across CTMark, (via print statements when they fire), so I'm porting over some of the obvious ones. This gives some minor code size improvements on CTMark at -O3 on AArch64. Differential Revision: https://reviews.llvm.org/D77453	2020-06-15 09:43:24 -07:00
Francesco Petrogalli	28a00ac9ba	[llvm][SVE] IR intrinsics for quadword permutation instructions. Summary: Adding intrinsics and codegen patterns for: * trn1 <Zd>.q, <Zm>.q, <Zn>.q * trn2 <Zd>.q, <Zm>.q, <Zn>.q * zip1 <Zd>.q, <Zm>.q, <Zn>.q * zip2 <Zd>.q, <Zm>.q, <Zn>.q * uzp1 <Zd>.q, <Zm>.q, <Zn>.q * uzp2 <Zd>.q, <Zm>.q, <Zn>.q These instructions are defined in Armv8.6-A. Reviewers: sdesmalen, efriedma, kmclaughlin Reviewed By: sdesmalen Subscribers: tschuett, kristof.beyls, hiraditya, rkruppe, psnobl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80850	2020-06-15 16:21:56 +00:00
Matt Arsenault	2ca552322c	AMDGPU/GlobalISel: Fix 8-byte aligned, 96-bit scalar loads These are legal since we can do a 96-bit load on some subtargets, but this is only for vector loads. If we can't widen the load, it needs to be broken down once known scalar. For 16-byte alignment, widen to a 128-bit load.	2020-06-15 11:33:16 -04:00
Wouter van Oortmerssen	d9e0bbd17b	[WebAssembly] Adding 64-bit versions of all load & store ops. Context: https://github.com/WebAssembly/memory64/blob/master/proposals/memory64/Overview.md This is just a first step, adding the new instruction variants while keeping the existing 32-bit functionality working. Some of the basic load/store tests have new wasm64 versions that show that the basics of the target are working. Further features need implementation, but these will be added in followups to keep things reviewable. Differential Revision: https://reviews.llvm.org/D80769	2020-06-15 08:31:56 -07:00
Florian Hahn	8c61f13a0f	[DSE,MSSA] Delete instructions after printing it. Also enables a now-passing test case, that exposed a crash caused by the wrong order.	2020-06-15 16:01:36 +01:00
Stefan Pintilie	57c9dc0521	[PowerPC] Do not add the relocation addend to the instruction encoding We should not be adding the relocation addend to the instruction encoding. This patch removes that and sets those bits to zero. Differential Revision: https://reviews.llvm.org/D81082	2020-06-15 09:51:34 -05:00
Florian Hahn	979720a9bb	[DSE,MSSA] Add additional merging test cases (NFC). Additional tests added ahead of partial overlapping store merging.	2020-06-15 15:45:07 +01:00
Dominik Montada	046566a1d5	[NFC] Remove unnecessary require global-isel from tests	2020-06-15 16:35:18 +02:00
Simon Pilgrim	ae33cbc494	[X86][SSE] LowerVectorAllZeroTest - add support for >256-bit vectors Reduce by splitting the vector until we reach the target size for PTEST/MOVMSK_PCMPEQ. There might be some cases where AVX512 can perform this with 512-bit vectors but so far I haven't encountered any such pattern that reaches LowerVectorAllZeroTest. Prep work for D81547	2020-06-15 15:30:24 +01:00
Hans Wennborg	f47a776628	Revert "[X86] Separate imm from relocImm handling." > relocImm was a complexPattern that handled both ConstantSDNode > and X86Wrapper. But it was only applied selectively because using > it would cause patterns to be not importable into FastISel or > GlobalISel. So it only got applied to flag setting instructions, > stores, RMW arithmetic instructions, and rotates. > > Most of the test changes are a result of making patterns available > to GlobalISel or FastISel. The absolute-cmp.ll change is due to > this fixing a pattern ordering issue to make an absolute symbol > match to an 8-bit immediate before trying a 32-bit immediate. > > I tried to use PatFrags to reduce the repetition, but I was getting > errors from TableGen. This caused "Invalid EmitNode" assertions, see the llvm-commits thread for discussion.	2020-06-15 16:14:59 +02:00
Yvan Roux	ffe8f6d33b	[ARM][MachineOutliner] Fix no-lr-save testcase. Now that saving LR into a register is handled, some register constraints are needed to keep machine-outliner-no-lr-save.mir meaningful.	2020-06-15 16:09:31 +02:00
Kevin P. Neal	07f3351284	[strictfp] Replace dangling strictfp attrs with nobuiltin In preparation for a patch that will enforce new rules for the usage of the strictfp attribute, this patch introduces auto-upgrade behavior that will replace the strictfp attribute on callsites with nobuiltin if the enclosing function declaration doesn't also have the strictfp attribute. This auto-upgrade isn't being performed on .ll files because that would prevent us from writing a test for the forthcoming verifier behavior. Differential Revision: https://reviews.llvm.org/D70096	2020-06-15 10:05:35 -04:00
Yvan Roux	669066de65	[ARM][MachineOutliner] Add LR RegSave mode. Outline chunks of code which need to save and restore the link register when a spare register can be used to it. Differential Revision: https://reviews.llvm.org/D80127	2020-06-15 15:22:08 +02:00
Daniel Kiss	b8ae3fdfa5	[AArch64] Fix BTI instruction emission. Summary: SCTLR_EL1.BT[01] controls the PACI[AB]SP compatibility with PBYTE 11 (see [1]) This bit will be set to zero so PACI[AB]SP are equal to BTI C instruction only. [1] https://developer.arm.com/docs/ddi0595/b/aarch64-system-registers/sctlr_el1 Reviewers: chill, tamas.petz, pbarrio, ostannard Reviewed By: tamas.petz, ostannard Subscribers: kristof.beyls, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D81746	2020-06-15 15:04:36 +02:00
Matt Arsenault	dae9554b2b	AMDGPU/GlobalISel: Workaround some load/store type selection patterns The logic is written for what loads/stores should be selectable. There are a set of cases that should be selectable, but due to missing MVTs and/or selection patterns, will fail to select. I think eventually load/store select patterns should ignore the type and only look at the value size, but until that happens, bitcast these to equivalent i32 vectors.	2020-06-15 07:42:20 -04:00
Matt Arsenault	96229606f9	AMDGPU/GlobalISel: Use less artifical example to avoid abort=0 These were failing due to an unlegalizable G_CONCAT_VECTORS due to registers with types that are naturally illegal.	2020-06-15 07:37:15 -04:00
Matt Arsenault	33e9086501	GlobalISel: Support lowering vector->vector G_BITCAST Extract subvectors and cast to the result element type before remerging.	2020-06-15 07:36:30 -04:00
Georgii Rymar	ec4e68e667	[yaml2obj] - Introduce the "NoHeaders" key for "SectionHeaderTable" We have an issue currently. The following YAML piece just ignores the `Excluded` key. ``` SectionHeaderTable: Sections: [] Excluded: - Name: .foo ``` Currently the meaning is: exclude the whole table. The code checks that the `Sections` key is empty and doesn't catch/check invalid/duplicated/missed `Excluded` entries. Also there is no way to exclude all sections except the first null section, because `Sections: []` currently just excludes the whole the sections header table. To fix it, I suggest a change of the behavior. 1) A new `NoHeaders` key is added. It provides an explicit syntax to drop the whole table. 2) The meaning of the following is changed: ``` SectionHeaderTable: Sections: [] Excluded: - Name: .foo ``` Assuming there are 2 sections in the object (a null section and `.foo`), with this patch it means: exclude the `.foo` section, keep the null section. The null section is an implicit section and I think it is reasonable to make "Sections: []" to mean it is implicitly added. It will be consistent with the global "Sections" tag that is used to describe sections. 3) `SectionHeaderTable->Sections` is now optional. No `Sections` is the same as `Sections: []` (I think it avoids a confusion). 4) Using of `NoHeaders` together with `Sections`/`Excluded` is not allowed. 5) It is possible to use the `Excluded` key without the `Sections` key now (in this case `Excluded` must contain all sections). 6) `SectionHeaderTable:` or `SectionHeaderTable: []` is not allowed. 7) When the `SectionHeaderTable` key is present, we still require all sections to be present in `Sections` and `Excluded` lists. No changes here, we are still strict. Differential revision: https://reviews.llvm.org/D81655	2020-06-15 12:43:16 +03:00

1 2 3 4 5 ...

72146 Commits