llvm-project

Commit Graph

Author	SHA1	Message	Date
Vedant Kumar	8359511c62	[CodeExtractor] Remove stale llvm.assume calls from extracted region During extraction, stale llvm.assume handles may be retained in the original function. The setup is: 1) CodeExtractor unregisters assumptions in the blocks that are to be extracted. 2) Extraction happens. There are now two functions: f1 and f1.extracted. 3) Leftover assumptions in f1 (/not/ removed as they were not in the set of blocks to be extracted) now have affected-value llvm.assume handles in f1.extracted. When assumptions for a value used in f1 are looked up, ValueTracking can assert as some of the handles are in the wrong function. To fix this, simply erase the llvm.assume calls in the extracted function. Alternatives include flushing the assumption cache in the original function, or walking all values used in the original function to prune stale affected-value handles. Both seem more expensive. Testing: check-llvm, LNT run with -mllvm -hot-cold-split enabled rdar://58460728	2020-01-28 17:18:01 -08:00
Derek Schuff	d966bf830f	[WebAssembly] Preserve debug frame base information through register coloring 2 fixes: Register coloring can re-assign virtual registers. When the frame base register is colored, update the DwarfFrameBase accordingly When the frame base register is stackified, do not attempt to encode DW_AT_frame_base as a local In the future we will presumably want to handle this case better but for now we can emit worse debug info rather than crashing. Differential Revision: https://reviews.llvm.org/D73581	2020-01-28 16:58:15 -08:00
Craig Topper	92ecc306af	[X86] Add test case for llvm.flt.rounds	2020-01-28 16:27:59 -08:00
Danilo Carvalho Grael	1f85dfb2af	[AArch64][SVE] Add SVE2 mla indexed intrinsics. Summary: Add SVE2 mla indexed intrinsics: - smlalb, smalalt, umlalb, umlalt, smlslb, smlslt, umlslb, umlslt. Reviewers: efriedma, sdesmalen, dancgr, cameron.mcinally, c-rhodes, rengolin Subscribers: tschuett, kristof.beyls, hiraditya, rkruppe, arphaman, psnobl, llvm-commits, amehsan Tags: #llvm Differential Revision: https://reviews.llvm.org/D73576	2020-01-28 17:15:33 -05:00
Jessica Paquette	dba29f7c3b	[AArch64][GlobalISel] Fold G_AND into G_BRCOND When the G_BRCOND is fed by a eq or ne G_ICMP, it may be possible to fold a G_AND into the branch by producing a tbnz/tbz instead. This happens when 1. We have a ne/eq G_ICMP feeding into the G_BRCOND 2. The G_ICMP is a comparison against 0 3. One of the operands of the G_AND is a power of 2 constant This is very similar to the code in AArch64TargetLowering::LowerBR_CC. Add opt-and-tbnz-tbz to test this. Differential Revision: https://reviews.llvm.org/D73573	2020-01-28 14:00:31 -08:00
Michael Spang	a2fb2c0ddc	[GlobalMerge] Preserve symbol visibility when merging globals Symbols created for merged external global variables have default visibility. This can break programs when compiling with -Oz -fvisibility=hidden as symbols that should be hidden will be exported at link time. Differential Revision: https://reviews.llvm.org/D73235	2020-01-28 13:26:18 -08:00
Petr Hosek	127d3abf25	[Instrumentation] Set hidden visibility for the bias variable We have to avoid using a GOT relocation to access the bias variable, setting the hidden visibility achieves that. Differential Revision: https://reviews.llvm.org/D73529	2020-01-28 12:07:03 -08:00
Roland McGrath	2b0e6fe2e2	[Fuchsia] Remove aarch64-fuchsia target-specific -mcmodel=kernel Under --target=aarch64-fuchsia, -mcmodel=kernel has the effect of (the default) -mcmodel=small plus -mtp=el1 (which did not exist when this behavior was added). Fuchsia's kernel now uses -mtp=el1 directly instead of -mcmodel=kernel, so remove this special support. Patch By: mcgrathr Differential Revision: https://reviews.llvm.org/D73409	2020-01-28 11:32:08 -08:00
David Blaikie	b96e6859c9	llvm-symbolizer test: Add a bit of extra detail on how to compile/reproduce this The details are also in the .test file, but doesn't hurt to make it a bit clearer.	2020-01-28 11:07:47 -08:00
Kristina Bessonova	4b0a7fe008	[llvm-dwarfdump][Statistics] Make calculations of vars in global scope more accurate It isn't known how many times we've seen the same variable or member in the global scope (unlike in functions), but there still can be some duplicates among different CUs. So, this patch proposes to count variables in the global scope just as a sum of the number of vars, constant members and artificial entities. Reviewed by: aprantl Differential Revision: https://reviews.llvm.org/D73004	2020-01-28 20:52:20 +02:00
Kristina Bessonova	5499e2f455	[llvm-dwarfdump][Statistics] Distinguish parameters with same name or w/o a name A few DW_TAG_formal_parameter's of the same function may have the same name (e.g. variadic (template) functions) or don't have a name at all (if the parameter isn't used inside the function body), but we still need to be able to distinguish between them to get correct number of 'total vars' and 'availability' metric. Reviewed by: aprantl Differential Revision: https://reviews.llvm.org/D73003	2020-01-28 20:52:20 +02:00
Kristina Bessonova	57839e5178	[llvm-dwarfdump][Statistics] Count more than one conrete out-of-line instances of a function Here may be more than one out-of-line instance of the same function among different CUs. All of them should be accounted for to get an accurate total number of variables/parameters. Reviewed by: aprantl Differential Revision: https://reviews.llvm.org/D73002	2020-01-28 20:52:19 +02:00
Sanjay Patel	276a6b8889	[InstCombine] add tests for cmp with splat operand and splat constant; NFC See PR44588: https://bugs.llvm.org/show_bug.cgi?id=44588	2020-01-28 13:42:20 -05:00
Amara Emerson	14c2cf8e18	[AArch64][GlobalISel] Don't bail out of the select(cmp(a, b)) -> csel optimization with multiple users. It can still be beneficial to do the optimization if the result of the compare is used by another select. Differential Revision: https://reviews.llvm.org/D73511	2020-01-28 10:09:03 -08:00
Derek Schuff	da6a896e6b	[WebAssembly] Add WebAssembly support to llvm-symbolizer The only thing missing for basic llvm-symbolizer support is the ability on lib/Object to get a wasm symbol's section ID, which allows sorting and computation of the symbols' sizes. Also, when the WasmAsmParser switches sections on new functions, also add the section to the list of Dwarf sections if Dwarf is being generated for assembly; this allows writing of simple tests. Reviewers: sbc100, jhenderson, aardappel Differential Revision: https://reviews.llvm.org/D73246	2020-01-28 09:55:38 -08:00
Kristina Bessonova	2e5d20bd47	[llvm-dwarfdump][Statistics] Ignore declarations of global variables Reviewed by: djtodoro Differential Revision: https://reviews.llvm.org/D73001	2020-01-28 19:50:46 +02:00
Kristina Bessonova	e76106e01c	[llvm-dwarfdump][Statistics] Ignore DW_TAG_subroutine_type in statistics DW_TAG_subroutine_type is not really useful for statistics purposes, as it never has location information. But it may contain DW_TAG_formal_parameter children that generate number of parameters w/o location and decrease 'availability' metric significantly. Reviewed by: djtodoro Differential Revision: https://reviews.llvm.org/D72983	2020-01-28 19:50:46 +02:00
Kristina Bessonova	9806b39dae	[llvm-dwarfdump][Statistics] Distinguish functions/variables with same name across different CUs Different variables and functions might have the same name in different CU. To calculate 'Availability' metric more accurate (i.e. to avoid getting availability above 100%), we need to have some additional logic to distinguish between them. The patch introduces a DIE identifier that consists of a function/variable name and declaration information: a filename and a line number. This allows distinguishing different functions/variables (different means declared in different files/lines) with the same name, keeping duplicates counted as duplicates. Reviewed by: aprantl, djtodoro Differential Revision: https://reviews.llvm.org/D72797	2020-01-28 19:50:46 +02:00
Derek Schuff	a928d127a5	[llvm-objcopy] Initial support for wasm in llvm-objcopy Currently only supports simple copying, other operations to follow. Reviewers: sbc100, alexshap, jhenderson Differential Revision: https://reviews.llvm.org/D70930	2020-01-28 09:47:16 -08:00
Florian Hahn	5d0ffbeb4d	[Matrix] Mark expressions shared between multiple remarks. This patch adds support for explicitly highlighting sub-expressions shared by multiple leaf nodes. For example consider the following code %shared.load = tail call <8 x double> @llvm.matrix.columnwise.load.v8f64.p0f64(double* %arg1, i32 %stride, i32 2, i32 4), !dbg !10, !noalias !10 %trans = tail call <8 x double> @llvm.matrix.transpose.v8f64(<8 x double> %shared.load, i32 2, i32 4), !dbg !10 tail call void @llvm.matrix.columnwise.store.v8f64.p0f64(<8 x double> %trans, double* %arg3, i32 10, i32 4, i32 2), !dbg !10 %load.2 = tail call <30 x double> @llvm.matrix.columnwise.load.v30f64.p0f64(double* %arg3, i32 %stride, i32 2, i32 15), !dbg !10, !noalias !10 %mult = tail call <60 x double> @llvm.matrix.multiply.v60f64.v8f64.v30f64(<8 x double> %trans, <30 x double> %load.2, i32 4, i32 2, i32 15), !dbg !11 tail call void @llvm.matrix.columnwise.store.v60f64.p0f64(<60 x double> %mult, double* %arg2, i32 10, i32 4, i32 15), !dbg !11 We have two leaf nodes (the 2 stores) and the first store stores %trans which is also used by the matrix multiply %mult. We generate separate remarks for each leaf (stores). To denote that parts are shared, the shared expressions are marked as shared (), with a reference to the other remark that shares it. The operation summary also denotes the shared operations separately. Reviewers: anemet, Gerolf, thegameg, hfinkel, andrew.w.kaylor, LuoYuanke Reviewed By: anemet Differential Revision: https://reviews.llvm.org/D72526	2020-01-28 09:27:55 -08:00
Jonathan Roelofs	7f93ff58e1	[llvm] Fix broken cases of 'CHECK[^:]*$' in tests	2020-01-28 09:52:59 -07:00
Florian Hahn	a911fef3dd	[LV] Do not try to sink dead instructions. Dead instructions do not need to be sunk. Currently we try and record the recipies for them, but there are no recipes emitted for them and there's nothing to sink. They can be removed from SinkAfter while marking them for recording. Fixes PR44634. Reviewers: rengolin, hsaito, fhahn, Ayal, gilr Reviewed By: gilr Differential Revision: https://reviews.llvm.org/D73423	2020-01-28 08:28:03 -08:00
Victor Huang	4b414d9ade	[PowerPC][Future] Add pld and pstd to future CPU Add the prefixed instructions pld and pstd to future CPU. These are load and store instructions that require new operand types that are 34 bits. This patch adds the two instructions as well as the operand types required. Note that this patch also makes a minor change to tablegen to account for the fact that some instructions are going to require shifts greater than 31 bits for the new 34 bit instructions. Differential Revision: https://reviews.llvm.org/D72574	2020-01-28 08:23:29 -06:00
Wang, Pengfei	3239b5034e	[FPEnv] Add pragma FP_CONTRACT support under strict FP. Summary: Support pragma FP_CONTRACT under strict FP. Reviewers: craig.topper, andrew.w.kaylor, uweigand, RKSimon, LiuChen3 Subscribers: hiraditya, jdoerfert, cfe-commits, llvm-commits, LuoYuanke Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D72820	2020-01-28 20:43:43 +08:00
Wang, Pengfei	3d1f0ce3b9	[X86] Add combination for fma and fneg on X86 under strict FP. Summary: X86 has instructions to calculate fma and fneg at the same time. But we combine the fneg and fma only when fneg is the source operand under strict FP. Reviewers: craig.topper, andrew.w.kaylor, uweigand, RKSimon, LiuChen3 Subscribers: LuoYuanke, llvm-commits, cfe-commits, jdoerfert, hiraditya Tags: #llvm Differential Revision: https://reviews.llvm.org/D72824	2020-01-28 20:09:56 +08:00
James Henderson	5c05165984	Revert "[DebugInfo] Make most debug line prologue errors non-fatal to parsing" This reverts commit `b94191fecd`. The change broke both an LLD test and the LLDB build.	2020-01-28 11:49:30 +00:00
James Henderson	b94191fecd	[DebugInfo] Make most debug line prologue errors non-fatal to parsing Many of the debug line prologue errors are not inherently fatal. In most cases, we can make reasonable assumptions and carry on. This patch does exactly that. In the case of length problems, the approach of "the claimed length is correct" is taken to be consistent with other instances such as the SectionParser, which ignores the read length. Reviewed by: dblaikie Differential Revision: https://reviews.llvm.org/D72158	2020-01-28 11:29:50 +00:00
Jay Foad	4a331beadc	[AMDGPU] Fix vccz after v_readlane/v_readfirstlane to vcc_lo/hi Summary: Up to gfx9, writes to vcc_lo and vcc_hi by instructions like v_readlane and v_readfirstlane do not update vccz to reflect the new value of vcc. Fix it by reusing part of the existing vccz bug handling code, which inserts an "s_mov_b64 vcc, vcc" instruction to restore vccz just before an instruction that needs the correct value. Subscribers: arsenm, kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D69661	2020-01-28 10:52:17 +00:00
Kazushi (Jam) Marukawa	92600c2ec8	[VE] call isel with stack passing Summary: Function calls and stack-passing of function arguments. Custom lowering, isel patterns and tests. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D73461	2020-01-28 10:55:47 +01:00
Georgii Rymar	cff7c149de	[llvm-readobj][test] - Remove --symbols --dyn-syms part from Object/readobj-shared-object.test. The intention of Object/readobj-shared-object.test was to check the general output for shared object. I've added a case for testing dynamic objects to ELF/symbols.test. Also we already test dynamic symbols printing in ELF/dyn-symbols.test + I've added a case for `--dyn-syms` alias in D73164. Hence we can remove this piece from Object/readobj-shared-object.test. Differential revision: https://reviews.llvm.org/D73175	2020-01-28 12:36:29 +03:00
Guillaume Chatelet	d9bff3be99	Update tests for @llvm.memcpy.inline intrinsics	2020-01-28 10:32:43 +01:00
Guillaume Chatelet	5f87510c37	Fix failing bot	2020-01-28 10:20:55 +01:00
Kazushi (Jam) Marukawa	422dfea577	[VE] enable unaligned load/store isel Summary: Enable unaligned load/store isel for iN and fp32/64 and tests. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D73448	2020-01-28 09:53:37 +01:00
Guillaume Chatelet	879c825cb8	[instrinsics] Add @llvm.memcpy.inline instrinsics Summary: This is a follow up on D61634. It adds an LLVM IR intrinsic to allow better implementation of memcpy from C++. A follow up CL will add the intrinsics in Clang. Reviewers: courbet, theraven, t.p.northover, jdoerfert, tejohnson Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D71710	2020-01-28 09:42:01 +01:00
Florian Hahn	6f07f304a2	[Matrix] Mark remarks test as AArch64 specific.	2020-01-27 18:00:43 -08:00
Florian Hahn	62e228f8fd	[Matrix] Add info about number of operations to remarks. This patch updates the remark to also include a summary of the number of vector operations generated for each matrix expression. Reviewers: anemet, Gerolf, thegameg, hfinkel, andrew.w.kaylor, LuoYuanke Reviewed By: anemet Differential Revision: https://reviews.llvm.org/D72480	2020-01-27 17:43:39 -08:00
Wei Mi	f60671f049	[LV] Remove nondeterminacy by changing LoopVectorizationLegality::Reductions from DenseMap to MapVector The iteration order of LoopVectorizationLegality::Reductions matters for the final code generation, so we better use MapVector instead of DenseMap for it to remove the nondeterminacy. reduction-order.ll in the patch is an example reduced from the case we saw. In the output of opt command, the order of the select instructions in the vector.body block keeps changing from run to run currently. Differential Revision: https://reviews.llvm.org/D73490	2020-01-27 16:53:20 -08:00
Florian Hahn	949294f396	[Matrix] Add optimization remarks for matrix expression. Generate remarks for matrix operations in a function. To generate remarks for matrix expressions, the following approach is used: 1. Collect leafs of matrix expressions (done in RemarkGenerator::getExpressionLeafs). Leafs are lowered matrix instructions without other matrix users (like stores). 2. For each leaf, create a remark containing a linearizied version of the matrix expression. The following improvements will be submitted as follow-ups: * Summarize number of vector instructions generated for each expression. * Account for shared sub-expressions. * Propagate matrix remarks up the inlining chain. The information provided by the matrix remarks helps users to spot cases where matrix expression got split up, e.g. due to inlining not happening. The remarks allow users to address those issues, ensuring best performance. Reviewers: anemet, Gerolf, thegameg, hfinkel, andrew.w.kaylor, LuoYuanke Reviewed By: anemet Differential Revision: https://reviews.llvm.org/D72453	2020-01-27 16:39:29 -08:00
Fangrui Song	c7c5da6df3	Reland "[StackColoring] Remap PseudoSourceValue frame indices via MachineFunction::getPSVManager()"" Reland `7a8b0b1595`, with a fix that checks `!E.value().empty()` to avoid inserting a zero to SlotRemap. Debugged by rnk@ in https://bugs.chromium.org/p/chromium/issues/detail?id=1045650#c33 Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D73510	2020-01-27 15:58:49 -08:00
Reid Kleckner	9521c18438	[IR] Keep a double break between functions when printing a module This behavior appears to have changed unintentionally in `b0e979724f`. Instead of printing the leading newline in printFunction, print it when printing a module. This ensures that `OS << *Func` starts printing immediately on the current line, but whole modules are printed nicely. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D73505	2020-01-27 15:31:09 -08:00
Reid Kleckner	c7feb6b36a	[WinEH] Re-run stack coloring test for i686 This would've caught https://crbug.com/1045650, which resulted in the revert of `7a8b0b1595`.	2020-01-27 15:26:03 -08:00
Evgenii Stepanov	34ab56904e	Support zero size types in StackSafetyAnalysis. Reviewers: vitalybuka Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73395	2020-01-27 15:22:59 -08:00
Evgenii Stepanov	c3b80adcee	Fix StackSafetyAnalysis crash with scalable vector types. Summary: Treat scalable allocas as if they have storage size of 0, and scalable-typed memory accesses as if their range is unlimited. This is not a proper support of scalable vector types in the analysis - we can do better, but not today. Reviewers: vitalybuka Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73394	2020-01-27 15:22:59 -08:00
Florian Hahn	8e3f59b45a	[AArch64] Add option to enable/disable load-store renaming. This patch adds a new option to enable/disable register renaming in the load-store optimizer. Defaults to disabled, as there is a potential mis-compile caused by this.	2020-01-27 15:15:50 -08:00
Matt Arsenault	d2a9739274	AMDGPU/GlobalISel: Eliminate SelectVOP3Mods_f32 Trivial type predicates should be moved into the tablegen pattern itself, and not checked inside complex patterns. This eliminates a redundant complex pattern, and fixes select source modifiers for GlobalISel. I have further patches which fully handle select in tablegen and remove all of the C++ selection, although it requires the ugliness to support the entire range of legal register types.	2020-01-27 17:53:54 -05:00
Sanjay Patel	747242af8d	[InstCombine] allow more narrowing of casted select D47163 created a rule that we should not change the casted type of a select when we have matching types in its compare condition. That was intended to help vector codegen, but it also could create situations where we miss subsequent folds as shown in PR44545: https://bugs.llvm.org/show_bug.cgi?id=44545 By using shouldChangeType(), we can continue to get the vector folds (because we always return false for vector types). But we also solve the motivating bug because it's ok to narrow the scalar select in that example. Our canonicalization rules around select are a mess, but AFAICT, this will not induce any infinite looping from the reverse transform (but we'll need to watch for that possibility if committed). Side note: there's a similar use of shouldChangeType() for phi ops just below this diff, and the source and destination types appear to be reversed. Differential Revision: https://reviews.llvm.org/D72733	2020-01-27 16:35:50 -05:00
Simon Pilgrim	e7e043724e	[DAG] Enable ISD::EXTRACT_SUBVECTOR SimplifyMultipleUseDemandedBits handling This allows SimplifyDemandedBits to call SimplifyMultipleUseDemandedBits to create a simpler ISD::EXTRACT_SUBVECTOR, which is particularly useful for cases where we're splitting into subvectors anyhow. Differential Revision: This allows SimplifyDemandedBits to call SimplifyMultipleUseDemandedBits to create a simpler ISD::EXTRACT_SUBVECTOR, which is particularly useful for cases where we're splitting into subvectors anyhow.	2020-01-27 21:17:47 +00:00
Adrian Prantl	a095d149c2	Fix an assertion failure in DwarfExpression's subregister composition This patch fixes an assertion failure in DwarfExpression that is triggered when a complex fragment has exactly the size of a subregister of the register the DBG_VALUE points to and there is no DWARF encoding for the super-register. I took the opportunity to replace/document some magic values with static constructor functions to make this code less confusing to read. rdar://problem/58489125 Differential Revision: https://reviews.llvm.org/D72938	2020-01-27 12:44:37 -08:00
Roman Lebedev	7bca4a28f5	[NFC][LoopVectorize] Autogenerate tests affected by isHighCostExpansionHelper() cost modelling (PR44668)	2020-01-27 23:34:30 +03:00
Roman Lebedev	9c801c48ee	[NFC][IndVarSimplify] Autogenerate tests affected by isHighCostExpansionHelper() cost modelling (PR44668)	2020-01-27 23:34:29 +03:00

1 2 3 4 5 ...

68237 Commits