llvm-project

Commit Graph

Author	SHA1	Message	Date
Vitaly Buka	4666953ce2	[StackSafety] Add info into function summary Summary: This patch adds optional field into function summary, implements asm and bitcode serialization. YAML serialization is omitted and can be added later if needed. This patch includes this information into summary only if module contains at least one sanitize_memtag function. In a near future MTE is the user of the analysis. Later if needed we can provede more direct control on when information is included into summary. Reviewers: eugenis Subscribers: hiraditya, steven_wu, dexonsmith, arphaman, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80908	2020-06-10 02:43:28 -07:00
Sam Parker	09d30cb977	[CostModel] Unify Shuffle and InsertElement Costs Extract the existing code from getInstructionThroughput into TTImpl::getUserCost. The duplicated code in the AMDGPU backend has also been removed. Differential Revision: https://reviews.llvm.org/D81448	2020-06-10 09:13:34 +01:00
Sam Parker	fa8bff0cd1	[CostModel] Unify getArithmeticInstrCost Add the remaining arithmetic opcodes into the generic implementation of getUserCost and then call this from getInstructionThroughput. Most of the backends have been modified to return the base implementation for cost kinds other RecipThroughput. The outlier here is AMDGPU which already uses getArithmeticInstrCost for all the cost kinds. This change means that most of the opcodes can be removed from that backends implementation of getUserCost. Differential Revision: https://reviews.llvm.org/D80992	2020-06-10 09:08:45 +01:00
Craig Topper	641d5ac4d1	[X86] Assign a feature to tremont, goldmont, goldmont-plus, icelake-client, and icelake for target multiversioning priority. Without this these CPUs all caused the compiler to assert when used for multiversioning.	2020-06-09 16:39:41 -07:00
diggerlin	edd819c757	[AIX] supporting the visibility attribute for aix assembly SUMMARY: in the aix assembly , it do not have .hidden and .protected directive. in current llvm. if a function or a variable which has visibility attribute, it will generate something like the .hidden or .protected , it can not recognize by aix as. in aix assembly, the visibility attribute are support in the pseudo-op like .extern Name [ , Visibility ] .globl Name [, Visibility ] .weak Name [, Visibility ] in this patch, we implement the visibility attribute for the global variable, function or extern function . for example. extern __attribute__ ((visibility ("hidden"))) int bar(int* ip); __attribute__ ((visibility ("hidden"))) int b = 0; __attribute__ ((visibility ("hidden"))) int foo(int* ip){ return (*ip)++; } the visibility of .comm linkage do not support , we will have a separate patch for it. we have the unsupported cases ("default" and "internal") , we will implement them in a a separate patch for it. Reviewers: Jason Liu ,hubert.reinterpretcast,James Henderson Differential Revision: https://reviews.llvm.org/D75866	2020-06-09 16:15:06 -04:00
Craig Topper	d5c28c4094	[X86] Move CPUKind enum from clang to llvm/lib/Support. NFCI Similar to what some other targets have done. This information could be reused by other frontends so doesn't make sense to live in clang. -Rename CK_Generic to CK_None to better reflect its illegalness. -Move function for translating from string to enum into llvm. -Call checkCPUKind directly from the string to enum translation and update CPU kind to CK_None accordinly. Caller will use CK_None as sentinel for bad CPU. I'm planning to move all the CPU to feature mapping out next. As part of that I want to devise a better way to express CPUs inheriting features from an earlier CPU. Allowing this to be expressed in a less rigid way than just falling through a switch. Or using gotos as we've had to do lately. Differential Revision: https://reviews.llvm.org/D81439	2020-06-09 12:52:41 -07:00
Matt Arsenault	b94c9e3b55	GlobalISel: Improve MachineIRBuilder construction The current relationship between LegalizerHelper and MachineIRBuilder confuses me, because the LegalizerHelper modifies the MachineIRBuilder which it does not own. Constructing a LegalizerHelper destroys the insert point, since the constructor calls setMF, which clears all the fields. Try to separate these functions, so it's possible to construct a LegalizerHelper from an existing MachineIRBuilder without losing the insert point/debug loc.	2020-06-09 15:05:04 -04:00
Matt Arsenault	babbf4441b	GlobalISel: Move some trivial MIRBuilder methods into the header The construction APIs for MachineIRBuilder don't make much sense, and it's been annoying to sort through it with these trivial functions separate from the declaration.	2020-06-09 15:04:48 -04:00
Thomas Lively	b7d369280b	[WebAssembly] Implement prototype SIMD rounding instructions Summary: As specified in https://github.com/WebAssembly/simd/pull/232. These instructions are implemented as LLVM intrinsics for now rather than normal ISel patterns to make these instructions opt-in. Once the instructions are merged to the spec proposal, the intrinsics will be replaced with proper ISel patterns. Reviewers: aheejin Subscribers: dschuff, sbc100, jgravelle-google, hiraditya, sunfish, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D81222	2020-06-09 10:14:14 -07:00
Aaron Puchert	55c365d247	Add LLVM_ATTRIBUTE_NORETURN to report_bad_alloc_error Summary: The attribute just means that there will be no regular return, it still leaves room for exceptions to be thrown. It is easily verified: there are no direct returns and the last statement is either a throw or a call to abort. Having the annotation helps static analyzers with this code from Support/MemAlloc.h (slightly simplified): LLVM_ATTRIBUTE_RETURNS_NONNULL inline void safe_malloc(size_t Sz) { void Result = std::malloc(Sz); if (Result == nullptr) report_bad_alloc_error("Allocation failed"); return Result; } Were report_bad_alloc_error to return regularly, the function would return nullptr, contradicting the attribute. Reviewers: rnk, sepavloff, dblaikie, aaron.ballman Reviewed By: dblaikie, aaron.ballman Differential Revision: https://reviews.llvm.org/D81318	2020-06-09 17:45:12 +02:00
Florian Hahn	0ab4edd02e	[PatternMatch] Support matching intrinsics with 6 arguments. I couldn't find a generic intrinsic with 6 arguments in tree for a unit test, but soon there will be one.	2020-06-09 15:59:26 +01:00
James Henderson	dbd26fe0b6	[DebugInfo] Print non-verbose output at some point as verbose output Verbose and non-verbose parsing of .debug_line produced their output at different points in the program. The most obvious impact of this was that error messages were produced at different times, but it also potentially reduced what clients could do by customising the stream or warning/error handlers. This change makes the two variants consistent by printing non-verbose output inline, the same as verbose output. Testing of the error messages has been modified to check the messages always appear in the same location to illustrate the behaviour. Reviewed by: JDevlieghere, dblaikie, MaskRay, labath Differential Revision: https://reviews.llvm.org/D80989	2020-06-09 14:24:53 +01:00
serge-sans-paille	5b08bd0eb4	Fix MemCpyOptimizer return status Differential Revision: https://reviews.llvm.org/D81229	2020-06-09 14:24:33 +02:00
James Henderson	1ce831912c	[Support] Add stream tie function and use it for errs() errs() is now tied to outs() so that if something prints to errs(), outs() will be flushed before the printing occurs. This avoids interleaving output between the two and is consistent with standard cout and cerr behaviour. Reviewed by: labath, JDevlieghere, MaskRay Differential Revision: https://reviews.llvm.org/D81156	2020-06-09 12:51:02 +01:00
Simon Pilgrim	70d77b5f8b	Magic.h - reduce includes to forward declarations. NFC.	2020-06-09 11:54:51 +01:00
Xing GUO	e4344e1466	[DWARFYAML][debug_ranges] Emit an error message for invalid offset. This patch helps make yaml2obj emit an error message when we try to assign an invalid offset to the entry of the 'debug_ranges' section. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D81357	2020-06-09 18:53:38 +08:00
Guillaume Chatelet	800e100588	Revert "[Alignment][NFC] Migrate TargetLowering::allowsMemoryAccess" This reverts commit `f21c52667e`.	2020-06-09 10:43:59 +00:00
Guillaume Chatelet	3b6196c9b3	[Alignment][NFC] TargetLowering::allowsMisalignedMemoryAccesses Summary: Note to downstream target maintainers: this might silently change the semantics of your code if you override `TargetLowering::allowsMisalignedMemoryAccesses` without marking it override. This patch is part of a series to introduce an Alignment type. See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2019-July/133851.html See this patch for the introduction of the type: https://reviews.llvm.org/D64790 Reviewers: courbet Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D81374	2020-06-09 10:17:42 +00:00
Guillaume Chatelet	f21c52667e	[Alignment][NFC] Migrate TargetLowering::allowsMemoryAccess Summary: Note to downstream target maintainers: this might silently change the semantics of your code if you override `TargetLowering::allowsMemoryAccess` without marking it override. This patch is part of a series to introduce an Alignment type. See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2019-July/133851.html See this patch for the introduction of the type: https://reviews.llvm.org/D64790 Reviewers: courbet Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D81379	2020-06-09 10:11:07 +00:00
Cullen Rhodes	b82be5db71	[AArch64][SVE] Implement structured load intrinsics Summary: This patch adds initial support for the following instrinsics: * llvm.aarch64.sve.ld2 * llvm.aarch64.sve.ld3 * llvm.aarch64.sve.ld4 For loading two, three and four vectors worth of data. Basic codegen is implemented with reg+reg and reg+imm addressing modes being addressed in a later patch. The types returned by these intrinsics have a number of elements that is a multiple of the elements in a 128-bit vector for a given type and N, where N is the number of vectors being loaded, i.e. 2, 3 or 4. Thus, for 32-bit elements the types are: LD2 : <vscale x 8 x i32> LD3 : <vscale x 12 x i32> LD4 : <vscale x 16 x i32> This is implemented with target-specific intrinsics for each variant that take the same operands as the IR intrinsic but return N values, where the type of each value is a full vector, i.e. <vscale x 4 x i32> in the above example. These values are then concatenated using the standard concat_vector intrinsic to maintain type legality with the IR. These intrinsics are intended for use in the Arm C Language Extension (ACLE). Reviewed By: sdesmalen Differential Revision: https://reviews.llvm.org/D75751	2020-06-09 08:51:58 +00:00
Guillaume Chatelet	49dd8e7991	[Alignment] Fix deprecation message	2020-06-09 08:05:13 +00:00
Kang Zhang	1b6602275d	[MachineVerifier] Add TiedOpsRewritten flag to fix verify two-address error Summary: Currently, MachineVerifier will attempt to verify that tied operands satisfy register constraints as soon as the function is no longer in SSA form. However, PHIElimination will take the function out of SSA form while TwoAddressInstructionPass will actually rewrite tied operands to match the constraints. PHIElimination runs first in the pipeline. Therefore, whenever the MachineVerifier is run after PHIElimination, it will encounter verification errors on any tied operands. This patch adds a function property called TiedOpsRewritten that will be set by TwoAddressInstructionPass and will control when the verifier checks tied operands. Reviewed By: nemanjai Differential Revision: https://reviews.llvm.org/D80538	2020-06-09 07:39:42 +00:00
David Sherwood	cc8872400c	[CodeGen] Ensure callers of CreateStackTemporary use sensible alignments In two instances of CreateStackTemporary we are sometimes promoting alignments beyond the stack alignment. I have introduced a new function called getReducedAlign that will return the alignment for the broken down parts of illegal vector types. For example, on NEON a <32 x i8> type is made up of two <16 x i8> types - in this case the sensible alignment is 16 bytes, not 32. In the legalization code wherever we create stack temporaries I have started using the reduced alignments instead for illegal vector types. I added a test to CodeGen/AArch64/build-one-lane.ll that tries to insert an element into an illegal fixed vector type that involves creating a temporary stack object. Differential Revision: https://reviews.llvm.org/D80370	2020-06-09 08:10:17 +01:00
Sam Parker	37289615c0	[NFCI][CostModel] Unify getCmpSelInstrCost Add cases for icmp, fcmp and select into the switch statement of the generic getUserCost implementation with getInstructionThroughput then calling into it. The BasicTTI and backend implementations have be set to return a default value (1) when a cost other than throughput is being queried. Differential Revision: https://reviews.llvm.org/D80550	2020-06-09 07:41:22 +01:00
Chen Zheng	8aa52b19a7	[APInt] set all bits for getBitsSetWithWrap if loBit == hiBit differentiate getBitsSetWithWrap & getBitsSet when loBit == hiBit getBitsSetWithWrap sets all bits; getBitsSet does nothing. Reviewed By: lkail, RKSimon, lebedev.ri Differential Revision: https://reviews.llvm.org/D81325	2020-06-08 22:55:24 -04:00
jasonliu	775ef44514	[XCOFF][AIX] report_fatal_error when an overflow section is needed If there are more than 65534 relocation entries in a single section, we should generate an overflow section. Since we don't support overflow section for now, we should generate an error. Differential revision: https://reviews.llvm.org/D81104	2020-06-08 19:59:04 +00:00
Andrew Litteken	bb677cacc8	[SuffixTree][MachOpt] Factoring out Suffix Tree and adding Unit Tests This moves the SuffixTree test used in the Machine Outliner and moves it into Support for use in other outliners elsewhere in the compilation pipeline. Differential Revision: https://reviews.llvm.org/D80586	2020-06-08 12:44:18 -07:00
Chris Jackson	c6c65164af	[DebugInfo] Reduce SalvageDebugInfo() functions - Now all SalvageDebugInfo() calls will mark undef if the salvage attempt fails. Reviewed by: vsk, Orlando Differential Revision: https://reviews.llvm.org/D78369	2020-06-08 19:28:18 +01:00
Jonas Devlieghere	550b599523	[Support] Replace 'DisableColors' boolean with 'ColorMode' enum Replace the DisableColors with a ColorMode which can be set to Auto, Enabled and Disabled. The purpose of this change is to make it possible to ignore the command line option not only for disabling colors, but also for enabling them. Differential revision: https://reviews.llvm.org/D81056	2020-06-08 09:48:47 -07:00
Jonas Devlieghere	8744d7f25b	[Support] Move color handling from raw_fd_ostream to raw_ostream Move the color handling code from raw_fd_ostream to raw_ostream. This makes it possible to use colors with any ostream when enabled. The existing behavior where only raw_fd_ostream supports colors by default remains unchanged. Differential revision: https://reviews.llvm.org/D81110	2020-06-08 09:03:32 -07:00
Aaron Puchert	31eeee1d8e	Fix build after removing llvm/CodeGen/GlobalISel/Types.h Removing the include was probably just forgotten after `f13ba22227`.	2020-06-08 16:58:12 +02:00
Guillaume Chatelet	54076610dc	[Alignment][NFC] Deprecate dead code from CallingConvLower.h Summary: This is a followup on D81196. Reviewers: courbet Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D81362	2020-06-08 14:49:39 +00:00
Xing GUO	9939f231e6	[ObjectYAML] Add support for error handling in DWARFYAML. NFC. This patch intends to be an NFC-patch. Test cases will be added in a follow-up patch. Reviewed By: jhenderson, grimar Differential Revision: https://reviews.llvm.org/D81356	2020-06-08 22:49:29 +08:00
Matt Arsenault	6c431fcf6e	GlobalISel: Remove dead include	2020-06-08 10:29:18 -04:00
Matt Arsenault	f74523f986	GlobalISel: Remove deprecated methods These have very few users to begin with, and have been marked deprecated for 2 months which should be long enough for out of tree targets.	2020-06-08 10:22:35 -04:00
Matt Arsenault	5f7e38d8f4	GlobalISel: Use Register	2020-06-08 10:15:53 -04:00
Matt Arsenault	f13ba22227	GlobalISel: Remove unused header	2020-06-08 10:15:53 -04:00
Matt Arsenault	1aca589c06	GlobalISel: Add dump method to LLT	2020-06-08 10:15:53 -04:00
Pavel Labath	9456bbdd08	[BinaryFormat] Add formatv support for DW_OP constants The functionality will be used from lldb.	2020-06-08 15:27:44 +02:00
Simon Pilgrim	aa67af9abd	DbgEntityHistoryCalculator.h - reduce DebugInfoMetadata.h include to forward declarations. NFC.	2020-06-08 13:59:05 +01:00
Cullen Rhodes	3ebbe35363	[AArch64][SVE] Implement vector tuple intrinsics Summary: This patch adds the following intrinsics for creating two-tuple, three-tuple and four-tuple scalable vectors: * llvm.aarch64.sve.tuple.create2 * llvm.aarch64.sve.tuple.create3 * llvm.aarch64.sve.tuple.create4 As well as: * llvm.aarch64.sve.tuple.get * llvm.aarch64.sve.tuple.set For extracting and inserting scalable vectors from vector tuples. These intrinsics are intended to be used by the ACLE functions svcreate<n>, svget and svset. This patch also includes calling convention support for passing and returning tuples of scalable vectors to/from functions. Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D75674	2020-06-08 11:09:55 +00:00
Guillaume Chatelet	94b0c32a0b	[Alignment][NFC] Migrate HandleByVal to Align Summary: Note to downstream target maintainers: this might silently change the semantics of your code if you override `TargetLowering::HandleByVal` without marking it `override`. This patch is part of a series to introduce an Alignment type. See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2019-July/133851.html See this patch for the introduction of the type: https://reviews.llvm.org/D64790 Reviewers: courbet Subscribers: sdardis, hiraditya, jrtc27, atanasyan, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D81365	2020-06-08 10:50:27 +00:00
Simon Pilgrim	1e7cd8c3ac	VersionTuple.h - reduce includes to forward declarations. NFC.	2020-06-08 11:08:44 +01:00
Dineshkumar Bhaskaran	cb6885b295	[ELF] Adding accessor method for getting Note Desc as StringRef Summary: One more way to access note desc. Reviewers: arsenm, scott.linder, saiislam Reviewed By: scott.linder Subscribers: wdng, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D81185	2020-06-08 09:44:00 +00:00
Xing GUO	67b4afc41a	[DWARFYAML] Rename function names to match the coding style. NFC.	2020-06-08 17:31:02 +08:00
Marco Elver	c6ec352a6b	Revert "[KernelAddressSanitizer] Make globals constructors compatible with kernel" This reverts commit `866ee2353f`. Building the kernel results in modpost failures due to modpost relying on debug info and inspecting kernel modules' globals: https://github.com/ClangBuiltLinux/linux/issues/1045#issuecomment-640381783	2020-06-08 10:34:03 +02:00
Sam Parker	772349de88	[PPC] Try to fix builbots Attempt to handle unsupported types, such as structs, in getMemoryOpCost. The backend now checks for a supported type and calls into BasicTTI as a fallback. BasicTTI will now also perform the same check and will default to an expensive cost of 4 for 'Other' MVTs. Differential Revision: https://reviews.llvm.org/D80984	2020-06-08 09:13:37 +01:00
QingShan Zhang	f8eabd6d01	[Power9] Add addi post-ra scheduling heuristic The instruction addi is usually used to post increase the loop indvar, which looks like this: label_X: load x, base(i) ... y = op x ... i = addi i, 1 goto label_X However, for PowerPC, if there are too many vsx instructions that between y = op x and i = addi i, 1, it will use all the hw resource that block the execution of i = addi, i, 1, which result in the stall of the load instruction in next iteration. So, a heuristic is added to move the addi as early as possible to have the load hide the latency of vsx instructions, if other heuristic didn't apply to avoid the starve. Reviewed By: jji Differential Revision: https://reviews.llvm.org/D80269	2020-06-08 01:31:07 +00:00
Simon Pilgrim	175fc4023a	CFG.h - reduce includes to forward declarations. NFC.	2020-06-07 17:25:35 +01:00
Benjamin Kramer	98626f78ae	Unbreak the build	2020-06-07 18:17:21 +02:00
Simon Pilgrim	f6cb987d50	DomTreeUpdater.h - refine includes. NFC. We don't need any of its defs or many of its includes inside PostDominators.h - so split it and reduce the frontend load.	2020-06-07 16:57:48 +01:00
Fangrui Song	dc52ce424b	[llvm-cov] Fix gcov version detection on big-endian	2020-06-07 08:07:32 -07:00
Simon Pilgrim	1c2d2c88b4	AlignmentFromAssumptions.h - reduce includes to forward declarations. NFC.	2020-06-07 13:51:48 +01:00
Simon Pilgrim	6602e4ca4b	MemorySSAUpdater.h - reduce includes to forward declarations. NFC.	2020-06-07 13:16:31 +01:00
Simon Pilgrim	3642d38823	DependenceAnalysis.h - reduce AliasAnalysis.h include to forward declaration. NFC. This requires the replacement of legacy class AliasAnalysis usages with AAResults (which it typedefs to anyhow)	2020-06-07 12:47:37 +01:00
Simon Pilgrim	b296fd2024	MustExecute.h - remove unnecessary Instruction.h include. NFC. We already have the Instruction forward declaration.	2020-06-07 12:10:50 +01:00
Simon Pilgrim	91591ec424	ObjCARCAnalysisUtils.h - remove unused LLVMContext.h include. NFC.	2020-06-07 11:48:46 +01:00
Simon Pilgrim	1e9d2f908e	OrderedInstructions.h - reduce includes to forward declarations. NFC.	2020-06-07 11:44:43 +01:00
Florian Hahn	4affc444b4	[Matrix] Implement * binary operator for MatrixType. This patch implements the * binary operator for values of MatrixType. It adds support for matrix * matrix, scalar * matrix and matrix * scalar. For the matrix, matrix case, the number of columns of the first operand must match the number of rows of the second. For the scalar,matrix variants, the element type of the matrix must match the scalar type. Reviewers: rjmccall, anemet, Bigcheese, rsmith, martong Reviewed By: rjmccall Differential Revision: https://reviews.llvm.org/D76794	2020-06-07 11:11:27 +01:00
Fangrui Song	e664d0543f	[gcov] Improve tests and lower the minimum supported version to gcov 3.4 global-ctor.ll no longer checks what it intended to check (@_GLOBAL__sub_I_global-ctor.ll needs a !dbg to work). Rewrite it. gcov 3.4 and gcov 4.2 use the same format, thus we can lower the version requirement to 3.4	2020-06-06 23:11:32 -07:00
James Y Knight	1978309db1	MachineBasicBlock::updateTerminator now requires an explicit layout successor. Previously, it tried to infer the correct destination block from the successor list, but this is a rather tricky propspect, given the existence of successors that occur mid-block, such as invoke, and potentially in the future, callbr/INLINEASM_BR. (INLINEASM_BR, in particular would be problematic, because its successor blocks are not distinct from "normal" successors, as EHPads are.) Instead, require the caller to pass in the expected fallthrough successor explicitly. In most callers, the correct block is immediately clear. But, in MachineBlockPlacement, we do need to record the original ordering, before starting to reorder blocks. Unfortunately, the goal of decoupling the behavior of end-of-block jumps from the successor list has not been fully accomplished in this patch, as there is currently no other way to determine whether a block is intended to fall-through, or end as unreachable. Further work is needed there. Differential Revision: https://reviews.llvm.org/D79605	2020-06-06 22:30:51 -04:00
Jonas Paulsson	515bfc66ea	[SystemZ] Implement -fstack-clash-protection Probing of allocated stack space is now done when this option is passed. The purpose is to protect against the stack clash attack (see https://www.qualys.com/2017/06/19/stack-clash/stack-clash.txt). Review: Ulrich Weigand Differential Revision: https://reviews.llvm.org/D78717	2020-06-06 18:38:36 +02:00
Simon Pilgrim	f14d4c9c54	EHPersonalities.h - reduce Triple.h include to forward declaration. NFC. Move implicit include dependencies down to source files.	2020-06-06 15:48:31 +01:00
Simon Pilgrim	eaf1ea109c	CFG.h - add missing GraphTraits.h include. NFC. MSVC doesn't care that this isn't declared for default template args but gcc (sometimes) does.	2020-06-06 15:18:25 +01:00
Simon Pilgrim	e5e33f23c7	CFG.h - reduce includes to forward declarations. NFC. Remove unnecessary includes from CFG.cpp. Fix implicit include dependency in X86WinEHState.cpp.	2020-06-06 15:06:42 +01:00
Matt Arsenault	bc20bdb9f9	AMDGPU/GlobalISel: Start rewriting load/store legality rules The current set is an incomprehensible mess riddled with ordering hacks for various limitations in the legalizer at the time of writing, many of which have been fixed. This takes a very small step in correcting this. The core first change is to start checking for fully legal cases first, rather than trying to figure out all of the actions that could need to be performed. It's recommended to check the legal cases first for faster legality checks in the common case. This still has a table listing some common cases, but it needs measuring whether this really helps or not. More significantly, stop trying to allow any arbitrary type with a legal bitwidth as a legal memory type, and start using the bitcast legalize action for them. Allowing loads of these weird vector types produced new burdens we don't need for handling all of the legalization artifacts. Unlike the SelectionDAG handling, this is still not casting 64 or 16-bit element vectors to 32-bit vectors. These cases should still be handled by increasing/decreasing the number of 16-bit elements. This is primarily to fix 8-bit element vectors. Another change is to stop trying to handle the load-widening based on a higher alignment. We should still do this, but the way it was handled wasn't really correct. We really need to modify the MMO's size at the same time, and not just increase the result type. The LegalizerHelper does not do this, and I think this would really require a separate WidenMemory action (or to add a memory action payload to the LegalizeMutation). These will now fail to legalize. The structure of the legalizer rules makes writing concise rules here difficult. It would be easier if the same function could answer the query the query, and report the action to perform at the same time. Instead these two are split into distinct predicate and action functions. This is mostly tolerable for other cases, but the load/store rules get pretty complicated so it's difficult to keep two versions of these functions in sync.	2020-06-06 09:59:46 -04:00
dfukalov	c94d32a6b3	[AMDGPU] Increase max iterations count to analyze complete unroll Summary: In some cases inner loops may not get boosts so try to analyze them deeper. Reviewers: rampitec, mzolotukhin Reviewed By: rampitec Subscribers: arsenm, kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, hiraditya, zzheng, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D81204	2020-06-06 16:32:45 +03:00
Simon Pilgrim	72e8f07c65	LoopPassManager.h - add missing MemorySSA.h include Fix buildbot failure due to rG5006e551d310 - oddly I can't reproduce this locally on my msvc expensive checks build.	2020-06-06 14:23:08 +01:00
Simon Pilgrim	5006e551d3	LoopAnalysisManager.h - reduce includes to forward declarations. NFC. Move implicit include dependencies down to header/source files.	2020-06-06 14:06:46 +01:00
Simon Pilgrim	eda13c2420	LegacyDivergenceAnalysis.h - reduce DivergenceAnalysis.h include to forward declaration. NFC. Move implicit include dependencies down to source file.	2020-06-06 13:30:00 +01:00
Simon Pilgrim	d43603c32c	LoopInfoImpl.h - remove unused SetVector.h include. NFC.	2020-06-06 12:59:22 +01:00
Nikita Popov	ff1210edb6	[NewGVN] Remove alignment from LoadExpression (NFC) The alignment is not actually used.	2020-06-06 11:49:20 +02:00
Nico Weber	101fbc0138	Revert "Migrate the rest of COFFObjectFile to Error" This reverts commit `b5289656b8`. __attribute__((optnone)) doesn't build with msvc, see http://lab.llvm.org:8011/builders/clang-x64-windows-msvc/builds/16326	2020-06-05 21:20:11 -04:00
Reid Kleckner	b5289656b8	Migrate the rest of COFFObjectFile to Error	2020-06-05 16:29:05 -07:00
Reid Kleckner	e03a135be8	Re-land "Migrate Binary::checkOffset from error_code to Error, NFC" This reverts commit `38f3ba591e`. Fix the XCOFF error handling. Unlike std::error_code, Error must be consumed or handled.	2020-06-05 16:27:31 -07:00
Nico Weber	38f3ba591e	Revert "Migrate Binary::checkOffset from error_code to Error, NFC" This reverts commit `74bd98829d`. Breaks LLVM::section-headers.test everywhere, see e.g. http://lab.llvm.org:8011/builders/clang-x86_64-debian-fast/builds/29940/steps/test-check-all/logs/FAIL%3A%20LLVM%3A%3Asection-headers.test	2020-06-05 17:00:20 -04:00
Reid Kleckner	74bd98829d	Migrate Binary::checkOffset from error_code to Error, NFC In my use case, this saved 100ms of time doing one-time-initialization for std::error_code().	2020-06-05 13:38:56 -07:00
Matt Arsenault	eaa8af9322	GlobalISel: Add helper for constructing load from offset	2020-06-05 15:06:03 -04:00
Matt Arsenault	45e1a22a92	GlobalISel: Make known bits/alignment API more consistent Just computing the alignment makes sense without caring about the general known bits, such as for non-integral pointers. Separate the two and start calling into the TargetLowering hooks for frame indexes. Start calling the TargetLowering implementation for FrameIndexes, which improves the AMDGPU matching for stack addressing modes. Also introduce a new hook for returning known alignment of target instructions. For AMDGPU, it would be useful to report the known alignment implied by certain intrinsic calls. Also stop using MaybeAlign.	2020-06-05 14:57:22 -04:00
Matt Arsenault	6c570f789d	GlobalISel: Add G_EXTRACT/G_INSERT offset to legalize info Immediate legalize fields were added for G_SEXT_INREG. Simiarly, these are likely not legal except for certain offsets.	2020-06-05 14:54:40 -04:00
Matt Arsenault	a080e345e4	AMDGPU: Fix missing immarg on buffer.atomic.fadd intrinsic	2020-06-05 14:34:07 -04:00
Marco Elver	866ee2353f	[KernelAddressSanitizer] Make globals constructors compatible with kernel Summary: This makes -fsanitize=kernel-address emit the correct globals constructors for the kernel. We had to do the following: - Disable generation of constructors that rely on linker features such as dead-global elimination. - Only emit constructors for globals not in explicit sections. The kernel uses sections for special globals, which we should not touch. Bugzilla: https://bugzilla.kernel.org/show_bug.cgi?id=203493 Tested: 1. With 'clang/test/CodeGen/asan-globals.cpp'. 2. With test_kasan.ko, we can see: BUG: KASAN: global-out-of-bounds in kasan_global_oob+0xb3/0xba [test_kasan] Reviewers: glider, andreyknvl Reviewed By: glider Subscribers: cfe-commits, nickdesaulniers, hiraditya, llvm-commits Tags: #llvm, #clang Differential Revision: https://reviews.llvm.org/D80805	2020-06-05 20:20:46 +02:00
Matt Arsenault	3b5d4aa258	GlobalISel: Infer nofpexcept flag during selection for non-strict ops Match SelectionDAG's behavior of adding nofpexcept to out instructions that may raise fp exceptions that are selected from instructions that do not.	2020-06-05 13:59:46 -04:00
Sander de Smalen	937cb7a8c7	Reland D80640: [CodeGen][SVE] Calculate correct type legalization for scalable vectors. This reverts commit `9bcef270d7`.	2020-06-05 18:09:31 +01:00
Simon Pilgrim	a3597ecae9	ScalarEvolutionNormalization.h - reduce ScalarEvolutionExpressions.h include to forward declaration. NFC.	2020-06-05 17:40:33 +01:00
Simon Pilgrim	ea0880ddef	TypeMetadataUtils.h - reduce Instructions.h include to forward declaration. NFC. Move implicit include dependencies down to source file.	2020-06-05 17:40:33 +01:00
Sander de Smalen	9bcef270d7	Revert "[CodeGen][SVE] Calculate correct type legalization for scalable vectors." Seems to break some buildbots, reverting the patch for now. This reverts commit `164f4b9d26`.	2020-06-05 16:03:52 +01:00
Sander de Smalen	164f4b9d26	[CodeGen][SVE] Calculate correct type legalization for scalable vectors. This patch updates TargetLoweringBase::computeRegisterProperties and TargetLoweringBase::getTypeConversion to support scalable vectors, and make the right calls on how to legalise them. These changes are required to legalise both MVTs and EVTs. Reviewers: efriedma, david-arm, ctetreau Reviewed By: efriedma Tags: #llvm Differential Revision: https://reviews.llvm.org/D80640	2020-06-05 15:20:34 +01:00
Simon Pilgrim	39ff53984d	SyncDependenceAnalysis.h - remove orphan method declarations. NFCI. These have been there since the header was added by D51491 but afaict without an implementation, all functionality is actually in DivergenceAnalysis	2020-06-05 14:35:31 +01:00
Simon Pilgrim	06fd973c85	TargetLibraryInfo.h - reduce Triple.h include to forward declaration. NFC. Move implicit include dependencies down to source files.	2020-06-05 14:35:30 +01:00
Simon Pilgrim	607e2a1fa9	ScopedNoAliasAA.h - remove unnecessary InstrTypes.h include. NFC.	2020-06-05 14:35:30 +01:00
Guillaume Chatelet	80845db6a5	[Alignment][NFC] Migrate CallingConv tablegen code Summary: We first migrate the generated code, more patches to come. This patch is part of a series to introduce an Alignment type. See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2019-July/133851.html See this patch for the introduction of the type: https://reviews.llvm.org/D64790 Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D81196	2020-06-05 13:33:34 +00:00
Dineshkumar Bhaskaran	f1b2be416d	[MsgPack] Added a convenience operator Summary: Added "not equal to" operator for DocNode comparison Reviewers: arsenm, scott.linder, saiislam Reviewed By: saiislam Subscribers: wdng, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D81250	2020-06-05 12:44:51 +00:00
Kerry McLaughlin	89fc0166f5	[CodeGen][SVE] Legalisation of extends with scalable types Summary: This patch adds legalisation of extensions where the operand of the extend is a legal scalable type but the result is not. EXTRACT_SUBVECTOR is used to split the result, before being replaced by target-specific [S\|U]UNPK[HI\|LO] operations. For example: ``` zext <vscale x 16 x i8> %a to <vscale x 16 x i16> ``` should emit: ``` uunpklo z2.h, z0.b uunpkhi z1.h, z0.b ``` Reviewers: sdesmalen, efriedma, david-arm Reviewed By: efriedma Subscribers: tschuett, hiraditya, rkruppe, psnobl, huihuiz, cfe-commits, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79587	2020-06-05 12:08:42 +01:00
Simon Pilgrim	44d86982d2	MemorySSAUpdater.h - reduce unnecessary includes to forward declarations. NFC. Remove unnecessary MemoryAccess forward declaration as its already included from MemorySSA.h Move implicit include dependencies down to source files.	2020-06-05 10:45:59 +01:00
Sam Parker	9303546b42	[CostModel] Unify getMemoryOpCost Use getMemoryOpCost from the generic implementation of getUserCost and have getInstructionThroughput return the result of that for loads and stores. This also means that the X86 implementation of getUserCost can be removed with the functionality folded into its getMemoryOpCost. Differential Revision: https://reviews.llvm.org/D80984	2020-06-05 10:13:38 +01:00
Xing GUO	929edd8bd2	[DWARFYAML][debug_aranges] Replace InitialLength with Format and Length. This patch addresses the comment in [D80972](https://reviews.llvm.org/D80972#inline-744217). Before this patch, the initial length field of .debug_aranges section should be declared as: ``` ## 32-bit DWARF debug_aranges: - Length: TotalLength: 0x20 Version: 2 ... ## 64-bit DWARF debug_aranges: - Length: TotalLength: 0xffffffff TotalLength64: 0x20 Version: 2 ... ``` After this patch: ``` ## 32-bit DWARF debug_aranges: - [[Format: DWARF32]] ## Optional Length: 0x20 Version: 2 ... ## 64-bit DWARF debug_aranges: - Format: DWARF64 Length: 0x20 Version: 2 ``` Current implementation of generating DWARF64 .debug_aranges section is buggy. A follow-up patch will improve it and add test cases for DWARF64. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D81063	2020-06-05 12:16:44 +08:00
Vedant Kumar	198762680e	[LiveDebugValues] Cache LexicalScopes::getMachineBasicBlocks, NFCI Summary: Cache the results from getMachineBasicBlocks in LexicalScopes to speed up UserValueScopes::dominates queries. This replaces the caching done in UserValueScopes. Compared to the old caching method, this reduces memory traffic when a VarLoc is copied (e.g. when a VarLocMap grows), and enables caching across basic blocks. When compiling sqlite 3.5.7 (CTMark version), this patch reduces the number of calls to getMachineBasicBlocks from 10,207 to 1,093. I also measured a small compile-time reduction (~ 0.1% of total wall time, on average, on my machine). As a drive-by, I made the DebugLoc in UserValueScopes a const reference to cut down on MetadataTracking traffic. Reviewers: jmorse, Orlando, aprantl, nikic Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80957	2020-06-04 16:58:45 -07:00
Jan Korous	5f5d972d83	[docs] Fix self-contradictory description of llvm_unreachable Just two paragraphs above it says: "If the compiler does not support this [skipping code generation for a particular branch], it will fall back to the "abort" implementation." And that actually correctly describes llvm_unreachable implementation. Differential Revision: https://reviews.llvm.org/D81130	2020-06-04 11:15:20 -07:00
Fangrui Song	9be3567df2	[llvm-dwarfdump] Add a table header for -debug-line -verbose output Like non-verbose output, so that it is easy to recognize the `Line,Column,File,ISA,Discriminator` column values. Reviewed By: JDevlieghere, jhenderson Differential Revision: https://reviews.llvm.org/D80874	2020-06-04 08:56:17 -07:00

1 2 3 4 5 ...

41171 Commits