llvm-project

Commit Graph

Author	SHA1	Message	Date
Craig Topper	9d86bf825c	[X86] Move hasOneUse check after opcode check. NFC Checking opcode is cheap. hasOneUse might not be if the node has multiple results. By checking the opcode we can rule out nodes with multiple results we aren't interested in.	2022-04-15 17:20:57 -07:00
Stella Stamenova	353f0a8e43	Revert "[mlir] Refactor LICM into a utility" This reverts commit `3131f80824`. This commit broke the Windows mlir bot: https://lab.llvm.org/buildbot/#/builders/13/builds/19745	2022-04-15 17:09:16 -07:00
Craig Topper	c6dc229a6d	[DAGCombiner] Move call to hasOneUse after opcode checks. NFC Checking the opcode is cheap, counting the number of uses is not.	2022-04-15 17:02:16 -07:00
Chris Bieneman	f2526c1a5c	Add DXIL Bitcode Writer and DXIL testing This change is a big blob of code that isn't easy to break up. It either comes in all together as a blob, works and has tests, or it doesn't do anything. Logically you can think of this patch as three things: (1) Adding virtual interfaces so the bitcode writer can be overridden (2) Adding a new bitcode writer implementation for DXIL (3) Adding some (optional) crazy CMake goop to build the DirectXShaderCompiler's llvm-dis as dxil-dis for testing Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D122082	2022-04-15 18:50:26 -05:00
Craig Topper	a7b9d75e7a	[DAGCombiner] Move or/xor/and opcode check in ReduceLoadOpStoreWidth before hasOneUse check. hasOneUse is not cheap on nodes with chain results that might have many uses. By checking the opcode first, we can avoid a costly walk of the use list on nodes we aren't interested in. Found by investigating calls to hasNUsesOfValue from the example provided in D123857.	2022-04-15 16:38:27 -07:00
Johannes Doerfert	81143b69dd	[Attributor][FIX] Use AttributorConfig in the unit tests too	2022-04-15 18:36:38 -05:00
Richard Smith	fc30901096	Extend support for std::move etc to also cover std::as_const and std::addressof, plus the libstdc++-specific std::__addressof. This brings us to parity with the corresponding GCC behavior. Remove STDBUILTIN macro that ended up not being used.	2022-04-15 16:31:39 -07:00
Peter Klausler	9e7eef9989	[flang] Handle parameter-dependent types in PDT initializers For parameterized derived type component initializers whose expressions' types depend on parameter values, f18's current scheme of analyzing the initialization expression once during name resolution fails. For example, type :: pdt(k) integer, kind :: k real :: component = real(0.0, kind=k) end type To handle such cases, it is necessary to re-analyze the parse trees of these initialization expressions once for each distinct initialization of the type. This patch adds code to wipe an expression parse tree of its typed expressions, and update those of its symbol table pointers that reference type parameters, and then re-analyze that parse tree to generate the properly typed component initializers. Differential Revision: https://reviews.llvm.org/D123728	2022-04-15 16:20:41 -07:00
Johannes Doerfert	3be3b40188	[Attributor][NFCI] Introduce AttributorConfig to bundle all options Instead of lengthy constructors we can now set the members of a read-only struct before the Attributor is created. Should make it clearer what is configurable and also help introducing new options in the future. This actually added IsModulePass and avoids deduction through the Function set size. No functional change was intended.	2022-04-15 18:17:19 -05:00
Bill Wendling	2a404cdfd8	[randstruct] Force errors for all platforms	2022-04-15 15:17:07 -07:00
Mogball	3131f80824	[mlir] Refactor LICM into a utility LICM is refactored into a utility that is application on any region. The implementation is moved to Transform/Utils.	2022-04-15 22:07:01 +00:00
Richard Smith	a571f82a50	Update test to handle opaque pointers flag flip.	2022-04-15 14:51:30 -07:00
Pavel Kosov	a5b7ea0783	[llvm-objdump] Implemented PrintBranchImmAsAddress for MIPS Updated MipsInstPrinter to print absolute hex offsets for branch instructions. It is necessary to make the llvm-objdump output close to the gnu objdump output. This implementation is based on the implementation for RISC-V. OS Laboratory. Huawei Russian Research Institute. Saint-Petersburg Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D123764	2022-04-15 23:48:38 +02:00
Vitaly Buka	eb4d22917e	[msan] Set poison_in_dtor=1 by default It's still disabled by default at compile time. Reviewed By: kstoimenov Differential Revision: https://reviews.llvm.org/D123875	2022-04-15 14:40:23 -07:00
Peter Klausler	7e225423d3	[flang] Finer control over error recovery with GetExpr() Prior to this patch, the semantics utility GetExpr() will crash unconditionally if it encounters a typed expression in the parse tree that has not been set by expression semantics. This is the right behavior when called from lowering, by which time it is known that the program had no fatal user errors, since it signifies a fatal internal error. However, prior to lowering, in the statement semantics checking code, a more nuanced test should be used before crashing -- specifically, we should not crash in the face of a missing typed expression when in error recovery mode. Getting this right requires GetExpr() and its helper class to have access to the semantics context, so that it can check AnyFatalErrors() before crashing. So this patch touches nearly all of its call sites. Differential Revision: https://reviews.llvm.org/D123873	2022-04-15 14:25:41 -07:00
Richard Smith	64c045e25b	Treat `std::move`, `forward`, and `move_if_noexcept` as builtins. We still require these functions to be declared before they can be used, but don't instantiate their definitions unless their addresses are taken. Instead, code generation, constant evaluation, and static analysis are given direct knowledge of their effect. This change aims to reduce various costs associated with these functions -- per-instantiation memory costs, compile time and memory costs due to creating out-of-line copies and inlining them, code size at -O0, and so on -- so that they are not substantially more expensive than a cast. Most of these improvements are very small, but I measured a 3% decrease in -O0 object file size for a simple C++ source file using the standard library after this change. We now automatically infer the `const` and `nothrow` attributes on these now-builtin functions, in particular meaning that we get a warning for an unused call to one of these functions. In C++20 onwards, we disallow taking the addresses of these functions, per the C++20 "addressable function" rule. In earlier language modes, a compatibility warning is produced but the address can still be taken. The same infrastructure is extended to the existing MSVC builtin `__GetExceptionInfo`, which is now only recognized in namespace `std` like it always should have been. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D123345	2022-04-15 14:09:45 -07:00
Florian Hahn	73f5d7d0d6	[VPlan] Handle equal address and store ops in onlyFirstLaneDemanded. With opaque pointers, the stored value and address can be the same. Previously the code in VPWidenMemoryInstructionRecipe::onlyFirstLaneDemanded incorrectly considers stores with matching store and pointer operands as only demanding the first lane, causing a crash.	2022-04-15 22:53:33 +02:00
Chih-Ping Chen	eab6e94f91	[DebugInfo] Add a TargetFuncName field in DISubprogram for specifying DW_AT_trampoline as a string. Also update the signature of DIBuilder::createFunction to reflect this addition. Differential Revision: https://reviews.llvm.org/D123697	2022-04-15 16:38:23 -04:00
Johannes Doerfert	39a68cc016	Revert "[Attributor] CGSCC pass should not recompute results outside the SCC" This reverts commit `0d7f81e313`, it caused the AMDGPU tests that use the Attributor to fail.	2022-04-15 15:29:51 -05:00
Lang Hames	0d11351bd7	[JITLink] Add missing moves from `43acef48d3`.	2022-04-15 12:58:22 -07:00
River Riddle	ac860240ad	[mlir][NFC] Cleanup the TestClone pass Fix variable naming convention and cleanup a clang-tidy warning.	2022-04-15 12:57:07 -07:00
River Riddle	31c88660ab	[mlir] Remove the use of FilterTypes for template metaprogramming This technique results in an explosion in compile time, resulting from a huge number of std::tuple/concat instatiations. This technique is replaced by simpler metaprogramming and results in a signficant reduction in compile time. A local debug/asan build saw a 4x speed up in the processing of ArithmeticOps.h.inc, and given the nature of this change every dialect should see similar reductions in compile time. Differential Revision: https://reviews.llvm.org/D123360	2022-04-15 12:57:07 -07:00
Johannes Doerfert	04f3a224bc	[Attributor][NFC] Introduce a flag to distinguish the scope of a query	2022-04-15 14:56:10 -05:00
Johannes Doerfert	0d7f81e313	[Attributor] CGSCC pass should not recompute results outside the SCC When we run the CGSCC pass we should only invest time on the SCC. We can initialize AAs with information from the module slice but we should not update those AAs.	2022-04-15 14:56:09 -05:00
Johannes Doerfert	bd72acf4d8	[Attributor][NFC] Code cleanup to minimize follow up changes	2022-04-15 14:56:09 -05:00
Johannes Doerfert	2d8e7834b0	[Attributor][NFC] Rename AAPotentialValues to AAPotentialConstantValues	2022-04-15 14:56:09 -05:00
Lang Hames	43acef48d3	[JITLink] Refactor and expand DWARF pointer encoding support. Adds support for pointer encodings commonly used in large/static models, including non-pcrel, sdata/udata8, indirect, and omit. Also refactors pointer-encoding handling to consolidate error generation inside common functions, rather than callees of those functions.	2022-04-15 12:51:46 -07:00
Arthur Eubanks	4d85859ff4	[test][LoopDeletion] Precommit test	2022-04-15 12:40:12 -07:00
Arjun P	ef8b2a7cea	[MLIR][Presburger] addSymbolicCut: fix the integral symbols heuristic to match the docs Previously this checked if the entire symbolic numerator was divisible by the denominator, which is never the case when this function is called. Fixed this to check only the non-const coefficients in the numerator, which was what was intended and documented. Reviewed By: Groverkss Differential Revision: https://reviews.llvm.org/D123592	2022-04-15 20:34:06 +01:00
Bill Wendling	aed923b124	[randstruct] Enforce using a designated init for a randomized struct A randomized structure needs to use a designated or default initializer. Using a non-designated initializer will result in values being assigned to the wrong fields. Differential Revision: https://reviews.llvm.org/D123763	2022-04-15 12:29:32 -07:00
LLVM GN Syncbot	73110f1306	[gn build] Port `721651be24`	2022-04-15 19:23:18 +00:00
Thomas Raoux	b4bcef05b7	[mlir][vector] Fix bug in extractFromBroadcast folding extract was incorrectly folded when the source was coming from a broadcast that was both adding new rank and broadcasting the inner dimension. Differential Revision: https://reviews.llvm.org/D123867	2022-04-15 19:21:45 +00:00
Alexandre Ganea	64969446bc	[Support][cmake] Fix snmalloc integration. NFC. When using LLVM_INTEGRATED_CRT_ALLOC, fix compiling with the latest snmalloc at ToT (https://github.com/microsoft/snmalloc).	2022-04-15 15:19:38 -04:00
Xiang Li	721651be24	[HLSL][clang][Driver] Support target profile command line option. The target profile option(/T) decide the shader model when compile hlsl. The format is shaderKind_major_minor like ps_6_1. The shader model is saved as llvm::Triple is clang/llvm like dxil-unknown-shadermodel6.1-hull. The main job to support the option is translating ps_6_1 into shadermodel6.1-pixel. That is done inside tryParseProfile at HLSL.cpp. To integrate the option into clang Driver, a new DriverMode DxcMode is created. When DxcMode is enabled, OSType for TargetTriple will be forced into Triple::ShaderModel. And new ToolChain HLSLToolChain will be created when OSType is Triple::ShaderModel. In HLSLToolChain, ComputeEffectiveClangTriple is overridden to call tryParseProfile when targetProfile option is set. To make test work, Fo option is added and .hlsl is added for active -xhlsl. Reviewed By: beanz Differential Revision: https://reviews.llvm.org/D122865 Patch by: Xiang Li <python3kgae@outlook.com>	2022-04-15 14:18:18 -05:00
Arjun P	69c1a35488	[MLIR][Presburger][Simplex] moveRowUnknownToColumn: support the row sample value being zero When the sample value is zero, everything is the same except that failure to pivot does not imply emptiness. So, leave it to the user to mark as empty if necessary, if they know the sample value is strictly negative. This is needed for an upcoming symbolic lexmin heuristic. Reviewed By: Groverkss Differential Revision: https://reviews.llvm.org/D123604	2022-04-15 20:15:21 +01:00
William S. Moses	0df963e817	[MLIR][ClonePass] Attempt fix for anonymous pass name	2022-04-15 15:14:20 -04:00
Eli Friedman	4802edd1ac	Fix size of flexible array initializers, and re-enable assertions. In D123649, I got the formula for getFlexibleArrayInitChars slightly wrong: the flexible array elements can be contained in the tail padding of the struct. Fix the formula to account for that. With the fixed formula, we run into another issue: in some cases, we were emitting extra padding for flexible arrray initializers. Fix CGExprConstant so it uses a packed struct when necessary, to avoid this extra padding. Differential Revision: https://reviews.llvm.org/D123826	2022-04-15 12:09:57 -07:00
Zequan Wu	dc100ebfda	[LLDB][NativePDB] Followup `c50817d1be`	2022-04-15 12:08:44 -07:00
rdzhabarov	3ef4099a61	[mlir] Fix BUILD issues and dependencies. Differential Revision: https://reviews.llvm.org/D123868	2022-04-15 19:05:02 +00:00
Zequan Wu	c50817d1be	[LLDB][NativePDB] Don't create inlined function parameters when it's malformed.	2022-04-15 11:59:11 -07:00
Johannes Doerfert	3f7a6ce0de	[DWARF][FIX] Handle the use of multiple registers gracefully Certain applications crashed for us with the AMDGPU backend. While this is not a proper fix it allows us to compile the code for now. I left a TODO for someone that understands DWARF. Differential Revision: https://reviews.llvm.org/D123717	2022-04-15 13:43:50 -05:00
Johannes Doerfert	1fb415fee9	[AMDGPU][FIX] Proper load-store-vectorizer result with opaque pointers The original code relied on the fact that we needed a bitcast instruction (for non constant base objects). With opaque pointers there might not be a bitcast. Always check if reordering is required instead. Fixes: https://github.com/llvm/llvm-project/issues/54896 Differential Revision: https://reviews.llvm.org/D123694	2022-04-15 13:42:46 -05:00
William S. Moses	9a8bb4bc63	[NFC] Update comments	2022-04-15 14:33:13 -04:00
Aaron Ballman	8fd3b5de3f	Fix an edge case in determining is a function has a prototype Given the declaration: typedef void func_t(unsigned); __attribute__((noreturn)) func_t func; we would incorrectly determine that `func` had no prototype because the `noreturn` attribute would convert the underlying type directly into a FunctionProtoType, but the declarator for `func` itself was not one for a function with a prototype. This adds an additional check for when the declarator is a type representation for a function with a prototype.	2022-04-15 14:04:07 -04:00
Zequan Wu	2f78f9455f	[LLDB][NativePDB] Fix subfield_register_simple_type.s test	2022-04-15 10:36:25 -07:00
Mogball	3430ae1e7b	[mlir] Update LICM to support Graph Regions Changes the algorithm of LICM to support graph regions (no guarantee of topologically sorted order). Also fixes an issue where ops with recursive side effects and regions would not be hoisted if any nested ops used operands that were defined within the nested region. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D122465	2022-04-15 17:30:27 +00:00
Fangrui Song	04e094a336	[PGO] Remove legacy PM passes Legacy PM for optimization pipeline was deprecated in 13.0.0 and Clang dropped legacy PM support in D123609. This change removes legacy PM passes for PGO so that downstream projects won't be able to use it. It seems appropriate to start removing such "add-on" features like instrumentations, before we remove more stuff after 15.x is branched. I have checked many LLVM users and only ldc[1] uses the legacy PGO pass. [1]: https://github.com/ldc-developers/ldc/issues/3961 Reviewed By: davidxl Differential Revision: https://reviews.llvm.org/D123834	2022-04-15 10:26:43 -07:00
William S. Moses	ed499ddcda	[MLIR] Fix operation clone Operation clone is currently faulty. Suppose you have a block like as follows: ``` (%x0 : i32) { %x1 = f(%x0) return %x1 } ``` The test case we have is that we want to "unroll" this, in which we want to change this to compute `f(f(x0))` instead of just `f(x0)`. We do so by making a copy of the body at the end of the block and set the uses of the argument in the copy operations with the value returned from the original block. This is implemented as follows: 1) map to the block arguments to the returned value (`map[x0] = x1`). 2) clone the body Now for this small example, this works as intended and we get the following. ``` (%x0 : i32) { %x1 = f(%x0) %x2 = f(%x1) return %x2 } ``` This is because the current logic to clone `x1 = f(x0)` first looks up the arguments in the map (which finds `x0` maps to `x1` from the initialization), and then sets the map of the result to the cloned result (`map[x1] = x2`). However, this fails if `x0` is not an argument to the op, but instead used inside the region, like below. ``` (%x0 : i32) { %x1 = f() { yield %x0 } return %x1 } ``` This is because cloning an op currently first looks up the args (none), sets the map of the result (`map[%x1] = %x2`), and then clones the regions. This results in the following, which is clearly illegal: ``` (%x0 : i32) { %x1 = f() { yield %x0 } %x2 = f() { yield %x2 } return %x2 } ``` Diving deeper, this is partially due to the ordering (how this PR fixes it), as well as how region cloning works. Namely it will first clone with the mapping, and then it will remap all operands. Since the ordering above now has a map of `x0 -> x1` and `x1 -> x2`, we end up with the incorrect behavior here. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D122531	2022-04-15 13:09:13 -04:00
Peter Klausler	ca2be81e34	[flang] Fix Symbol::Rank for ProcEntityDetails When a procedure pointer or procedure dummy argument has a defined interface, the rank of the pointer (or dummy) is the rank of the interface. Also tweak code discovered in shape analysis when investigating this problam so that it returns a vector of emptied extents rather than std::nullopt when the extents are not scope-invariant, so that the rank can at least be known. Differential Revision: https://reviews.llvm.org/D123727	2022-04-15 09:54:48 -07:00
jfurtek	bed8212157	[mlir][ods][NFC] Move enum attribute definitions from OpBase.td to EnumAttr.td This diff moves `EnumAttr` tablegen definitions (specifically, `IntEnumAttr` and `BitEnumAttr`-related classes) from `OpBase.td` to `EnumAttr.td`. No functionality is changed. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D123551	2022-04-15 16:51:14 +00:00

... 2 3 4 5 6 ...

421427 Commits All Branches Search

421427 Commits

All Branches