llvm-project

Commit Graph

Author	SHA1	Message	Date
serge-sans-paille	b498303066	[nfc] Fix missing include	2020-11-13 10:35:23 +01:00
Lang Hames	00526cc78e	[ORC][examples] Fix missing includes/dependencies in more examples.	2020-11-13 20:22:01 +11:00
Lang Hames	e842765666	[ORC] Make a narrowing conversion explicit.	2020-11-13 20:11:19 +11:00
Max Kazantsev	b4ac9a158e	[Test] One more IndVars test with inverted exit condition	2020-11-13 16:02:31 +07:00
Max Kazantsev	9224d322a2	[IndVars] Fix branches exiting by true with invariant conditions Forgot to invert the condition for them.	2020-11-13 15:52:00 +07:00
Max Kazantsev	e36d101fdb	[Test] Add test with inverted branch	2020-11-13 15:51:59 +07:00
Stephan Herhut	4a771108ac	[mlir][bufferize] Fix buffer promotion to stack for index types The index type does not have a bitsize and hence the size of corresponding allocations cannot be computed. Instead, the promotion pass now has an explicit option to specify the size of index. Differential Revision: https://reviews.llvm.org/D91360	2020-11-13 09:23:36 +01:00
Stephan Herhut	5da2423bc0	[mlir][gpu] Only transform mapped parallel loops to GPU. This exposes a hook to configure legality of operations such that only `scf.parallel` operations that have mapping attributes are marked as illegal. Consequently, the transformation can now also be applied to mixed forms. Differential Revision: https://reviews.llvm.org/D91340	2020-11-13 09:15:17 +01:00
Lang Hames	1bf805dd2c	[examples] Fix Kaleidoscope examples after OrcJIT break-up / remote TPC commit. Fix the Kaleidoscope examples after `1d0676b54c` by explicitly creating the SymbolStringPool.	2020-11-13 19:13:00 +11:00
Kai Luo	96ff53fbae	[PowerPC] Add test case for negated abs. NFC.	2020-11-13 08:06:31 +00:00
Max Kazantsev	0a1d394bf3	[NFC] Refactor loop-invariant getters to return Optional	2020-11-13 15:03:10 +07:00
River Riddle	811001380f	[mlir][Pass] Remove the verifierPass now that verification is run during normal pass execution A recent refactoring removed the need to interleave verifier passes and instead opted to verify during the normal execution of passes instead. As such, the old verify pass is no longer necessary and can be removed. Differential Revision: https://reviews.llvm.org/D91212	2020-11-12 23:45:27 -08:00
Lang Hames	935ca5a1a7	[examples] Fix Kaleidoscope examples after OrcJIT break-up / remote TPC commit. Fix the Kaleidoscope examples after `1d0676b54c` by explicitly creating the SymbolStringPool.	2020-11-13 18:43:13 +11:00
River Riddle	48e8129edf	[mlir][Asm] Add support for resolving operation locations after parsing has finished This revision adds support in the parser/printer for "deferrable" aliases, i.e. those that can be resolved after printing has finished. This allows for printing aliases for operation locations after the module instead of before, i.e. this is now supported: ``` "foo.op"() : () -> () loc(#loc) #loc = loc("some_location") ``` Differential Revision: https://reviews.llvm.org/D91227	2020-11-12 23:34:36 -08:00
Jason Molenda	92b036dea2	debugserver should advance pc past builtin_debugtrap insn On x86_64, when you hit a __builtin_debugtrap instruction, you can continue past this in the debugger. This patch has debugserver recognize the specific instruction used for __builtin_debugtrap and advance the pc past it, so that the user can continue execution once they've hit one of these. In the patch discussion, we were in agreement that it would be better to have this knowledge up in lldb instead of depending on each stub rewriting the pc behind the debugger's back, but that's a larger scale change for another day. <rdar://problem/65521634> Differential revision: https://reviews.llvm.org/D91238	2020-11-12 23:31:14 -08:00
Lang Hames	dabc914d2b	[ORC][examples] Fix include and library dependence for SpeculativeJIT example.	2020-11-13 18:24:34 +11:00
Serge Pavlov	92d7a84e12	[Driver] Add option -fproc-stat-report The new option `-fproc-stat-info=<file>` can be used to generate report about used memory and execution tile of each stage of compilation. Documentation for this option can be found in `UserManual.rst`. The option can be used in parallel builds. Differential Revision: https://reviews.llvm.org/D78903	2020-11-13 14:15:42 +07:00
Lang Hames	98f70e94e0	[ORC] Add dependence of OrcJIT on OrcTargetProcess. The SelfTargetProcessControl class depends on OrcTargetProcess.	2020-11-13 18:09:41 +11:00
River Riddle	120ccef0e1	[mlir] Remove C++17 only use of inline on constexpr variable	2020-11-12 23:02:37 -08:00
River Riddle	7f61396cfa	[mlir][Interfaces] Add implicit casts from concrete operation types to the interfaces they implement. This removes the need to have an explicit `cast<>` given that we always know it `isa` instance of the interface. Differential Revision: https://reviews.llvm.org/D91304	2020-11-12 22:56:08 -08:00
River Riddle	2e71dad332	[mlir][DenseElementsAttr] Allow for custom floating point types in `getValues` Some users have native c++ data types that correspond to floating point values stored within a DenseElementsAttr that do not have a corresponding native C++ data type(e.g. bfloat16/half/etc). This revision allows for such users to use those native types directly, and removes the need to go through APFloat when the much faster native value path is available. Differential Revision: https://reviews.llvm.org/D91402	2020-11-12 22:47:30 -08:00
Arthur Eubanks	b9406121a0	[NFC] Removed unused variable Obsolete as of https://reviews.llvm.org/D91046.	2020-11-12 22:24:57 -08:00
Akira Hatanaka	09266e4af0	[ObjC][ARC] Clear the lists of basic blocks and instructions before continuing the loop This fixes a bug introduced in `c6f1713c46`.	2020-11-12 22:20:02 -08:00
Lang Hames	d3715b5a06	[ORC] Make WrapperFunctionResult::zeroInit static	2020-11-13 17:15:13 +11:00
Lang Hames	bdf26d8d19	[ORC] Remove designated initializer.	2020-11-13 17:12:33 +11:00
Lang Hames	1d0676b54c	[ORC] Break up OrcJIT library, add Orc-RPC based remote TargetProcessControl implementation. This patch aims to improve support for out-of-process JITing using OrcV2. It introduces two new class templates, OrcRPCTargetProcessControlBase and OrcRPCTPCServer, which together implement the TargetProcessControl API by forwarding operations to an execution process via an Orc-RPC Endpoint. These utilities are used to implement out-of-process JITing from llvm-jitlink to a new llvm-jitlink-executor tool. This patch also breaks the OrcJIT library into three parts: -- OrcTargetProcess: Contains code needed by the JIT execution process. -- OrcShared: Contains code needed by the JIT execution and compiler processes -- OrcJIT: Everything else. This break-up allows JIT executor processes to link against OrcTargetProcess and OrcShared only, without having to link in all of OrcJIT. Clients executing JIT'd code in-process should start linking against OrcTargetProcess as well as OrcJIT. In the near future these changes will enable: -- Removal of the OrcRemoteTargetClient/OrcRemoteTargetServer class templates which provided similar functionality in OrcV1. -- Restoration of Chapter 5 of the Building-A-JIT tutorial series, which will serve as a simple usage example for these APIs. -- Implementation of lazy, cross-target compilation in lli's -jit-kind=orc-lazy mode.	2020-11-13 17:05:13 +11:00
Jameson Nash	9606ef03f0	[AsmPrinter] fix -disable-debug-info option This option was in a rather convoluted place, causing global parameters to be set in awkward and undesirable ways to try to account for it indirectly. Add tests for the -disable-debug-info option and ensure we don't print unintended markers from unintended places. Reviewed By: dstenb Differential Revision: https://reviews.llvm.org/D91083	2020-11-13 00:58:09 -05:00
Craig Topper	114f044640	[X86] Use EVT::getIntegerVT instead of MVT::getIntegerVT where the type can be i2 or i4. This was a mistake introduced in D91294. I'm not sure how to exercise this with the existing code, but I hit it while trying some follow up experiments.	2020-11-12 21:48:45 -08:00
Craig Topper	a4124e455e	[X86] When storing v1i1/v2i1/v4i1 to memory, make sure we store zeros in the rest of the byte We can't store garbage in the unused bits. It possible that something like zextload from i1/i2/i4 is created to read the memory. Those zextloads would be legalized assuming the extra bits are 0. I'm not sure that the code in lowerStore is executed for the v1i1/v2i1/v4i1 case. It looks like the DAG combine in combineStore may have converted them to v8i1 first. And I think we're missing some cases to avoid going to the stack in the first place. But I don't have time to investigate those things at the moment so I wanted to focus on the correctness issue. Should fix PR48147. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D91294	2020-11-12 21:28:18 -08:00
Max Kazantsev	77efb73c67	[IndVars] Replace checks with invariants if we cannot remove them If we cannot prove that the check is trivially true, but can prove that it either fails on the 1st iteration or never fails, we can replace it with first iteration check. Differential Revision: https://reviews.llvm.org/D88527 Reviewed By: skatkov	2020-11-13 12:23:12 +07:00
Richard Smith	7602ef768b	Suppress trailing template arguments equivalent to default arguments when printing the name of a member of a class template specialization.	2020-11-12 21:10:34 -08:00
Mehdi Amini	a9386bb0f9	Fix MLIR lit test configuration after cmake Python detection change `07f1047f41` changed the CMake detection to use find_package(Python3 ... but didn't update the lit configuration to use the expected Python3_EXECUTABLE cmake variable to point to the interpreter path. This resulted in an empty path on MacOS.	2020-11-13 04:44:45 +00:00
Philip Reames	d4e81cd9dd	[Tests][LoopVect] Exercise basic uniform memory operand logic	2020-11-12 20:34:31 -08:00
Shilei Tian	24d0ef0f50	[OpenMP] Fixed a bug when displaying affinity Currently the affinity format string has initial value. When users set the format via OMP_AFFINITY_FORMAT, it will overwrite the format string. However, when copying the format, the tailing null is missing. As a result, if the user format string is shorter than default value, the remaining part in the default value still makes effort. This bug is not exposed because the test case doesn't check the end of a string. It only checks whether given output "contains" the check string. Reviewed By: AndreyChurbanov Differential Revision: https://reviews.llvm.org/D91309	2020-11-12 22:27:32 -05:00
Michael Liao	8920ef06a1	[hip] Remove the coercion on aggregate kernel arguments. - If an aggregate argument is indirectly accessed within kernels, direct passing results in unpromotable `alloca`, which degrade performance significantly. InferAddrSpace pass is enhanced in [D91121](https://reviews.llvm.org/D91121) to take the assumption that generic pointers loaded from the constant memory could be regarded global ones. The need for the coercion on aggregate arguments is mitigated. Differential Revision: https://reviews.llvm.org/D89980	2020-11-12 21:19:30 -05:00
Michael Kruse	243511a24e	[Polly] Fix memory leak.	2020-11-12 20:04:17 -06:00
Sanjay Patel	0abde4bc92	[InstCombine] fold sub of low-bit masked value from offset of same value There might be some demanded/known bits way to generalize this, but I'm not seeing it right now. This came up as a regression when I was looking at a different demanded bits improvement. https://rise4fun.com/Alive/5fl Name: general Pre: ((-1 << countTrailingZeros(C1)) & C2) == 0 %a1 = add i8 %x, C1 %a2 = and i8 %x, C2 %r = sub i8 %a1, %a2 => %r = and i8 %a1, ~C2 Name: test 1 %a1 = add i8 %x, 192 %a2 = and i8 %x, 10 %r = sub i8 %a1, %a2 => %r = and i8 %a1, -11 Name: test 2 %a1 = add i8 %x, -108 %a2 = and i8 %x, 3 %r = sub i8 %a1, %a2 => %r = and i8 %a1, -4	2020-11-12 20:10:28 -05:00
Sanjay Patel	87e006bedd	[InstCombine] add tests for sub with masked bits; NFC	2020-11-12 20:10:28 -05:00
Rahul Joshi	5883c4b470	[MLIR] Fix standard -> LLVM conversion to fail for unsupported memref element type. - Move isSupportedMemRefType() to ConvertToLLVMPatterns and check if the memref element type is supported there. Differential Revision: https://reviews.llvm.org/D91374	2020-11-12 17:06:05 -08:00
peter klausler	c2bccd66f6	[flang] Document DO CONCURRENT's problems (NFC) Differential Revision: https://reviews.llvm.org/D86556	2020-11-12 15:30:43 -08:00
Jonas Devlieghere	406ad18748	[lldb/DataFormatters] Display null C++ pointers as nullptr Display null pointer as `nullptr`, `nil` and `NULL` for C++, Objective-C/Objective-C++ and C respectively. The original motivation for this patch was to display a null std::string pointer as nullptr instead of "", but the fix seemed generic enough to be done for all summary providers. Differential revision: https://reviews.llvm.org/D77153	2020-11-12 15:24:06 -08:00
Stanislav Mekhanoshin	5ab1702129	[AMDGPU] Remove scratch rsrc from spill pseudos Differential Revision: https://reviews.llvm.org/D91110	2020-11-12 15:23:37 -08:00
Nico Weber	fa9f41330d	[gn build] (manually) port `410626c9b5`	2020-11-12 18:21:22 -05:00
Sean Silva	796880288a	[mlir] Make tensor_to_memref op docs match reality The previous code defined it as allocating a new memref for its result. However, this is not how it is treated by the dialect conversion framework, that does the equivalent of inserting and folding it away internally (even independent of any canonicalization patterns that we have defined). The semantics as they were previously written were also very constraining: Nontrivial analysis is needed to prove that the new allocation isn't needed for correctness (e.g. to avoid aliasing). By removing those semantics, we avoid losing that information. Differential Revision: https://reviews.llvm.org/D91382	2020-11-12 14:56:10 -08:00
Sean Silva	faa66b1b2c	[mlir] Bufferize tensor constant ops We lower them to a std.global_memref (uniqued by constant value) + a std.get_global_memref to produce the corresponding memref value. This allows removing Linalg's somewhat hacky lowering of tensor constants, now that std properly supports this. Differential Revision: https://reviews.llvm.org/D91306	2020-11-12 14:56:10 -08:00
Sean Silva	ad2f9f6745	[mlir] Fix subtensor_insert bufferization. It was incorrect in the presence of a tensor argument with multiple uses. The bufferization of subtensor_insert was writing into a converted memref operand, but there is no guarantee that the converted memref for that operand is safe to write into. In this case, the same converted memref is written to in-place by the subtensor_insert bufferization, violating the tensor-level semantics. I left some comments in a TODO about ways forward on this. I will be working actively on this problem in the coming days. Differential Revision: https://reviews.llvm.org/D91371	2020-11-12 14:56:09 -08:00
Jessica Paquette	d0ba6c4002	[AArch64][GlobalISel] Select CSINC and CSINV for G_SELECT with constants Select the following: - G_SELECT cc, 0, 1 -> CSINC zreg, zreg, cc - G_SELECT cc 0, -1 -> CSINV zreg, zreg cc - G_SELECT cc, 1, f -> CSINC f, zreg, inv_cc - G_SELECT cc, -1, f -> CSINV f, zreg, inv_cc - G_SELECT cc, t, 1 -> CSINC t, zreg, cc - G_SELECT cc, t, -1 -> CSINC t, zreg, cc (IR example: https://godbolt.org/z/YfPna9) These correspond to a bunch of the AArch64csel patterns in AArch64InstrInfo.td. Unfortunately, it doesn't seem like we can import patterns that use NZCV like those ones do. E.g. ``` def : Pat<(AArch64csel GPR32:$tval, (i32 1), (i32 imm:$cc), NZCV), (CSINCWr GPR32:$tval, WZR, (i32 imm:$cc))>; ``` So we have to manually select these for now. This replaces `selectSelectOpc` with an `emitSelect` function, which performs these optimizations. Differential Revision: https://reviews.llvm.org/D90701	2020-11-12 14:44:01 -08:00
Kazushi (Jam) Marukawa	410626c9b5	[VE] Support vld intrinsics Add intrinsics for vector load instructions. Add a regression test also. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D91332	2020-11-13 07:34:42 +09:00
Sanjay Patel	9e0c35655b	[LoopVectorize] regenerate test checks; NFC	2020-11-12 17:15:46 -05:00
shafik	bae9aedb34	[LLDB] Fix handling of bit-fields in a union When parsing DWARF and laying out bit-fields we don't properly take into account when they are in a union, they will all have a zero offset. Differential Revision: https://reviews.llvm.org/D91118	2020-11-12 14:09:27 -08:00

1 2 3 4 5 ...

372023 Commits All Branches Search

372023 Commits

All Branches