llvm-project

Commit Graph

Author	SHA1	Message	Date
Amara Emerson	1ccebb18ef	[GlobalISel] Micro-optimize the conditional branch optimization. Convert a check into an assert and pass an MI instead of recomputing in the apply function.	2021-05-07 00:03:09 -07:00
KareemErgawy-TomTom	e4dee7e730	[MLIR][SPIRV] Properly (de-)serialize BranchConditionalOp. Implements proper (de-)serialization logic for BranchConditionalOp when such ops have true/false target operands. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D101602	2021-05-07 09:00:50 +02:00
Chen Zheng	a95473c563	[XCOFF] handle string constants generation for AIX This follows https://www.ibm.com/docs/en/aix/7.2?topic=constants-string Reviewed By: hubert.reinterpretcast Differential Revision: https://reviews.llvm.org/D101280	2021-05-07 06:43:36 +00:00
Tobias Gysi	26e916334e	[mlir][linalg] Add IndexedGenericOp to GenericOp canonicalization. Replace all `linalg.indexed_generic` ops by `linalg.generic` ops that access the iteration indices using the `linalg.index` op. Differential Revision: https://reviews.llvm.org/D101612	2021-05-07 06:00:16 +00:00
Qiu Chaofan	f7294ac809	[PowerPC] Remove extra swap for extract+vperm on LE This is a simple fix on LE. On BE, vector shuffles are categorized into different ops. We may need more work to eliminate these in tablegen/pre-isel. Reviewed By: nemanjai Differential Revision: https://reviews.llvm.org/D101605	2021-05-07 13:48:08 +08:00
Yonghong Song	605c811d2b	BPF: fix FIELD_EXISTS relocation with array subscripts Lorenz Bauer reported an issue in bpf mailing list ([1]) where for FIELD_EXISTS relocation, if the object is an array subscript, the patched immediate is the object offset from the base address, instead of 1. Currently in BPF AbstractMemberAccess pass, the final offset from the base address is the patched offset except FIELD_EXISTS which is 1 unconditionally. In this particular case, the last data structure access is not a field (struct/union offset) so it didn't hit the place to set patched immediate to be 1. This patch fixed the issue by checking the relocation type. If the type is FIELD_EXISTS, just set to 1. Tested by modifying some bpf selftests, libbpf is okay with such types with FIELD_EXISTS relocation. [1] https://lore.kernel.org/bpf/CACAyw99n-cMEtVst7aK-3BfHb99GMEChmRLCvhrjsRpHhPrtvA@mail.gmail.com/ Differential Revision: https://reviews.llvm.org/D102036	2021-05-06 22:37:02 -07:00
Coelacanthus	e6cf3d6441	[TableGen] Use range-based for loops (NFC) Use range-based for loops in TableGen. Reviewed By: Paul-C-Anagnostopoulos Differential Revision: https://reviews.llvm.org/D101994	2021-05-07 13:34:03 +08:00
qixingxue	e388b9399b	[IR] Fix typo in comment of Intrinsics.td (NFC)	2021-05-07 13:21:58 +08:00
Bruno Cardoso Lopes	819e0d105e	[CGAtomic] Lift strong requirement for remaining compare_exchange combinations Follow up on `431e3138a` and complete the other possible combinations. Besides enforcing the new behavior, it also mitigates TSAN false positives when combining orders that used to be stronger.	2021-05-06 21:05:20 -07:00
MaheshRavishankar	05a89312d8	[mlir][Linalg] Allow folding to rank-zero tensor when using rank-reducing subtensors. The pattern to convert subtensor ops to their rank-reduced versions (by dropping unit-dims in the result) can also convert to a zero-rank tensor. Handle that case. This also fixes a OOB access bug in the existing pattern for such cases. Differential Revision: https://reviews.llvm.org/D101949	2021-05-06 19:03:55 -07:00
Jianzhou Zhao	87a6325fbe	[dfsan] Rename and fix an internal test issue for mmap+calloc The linker suggests using -Wl,-z,notext. Replaced assert by exit also fixed this. After renaming, interceptor.c would be used to test interceptors in general by D101204. Reviewed By: morehouse Differential Revision: https://reviews.llvm.org/D101649	2021-05-07 00:57:21 +00:00
Cyndy Ishida	c4ed142e69	[llvm][TextAPI] add mapping from OS string to Platform * add utility for matching target triple OS value strings to PlatformKind This was reviewed offline by ributzka, steven_wu	2021-05-06 16:25:56 -07:00
Stanislav Mekhanoshin	c714d03785	[AMDGPU] Expose __builtin_amdgcn_perm for v_perm_b32 Differential Revision: https://reviews.llvm.org/D102022	2021-05-06 16:17:33 -07:00
Rob Suderman	d3e987c389	[mlir][tosa] Added div op, variadic concat. Removed placeholder. Spec v0.22 alignment. Nearly complete alignment to spec v0.22 - Adds Div op - Concat inputs now variadic - Removes Placeholder op Note: TF side PR https://github.com/tensorflow/tensorflow/pull/48921 deletes Concat legalizations to avoid breaking TensorFlow CI. This must be merged only after the TF PR has merged. Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D101958	2021-05-06 15:55:58 -07:00
Jon Chesterfield	44ee974e2f	[libomptarget][nfc] Refactor amdgpu partial barrier to simplify adding a second one [libomptarget][nfc] Refactor amdgpu partial barrier to simplify adding a second one D101976 would require a second barrier instance. This NFC to amdgpu makes it simpler to add one (an extra global, one more line in init). Also renames the current barrier to L0. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D102016	2021-05-06 23:52:19 +01:00
Amy Zhuang	5dc1ed3f62	[mlir] Update dstNode after DenseMap insertion in loop fusion pass. Reviewed By: vinayaka-polymage Differential Revision: https://reviews.llvm.org/D101794	2021-05-06 15:23:59 -07:00
Malhar Jajoo	9ff38e2d9d	[ARM] Transforming memcpy to Tail predicated Loop This patch converts llvm.memcpy intrinsic into Tail Predicated Hardware loops for a target that supports the Arm M-profile Vector Extension (MVE). From an implementation point of view, the patch - adds an ARM specific SDAG Node (to which the llvm.memcpy intrinsic is lowered to, during first phase of ISel) - adds a corresponding TableGen entry to generate a pseudo instruction, with a custom inserter, on matching the above node. - Adds a custom inserter function that expands the pseudo instruction into MIR suitable to be (by later passes) into a WLSTP loop. Reviewed By: dmgreen Differential Revision: https://reviews.llvm.org/D99723	2021-05-06 23:21:28 +01:00
Jon Chesterfield	7e9351b9de	[libomptarget][amdgpu][nfc] Remove dead code from amdgpu plugin [libomptarget][amdgpu][nfc] Remove dead code from amdgpu plugin Drops an enum that was identical to a HSA one, localises some functions where they were only called from one TU. Covers everything internalize + adce can identify as dead, except for msgpack::dump which is useful when debugging. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D102014	2021-05-06 23:16:32 +01:00
Lei Zhang	41bc54cc56	[mlir][spirv] NFC: Replace OwningSPIRVModuleRef with OwningOpRef Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D102009	2021-05-06 17:17:44 -04:00
Jim Ingham	72ba78c29e	When SendContinuePacketAndWaitForResponse returns eStateInvalid, don't fetch more packets. This looks like just an oversight in the AsyncThread function. It gets a result of eStateInvalid, and then marks the process as exited, but doesn't set "done" to true, so we go to fetch another event. That is not safe, since you don't know when that extra packet is going to arrive. If it arrives while you are tearing down the process, the internal-state-thread might try to handle it when the process in not in a good state. Rather than put more effort into checking all the shutdown paths to make sure this extra packet doesn't cause problems, just don't fetch it. We weren't going to do anything useful with it anyway. The main part of the patch is setting "done = true" when we get the eStateInvalid. I also added a check at the beginning of the while(done) loop to prevent another error from getting us to fetch packets for an exited process. I added a test case to ensure that if an Interrupt fails, we call the process exited. I can't test exactly the error I'm fixing, there's no good way to know that the stop reply for the failed interrupt wasn't fetched. But at least this asserts that the overall behavior is correct. Differential Revision: https://reviews.llvm.org/D101933	2021-05-06 14:11:42 -07:00
Aaron Puchert	d21e1b79ff	Thread safety analysis: Eliminate parameter from intersectAndWarn (NFC) We were modifying precisely when intersecting the lock sets of multiple predecessors without back edge. That's no coincidence: we can't modify on back edges, it doesn't make sense to modify at the end of a function, and otherwise we always want to intersect on forward edges, because we can build a new lock set for those. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D101755	2021-05-06 23:07:42 +02:00
LLVM GN Syncbot	fca10c8808	[gn build] Port `83af66e18e`	2021-05-06 21:03:05 +00:00
Frank Derry Wanye	83af66e18e	new altera ID dependent backward branch check This lint check is a part of the FLOCL (FPGA Linters for OpenCL) project out of the Synergy Lab at Virginia Tech. FLOCL is a set of lint checks aimed at FPGA developers who write code in OpenCL. The altera ID dependent backward branch lint check finds ID dependent variables and fields used within loops, and warns of their usage. Using these variables in loops can lead to performance degradation.	2021-05-06 17:01:39 -04:00
Alex Hoppen	a3a8a1a15b	[Index] Ignore nullptr decls for indexing We can end up with a call to `indexTopLevelDecl(D)` with `D == nullptr` in non-assert builds e.g. when indexing a module in `indexModule` and - `ASTReader::GetDecl` returns `nullptr` if `Index >= DeclsLoaded.size()`, thus returning `nullptr` => `ModuleDeclIterator::operator*` returns `nullptr` => we call `IndexCtx.indexTopLevelDecl` with `nullptr` Be resilient and just ignore the `nullptr` decls during indexing. Reviewed By: akyrtzi Differential Revision: https://reviews.llvm.org/D102001	2021-05-06 13:12:26 -07:00
River Riddle	6304c0836a	[mlir] Store the flag for dynamic operand storage in the low bits It is currently stored in the high bits, which is disallowed on certain platforms (e.g. android). This revision switches the representation to use the low bits instead, fixing crashes/breakages on those platforms. Differential Revision: https://reviews.llvm.org/D101969	2021-05-06 12:45:35 -07:00
Sanjay Patel	fefcb1f878	[PassManager] add helper function to hold set of vector passes This is no-functional-change-intended (NFC) and split off from D102002 (which proposes to eliminate the LTO-based differences).	2021-05-06 15:36:15 -04:00
Mircea Trofin	97ab068034	[NPM] Do not run function simplification pipeline unnecessarily The CGSCC pass manager interplay with the FunctionAnalysisManagerCGSCCProxy is 'special' in the sense that the former will rerun the latter if there are changes to a SCC structure; that being said, some of the functions in the SCC may be unchanged. In that case, the function simplification pipeline will be re-run, which impacts compile time[1]. This patch allows the function simplification pipeline be skipped if it was already run and the function was not modified since. The behavior is currently disabled by default. This is because, currently, the rerunning of the function simplification pipeline on an unchanged function may still result in changes. The patch simplifies investigating and fixing those cases where repeated function pass runs do actually positively impact code quality, while offering an easy workaround for those impacted negatively by compile time regressions, and not impacting mainline scenarios. [1] A [[ http://llvm-compile-time-tracker.com/compare.php?from=eb37d3546cd0c6e67798496634c45e501f7806f1&to=ac722d1190dc7bbdd17e977ef7ec95e69eefc91e&stat=instructions \| compile time tracker ]] run with the option enabled. Differential Revision: https://reviews.llvm.org/D98103	2021-05-06 12:24:33 -07:00
Craig Topper	191ffda3f7	[RISCV] Remove unused ComplexPatterns. NFC	2021-05-06 12:17:41 -07:00
Arnamoy Bhattacharyya	a40b609958	[flang][OpenMP] Add semantic check for occurrence of constructs nested inside a SIMD region Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D99757	2021-05-06 15:09:51 -04:00
Petr Hosek	8cb191b724	[Fuchsia][CMake] Update OSX deployment target Use correct spelling of CMAKE_OSX_DEPLOYMENT_TARGET and bump the minimum version to 10.13 which matches what we use for host tools in Fuchsia. Differential Revision: https://reviews.llvm.org/D102013	2021-05-06 12:06:16 -07:00
Xing Xue	8408d3f2d8	[libunwind] NFC: Use macros to accommodate differences in representation of PowerPC assemblers Summary: This NFC patch replaces the representation of registers and the left shift operator in the PowerPC assembly code to allow it to be consumed by the GNU flavored assembler and the AIX assembler. * Registers - change the representation of PowperPC registers from %rn, %fn, %vsn, and %vrn to the register number alone, e.g., n. The GNU flavored assembler and the AIX assembler are able to determine the register kind based on the context of the instruction in which the register is used. * Left shift operator - use macro PPC_LEFT_SHIFT to represent the left shift operator. The left shift operator in the AIX assembly language is < instead of << Reviewed by: sfertile, MaskRay, compnerd Differential Revision: https://reviews.llvm.org/D101179	2021-05-06 14:33:38 -04:00
Craig Topper	a577d59db2	[RISCV] Minor vector instruction tablegen cleanup. NFC Use result_type for the IMPLICIT_DEF in masked vector patterns. This doesn't matter today because result_type and op_type are always the same. Use multiclass inheritance to reduce repeated code.	2021-05-06 11:23:59 -07:00
peter klausler	6a1c3efa05	[flang] Implement NAMELIST I/O in the runtime Add InputNamelist and OutputNamelist as I/O data transfer APIs to be used with internal & external list-directed I/O; delete the needless original namelist-specific Begin... calls. Implement NAMELIST output and input; add basic tests. Differential Revision: https://reviews.llvm.org/D101931	2021-05-06 11:18:36 -07:00
Fangrui Song	306370be0b	[AArch64] Fix namespace issue. NFC	2021-05-06 11:16:07 -07:00
peter klausler	4f41994c13	[flang] Fix race condition in runtime The code that initializes the default units 5 & 6 had a race condition that would allow threads access to the unit map before it had been populated. Also add some missing calls to va_end() that will never be called (they're in program abort situations) but might elicit warnings if absent. Differential Revision: https://reviews.llvm.org/D101928	2021-05-06 11:09:30 -07:00
Matthew Voss	22aece57be	Allow llvm-dis to disassemble multiple files Differential Revision: https://reviews.llvm.org/D101110	2021-05-06 11:08:55 -07:00
peter klausler	199a623ebf	[flang] Runtime must defer formatted/unformatted determination What the Fortran standard calls "preconnected" external I/O units might not be known to be connected to unformatted or formatted files until the first I/O data transfer statement is executed. Support this deferred determination by representing the flag as a tri-state Boolean and adapting its points of use. Differential Revision: https://reviews.llvm.org/D101929	2021-05-06 11:06:43 -07:00
Arthur Eubanks	642df18f14	[gn build] Support compiler-rt/profile on Windows Reviewed By: thakis Differential Revision: https://reviews.llvm.org/D101961	2021-05-06 10:20:52 -07:00
thomasraoux	71eb32d97e	[mlir][vector] Fix typo	2021-05-06 10:12:31 -07:00
thomasraoux	52525cb20f	[mlir][linalg][NFC] Make reshape folding control more fine grain This expose a lambda control instead of just a boolean to control unit dimension folding. This however gives more control to user to pick a good heuristic. Folding reshapes helps fusion opportunities but may generate sub-optimal generic ops. Differential Revision: https://reviews.llvm.org/D101917	2021-05-06 10:11:39 -07:00
Thomas Lively	b198b9b897	[WebAssembly] Fix argument types in SIMD narrowing intrinsics The builtins were updated to take signed parameters in `627a526955`, but the intrinsics that use those builtins were not updated as well. The intrinsic test did not catch this sign mismatch because it is only reported as an error under -fno-lax-vector-conversions. This commit fixes the type mismatch and adds -fno-lax-vector-conversions to the test to catch similar problems in the future. Differential Revision: https://reviews.llvm.org/D101979	2021-05-06 10:07:45 -07:00
Stefan Pintilie	f0adf3a24c	[PowerPC][LLD] Make sure that the correct Thunks are used. This fixes an issue where mixed TOC / NOTOC calls can call the incorrect thunks if a previous thunk already exists. The issue appears when a TOC funciton calls a NOTOC callee and then a different NOTOC function calls the same NOTOC callee. In this case the linker would sometimes incorrectly call the same thunk for both cases. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D101837	2021-05-06 12:00:04 -05:00
Craig Topper	6660319cef	[RISCV] Remove unused RISCV::VLEFF and VLEFF_MASK. NFC Looks like these got left behind when vleff isel was moved to X86ISelDAGToDAG.cpp	2021-05-06 09:41:29 -07:00
Hubert Tong	e2d774a3db	[AIX][Test][ORC] Skip unsupported ORC C API tests on AIX As mentioned before in D78813, currently the XCOFF backend does not support writing 64-bit object files, which the ORC JIT tests will try to exercise if we are on AIX. This patch disables the tests on AIX for now. This is consistent with what's been done, for example, regarding `armv7`. Reviewed By: lhames Differential Revision: https://reviews.llvm.org/D101971	2021-05-06 12:36:56 -04:00
Denys Shabalin	1f109f9d9c	Fix array attribute in bindings for linalg.init_tensor Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D101998	2021-05-06 18:25:59 +02:00
Jonas Paulsson	1c4cb510b4	[SystemZ] Don't use libcall for 128 bit shifts. Expand 128 bit shifts instead of using a libcall. This patch removes the 128 bit shift libcalls and thereby causes ExpandShiftWithUnknownAmountBit() to be called. Review: Ulrich Weigand Differential Revision: https://reviews.llvm.org/D101993	2021-05-06 18:14:41 +02:00
Craig Topper	58323be415	[RISCV] Cleanup instruction formats used for B extension ternary operations. Rename RVInstR4 as used by F/D/Zfh extensions to RVInstR4Frm. Introduce new RVInstR4 that takes funct3 as a parameter. Add new format classes for FSRI and FSRIW instead of trying to bend RVInstR4 to use a shamt overlayed on rs2 and funct2. Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D100427	2021-05-06 08:59:05 -07:00
Fraser Cormack	2e0ee68dc8	[LangRef][VP] Fix typos in VP sdiv/udiv examples	2021-05-06 16:37:18 +01:00
David Goldman	159dd447fe	[clangd][ObjC] Highlight Objc Ivar refs Treat them just like we do for properties - as a `property` semantic token although ideally we could differentiate the two. Differential Revision: https://reviews.llvm.org/D101785	2021-05-06 11:41:49 -04:00
Stanislav Mekhanoshin	28f1d018b1	[AMDGPU] Fix 64 bit DPP validation AMDGPUAsmParser::isSupportedDPPCtrl() was failing to correctly find a DPP register operand, regadless of the position it is always src0. Moved this check into a new validateDPP() method where we have full instruction already. In particular it was failing to reject this case: v_cvt_u32_f64 v5, v[0:1] quad_perm:[0,2,1,1] row_mask:0xf bank_mask:0xf Essentially it was broken for any case where size of dst and src0 differ. It also improves the diagnostics with a proper error message. The check in the InstPrinter also drops verification of the dst register as it does not have anything to do with the dpp operand. Differential Revision: https://reviews.llvm.org/D101930	2021-05-06 08:40:26 -07:00

1 2 3 4 5 ...

387752 Commits All Branches Search

387752 Commits

All Branches