llvm-project

Commit Graph

Author	SHA1	Message	Date
Matthias Springer	ebf35370ff	[mlir][tensor] Insert explicit tensor.cast ops for insert_slice src If additional static type information can be deduced from a insert_slice's size operands, insert an explicit cast of the op's source operand. This enables other canonicalization patterns that are matching for tensor_cast ops such as `ForOpTensorCastFolder` in SCF. Differential Revision: https://reviews.llvm.org/D108617	2021-08-24 19:45:04 +09:00
Matthias Springer	0c36082963	[mlir][SCF] Use symbols in loop peeling rewrite Use symbols in the affine map instead of dims. Dims should not be divided. Differential Revision: https://reviews.llvm.org/D108431	2021-08-24 19:39:19 +09:00
Krasimir Georgiev	befb9dc369	[clang-format] keep TypeScript argument decorators in line As a follow-up from https://reviews.llvm.org/D108538, ensure TypeScript argument decorators are kept in line with the argument. Reviewed By: MyDeveloperDay Differential Revision: https://reviews.llvm.org/D108620	2021-08-24 12:37:08 +02:00
Dawid Jurczak	2e8534beb2	[NFC][LoopIdiom] Add reproducer of wrong memmove transformation That's precommit test for D107964. Differential Revision: https://reviews.llvm.org/D108537	2021-08-24 12:00:24 +02:00
Ricky Taylor	47f52f989b	[M68k][AsmParser] Support parsing register masks & fix printing them Fixes PR51580. Register masks will now be printed as 'movem.l (%sp), %a0-%a5/%d5' for example and can now be parsed in the same format. Previously the printed syntax was 'movem.l (%sp), %a0-%a5,%d', which didn't match prior art and was too ambiguous to easily parse. Differential Revision: https://reviews.llvm.org/D108597	2021-08-24 10:40:02 +01:00
Jeremy Morse	992e21eeee	[DebugInfo][InstrRef] Fix over-droppage of locations in X86FloatingPoint Over in D105657, we started dropping instruction numbers (that become variable locations) from call instructions, as we can't correctly represent the x87 FP stack. Unfortunately, it turns out that the "special FP instructions" that this pass transforms includes "every call instruction" [0]. Thus, we've ended up dropping all return values from all calls. Ouch. This patch adds a filter: only drop instruction numbers from calls if they return something on the FP stack. Seeing how LLVM only allows a single return value, this should drop instruction numbers on anything that returns a float, and nothing else. Rather than writing a new test, I've modified the original one to have a positive and negative case: drop instruction number on a call with an FP-stack modification, keep it on a plain call. Differential Revision: https://reviews.llvm.org/D108580	2021-08-24 10:24:07 +01:00
Petar Avramovic	2bf4eeeeb6	[GlobalISel] Avoid creating COPY in LegalizationArtifactCombiner When Src and Dst used in buildAnyExtOrTrunc or buildSExtOrTrunc have the same type (creates COPY) use Src register directly or use replaceRegOrBuildCopy instead. Differential Revision: https://reviews.llvm.org/D108306	2021-08-24 11:09:56 +02:00
Jingu Kang	b52171629f	[GVN] Execute performLoopLoadPRE ahead of PerformLoadPRE Differential Revision: https://reviews.llvm.org/D108204	2021-08-24 09:50:27 +01:00
Cullen Rhodes	e9c8973f1c	[AArch64][SME] Fix v8.6a bf16 NEON instruction predication In streaming mode on SME targets only the scalar BFCVT armv8.6-a instruction is legal, predicate the illegal instructions on NEON to disable them in streaming mode (see D107902). BFCVT is predicated on HasNEONorStreamingSVE. The reference can be found here: https://developer.arm.com/documentation/ddi0602/2021-06/SIMD-FP-Instructions Reviewed By: paulwalker-arm Differential Revision: https://reviews.llvm.org/D108279	2021-08-24 08:13:57 +00:00
Michael Kruse	cdbc86dd22	[Polly] Don't redundantly link libPolly into unittests. With LLVM_LINK_LLVM_DYLIB and LLVM_POLLY_LINK_INTO_TOOLS, Polly is already linked into libLLVM.so, linking libPolly.a as well into unittests results in duplicate command line registration errors.	2021-08-24 03:07:30 -05:00
Martin Storsjö	039b469b85	[ARM] Allow using ';' as asm statement separator in MSVC mode This does the same as D96259, but for ARM, just like AArch64, using the same comment char as for ELF and MinGW mode. As the assembly input/output of LLVM is GAS style, trying to match what MS armasm.exe does isn't needed (because the comment char used is the least concern when it comes to that; all directives differ too). If a separate armasm compatible mode is implemented, it can use its own comment style (just like llvm-ml implements MS ml.exe compatible assembly parsing). This fixes building compiler-rt assembly files for ARM in MSVC mode. The updated testcase literals-comments.s was only intended to make sure that '#' isn't interpreted as a comment char. Differential Revision: https://reviews.llvm.org/D107251	2021-08-24 11:01:49 +03:00
Pushpinder Singh	9b8b7c1180	[AMDGPU][Libomptarget] Delete g_atl_machine global With uses of g_atl_machine gone, a significant portion of dead code has been removed. This patch depends on D104691 and D104695. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D104696	2021-08-24 07:59:40 +00:00
Zhouyi Zhou	1f8602e16e	[clang] NFC: remove superfluous braces In commit `9bb33f572f`, a pair of superfluous braces are introduced to the function Sema::BuildDeclarationNameExpr. This patch tries to remove the superfluous braces. Also use clang-format to further beautify the above function. Reviewed By: rjmccall Differential Revision: https://reviews.llvm.org/D108609	2021-08-24 15:42:50 +08:00
Anton Afanasyev	bed587631f	[AggressiveInstCombine] Add arithmetic shift right instr to `TruncInstCombine` DAG Add `ashr` instruction to the DAG post-dominated by `trunc`, allowing `TruncInstCombine` to reduce bitwidth of expressions containing these instructions. We should be shifting by less than the target bitwidth. Also it is sufficient to require that all truncated bits of the value-to-be-shifted are sign bits (all zeros or ones) and one sign bit is left untruncated: https://alive2.llvm.org/ce/z/Ajo2__ Part of https://reviews.llvm.org/D107766 Differential Revision: https://reviews.llvm.org/D108355	2021-08-24 10:41:16 +03:00
Lang Hames	8614cb9f99	[ORC-RT] Add non-const WrapperFunctionResult data access, simplify allocate. WrapperFunctionResult no longer supports wrapping constant data, so this patch provides direct non-const access to the wrapped data. Since wrapped data can now be written, the WrapperFunctionResult::allocate method can be simplified to return a WrapperFunctionResult. This is essentially the same change (and with the same motivation) as LLVM commit `8b117830b1`, but applied to the ORC runtime's WrapperFunctionResult code.	2021-08-24 17:34:59 +10:00
Anton Afanasyev	280a0b735f	[Test][AggressiveInstCombine] Modify shift tests Add `sext` for `ashr`, remove unrelated tests	2021-08-24 10:30:27 +03:00
Lang Hames	8b117830b1	[ORC] Add non-const WrapperFunctionResult::data method, simplify allocate. WrapperFunctionResult no longer supports wrapping constant data, so this patch adds a non-const data method. Since data can now be written through the data method, the allocate method can be simplified to return a WrapperFunctionResult.	2021-08-24 17:15:37 +10:00
Pushpinder Singh	07e85823aa	[OpenMP][AMDGCN] Enable complex functions This patch enables basic complex functionality using the ocml builtins. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D108552	2021-08-24 12:40:41 +05:30
Liu, Chen3	b7795eb646	[X86] Building constant vector which element type is half will cause assertion fail. Fix assertion fail when building con constant vector which element type is half. Differential Revision: https://reviews.llvm.org/D108612	2021-08-24 14:34:30 +08:00
Omar Emara	292f013395	[LLDB][GUI] Handle extra navigation keys in forms This patch handles the up and down keys if they weren't handled by the selected field. Moreover, it makes sure the form always absorb the key to take full control until the form is canceled or submitted. Differential Revision: https://reviews.llvm.org/D108414	2021-08-23 22:42:57 -07:00
Kazu Hirata	cb661df58f	[IPO] Remove DisableTailCalls (NFC) The last use was removed on Jun 9, 2015 in commit `d9699bc7bd`.	2021-08-23 22:20:23 -07:00
Omar Emara	945cde8b6a	[LLDB][GUI] Add submit form key combination This patch adds a new key ALt+Enter key combination to form windows. Once invoked, the first action is executed without having to navigate to its button. Field exit callbacks are now also invoked on validation to support this aforementioned key combination. One concern for this key combination is its potential use by the window manager of the host. I am not sure if this will be a problem, but it is worth putting in consideration. Differential Revision: https://reviews.llvm.org/D108410	2021-08-23 22:07:04 -07:00
Petr Hosek	8111f2f7ee	[profile] Update counter offset to account for binary ids In the raw profile, binary ids immediately follow the header so when computing counters offset we need to account for the new section. Differential Revision: https://reviews.llvm.org/D108608	2021-08-23 21:43:30 -07:00
Omar Emara	d95d2a8e4a	[LLDB][GUI] Add extra keys to text field This patch adds many new keys to the text field and implements new behaviors as follows: ``` case KEY_HOME: case KEY_CTRL_A: MoveCursorToStart(); case KEY_END: case KEY_CTRL_E: MoveCursorToEnd(); case KEY_RIGHT: case KEY_SF: MoveCursorRight(); case KEY_LEFT: case KEY_SR: MoveCursorLeft(); case KEY_BACKSPACE: case KEY_DELETE: RemovePreviousChar(); case KEY_DC: RemoveNextChar(); case KEY_EOL: case KEY_CTRL_K: ClearToEnd(); case KEY_DL: case KEY_CLEAR: Clear(); ``` This patch also refactors scrolling to be dynamic at draw time for easier handing. Differential Revision: https://reviews.llvm.org/D108385	2021-08-23 21:16:48 -07:00
Wang, Pengfei	c728bd5bba	[X86] AVX512FP16 instructions enabling 5/6 Enable FP16 FMA instructions. Ref.: https://software.intel.com/content/www/us/en/develop/download/intel-avx512-fp16-architecture-specification.html Reviewed By: LuoYuanke Differential Revision: https://reviews.llvm.org/D105268	2021-08-24 09:07:19 +08:00
Philip Reames	96ef794fd0	[SCEV] Add a hasFlags utility to improve readability [NFC]	2021-08-23 17:36:52 -07:00
Jessica Paquette	2ec2b25fba	[AArch64][GlobalISel] Select @llvm.aarch64.neon.ld2.* This is pretty similar to the ST2 selection code in `AArch64InstructionSelector::selectIntrinsicWithSideEffects`. This is a GISel equivalent of the ld2 case in `AArch64DAGToDAGISel::Select`. There's some weirdness there that appears here too (e.g. using ld1 for scalar cases, which are 1-element vectors in SDAG.) It's a little gross that we have to create the copy and then select it right after, but I think we'd need to refactor the existing copy selection code quite a bit to do better. This was falling back while building llvm-project with GISel for AArch64. Differential Revision: https://reviews.llvm.org/D108590	2021-08-23 17:15:53 -07:00
Haowei Wu	253cb50c60	[lit] Add the option to output test result as resultdb json format This change adds the option --resultdb-output=path allow llvm-lit generating LuCI ResultDB JSON output for the test results, which can be better integrated with certain CI/CQ framework. Differential Revision: https://reviews.llvm.org/D108238	2021-08-23 17:00:50 -07:00
MaheshRavishankar	b546f4347b	[mlir]Linalg] Allow controlling fusion of linalg.generic -> linalg.tensor_expand_shape. Differential Revision: https://reviews.llvm.org/D108565	2021-08-23 16:28:10 -07:00
Reid Kleckner	e42ce422a9	[dllexport] Instantiate default ctor default args Fixes https://bugs.llvm.org/show_bug.cgi?id=51414. Differential Revision: https://reviews.llvm.org/D108021	2021-08-23 15:56:29 -07:00
Greg Clayton	a58c2e4af0	Fix DWARFDie::getDeclFile(...) to work with DW_AT_specification. DWARFDie::getDeclFile(...) previously only supported getting the DW_AT_decl_file if the DIE itself contained the DW_AT_decl_file attribute, or if the DIE had a DW_AT_abstract_origin that pointed to another DIE that had a DW_AT_decl_file. This patch allows the function to get the right attribute value if there is a DW_AT_specification that points to another DIE. We also test that if a DW_AT_abtract_origin or DW_AT_specification points to a DIE in another CU with a DW_FORM_ref_addr, that the right line table is used to extract the file index. Full tests were added for the following cases: - DIE has a DW_AT_decl_file attribute - DIE has a DW_AT_abtract_origin that points to another die in the same CU - DIE has a DW_AT_abtract_origin that points to another die in another CU - DIE has a DW_AT_specification that points to another die in the same CU - DIE has a DW_AT_specification that points to another die in another CU Differential Revision: https://reviews.llvm.org/D108480	2021-08-23 15:43:18 -07:00
Jon Chesterfield	d26000e4cc	[openmp][devicertl] Freestanding nvptx via stub printf Compiled nvptx devicertl as freestanding, breaking the dependency on host glibc and gcc-multilibs. Thus build it by default. Comes at the cost of #defining out printf. Tried mapping it onto __builtin_printf but that gets transformed back to printf instead of hitting the cuda/openmp lowering transform. Printf could be preserved by one of: - dropping all the standard headers and ffreestanding - providing a header only printf implementation - changing the compiler handling of printf Reviewed By: grokos Differential Revision: https://reviews.llvm.org/D108349	2021-08-23 23:07:47 +01:00
Mircea Trofin	1055c5e1d3	[MLGO] Make sure inliner logs when deleting callees When using final reward (which is now the default), we were skipping logging decisions that were leading to callee deletion. This fixes that. Differential Revision: https://reviews.llvm.org/D108587	2021-08-23 14:54:46 -07:00
Azharuddin Mohammed	d898693f72	[ExecutionEngine] Use the libunwind __register_frame on Darwin This was already the case, but the recent change (`957334382c`) altered the behavior on some of our bots where __unw_add_dynamic_fde is not found. This restores the prior behavior on Darwin while also retaining the new behavior from that change.	2021-08-23 14:51:14 -07:00
Aart Bik	236a90802d	[mlir][sparse] replace support lib conversion with actual MLIR codegen Rationale: Passing in a pointer to the memref data in order to implement the dense to sparse conversion was a bit too low-level. This revision improves upon that approach with a cleaner solution of generating a loop nest in MLIR code itself that prepares the COO object before passing it to our "swiss army knife" setup. This is much more intuitive and now also allows for dynamic shapes. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D108491	2021-08-23 14:26:05 -07:00
Michael Jones	eff11176c5	[libc] Enable string to integer conversion functions in the default build Adds atoi, atol, atoll, strtol, strtoll, strtoul, and strtoull to the list of entrypoints for Windows and aarch64 linux, as well as moving them out of the LLVM_LIBC_FULL_BUILD condition for x86_64 linux. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D108477	2021-08-23 21:18:14 +00:00
Sanjay Patel	cc9c545fb4	[InstCombine] generalize subtract with 'not' operands; 2nd try This is a re-try of `3aa009cc87` which was reverted at `9577fac0fd` because it caused an infinite loop. For the extra test case, either re-ordering the transforms or adding the extra clause to avoid sub-of-sub is enough to prevent the infinite compile, but I'm doing both to be safer. Original commit message: The motivation was to get min/max intrinsics to parity with cmp+select idioms, but this unlocks a few more folds because isFreeToInvert recognizes add/sub with constants too. In the min/max example, we have too many extra uses for smaller folds to improve things, but this fold is able to eliminate uses even though we can't reduce the number of instructions.	2021-08-23 17:06:51 -04:00
Fangrui Song	ba6e15d8cc	[TargetMachine] Move COFF special case for ExternalSymbolSDNode from shouldAssumeDSOLocal to X86Subtarget Intended to be NFC. ARM/AArch64 don't appear to need adjustment. TargetMachine::shouldAssumeDSOLocal is expected to be very simple, ideally matching isDSOLocal(). The IR producers are expected to set dso_local correctly. (While some may think this function can make producers' work easier, the function is really not in a good position to set dso_local. See the various special cases we duplicate from clang CodeGenModule.cpp.) Reviewed By: mstorsjo Differential Revision: https://reviews.llvm.org/D108514	2021-08-23 13:54:40 -07:00
Artem Belevich	4c40c03b39	Fixed doc build.	2021-08-23 13:45:36 -07:00
River Riddle	4e103a12d9	[mlir] Add support for VariadicOfVariadic operands This revision adds native ODS support for VariadicOfVariadic operand groups. An example of this is the SwitchOp, which has a variadic number of nested operand ranges for each of the case statements, where the number of case statements is variadic. Builtin ODS support allows for generating proper accessors for the nested operand ranges, builder support, and declarative format support. VariadicOfVariadic operands are supported by providing a segment attribute to use to store the operand groups, mapping similarly to the AttrSizedOperand trait (but with a user defined attribute name). `build` methods for VariadicOfVariadic operand expect inputs of the form `ArrayRef<ValueRange>`. Accessors for the variadic ranges return a new `OperandRangeRange` type, which represents a contiguous range of `OperandRange`. In the declarative assembly format, VariadicOfVariadic operands and types are by default formatted as a comma delimited list of value lists: `(<value>, <value>), (), (<value>)`. Differential Revision: https://reviews.llvm.org/D107774	2021-08-23 20:32:31 +00:00
Artem Belevich	ce4545db1d	[CUDA] Bump the latest supported CUDA version to 11.4. This should reduce the amount of noise issued by clang for the recent-ish CUDA versions. Clang still does not support all the features offered by NVCC, but is expected to handle CUDA headers and produce binaries for all GPUs supported by NVCC. Differential Revision: https://reviews.llvm.org/D108248	2021-08-23 13:24:49 -07:00
Artem Belevich	3db8e486e5	[CUDA] Improve CUDA version detection and diagnostics. Always use cuda.h to detect CUDA version. It's a more universal approach compared to version.txt which is no longer present in recent CUDA versions. Split the 'unknown CUDA version' warning in two: * when detected CUDA version is partially supported by clang. It's expected to work in general, at the feature parity with the latest supported CUDA version. and may be missing support for the new features/instructions/GPU variants. Clang will issue a warning. * when detected version is new. Recent CUDA versions have been working with clang reasonably well, and will likely to work similarly to the partially supported ones above. Or it may not work at all. Clang will issue a warning and proceed as if the latest known CUDA version was detected. Differential Revision: https://reviews.llvm.org/D108247	2021-08-23 13:24:48 -07:00
Artem Belevich	49d982d8cb	[CUDA] Add support for CUDA-11.4 Differential Revision: https://reviews.llvm.org/D108239	2021-08-23 13:24:46 -07:00
Artem Belevich	0060fffc82	[CUDA] Bump default GPU architecture to sm_35. It's the oldest GPU architecture currently supported by all CUDA versions clang can use. Differential Revision: https://reviews.llvm.org/D108235	2021-08-23 13:24:45 -07:00
Simon Pilgrim	10c982e0b3	Revert rG1c9bec727ab5c53fa060560dc8d346a911142170 : [InstCombine] Fold (gep (oneuse(gep Ptr, Idx0)), Idx1) -> (gep Ptr, (add Idx0, Idx1)) (PR51069) Reverted (manually due to merge conflicts) while regressions reported on PR51540 are investigated As noticed on D106352, after we've folded "(select C, (gep Ptr, Idx), Ptr) -> (gep Ptr, (select C, Idx, 0))" if the inner Ptr was also a (now one use) gep we could then merge the geps, using the sum of the indices instead. I've limited this to basic 2-op geps - a more general case further down InstCombinerImpl.visitGetElementPtrInst doesn't have the one-use limitation but only creates the add if it can be created via SimplifyAddInst. https://alive2.llvm.org/ce/z/f8pLfD (Thanks Roman!) Differential Revision: https://reviews.llvm.org/D106450	2021-08-23 21:09:26 +01:00
David Green	50f4ae58eb	[AArch64] Correct store ReadAdrBase operand It appears that the Read operand for stores was being placed on the first operand (the stored value) not the address base. This adds a ReadST for the stored value operand, allowing the ReadAdrBase to correctly act upon the address. Differential Revision: https://reviews.llvm.org/D108287	2021-08-23 21:07:55 +01:00
David Green	955c9437fd	[AArch64] Add Scheduling tests for Load/Store ReadAdv operands.	2021-08-23 21:07:55 +01:00
MaheshRavishankar	4aeeb91a92	[mlir][Linalg] Allow all build methods of Structured ops to specify additional attributes. Differential Revision: https://reviews.llvm.org/D108338	2021-08-23 13:06:34 -07:00
Nikita Popov	19dc02e99f	[MergeICmps] Allow sinking past non-load/store This is a followup to D106591. MergeICmps currently only allows sinking the loads past either instructions that don't write to memory at all, or simple loads/stores that don't modify the memory the loads access. The "simple loads/stores" part of this check doesn't seem necessary to me -- AA isModRef() already accurately models any operation that may clobber the memory. For example, in the adjusted test case the transform is still fine if the call to @foo() isn't readonly, but inaccessiblememonly -- in both cases, the call cannot modify the loaded memory. Differential Revision: https://reviews.llvm.org/D108517	2021-08-23 22:03:49 +02:00
River Riddle	da12d88b1c	[mlir][NFC] Add inlineRegion overloads that take a block iterator insert position This allows for inlining into an empty block or to the beginning of a block. NFC as the existing implementations now foward to this overload. Differential Revision: https://reviews.llvm.org/D108572	2021-08-23 19:49:53 +00:00

1 2 3 4 5 ...

397260 Commits All Branches Search

397260 Commits

All Branches