llvm-project

Commit Graph

Author	SHA1	Message	Date
Eli Friedman	673dbe1b5e	[clang codegen] Use IR "align" attribute for static array arguments. Without the "align" attribute, marking the argument dereferenceable is basically useless. See also D80166. Fixes https://bugs.llvm.org/show_bug.cgi?id=46876 . Differential Revision: https://reviews.llvm.org/D84992	2020-08-18 12:51:16 -07:00
Craig Topper	6b1f9f2bd4	[X86] Don't call SemaBuiltinConstantArg from CheckX86BuiltinTileDuplicate if Argument is Type or Value Dependent. SemaBuiltinConstantArg has an early exit for that case that doesn't produce an error and doesn't update the APInt. We need to detect that case and not use the APInt value. While there delete the signature of CheckX86BuiltinTileArgumentsRange that takes a single Argument index to check. There's another version that takes an ArrayRef and single value is convertible to an ArrayRef.	2020-08-18 12:33:40 -07:00
Mehdi Amini	62dbbcf6d7	Remove MLIREDSCInterface library which isn't used anywhere (NFC) Reviewed By: nicolasvasilache, ftynse Differential Revision: https://reviews.llvm.org/D85042	2020-08-18 19:04:30 +00:00
Jessica Paquette	bf36e90295	[GlobalISel][CallLowering] NFC: Unify flag-setting from CallBase + AttributeList It's annoying to have to maintain multiple, nearly identical chains of if statements which all set the same attributes. Add a helper function, `addFlagsUsingAttrFn` which performs the attribute setting. Then, use wrappers for that function in `lowerCall` and `setArgFlags`. (Note that the flag-setting code in `setArgFlags` was missing the returned attribute. There's no selection for this yet, so no test. It's an example of the kind of thing this lets us avoid, though.) Differential Revision: https://reviews.llvm.org/D86159	2020-08-18 11:07:33 -07:00
Jessica Paquette	f29e6277ad	[GlobalISel][CallLowering] Don't tail call with non-forwarded explicit sret Similar to this commit: `faf8065a99` Testcase is pretty much the same as test/CodeGen/AArch64/tailcall-explicit-sret.ll Except it uses i64 (since we don't handle the i1024 return values yet), and doesn't have indirect tail call testcases (because we can't translate those yet). Differential Revision: https://reviews.llvm.org/D86148	2020-08-18 11:06:57 -07:00
Siva Chandra Reddy	f768eb216f	[libc][obvious] Fix link order of math tests.	2020-08-18 11:04:58 -07:00
Tue Ly	9887a70e7a	[libc] Add ULP function to MPFRNumber class to test correctly rounded functions such as SQRT, FMA. Add ULP function to MPFRNumber class to test correctly rounded functions. Differential Revision: https://reviews.llvm.org/D84725	2020-08-18 13:51:58 -04:00
Matt Arsenault	5a15f6628e	GlobalISel: Implement fewerElementsVector for G_INSERT_VECTOR_ELT Add unit tests since AMDGPU will only trigger this for gigantic vectors, and won't use the annoying odd sized breakdown case.	2020-08-18 13:51:19 -04:00
David Blaikie	f7a49d2aa6	[WIP][DebugInfo] Lazily parse debug_loclist offsets Parsing DWARFv5 debug_loclist offsets when a CU is parsed is weighing down memory usage of symbolizers that don't need to parse this data at all. There's not much benefit to caching these anyway - since they are O(1) lookup and reading once you know where the offset list starts (and can do bounds checking with the offset list size too). In general, I think it might be time to start paying down some of the technical debt of loc/loclist/range/rnglist parsing to try to unify it a bit more. eg: * Currently DWARFUnit has: RangeSection, RangeSectionBase, LocSection, LocSectionBase, LocTable, RngListTable, LoclistTableHeader (be nice if these were all wrapped up in two variables - one for loclists, one for rnglists) * rnglists and loclists are handled differently (see: LoclistTableHeader, but no RnglistTableHeader) * maybe all these types could be less stateful - lazily parse what they need to, even reparsing rather than caching because it doesn't seem too expensive, for instance. (though admittedly so long as it's constantcost/overead per compilatiton that's probably adequate) * Maybe implementing and using a DWARFDataExtractor that can be sub-ranged (so we could slice it up to just the single contribution) - though maybe that's not so useful because loc/ranges need to refer to it by absolute, not contribution-relative mechanisms Differential Revision: https://reviews.llvm.org/D86110	2020-08-18 10:49:39 -07:00
Tim Keith	a3538b8394	[flang] Improve error messages for procedures in expressions When a procedure name was used on the RHS of an assignment we were not reporting the error. When one was used in an expression the error message wasn't very good (e.g. "Operands of + must be numeric; have INTEGER(4) and untyped"). Detect these cases in ArgumentAnalyzer and emit better messages, depending on whether the named procedure is a function or subroutine. Procedure names may appear as actual arguments to function and subroutine calls so don't report errors in those cases. That is the same case where assumed type arguments are allowed, so rename `isAssumedType_` to `isProcedureCall_` and use that to decide if it is an error. Differential Revision: https://reviews.llvm.org/D86107	2020-08-18 10:47:55 -07:00
Amara Emerson	04a6ea5d77	[GlobalISel] Add a combine for sext_inreg(load x), c --> sextload x This is restricted to single use loads, which if we fold to sextloads we can find more optimal addressing modes on AArch64. This also fixes an overload the MachineFunction::getMachineMemOperand() method which was incorrectly using the MF alignment instead of the MMO alignment. Differential Revision: https://reviews.llvm.org/D85966	2020-08-18 10:42:15 -07:00
Amara Emerson	40e269ea6d	[GlobalISel] Add a combine for ashr(shl x, c), c --> sext_inreg x, c' By detecting this sign extend pattern early, we can uncover opportunities for more optimizations. Differential Revision: https://reviews.llvm.org/D85965	2020-08-18 10:42:15 -07:00
Rob Suderman	5556575230	Added std.floor operation to match std.ceil There should be an equivalent std.floor op to std.ceil. This includes matching lowerings for SPIRV, NVVM, ROCDL, and LLVM. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D85940	2020-08-18 10:25:32 -07:00
Arthur Eubanks	a1caa30297	[gn build] Add support for expensive checks Reviewed By: hans, MaskRay Differential Revision: https://reviews.llvm.org/D86007	2020-08-18 09:53:39 -07:00
Simon Pilgrim	11ff5176c4	[X86][AVX] lowerShuffleWithVPMOV - add non-VLX support. We can efficiently handle non-VLX cases now that we have the getAVX512TruncNode helper.	2020-08-18 17:51:14 +01:00
Arthur Eubanks	501a078cbb	Revert "[TSan][libdispatch] Add interceptors for dispatch_async_and_wait()" This reverts commit `d137db8029`. Breaks builds on older SDKs.	2020-08-18 09:49:05 -07:00
Mauricio Sifontes	8f4859d351	Create Optimization Pass Wrapper for MLIR Reduce Create a reduction pass that accepts an optimization pass as argument and only replaces the golden module in the pipeline if the output of the optimization pass is smaller than the input and still exhibits the interesting behavior. Add a -test-pass option to test individual passes in the MLIR Reduce tool. Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D84783	2020-08-18 16:47:10 +00:00
Fangrui Song	c466c5fa7e	[ARM] Fix build after D86087	2020-08-18 09:20:32 -07:00
Mott, Jeffrey T	ca77ab494a	Disable use of _ExtInt with '__atomic' builtins We're (temporarily) disabling ExtInt for the '__atomic' builtins so we can better design their behavior later. The idea is until we do an audit/design for the way atomic builtins are supposed to work with _ExtInt, we should leave them restricted so they don't limit our future options, such as by binding us to a sub-optimal implementation via ABI. Example after this change: $ cat test.c void f(_ExtInt(64) *ptr) { __atomic_fetch_add(ptr, 1, 0); } $ clang -c test.c test.c:2:22: error: argument to atomic builtin of type '_ExtInt' is not supported __atomic_fetch_add(ptr, 1, 0); ^ 1 error generated. Differential Revision: https://reviews.llvm.org/D84049	2020-08-18 09:17:26 -07:00
David Green	3471520b1f	[ARM] Allow tail predication of VLDn VLD2/4 instructions cannot be predicated, so we cannot tail predicate them from autovec. From intrinsics though, they should be valid as they will just end up loading extra values into off vector lanes, not effecting the on lanes. The same is true for loads in general where so long as we are not using the other vector lanes, an unpredicated load can be converted to a predicated one. This marks VLD2 and VLD4 instructions as validForTailPredication and allows any unpredicated load in tail predication loop, which seems to be valid given the other checks we have. Differential Revision: https://reviews.llvm.org/D86022	2020-08-18 17:15:45 +01:00
Jan Kratochvil	7baed769c7	[lldb] [testsuite] Add split-file for check-lldb dependencies D85968 started to use `split-file` and while buildbots run fine while doing `make check-lldb` by hand I get: .../llvm-monorepo-clangassert/tools/lldb/test/SymbolFile/DWARF/Output/DW_AT_declaration-with-children.s.script: line 2: split-file: command not found failed: lldb-shell :: SymbolFile/DWARF/DW_AT_declaration-with-children.s Differential Revision: https://reviews.llvm.org/D86144	2020-08-18 18:10:55 +02:00
Sam Tebbs	31f02ac60a	[ARM] Use mov operand if the mov cannot be moved while tail predicating There are some cases where the instruction that sets up the iteration count for a tail predicated loop cannot be moved before the dlstp, stopping tail predication entirely. This patch checks if the mov operand can be used and if so, uses that instead. Differential Revision: https://reviews.llvm.org/D86087	2020-08-18 17:10:29 +01:00
George Mitenkov	cc98a0fbe4	[MLIR][SPIRVToLLVM] Additional conversions for spirv-runner This patch adds more op/type conversion support necessary for `spirv-runner`: - EntryPoint/ExecutionMode: currently removed since we assume having only one kernel function in the kernel module. - StorageBuffer storage class is now supported. We are not concerned with multithreading so this is fine for now. - Type conversion enhanced, now regular offsets and strides for structs and arrays are supported (based on `VulkanLayoutUtils`). - Support of `spc.AccessChain` that is modelled with GEP op in LLVM dialect. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D86109	2020-08-18 19:09:59 +03:00
Dokyung Song	bb54bcf849	[libFuzzer] Fix arguments of InsertPartOf/CopyPartOf calls in CrossOver mutator. The CrossOver mutator is meant to cross over two given buffers (referred to as the first/second buffer henceforth). Previously InsertPartOf/CopyPartOf calls used in the CrossOver mutator incorrectly inserted/copied part of the second buffer into a "scratch buffer" (MutateInPlaceHere of the size CurrentMaxMutationLen), rather than the first buffer. This is not intended behavior, because the scratch buffer does not always (i) contain the content of the first buffer, and (ii) have the same size as the first buffer; CurrentMaxMutationLen is typically a lot larger than the size of the first buffer. This patch fixes the issue by using the first buffer instead of the scratch buffer in InsertPartOf/CopyPartOf calls. A FuzzBench experiment was run to make sure that this change does not inadvertently degrade the performance. The performance is largely the same; more details can be found at: https://storage.googleapis.com/fuzzer-test-suite-public/fixcrossover-report/index.html This patch also adds two new tests, namely "cross_over_insert" and "cross_over_copy", which specifically target InsertPartOf and CopyPartOf, respectively. - cross_over_insert.test checks if the fuzzer can use InsertPartOf to trigger the crash. - cross_over_copy.test checks if the fuzzer can use CopyPartOf to trigger the crash. These newly added tests were designed to pass with the current patch, but not without the it (with `790878f291` these tests do not pass). To achieve this, -max_len was intentionally given a high value. Without this patch, InsertPartOf/CopyPartOf will generate larger inputs, possibly with unpredictable data in it, thereby failing to trigger the crash. The test pass condition for these new tests is narrowed down by (i) limiting mutation depth to 1 (i.e., a single CrossOver mutation should be able to trigger the crash) and (ii) checking whether the mutation sequence of "CrossOver-" leads to the crash. Also note that these newly added tests and an existing test (cross_over.test) all use "-reduce_inputs=0" flags to prevent reducing inputs; it's easier to force the fuzzer to keep original input string this way than tweaking cov-instrumented basic blocks in the source code of the fuzzer executable. Differential Revision: https://reviews.llvm.org/D85554	2020-08-18 16:09:18 +00:00
Fangrui Song	aa48a480b8	[llvm-dwarfdump][test] Add a --statistics test for a DW_AT_artificial variable There is an untested but useful case: `this` (even if not written) is counted as a source variable. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D86044	2020-08-18 09:08:38 -07:00
Jamie Schmeiser	645c6856a6	[NFC] Add raw_ostream parameter to printIR routines This is a non-functional-change to generalize the printIR routines so that the output can be saved and manipulated rather than being directly output to dbgs(). This is a prerequisite change for many upcoming changes that allow new ways of examining changes made to the IR in the new pass manager. Reviewed By: aeubanks (Arthur Eubanks) Differential Revision: https://reviews.llvm.org/D85999	2020-08-18 16:05:27 +00:00
Fangrui Song	ec29538af2	[ELF] Assign file offsets of non-SHF_ALLOC after SHF_ALLOC and set sh_addr=0 to non-SHF_ALLOC * GNU ld places non-SHF_ALLOC sections after SHF_ALLOC sections. This has the advantage that the file offsets of a non-SHF_ALLOC cannot be contained in a PT_LOAD. This patch matches the behavior. * For non-SHF_ALLOC non-orphan sections, GNU ld may assign non-zero sh_addr and treat them similar to SHT_NOBITS (not advance location counter). This is an alternative approach to what we have done in D85100. By placing non-SHF_ALLOC sections at the end, we can drop special cases in createSection and findOrphanPos added by D85100. Different from GNU ld, we set sh_addr to 0 for non-SHF_ALLOC sections. 0 arguably is better because non-SHF_ALLOC sections don't appear in the memory image. ELF spec says: > sh_addr - If the section will appear in the memory image of a process, this > member gives the address at which the section's first byte should > reside. Otherwise, the member contains 0. D85100 appeared to take a detour. If we take a combined view on D85100 and this patch, the overall complexity slightly increases (one more 3-line loop) and compatibility with GNU ld improves. The behavior we don't want to match is the special treatment of .symtab .shstrtab .strtab: they can be matched in LLD but not in GNU ld. Reviewed By: jhenderson, psmith Differential Revision: https://reviews.llvm.org/D85867	2020-08-18 09:03:01 -07:00
Jessica Paquette	224a8c639e	[GlobalISel][CallLowering] Look through call parameters for flags We weren't looking through the parameters on calls at all. E.g., say you had ``` declare i32 @zext(i32 zeroext %x) ... %y = call i32 @zext(i32 %something) ... ``` At the point of the call, we wouldn't know that the %something should have the zeroext attribute. This sets flags in about the same way as TargetLoweringBase::ArgListEntry::setAttributes. Differential Revision: https://reviews.llvm.org/D86125	2020-08-18 08:48:56 -07:00
jasonliu	f48eced390	[XCOFF] emit .rename for .lcomm when necessary Summary: This is a follow up for D82481. For .lcomm directive, although it's not necessary to have .rename emitted, it's still desirable to do it so that we do not see internal 'Rename..' gets print out in symbol table. And we could have consistent naming between TC entry and .lcomm. And also have consistent naming between IR and final object file. Reviewed By: hubert.reinterpretcast Differential Revision: https://reviews.llvm.org/D86075	2020-08-18 15:32:45 +00:00
MaheshRavishankar	a65a50540e	[mlir][Linalg] Canonicalize tensor_reshape(splat-constant) -> splat-constant. When the operand to the linalg.tensor_reshape op is a splat constant, the result can be replaced with a splat constant of the same value but different type. Differential Revision: https://reviews.llvm.org/D86117	2020-08-18 08:17:09 -07:00
Simon Pilgrim	87122c3480	[X86] Regenerate load-slice test labels. NFCI. Pulled out a superfluous diff from D66004	2020-08-18 16:08:35 +01:00
David Green	b8088ada05	[LV] Predicated reduction tests. NFC	2020-08-18 16:02:21 +01:00
Nathan James	8c9ffe34d9	[NFC][clang-tidy] Put abseil headers in alphabetical order	2020-08-18 15:52:47 +01:00
Simon Pilgrim	abd33bf5ef	[X86][AVX] lowerShuffleWithPERMV - pad 128/256-bit shuffles on non-VLX targets Allow non-VLX targets to use 512-bits VPERMV/VPERMV3 for 128/256-bit shuffles. TBH I'm not sure these targets actually exist in the wild, but we're testing for them and its good test coverage for shuffle lowering/combines across different subvector widths.	2020-08-18 15:46:02 +01:00
Simon Pilgrim	011bf4fd96	[X86][AVX] lowerShuffleWithVTRUNC - extend to support v16i16/v32i8 binary shuffles. This requires a few additional SrcVT vs DstVT padding cases in getAVX512TruncNode.	2020-08-18 15:30:02 +01:00
Sanjay Patel	c98fcba55c	[SLP] remove instcombine dependency from regression test; NFC InstCombine doesn't do that much here - sinks some instructions and improves alignments - but that should not be part of the SLP pass unit testing.	2020-08-18 10:18:22 -04:00
Simon Pilgrim	d5621b83a5	[X86][AVX] lowerShuffleWithVTRUNC - pull out TRUNCATE/VTRUNC creation into helper code. NFCI. Prep work toward adding v16i16/v32i8 support for lowerShuffleWithVTRUNC and improving lowerShuffleWithVPMOV.	2020-08-18 14:52:42 +01:00
Matt Arsenault	2f5f5febf3	AMDGPU/GlobalISel: Select llvm.amdgcn.groupstaticsize Previously, it would successfully select and assert if not HSA or PAL when expanding the pseudoinstruction. We don't need the pseudoinstruction anymore since we know the total size after legalization.	2020-08-18 09:28:01 -04:00
Matt Arsenault	3ba7777b94	AMDGPU/GlobalISel: Fix selection of s1/s16 G_[F]CONSTANT The code to determine the value size was overcomplicated and only correct in the case where the result register already had a register class assigned. We can always take the size directly from the register's type.	2020-08-18 09:28:01 -04:00
Georgii Rymar	740332b6cc	[llvm-readobj/elf] - Refine testing of broken Android's packed relocation sections. This uses modern `split-file` tool to merge 5 `packed-relocs-error*.s` tests to a new `packed-relocs-errors.s` and adds testing for GNU style. Differential revision: https://reviews.llvm.org/D85835	2020-08-18 16:23:41 +03:00
Sanjay Patel	139da9c4d7	[InstCombine] fold fabs of select with negated operand This is the FP example shown in: https://bugs.llvm.org/PR39474	2020-08-18 09:23:07 -04:00
Sanjay Patel	e0aa335334	[InstCombine] add tests for fneg+fabs; NFC	2020-08-18 09:23:07 -04:00
Georgii Rymar	bd7daf5ceb	[yaml2obj] - Don't crash when `FileHeader` declares an empty `Flags` key in specific situations. We currently call the `llvm_unreachable` for the following YAML: ``` --- !ELF FileHeader: Class: ELFCLASS32 Data: ELFDATA2LSB Type: ET_REL Machine: EM_NONE Flags: [ ] ``` it happens because the `Flags` key is present, though `EM_NONE` is a machine type that has no known `EF_*` values and we call `llvm_unreachable` by mistake. Differential revision: https://reviews.llvm.org/D86138	2020-08-18 16:09:28 +03:00
Alexey Bataev	1b93ebccaa	[OPENMP]Do not capture base pointer by reference if it is used as a base for array-like reduction. If the declaration is used in the reduction clause, it is captured by reference by default. But if the declaration is a pointer and it is a base for array-like reduction, this declaration can be captured by value, since the pointee is reduced but not the original declaration. Differential Revision: https://reviews.llvm.org/D85321	2020-08-18 09:05:35 -04:00
Eduardo Caldas	c8c92b54d7	[SyntaxTree] Use Annotations based tests for expressions In this process we also create some other tests, in order to not lose coverage when focusing on the annotated code Differential Revision: https://reviews.llvm.org/D85962	2020-08-18 13:00:56 +00:00
Eduardo Caldas	ab58c9ee8a	[SyntaxTree] Implement annotation-based test infrastructure We add the method `SyntaxTreeTest::treeDumpEqualOnAnnotations`, which allows us to compare the treeDump of only annotated code. This will reduce a lot of noise from our `BuildTreeTest` and make them short and easier to read.	2020-08-18 13:00:56 +00:00
Ronak Chauhan	7b777ee730	[ELF] Hide target specific methods as private Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D86136	2020-08-18 18:26:08 +05:30
Simon Pilgrim	7db5124736	[X86][AVX] lowerShuffleWithVTRUNC - avoid unnecessary division in element counts. NFCI. (256 / SrcEltBits) == ((2 * EltSizeInBits * NumElts) / (EltSizeInBits * Scale)) == (2 * (NumElts / Scale)) == NumSrcElts	2020-08-18 13:48:22 +01:00
Nico Weber	b4bffdbadf	Revert "PR44685: DebugInfo: Handle address-use-invalid type units referencing non-type units" This reverts commit `be3ef93bf5`. Test fails on macOS and Windows, e.g. http://45.33.8.238/win/22216/step_11.txt	2020-08-18 08:40:36 -04:00
Ronak Chauhan	e760e85680	[llvm-objdump][AMDGPU] Detect CPU string AMDGPU ISA isn't backwards compatible and hence -mcpu must always be specified during disassembly. However, the AMDGPU target CPU is stored in e_flags in the ELF object. This patch allows targets to implement CPU string detection, and also implements it for AMDGPU by looking at e_flags. Reviewed By: scott.linder Differential Revision: https://reviews.llvm.org/D84519	2020-08-18 17:43:16 +05:30

1 2 3 4 5 ...

363851 Commits All Branches Search

363851 Commits

All Branches