llvm-project

Commit Graph

Author	SHA1	Message	Date
Eli Friedman	664a1fd9f0	[AArch64] Use the CMP_SWAP_128 variants added in `843c6140`. Accidentally forgot to flip the opcode... and I didn't notice because it was working fine for the GlobalISel.	2021-07-20 13:23:27 -07:00
Alex Lorenz	05a6d74c48	[clang] NFC, move DarwinSDKInfo to lib/Basic This is a preparation commit for https://reviews.llvm.org/D105958	2021-07-20 13:22:48 -07:00
Fangrui Song	db5e078690	[LTO] Add SelectionKind to IRSymtab and use it in ld.lld/LLVMgold In PGO, a C++ external linkage function `foo` has a private counter `__profc_foo` and a private `__profd_foo` in a `comdat nodeduplicate`. A `__attribute__((weak))` function `foo` has a weak hidden counter `__profc_foo` and a private `__profd_foo` in a `comdat nodeduplicate`. In `ld.lld a.o b.o`, say a.o defines an external linkage `foo` and b.o defines a weak `foo`. Currently we treat `comdat nodeduplicate` as `comdat any`, ld.lld will incorrectly consider `b.o:__profc_foo` non-prevailing. In the worst case when `b.o:__profd_foo` is retained and `b.o:__profc_foo` isn't, there will be dangling reference causing an `undefined hidden symbol` error. Add SelectionKind to `Comdat` in IRSymtab and let linkers ignore nodeduplicate comdat. Differential Revision: https://reviews.llvm.org/D106228	2021-07-20 13:22:00 -07:00
Melanie Blower	ce8024e8ff	[CLANG][PATCH][FPEnv] Add support for option -ffp-eval-method and extend #pragma float_control similarly The Intel compiler ICC supports the option "-fp-model=(source\|double\|extended)" which causes the compiler to use a wider type for intermediate floating point calculations. Also supported is a way to embed this effect in the source program with #pragma float_control(source\|double\|extended). This patch extends pragma float_control syntax, and also adds support for a new floating point option "-ffp-eval-method=(source\|double\|extended)". source: intermediate results use source precision double: intermediate results use double precision extended: intermediate results use extended precision Reviewed By: Aaron Ballman Differential Revision: https://reviews.llvm.org/D93769	2021-07-20 16:02:09 -04:00
Rob Suderman	6bf0f6a4f7	[mlir][tosa] Add quantized lowering for matmul and fully_connected Added the named op variants for quantized matmul and quantized batch matmul with the necessary lowerings/tests from tosa's matmul/fully connected ops. Current version does not use the contraction op interface as its verifiers are not compatible with scalar operations. Differential Revision: https://reviews.llvm.org/D105063	2021-07-20 12:58:02 -07:00
Alex Lorenz	a8262a383b	[clang][darwin] add support for Mac Catalyst availability This commit adds support for Mac Catalyst availability attribute, as supported by the Apple clang compiler. A follow-up commit will provide additional support for inferring Mac Catalyst availability from macOS availability using the mapping in the SDKSettings.json. Differential Revision: https://reviews.llvm.org/D105052	2021-07-20 12:51:57 -07:00
Fangrui Song	0c0549fbb3	[AArch64] Delete unused Opcode after D106039	2021-07-20 12:51:44 -07:00
Fangrui Song	3924877932	[IR] Rename `comdat noduplicates` to `comdat nodeduplicate` In the textual format, `noduplicates` means no COMDAT/section group deduplication is performed. Therefore, if both sets of sections are retained, and they happen to define strong external symbols with the same names, there will be a duplicate definition linker error. In PE/COFF, the selection kind lowers to `IMAGE_COMDAT_SELECT_NODUPLICATES`. The name describes the corollary instead of the immediate semantics. The name can cause confusion to other binary formats (ELF, wasm) which have implemented/ want to implement the "no deduplication" selection kind. Rename it to be clearer. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D106319	2021-07-20 12:47:10 -07:00
Shilei Tian	55c65884a4	[OpenMP][deviceRTLs] Update return type of function __kmpc_parallel_level In `deviceRTLs`, the parallel level is stored in a shared variable of type `uint8_t`. `__kmpc_parallel_level` currently returns a 16-bit interger. This patch first changes the return type of the function to `uint8_t`, same as the shared variable, and then corrects function type which was updated in D105955. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D106384	2021-07-20 15:45:43 -04:00
Shilei Tian	02dff78983	[NFC][OpenMP] Fix an issue that no CHECK in test cases This fixes the complaint from FileCheck. Reviewed By: abhinavgaba, jdoerfert Differential Revision: https://reviews.llvm.org/D106387	2021-07-20 15:39:18 -04:00
Eli Friedman	843c614058	[AArch64] Fix i128 cmpxchg using ldxp/stxp. Basically two parts to this fix: 1. Stop using AtomicExpand to expand cmpxchg i128 2. Fix AArch64ExpandPseudoInsts to use a correct expansion. From ARM architecture reference: To atomically load two 64-bit quantities, perform a Load-Exclusive pair/Store-Exclusive pair sequence of reading and writing the same value for which the Store-Exclusive pair succeeds, and use the read values from the Load-Exclusive pair. Fixes https://bugs.llvm.org/show_bug.cgi?id=51102 Differential Revision: https://reviews.llvm.org/D106039	2021-07-20 12:38:12 -07:00
Sam Clegg	d51f74acdf	[lld][WebAssembly] Error on import of TLS symbols in shared libraries In https://reviews.llvm.org/D102044 we made exporting a TLS symbol into an error, but we also want to error on import. See https://github.com/emscripten-core/emscripten/issues/14461 Differential Revision: https://reviews.llvm.org/D106385	2021-07-20 12:36:03 -07:00
Nikita Popov	a465f07cf9	[AttrBuilder] Assert correct attribute kind Make sure that addAttribute() is only used with simple enum attributes. Integer and type attributes need to provide an additional value/type.	2021-07-20 21:16:23 +02:00
Albion Fung	2a7711f33a	[PowerPC] Extra test case for LDARX An extra test case added for the builtin __LDARX. Differential revision: https://reviews.llvm.org/D105926	2021-07-20 14:15:15 -05:00
Sam Clegg	f428693de0	Reland "[lld][WebAssembly] Cleanup duplicate fields in Symbols.h. NFC" This avoids duplication and simplifies the code in several places without increasing the size of the symbol union (at least not above the assert'd limit of 120 bytes). Originally commit: `9b965b37c7` Reverted in: `16aac493e5`. Differential Revision: https://reviews.llvm.org/D106026	2021-07-20 12:13:08 -07:00
Nikita Popov	6312a75dba	[BitcodeReader] Handle type attributes more explicitly (NFCI) For attributes in legacy bitcode that are now typed, explicitly create a type attribute with nullptr type, the same as we do for the attribute group representation. This is so we can assert use of the correct constructor in the future.	2021-07-20 21:08:06 +02:00
Nikita Popov	a7f183afe7	[Orc] Fix sret/byval attributes in test (NFC) This was placing sret/byval attributes without type argument on non-pointer arguments. Make this valid IR by using pointer arguments and passing the corresponding attribute type argument.	2021-07-20 20:47:15 +02:00
peter klausler	4e92962127	[flang] Fix legitimate warning from latest GCC A rank-0 static descriptor needs to be a vector; it's for "v-list" values in defined derived type formatted I/O. (Pushed without pre-review due to high confidence and an unwell buildbot.)	2021-07-20 11:40:34 -07:00
Graham Yiu	a4ac34bfb0	[NFC] Update code owners file - Replace Pete with Mark as owner of ARC backend - Re-order Philip to be sorted by first name	2021-07-20 11:29:10 -07:00
Nikita Popov	0c794abff1	[ThinTLOBitcodeWriter] Fix unused variable warning (NFC)	2021-07-20 20:19:47 +02:00
Nikita Popov	1f8d3fd42b	[Verifier] Check byval/etc type when comparing ABI attributes For musttail calls, ABI attributes between the function and the musttail call must match. The current check discards the type of type attributes like byval, which means that it will consider byval(i32) and byval(i64) (or similar) as compatible. I assume this is a leftover from before these attributes had a type argument. Ran into this while trying to tighten an assertion in AttrBuilder. Differential Revision: https://reviews.llvm.org/D105841	2021-07-20 20:19:47 +02:00
Alex Lorenz	c68f247275	[clang-scan-deps] ignore top-level module dependencies that aren't actually imported Whenever -fmodule-name=top_level_module name is parsed, and clang actually tries to import top_level_module, the headers are imported textually and the module isn't actually built. However, the dependency scanner could still record it as a potential dependency if the module was reimported and thus recorded by the preprocessor callbacks. This change avoids collecting this kind of module as a dependency by verifying that we don't collect top level modules without actual PCM files. Differential Revision: https://reviews.llvm.org/D106100	2021-07-20 11:11:28 -07:00
Victor Huang	1a762f93f8	[PowerPC] Add PowerPC cmpb builtin and emit target indepedent code for XL compatibility This patch is in a series of patches to provide builtins for compatibility with the XL compiler. This patch add the builtin and emit target independent code for __cmpb. Reviewed By: nemanjai, #powerpc Differential revision: https://reviews.llvm.org/D105194	2021-07-20 13:06:22 -05:00
Jacob Hegna	1f3e90e128	Fix Threshold overwrite bug in the Oz inlining model features. Differential Revision: https://reviews.llvm.org/D106336	2021-07-20 18:05:06 +00:00
Zequan Wu	8773822c57	[Utils] Add -compilation-dir flag to prepare-code-coverage-artifact.py Differential Revision: https://reviews.llvm.org/D106314	2021-07-20 10:55:49 -07:00
Nikita Popov	ea014c5bbf	[Inline] Fix noalias addition on simplified instructions (PR50589) When adding noalias/alias.scope metadata, we analyze the instructions of the original callee, and then place metadata on the corresponding inlined instructions in the caller as provided by VMap. However, this assumes that this actually a clone of the instruction, rather than the result of simplification. If simplification occurred, the instruction that VMap points to may not have any relationship as far as ModRef behavior is concerned. Fix this by tracking simplified instructions during cloning and then only processing instructions that have not been simplified. This is done with an additional map form original to cloned instruction, into which we only insert if no simplification is performed. The mapping in VMap can then be compared to this map. If they're the same, the instruction hasn't been simplified. (I originally wanted to only track a set of simplified instructions, but that wouldn't work if the instruction only gets simplified afterwards, e.g. based on rewritten phis.) Fixes https://bugs.llvm.org/show_bug.cgi?id=50589. Differential Revision: https://reviews.llvm.org/D106242	2021-07-20 19:52:41 +02:00
Fangrui Song	e8bc871ca2	[PowerPC][test] Don't write to srcdir	2021-07-20 10:50:11 -07:00
Mark de Wever	a08554bcdd	[libc++][doc] Fixes a broken link.	2021-07-20 19:49:45 +02:00
Jacques Pienaar	4b897de5fa	[mlir][ods] Add nested OpTrait Allows for grouping OpTraits with list of OpTrait to make it easier to group OpTraits together without needing to use list concats (e.g., enable using `[Traits, ..., UsefulGroupOfTraits, Others, ...]` instead of `[Traits, ...] # UsefulGroupOfTraits # [Others, ...]`). Flatten in construction of Operation. This recurses here as the expectation is that these aren't expected to be deeply nested (most likely only 1 level of nesting). Differential Revision: https://reviews.llvm.org/D106223	2021-07-20 10:44:48 -07:00
Sami Tolvanen	700d07f8ce	ThinLTO: Fix inline assembly references to static functions with CFI Create an internal alias with the original name for static functions that are renamed in promoteInternals to avoid breaking inline assembly references to them. This version uses module inline assembly to avoid issues with LowerTypeTestsModule. Relands commmit `8e3b5cb39e` with arch specific tests fixed. Link: https://github.com/ClangBuiltLinux/linux/issues/1354 Reviewed By: nickdesaulniers, pcc Differential Revision: https://reviews.llvm.org/D104058	2021-07-20 10:30:02 -07:00
Arthur Eubanks	6144fc2da1	[NewPM] Print pre-transformation IR name in --print-after-all Sometimes a transformation can change the name of some IR (e.g. an SCC with functions added/removed). This can be confusing when debug logging doesn't match the post-transformation name. The specific example I came across was that --print-after-all said the inliner was working on an SCC that only contained one function, but calls in multiple functions were getting inlined. After all inlining, the current SCC only contained one function. Piggyback off of the existing logic to handle invalidated IR + --print-module-scope. Simply always store the IR description and use that. Reviewed By: jamieschmeiser Differential Revision: https://reviews.llvm.org/D106290	2021-07-20 10:20:10 -07:00
Nancy Wang	7704fedfff	[SystemZ][z/OS][libcxx]: fix libcxx test cases related to codecvt class UTF8 This PR to fix a few test cases related to class https://en.cppreference.com/w/cpp/locale/codecvt , as mentioned in document, class is converting UTF16 and UTF8 or UTF32 and UTF8, character type is deprecated in c++20 and it needs explicitly specify it is UTF8 string literal. Current test cases assume 1 byte character is ASCII or Unicode character which is not true on z/OS platform. UTF8/16/32 information can be found in https://naveenr.net/unicode-character-set-and-utf-8-utf-16-utf-32-encoding/ and EBCDIC and ASCII character value can be found in http://www.simotime.com/asc2ebc1.htm Differential Revision: https://reviews.llvm.org/D106153	2021-07-20 13:02:59 -04:00
Nancy Wang	f3cb8d6e25	[SystemZ][z/OS][libcxx]: fix libcxx test cases related to codecvt class UTF16/32 This PR is to fix a few UTF16 and UTF32 related test cases that are testing member functions for https://en.cppreference.com/w/cpp/locale/codecvt class , functions are converting from UTF16, UTF32 to UTF8 or vise visa. Test cases need to explicitly specify it is UNICODE character for UTF16/32 type in order to be valid tests to match type in documentation. it assumes it will be ASCII or UTF8 type for 1 byte character ( value range from 1 to 127 ), which is not true on z/OS in EBCDIC mode. For information related to UTF16/32 , please see https://naveenr.net/unicode-character-set-and-utf-8-utf-16-utf-32-encoding/ , and EBCDIC/ASCII character value can be found in http://www.simotime.com/asc2ebc1.htm Differential Revision: https://reviews.llvm.org/D106151	2021-07-20 12:57:32 -04:00
Giorgis Georgakoudis	e8439ec893	[OpenMP] Set RequiresFullRuntime false in SPMDization SPMDization in D102307 does not change the RequiresFullRuntime argument of kmpc_target_init/deinit calls. However, the constraints of SPMDization detection for converting a target region to SPMD mode should guarantee that the region does not require full runtime support. Hence, this patch sets RequiresFullRuntime to false for improved execution performance. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D105556	2021-07-20 09:54:51 -07:00
Joseph Huber	b917a1d713	[OpenMP] Change AMDGCN to AMDGPU in the Cmake Module Summary: Change the name for targeting AMD offloading.	2021-07-20 12:52:53 -04:00
Rahul Joshi	0cc2346cbf	[MLIR][NFC] Minor cleanup for BufferDeallocation pass. - Change walkReturnOperations() to be a non-template and look at block terminator for ReturnLike trait. - Clarify description of validateSupportedControlFlow - Eliminate unused argument in Backedges::recurse. - Eliminate repeated calls to getFunction() - Fix wording for non-SCF loop failure Differential Revision: https://reviews.llvm.org/D106373	2021-07-20 09:43:22 -07:00
Joseph Huber	6242f9b966	[OpenMP][Documentation] Fix hyperlink location Fixes the documentation hyperlinks not showing the header. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D106374	2021-07-20 12:42:32 -04:00
Fangrui Song	2f75fda5dc	[test] Avoid llvm-symbolizer/llvm-addr2line one-dash long options	2021-07-20 09:34:35 -07:00
Shimin Cui	0b043bb39b	This patch extends the OptimizeGlobalAddressOfMalloc to handle the null check of global pointer variables. It is disabled with https://reviews.llvm.org/rGb7cd291c1542aee12c9e9fde6c411314a163a8ea . This PR is to reenable it while fixing the original problem reported. The fix is to set the store value correctly when creating store for the new created global init bool symbol. Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D102711	2021-07-20 12:27:26 -04:00
Craig Topper	81efb82570	[RISCV] Teach RISCVMatInt about cases where it can use LUI+SLLI to replace LUI+ADDI+SLLI for large constants. If we need to shift left anyway we might be able to take advantage of LUI implicitly shifting its immediate left by 12 to cover part of the shift. This allows us to use more bits of the LUI immediate to avoid an ADDI. isDesirableToCommuteWithShift now considers compressed instruction opportunities when deciding if commuting should be allowed. I believe this is the same or similar to one of the optimizations from D79492. Reviewed By: luismarques, arcbbb Differential Revision: https://reviews.llvm.org/D105417	2021-07-20 09:22:06 -07:00
Craig Topper	2ad2c5d457	[RISCV] Add -mattr=+c command lines to add-before-shl.ll to prepare for D105417. NFC	2021-07-20 09:22:06 -07:00
Fangrui Song	5b899c22f3	[Driver] Detect libstdc++ include paths for native gcc on 32-bit non-Debian Linux Fixes https://bugs.llvm.org/show_bug.cgi?id=50303 Differential Revision: https://reviews.llvm.org/D106119	2021-07-20 09:18:24 -07:00
Quinn Pham	59d2ba2a3d	[PowerPC] Semachecking for XL compat builtin icbt This patch is in a series of patches to provide builtins for compatibility with the XL compiler. This patch adds semachecking for an already implemented builtin, `__icbt`. `__icbt` is only valid for Power8 and up. Reviewed By: #powerpc, nemanjai Differential Revision: https://reviews.llvm.org/D105834	2021-07-20 11:05:22 -05:00
Craig Topper	98d4adc2d1	[RISCV] Add custom isel to select (and (srl X, C1), C2) and (and (shl X, C1), C2) Replace some existing isel patterns that are covered by the new code. SLLIUWPat has been removed in favor of folding its root case into the new code. The other uses in isel patterns for shXadd.uw have been switched to using hardcoded AND masks. This is based on the original version of D49585 from ARM. The final version of that was made a DAG combine, but I've chosen to keep it as custom isel. I'm not convinced DAG combine is as good with shift pairs as it is with and+shift. I saw some issues optimizing the shifts created by vscale lowering if an and isn't created for from a shift pair. Reviewed By: luismarques Differential Revision: https://reviews.llvm.org/D106230	2021-07-20 08:53:55 -07:00
Stefan Pintilie	1a6dc92be7	[PowerPC] Inefficient register allocation of ACC registers results in many copies. ACC registers are a combination of four consecutive vector registers. If the vector registers are assigned first this often forces a number of copies to appear just before the ACC register is created. If the ACC register is assigned first then fewer copies are generated when the vector registers are assigned. This patch tries to force the register allocator to assign the ACC registers first and then the UACC registers and then the vector pair registers. It does this by changing the priority of the register classes. This patch also adds hints to help the register allocator assign UACC registers from known ACC registers and vector pair registers from known UACC registers. Reviewed By: nemanjai Differential Revision: https://reviews.llvm.org/D105854	2021-07-20 10:53:40 -05:00
Sterling Augustine	bbbc4f110e	Avoid keeping internal string_views in Twine. This is a follow-up to https://reviews.llvm.org/D103935 A Twine's internal layout should not depend on which version of the C++ standard is in use. Dynamically linking binaries compiled with two different layouts (eg, --std=c++14 vs --std=c++17) ends up problematic. This change avoids that issue by immediately converting a string_view to a pointer-and-length at the cost of an extra eight-bytes in Twine. Differential Revision: https://reviews.llvm.org/D106186	2021-07-20 08:46:53 -07:00
Craig Topper	84877a098a	[RISCV] Use unordered indexed loads for MGATHER. I don't think the semantics of the llvm masked gather intrinsic care about the order the elements are loaded. For example, type legalization by splitting will chain them in parallel. This is different than scatter which we do chain in order. Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D106025	2021-07-20 08:46:02 -07:00
David Green	4272e64acd	[LV] Change interface of getReductionPatternCost to return Optional Currently the Instruction cost of getReductionPatternCost returns an Invalid cost to specify "did not find the pattern". This changes that to return an Optional with None specifying not found, allowing Invalid to mean an infinite cost as is used elsewhere. Differential Revision: https://reviews.llvm.org/D106140	2021-07-20 16:44:50 +01:00
Geoffrey Martin-Noble	57de4ac121	[Bazel] Update for `bc1a2979fc` Update Bazel build configuration for https://github.com/llvm/llvm-project/commit/bc1a2979fc70 by adding missing dep to clang tooling unit tests.	2021-07-20 08:36:03 -07:00
Joel E. Denny	5b0a948a81	[UpdateCCTestChecks] Implement --global-hex-value-regex For example, in OpenMP offload codegen tests, global variables like `.offload_maptypes*` are much easier to read in hex. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D104743	2021-07-20 11:23:20 -04:00

1 2 3 4 5 ...

394219 Commits All Branches Search

394219 Commits

All Branches