llvm-project

Commit Graph

Author	SHA1	Message	Date
Wang, Pengfei	e9c11c1934	[X86] Zero AMX config buffer for non AVX512 cases. Zero AMX config buffer for non AVX512 cases. Differential Revision: https://reviews.llvm.org/D96927	2021-02-18 13:26:09 +08:00
Fangrui Song	da59c2e4dc	[GWP-ASan] Change sys/cdefs.h to features.h sys/cdefs.h is a glibc internal header which is not supposed to be included by applications. (Some libc implementations provide this file for compatibility.) Android features.h includes sys/cdefs.h, so we can include features.h instead. This change makes `ninja gwp_asan` build on musl.	2021-02-17 20:03:16 -08:00
Mehdi Chinoune	8cfe9c02a0	[Flang] Fix compilation on MinGW-w64 Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D94707	2021-02-17 21:53:48 -06:00
Igor Kudrin	a0c9ec1f5e	[Driver] Honor "-gdwarf-N" at any position for assembler sources This fixes an issue when "-gdwarf-N" switch was ignored if it was given before another debug option. Differential Revision: https://reviews.llvm.org/D96865	2021-02-18 10:36:42 +07:00
Wang, Pengfei	9dcfb95ba2	[X86] Add AVX2/SSE2 checks for AMX config buffer zeroing. NFC	2021-02-18 11:30:12 +08:00
Craig Topper	016eca8f90	[RISCV] Guard LowerINSERT_VECTOR_ELT against fixed vectors. The type legalizer can call this code based on the scalar type so we need to verify the vector type is a scalable vector. I think due to how type legalization visits nodes, the vector type will have already been legalized so we don't have an issue with using MVT here like we did for EXTRACT_VECTOR_ELT. I've added a test just in case.	2021-02-17 19:27:08 -08:00
Fangrui Song	58ecfccd0d	[profile] Add __attribute__((used)) to zero size dummy sections D14468 added these dummy sections. This patch adds `__attribute__((used))` so that when compiled by GCC>=11 or (expected, D96838) Clang>=13 on some ELF platforms, these sections will get SHF_GNU_RETAIN to make sure they will not be discarded by ld --gc-sections. We are trying to get rid of LLD's "__start_/__stop_ references retain C identifier name sections" rule. If LLD drops the rule in the future (we will retain compatibility for `__llvm_prf_` for a while), `__llvm_prf_` will need to have the SHF_GNU_RETAIN flag, otherwise: ``` // __llvm_prf_cnts/__llvm_prf_data usually exist, but {names,vnds} may not exist. // Such diagnostics will happen with {cnts,data} as well if no input object file is instrumented. % clang++ -fprofile-generate a.cc -fuse-ld=lld -Wl,--gc-sections ld.lld: error: undefined hidden symbol: __start___llvm_prf_names >>> referenced by InstrProfilingPlatformLinux.c >>> InstrProfilingPlatformLinux.c.o:(__llvm_profile_begin_names) in archive /tmp/RelA/lib/clang/13.0.0/lib/linux/libclang_rt.profile-x86_64.a ... ``` Differential Revision: https://reviews.llvm.org/D96902	2021-02-17 19:22:25 -08:00
Joseph Huber	c3a3d20093	[LV] Add analysis remark for mixed precision conversions Floating point conversions inside vectorized loops have performance implications but are very subtle. The user could specify a floating point constant, or call a function without realizing that it will force a change in the vector width. An example of this behaviour is seen in https://godbolt.org/z/M3nT6c . The vectorizer should indicate when this happens becuase it is most likely unintended behaviour. This patch adds a simple check for this behaviour by following floating point stores in the original loop and checking if a floating point conversion operation occurs. Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D95539	2021-02-17 21:37:08 -05:00
Craig Topper	00c4e0a8f6	[RISCV] Guard the ISD::EXTRACT_VECTOR_ELT handling in ReplaceNodeResults against fixed vectors and non-MVT types. The type legalizer is calling this code based on the scalar type so we need to verify the input type is a scalable vector. The vector type has also not been legalized yet when this is called so we need to use EVT for it.	2021-02-17 18:25:38 -08:00
Aart Bik	ff6c84b803	[mlir][sparse] generalize sparse storage format to many more types Rationale: Narrower types for overhead storage yield a smaller memory footprint for sparse tensors and thus needs to be supported. Also, more value types need to be supported to deal with all kinds of kernels. Since the "one-size-fits-all" sparse storage scheme implementation is used instead of actual codegen, the library needs to be able to support all combinations of desired types. With some crafty templating and overloading, the actual code for this is kept reasonably sized though. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D96819	2021-02-17 18:20:23 -08:00
Hsiangkai Wang	766ee1096f	[Clang][RISCV] Define RISC-V V builtin types Add the types for the RISC-V V extension builtins. These types will be used by the RISC-V V intrinsics which require types of the form <vscale x 1 x i64>(LMUL=1 element size=64) or <vscale x 4 x i32>(LMUL=2 element size=32), etc. The vector_size attribute does not work for us as it doesn't create a scalable vector type. We want these types to be opaque and have no operators defined for them. We want them to be sizeless. This makes them similar to the ARM SVE builtin types. But we will have quite a bit more types. This patch adds around 60. Later patches will add another 230 or so types representing tuples of these types similar to the x2/x3/x4 types in ARM SVE. But with extra complexity that these types are combined with the LMUL concept that is unique to RISCV. For more background see this RFC http://lists.llvm.org/pipermail/llvm-dev/2020-October/145850.html Authored-by: Roger Ferrer Ibanez <roger.ferrer@bsc.es> Co-Authored-by: Hsiangkai Wang <kai.wang@sifive.com> Differential Revision: https://reviews.llvm.org/D92715	2021-02-18 10:17:31 +08:00
Stanislav Mekhanoshin	75997e8407	[AMDGPU] Fixed msan build LoadStoreOptimizer was using uninitialized SCC value for instructions where it is unsupported.	2021-02-17 18:01:23 -08:00
Eric Schweitz	fd3297dc32	[flang][fir][NFC] clang-tidy change. Add include. Differential Revision: https://reviews.llvm.org/D96912	2021-02-17 17:52:04 -08:00
Chen Zheng	5517923b1c	[XCOFF][NFC] make csect properties optional for getXCOFFSection We are going to support debug sections for XCOFF. So the csect properties are not necessary. This patch makes these properties optional. Reviewed By: hubert.reinterpretcast Differential Revision: https://reviews.llvm.org/D95931	2021-02-17 20:51:42 -05:00
Eric Schweitz	930150781d	[flang][fir][NFC] Merge tablegen files. Differential Revision: https://reviews.llvm.org/D96908	2021-02-17 17:51:14 -08:00
Marco Vanotti	78eabcaa48	[libunwind] Add support for PC reg column in arm64 This change adds support for the dwarf PC register column in arm64, allowing CFI directives to make use of it. As of the last revision of the DWARF for ARM 64-bit architecture[0], the pc register has been added as a valir register, with number 32. This allows libunwinder to restore both pc and lr, which is useful for stack switches and signal contexts. [0]: `f52e1ad3f8/aadwarf64/aadwarf64.rst` Reviewed By: phosek, #libunwind Differential Revision: https://reviews.llvm.org/D96901	2021-02-17 17:42:19 -08:00
Joerg Sonnenberger	2628e91461	[NetBSD] Use cortex-a8 as default CPU for ARMv7 This matches the platform default for GCC. It primarily matters when the integrated assembler is not used as there is no default CPU defined for ARMv7-A and GNU as is upset with -mcpu=generic.	2021-02-18 01:53:04 +01:00
Stanislav Mekhanoshin	48d2e04152	[AMDGPU] Mark SMRD atomics We did not have atomic flags on SMRD, did not copy TSFlags to real instructions, and did not have ret/noret atomic map. At the moment it is NFC, but needed for D96469. Differential Revision: https://reviews.llvm.org/D96823	2021-02-17 16:47:02 -08:00
Teresa Johnson	d55d46f43b	[WPD] Add an optional checking mode for debugging devirtualization This adds an internal option -wholeprogramdevirt-check which if enabled will guard each devirtualization with a runtime check against the expected target, and an invocation of a debug trap if the check fails. This is useful for debugging WPD failures involving undefined behavior (e.g. casting to another class type not in the inheritance chain). Differential Revision: https://reviews.llvm.org/D95969	2021-02-17 16:46:15 -08:00
Nico Weber	2f0f67afb2	[gn build] add a comment to the goma_dir arg	2021-02-17 19:36:36 -05:00
Heejin Ahn	0b5d2b0efd	[WebAssembly] Remove dependency of reference types from EH The new spec does not have `exnref` so EH does not have dependency of the reference types proposal anymore. Reviewed By: dschuff Differential Revision: https://reviews.llvm.org/D96903	2021-02-17 16:10:59 -08:00
Stanislav Mekhanoshin	a8d9d50762	[AMDGPU] gfx90a support Differential Revision: https://reviews.llvm.org/D96906	2021-02-17 16:01:32 -08:00
Rahman Lavaee	0252e6ead1	[obj2yaml,yaml2obj] Add NumBlocks to the BBAddrMapEntry yaml field. As discussed in D95511, this allows us to encode invalid BBAddrMap sections to be used in more rigorous testing. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D96831	2021-02-17 15:45:13 -08:00
Eric Schweitz	0d4534237d	[flang][fir][NFC] clang-tidy change Differential Revision: https://reviews.llvm.org/D96911	2021-02-17 15:41:20 -08:00
LLVM GN Syncbot	ebcf921e4a	[gn build] Port `7397905ab0`	2021-02-17 23:33:31 +00:00
Rong Xu	7397905ab0	[SampleFDO] Third Try: Refactor SampleProfile.cpp Apply the patch for the third time after fixing buildbot failures. Refactor SampleProfile.cpp to use the core code in CodeGen. The main changes are: (1) Move SampleProfileLoaderBaseImpl class to a header file. (2) Split SampleCoverageTracker to a head file and a cpp file. (3) Move the common codes (common options and callsiteIsHot()) to the common cpp file. (4) Add inline keyword to avoid duplicated symbols -- they will be removed later when the class is changed to a template. Differential Revision: https://reviews.llvm.org/D96455	2021-02-17 15:31:50 -08:00
Teresa Johnson	50ac3b1d78	[gold] Match lld WPD behavior for shared library symbols and add test lld already marks shared library defs as ExportDynamic, which prevents potentially unsafe devirtualization of symbols defined in shared libraries. Match that behavior in the gold plugin, and add the same test. Depends on D96721. Differential Revision: https://reviews.llvm.org/D96722	2021-02-17 15:28:49 -08:00
AndreyChurbanov	dab5d6c2eb	[OpenMP] fix race condition in test	2021-02-18 02:27:49 +03:00
Jon Chesterfield	53d7fd3762	[libomptarget][amdgcn] Remove lookup of .language msgpack field	2021-02-17 23:02:16 +00:00
Rob Suderman	55756f32f7	[MLIR][TOSA] Expand Tosa int types to I8 and I16 Tosa integers should include I8 and I16 values. Differential Revision: https://reviews.llvm.org/D96900	2021-02-17 14:18:38 -08:00
Patrick Oppenlander	26a0aeba61	[libc++abi] Add builtins to dynamic library link Otherwise libc++abi.so fails to link on arm with undefined references to some __aeabi_ builtins. Differential Revision: https://reviews.llvm.org/D96574	2021-02-17 17:05:59 -05:00
Jessica Paquette	e6064a6418	[GlobalISel] Implement computeKnownBits for G_ASSERT_SEXT Implementation is the same as G_SEXT_INREG. Differential Revision: https://reviews.llvm.org/D96899	2021-02-17 14:00:36 -08:00
Jessica Paquette	26fb036559	[GlobalISel] Implement computeNumSignBits for G_ASSERT_SEXT Same implementation as G_SEXT_INREG. Add a testcase to combine-sext-inreg for a concrete example, and a testcase to KnownBitsTest. Differential Revision: https://reviews.llvm.org/D96897	2021-02-17 13:53:17 -08:00
Fangrui Song	0c2bb6b446	[Driver] Clean up some Separate form options Drop the `Separate` form of `-fmodule-name X`, `-fprofile-remapping-file X`, and `-frewrite-map-file X`. To the best of my knowledge they are not used. Their conventional Joined forms (`-fFOO=`) should be used instead. `-fdebug-compilation-dir X` is used in several places, e.g. chromium/infra/goma. It is also advertised in http://blog.llvm.org/2019/11/deterministic-builds-with-clang-and-lld.html So we keep it but make the EQ form canonical and the Separate form an alias. Differential Revision: https://reviews.llvm.org/D96886	2021-02-17 13:49:41 -08:00
AndreyChurbanov	cf1ddae7e3	[OpenMP][NFC] replaced 'dependencies' with 'dependences' in comments and debug prints	2021-02-18 00:38:18 +03:00
peter klausler	b82a8c3f23	[flang] Warn about useless explicit typing of intrinsics Fortran 2018 explicitly permits an ignored type declaration for the result of a generic intrinsic function. See the comment added to Semantics/expression.cpp for an explanation of why this is somewhat dangerous and worthy of a warning. Differential Revision: https://reviews.llvm.org/D96879	2021-02-17 13:13:59 -08:00
Yusra Syeda	8b624a3164	[SystemZ] Separate LoZ ELF specifics in tablegen. Separate the LoZ ELF calling convention in tablegen. This will make it easier to add the z/OS ABI in future patches. Reviewed By: uweigand Differential Revision: https://reviews.llvm.org/D96867	2021-02-17 16:11:58 -05:00
Jessica Paquette	60aa646441	[GlobalISel] Add G_ASSERT_SEXT This adds a G_ASSERT_SEXT opcode, similar to G_ASSERT_ZEXT. This instruction signifies that an operation was already sign extended from a smaller type. This is useful for functions with sign-extended parameters. E.g. ``` define void @foo(i16 signext %x) { ... } ``` This adds verifier, regbankselect, and instruction selection support for G_ASSERT_SEXT equivalent to G_ASSERT_ZEXT. Differential Revision: https://reviews.llvm.org/D96890	2021-02-17 13:10:34 -08:00
Aaron Green	10993bf072	Bugfix for collecting features from very small DSOs. During unit tests, it was observed that crafting an artificially small DSO could cause OOB memory to be accessed. This change fixes that (but again, the affected DSOs are unlikely to ever occur outside unit tests). Reviewed By: morehouse, charco Differential Revision: https://reviews.llvm.org/D94507	2021-02-17 13:04:49 -08:00
Teresa Johnson	3c4c205060	[WPD][lld] Test handling of vtable definition from shared libraries Adds a lld test for a case that the handling added for dynamically exported symbols in `1487747e99` already fixes. Because isExportDynamic returns true when the symbol is SharedKind with default visibility, it will treat as dynamically exported and block devirtualization when the definition of a vtable comes from a shared library. This is desireable as it is dangerous to devirtualize in that case, since there could be hidden overrides in the shared library. Typically that happens when the shared library header contains available externally definitions, which applications can override. An example is std::error_category, which is overridden in LLVM and causing failures after a self build with WPD enabled, because libstdc++ contains hidden overrides of the virtual base class methods. The regular LTO case in the new test already worked, but there are 2 fixes in this patch needed for the index-only case and the hybrid LTO case. For the index-only case, WPD should not simply ignore available externally vtables. A follow on fix will be made to clang to emit type metadata for those vtables, which the new test is modeling. For the hybrid case, we need to ensure when the module is split that any llvm.*used globals are cloned to the regular LTO split module so available externally vtable definitions are not prematurely deleted. Another follow on fix will add the equivalent gold test, which requires a small fix to the plugin to treat symbols in dynamic libraries the same way lld already is. Differential Revision: https://reviews.llvm.org/D96721	2021-02-17 12:49:24 -08:00
Sriraman Tallam	e741916330	Basic block sections should enable not function sections implicitly. Basic block sections enables function sections implicitly, this is not needed and is inefficient with "=list" option. We had basic block sections enable function sections implicitly in clang. This is particularly inefficient with "=list" option as it places functions that do not have any basic block sections in separate sections. This causes unnecessary object file overhead for large applications. This patch disables this implicit behavior. It only creates function sections for those functions that require basic block sections. This patch is the second of two patches and this patch removes the implicit enabling of function sections with basic block sections in clang. Differential Revision: https://reviews.llvm.org/D93876	2021-02-17 12:37:50 -08:00
Nico Weber	279c5dc2f3	fix comment typo to cycle bots	2021-02-17 15:29:39 -05:00
Heejin Ahn	da01a9db8b	[WebAssemblly] Fix EHPadStack update in fixCallUnwindMismatches Updating `EHPadStack` with respect to `TRY` and `CATCH` instructions have to be done after checking all other conditions, not before. Because we did this before checking other conditions, when we encounter `TRY` and we want to record the current mismatching range, we already have popped up the entry from `EHPadStack`, which we need to access to record the range. The `baz` call in the added test needs try-delegate because the previous TRY marker placement for `quux` was placed before `baz`, because `baz`'s return value was stackified in RegStackify. If this wasn't stackified this try-delegate is not strictly necessary, but at the moment it is not easy to identify cases like this. I plan to transfer `nounwind` attributes from the LLVM IR to prevent cases like this. The call in the test does not have `unwind` attribute in order to test this bug, but in many cases of this pattern the previous call has `nounwind` attribute. Reviewed By: tlively Differential Revision: https://reviews.llvm.org/D96711	2021-02-17 12:14:11 -08:00
Mircea Trofin	3a030c2f2f	[NFC][RegAlloc] InlineSpiller::Original is a Register	2021-02-17 12:07:59 -08:00
Nico Weber	0dd2ffb392	[gn build] make WindowsManifestMerger.cpp build fine with sysroot This already works in the cmake build. Differential Revision: https://reviews.llvm.org/D96889	2021-02-17 15:03:46 -05:00
Zixu Wang	e320cf23f0	[NFC][clang] Bump up DIAG_SIZE_SEMA for downstream diagnostics Bump DIAG_SIZE_SEMA up by 500 to accommodate extra downstream diagnostics Differential Revision: https://reviews.llvm.org/D96888	2021-02-17 11:54:43 -08:00
Craig Topper	3bdd02735b	[RISCV] Localize RISCVZvlssegTable to RISCVISelDAGToDAG.cpp, the only place it is used.	2021-02-17 11:37:28 -08:00
peter klausler	452d7ebc09	[flang] Ensure that intrinsic procedures are PURE &/or ELEMENTAL The intrinsic procedure table properly classify the various intrinsics, but the PURE and ELEMENTAL attributes that these classifications imply don't always make it to the utility predicates that test symbols for them, leading to spurious error messages in some contexts. So set those attribute flags as appropriate in name resolution, using a new function to isolate the tests. An alternate solution, in which the predicates would query the intrinsic procedure table for these attributes on demand, was something I also tried, so that this information could come directly from an authoritative source; but it would have required references to the intrinsic table to be passed along on too many seemingly unrelated APIs and ended up looking messy. Several symbol table tests needed to have their expected outputs augmented with the PURE and ELEMENTAL flags. Some bogus messages that were flagged as such in test/Semantics/doconcurrent01.f90 were removed, since they are now correctly not emitted. Differential Revision: https://reviews.llvm.org/D96878	2021-02-17 11:31:33 -08:00
Derek Schuff	1f9e551a81	[WebAssembly] Do not use EHCatchret symbols with wasm EH D94835 added support for WinEH to export public symbols pointing to basic blocks which are catchret targets for use with Windows CET. Wasm currently doesn't support public symbols to non-function code addresses (they get treated like new functions in asm but then don't lower to object files correctly). It created them unconditionally for all catchret targets. This change disables those symbols unless the exceptionHandlingType is WinEH (since they aren't used with ExceptionHandling::Wasm) Differential Revision: https://reviews.llvm.org/D96824	2021-02-17 11:22:48 -08:00
Craig Topper	799f7865c8	[RISCV] Use bits<7> instead of bits<11> for the EEW field size in the RISCVZvlsseg searchable table. NFCI We only support 8, 16, 32, and 64 for EEW. These only need 7 bits to represent.	2021-02-17 11:12:36 -08:00

1 2 3 4 5 ...

380261 Commits All Branches Search

380261 Commits

All Branches