llvm-project

Commit Graph

Author	SHA1	Message	Date
Chris Bieneman	4f710515fb	Fix memory leak I introduced in `2d66ed370a` This should fix the asan issue identified on the Linux asan bot.	2022-01-31 19:44:47 -06:00
David Blaikie	275c56226d	Disable -Wmissing-prototypes for internal linkage functions that aren't explicitly marked "static" Some functions can end up non-externally visible despite not being declared "static" or in an unnamed namespace in C++ - such as by having parameters that are of non-external types. Such functions aren't mistakenly intended to be defining some function that needs a declaration. They could be maybe more legible (except for the `operator new` example) with an explicit static, but that's a stylistic thing outside what should be addressed by a warning.	2022-01-31 17:34:51 -08:00
Jonas Devlieghere	d329dfd0c8	[lldb] Use the build's python interpreter in the shell tests Make sure that the shell tests use the same python interpreter as the rest of the build instead of picking up `python` from the PATH. It would be nice if we could use the _disallow helper, but that triggers on invocations that specify python as the scripting language.	2022-01-31 16:53:42 -08:00
Fangrui Song	7aaf024dac	[BitcodeWriter] Fix cases of some functions `WriteIndexToFile` is used by external projects so I do not touch it.	2022-01-31 16:46:11 -08:00
Fangrui Song	85dfe19b36	[ModuleUtils] Move EmbedBufferInModule to LLVMTransformsUtils D116542 adds EmbedBufferInModule which introduces a layer violation (https://llvm.org/docs/CodingStandards.html#library-layering). See `2d5f857a1e` for detail. EmbedBufferInModule does not use BitcodeWriter functionality and should be moved LLVMTransformsUtils. While here, change the function case to the prevailing convention. It seems that EmbedBufferInModule just follows the steps of EmbedBitcodeInModule. EmbedBitcodeInModule calls WriteBitcodeToFile but has IR update operations which ideally should be refactored to another library. Reviewed By: jhuber6 Differential Revision: https://reviews.llvm.org/D118666	2022-01-31 16:33:57 -08:00
Joseph Huber	4a780aa13e	[LLVM] Resolve layer violation in BitcodeWriter Summary: The changes introduced in D116542 added a dependency on TransformUtils to use the `appendToCompilerUsed` method. This created a circular dependency. This patch simply copies the needed function locally to remove the dependency.	2022-01-31 19:01:08 -05:00
Keith Smiley	7a836ba3c5	[llvm-objcopy][MachO] Ignore LC_LINKER_OPTION when redefining symbols Previously you would get this error: ``` error: unsupported load command (cmd=0x2d) ``` If the binary you were redefining the symbols of contained a LC_LINKER_OPTION load command. This command does not need to be changed when redefining symbols so we can ignore it like many others. Differential Revision: https://reviews.llvm.org/D118526	2022-01-31 15:49:06 -08:00
Fangrui Song	2d5f857a1e	[Bazel] Add include/llvm/Transforms/Utils/ModuleUtils.h to work around layer violation after D116542 There is a layer violation and can break clang -fmodule-name=X -fmodules-strict-decluse builds: * LLVMTransformUtils has `#include "llvm/Bitcode/BitcodeWriterPass.h"` * LLVMBitWriter depends on LLVMTransformUtils after D116542 Temporarily work around the issue.	2022-01-31 15:41:45 -08:00
Michael Kruse	8a9e4f245b	[Clang][OpenMPIRBuilder] Fix off-by-one error when dividing by stepsize. When the stepsize does not evenly divide the range's end, round-up to ensure that that last multiple of the stepsize before the reaching the upper boud is reached. For instance, the trip count of for (int i = 0; i < 7; i+=5) is two (i=0 and i=5), not (7-0)/5 == 1. Reviewed By: peixin Differential Revision: https://reviews.llvm.org/D118542	2022-01-31 17:28:52 -06:00
Peter Klausler	c7f4c333af	[flang] Make NEWUNIT= use a range suitable for INTEGER(KIND=1) and recycle unit numbers Use a bit-set to manage runtime-generated I/O unit numbers, recycle them after they're closed, and use a range of values that fits in a minimal-sized integer. Differential Revision: https://reviews.llvm.org/D118651	2022-01-31 15:20:21 -08:00
Mircea Trofin	9aa2c914b9	[mlgo][regalloc] Factor live interval feature calculation Factoring it out so we can subsequently cache it. This should be a NFC, however, for the float quantities, we see small errors in the least significant digits. This is because, before, we were summing up one by one. Now, we sum up results of sums. This shouldn't matter for ML, and will require rework when we do quantization (avoiding floats altogether), but meanwhile, it did require an update to the reference file used for testing. The patch also bumps the precision of the variables involved in this, to reduce the error (note they are casted back to float at the end by the SET macro, since we only work with float and not double in TF) Differential Revision: https://reviews.llvm.org/D118659	2022-01-31 15:19:15 -08:00
Snehasish Kumar	186dcd4aab	[instrprof][NFC] Refactor out the common logic for getProfileKind. The logic for getProfileKind for RawInstrProfReader and InstrProfReaderIndex is similar. To avoid duplication, move the logic from the header to InstrProfReader.cpp and introduce a static method which implements the common code. Differential Revision: https://reviews.llvm.org/D118656	2022-01-31 15:04:42 -08:00
Snehasish Kumar	8306968b59	[memprof] Move the meminfo block struct to MemProfData.inc. The definition of the MemInfoBlock is shared between the memprof compiler-rt runtime and llvm/lib/ProfileData/. This change removes the memprof_meminfoblock header and moves the struct to the shared include file. To enable this sharing, the Print method is moved to the memprof_allocator (the only place it is used) and the remaining uses are updated to refer to the MemInfoBlock defined in the MemProfData.inc file. Also a couple of other minor changes which improve usability of the types in MemProfData.inc. * Update the PACKED macro to handle commas. * Add constructors and equality operators. * Don't initialize the buildid field. Differential Revision: https://reviews.llvm.org/D116780	2022-01-31 15:04:41 -08:00
Peter Klausler	702c0cfa07	[flang] runtime perf: larger I/O buffer growth increments When reallocating an I/O buffer to accommodate a large record, ensure that the amount of growth is at least as large as the minimum initial record size (64KiB). The previous policy was causing input buffer reallocation for each byte after the minimum buffer size when scanning input data for record termination newlines. Differential Revision: https://reviews.llvm.org/D118649	2022-01-31 14:53:15 -08:00
Dávid Bolvanský	623b66eded	[Clang][NFC] Added testcase from #49549 The issue is fixed in trunk, so add testcase to avoid regression in the future.	2022-01-31 23:45:56 +01:00
Konstantin Varlamov	85dae4c11e	[libc++][ranges][NFC] Fix formatting on newly-added links on the Ranges status page.	2022-01-31 14:45:21 -08:00
Sam Clegg	45ad3467b7	[clang][WebAssembly] Imply -fno-threadsafe-static when threading is disabled When we don't enable atomics we completely disabled threading in which case there is no point in generating thread safe code for static initialization. This should always be safe because, in WebAssembly, it is not possible to link object compiled without the atomics feature into a mutli-threaded program. See https://github.com/emscripten-core/emscripten/pull/16152 Differential Revision: https://reviews.llvm.org/D118571	2022-01-31 14:26:09 -08:00
Chris Bieneman	2d66ed370a	[NFC] Skip PassBuilderCTests if no default triple This fixes the unit tests so that it is skipped if there is no default target triple set. Unset default target triple is a supported build configuration for LLVM.	2022-01-31 16:24:42 -06:00
Mircea Trofin	d46305e22d	[NFC][regalloc] Move evict advisor initialization before VRAI This is because a subsequent patch will propose obtaining the VRAI from the advisor, which will enable feature caching for the ML advisor, for better compile time. Making this change first as it's both innocuous and keeps the future patch to be reviewed small.	2022-01-31 14:04:59 -08:00
Joachim Protze	0fd5f6964d	[OpenMP][tests][NFC] Pin debug info to DWARF v4 for libarcher tests Temporary solution for #53467, since debian test machines do not support DWARF v5.	2022-01-31 22:55:29 +01:00
Kirill Stoimenov	a5dd6c7419	[ASan] Fixed null pointer bug introduced in D112098. Also added some more test to cover the "else if" part. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D118645	2022-01-31 21:50:10 +00:00
Joseph Huber	b79e2a1ccd	[OpenMP] Remove hard-coded triple in new driver test Summary: Previously this test used a hard-coded triple value in the check lines wihch failed on other architectures. This patch changes that to accept any host triple.	2022-01-31 16:46:51 -05:00
Itay Bookstein	2a868802a3	[clang][CodeGen][NFC] Remove unused CodeGenModule fields Signed-off-by: Itay Bookstein <ibookstein@gmail.com> Reviewed By: erichkeane Differential Revision: https://reviews.llvm.org/D118619	2022-01-31 23:45:53 +02:00
Mircea Trofin	bc3b372161	[nfc][mlgo] De-const a parameter We plan to pass the MachineFunction& to APIs that expect it non-const (for legitimate reasons). The advisor still holds the ref as a const ref, though, so we keep most of the maintainability value of that.	2022-01-31 13:44:33 -08:00
Peter Klausler	52a1346b78	[flang] Distinguish intrinsic from non-intrinsic modules For "USE, INTRINSIC", search only for intrinsic modules; for "USE, NON_INTRINSIC", do not recognize intrinsic modules. Allow modules of both kinds with the same name to be used in the same source file (but not in the same scoping unit, a constraint of the standard that is now enforced). The symbol table's scope tree now has a single instance of a scope with a new kind, IntrinsicModules, whose children are the USE'd intrinsic modules (explicit or not). This separate "top-level" scope is a child of the single global scope and it allows both intrinsic and non-intrinsic modules of the same name to exist in the symbol table. Intrinsic modules' scopes' symbols now have the INTRINSIC attribute set. The search path directories need to make a distinction between regular directories and the one(s) that point(s) to intrinsic modules. I allow for multiple intrinsic module directories in the second search path, although only one is needed today. Differential Revision: https://reviews.llvm.org/D118631	2022-01-31 13:31:27 -08:00
William S. Moses	8cb9c73609	[LoopIdiom] Keep TBAA when creating memcpy/memmove When upgrading a loop of load/store to a memcpy, the existing pass does not keep existing aliasing information. This patch allows existing aliasing information to be kept. Reviewed By: jeroen.dobbelaere Differential Revision: https://reviews.llvm.org/D108221	2022-01-31 16:28:13 -05:00
Nico Weber	d1daa54f36	[gn build] (manually) port `551b177452`	2022-01-31 16:15:48 -05:00
Martin Storsjö	eb9645819c	[libcxx] [Windows] Pick a unique bit for __regex_word The old `__regex_word` aliased the mask for `xdigit`, causing stray test failures. The diff may look surprising, as if the previous faulty value had been set specifically for Windows - but this is due to a restructuring in `411c630bae`. Prior to that, there were OS specific settings for some OSes, and one fallback used for the rest (which turns out to not work for Windows). Differential Revision: https://reviews.llvm.org/D118188	2022-01-31 23:13:58 +02:00
David Greene	ecd46edd61	[UpdateTestChecks] Re-add --filter and --filter-out options Re-add filtering options with fixes for failed tests. We were not passing the is_filtered argument in all check generator calls in update_cc_test_checks.py Enhance the various update_*_test_checks.py tools to allow filtering the tool output with regular expressions. The --filter option will emit only tool output lines matching the given regular expression while the --filter-out option will emit only tools output lines not matching the given regular expression. Filters are applied in order of appearance on the command line (or in UTC_ARGS) and the first matching filter terminates the search. This allows test authors to create more focused tests by removing irrelevant tool output and checking only the pieces of output necessary to test the desired functionality. Differential Revision: https://reviews.llvm.org/D117694	2022-01-31 13:11:40 -08:00
tyb0807	e21f90dba2	[AArch64] Removing redundant PAuth flag This removes `HasPAUTH` from `AArch64SubTarget`, as it seems to be a redundant, unused copy of `HasPAuth`. Differential Revision: https://reviews.llvm.org/D117782	2022-01-31 21:00:30 +00:00
tyb0807	5aa08bf708	[AArch64][SelectionDAG] CodeGen for Armv8.8/9.3 MOPS New target SDNodes are added: AArch64ISD::MOPS_MEMSET, etc. Each intrinsic is translated to one of these in SelectionDAGBuilder via EmitTargetCodeForMOPS. A custom lowering routine for INTRINSIC_W_CHAIN is added to handle llvm.aarch64.mops.memset.tag. This takes a separate path from the common intrinsics but ultimately ends up in the same EmitMOPS(). This is part 4/4 of a series of patches split from https://reviews.llvm.org/D117405 to facilitate reviewing. Patch by Tomas Matheson, Lucas Prates and Son Tuan Vu. Differential Revision: https://reviews.llvm.org/D117764	2022-01-31 20:56:27 +00:00
Joseph Huber	95c8f74640	[Clang] Introduce Clang Linker Wrapper Tool This patch introduces a linker wrapper tool that allows us to preprocess files before they are sent to the linker. This adds a dummy action and job to the driver stage that builds the linker command as usual and then replaces the command line with the wrapper tool. Depends on D116543 Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D116544	2022-01-31 15:56:04 -05:00
Joseph Huber	12ae095bbb	[OpenMP] Embed device files into the host IR This patch adds support for embedding the device object files into the host IR to create a fat binary. Each offloading file will be inserted into a section with the following naming format `.llvm.offloading.<triple>.<arch>.<filename>`. Depends on D116542 Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D116543	2022-01-31 15:56:02 -05:00
Joseph Huber	551b177452	[OpenMP] Add a flag for embedding a file into the module This patch adds support for a flag `-fembed-offload-binary` to embed a file as an ELF section in the output by placing it in a global variable. This can be used to bundle offloading files with the host binary so it can be accessed by the linker. The section is named using the `-fembed-offload-section` option. Depends on D116541 Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D116542	2022-01-31 15:56:00 -05:00
Joseph Huber	2f9ace9e9a	[OpenMP] Introduce new flag to change offloading driver pipeline This patch introduces the `-fopenmp-new-driver` option which instructs the compiler to use a new driver scheme for producing offloading code. In this scheme we create a complete offloading object file and then pass it as input to the host compilation phase. This will allow us to embed the object code in the backend phase. This is the start of a series of commits to rework the OpenMP offloading driver pipeline. The goal of this is to simplify the steps required for creating an offloading program. This patch changes the driver's configuration to simply pass the device file back to the host as an input so it can be embedded as an LLVM IR global during the backend, then simply passes that object file to the linker. This driver implementation will currently create the following phases, ``` $ clang input.c -fopenmp -fopenmp-targets=nvptx64 -fopenmp-new-driver -ccc-print-phases +- 0: input, "input.c", c, (host-openmp) +- 1: preprocessor, {0}, cpp-output, (host-openmp) +- 2: compiler, {1}, ir, (host-openmp) \| \| +- 3: input, "input.c", c, (device-openmp) \| \| +- 4: preprocessor, {3}, cpp-output, (device-openmp) \| \|- 5: compiler, {4}, ir, (device-openmp) \| +- 6: offload, "host-openmp (x86_64-unknown-linux-gnu)" {2}, "device-openmp (nvptx64)" {5}, ir \| +- 7: backend, {6}, assembler, (device-openmp) \|- 8: assembler, {7}, object, (device-openmp) +- 9: offload, "host-openmp (x86_64-unknown-linux-gnu)" {2}, "device-openmp (nvptx64)" {8}, ir +- 10: backend, {9}, assembler, (host-openmp) +- 11: assembler, {10}, object, (host-openmp) 12: clang-linker-wrapper, {11}, image, (host-openmp) ``` Which will map to the following bindings ``` # "x86_64-unknown-linux-gnu" - "clang", inputs: ["input.c"], output: "/tmp/input-bae62e.bc" # "nvptx64" - "clang", inputs: ["input.c", "/tmp/input-bae62e.bc"], output: "/tmp/input-76784e.s" # "nvptx64" - "NVPTX::Assembler", inputs: ["/tmp/input-76784e.s"], output: "/tmp/input-8f29db.o" # "x86_64-unknown-linux-gnu" - "clang", inputs: ["/tmp/input-bae62e.bc", "/tmp/input-8f29db.o"], output: "/tmp/input-545450.o" # "x86_64-unknown-linux-gnu" - "Offload::Linker", inputs: ["/tmp/input-545450.o"], output: "a.out" ``` Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D116541	2022-01-31 15:55:58 -05:00
tyb0807	78fd413cf7	[AArch64][GlobalISel] CodeGen for Armv8.8/9.3 MOPS This implements codegen for Armv8.8/9.3 Memory Operations extension (MOPS). Any memcpy/memset/memmov intrinsics will always be emitted as a series of three consecutive instructions P, M and E which perform the operation. The SelectionDAG implementation is split into a separate patch. AArch64LegalizerInfo will now consider the following generic opcodes if +mops is available, instead of legalising by expanding them to libcalls: G_BZERO, G_MEMCPY_INLINE, G_MEMCPY, G_MEMMOVE, G_MEMSET The s8 value of memset is legalised to s64 to match the pseudos. AArch64O0PreLegalizerCombinerInfo will still be able to combine G_MEMCPY_INLINE even if +mops is present, as it is unclear whether it is better to generate fixed length copies or MOPS instructions for the inline code of small or zero-sized memory operations, so we choose to be conservative for now. AArch64InstructionSelector will select the above as new pseudo instructions: AArch64::MOPSMemory{Copy/Move/Set/SetTagging} These are each expanded to a series of three instructions (e.g. SETP/SETM/SETE) which must be emitted together during code emission to avoid scheduler reordering. This is part 3/4 of a series of patches split from https://reviews.llvm.org/D117405 to facilitate reviewing. Patch by Tomas Matheson and Son Tuan Vu Differential Revision: https://reviews.llvm.org/D117763	2022-01-31 20:54:41 +00:00
River Riddle	6ec9fd20bd	[mlir:Standard][NFC] Remove the dead Arithmetic op classes from Ops.td These were dead after the arithmetic operations moved from Standard to the Arithmetic dialect.	2022-01-31 12:53:29 -08:00
tyb0807	13660715e6	[AArch64] Modeling NZCV read/write for MOPS instructions According to the specification, MOPS instructions define/use NZCV flags as part of their semantics (see discussion in https://reviews.llvm.org/D116157). More specifically, the specification of the MOPS extension states that each memcpy/memset/memmov operation will be performed by a series of three MOPS instructions P, M and E. The P instruction writes to the NZCV flags, while the others (M and E) reads from the NZCV flags. This is part 2/4 of a series of patches split from https://reviews.llvm.org/D117405 to facilitate reviewing. Differential Revision: https://reviews.llvm.org/D117757	2022-01-31 20:50:16 +00:00
tyb0807	51e188d079	[AArch64] Support for memset tagged intrinsic This introduces a new ACLE intrinsic for memset tagged (https://github.com/ARM-software/acle/blob/next-release/main/acle.md#memcpy-family-of-operations-intrinsics---mops). void __builtin_arm_mops_memset_tag(void , int, size_t) A corresponding LLVM intrinsic is introduced: i8* llvm.aarch64.mops.memset.tag(i8*, i8, i64) The types match llvm.memset but the return type is not void. This is part 1/4 of a series of patches split from https://reviews.llvm.org/D117405 to facilitate reviewing. Patch by Tomas Matheson Differential Revision: https://reviews.llvm.org/D117753	2022-01-31 20:49:34 +00:00
Peter Klausler	f651bbea06	[flang] Correct interpretation of RECL= When RECL= is set on OPEN(), ensure that it: 1) enforces a max output record payload size (not including header+footer or newline), and 2) causes padding of short output records only for ACCESS='DIRECT' The previous code was causing some false overrun errors and applying padding to sequential/stream output files. Differential Revision: https://reviews.llvm.org/D118630	2022-01-31 12:48:17 -08:00
Mircea Trofin	afbc7bdf98	[mlgo][regalloc][test] Add comprehensive log output testing	2022-01-31 12:46:18 -08:00
Sanjoy Das	18e33ba9c9	Remove `mutable` and stray comment The `mutable` was added back when `scope` was a `DataLayoutOpInterface`. Differential Revision: https://reviews.llvm.org/D118643	2022-01-31 12:33:20 -08:00
Martin Storsjö	1a5388ca67	[libcxx] [Windows] Use the standard vsnprintf instead of _vsnprintf In ancient Microsoft C runtimes, there might only have been a nonstandard `_vsnprintf` instead of the standard `vsnprintf`, but in modern versions (the only ones relevant for libc++), both are available. In MinGW configurations built with `__USE_MINGW_ANSI_STDIO=1` (as it is built in CI), `vsnprintf` provides a more standards compliant behaviour than what Microsoft's CRT provides, while `_vsnprintf` retains the Microsoft C runtime specific quirks. Differential Revision: https://reviews.llvm.org/D118187	2022-01-31 22:29:12 +02:00
Daniel McIntosh	0ee7a2c304	[docs] Update Prolog/Epilog Code Insertion docs to show it's still incomplete Compact Unwind is a subsection, but that was lost in rGff9feeb520a32d076c3095468208ae116c428285 Reviewed By: void Differential Revision: https://reviews.llvm.org/D118499	2022-01-31 15:25:46 -05:00
Sam Clegg	3e230d15eb	Revert "[WebAssembly] Refactor and fix emission of external IR global decls" This reverts commit `00bf4755e9`. This change broke the emscripten builder (among other things): https://ci.chromium.org/ui/p/emscripten-releases/builders/try/linux/b8823500584349280721/overview Sample failure: ``` test_unistd_unlink (test_core.core0) ... wasm-ld: error: symbol type mismatch: __stdio_write >>> defined as WASM_SYMBOL_TYPE_FUNCTION in /usr/local/google/home/sbc/dev/wasm/emscripten/cache/sysroot/lib/wasm32-emscripten/libc-debug.a(__stdio_write.o) >>> defined as WASM_SYMBOL_TYPE_DATA in /usr/local/google/home/sbc/dev/wasm/emscripten/cache/sysroot/lib/wasm32-emscripten/libc-debug.a(stderr.o) ```	2022-01-31 12:20:56 -08:00
Joseph Huber	ad0a306a38	[OpenMP][NFC] Change error message on offloading failure to mention documentation This patch changes the error message to instead mention the documentation page for the debugging options provided by libomptarget and the bitcode runtimes. Add some extra information to the documentation to help users more quickly identify debugging resources. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D118626	2022-01-31 15:19:52 -05:00
Joseph Huber	fd5853dae6	[Libomptarget] Reduce shared memory stack size to 512 and a message when it is exceeded Reduces the shared memory size used for globalization to 512 bytes from 2048 to reduce the pressure on shared memory. This patch ado adds a debug mesage to indicate when the shared memory was insufficient. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D118625	2022-01-31 15:19:48 -05:00
Sanjay Patel	06fd721fe7	[x86] add tests for binop of select with identity constant; NFC	2022-01-31 15:08:00 -05:00
bakhtiyar	149311b405	[async] Get the number of worker threads from the runtime. Reviewed By: ezhulenev Differential Revision: https://reviews.llvm.org/D117751	2022-01-31 12:06:01 -08:00
Adrian Prantl	2afc8be2fa	Work around a Clang modules build issue. See: https://green.lab.llvm.org/green/view/LLDB/job/lldb-cmake/40636/consoleFull#-39956214149ba4694-19c4-4d7e-bec5-911270d8a58c ``` llvm/lib/Support/Valgrind.cpp:37:63: error: missing '#include <stddef.h>'; 'size_t' must be declared before it is used void llvm::sys::ValgrindDiscardTranslations(const void *Addr, size_t Len) { ^ /Applications/Xcode.app/Contents/Developer/Toolchains/XcodeDefault.xctoolchain/usr/lib/clang/13.0.0/include/stddef.h:46:23: note: declaration here is not visible typedef __SIZE_TYPE__ size_t; ^ 1 error generated. ``` rdar://88049280	2022-01-31 12:03:00 -08:00

1 2 3 4 5 ...

413153 Commits All Branches Search

413153 Commits

All Branches