llvm-project

Commit Graph

Author	SHA1	Message	Date
Lang Hames	824a73bbfa	[docs][ORC] Reword "How to Add Process and Library Symbols to the JITDylibs". Now opens with advice on what to do, rather than what not to do.	2022-03-26 13:02:01 -07:00
Alisamar Husain	bcf1978a87	[intelpt] Refactoring instruction decoding for flexibility Now the decoded thread has Append methods that provide more flexibility in terms of the underlying data structure that represents the instructions. In this case, we are able to represent the sporadic errors as map and thus reduce the size of each instruction. Differential Revision: https://reviews.llvm.org/D122293	2022-03-26 11:34:47 -07:00
Iain Sandoe	f8846229c4	[C++20][Modules][HU 3/5] Emit module macros for header units. For header units we build the top level module directly from the header that it represents and macros defined in this TU need to be emitted (when such a definition is live at the end of the TU). Differential Revision: https://reviews.llvm.org/D121097	2022-03-26 16:30:40 +00:00
LLVM GN Syncbot	139416cb5e	[gn build] Port `555214cbcc`	2022-03-26 16:10:19 +00:00
Shengchen Kan	3e41917984	[X86][tablgen] Remove useless check in X86FoldTablesEmitter.cpp. NFC Any `X86Inst` has a name.	2022-03-27 00:09:29 +08:00
Mark de Wever	555214cbcc	[libc++][format][2/6] Adds a __output_iterator. Instead of using a temporary `string` in `__vformat_to_wrapped` use a new generic iterator. This aids to reduce the number of template instantions and avoids using a `string` to buffer the entire formatted output. This changes the type of `format_context` and `wformat_context`, this can still be done since the code isn't ABI stable yet. Several approaches have been evaluated: - Using a __output_buffer base class with: - a put function to store the buffer in its internal buffer - a virtual flush function to copy the internal buffer to the output - Using a `function` to forward the output operation to the output buffer, much like the next method. - Using a type erased function point to store the data in the buffer. The last version resulted in the best performance. For some cases there's still a loss of speed over the original method. This loss many becomes apparent when large strings are copied to a pointer like iterator, before the compiler optimized this using `memcpy`. Reviewed By: ldionne, vitaut, #libc Differential Revision: https://reviews.llvm.org/D110495	2022-03-26 16:48:01 +01:00
Shengchen Kan	a86cd3be1c	[X86][tablgen] Rename some fields for RecognizableInstrBase to align with fields in TD file. NFC The comment for `HasVEX_L` is updated.	2022-03-26 23:32:50 +08:00
Shengchen Kan	dc68ca3eff	[X86][tablgen] Rename field hasREX_WPrefix to hasREX_W for X86Inst. NFC To make it more like hasVEX_L and hasEVEX_K, etc.	2022-03-26 23:14:08 +08:00
Shengchen Kan	271e8d2495	[X86][tablgen] Refine the class RecognizableInstr. NFCI 1. Add comments to explain why we set `isAsmParserOnly` for XACQUIRE and XRELEASE 2. Check `X86Inst` in the constructor of `RecognizableInstrBase` so that we can avoid the case where one of it's field is not initialized but accessed by user. (e.g. in X86EVEX2VEXTablesEmitter.cpp) 3. Move `Rec` from `RecognizableInstrBase` to `RecognizableInstr` to reduce size of `RecognizableInstrBase` 4. Remove out-of-date comments for shouldBeEmitted() (filter() was removed) 5. Add a basic field `IsAsmParserOnly` and remove the field `ShouldBeEmitted` b/c we can deduce it w/ little overhead	2022-03-26 22:41:49 +08:00
Aaron Ballman	bfa2f25d35	[C11] Correct the resulting type for an assignment expression In C, assignment expressions result in an rvalue whose type is the type of the lhs of the assignment after it undergoes lvalue to rvalue conversion. lvalue to rvalue conversion in C strips all qualifiers including _Atomic. We used getUnqualifiedType() which does not strip the _Atomic qualifier when we should have used getAtomicUnqualifiedType(). This corrects the usage and adds some comments to getUnqualifiedType() to make it more clear that it does not strip _Atomic and that's on purpose (see C11 6.2.5p27). This addresses Issue 48742.	2022-03-26 08:03:11 -04:00
Mark de Wever	c3b672a34c	[Clang][doc] Fix __builtin_assume wording. D117296 removed wording for __builtin_assume, D120205 restored the wording, but the last sentence was only partly restored. This restores the rest of the last sentence. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D122423	2022-03-26 13:02:40 +01:00
xndcn	c0ccb69228	[mlir][spirv] Convert func.call to spv.FunctionCall Differential Revision: https://reviews.llvm.org/D122368	2022-03-26 19:21:23 +08:00
zhongyunde	758be63ac6	[test][AArch64] Add a test case for D121180 NFC Now, perform last active true vector combine only where we're extracting from a flag-setting operation. But in fact, the last active extracting will output LASTB + WHILELS, and the WHILELS itself is a flag-setting operation, so precommit this case to test the potentially further optimization. Reviewed By: paulwalker-arm Differential Revision: https://reviews.llvm.org/D122453	2022-03-26 19:12:16 +08:00
Shengchen Kan	c8ea732937	[X86][tablgen] Set ShouldBeEmitted to false when isAsmParserOnly is true. NFCI In fact, an instruction can not be emitted to disassemble table when `isAsmParserOnly` is true, so `isAsmParserOnly=true` implies `ShouldBeEmitted=false`. We check `isAsmParserOnly` in X86FoldTablesEmitter.cpp at a early stage b/c none of them is foldable.	2022-03-26 19:10:58 +08:00
Iain Sandoe	0687578728	[C++20][Modules][HU 2/5] Support searching Header Units in user or system search paths. This is support for the user-facing options to create importable header units from headers in the user or system search paths (or to be given an absolute path). This means that an incomplete header path will be passed by the driver and the lookup carried out using the search paths present when the front end is run. To support this, we introduce file fypes for c++-{user,system,header-unit}-header. These terms are the same as the ones used by GCC, to minimise the differences for tooling (and users). The preprocessor checks for headers before issuing a warning for "#pragma once" in a header build. We ensure that the importable header units are recognised as headers in order to avoid such warnings. Differential Revision: https://reviews.llvm.org/D121096	2022-03-26 10:17:17 +00:00
Shengchen Kan	5f543cb0ef	[X86][tablgen] Use initializer list for some fields of RecognizableInstr*. NFC Also, some code in constructor of `RecognizableInstrBase` is formatted.	2022-03-26 18:03:13 +08:00
Shengchen Kan	7a94fa58c4	[X86][tablgen] Move fields Name, Is64Bit, Is32Bit, Operands from RecognizableInstrBase to RecognizableInstr, NFCI These four fields are not used by any user of `RecognizableInstrBase`, so we can move them to `RecognizableInstr` to avoid unnecessary construction.	2022-03-26 16:43:18 +08:00
Fangrui Song	02f20a09c3	[Option] Remove the error-prone default argument true from 4-argument hasFlag	2022-03-26 01:09:18 -07:00
Fangrui Song	522712e2d2	[Option] Remove the error-prone default argument true from 3-argument hasFlag	2022-03-26 00:58:39 -07:00
Fangrui Song	c37accf0a2	[Option] Avoid using the default argument for the 3-argument hasFlag. NFC The default argument true is error-prone: I think many would think the default is false.	2022-03-26 00:57:06 -07:00
Fangrui Song	da62a5c661	[Driver][test] Clean up riscv* tests See `D119309` for the guideline (-target, -no-canonical-prefixes, unneeded -o with -###).	2022-03-25 23:59:31 -07:00
Ben Shi	bce2e208e0	[AVR] Optimize int16 airthmetic right shift for shift amount 7/14/15 Reviewed By: aykevl Differential Revision: https://reviews.llvm.org/D115618	2022-03-26 06:53:27 +00:00
Fangrui Song	88436afe30	[LoongArch] Fix several Clang warnings. NFC	2022-03-25 22:15:35 -07:00
Shengchen Kan	bf11ed293a	[X86][tablgen] Add class RecognizableInstrBase to simplify X86 code, NFCI	2022-03-26 13:03:06 +08:00
Joseph Huber	392bb8cf1f	[OpenMP] Fix AMDGPU globals test	2022-03-25 23:05:41 -04:00
Shilei Tian	545fcc3d84	[OpenMP][CUDA] Fix potential program crash caused by double free resources As we mentioned in the code comments for function `ResourcePoolTy::release`, at some point there could be two identical resources on the two sides of `Next` mark. It is usually not an issue, unless the following case: 1. Some resources are not returned. 2. We need to iterate the pool and free the element. That will cause double free, which is the case for event pool. Since we don't release events hold by the data map, it can happen that the `Next` mark is not reset, and we have two identical items in the pool. When the pool is destroyed, we will call `cuEventDestroy` twice on the same event. In the best case, we can only observe CUDA errors. In the worst case, it can cause internal failures in CUDART and further crash. This patch fixes the issue by tracking all resources that have been given using an `unordered_set`. We don't remove it when a resource is returned. When the pool is destroyed, we merge the pool (a `vector`) and the set. In this way, we can make sure that the set contains all resources allocated from the device. We just need to iterate the set and free the resource accordingly. For now, only event pool is set to use it. Stream pool is not because we can make sure all streams are returned when the plugin is destroyed. Someone might be wondering, why don't we release all events hold in the data map. That is because, plugins are determined to be destroyed before `libomptarget`. If we can somehow make the plugin outlast `libomptarget`, life will be much easier. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D122014	2022-03-25 22:49:32 -04:00
Joseph Huber	9d3550c517	[OpenMP] Add AMDGPU calling convention to ctor / dtor functions This patch adds the necessary AMDGPU calling convention to the ctor / dtor kernels. These are fundamentally device kenels called by the host on image load. Without this calling convention information the AMDGPU plugin is unable to identify them. Depends on D122504 Fixes #54091 Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D122515	2022-03-25 22:44:20 -04:00
Joseph Huber	3c6d32ec6c	[OpenMP] Make Ctor / Dtor functions have external visibility The default construction of constructor functions by LLVM tends to make them have internal linkage. When we call a ctor / dtor function in the target region we are actually creating a kernel that is called at registration. Because the ctor is a kernel we need to make sure it's externally visible so we can actually call it. This prevented AMDGPU from correctly using constructors while NVPTX could use them simply because it ignored internal visibility. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D122504	2022-03-25 22:44:17 -04:00
Shengchen Kan	e13faa40cf	[X86][tablgen] Add interface getMnemonic to namespace X86Disassembler, NFCI Address comments in D122477 b/c `getMnemonic` is common to X86 and may be used in more than one place.	2022-03-26 09:55:54 +08:00
Maksim Panchenko	4ae9745af1	[Disassember][NFCI] Use strong type for instruction decoder All LLVM backends use MCDisassembler as a base class for their instruction decoders. Use "const MCDisassembler " for the decoder instead of "const void ". Remove unnecessary static casts. Reviewed By: skan Differential Revision: https://reviews.llvm.org/D122245	2022-03-25 18:53:59 -07:00
Peter Klausler	435641bc3d	[flang] Catch bad OPEN(STATUS=) cases STATUS='NEW' and 'REPLACE' require FILE= to be present. STATUS='SCRATCH' may not appear with FILE=. These errors are caught at compilation time when constant character strings are used in an OPEN statement, but the runtime needs to enforce them as well to catch errors in OPEN statements with character variables and expressions. Differential Revision: https://reviews.llvm.org/D122509	2022-03-25 18:24:50 -07:00
Uday Bondhugula	5576579c86	Update affine.load folding hook to fold global splat constant loads Enhance affine.load folding hook to fold loads on global splat constant memrefs. Differential Revision: https://reviews.llvm.org/D122292	2022-03-26 06:44:03 +05:30
Fred Riss	3427eddd9a	Adopt new dyld SPIs to introspect the shared cache. With the shared cache getting split into multiple files, the current way we created ObjectFileMachO objects for shared cache dylib images will break. This patch conditionally adopts new SPIs which will do the right thing in the new world of multi-file caches.	2022-03-25 18:02:15 -07:00
Gulfem Savrun Yeniceri	ead8586645	[InstrProfiling] Add comments for no runtime hook This patch adds comments about `c7f91e227a`, and follows LLVM style guideline about nested if statements.	2022-03-26 00:26:43 +00:00
David Blaikie	34b9b1ea48	Disable -Wmissing-prototypes for internal linkage functions that aren't explicitly marked "static" Some functions can end up non-externally visible despite not being declared "static" or in an unnamed namespace in C++ - such as by having parameters that are of non-external types. Such functions aren't mistakenly intended to be defining some function that needs a declaration. They could be maybe more legible (except for the operator new example) with an explicit static, but that's a stylistic thing outside what should be addressed by a warning. This reapplies `275c56226d` - once we figure out what to do about the change in behavior for -Wnon-c-typedef-for-linkage (this reverts the revert commit `85ee1d3ca1`) Differential Revision: https://reviews.llvm.org/D121328	2022-03-25 23:53:19 +00:00
David Blaikie	a5032b2633	DebugInfo: Don't allow type units to references types in the CU We could only do this in limited ways (since we emit the TUs first, we can't use ref_addr (& we can't use that in Split DWARF either) - so we had to synthesize declarations into the TUs) and they were ambiguous in some cases (if the CU type had internal linkage, parsing the TU would require knowing which CU was referencing the TU to know which type the declaration was for, which seems not-ideal). So to avoid all that, let's just not reference types defined in the CU from TUs - instead moving the TU type into the CU (recursively). This does increase debug info size (by pulling more things out of type units, into the compile unit) - about 2% of uncompressed dwp file size for clang -O0 -g -gsplit-dwarf. (5% .debug_info.dwo section size increase in the .dwp)	2022-03-25 23:49:03 +00:00
Haojian Wu	16eaa5240e	[pseudo] Fix the wrong rule ids in ForestTest.	2022-03-26 00:05:37 +01:00
Haojian Wu	41e69fb245	[pseudo] Add missing header guard for Forest.h	2022-03-25 23:51:19 +01:00
Peter Klausler	5c116d50e4	[flang] Mark C_ASSOCIATED specific procedures as PURE The interfaces to C_ASSOCIATED()'s specific procedures must be PURE so that they are accepted for use in specification expressions. Differential Revision: https://reviews.llvm.org/D122438	2022-03-25 15:04:26 -07:00
Med Ismail Bennani	150db43e41	[lldb/Plugin] Sort the ScriptedProcess' thread list before creating threads With Scripted Processes, in order to create scripted threads, the blueprint provides a dictionary that have each thread index as the key with the respective thread instance as the pair value. In Python, this is fine because a dictionary key can be of any type including integer types: ``` >>> {1: "one", 2: "two", 10: "ten"} {1: 'one', 2: 'two', 10: 'ten'} ``` However, when the python dictionary gets bridged to C++ we convert it to a `StructuredData::Dictionary` that uses a `std::map<ConstString, ObjectSP>` for storage. Because `std::map` is an ordered container and ours uses the `ConstString` type for keys, the thread indices gets converted to strings which makes the dictionary sorted alphabetically, instead of numerically. If the ScriptedProcess has 10 threads or more, it causes thread “10” (and higher) to be after thread “1”, but before thread “2”. In order to solve this, this sorts the thread info dictionary keys numerically, before iterating over them to create ScriptedThreads. rdar://90327854 Differential Revision: https://reviews.llvm.org/D122429 Signed-off-by: Med Ismail Bennani <medismail.bennani@gmail.com>	2022-03-25 14:59:50 -07:00
Med Ismail Bennani	29f363611d	[lldb/Utility] Make StructuredData::Dictionary::GetKeys return an Array This patch changes `StructuredData::Dictionary::GetKeys` return type from an `StructuredData::ObjectSP` to a `StructuredData::ArraySP`. The function already stored the keys in an array but implicitely upcasted it to an `ObjectSP`, which required the user to convert it again to a Array object to access each element. Since we know the keys should be held by an iterable container, it makes more sense to return the allocated ArraySP as-is. Differential Revision: https://reviews.llvm.org/D122426 Signed-off-by: Med Ismail Bennani <medismail.bennani@gmail.com>	2022-03-25 14:59:50 -07:00
Med Ismail Bennani	12301d616f	[lldb/crashlog] Parse thread fields and pass it to crashlog scripted process Previously, the ScriptedThread used the thread index as the thread id. This patch parses the crashlog json to extract the actual thread "id" value, and passes this information to the Crashlog ScriptedProcess blueprint, to create a higher fidelity ScriptedThreaad. It also updates the blueprint to show the thread name and thread queue. Finally, this patch updates the interactive crashlog test to reflect these changes. rdar://90327854 Differential Revision: https://reviews.llvm.org/D122422 Signed-off-by: Med Ismail Bennani <medismail.bennani@gmail.com>	2022-03-25 14:59:50 -07:00
Fangrui Song	afaefb671f	[Driver][Linux] Remove D.Dir+"/../lib" from default search paths for LLVM_ENABLE_RUNTIMES builds The rule was added in 2014 to support -stdlib=libc++ and -lc++ without specifying -L, when D.Dir is not a well-known system library directory like /usr/lib /usr/lib64. This rule turns out to get in the way with (-m32 for 64-bit clang) or (-m64 for 32-bit clang) for Gentoo : https://github.com/llvm/llvm-project/issues/54515 Nowadays LLVM_ENABLE_RUNTIMES is the only recommended way building libc++ and LLVM_ENABLE_PROJECTS=libc++ is deprecated. LLVM_ENABLE_RUNTIMES builds libc++ in D.Dir+"/../lib/${triple}/". The rule is unneeded. Also reverts D108286. Gentoo uses a modified LLVM_ENABLE_RUNTIMES that installs libc++.so in well-known paths like /usr/lib64 and /usr/lib which are already covered by nearby search paths. Implication: if a downstream package needs something like -lLLVM-15git and uses libLLVM-15git.so not in a well-known path, it needs to supply -L D.Dir+"/../lib" explicitly (e.g. via LLVMConfig.cmake), instead of relying on the previous default search path. Reviewed By: mgorny Differential Revision: https://reviews.llvm.org/D122444	2022-03-25 14:56:18 -07:00
Johannes Doerfert	6c2be885ff	Revert "[OpenMP][NFC] Add missing virtual destructor to silence warning" This reverts commit `b9fd8f34ae` as it accidentally contained a unit test change that is not finished (and unrelated).	2022-03-25 16:07:11 -05:00
Florian Hahn	bb9bdef4df	[Clang] Use pattern to match profile metadata in test. Make the test more robust to slightly different metadata numbering by using a pattern instead of hard coding the ids.	2022-03-25 21:05:58 +00:00
Johannes Doerfert	7dfad948f1	[OpenMP][FIX] Repair ExclusiveAccess move semantic snafu	2022-03-25 16:00:53 -05:00
Johannes Doerfert	b9fd8f34ae	[OpenMP][NFC] Add missing virtual destructor to silence warning	2022-03-25 16:00:53 -05:00
William S. Moses	89525cbf28	[Clang] Add helper method to determine if a nonvirtual base has an entry in the LLVM struct This patch adds a helper method to determine if a nonvirtual base has an entry in the LLVM struct. Such a base may not have an entry if the base does not have any fields/bases itself that would change the size of the struct. This utility method is useful for other frontends (Polygeist) that use Clang as an API to generate code. Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D122502	2022-03-25 16:32:12 -04:00
Paul Robinson	6aa0397758	Remove dead code in driver parsing -gsimple-template-names= options While -g[no-]simple-template-names is a driver option, the fancier -gsimple-template-names={simple,mangled} option is cc1-only, so code to handle it in the driver is dead. Differential Revision: https://reviews.llvm.org/D122503	2022-03-25 13:23:24 -07:00
Peter Klausler	2ab9990c9e	[flang] Add & use a better visit() Adds flang/include/flang/Common/visit.h, which defines a Fortran::common::visit() template function that is a drop-in replacement for std::visit(). Modifies most use sites in the front-end and runtime to use common::visit(). The C++ standard mandates that std::visit() have O(1) execution time, which forces implementations to build dispatch tables. This new common::visit() is O(log2 N) in the number of alternatives in a variant<>, but that N tends to be small and so this change produces a fairly significant improvement in compiler build memory requirements, a 5-10% improvement in compiler build time, and a small improvement in compiler execution time. Building with -DFLANG_USE_STD_VISIT causes common::visit() to be an alias for std::visit(). Calls to common::visit() with multiple variant arguments are referred to std::visit(), pending further work. Differential Revision: https://reviews.llvm.org/D122441	2022-03-25 13:15:20 -07:00

1 2 3 4 5 ...

419278 Commits All Branches Search

419278 Commits

All Branches