llvm-project

Commit Graph

Author	SHA1	Message	Date
Sean Silva	57211fd239	[mlir] Use dynamic_tensor_from_elements in shape.broadcast conversion Now, convert-shape-to-std doesn't internally create memrefs, which was previously a bit of a layering violation. The conversion to memrefs should logically happen as part of bufferization. Differential Revision: https://reviews.llvm.org/D89669	2020-10-19 15:51:46 -07:00
Sean Silva	7885bf8b78	[mlir][DialectConversion] Fix recursive `clone` calls. The framework was not tracking ops created in any regions of the cloned op. Differential Revision: https://reviews.llvm.org/D89668	2020-10-19 15:51:46 -07:00
Sean Silva	f4abd3ed6d	[mlir] Add std.dynamic_tensor_from_elements bufferization. It's unfortunate that this requires adding a dependency on scf dialect to std bufferization (and hence all of std transforms). This is a bit perilous. We might want a lib/Transforms/Bufferize/ with a separate bufferization library per dialect? Differential Revision: https://reviews.llvm.org/D89667	2020-10-19 15:51:45 -07:00
Sean Silva	e3f5073a96	[mlir] Add some more std bufferize patterns. Add bufferizations for extract_element and tensor_from_elements. Differential Revision: https://reviews.llvm.org/D89594	2020-10-19 15:51:45 -07:00
Volodymyr Sapsai	4000c9ee18	Reland "[Modules] Add stats to measure performance of building and loading modules." Measure amount of high-level or fixed-cost operations performed during building/loading modules and during header search. High-level operations like building a module or processing a .pcm file are motivated by previous issues where clang was re-building modules or re-reading .pcm files unnecessarily. Fixed-cost operations like `stat` calls are tracked because clang cannot change how long each operation takes but it can perform fewer of such operations to improve the compile time. Also tracking such stats over time can help us detect compile-time regressions. Added stats are more stable than the actual measured compilation time, so expect the detected regressions to be less noisy. On relanding drop stats in MemoryBuffer.cpp as their value is pretty low but affects a lot of clients and many of those aren't interested in modules and header search. rdar://problem/55715134 Reviewed By: aprantl, bruno Differential Revision: https://reviews.llvm.org/D86895	2020-10-19 15:44:11 -07:00
Stanislav Mekhanoshin	6ddadf9901	[AMDGPU] flat scratch ST addressing mode on gfx10 GFX10 enables third addressing mode for flat scratch instructions, an ST mode. In that mode both register operands are omitted and only swizzled offset is used in addition to flat_scratch base. Differential Revision: https://reviews.llvm.org/D89501	2020-10-19 15:29:52 -07:00
Walter Erquinigo	8a203bb22d	[trace] rename ThreadIntelPT into TraceTrace Renamed ThreadIntelPT to TreaceThread, making it a top-level class. I noticed that this class can and shuld work for any trace plugin and there's nothing intel-pt specific in it. With that TraceThread change, I was able to move most of the json file parsing logic to the base class TraceSessionFileParser, which makes adding new plug-ins easier. This originally was part of https://reviews.llvm.org/D89283 Differential Revision: https://reviews.llvm.org/D89408	2020-10-19 15:15:02 -07:00
Jordan Rupprecht	8a377f1e3c	[NFC] Inline assertion-only variable	2020-10-19 15:11:37 -07:00
Sergei Trofimovich	1eb812e06d	[VE] Fix initializer visibility Before the change attempt to link libLTO.so against shared LLVM library failed as: ``` [ 76%] Linking CXX shared library ../../lib/libLTO.so ... /usr/bin/cmake -E cmake_link_script CMakeFiles/LTO.dir/link.txt --verbose=1 c++ -o ...libLTO.so.12git ...ibLLVM-12git.so ld: CMakeFiles/LTO.dir/lto.cpp.o: in function `llvm::InitializeAllTargetInfos()': include/llvm/Config/Targets.def:31: undefined reference to `LLVMInitializeVETargetInfo' ``` It happens because on linux llvm build system sets default symbol visibility to "hidden". The fix is to set visibility back to "default" for exported APIs with LLVM_EXTERNAL_VISIBILITY. Bug: https://bugs.llvm.org/show_bug.cgi?id=47847 Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D89633	2020-10-19 22:54:41 +01:00
Yaxun (Sam) Liu	52bcd691cb	Recommit "[CUDA][HIP] Defer overloading resolution diagnostics for host device functions" This recommits `7f1f89ec8d` and `40df06cdaf` with bug fixes for memory sanitizer failure and Tensile build failure.	2020-10-19 17:48:04 -04:00
Yaxun (Sam) Liu	7e561b62d2	[NFC] Refactor DiagnosticBuilder and PartialDiagnostic PartialDiagnostic misses some functions compared to DiagnosticBuilder. This patch refactors DiagnosticBuilder and PartialDiagnostic, extracts the common functionality so that the streaming << operators are shared. Differential Revision: https://reviews.llvm.org/D84362	2020-10-19 17:48:04 -04:00
Amy Huang	ea693a1627	[NPM] Port module-debuginfo pass to the new pass manager Port pass to NPM and update tests in DebugInfo/Generic. Differential Revision: https://reviews.llvm.org/D89730	2020-10-19 14:31:17 -07:00
Roman Lebedev	e0567582b8	[NFCI][SCEV] Always refer to enum SCEVTypes as enum, not integer The main tricky thing here is forward-declaring the enum: we have to specify it's underlying data type. In particular, this avoids the danger of switching over the SCEVTypes, but actually switching over an integer, and not being notified when some case is not handled. I have updated most of such switches to be exaustive and not have a default case, where it's pretty obvious to be the intent, however not all of them.	2020-10-20 00:10:22 +03:00
Roman Lebedev	d4b0aa9773	[NFC][SCEV] BuildConstantFromSCEV(): reformat, NFC Makes diff in next commit more readable	2020-10-20 00:10:22 +03:00
Roman Lebedev	3355284b2d	[NFC][SCEVExpander] isHighCostExpansionHelper(): rewrite as a switch If we switch over an enum, compiler can easily issue a diagnostic if some case is not handled. However with an if cascade that isn't so. Experimental evidence suggests new behavior to be superior.	2020-10-20 00:10:22 +03:00
Dávid Bolvanský	d605a11993	[Intrinsics] Added writeonly attribute to the first arg of llvm.memmove D18714 introduced writeonly attribute: "Also start using the attribute for memset, memcpy, and memmove intrinsics, and remove their special-casing in BasicAliasAnalysis." But actually, writeonly was not attached to memmove - oversight, it seems. So let's add it. As we can see, this helps DSE to eliminate redundant stores. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D89724	2020-10-19 23:09:41 +02:00
Martin Storsjö	93671fffb5	[libcxx] [test] Use _putenv instead of setenv/unsetenv on windows Move the functions to the helper header and keep the arch specific logic there. Differential Revision: https://reviews.llvm.org/D89681	2020-10-20 00:07:02 +03:00
Martin Storsjö	81db3c31aa	[libcxx] [test] Fix all remaining issues with fs::path::string_type being wstring Use fs::path as variable type instead of std::string, when the input potentially is a path, as they can't be implicitly converted back to string. Differential Revision: https://reviews.llvm.org/D89674	2020-10-20 00:07:02 +03:00
Martin Storsjö	5c39eebc12	[libcxx] [test] Fix filesystem_test_helper.h to compile for windows Use .string() instead of .native() in places where we want to combine paths with std::string. Convert some methods to take a fs::path as parameter instead of std::string, for cases where they are called with paths as parameters (which can't be implicitly converted to std::string if the path's string_type is wstring). Differential Revision: https://reviews.llvm.org/D89530	2020-10-20 00:07:02 +03:00
Martin Storsjö	afe40b305d	[libcxx] [test] Mark tests that require specific allocation behaviours as libcpp only This fixes/silences a few failures on libstdc++ on linux. Differential Revision: https://reviews.llvm.org/D89676	2020-10-20 00:07:01 +03:00
Martin Storsjö	fa88f61ef5	[libcxx] [test] Exclude domain socket tests on windows, like bsd/darwin Differential Revision: https://reviews.llvm.org/D89673	2020-10-20 00:07:01 +03:00
Martin Storsjö	cf9831b843	[libcxx] [test] Add LIBCPP_ONLY() around another test for an implementation detail Differential Revision: https://reviews.llvm.org/D89675	2020-10-20 00:07:01 +03:00
Martin Storsjö	41c5070888	[libcxx] [test] Don't require fs::path::operator(string_type&&) to be noexcept Mark this as a libcpp specific test; the standard doesn't say that this method should be noexcept. Differential Revision: https://reviews.llvm.org/D89677	2020-10-20 00:07:01 +03:00
Martin Storsjö	e2ddd515ab	[libcxx] [test] Allow fs::permissions(path, perms, perm_options, error_code) to be noexcept The standard doesn't declare this overload as noexcept, but doesn't either say that it strictly cannot be noexcept either. The function doesn't throw on errors that are signaled via error_code, but the standard says that it may throw a bad_alloc. This fixes an error with libstdc++ on linux. Differential Revision: https://reviews.llvm.org/D89678	2020-10-20 00:07:01 +03:00
Martin Storsjö	c61c7ba595	[libcxx] [test] Do error printfs to stderr in filesystems tests This makes them more readable in llvm-lit's output on failures. This only applies the change on the filesystem test subdir. Differential Revision: https://reviews.llvm.org/D89680	2020-10-20 00:07:01 +03:00
Martin Storsjö	5eece137bc	[clang] Automatically link against oldnames just as linking against libcmt Differential Revision: https://reviews.llvm.org/D89702	2020-10-20 00:07:00 +03:00
Duncan P. N. Exon Smith	0ddf4bd47c	clang/{Format,Rewrite}: Stop using SourceManager::getBuffer, NFC Update clang/lib/Format and clang/lib/Rewrite to use a `MemoryBufferRef` from `getBufferOrFake` instead of `MemoryBuffer*` from `getBuffer`. No functionality change here, since the call sites weren't checking if the buffer was valid. Differential Revision: https://reviews.llvm.org/D89406	2020-10-19 17:02:59 -04:00
Evgenii Stepanov	188a7d6710	Add alloca size threshold for StackTagging initializer merging. Summary: Initializer merging generates pretty inefficient code for large allocas that also happens to trigger an exponential algorithm somewhere in Machine Instruction Scheduler. See https://bugs.llvm.org/show_bug.cgi?id=47867. This change adds an upper limit for the alloca size. The default limit is selected such that worst case size of memtag-generated code is similar to non-memtag (but because of the ISA quirks, this case is realized at the different value of alloca size, ex. memset inlining triggers at sizes below 512, but stack tagging instructions are 2x shorter, so limit is approx. 256). We could try harder to emit more compact code with initializer merging, but that would only affect large, sparsely initialized allocas, and those are doing fine already. Reviewers: vitalybuka, pcc Subscribers: llvm-commits	2020-10-19 13:44:07 -07:00
Arthur Eubanks	c76968d8b6	[test][NPM] Fix already-vectorized.ll under NPM The NPM runs SpeculateAroundPHIs which breaks critical edges, causing a branch we check for to not directly jump back to the same block.	2020-10-19 13:11:13 -07:00
Craig Topper	edd0cb11bd	[SelectionDAG][X86] Enable SimplifySetCC CTPOP transforms for vector splats This enables these transforms for vectors: (ctpop x) u< 2 -> (x & x-1) == 0 (ctpop x) u> 1 -> (x & x-1) != 0 (ctpop x) == 1 --> (x != 0) && ((x & x-1) == 0) (ctpop x) != 1 --> (x == 0) \|\| ((x & x-1) != 0) All enabled if CTPOP isn't Legal. This differs from the scalar behavior where the first two are done unconditionally and the last two are done if CTPOP isn't Legal or Custom. The Legal check produced better results for vectors based on X86's custom handling. Might be worth re-visiting scalars here. I disabled the looking through truncate for vectors. The code that creates new setcc can use the same result VT as the original setcc even if we truncated the input. That may work work for most scalars, but definitely wouldn't work for vectors unless it was a vector of i1. Fixes or at least improves PR47825 Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D89346	2020-10-19 12:56:59 -07:00
Craig Topper	e28376ec28	[X86] Add i32->float and i64->double bitcast pseudo instructions to store folding table. We have pseudo instructions we use for bitcasts between these types. We have them in the load folding table, but not the store folding table. This adds them there so they can be used for stack spills. I added an exact size check so that we don't fold when the stack slot is larger than the GPR. Otherwise the upper bits in the stack slot would be garbage. That would be fine for Eli's test case in PR47874, but I'm not sure its safe in general. A step towards fixing PR47874. Next steps are to change the ADDSSrr_Int pseudo instructions to use FR32 as the second source register class instead of VR128. That will keep the coalescer from promoting the register class of the bitcast instruction which will make the stack slot 4 bytes instead of 16 bytes. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D89656	2020-10-19 12:53:14 -07:00
Matt Arsenault	ae3625d752	Fix typo	2020-10-19 15:37:05 -04:00
Lang Hames	9898d9d885	[ORC] Fix a missing include.	2020-10-19 12:13:55 -07:00
Arthur Eubanks	fce64578bc	[NPM][test] Fix some LoopVectorize tests under NPM	2020-10-19 12:05:37 -07:00
Nikita Popov	ddd0f08318	[BatchAA] Add test for incorrect phi cycle result (NFC) AA computes the correct result for phi/a1 aliasing, while BatchAA produces an incorrect result depening on which queries have been performed beforehand.	2020-10-19 20:53:11 +02:00
Tony	6be9c7d2dc	[AMDGPU] Correct comment typo in SIMemoryLegaliizer.cpp	2020-10-19 18:50:28 +00:00
Arthur Eubanks	65e5006962	[NPM][opt] Run -O# after other passes in legacy PM compatibility mode Generally tests run -O# before other passes, not after.	2020-10-19 11:48:44 -07:00
Valentin Clement	340181f29a	[flang][openacc] Switch to use TODO from D88909 Use the Todo.h header file introduce in D88909 to marke part of the lowering that are not done yet. Reviewed By: jeanPerier Differential Revision: https://reviews.llvm.org/D88915	2020-10-19 14:47:37 -04:00
Alexandre Ganea	89b72209ad	[LLDB][TestSuite] Improve skipIfWindowsAndNonEnglish in decorators.py Differential Revision: https://reviews.llvm.org/D89716	2020-10-19 14:28:08 -04:00
Cameron McInally	629d1d117a	[SVE] Update vector reduction intrinsics in new tests. Remove `experimental` from the intrinsic names.	2020-10-19 13:27:46 -05:00
Michael Jones	ba24ba7e9c	[libc] Add LLVM libc specific functions to llvm_libc_ext.td. Also moved most of the common type definitions from libc/spec/stdc.td to libc/spec/spec.td so that they can be used to list functions in llvm_libc_ext.td. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D89436	2020-10-19 18:21:25 +00:00
Jay Foad	56f6bf1a8d	[AMDGPU] Remove MUL_LOHI_U24/MUL_LOHI_I24 These were introduced in r279902 on the grounds that using separate MUL_U24/MUL_I24 and MULHI_U24/MULHI_I24 nodes would introduce multiple uses of the operands, which would prevent SimplifyDemandedBits from simplifying the operands. This has since been fixed by D24672 "AMDGPU/SI: Use new SimplifyDemandedBits helper for multi-use operations" No functional change intended. At least it has no effect on lit tests. Differential Revision: https://reviews.llvm.org/D89706	2020-10-19 19:15:34 +01:00
Joseph Huber	24df30efda	[OpenMP] Fixing OpenMP/driver.c failing on 32-bit hosts The changes made in D88594 caused the test OpenMP/driver.c to fail on a 32-bit host becuase it was offloading to a 64-bit architecture by default. The offloading test was moved to a new file and a feature was added to the lit config to check for a 64-bit host. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D89696	2020-10-19 13:41:53 -04:00
Atmn Patel	1e55cf77f3	[LangRef] Define mustprogress attribute LLVM IR currently assumes some form of forward progress. This form is not explicitly defined anywhere, and is the cause of miscompilations in most languages that are not C++11 or later. This implicit forward progress guarantee can not be opted out of on a function level nor on a loop level. Languages such as C (C11 and later), C++ (pre-C++11), and Rust have different forward progress requirements and this needs to be evident in the IR. Specifically, C11 and onwards (6.8.5, Paragraph 6) states that "An iteration statement whose controlling expression is not a constant expression, that performs no input/output operations, does not access volatile objects, and performs no synchronization or atomic operations in its body, controlling expression, or (in the case of for statement) its expression-3, may be assumed by the implementation to terminate." C++11 and onwards does not have this assumption, and instead assumes that every thread must make progress as defined in [intro.progress] when it comes to scheduling. This was initially brought up in [0] as a bug, a solution was presented in [1] which is the current workaround, and the predecessor to this change was [2]. After defining a notion of forward progress for IR, there are two options to address this: 1) Set the default to assuming Forward Progress and provide an opt-out for functions and an opt-in for loops. 2) Set the default to not assuming Forward Progress and provide an opt-in for functions, and an opt-in for loops. Option 2) has been selected because only C++11 and onwards have a forward progress requirement and it makes sense for them to opt-into it via the defined `mustprogress` function attribute. The `mustprogress` function attribute indicates that the function is required to make forward progress as defined. This is sharply in contrast to the status quo where this is implicitly assumed. In addition, `willreturn` implies `mustprogress`. The background for why this definition was chosen is in [3] and for why the option was chosen is in [4] and the corresponding thread(s). The implementation is in D85393, the clang patch is in D86841, the LoopDeletion patch is in D86844, the Inliner patches are in D87180 and D87262, and there will be more incoming. [0] https://bugs.llvm.org/show_bug.cgi?id=965#c25 [1] https://lists.llvm.org/pipermail/llvm-dev/2017-October/118558.html [2] https://reviews.llvm.org/D65718 [3] https://lists.llvm.org/pipermail/llvm-dev/2020-September/144919.html [4] https://lists.llvm.org/pipermail/llvm-dev/2020-September/145023.html Reviewed By: jdoerfert, efriedma, nikic Differential Revision: https://reviews.llvm.org/D86233	2020-10-19 13:34:27 -04:00
Mikhail Maltsev	a3c16039b3	[clang] Use SourceLocation as key in std::map, NFCI SourceLocation implements `operator<`, so `SourceLocation`-s can be used as keys in `std::map` directly, there is no need to extract the internal representation. Since the `operator<` simply compares the internal representations of its operands, this patch does not introduce any functional changes. Reviewed By: dexonsmith Differential Revision: https://reviews.llvm.org/D89705	2020-10-19 18:31:05 +01:00
Florian Hahn	3cbdae22b9	[SCEV] Add tests where assumes can be used to improve tripe multiple. This patch adds a set of tests where information from assumes can be used to improve the trip multiple. See PR47904.	2020-10-19 18:26:09 +01:00
Louis Dionne	ec0dc70efc	[libc++] Add more tests for operator<< on std::complex	2020-10-19 13:23:59 -04:00
Amy Kwan	6a946fd06f	[DAGCombiner][PowerPC] Remove isMulhCheaperThanMulShift TLI hook, Use isOperationLegalOrCustom directly instead. MULH is often expanded on targets. This patch removes the isMulhCheaperThanMulShift hook and uses isOperationLegalOrCustom instead. Differential Revision: https://reviews.llvm.org/D80485	2020-10-19 12:23:04 -05:00
Jonas Devlieghere	97b8e2c1f0	[llvm] Make obj2yaml and yaml2obj LLVM utilities instead of tools For testing purposes I need a way to build and install FileCheck and yaml2obj. I had to choose between making FileCheck an LLVM tool and making obj2yaml and yaml2obj utilities. I think the distinction is rather arbitrary but my understanding is that tools are things meant for the toolchain while utilities are more used for things like testing, which is the case here. The functional difference is that these tools now end up in the ${LLVM_UTILS_INSTALL_DIR}, which defaults to the ${LLVM_TOOLS_INSTALL_DIR}. Unless you specified a different value or you added obj2yaml and yaml2obj to ${LLVM_TOOLCHAIN_TOOLS}, this patch shouldn't change anything. Differential revision: https://reviews.llvm.org/D89357	2020-10-19 10:21:21 -07:00
Tony	151e297034	[AMDGPU] Simplify cumode handling in SIMemoryLegalizer Differential Revision: https://reviews.llvm.org/D89663	2020-10-19 17:13:45 +00:00

1 2 3 4 5 ...

369405 Commits All Branches Search

369405 Commits

All Branches