llvm-project

Commit Graph

Author	SHA1	Message	Date
David Green	646c872f9d	[ARM] Teach getIntImmCostInst about the cost of saturating fp converts Given a min(max(fptosi, INT_MIN), INT_MAX) with the correct constants, we can now generate a fptosi.sat. But in the arm backend, the constant can be treated as high cost, pulling it out of the basic block in a way that the DAG combine can no longer see it. This teaches it again that it is a low cost constant, not worth hoisting out. Recommitted from `0e98659ea1` with a fix for APInt comparison. Differential Revision: https://reviews.llvm.org/D114380	2021-12-02 07:56:27 +00:00
Lang Hames	758d54b462	[ORC] Add support for removing JITDylibs. This allows JITDylibs to be removed from the ExecutionSession. Calling ExecutionSession::removeJITDylib will disconnect the JITDylib from the ExecutionSession and clear it (removing all trackers associated with it). The JITDylib object will then be destroyed as soon as the last JITDylibSP pointing at it is destroyed.	2021-12-02 17:36:32 +11:00
Lang Hames	9eb591f0ac	[ORC] Only use JITDylib::GeneratorsMutex while running generators. GeneratorsMutex should prevent lookups from proceeding through the generators of a single JITDylib concurrently (since this could result in redundant attempts to generate definitions). Mutation of the generators list itself should be done under the session lock.	2021-12-02 17:36:32 +11:00
Lang Hames	9355d11597	[ORC] Hold ResourceTracker in MaterializationResponsibility. This keeps the tracker alive for the lifetime of the MR. This is needed so that we can check whether the tracker has become defunct before posting results (or failure) for the MR.	2021-12-02 17:36:31 +11:00
Austin Kerbow	da067ed569	[AMDGPU] Set most sched model resource's BufferSize to one Using a BufferSize of one for memory ProcResources will result in better ILP since it more accurately models the dependencies between memory ops and their consumers on an in-order processor. After this change, the scheduler will treat the data edges from loads as blocking so that stalls are guaranteed when waiting for data to be retreaved from memory. Since we don't actually track waitcnt here, this should do a better job at modeling their behavior. Practically, this means that the scheduler will trigger the 'STALL' heuristic more often. This type of change needs to be evaluated experimentally. Preliminary results are positive. Fixes: SWDEV-282962 Reviewed By: rampitec Differential Revision: https://reviews.llvm.org/D114777	2021-12-01 22:31:28 -08:00
skc7	16b781e6d1	[AMDGPU][clang] Fix __builtin_nontemporal_store() failure on AMDGPU Reviewed By: yaxunl, sameerds Differential Revision: https://reviews.llvm.org/D114849	2021-12-02 05:53:25 +00:00
Phoebe Wang	f13b43d570	[X86][FP16] Only generate approximate rsqrt when Reciprocal is true for half type We have reasonable fast sqrt and accurate rsqrt for half type due to the limited fractions. So neither do we need multi steps refinement for rsqrt nor replace sqrt by rsqrt. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D114844	2021-12-02 13:52:45 +08:00
Phoebe Wang	4756a2f157	[X86] Insert FMUL for estimated non reciprocal SQRT when `RefinementSteps` = 0 Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D114843	2021-12-02 13:52:45 +08:00
Jonas Devlieghere	fcd2d85cc9	[lldb] Skip test_launch_scripted_process_stack_frames with ASan This test is failing on the sanitized bot because of a heap-use-after-free. Disabling the test to turn the bot green again. rdar://85954489.	2021-12-01 21:35:00 -08:00
Igor Kudrin	b0ac68ccb7	[ELF] Prevent internalizing used comdat symbol When a comdat symbol is defined in both bitcode and regular object files, which are contained in the same archive, the linker could lose the flag that the symbol is used in the regular object file and allow LTO to internalize it, which led to "error: undefined symbol". The issue was introduced in D79300. Differential Revision: https://reviews.llvm.org/D114801	2021-12-02 12:10:06 +07:00
Jacques Pienaar	86eb57b728	[mlir][drr] Simple heuristic to reduce chance of accidental nullptr dereference When an attribute is optional & is given an additional constraint in rewrite pattern that could lead to dereferencing null Attribute. Avoid cases where the constraints checks attribute but has no check if null. This should be improved to be more uniformly guarded.	2021-12-01 20:45:08 -08:00
Christudasan Devadasan	399b7de0ea	[AMDGPU] Add a regclass flag for scalar registers Along with vector RC flags, this scalar flag will make various regclass queries like `isVGPR` more accurate. Regclasses other than vectors are currently set with the new flag even though certain unallocatable classes aren't truly scalars. It would be ok as long as they remain unallocatable. Reviewed By: rampitec Differential Revision: https://reviews.llvm.org/D110053	2021-12-01 23:31:07 -05:00
Joe Loser	c16b13ebf9	[libc++] Implement P1989R2: range constructor for string_view Implement P1989R2 which adds a range constructor for `string_view`. Adjust `operator/=` in `path` to avoid atomic constraints caching issue getting provoked from this PR. Add defaulted template argument to `string_view`'s "sufficient overloads" to avoid mangling issues in `clang-cl` builds. It is a MSVC mangling bug that this works around. Differential Revision: https://reviews.llvm.org/D113161	2021-12-01 23:16:36 -05:00
Vitaly Buka	ae234a7545	[NFC][sanitizer] Fix "not used" warning in test	2021-12-01 20:16:25 -08:00
Jonas Devlieghere	8176768b46	[lldb] Fix DYLD_INSERT_LIBRARIES on AS Don't make DYLD_INSERT_LIBRARIES conditional on the host triple containing x86.	2021-12-01 20:03:05 -08:00
Philip Reames	3dfa76b695	[tests] Precommit tests for writeonly argument attribute inference	2021-12-01 19:48:29 -08:00
Matthias Springer	4479138de8	[mlir][linalg][bufferize] Bufferization of tensor.insert This is a lightweight operation, useful for writing unit tests. It will be utilized for testing in subsequent commits. Differential Revision: https://reviews.llvm.org/D114693	2021-12-02 11:58:01 +09:00
Kevin Athey	27c9e8b45b	Revert "[VE] Make VE official" Breaks fast buildbot. This reverts commit `a9d1d00b86`.	2021-12-01 17:32:21 -08:00
Daniel Sanders	54e21df973	[unroll] Fix a functional change in an NFC patch `5c77aa2b91` [unroll] Use early return in shouldFullUnroll [nfc] wasn't quite NFC since !(x <= y) is x > y rather than x >= y Credit to Justin Bogner for spotting the bug Reviewed By: reames Differential Revision: https://reviews.llvm.org/D114894	2021-12-01 17:28:12 -08:00
Steven Wan	f9d585d0dd	Revert "[sanitizer] Add compress_stack_depot flag" This is failing on clang-s390x-linux, https://lab.llvm.org/buildbot/#/builders/94/builds/6748. This reverts commit `bf18253b0e`.	2021-12-01 20:21:52 -05:00
Julian Lettner	863b117411	[TSan][Darwin] Prevent inlining of functions in tests Prevent inlining of functions so we can FileCheck the generated stack traces.	2021-12-01 17:00:52 -08:00
Jonas Devlieghere	8f329cee42	[lldb] Split TestCxxChar8_t Split TestCxxChar8_t into two parts: one that check reading variables without a process and another part with. This allows us to skip the former on Apple Silicon, where lack of support for chained fix-ups causes the test to fail. Differential revision: https://reviews.llvm.org/D114819	2021-12-01 16:58:43 -08:00
Fabian Wolff	987a21522f	[clang-tidy] Use `hasCanonicalType()` matcher in `bugprone-unused-raii` check Fixes PR#52217. Reviewed By: simon.giesecke Differential Revision: https://reviews.llvm.org/D113429	2021-12-02 01:53:12 +01:00
LLVM GN Syncbot	ab112c2964	[gn build] Port `170783f991`	2021-12-02 00:48:10 +00:00
Noah Shutty	170783f991	[llvm] [Support] Add HTTP Client Support library. This patch implements a small HTTP client library consisting primarily of the `HTTPRequest`, `HTTPResponseHandler`, and `BufferedHTTPResponseHandler` classes. Unit tests of the `HTTPResponseHandler` and `BufferedHTTPResponseHandler` are included. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D112751	2021-12-01 23:54:38 +00:00
Julian Lettner	6703fe25b7	[TSan][Darwin] Mark test unsupported	2021-12-01 15:50:10 -08:00
Arthur Eubanks	7cbb6e9a8f	[llvm-reduce] Assert that the number of chunks does not change with reductions Followup to D113537. Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D113816	2021-12-01 15:40:05 -08:00
Arthur Eubanks	512534bc16	[Cloning] Clone metadata on function declarations Previously we missed cloning metadata on function declarations because we don't call CloneFunctionInto() on declarations in CloneModule(). Reviewed By: dexonsmith Differential Revision: https://reviews.llvm.org/D113812	2021-12-01 15:40:05 -08:00
LLVM GN Syncbot	1b7150c8f8	[gn build] Port `7cc2493daa`	2021-12-01 23:31:02 +00:00
spupyrev	7cc2493daa	profi - a flow-based profile inference algorithm: Part I (out of 3) The benefits of sampling-based PGO crucially depends on the quality of profile data. This diff implements a flow-based algorithm, called profi, that helps to overcome the inaccuracies in a profile after it is collected. Profi is an extended and significantly re-engineered classic MCMF (min-cost max-flow) approach suggested by Levin, Newman, and Haber [2008, Complementing missing and inaccurate profiling using a minimum cost circulation algorithm]. It models profile inference as an optimization problem on a control-flow graph with the objectives and constraints capturing the desired properties of profile data. Three important challenges that are being solved by profi: - "fixing" errors in profiles caused by sampling; - converting basic block counts to edge frequencies (branch probabilities); - dealing with "dangling" blocks having no samples in the profile. The main implementation (and required docs) are in SampleProfileInference.cpp. The worst-time complexity is quadratic in the number of blocks in a function, O(\|V\|^2). However a careful engineering and extensive evaluation shows that the running time is (slightly) super-linear. In particular, instances with 1000 blocks are solved within 0.1 second. The algorithm has been extensively tested internally on prod workloads, significantly improving the quality of generated profile data and providing speedups in the range from 0% to 5%. For "smaller" benchmarks (SPEC06/17), it generally improves the performance (with a few outliers) but extra work in the compiler might be needed to re-tune existing optimization passes relying on profile counts. UPD Dec 1st 2021: - synced the declaration and definition of the option `SampleProfileUseProfi ` to use type `cl::opt<bool`; - added `inline` for `SampleProfileInference<BT>::findUnlikelyJumps` and `SampleProfileInference<BT>::isExit` to avoid linking problems on windows. Reviewed By: wenlei, hoy Differential Revision: https://reviews.llvm.org/D109860	2021-12-01 15:30:38 -08:00
Konstantin Boyarinov	8c6b24899e	[libcxx][test][NFC] Various tests for std::vector Add missing tests for std::vector funcionality to improve code coverage: - Rewrote access tests to check modification of the container using the reference returned by the non-const overload - Added tests for reverse iterators: rbegin, rend, etc. - Added exception test for vector::reserve - Extended test cases for vector copy assignment - Fixed insert_iter_value.pass.cpp to use insert overload with const value_type& (not with value_type&& which is tested in iter_rvalue.pass.cpp test) Reviewed By: Quuxplusone, rarutyun, #libc Differential Revision: https://reviews.llvm.org/D112438	2021-12-02 02:11:45 +03:00
Vitaly Buka	e599aa80c0	[sanitizer] Implement MprotectReadOnly and MprotectNoAccess MprotectReadOnly for Win and Fuchsia MprotectNoAccess for Fuchsia	2021-12-01 14:50:50 -08:00
Nikolas Klauser	6146e4cf89	[libc++] Make __wrap_iter constexpr `__wrap_iter` is currently only constexpr if it's not a debug built, but it isn't used in a constexpr context currently. Making it always constexpr and disabling the debugging utilities at constant evaluation is more usful since it has to be always constexpr to be used in a constexpr context. Reviewed By: ldionne, #libc Spies: libcxx-commits Differential Revision: https://reviews.llvm.org/D114733	2021-12-01 23:29:22 +01:00
Vitaly Buka	86f48fbb1c	[NFC][sanitizer] constexpr in sanitizer_dense_map_info	2021-12-01 13:45:42 -08:00
Kazu Hirata	afe43e0713	[mlir] Remove extractVectorTypeFromShapedValue This patch fixes the build by removing extractVectorTypeFromShapedValue. The last use was removed Dec 1, 2021 in commit extractVectorTypeFromShapedValue.	2021-12-01 13:43:17 -08:00
Peter Klausler	3f6dbf1a75	[flang] Don't close stderr in runtime (fixes STOP output) STOP statement output was sometimes failing to appear because the runtime flushes and shuts down open Fortran units beforehand. But when file descriptor 2 was closed, the STOP statement output was suppressed. The fix is to not actually close file descriptors 0-2 if they are connected to Fortran units being closed. This was already the policy when an OPEN statement was (re-)opening such a unit, so that logic has been pulled out into a member function and shared with CLOSE processing. Differential Revision: https://reviews.llvm.org/D114897	2021-12-01 13:25:18 -08:00
Gabor Marton	20f8733d4b	[Analyzer][solver] Simplification: Do a fixpoint iteration before the eq class merge This reverts commit `f02c5f3478` and addresses the issue mentioned in D114619 differently. Repeating the issue here: Currently, during symbol simplification we remove the original member symbol from the equivalence class (`ClassMembers` trait). However, we keep the reverse link (`ClassMap` trait), in order to be able the query the related constraints even for the old member. This asymmetry can lead to a problem when we merge equivalence classes: ``` ClassA: [a, b] // ClassMembers trait, a->a, b->a // ClassMap trait, a is the representative symbol ``` Now let,s delete `a`: ``` ClassA: [b] a->a, b->a ``` Let's merge ClassA into the trivial class `c`: ``` ClassA: [c, b] c->c, b->c, a->a ``` Now, after the merge operation, `c` and `a` are actually in different equivalence classes, which is inconsistent. This issue manifests in a test case (added in D103317): ``` void recurring_symbol(int b) { if (b * b != b) if ((b * b) * b * b != (b * b) * b) if (b * b == 1) } ``` Before the simplification we have these equivalence classes: ``` trivial EQ1: [b * b != b] trivial EQ2: [(b * b) * b * b != (b * b) * b] ``` During the simplification with `b * b == 1`, EQ1 is merged with `1 != b` `EQ1: [b * b != b, 1 != b]` and we remove the complex symbol, so `EQ1: [1 != b]` Then we start to simplify the only symbol in EQ2: `(b * b) * b * b != (b * b) * b --> 1 * b * b != 1 * b --> b * b != b` But `b * b != b` is such a symbol that had been removed previously from EQ1, thus we reach the above mentioned inconsistency. This patch addresses the issue by making it impossible to synthesise a symbol that had been simplified before. We achieve this by simplifying the given symbol to the absolute simplest form. Differential Revision: https://reviews.llvm.org/D114887	2021-12-01 22:23:41 +01:00
Florian Hahn	ad88a37cea	[TLI] Add memset_pattern4, memset_pattern8 lib functions. Similar to memset_pattern16, memset_pattern4, memset_pattern8 are available on Darwin platforms. https://developer.apple.com/library/archive/documentation/System/Conceptual/ManPages_iPhoneOS/man3/memset_pattern4.3.html Reviewed By: ab Differential Revision: https://reviews.llvm.org/D114881	2021-12-01 21:18:19 +00:00
LLVM GN Syncbot	bab21a4628	[gn build] Port `a0efb17500`	2021-12-01 20:41:34 +00:00
Christopher Di Bella	a0efb17500	[libcxx][modularisation] modularises <numeric> header Differential Revision: https://reviews.llvm.org/D114836	2021-12-01 20:40:21 +00:00
Petar Avramovic	641906da8d	AMDGPU/GlobalISel: Fix constant bus restriction errors for med3 Detected on targets older then gfx10 (e.g. gfx9) for constants that are too large to be inlined (constant are sgpr by default). In med3 combine it is expected that regbankselect maps all operands of min/max we try to match to vgpr. However constants are mapped to sgpr and there will be a sgpr-to-vgpr copy. Matchers look through sgpr-to-vgpr copies and return sgpr and these break constant bus restriction. Build med3 with all vgpr operands. Use existing sgpr-to-vgpr copies for matched sgprs. If there is no such copy (not expected) build one. Differential Revision: https://reviews.llvm.org/D114700	2021-12-01 21:36:37 +01:00
Paul Robinson	66071f440c	[TLI checker] Update for post-commit review comments Ignore undefined symbols; other minor code cleanup. Replace test objects and their asm source with a yaml equivalent. Differential Revision: https://reviews.llvm.org/D114478	2021-12-01 12:33:54 -08:00
Florian Hahn	5fe151f98f	[DSE] Add libcall tests for functions only available on Darwin. Add a set of tests for memset_pattern{4,8,16} variants.	2021-12-01 20:30:15 +00:00
Fabian Wolff	844a8d3cec	Fix false positives in `fuchsia-trailing-return` check involving deduction guides Fixes PR#47614. Deduction guides, implicit or user-defined, look like function declarations in the AST. They aren't really functions, though, and they always have a trailing return type, so it doesn't make sense to issue this warning for them.	2021-12-01 15:28:01 -05:00
Nikita Popov	8d1759c404	[GlobalOpt] Simplify CleanupConstantGlobalUsers() This bases the CleanupConstantGlobalUsers() implementation around the ConstantFoldLoadFromConst() API. The general approach is that we discover all users while looking through casts, and then constant fold loads and drop stores and memintrinsics. This avoids special cases and limitations in the previous implementation, which is also incompatible with opaque pointers. The result is a bit more powerful than before, because we now use more general load folding logic which can for example look through pointer bitcasts between different sizes. This is where the test changes come from, as we now fold more loads and can thus remove more globals. Differential Revision: https://reviews.llvm.org/D114889	2021-12-01 21:06:25 +01:00
Arthur O'Dwyer	0efd9a03fa	[libc++] [test] Refactor string_view comparison tests for comprehensiveness. Differential Revision: https://reviews.llvm.org/D114658	2021-12-01 15:04:33 -05:00
Arthur O'Dwyer	b4a13e4c98	[libc++] [test] C++14/17-friendly `TEST_IS_CONSTANT_EVALUATED` macro. Reviewed as part of D114658. Ultimately this will probably have to be flipped around and renamed `TEST_IS_RUNTIME`, and extended with `TEST_IS_RUNTIME_OR_CXX20` (once constexpr std::string support is added) and so on for every new C++ version. But we don't need that flexibility yet, so we're not adding it.	2021-12-01 15:02:54 -05:00
Arthur O'Dwyer	a0b50c56d1	[libc++] [test] C++03-friendly MAKE_STRING macro. Reviewed as part of D114658.	2021-12-01 15:02:53 -05:00
Konstantin Varlamov	7da4ee6f23	[libcxx][NFC] Make sequence containers slightly more SFINAE-friendly during CTAD. Disable the constructors taking `(size_type, const value_type&, allocator_type)` if `allocator_type` is not a valid allocator. Otherwise, these constructors are considered when resolving e.g. `(int, int, NotAnAllocator())`, leading to a hard error during instantiation. A hard error makes the Standard's requirement to not consider deduction guides of the form `(Iterator, Iterator, BadAllocator)` during overload resolution essentially non-functional. The previous approach was to SFINAE away `allocator_traits`. This patch SFINAEs away the specific constructors instead, for consistency with `basic_string` -- see [LWG3076](wg21.link/lwg3076) which describes a very similar problem for strings (note, however, that unlike LWG3076, no valid constructor call is affected by the bad instantiation). Differential Revision: https://reviews.llvm.org/D114311	2021-12-01 11:56:51 -08:00
Ellis Hoag	9e647806f3	[InstrProf][NFC] Refactor ProfileDataMap usage Instead of using `DenseMap::find()` and `DenseMap::insert()`, use `DenseMap::operator[]` to get a reference to the profile data and update the reference. This simplifies the changes in D114565. Reviewed By: kyulee Differential Revision: https://reviews.llvm.org/D114828	2021-12-01 11:47:14 -08:00

1 2 3 4 5 ...

406238 Commits All Branches Search

406238 Commits

All Branches