llvm-project

Commit Graph

Author	SHA1	Message	Date
Jacques Pienaar	2fa76d4769	[mlir][ods] Fix incorrectly generated attribute name. In prefixed accessor on OpAdaptor.	2021-10-29 14:06:33 -07:00
Zarko Todorovski	8659b241ae	[clang][NFC] Inclusive terms: Replace uses of whitelist in clang/lib/StaticAnalyzer Replace variable and functions names, as well as comments that contain whitelist with more inclusive terms. Reviewed By: aaron.ballman, martong Differential Revision: https://reviews.llvm.org/D112642	2021-10-29 16:51:36 -04:00
Joseph Huber	2c6a4e5678	[OpenMP] Use the assertion formatting from assert.h This patch changes the `assert_assume` function used for internal assumptions in the device runtime to use a more standard formatting for the assumption message. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D112842	2021-10-29 16:44:01 -04:00
Nikita Popov	cdf45f98ca	[BasicAA] Extract linear expression multiplication (NFC) Extract a common method for multiplying a linear expression by a factor.	2021-10-29 22:41:40 +02:00
Alex Lorenz	a43d1aa852	[clang] Make 'align-mismatch' warning work without an associated function declaration This change fixes a crash where a NULL fd was used to emit a diagnostic. Instead of crashing, just avoid printing the declaration name when there's no associated function declaration. Differential Revision: https://reviews.llvm.org/D109402	2021-10-29 13:39:16 -07:00
Sam Clegg	3b039c68f2	Revert "[WebAssembly] Fix debug locations for ExplicitLocals pass" This reverts commit `a66451ebbe`. This caused a failure when integrated with emscripten: https://ci.chromium.org/ui/p/emscripten-releases/builders/try/linux/b8832019855439718609/overview	2021-10-29 13:34:18 -07:00
Aart Bik	0121c96f37	[mlir][sparse] refine the mixed width sparse conversion test Added a type with different pointer/index bit width. Also added some sanity CHECKs on the stored indices. Reviewed By: wrengr Differential Revision: https://reviews.llvm.org/D112778	2021-10-29 13:31:04 -07:00
Nikita Popov	7cf7378a9d	[BasicAA] Don't treat non-inbounds GEP as nsw The scale multiplication is only guaranteed to be nsw if the GEP is inbounds (or the multiplication is trivial). Previously we were only considering explicit muls in GEP indices.	2021-10-29 22:30:44 +02:00
Jacques Pienaar	dde96363fc	[mlir] Flip accessors to prefixed form (NFC) Change these missed during/added after the last update.	2021-10-29 13:29:48 -07:00
Sam Clegg	182b72aa48	[lld][WebAssembly] Generate TLS relocation code also when linking statically Previously relocations were only generated for PIC output, but relocations for TLS GOT entries are always needed when shared memory is enabled, not just in PIC mode. This means that the `__wasm_apply_global_tls_relocs` is now generated even for statically linked (non-PIC) output. Without this the globals that hold the addresses of TLS symbols are not set correctly. Differential Revision: https://reviews.llvm.org/D112833	2021-10-29 13:26:35 -07:00
Guillaume Chatelet	fe953b15cf	Revert "[libc] Add more robust compile time architecture detection" This reverts commit `a72e249986`.	2021-10-29 20:25:55 +00:00
Richard Smith	68ffcd5213	Properly determine the end location of an ObjCObjectPointerType. After rGa9db0a804a53, we correctly determined the end for pointer types like `id` that are spelled without a ``, but incorrectly determined the end for pointer types spelled with a ``.	2021-10-29 13:15:53 -07:00
Arthur O'Dwyer	0412c007e3	[libc++] Implement LWG3369, tweak CTAD for std::span. The original bug doesn't reproduce on Clang, allegedly because of https://bugs.llvm.org/show_bug.cgi?id=44484 We already test STL's exact test case, in "span.cons/deduct.pass.cpp", which I'm touching just for the heck of it. Differential Revision: https://reviews.llvm.org/D111838	2021-10-29 14:15:41 -06:00
Arthur O'Dwyer	d6b826ebb2	[libc++] [doc] Mark LWG3398 as complete. This was done in D108054.	2021-10-29 14:15:41 -06:00
Guillaume Chatelet	a72e249986	[libc] Add more robust compile time architecture detection We may want to restrict the detected platforms to only `x86_64` and `aarch64`. There are still custom detection in api.td but I don't think we can handle these: - config/linux/api.td:205 - config/linux/api.td:199 Differential Revision: https://reviews.llvm.org/D112818	2021-10-29 20:15:12 +00:00
Nick Desaulniers	39e5dd113f	[SparcISelLowering] avoid emitting libcalls to __muloti4 and __mulodi4 These compiler-rt-only symbols aren't available in libgcc. Similar to D108842, D108844, and D108926. Fixes: pr/52043 Reviewed By: craig.topper, rengolin Differential Revision: https://reviews.llvm.org/D112750	2021-10-29 13:14:09 -07:00
Miguel Raz Guzmán Macedo	03eddbc714	[doc] Typo fix in NewPassManager.rst Simple typo fix. Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D112780	2021-10-29 13:11:11 -07:00
wren romano	30a64c9aa5	[mlir][sparse] Renaming CPP macros for clarity Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D112771	2021-10-29 13:07:48 -07:00
Sanjay Patel	285b8abce4	[x86] limit vector increment fold to allow load folding The tests are based on the example from: https://llvm.org/PR52032 I suspect that it looks worse than it actually is. :) That is, llvm-mca says there's no uop/timing difference with the load folding and pcmpeq vs. broadcast on Haswell (and probably other targets). The load-folding definitely makes the code smaller, so it's good for that at least. So this requires carving a narrow hole in the transform to get just this case without changing others that look good as-is (in other words, the transform still seems good for most examples). Differential Revision: https://reviews.llvm.org/D112464	2021-10-29 15:48:35 -04:00
Sanjay Patel	837518d6a0	[x86] make mayFold* helpers visible to more files; NFC The first function is needed for D112464, but we might as well keep these together in case the others can be used someday.	2021-10-29 15:48:35 -04:00
Sanjay Patel	8f786b4618	[InstCombine] fix comments to match code; NFC	2021-10-29 15:48:35 -04:00
Guillaume Chatelet	d7cc760f3b	[libc][NFC] Fix typo and unused variable Differential Revision: https://reviews.llvm.org/D112823	2021-10-29 19:42:51 +00:00
Michał Górny	16a816a19e	[lldb] [gdb-remote] Fix processing generic regnums Fix regression in processing generic regnums that was introduced in `fa456505b8` ("[lldb] [gdb-remote] Refactor getting remote regs to use local vector"). Since then, the "generic" field was wrongly interpreted as integer rather than string constant. Thanks to Ted Woodward for noticing and providing the correct code.	2021-10-29 21:37:46 +02:00
modimo	5caad9b5d3	[InlineAdvisor] Add fallback/format switches and negative remark processing to Replay Inliner Adds the following switches: 1. --sample-profile-inline-replay-fallback/--cgscc-inline-replay-fallback: controls what the replay advisor does for inline sites that are not present in the replay. Options are: 1. Original: defers to original advisor 2. AlwaysInline: inline all sites not in replay 3. NeverInline: inline no sites not in replay 2. --sample-profile-inline-replay-format/--cgscc-inline-replay-format: controls what format should be generated to match against the replay remarks. Options are: 1. Line 2. LineColumn 3. LineDiscriminator 4. LineColumnDiscriminator Adds support for negative inlining decisions. These are denoted by "will not be inlined into" as compared to the positive "inlined into" in the remarks. All of these together with the previous `--sample-profile-inline-replay-scope/--cgscc-inline-replay-scope` allow tweaking in how to apply replay. In my testing, I'm using: 1. --sample-profile-inline-replay-scope/--cgscc-inline-replay-scope = Function to only replay on a function 2. --sample-profile-inline-replay-fallback/--cgscc-inline-replay-fallback = NeverInline since I'm feeding in only positive remarks to the replay system 3. --sample-profile-inline-replay-format/--cgscc-inline-replay-format = Line since I'm generating the remarks from DWARF information from GCC which can conflict quite heavily in column number compared to Clang An alternative configuration could be to do Function, AlwaysInline, Line fallback with negative remarks which closer matches the final call-sites. Note that this can lead to unbounded inlining if a negative remark doesn't match/exist for one reason or another. Updated various tests to cover the new switches and negative remarks Testing: ninja check-all Reviewed By: wenlei, mtrofin Differential Revision: https://reviews.llvm.org/D112040	2021-10-29 12:32:03 -07:00
Duncan P. N. Exon Smith	9902362701	Support: Use sys::path::is_style_{posix,windows}() in a few places Use the new sys::path::is_style_posix() and is_style_windows() in a few places that need to detect the system's native path style. In llvm/lib/Support/Path.cpp, this patch removes most uses of the private `real_style()`, where is_style_posix() and is_style_windows() are just a little tidier. Elsewhere, this removes `_WIN32` macro checks. Added a FIXME to a FileManagerTest that seemed fishy, but maintained the existing behaviour. Differential Revision: https://reviews.llvm.org/D112289	2021-10-29 12:09:41 -07:00
modimo	51ce567b38	[SampleProfile] Add all callsites to AllCandidates if InlineReplay is in effect Replay in sample profiling needs to be asked on candidates that may not have counts or below the threshold. If replay is in effect for a function make sure these are captured and also imported during thinLTO. Testing: ninja check-all Reviewed By: wenlei Differential Revision: https://reviews.llvm.org/D112033	2021-10-29 12:04:52 -07:00
Roman Lebedev	0ae7bf124a	[NFC][LoopDeletion] Count the number of broken backedges Those don't contribute to the number of deleted loops.	2021-10-29 21:58:16 +03:00
Joseph Huber	35f42340a2	[OpenMP][Docs] Add documentation for device RTL debugging Add documentation for the debugging features in the OpenMP device runtime library. Reviewed By: tianshilei1992 Differential Revision: https://reviews.llvm.org/D112010	2021-10-29 14:57:14 -04:00
Joseph Huber	6dd791bca8	[OpenMP] Check output of malloc in the device for debug A common problem is the device running out of global heap memory and crashing due to a nullptr dereference when using the data sharing stack. This explicitly checks that a nullptr was not returned by malloc when debugging field 1 is enabled. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D112005	2021-10-29 14:57:12 -04:00
Joseph Huber	74f91741b6	[OpenMP] Use function tracing RAII for runtime functions. This patch adds support for using function tracing features to track the executino of runtime functions in the device runtime library. This is enabled by first compiling the new runtime with `-fopenmp-target-debug=3` and running with `LIBOMPTARGET_DEVICE_RTL_DEBUG=3`. The output only tracks team 0 and thread 0 so there isn't much output when using a generic region. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D112002	2021-10-29 14:57:11 -04:00
Amara Emerson	5dd9e019dd	[AArch64][GlobalISel] Fix an crash in RBS due to a new regclass being added. rdar://84674985	2021-10-29 11:47:00 -07:00
Duncan P. N. Exon Smith	4e4883e1f3	Support: Expose sys::path::is_style_{posix,windows,native}() Expose three helpers in namespace llvm::sys::path to detect the path rules followed by sys::path::Style. - is_style_posix() - is_style_windows() - is_style_native() This are constexpr functions that that will allow a bunch of path-related code to stop checking `_WIN32`. Originally I looked at adding system_style(), analogous to sys::endian::system_endianness(), but future patches (from others) will add more Windows style variants for slash preferences. These helpers should be resilient to that change, allowing callers to detect basic path rules. Differential Revision: https://reviews.llvm.org/D112288	2021-10-29 11:46:44 -07:00
Sanjay Patel	d0e9879d96	[InstCombine] allow vector splat matching for bitwise logic folds These transforms are also likely missing a one-use check, but that's another patch.	2021-10-29 14:22:50 -04:00
Sanjay Patel	ae8984111d	[InstCombine] add tests for bitwise logic folds; NFC	2021-10-29 14:22:49 -04:00
Roman Lebedev	e5df0a5a6f	[NFC][PhaseOrdering] Add additional loop deletion tests Test thanks to Michael Kuklinski from #llvm, originally inspired by Daniel Lemire's https://lemire.me/blog/2021/10/26/in-c-is-empty-faster-than-comparing-the-size-with-zero/	2021-10-29 21:10:36 +03:00
peter klausler	f70343d926	[flang] Fix combined folding of FINDLOC/MAXLOC/MINLOC The tests for folding these intrinsics neglected to name the logical scalars with a leading "test_", so test failures caused by recent work to implement a combined constant folding facility for these intrinsics wasn't catching some bugs. This patch fixes the tests and the bugs. Differential Revision: https://reviews.llvm.org/D112741	2021-10-29 11:00:11 -07:00
Stanislav Mekhanoshin	a905c54b76	[InstCombine] Fold `(~(a \| b) & c) \| ~(a \| c)` into `~((b & c) \| a)` ``` ---------------------------------------- define i4 @src(i4 %a, i4 %b, i4 %c) { %or1 = or i4 %b, %a %not1 = xor i4 %or1, -1 %or2 = or i4 %a, %c %not2 = xor i4 %or2, -1 %and = and i4 %not2, %b %or3 = or i4 %and, %not1 ret i4 %or3 } define i4 @tgt(i4 %a, i4 %b, i4 %c) { %and = and i4 %c, %b %or = or i4 %and, %a %or3 = xor i4 %or, -1 ret i4 %or3 } Transformation seems to be correct! ``` Differential Revision: https://reviews.llvm.org/D112338	2021-10-29 10:58:09 -07:00
peter klausler	d0ca0595b9	[flang] Fix crash on "call system_clock(count_max=j)" An erroneous entry in the intrinsics table causes semantics to crash on a call to system_clock if the optional "count_max=" argument appears and "count=" does not. Differential Revision: https://reviews.llvm.org/D112738	2021-10-29 10:51:28 -07:00
Michael Jones	62c187cb55	[libc] add fast path to string to float conversion Add the fast path first described by Clinger [1] with additions by Gay [2]. This speeds up conversion by about 10% by handling numbers with fewer digits more efficiently. [1] Clinger WD. How to Read Floating Point Numbers Accurately. SIGPLAN Not 1990 Jun;25(6):92–101. https://doi.org/10.1145/93548.93557. [2] Gay DM, Correctly rounded binary-decimal and decimal-binary conversions; 1990. AT&T Bell Laboratories Numerical Analysis Manuscript 90-10. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D112580	2021-10-29 10:50:03 -07:00
Sam Clegg	fad05465c1	[lld][WebAssembly] Handle TLS variables in Symbol::getVA. NFC In the shared memory case we can always assume that TLS addresses are relative to __tls_base. In the non-shared memory case TLS variables are absolute, just like normal data addresses. This simplifies the code in calcNewValue so that TLS relocations no longer need special handling. Differential Revision: https://reviews.llvm.org/D112831	2021-10-29 10:45:30 -07:00
Matt Morehouse	33cc0cfd46	[X86] Don't affect jump tables under +tagged-globals. `classifyLocalReference(nullptr)` is called to get the appropriate relocation type for jump tables. We should not use @GOTPCREL for this case. The new test cases trigger assertions without this patch. Reviewed By: eugenis Differential Revision: https://reviews.llvm.org/D112832	2021-10-29 10:37:43 -07:00
wlei	f5537643b8	[llvm-profgen] Update total samples by accumulating all its body samples Like probe-based profile, the total samples is the sum of all its body samples. This patch fix it by a post-processing update for the line-number based profile. Tested it on our internal services, results showed no performance change. Reviewed By: hoy, wenlei Differential Revision: https://reviews.llvm.org/D112672	2021-10-29 10:36:57 -07:00
Fraser Cormack	8314a04ede	[SelectionDAG] Allow FindMemType to fail when widening loads & stores This patch removes an internal failure found in FindMemType and "bubbles it up" to the users of that method: GenWidenVectorLoads and GenWidenVectorStores. FindMemType -- renamed findMemType -- now returns an optional value, returning None if no such type is found. Each of the aforementioned users now pre-calculates the list of types it will use to widen the memory access. If the type breakdown is not possible they will signal a failure, at which point the compiler will crash as it does currently. This patch is preparing the ground for alternative legalization strategies for vector loads and stores, such as using vector-predication versions of loads or stores. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D112000	2021-10-29 18:27:31 +01:00
Kazu Hirata	3b285ff517	[llvm-profgen] Fix a set-but-unused warning This patch fixes: llvm/tools/llvm-profgen/ProfiledBinary.cpp:357:12: error: variable 'EndOffset' set but not used [-Werror,-Wunused-but-set-variable] The last use of the variable was removed on Oct 26 in commit `40ca411251`.	2021-10-29 10:19:44 -07:00
Zarko Todorovski	c001775a3a	[clang] Inclusive language: change error message to use allowlist Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D112627	2021-10-29 13:12:46 -04:00
Keith Smiley	bd8a9507ef	[clang][driver] Fix multiarch output name with -Wl arg Previously if you passed a `-Wl,-foo` _before_ the source filename, the first `InputInfos`, which is used for the base input name would be an `InputArg` kind, which would never have a base input name. Now we use that by default, but pick the first `InputInfo` that is of kind `Filename` to get the name from if there is one. Differential Revision: https://reviews.llvm.org/D112767	2021-10-29 10:09:38 -07:00
Dwight Guth	2f16173627	[llvm-reduce] optimize extractFromModule functions The extractBasicBlocksFromModule, extractInstrFromModule, and other similar functions previously performed very poorly when the number of such elements in the program to reduce was very high. Previously, we were creating the set which caches elements to keep by looping through all elements in the module and adding them to the set. However, since std::set is an ordered set, this introduces a massive amount of rebalancing if the order of elements in the program and the order of their pointers in memory are not the same. The solution is straightforward: first put all the elements to be kept in a vector, then use the constructor for std::set which takes a pair of iterators over a collection. This constructor is optimized to avoid doing unnecessary work when initializing large sets. Also in this change, we pass BBsToKeep set to functions replaceBranchTerminator and removeUninterestingBBsFromSwitch as a const reference rather than passing it by value. This ought to prevent the need to copy the collection each time these functions are called, which is expensive if the collection is large. Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D112757	2021-10-29 10:06:26 -07:00
Nikita Popov	4dd540d9c8	[BasicAA] Add missing inbounds to tests (NFC) Add missing inbounds to tests that are not correct without it due to possibility of offset overflow. inbounds: https://alive2.llvm.org/ce/z/LC8G9_ w/o inbounds: https://alive2.llvm.org/ce/z/ErrJVW	2021-10-29 19:05:39 +02:00
wlei	2f8196db92	[llvm-profgen] Fix bug of populating profile symbol list Previous implementation of populating profile symbol list is wrong, it only included the profiled symbols. Actually it should use all symbols, here this switches to use the symbols from debug info. Also turned the flag off by default. Reviewed By: wenlei, hoy Differential Revision: https://reviews.llvm.org/D111824	2021-10-29 09:59:12 -07:00
wlei	40ca411251	[llvm-profgen] Switch to DWARF-based symbol and ranges It happened a bug that some callsite name in the profile is not a real function, it turned out that there're some non-function symbol from the ELF text section, e.g. the global accessible branch label and also recalled that we can have one function being split into multiple ranges. We shouldn't count samples for those are not the entry of the real function. So this change tried to fix this issue by switching to use the name or ranges from DWARF-based debug info, the range of which assure it's the real function start. For the split functions, we assume that the real entry function's DWARF name should always match the symbol table name. The switching is also consistent with the body samples' symbol which is from DWARF. Reviewed By: hoy, wenlei Differential Revision: https://reviews.llvm.org/D112282	2021-10-29 09:59:12 -07:00

... 3 4 5 6 7 ...

403496 Commits All Branches Search

403496 Commits

All Branches