llvm-project

Commit Graph

Author	SHA1	Message	Date
Björn Schäpers	bcc1dee600	[clang-format] Add StatementAttributeLikeMacros option This allows to ignore for example Qts emit when AlignConsecutiveDeclarations is set, otherwise it is parsed as a type and it results in some misformating: unsigned char MyChar = 'x'; emit signal(MyChar); Differential Revision: https://reviews.llvm.org/D93776	2021-01-18 06:54:31 +01:00
Chen Zheng	26a396c4ef	[PowerPC] support register pressure reduction in machine combiner. Reassociating some patterns to generate more fma instructions to reduce register pressure. Reviewed By: jsji Differential Revision: https://reviews.llvm.org/D92071	2021-01-17 23:56:13 -05:00
Lang Hames	e5619065b8	[JITLink][ELF] New ELF skip-debug-sections test requires asserts. This should fix the failures on Release mode testers.	2021-01-18 15:41:53 +11:00
Philip Reames	8356610f8d	[test] pre commit a couple more tests for vectorizing multiple exit loops	2021-01-17 20:29:13 -08:00
Philip Reames	7011086dc1	[test] Autogen a loop vectorizer test to make future changes visible	2021-01-17 20:03:22 -08:00
Qiu Chaofan	f776d8b12f	[Legalizer] Promote result type in expanding FP_TO_XINT This patch promotes result integer type of FP_TO_XINT in expanding. So crash in conversion from ppc_fp128 to i1 will be fixed. Reviewed By: steven.zhang Differential Revision: https://reviews.llvm.org/D92473	2021-01-18 11:56:11 +08:00
Qiu Chaofan	2d9890775f	[PowerPC] [NFC] Add AIX triple to some regression tests As part of the effort to improve AIX support, regression test coverage misses quite a lot for AIX subtarget. This patch adds AIX triple to those don't need extra change, and we can cover more cases in following commits. Reviewed By: steven.zhang Differential Revision: https://reviews.llvm.org/D94159	2021-01-18 11:44:00 +08:00
Juneyoung Lee	2639c162b7	[InstCombine] more tests for D94861 (NFC)	2021-01-18 11:12:52 +09:00
Lang Hames	a817f46d50	[JITLink][ELF] Skip DWARF sections in ELF objects. This matches current JITLink/MachO behavior and avoids processing currently unsupported relocations.	2021-01-18 12:42:48 +11:00
Fangrui Song	b74ae43c44	Makefile.rules: Make HOST_OS/OS simply expanded variable to avoid excess uname -s invocations This decreases the number of runs from 18 to 1.	2021-01-17 17:19:29 -08:00
Chen Zheng	3bdf4507b6	[NFC] [TargetRegisterInfo] add one use check to lookThruCopyLike. add one use check to lookThruCopyLike. The root node is safe to be deleted if we are sure that every definition in the copy chain only has one use. Reviewed By: jsji Differential Revision: https://reviews.llvm.org/D92069	2021-01-17 19:56:42 -05:00
Chandler Carruth	f855751c12	Fix openmp CMake build on non-Linux AArch64 systems. This just checks for `/proc/cpuinfo` existing before reading it. Tested on an ARM macOS machine.	2021-01-17 16:18:31 -08:00
Fangrui Song	95d146182f	Makefile.rules: Delete GCC 4.6 workaround 5.1 is the minimum supported version.	2021-01-17 13:16:38 -08:00
Pavel Labath	a89242d874	[lldb] Skip TestPlatformProcessConnect on windows and darwin The test fails (for different reasons) on these platforms. Skip for now.	2021-01-17 20:18:55 +01:00
Nikita Popov	4229b87ed3	[ValueTracking] Fix isSafeToSpeculativelyExecute for sdiv (PR48778) The != -1 check does not work correctly for all bitwidths. Use isAllOnesValue() instead.	2021-01-17 20:06:17 +01:00
Nikita Popov	1cc477f030	[SimplifyCFG] Add test for PR48778 (NFC) The sdiv is incorrectly speculated.	2021-01-17 20:06:17 +01:00
Stephen Kelly	9a7fb08487	NFC: Minor cleanup of function calls	2021-01-17 18:47:17 +00:00
Kazu Hirata	50be8e4471	[TableGen] Drop redundant const from return types (NFC) Identified with readability-const-return-type.	2021-01-17 10:39:49 -08:00
Kazu Hirata	a59126115e	[IRBuilder] "Zero"-initialize SmallVector (NFC)	2021-01-17 10:39:47 -08:00
Kazu Hirata	352fcfc697	[llvm] Use llvm::sort (NFC)	2021-01-17 10:39:45 -08:00
Raphael Isemann	7e9e6ac526	[lldb][docs] Fix some RST formatting errors related to code examples. Mostly just making sure the indentation is right (SBDebugger had 0 spaces as it was still plain text, the others had too much indentation or other minor issues).	2021-01-17 17:41:05 +01:00
Dávid Bolvanský	ed396212da	[InstCombine] Transform abs pattern using multiplication to abs intrinsic (PR45691) ``` unsigned r(int v) { return (1 \| -(v < 0)) * v; } `r` is equivalent to `abs(v)`. ``` ``` define <4 x i8> @src(<4 x i8> %0) { %1: %2 = ashr <4 x i8> %0, { 31, undef, 31, 31 } %3 = or <4 x i8> %2, { 1, 1, 1, undef } %4 = mul nsw <4 x i8> %3, %0 ret <4 x i8> %4 } => define <4 x i8> @tgt(<4 x i8> %0) { %1: %2 = icmp slt <4 x i8> %0, { 0, 0, 0, 0 } %3 = sub nsw <4 x i8> { 0, 0, 0, 0 }, %0 %4 = select <4 x i1> %2, <4 x i8> %3, <4 x i8> %0 ret <4 x i8> %4 } Transformation seems to be correct! ``` Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D94874	2021-01-17 17:06:14 +01:00
Dávid Bolvanský	469ceaf538	[Tests] Add test for PR45691	2021-01-17 17:04:49 +01:00
Raphael Isemann	acdc745689	[lldb][docs] Cleanup the Python doc strings for SB API classes The first line of the doc string ends up on the SB API class summary at the root page of the Python API web page of LLDB. Currently many of the descriptions are missing or are several lines which makes the table really hard to read. This just adds the missing docstrings where possible and fixes the formatting where necessary.	2021-01-17 16:51:07 +01:00
Nikita Popov	a13c0f62c3	[InstSimplify] Fold xC1/C2 <= x (PR48744) We can fold xC1/C2 <= x to true if C1 <= C2. This is valid even if the multiplication is not nuw: https://alive2.llvm.org/ce/z/vULors The multiplication or division can be replaced by shifts. We don't handle the case where both are shifts, as that should get folded away by InstCombine.	2021-01-17 16:02:55 +01:00
Nikita Popov	4bfbfb9bcb	[InstSimplify] Add tests for x*C1/C2<=x (NFC) Tests for PR48744.	2021-01-17 16:02:55 +01:00
Utkarsh Saxena	9abbc05097	[clangd] Use !empty() instead of size()>0	2021-01-17 15:26:40 +01:00
Utkarsh Saxena	0f9908a7c9	[clangd] Use empty() instead of size()>0	2021-01-17 15:13:01 +01:00
mydeveloperday	00dc97f167	[clang-format] PR48594 BraceWrapping: SplitEmptyRecord ignored for templates https://bugs.llvm.org/show_bug.cgi?id=48594 Empty or small templates were not being treated the same way as small classes especially when SplitEmptyRecord was set to true This revision aims to help this by identifying a case when we should try not to merge the lines together Reviewed By: curdeius, JohelEGP Differential Revision: https://reviews.llvm.org/D93839	2021-01-17 11:14:33 +00:00
Raphael Isemann	e7bc6c594b	Reland [lldb][docs] Use sphinx instead of epydoc to generate LLDB's Python reference The build server should now have the missing dependencies. Original summary: Currently LLDB uses epydoc to generate the Python API reference for the website. epydoc however is unmaintained since more than a decade and no longer works with Python 3. Also whatever setup we had once for generating the documentation on the website server no longer seems to work, so the current website documentation has been stale since more than a year. This patch replaces epydoc with sphinx and its automodapi plugin that can generate Python API references. LLVM already uses sphinx for the rest of the documentation, so this way we are more consistent with the rest of LLVM. The only new dependency is the automodapi plugin for sphinx. This patch effectively does the following things: * Remove the epydoc code. * Make a new dummy Python API page in our website that just calls the Sphinx command for generated the API documentation. * Add a mock _lldb module that is only used when generating the Python API. This way we don't have to build all of LLDB to generate the API reference. Some notes: * The long list of skips is necessary due to boilerplate functions that SWIG is generating. Sadly automodapi is not really scriptable from what I can see, so we have to blacklist this stuff manually. * The .gitignore change because automodapi wants a subfolder of our documentation directory to place generated documentation files there. The path is also what is used on the website, so we can't really workaround this (without copying the whole `docs` dir somewhere else when we build). * We have to use environment variables to pass our build path to our sphinx configuration. Sphinx doesn't support passing variables onto that script. Reviewed By: JDevlieghere Differential Revision: https://reviews.llvm.org/D94489	2021-01-17 12:13:01 +01:00
mydeveloperday	9af03864df	[clang-format] Revert `e9e6e3b34a` Reverting {D92753} due to issues with #pragma indentation in #ifdef/endif structure	2021-01-17 11:07:31 +00:00
Nikita Popov	0b84afa5fc	Reapply [BasicAA] Handle recursive queries more efficiently There are no changes relative to the original commit. However, an issue this exposed in BasicAA assumption tracking has been fixed in the previous commit. ----- An alias query currently works out roughly like this: * Look up location pair in cache. * Perform BasicAA logic (including cache lookup and insertion...) * Perform a recursive query using BestAAResults. * Look up location pair in cache (and thus do not recurse into BasicAA) * Query all the other AA providers. * Query all the other AA providers. This is a lot of unnecessary work, all ultimately caused by the BestAAResults query at the end of aliasCheck(). The reason we perform it, is that aliasCheck() is getting called recursively, and we of course want those recursive queries to also make use of other AA providers, not just BasicAA. We can solve this by making the recursive queries directly use BestAAResults (which will check both BasicAA and other providers), rather than recursing into aliasCheck(). There are some tradeoffs: * We can no longer pass through the precomputed underlying object to aliasCheck(). This is not a major concern, because nowadays getUnderlyingObject() is quite cheap. * Results from other AA providers are no longer cached inside BasicAA. The way this worked was already a bit iffy, in that a result could be cached, but if it was MayAlias, we'd still end up re-querying other providers anyway. If we want to cache non-BasicAA results, we should do that in a more principled manner. In any case, despite those tradeoffs, this works out to be a decent compile-time improvment. I think it also simplifies the mental model of how BasicAA works. It took me quite a while to fully understand how these things interact. Differential Revision: https://reviews.llvm.org/D90094	2021-01-17 10:34:35 +01:00
Nikita Popov	b1c2f1282a	[BasicAA] Move assumption tracking into AAQI D91936 placed the tracking for the assumptions into BasicAA. However, when recursing over phis, we may use fresh AAQI instances. In this case AssumptionBasedResults from an inner AAQI can reesult in a removal of an element from the outer AAQI. To avoid this, move the tracking into AAQI. This generally makes more sense, as the NoAlias assumptions themselves are also stored in AAQI. The test case only produces an assertion failure with D90094 reapplied. I think the issue exists independently of that change as well, but I wasn't able to come up with a reproducer.	2021-01-17 10:34:35 +01:00
Fangrui Song	3809f4ebab	[ELF] Support R_PPC_ADDR24 (ba foo; bla foo)	2021-01-17 00:02:13 -08:00
Kazushi (Jam) Marukawa	3cbd476c54	[VE] Support VE in libunwind Modify libunwind to support SjLj exception handling routines for VE. In order to do that, we need to implement not only SjLj exception handling routines but also a Registers_ve class. This implementation of Registers_ve is incomplete. We will work on it later when we need backtrace in libunwind. Reviewed By: #libunwind, compnerd Differential Revision: https://reviews.llvm.org/D94591	2021-01-17 15:35:02 +09:00
Craig Topper	061f681c0d	[RISCV] Remove an extra map lookup from RISCVCompressInstEmitter. NFC When we looked up the map to see if the entry already existed, this created the new entry for us. So save a reference to it so we can use it to update the entry instead of looking it up again. Also remove unnecessary StringRef constructors around string literals on calls to this function.	2021-01-16 21:20:53 -08:00
Craig Topper	1327c730bb	[RISCV] Few more minor cleanups to RISCVCompressInstEmitter. NFC -Use StringRef instead of std::string. -Const correct a parameter. -Don't call StringRef::data() before printing. Just pass the StringRef.	2021-01-16 21:09:43 -08:00
Craig Topper	2b6a92625f	[RISCV] Simplify mergeCondAndCode in RISCVCompressInstEmitter.cpp. NFC Instead forming a std::string and returning it to pass into another raw_ostream, just pass the raw_ostream as a parameter. Take StringRef as arguments instead raw_string_ostream references making the caller responsible for converting to strings. Use StringRef operations instead of std::string::substr.a	2021-01-16 20:59:48 -08:00
Craig Topper	97f7e4e8c9	[RISC] Replace dyn_casts that are only checked by an assert with a cast. NFC	2021-01-16 20:23:48 -08:00
Craig Topper	633c5afccf	[RISCV] Remove unneeded StringRef to std::string conversions in RISCVCompressInstEmitter. NFC Stop concatenating std::string before streaming into a raw_ostream. Just stream the pieces. Remove some new lines from asserts. Remove std::string concatenation from an assert. assert strings aren't really evaluated like this at runtime. An assertion failure will just print exactly what's between the parentheses in the source.	2021-01-16 20:09:45 -08:00
Fangrui Song	a048ce13e3	[X86] Default to -x86-pad-for-align=false to drop assembler difference with or w/o -g Fix PR48742: the D75203 assembler optimization locates MCRelaxableFragment's within two MCSymbol's and relaxes some MCRelaxableFragment's to reduce the size of a MCAlignFragment. A -g build has more MCSymbol's and therefore may have different assembler output (e.g. a MCRelaxableFragment (jmp) may have 5 bytes with -O1 while 2 bytes with -O1 -g). `.p2align 4, 0x90` is common due to loops. For a larger program, with a lot of temporary labels, the assembly output difference is somewhat destined. The cost seems to overweigh the benefits so we default to -x86-pad-for-align=false until the heuristic is improved. Reviewed By: skan Differential Revision: https://reviews.llvm.org/D94542	2021-01-16 16:39:54 -08:00
Nikita Popov	5238e7b302	[InstCombine] Replace one-use select operand based on condition InstCombine already performs a fold where X == Y ? f(X) : Z is transformed to X == Y ? f(Y) : Z if f(Y) simplifies. However, if f(X) only has one use, then we can always directly replace the use inside the instruction. To actually be profitable, limit it to the case where Y is a non-expr constant. This could be further extended to replace uses further up a one-use instruction chain, but for now this only looks one level up. Among other things, this also subsumes D94860. Differential Revision: https://reviews.llvm.org/D94862	2021-01-16 23:25:02 +01:00
Roman Lebedev	32fc32317a	[SimplifyCFG] markAliveBlocks(): catchswitch: preserve PostDomTree When removing catchpad's from catchswitch, if that removes a successor, we need to record that in DomTreeUpdater. This fixes PostDomTree preservation failure in an existing test. This appears to be the single issue that i see in my current test coverage.	2021-01-17 01:21:05 +03:00
David Green	1454724215	[ARM] Align blocks that are not fallthough targets If the previous block in a function does not fallthough, adding nop's to align it will never be executed. This means we can freely (except for codesize) align more branches. This happens in constantislandspass (as it cannot happen later) and only happens at aggressive optimization levels as it does increase codesize. Differential Revision: https://reviews.llvm.org/D94394	2021-01-16 22:19:35 +00:00
David Green	2a5b576e3e	[ARM] Test for aligned blocks. NFC	2021-01-16 22:04:48 +00:00
Dávid Bolvanský	bfd75bdf3f	[NFC] Removed extra text in comments	2021-01-16 22:48:56 +01:00
Aart Bik	d8fc27301d	[mlir][sparse] improved sparse runtime support library Added the ability to read (an extended version of) the FROSTT file format, so that we can now read in sparse tensors of arbitrary rank. Generalized the API to deal with more than two dimensions. Also added the ability to sort the indices of sparse tensors lexicographically. This is an important step towards supporting auto gen of initialization code, since sparse storage formats are easier to initialize if the indices are sorted. Since most external formats don't enforce such properties, it is convenient to have this ability in our runtime support library. Lastly, the re-entrant problem of the original implementation is fixed by passing an opaque object around (rather than having a single static variable, ugh!). Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D94852	2021-01-16 12:16:10 -08:00
Shilei Tian	ed939f853d	[OpenMP] Added the support for hidden helper task in RTL The basic design is to create an outer-most parallel team. It is not a regular team because it is only created when the first hidden helper task is encountered, and is only responsible for the execution of hidden helper tasks. We first use `pthread_create` to create a new thread, let's call it the initial and also the main thread of the hidden helper team. This initial thread then initializes a new root, just like what RTL does in initialization. After that, it directly calls `__kmpc_fork_call`. It is like the initial thread encounters a parallel region. The wrapped function for this team is, for main thread, which is the initial thread that we create via `pthread_create` on Linux, waits on a condition variable. The condition variable can only be signaled when RTL is being destroyed. For other work threads, they just do nothing. The reason that main thread needs to wait there is, in current implementation, once the main thread finishes the wrapped function of this team, it starts to free the team which is not what we want. Two environment variables, `LIBOMP_NUM_HIDDEN_HELPER_THREADS` and `LIBOMP_USE_HIDDEN_HELPER_TASK`, are also set to configure the number of threads and enable/disable this feature. By default, the number of hidden helper threads is 8. Here are some open issues to be discussed: 1. The main thread goes to sleeping when the initialization is finished. As Andrey mentioned, we might need it to be awaken from time to time to do some stuffs. What kind of update/check should be put here? Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D77609	2021-01-16 14:13:35 -05:00
Sanjay Patel	49b96cd9ef	[SLP] remove opcode field from reduction data class This is NFC-intended and another step towards supporting intrinsics as reduction candidates. The remaining bits of the OperationData class do not make much sense as-is, so I will try to improve that, but I'm trying to take minimal steps because it's still not clear how this was intended to work.	2021-01-16 13:55:52 -05:00
Sanjay Patel	fcfcc3cc6b	[SLP] fix typos; NFC	2021-01-16 13:55:52 -05:00

1 2 3 4 5 ...

377263 Commits All Branches Search

377263 Commits

All Branches