llvm-project

Commit Graph

Author	SHA1	Message	Date
Benjamin Kramer	dd030036f0	Put back initializers that were dropped in `0ab5b5b858` Found by msan.	2020-03-28 22:06:12 +01:00
Benjamin Kramer	b578f130a7	[COFF] Stabilize sort Found by llvm::sort's expensive checks.	2020-03-28 21:38:50 +01:00
Benjamin Kramer	ba2e72c54e	[MDBuilder] Don't use stable sort for sorting integers.	2020-03-28 21:19:46 +01:00
Nikita Popov	2215dcf1d7	[InstCombine] Remove unreachable blocks before DCE Dropping unreachable code may reduce use counts on other instructions, so it's better to do this earlier rather than later. NFC-ish, may only impact worklist order.	2020-03-28 21:19:16 +01:00
Nikita Popov	97cc1275c7	[InstCombine] Merge two functions; NFC Merge AddReachableCodeToWorklist() into prepareICWorklistFromFunction(). It's one logical step, and this makes it easier to move code.	2020-03-28 21:19:16 +01:00
Benjamin Kramer	d3b6e1f1f9	[ADT] Automatically forward llvm::sort to array_pod_sort if safe This is safe if the iterator type is a pointer and the comparator is stateless. The enable_if pattern I'm adding here only uses array_pod_sort for the default comparator (std::less). Using array_pod_sort has a potential performance impact, but I didn't notice anything when testing clang. Sorting doesn't seem to be on the hot path anywhere in LLVM. Shrinks Release+Asserts clang by 73k.	2020-03-28 20:20:14 +01:00
Benjamin Kramer	2d24d74b85	[AMDGPU] Stabilize sort order Found by the expensive checks in llvm::sort.	2020-03-28 20:20:14 +01:00
Yonghong Song	ced0d1f42b	[BPF] support 128bit int explicitly in layout spec Currently, bpf does not specify 128bit alignment in its layout spec. So for a structure like struct ipv6_key_t { unsigned pid; unsigned __int128 saddr; unsigned short lport; }; clang will generate IR type %struct.ipv6_key_t = type { i32, [12 x i8], i128, i16, [14 x i8] } Additional padding is to ensure later IR->MIR can generate correct stack layout with target layout spec. But it is common practice for a tracing program to be first compiled with target flag (e.g., x86_64 or aarch64) through clang to generate IR and then go through llc to generate bpf byte code. Tracing program often refers to kernel internal data structures which needs to be compiled with non-bpf target. But such a compilation model may cause a problem on aarch64. The bcc issue https://github.com/iovisor/bcc/issues/2827 reported such a problem. For the above structure, since aarch64 has "i128:128" in its layout string, the generated IR will have %struct.ipv6_key_t = type { i32, i128, i16 } Since bpf does not have "i128:128" in its spec string, the selectionDAG assumes alignment 8 for i128 and computes the stack storage size for the above is 32 bytes, which leads incorrect code later. The x86_64 does not have this issue as it does not have "i128:128" in its layout spec as it does permits i128 to be alignmented at 8 bytes at stack. Its IR type looks like %struct.ipv6_key_t = type { i32, [12 x i8], i128, i16, [14 x i8] } The fix here is add i128 support in layout spec, the same as aarch64. The only downside is we may have less optimal stack allocation in certain cases since we require 16byte alignment for i128 instead of 8. But this is probably fine as i128 is not used widely and in most cases users should already have proper alignment. Differential Revision: https://reviews.llvm.org/D76587	2020-03-28 11:46:29 -07:00
Benjamin Kramer	4065e92195	Upgrade some instances of std::sort to llvm::sort. NFC.	2020-03-28 19:23:29 +01:00
Benjamin Kramer	347e31c052	Remove constexpr that MSVC doesn't like	2020-03-28 19:23:29 +01:00
Reid Kleckner	e5bf5037d8	[CodeGen] Fix sinking local values in lpads with phis There was already a test case for landingpads to handle this case, but I had forgotten to consider PHI instructions preceding the EH_LABEL in the landingpad. PR45261	2020-03-28 11:10:33 -07:00
Nikita Popov	30d712103f	[InstCombine] Use replaceOperand() API in GEP transforms To make sure that replaced operands get DCEd. This drops one iteration from gepphigep.ll, which is still not optimal. This was the last test case performing more than 3 iterations. NFC-ish, only worklist order should change.	2020-03-28 19:07:25 +01:00
Nikita Popov	b1f78baeaa	[InstCombine] Reduce code duplication in GEP of PHI transform; NFC The `NewGEP->setOperand(DI, NewPN)` call was duplicated, and the insertion of NewGEP is the same in both if/else, so we can extract it.	2020-03-28 19:07:25 +01:00
Benjamin Kramer	e8743c0f38	Const-initialize ParsedAttrInfos Gets rid of a 150k static initializer (Release clang)	2020-03-28 19:04:53 +01:00
Alexandre Ganea	3ab3f3c5d5	After `09158252f7`, fix build when -DLLVM_ENABLE_THREADS=OFF Tested on Linux with Clang 9, and on Windows with Visual Studio 2019 16.5.1 with -DLLVM_ENABLE_THREADS=ON and OFF.	2020-03-28 13:54:58 -04:00
Nikita Popov	672e8bfbfc	[InstCombine] Fix worklist management in foldXorOfICmps() Because this code does not use the IC-aware replaceInstUsesWith() helper, we need to manually push users to the worklist. This is NFC-ish, in that it may only change worklist order.	2020-03-28 18:25:21 +01:00
Nikita Popov	337b671b0d	[InstCombine] Change limit-max-iterations test case; NFC This particular case will stop needing multiple iterations in a followup change.	2020-03-28 18:25:20 +01:00
Matt Schulte	fdc41aa22c	[lld][ELF] Mark empty NOLOAD output sections SHT_NOBITS instead of SHT_PROGBITS This fixes PR# 45336. Output sections described in a linker script as NOLOAD with no input sections would be marked as SHT_PROGBITS. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D76981	2020-03-28 10:07:58 -07:00
Enna1	03bc311a16	[CorrelatedValuePropagation] Remove redundant if statement in processSelect() This statement if (ReplaceWith == S) ReplaceWith = UndefValue::get(S->getType()); is introduced in https://reviews.llvm.org/rG35609d97ae89b8e13f40f4e6b9b056954f8baa83 to fix a case where unreachable code can cause select instruction simplification to fail. In https://reviews.llvm.org/rGd10480657527ffb44ea213460fb3676a6b1300aa, we begin to perform a depth-first walk of basic blocks. This means we will not visit unreachable blocks. So we do not need this the special check any more. Differential Revision: https://reviews.llvm.org/D76753	2020-03-28 18:01:17 +01:00
Martin Storsjö	e6112a56dd	[AsmPrinter] Emit .weak directive for weak linkage on COFF for symbols without a comdat MC already knows how to emulate the .weak directive (with its ELF semantics; i.e., an undefined weak symbol resolves to 0, and a defined weak symbol has lower link precedence than a strong symbol of the same name) using COFF weak externals. Plumb this through the ASM printer too, so that definitions marked with __attribute__((weak)) at the language level (which gets translated to weak linkage at the IR level) have the corresponding .weak directive emitted. Note that declarations marked with __attribute__((weak)) at the language level (which translates to extern_weak at the IR level) already have .weak directives emitted. Weak/linkonce symbols without an associated comdat (in particular, ones generated with __attribute__((weak)) in C/C++) were earlier emitted as normal unique globals, as the comdat is required to provide the linkonce semantics. This change makes sure they are emitted as .weak instead, allowing other symbols to override them. Rename the existing coff-weak.ll test to coff-linkonce.ll. I'm not quite sure what that test covers, since the behavior being tested in it (the emission of a one_only section) is just a result of passing -function-sections to llc; the linkonce_odr makes no difference. Add a new coff-weak.ll which tests the new directive emission. Based on an previous patch by Shoaib Meenai. Differential Revision: https://reviews.llvm.org/D44543	2020-03-28 18:48:58 +02:00
Alex Brachet	6a4f8423ae	[libc] Only use __has_builtin on clang The preprocessor reads the whole line even if the first condition of an and is false so this broke when compiling on older gcc versions which don't recognize `__has_builtin`	2020-03-28 12:28:43 -04:00
Florian Hahn	81f173ed0e	[SCCP] Remove LatticeVal alias now that transition is done (NFC). The LatticeVal alias was introduced to reduce the diff size for the transition to ValueLatticeElement, which is done now. This patch removes the unnecessary alias and updates some very verbose type uses with auto.	2020-03-28 15:40:24 +00:00
Florian Hahn	a44bf59c93	[SCCP] Remove unused toLatticeValue helper (NFC). LatticeVal is an alias for ValueLatticeElement and the function is not used any longer.	2020-03-28 15:40:24 +00:00
Kadir Cetinkaya	9619c2cc9a	[clang][Syntax] Handle macro arguments in spelledForExpanded Reviewers: sammccall Subscribers: cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D75446	2020-03-28 16:35:46 +01:00
Raphael Isemann	14db82c929	[lldb][NFC] Fix typo in TestInvalidArgsLog	2020-03-28 16:16:08 +01:00
Michael Liao	cb6389360b	Fix GCC warning on enum class bitfield. NFC.	2020-03-28 10:20:34 -04:00
Michael Liao	d2dd0fac48	Fix `-Wsign-compare` warning. NFC.	2020-03-28 10:20:27 -04:00
Martin Storsjö	8330dcadb8	[llvm-rc] Allow -1 for menu item IDs This seems to be used in some resource files, e.g. `f3217573d7/include/wx/msw/wx.rc (L28)`. MSVC rc.exe and GNU windres both allow any value here, and silently just truncate to uint16_t range. This just explicitly allows the -1 value and errors out on others - the same was done for control IDs in dialogs in `c1a67857ba`. Differential Revision: https://reviews.llvm.org/D76951	2020-03-28 14:32:08 +02:00
Simon Pilgrim	8c1dbd5c1e	[X86][SSE] Add testnzc(~X,Y) -> testnzc(X,Y) test cases	2020-03-28 10:56:57 +00:00
Simon Pilgrim	d34d2ec28b	[X86][SSE] Add original PR38522 test case	2020-03-28 10:56:57 +00:00
Simon Pilgrim	8d85da5f5a	[X86][SSE] Add combine tests for PTEST/TESTPS/TESTPD instructions Including some test coverage for PR38522	2020-03-28 10:56:57 +00:00
Evan LeClercq	37943e518c	[docs] Added solutions to slow build under common problems. I added a list of options to configure should someone have issues with long build time or running out of memory. This was added under common problems in the getting started section of the documentation. Reviewed By: Meinersbur, dim, e-leclercq Differential Revision: https://reviews.llvm.org/D75425	2020-03-28 04:19:45 -05:00
Dmitry Vyukov	65b4695375	tsan: fix build bugs 1. Fix expression inside of DCHECK_LE. 2. Add a debug build test to buildgo.sh. 3. Fix race binary path in buildgo.sh.	2020-03-28 09:09:39 +01:00
Uday Bondhugula	06066c4003	[NFC] Attributor comment updates / cast cleanup Minor update/fixes to comments for the Attributor pass, and dyn_cast -> cast. Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Differential Revision: https://reviews.llvm.org/D76972	2020-03-28 13:36:43 +05:30
Siva Chandra Reddy	51b899c846	[libc] Extend add_object rule to handle helper object libraries. The rule is now called add_object_library. Reviewers: abrachet Differential Revision: https://reviews.llvm.org/D76826	2020-03-28 00:07:46 -07:00
Alex Brachet	255e634bf7	[libc][NFC] Ensure internal implementation is in __llvm_libc namespace Summary: In preparation for D76818. Reviewers: PaulkaToast, sivachandra, gchatelet Reviewed By: PaulkaToast, sivachandra Subscribers: MaskRay, tschuett, libc-commits Differential Revision: https://reviews.llvm.org/D76967	2020-03-28 02:59:42 -04:00
Jason Molenda	f0a5af906b	Merge in symbols from Mach-O dyld trie to the symbol table In ObjectFileMachO we construct the symbol table from multiple sources -- primarily the binary's nlist records, but when the nlist symbols have been stripped, we would augment those with function start address from the LC_FUNCTION_STARTS or eh_frame. This patch adds another source of symbols - the exported symbols that the dynamic linker, dyld, uses at runtime from its trie structure. This provides us names and addresses for these functions/data. This patch removes the code from ParseSymtab that would reject an empty symbol table / nlist source. It adds a new symbols_added set which tracks the address of every symbol we've added to the symtab. We add symbols in most-information-ful order, and before adding a symbol from less-informational-ful source (e.g. LC_FUNCTION_STARTS with no function name), we check if that symbol has already been added. On targets with thumb code generation, instead of using the 0th bit in these addresses in FunctionStarts (or now the trie entries), we use the data field of FunctionStarts (formerly used to track if the func_start should be added) and a flag for the trie entries to encode this, and only store the actual addresses in the symbols_seen and these vectors. <rdar://problem/50791451> Differential revision: https://reviews.llvm.org/D76758	2020-03-27 22:53:15 -07:00
Serge Pavlov	f398739152	[FEnv] Constfold some unary constrained operations This change implements constant folding to constrained versions of intrinsics, implementing rounding: floor, ceil, trunc, round, rint and nearbyint. Differential Revision: https://reviews.llvm.org/D72930	2020-03-28 12:28:33 +07:00
Yaxun (Sam) Liu	369e26ca9e	[AMDGPU] Add __builtin_amdgcn_workgroup_size_x/y/z The main purpose of introducing these builtins is to add a range metadata [1, 1025) on the work group size loaded from dispatch ptr, which cannot be done by source code. Differential Revision: https://reviews.llvm.org/D76772	2020-03-28 01:03:20 -04:00
Richard Smith	499b2a8d63	PR45294: Fix handling of assumed template names looked up in the lexical scope. There are a few contexts in which we assume a name is a template name; if such a context is one where we should perform an unqualified lookup, and lookup finds nothing, we would form a dependent template name even if the name is not dependent. This happens in particular for the lookup of a pseudo-destructor. In passing, rename ActOnDependentTemplateName to just ActOnTemplateName given that we apply it for non-dependent template names too.	2020-03-27 21:07:06 -07:00
Mehdi Amini	9dd92a5697	Remove spurious duplicated header include (NFC)	2020-03-28 03:49:53 +00:00
Richard Smith	88c7ffaf94	Form invalid template-id annotations when parsing a construct that is required to be a template-id but names an undeclared identifier.	2020-03-27 20:27:42 -07:00
Sean Silva	6677477190	Verify number of result types in generated builder. Summary: This just bit me and is nasty to debug. Differential Revision: https://reviews.llvm.org/D76888	2020-03-27 20:05:42 -07:00
Jonas Devlieghere	190df4a5bc	Revert "[FileCollector] Add a method to add a whole directory and it contents." This reverts commit `8913769e35` because the unit test is failing on the Windows bot.	2020-03-27 19:21:48 -07:00
Ilya Leoshkevich	fad47d2225	[compiler-rt] Fall back to internal_uname() when called early Summary: Commit `5f5fb56c68` ("[compiler-rt] Intercept the uname() function") broke sanitizer-x86_64-linux and clang-cmake-thumbv7-full-sh (again) builds: http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux/builds/26313 http://lab.llvm.org:8011/builders/clang-cmake-thumbv7-full-sh/builds/4324 The reason is that uname() can be called as early as __pthread_initialize_minimal_internal(). When intercepted, this triggers ASan initialization, which eventually calls dlerror(), which in turn uses pthreads, causing all sorts of issues. Fix by falling back to internal_uname() when interceptor runs before ASan is initialized. This is only for Linux at the moment. Reviewers: eugenis, vitalybuka Reviewed By: eugenis Subscribers: dberris, #sanitizers, pcc Tags: #sanitizers Differential Revision: https://reviews.llvm.org/D76919	2020-03-28 03:14:19 +01:00
Richard Smith	0c42539df3	Improve error recovery from missing '>' in template argument list. Produce the conventional "to match this '<'" note, so that the user knows why we expected a '>', and properly handle '>>' in C++11 onwards.	2020-03-27 18:59:01 -07:00
Uday Bondhugula	fec6c5ac5e	[MLIR][NFC] Move TestDialect -> Dialect/Test Move test/lib/TestDialect to test/lib/Dialect/Test - makes the dir structure more uniform. Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Differential Revision: https://reviews.llvm.org/D76677	2020-03-28 07:12:55 +05:30
Uday Bondhugula	43a95a543f	[MLIR] Introduce full/partial tile separation using if/else This patch introduces a utility to separate full tiles from partial tiles when tiling affine loop nests where trip counts are unknown or where tile sizes don't divide trip counts. A conditional guard is generated to separate out the full tile (with constant trip count loops) into the then block of an 'affine.if' and the partial tile to the else block. The separation allows the 'then' block (which has constant trip count loops) to be optimized better subsequently: for eg. for unroll-and-jam, register tiling, vectorization without leading to cleanup code, or to offload to accelerators. Among techniques from the literature, the if/else based separation leads to the most compact cleanup code for multi-dimensional cases (because a single version is used to model all partial tiles). INPUT affine.for %i0 = 0 to %M { affine.for %i1 = 0 to %N { "foo"() : () -> () } } OUTPUT AFTER TILING W/O SEPARATION map0 = affine_map<(d0) -> (d0)> map1 = affine_map<(d0)[s0] -> (d0 + 32, s0)> affine.for %arg2 = 0 to %M step 32 { affine.for %arg3 = 0 to %N step 32 { affine.for %arg4 = #map0(%arg2) to min #map1(%arg2)[%M] { affine.for %arg5 = #map0(%arg3) to min #map1(%arg3)[%N] { "foo"() : () -> () } } } } OUTPUT AFTER TILING WITH SEPARATION map0 = affine_map<(d0) -> (d0)> map1 = affine_map<(d0) -> (d0 + 32)> map2 = affine_map<(d0)[s0] -> (d0 + 32, s0)> #set0 = affine_set<(d0, d1)[s0, s1] : (-d0 + s0 - 32 >= 0, -d1 + s1 - 32 >= 0)> affine.for %arg2 = 0 to %M step 32 { affine.for %arg3 = 0 to %N step 32 { affine.if #set0(%arg2, %arg3)[%M, %N] { // Full tile. affine.for %arg4 = #map0(%arg2) to #map1(%arg2) { affine.for %arg5 = #map0(%arg3) to #map1(%arg3) { "foo"() : () -> () } } } else { // Partial tile. affine.for %arg4 = #map0(%arg2) to min #map2(%arg2)[%M] { affine.for %arg5 = #map0(%arg3) to min #map2(%arg3)[%N] { "foo"() : () -> () } } } } } The separation is tested via a cmd line flag on the loop tiling pass. The utility itself allows one to pass in any band of contiguously nested loops, and can be used by other transforms/utilities. The current implementation works for hyperrectangular loop nests. Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Differential Revision: https://reviews.llvm.org/D76700	2020-03-28 06:58:35 +05:30
Kazuaki Ishizaki	e5a8512655	[mlir] NFC: fix trivial typo in source files Summary: fix trivial typos in the source files Reviewers: mravishankar, antiagainst, nicolasvasilache, herhut, rriddle, aartbik Reviewed By: antiagainst, rriddle Subscribers: mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, csigg, arpith-jacob, mgester, lucyrfox, aartbik, liufengdb, Joonsoo, bader, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76876	2020-03-28 10:12:49 +09:00
Jessica Paquette	98d05f88d5	[GlobalISel] Fix equality for copies from physregs in matchEqualDefs When we see this: ``` %a = COPY $physreg ... SOMETHING implicit-def $physreg ... %b = COPY $physreg ``` The two copies are not equivalent, and so we shouldn't perform any folding on them. When we have two instructions which use a physical register check that they define the same virtual register(s) as well. e.g., if we run into this case ``` %a = COPY $physreg ... %b = COPY %a ``` we can say that the two copies are the same, and can be folded. Differential Revision: https://reviews.llvm.org/D76890	2020-03-27 17:52:21 -07:00

... 3 4 5 6 7 ...

346701 Commits All Branches Search

346701 Commits

All Branches