llvm-project

Commit Graph

Author	SHA1	Message	Date
LLVM GN Syncbot	808ac8d595	[gn build] Port `208332de8a`	2021-06-21 07:27:34 +00:00
Ruiling Song	208332de8a	[AMDGPU] Add Optimize VGPR LiveRange Pass. This pass aims to optimize VGPR live-range in a typical divergent if-else control flow. For example: def(a) if(cond) use(a) ... // A else use(a) As AMDGPU access vgpr with respect to active-mask, we can mark `a` as dead in region A. For details, please refer to the comments in implementation file. The pass is enabled by default, the frontend can disable it through "-amdgpu-opt-vgpr-liverange=false". Differential Revision: https://reviews.llvm.org/D102212	2021-06-21 15:25:55 +08:00
Nicolas Vasilache	11e9a72dfc	[mlir][Linalg] NFC - Drop unused variable definition.	2021-06-21 07:08:02 +00:00
Nicolas Vasilache	e04533d38a	[mlir][Linalg] Introduce a BufferizationAliasInfo (6/n) This revision adds a BufferizationAliasInfo which maintains and updates information about which tensors will alias once bufferized, which bufferized tensors are equivalent to others and how to handle clobbers. Bufferization greedily tries to bufferize inplace by: 1. first trying to bufferize SubTensorInsertOp inplace, in reverse order (these are deemed the most expensives). 2. then trying to bufferize all non SubTensorOp / SubTensorInsertOp, in reverse order. 3. lastly trying to bufferize all SubTensorOp in reverse order. Reverse order is a heuristic that seems to work nicely because structured tensor codegen very often proceeds by: 1. take a subset of a tensor 2. compute on that subset 3. insert the result subset into the full tensor and yield a new tensor. BufferizationAliasInfo + equivalence sets + clobber analysis allows bufferizing nested subtensor/compute/subtensor_insert sequences inplace to a certain extent. To fully realize inplace bufferization, additional container-containee analysis will be necessary and is left for a subsequent commit. Differential revision: https://reviews.llvm.org/D104110	2021-06-21 06:59:42 +00:00
LLVM GN Syncbot	b746a8db84	[gn build] Port `80fd5fa526`	2021-06-21 06:23:08 +00:00
hsmahesha	80fd5fa526	[AMDGPU] Replace non-kernel function uses of LDS globals by pointers. The main motivation behind pointer replacement of LDS use within non-kernel functions is - to avoid subsequent LDS lowering pass from directly packing LDS (assume large LDS) into a struct type which would otherwise cause allocating huge memory for struct instance within every kernel. Reviewed By: rampitec Differential Revision: https://reviews.llvm.org/D103225	2021-06-21 11:51:49 +05:30
Pushpinder Singh	7a97cd9da7	[AMDGPU][Libomptarget] Remove redundant functions There does not seem to be any use of these functions. They just put the value to a local which is never used again. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D104512	2021-06-21 06:13:24 +00:00
Max Kazantsev	3f2ff7cc8c	[Test] Add some tests showing room for optimization exploiting undef and UB	2021-06-21 13:11:46 +07:00
Nathan Ridge	e37653da13	[clangd] Type hints for C++14 return type deduction Differential Revision: https://reviews.llvm.org/D103789	2021-06-21 01:13:00 -04:00
Esme-Yi	657aa3a763	[yaml2obj] Add support for writing the long symbol name. Summary: This patch, as a follow-up of D95505, adds support for writing the long symbol name by implementing the StringTable. Only XCOFF32 is suppoted now. Reviewed By: jhenderson, shchenz Differential Revision: https://reviews.llvm.org/D103455	2021-06-21 05:09:56 +00:00
Max Kazantsev	bb1dc876eb	[LoopDeletion] Handle Phis with similar inputs from different blocks This patch lifts the requirement to have the only incoming live block for Phis. There can be multiple live blocks if the same value comes to phi from all of them. Differential Revision: https://reviews.llvm.org/D103959 Reviewed By: nikic, lebedev.ri	2021-06-21 11:37:06 +07:00
Zhouyi Zhou	735ad67a4c	[clang] NFC: adjust indentation of statements with more than one lines Hi, I think it will be more beautiful to adjust indentation of statements with more than one lines. In function TreeTransform<Derived>::TransformDependentScopeDeclRefExpr the second line of statement NestedNameSpecifierLoc QualifierLoc \newline = getDerived().TransformNestedNameSpecifierLoc(E->getQualifierLoc()); is no more indent than the first line There is a similar case in function TreeTransform<Derived>::TransformUnresolvedMemberExpr Also I use clang-format to fix above functions Thanks alot Reviewed By: pengfei Differential Revision: https://reviews.llvm.org/D104145	2021-06-21 10:17:10 +08:00
Nico Weber	3a6a60f6c9	[lld/mac] Make a variable more local; no behavior change The variable used to need the wider scope, but doesn't after the reland. See LC_LINKER_OPTIONS-related discussion on https://reviews.llvm.org/D104353 for background.	2021-06-20 21:59:15 -04:00
Juneyoung Lee	ce192ced2b	[InstCombine] Use poison constant to represent the result of unreachable instrs This patch updates InstCombine to use poison constant to represent the resulting value of (either semantically or syntactically) unreachable instrs, or a don't-care value of an unreachable store instruction. This allows more aggressive folding of unused results, as shown in llvm/test/Transforms/InstCombine/getelementptr.ll . Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D104602	2021-06-21 09:58:44 +09:00
Nico Weber	e6cb55d5ce	[lld/mac] Test zerofill sections after __thread_bss Real zerofill sections go after __thread_bss, since zerofill sections must all be at the end of their segment and __thread_bss must be right after __thread_data. Works fine already, but wasn't tested as far as I can tell. Also tweak comment about zerofill sections a bit. No behavior change. Differential Revision: https://reviews.llvm.org/D104609	2021-06-20 20:44:29 -04:00
Eli Friedman	62ed024c74	[NFC][ScalarEvolution] Clean up ExitLimit constructors. Make all the constructors forward to one constructor. Remove redundant assertions.	2021-06-20 17:40:30 -07:00
Jim Lin	912b3b0348	[IVDescriptors] Fix comment that getUnsafeAlgebraInst has been renamed to getExactFPMathInst https://reviews.llvm.org/rG36a489d194750dc888f214240e9dec9122ca1f0e renamed the function call in the test from getUnsafeAlgebraInst to getExactFPMathInst. Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D104441	2021-06-21 07:56:22 +08:00
Jez Ng	f79e7a5a48	[lld-macho] Have inputOrder default to less than INT_MAX We make it less than INT_MAX in order not to conflict with the ordering of zerofill sections, which must always be placed at the end of their segment. This is the more structural fix for the issue addressed in {D104596}. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D104607	2021-06-20 19:49:27 -04:00
Fangrui Song	89e66a3ab3	[ELF] Delete --no-cref which does not exist in GNU ld Also delete the single dash form which does not appear to be used.	2021-06-20 14:28:56 -07:00
Fangrui Song	cd6b1b2b86	[ELF][test] Add missing tests for --no-export-dynamic & --no-warn-backrefs	2021-06-20 14:20:14 -07:00
Dmitri Gribenko	ffa252e8ce	[GCOVProfiling][test] Ensure that 'opt' drops any files in a temp directory	2021-06-20 22:48:35 +02:00
Jason Molenda	af913881e3	Try to unbreak the windows CI MSVC and clang seem to disagree with whether I can do this.	2021-06-20 13:13:46 -07:00
Craig Topper	3a8c7060cc	[TypePromotion] Prune Intrinsic includes. NFC TypePromotion is meant to be a generic pass and doesn't reference any ARM intrinsics so it shouldn't include IntrinsicsARM.h. The other Intrinsic related headers appear to be unneeded as well.	2021-06-20 13:04:02 -07:00
Jason Molenda	9ea6dd5cfa	Add a corefile style option to process save-core; skinny corefiles Add a new feature to process save-core on Darwin systems -- for lldb to create a user process corefile with only the dirty (modified memory) pages included. All of the binaries that were used in the corefile are assumed to still exist on the system for the duration of the use of the corefile. A new --style option to process save-core is added, so a full corefile can be requested if portability across systems, or across time, is needed for this corefile. debugserver can now identify the dirty pages in a memory region when queried with qMemoryRegionInfo, and the size of vm pages is given in qHostInfo. Create a new "all image infos" LC_NOTE for Mach-O which allows us to describe all of the binaries that were loaded in the process -- load address, UUID, file path, segment load addresses, and optionally whether code from the binary was executing on any thread. The old "read dyld_all_image_infos and then the in-memory Mach-O load commands to get segment load addresses" no longer works when we only have dirty memory. rdar://69670807 Differential Revision: https://reviews.llvm.org/D88387	2021-06-20 12:26:54 -07:00
Nikita Popov	1ae266f452	[LoopUnroll] Use smallest exact trip count from any exit This is a more general alternative/extension to D102635. Rather than handling the special case of "header exit with non-exiting latch", this unrolls against the smallest exact trip count from any exit. The latch exit is no longer treated as priviledged when it comes to full unrolling. The motivating case is in full-unroll-one-unpredictable-exit.ll. Here the header exit is an IV-based exit, while the latch exit is a data comparison. This kind of loop does not get rotated, because the latch is already exiting, and loop rotation doesn't try to distinguish IV-based/analyzable latches. Differential Revision: https://reviews.llvm.org/D102982	2021-06-20 20:58:26 +02:00
Fangrui Song	558ee5843f	[mlir] Fix -Wunused-but-set-variable in -DLLVM_ENABLE_ASSERTIONS=off build. NFC	2021-06-20 11:55:00 -07:00
Fangrui Song	50225112b5	[lld-link] Fix -Wunused-but-set-variable in -DLLVM_ENABLE_ASSERTIONS=off build. NFC	2021-06-20 11:35:02 -07:00
Fangrui Song	521d373274	Fix -Wunused-variable and -Wunused-but-set-variable in -DLLVM_ENABLE_ASSERTIONS=off build. NFC	2021-06-20 11:09:07 -07:00
Michał Górny	d4c437c428	[lldb] [Process/elf-core] Fix reading NetBSD/i386 core dumps Add support for extracting basic data from NetBSD/i386 core dumps. FPU registers are not supported at the moment. Differential Revision: https://reviews.llvm.org/D101091	2021-06-20 18:59:21 +02:00
David Green	a24b02193a	[DSE] Remove stores in the same loop iteration DSE will currently only remove stores in the same block unless they can be guaranteed to be loop invariant. This expands that to any stores that are in the same Loop, at the same loop level. This should still account for where AA/MSSA will not handle aliasing between loops, but allow the dead stores to be removed where they overlap in the same loop iteration. It requires adding loop info to DSE, but that looks fairly harmless. The test case this helps is from code like this, which can come up in certain matrix operations: for(i=..) dst[i] = 0; for(j=..) dst[i] += src[in+j]; After LICM, this becomes: for(i=..) dst[i] = 0; sum = 0; for(j=..) sum += src[in+j]; dst[i] = sum; The first store is dead, and with this patch is now removed. Differntial Revision: https://reviews.llvm.org/D100464	2021-06-20 17:03:30 +01:00
Sanjay Patel	4c44b02d87	[InstCombine] fold ctpop-of-select with 1 or more constant arms The general pattern is mentioned in: https://llvm.org/PR50140 ...but we need to do a bit more to handle intrinsics with extra operands like ctlz/cttz.	2021-06-20 11:28:45 -04:00
Raul Tambre	56aac567ac	[libcxx] Implement P0883R2 ("Fixing Atomic Initialization") Reviewed By: Quuxplusone Differential Revision: https://reviews.llvm.org/D103769	2021-06-20 17:37:42 +03:00
Peter Steinfeld	e7f78fb917	[flang] Implement constant folding for the NOT intrinsic I implemented constant folding for the NOT intrinsic and added some tests. Differential Revision: https://reviews.llvm.org/D104587	2021-06-20 07:25:05 -07:00
Sanjay Patel	240acb0cff	[InstCombine] avoid infinite loops with select folds of constant expressions This pair of transforms was added recently with: `8591640379` And could lead to conflicting folds: https://bugs.chromium.org/p/oss-fuzz/issues/detail?id=35399	2021-06-20 09:46:25 -04:00
Roman Lebedev	e497b12a69	[NFC][AArch64][ARM][Thumb][Hexagon] Autogenerate some tests These all (and some others) are being affected by D104597, but they are manually-written, which rather complicates checking the effect that change has on them.	2021-06-20 14:12:45 +03:00
Roman Lebedev	b1f55c33d4	[UpdateTestUtils] Print test filename when complaining about conflicting prefix Now that FileCheck eagerly complains when prefixes are unused, the update script does the same, and is becoming very common to need to drop some prefixes, yet figuring out the file it complains about isn't obvious unless it actually tells us.	2021-06-20 14:12:39 +03:00
Roman Lebedev	c5b7335dc8	[SimplifyCFG] FoldTwoEntryPHINode(): don't fold if either block has it's address taken Same as with HoistThenElseCodeToIf() (`ad87761925`).	2021-06-20 12:37:14 +03:00
Roman Lebedev	ad87761925	[SimplifyCFG] HoistThenElseCodeToIf(): don't hoist if either block has it's address taken This problem is exposed by D104598, after it tail-merges `ret` in `@test_inline_constraint_S_label`, the verifier would start complaining `invalid operand for inline asm constraint 'S'`. Essentially, taking address of a block is mismodelled in IR. It should probably be an explicit instruction, a first one in block, that isn't identical to any other instruction of the same type, so that it can't be hoisted.	2021-06-20 12:18:15 +03:00
Juneyoung Lee	09e8c0d5aa	[InstSimplify] icmp poison, X -> poison This adds a simple transformation from icmp with poison constant to poison. Comparing poison with something else is poison, so this is okay. https://alive2.llvm.org/ce/z/e8iReb https://alive2.llvm.org/ce/z/q4MurY	2021-06-20 15:39:07 +09:00
Fangrui Song	0873016cef	[llvm-cov gcov] Support GCC 12 format GCC 12 will change the length field to represent the number of bytes instead of 32-bit words. This avoids padding for strings.	2021-06-19 22:51:20 -07:00
Fangrui Song	e85eecff30	[llvm-cov gcov] Change case to match the prevailing style && replace getString with readString	2021-06-19 22:50:52 -07:00
Michael Kruse	f075760317	[Flang][test] Fix Windows buildbot. Add REQUIRES: shell to tests that execute a UNIX shell script to not run on Windows.	2021-06-19 22:23:02 -05:00
Fangrui Song	cee85fcd76	[test] Fix nocompress.test	2021-06-19 16:27:53 -07:00
Petr Hosek	d4c2b973ed	[profile] Fix variable name This fixes a bug introduced in `d85c258fd1`.	2021-06-19 14:55:32 -07:00
Fangrui Song	8ea2a58a2e	[llvm-profdata] Make diagnostics consistent with the (no capitalization, no period) style The format is currently inconsistent. Use the https://llvm.org/docs/CodingStandards.html#error-and-warning-messages style. And add `error:` or `warning:` to CHECK lines wherever appropriate.	2021-06-19 14:54:25 -07:00
Petr Hosek	d85c258fd1	[profile] Don't publish VMO if there are no counters If there are no counters, there's no need to publish the VMO. Differential Revision: https://reviews.llvm.org/D102786	2021-06-19 14:47:57 -07:00
Martin Storsjö	e1adf90826	[LLD] [COFF] Avoid doing repeated fuzzy symbol lookup for each iteration. NFC. This is run every time around in the main linker loop. Once a match has been found, stop trying to rematch such a symbol. Not sure if this has any actual measurable performance impact though (SymbolTable::findMangle() iterates over the whole symbol table for each call and does fuzzy matching on top of that) but this makes the code more reassuring to read at least. (This is in practice run for def files listing undecorated stdcall functions to be exported.) Differential Revision: https://reviews.llvm.org/D104529	2021-06-19 22:32:37 +03:00
Martin Storsjö	1c8bb625b7	[LLD] [MinGW] Print errors/warnings in lld-link with a "ld.lld" prefix Pass the original argv[0] to the coff linker, as the coff linker uses the basename of argv[0] as the log prefix. This makes error messages to be printed with a "ld.lld:" prefix instead of "lld-link:". The current "lld-link:" prefix can be confusing to users, as they're invoking the MinGW linker (and might not even have a lld-link executable). Keep the first argument as lld-link when printing the command line, to make it an actually reproducible standalone command. Differential Revision: https://reviews.llvm.org/D104526	2021-06-19 22:32:37 +03:00
Fangrui Song	0f558db742	[llvm-profdata] Delete unneeded empty output filename check	2021-06-19 12:20:45 -07:00
Craig Topper	b663f30fa4	[RISCV] Prevent formation of shXadd(.uw) and add.uw if it prevents the use of addi. If the outer add has an simm12 immediate operand we should prefer it instead of materializing it in a register. This would guarantee and extra instruction and temporary register. Since we don't check one use on the shl or zext we might generate more instructions if there is an additional user.	2021-06-19 12:10:42 -07:00

1 2 3 4 5 ...

391573 Commits All Branches Search

391573 Commits

All Branches