llvm-project

Commit Graph

Author	SHA1	Message	Date
Patrick Oppenlander	b3163c1cdd	[Driver] Support PowerPC SPE musl dynamic linker name ld-musl-powerpc-sf.so.1 Musl treats PowerPC SPE as a soft-float target (as the PowerPC SPE ABI is soft-float compatible). Reviewed By: jhibbits, MaskRay Differential Revision: https://reviews.llvm.org/D105869	2021-11-28 15:39:55 -08:00
Fangrui Song	7ea662e2dd	[ELF] Replace one make_unique from r316378 with a stack object. NFC	2021-11-28 15:32:29 -08:00
Fangrui Song	25c7ec4fc6	[ELF] Simplify OutputSection::sectionIndex assignment. NFC And improve comments.	2021-11-28 14:56:29 -08:00
Fangrui Song	d060cc1f98	[ELF] Fix out-of-bounds write in memset(&Out::first, ...) Fix r285764: there is no guarantee that Out::first is placed before other static data members of `struct Out`. After `bufferStart` was introduced, this out-of-bounds write is destined in many compilers. It is likely benign, though. And move `Out::elfHeader->size` assignment beside `Out::elfHeader->sectionIndex`	2021-11-28 14:47:57 -08:00
Fangrui Song	cecc6893a0	[ELF] Simplify assignFileOffsets There is a difference with non-SHF_ALLOC SHT_NOBITS when off%sh_addralign!=0 which doesn't happen/matter in practice.	2021-11-28 13:44:42 -08:00
Nicolas Vasilache	f5a9bfdf8f	[mlir] NFC - Move invalid.mlir tests to the proper dialects	2021-11-28 21:30:40 +00:00
Dimitry Andric	df08b2fe8b	[AArch64] Avoid crashing on invalid -Wa,-march= values As reported in https://bugs.freebsd.org/260078, the gnutls Makefiles pass -Wa,-march=all to compile a number of assembly files. Clang does not support this -march value, but because of a mistake in handling the arguments, an unitialized Arg pointer is dereferenced, which can cause a segfault. Work around this by adding a check if the local WaMArch variable is initialized, and if so, using its value in the diagnostic message. Reviewed By: tschuett Differential Revision: https://reviews.llvm.org/D114677	2021-11-28 22:23:42 +01:00
Nikita Popov	3608e18a94	[DSE] Use MapVector for IOLs I'm not sure whether this can cause any actual non-determinism, but at least it makes the DSE debug log non-deterministic, which makes it harder to debug other non-determinism issues.	2021-11-28 21:54:29 +01:00
Fangrui Song	f9a4d9aa03	[ELF] -z separate-*: Use max-page-size instead of common-page-size for text/non-SHF_ALLOC transition and writeTrapInstr For -z separate-code and -z separate-loadable-segments: When RW is present, the RX to RW transition is aligned with max-page-size. When RW is absent, the RX to non-SHF_ALLOC transition should use max-page-size as well.	2021-11-28 12:47:50 -08:00
Fangrui Song	6c1c2313d1	[ELF] Simplify assignFileOffsets. NFC	2021-11-28 11:43:59 -08:00
Florian Hahn	3495090b9b	[LV] Move code from widenGEP to VPWidenGEPRecipe (NFC). The code in widenGEP has already been transitioned to only rely on information provided by VPWidenGEPRecipe directly. Moving the code directly to VPWidenGEPRecipe::execute completes the transition for the recipe. It provides the following advantages: 1. Less indirection, easier to see what's going on. 2. Removes accesses to fields of ILV. 2) in particular ensures that no dependencies on fields in ILV for GEP code generation are re-introduced. Reviewed By: Ayal Differential Revision: https://reviews.llvm.org/D114321	2021-11-28 18:29:18 +00:00
Kazu Hirata	c73fc74ce0	[llvm] Use range-based for loops (NFC)	2021-11-28 10:04:54 -08:00
Nico Weber	8874ada906	[clang] Fix -Wreturn-type false positive in @try statements After `04f30795f1`, -Wreturn-type has an effect on functions that contain @try/@catch statements. CheckFallThrough() was missing a case for ObjCAtTryStmts, leading to a false positive. (What about the other two places in CheckFallThrough() that handle CXXTryStmt but not ObjCAtTryStmts? - I think the last use of CXXTryStmt is dead in practice: `04c6851cd` made it so that calls never add edges to try bodies, and the CFG block for a try statement is always an empty block containing just the try element itself as terminator (the try body itself is part of the normal flow of the function and not connected to the block for the try statement itself. The try statment cfg block is only connected to the catch bodies, and only reachable from throw expressions within the try body.) - The first use of CXXTryStmt might be important. It looks similar to the code that adds all cfg blocks for try statements as roots of the reachability graph for the reachability warnings, but I can't find a way to trigger it. So I'm omitting it for now. The CXXTryStmt code path seems to only be hit by try statements that are function bodies without a surrounding compound statements (`f() try { ... } catch ...`), and those don't exist for ObjC @try statements. ) Fixes PR52473. Differential Revfision: https://reviews.llvm.org/D114660	2021-11-28 12:56:46 -05:00
David Green	04b5c00952	[ARM] Add testing for various fptosi.sat patterns. NFC	2021-11-28 16:36:17 +00:00
Sanjay Patel	f55d1eb374	[InstCombine] use decomposeBitTestICmp to make icmp (trunc X), C more consistent This is a follow-on suggested in D112634. Two folds that were added with that patch are subsumed in the call to decomposeBitTestICmp, and two other folds are potentially inverted. The deleted folds were very specialized by instcombine standards because they were restricted to legal integer types based on the data layout. This generalizes the canonical form independent of target/types. This change has a reasonable chance of exposing regressions either in IR or codegen, but I don't have any evidence for either of those yet. A spot check of asm across several in-tree targets shows variations that I expect are mostly neutral. We have one improvement in an existing IR test that I noted with a comment. Using mask ops might also make more code match with D114272. Differential Revision: https://reviews.llvm.org/D114386	2021-11-28 09:59:37 -05:00
Sanjay Patel	97755ab1c6	[InstCombine] reduce code duplication; NFC	2021-11-28 09:27:20 -05:00
Mark de Wever	cb68fc814a	[libc++] Add myself to the credits. Noticed while updating the credits for the to_chars patch.	2021-11-28 15:24:45 +01:00
Mark de Wever	730dccb986	[libc++] Remove support for Clang 9 and older. I encountered this while reviewing an unrelated patch. Will land after the CI passes. Reviewed By: #libc, Mordante Differential Revision: https://reviews.llvm.org/D114673	2021-11-28 15:17:05 +01:00
Florian Hahn	7b75110fac	[SCEV] Turn validity check in getExistingSCEV into assert (NFC). Now that we track users of SCEV expressions, we should be able to always invalidate containing expressions. With that, I think the case where a value gets removed but SCEVs containing references to it should not be possible any longer. Turn check into an assert. This slightly reduces compile-time: NewPM-O3: -0.27% NewPM-ReleaseThinLTO: -0.21% NewPM-ReleaseLTO-g: -0.26% http://llvm-compile-time-tracker.com/compare.php?from=c3dc6b081da6ba503e67d260033f81f61eb38ea3&to=95a4a028b1f1dd0bc3d221435953b7d2c031b3d5&stat=instructions Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D114633	2021-11-28 12:16:55 +00:00
Mark de Wever	01631ffcfc	[libc++][format] Adds formatting benchmarks. These benchmarks will be used to test the performance inpact of the next set of optimization patches. Reviewed By: ldionne, #libc Differential Revision: https://reviews.llvm.org/D110501	2021-11-28 13:12:53 +01:00
Vitaly Buka	168bc7ce7e	[sanitizer] Remove storeIds and use padding of StackDepotNode Depends on D114489. Reviewed By: morehouse, dvyukov Differential Revision: https://reviews.llvm.org/D114490	2021-11-28 01:58:49 -08:00
Sander de Smalen	28a4deab92	[LV] Fix incorrectly marking a pointer indvar as 'scalar'. collectLoopScalars should only add non-uniform nodes to the list if they are used by a load/store instruction that is marked as CM_Scalarize. Before this patch, the LV incorrectly marked pointer induction variables as 'scalar' when they required to be widened by something else, such as a compare instruction, and weren't used by a node marked as 'CM_Scalarize'. This case is covered by sve-widen-phi.ll. This change also allows removing some code where the LV tried to widen the PHI nodes with a stepvector, even though it was marked as 'scalarAfterVectorization'. Now that this code is more careful about marking instructions that need widening as 'scalar', this code has become redundant. Differential Revision: https://reviews.llvm.org/D114373	2021-11-28 09:49:28 +00:00
Sander de Smalen	a9f837bbf0	NFC: Simplify sve-widen-phi.ll by unrolling once. The unroll factor > 1 has little value for what is being tested.	2021-11-28 09:49:28 +00:00
Vitaly Buka	cc2794abea	[sanitizer] Switch StackStore from pointers to 32bit IDs Depends on D114488. Reviewed By: morehouse, dvyukov, kstoimenov Differential Revision: https://reviews.llvm.org/D114489	2021-11-28 01:44:28 -08:00
Chia-hung Duan	2afd16fe72	[mlir] Enable MLIRDialectUtilsTests Also remove `TooFewDims` test which tried to create an invalid AffineMap. The creation of an invalid AffineMap is rejected by `willBeValidAffineMap`, as a result we can deprecate the test. Reviewed By: bondhugula Differential Revision: https://reviews.llvm.org/D114657	2021-11-27 22:36:43 +00:00
David Green	5c64d8ef8c	[ARM] CSINC/CSINV patterns from CMOV We sometimes end up generating CMOV with constant operands that can be simplified to CSINC or CSINV under Arm-8.1m. This adds some simple patterns for them. Differential Revision: https://reviews.llvm.org/D114349	2021-11-27 20:21:41 +00:00
Kazu Hirata	ff649e0802	[Target] Use range-based for loops (NFC)	2021-11-27 11:16:19 -08:00
David Green	7d5d063c77	[ARM] Fold away unnecessary CSET/CMPZ Codegen from expanded vector operations can end up with unnecessary CMPZ/CSINC, of the form: CSXYZ A, B, C1 (CMPZ (CSINC 0, 0, C2, D), 0) These can be converted to remove the CMPZ and CSINC, depending on the condition. if C1==NE -> CSXYZ A, B, C2, D if C1==EQ -> CSXYZ A, B, NOT(C2), D Differential Revision: https://reviews.llvm.org/D114013	2021-11-27 19:07:16 +00:00
Simon Pilgrim	7ba64ab05a	[PowerPC] Regenerate ppc64-P9-vabsd.ll tests	2021-11-27 16:43:50 +00:00
Kristina Bessonova	9043289326	[DwarfCompileUnit] Set parent DIE right after creating a local entity No functional changes intended. Before this patch DwarfCompileUnit::createScopeChildrenDIE() and DwarfCompileUnit::createAndAddScopeChildrenDIE() used to emit child subtrees and then when all the children get created, attach them to a parent scope DIE. However, when a DIE doesn't have a parent, all the requests for its unit DIE fail. Currently, this is not a big issue since it isn't usually needed to know unit DIE for a local (function-scoped) entity. But once we introduce lexical blocks as a valid scope for global variables (static locals) and type DIEs, any requests for a unit DIE need to be guarded against local scope due to the potential absence of the DIE's parent. To avoid the aforementioned issue, this patch refactors a few DwarfCompileUnit methods to support the idea of attaching a DIE to its parent as close to the creation of this DIE as possible. Reviewed By: ellis Differential Revision: https://reviews.llvm.org/D114350	2021-11-27 17:59:07 +02:00
Nikita Popov	f492a414ba	[SCEV] Simplify forgetSymbolicName() (NFCI) With the recently introduced tracking as well as D113349, we can greatly simplify forgetSymbolicName(). In fact, we can simply replace it with forgetMemoizedResults(). What forgetSymbolicName() used to do is to walk the IR use-def chain to find all SCEVs that mention the SymbolicName. However, thanks to use tracking, we can now determine the relevant SCEVs in a more direct way. D113349 is needed to also clear out the actual IR to SCEV mapping in ValueExprMap. Differential Revision: https://reviews.llvm.org/D114263	2021-11-27 16:42:38 +01:00
Nikita Popov	c2550e3427	[SCEV] Simplify invalidation after BE count calculation (NFCI) After backedge taken counts have been calculated, we want to invalidate all addrecs and dependent expressions in the loop, because we might compute better results with the newly available backedge taken counts. Previously this was done with a forgetLoop() style use-def walk. With recent improvements to SCEV invalidation, we can instead directly invalidate any SCEVs using addrecs in this loop. This requires a great deal less subtlety to avoid invalidating more than necessary, and in particular gets rid of the hack from D113349. The change is similar to D114263 in spirit.	2021-11-27 16:35:06 +01:00
David Green	1b2d58ba90	[ARM] Extra testing for v2i1 types. NFC This adds extra tests for various operations from making the v2i1 type legal.	2021-11-27 13:21:09 +00:00
Florian Hahn	25dad1064b	[DSE] Optimize defining access of defs while walking upwards. This patch extends the code that walks memory defs upwards to find clobbering accesses to also try to optimize the clobbering defining access. We should be able to find set the optimized access of our starting def (KillingDef), if the following holds: 1. It is the first call of getDomMemoryDef for KillingDef (so Current == KillingDef->getDefiningAccess(). 2. No potentially aliasing defs are skipped. Then if a (partly) aliasing def is encountered, it can be used as optimized access for KillingDef. No further optimizations can be applied to KillingDef. I'd appreciate a careful look, as the existing documentation is not too clear on what is expected for optimized accesses. The motivation for this patch is to use the optimized accesses to cover more cases of redundant stores as follow-up to D111727. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D112313	2021-11-27 13:04:28 +00:00
Kristina Bessonova	9f374a74c2	[NVPTX][AsmPrinter] Avoid removing globals before calling AsmPrinter::doFinalization() Instead of removing globals from a module, we, it seems, can just override AsmPrinter::emitGlobalVariable() to do nothing as NVPTXAsmPrinter already emitted globals by this time and we don't want to do it twice. Reviewed By: tra Differential Revision: https://reviews.llvm.org/D113653	2021-11-27 14:02:34 +02:00
Nikita Popov	2b160e95c8	Reland [SCEV] Fix and validate ValueExprMap/ExprValueMap consistency Relative to the previous landing attempt, this introduces an additional flag on forgetMemoizedResults() to not remove SCEVUnknown phis from the value map. The invalidation after BECount calculation wants to leave these alone and skips them in its own use-def walk, but we can still end up invalidating them via forgetMemoizedResults() if there is another IR value with the same SCEV. This is intended as a temporary workaround only, and the need for this should go away once the getBackedgeTakenInfo() invalidation is refactored in the spirit of D114263. ----- This adds validation for consistency of ValueExprMap and ExprValueMap, and fixes identified issues: * Addrec construction directly wrote to ValueExprMap in a few places, without updating ExprValueMap. Add a helper to ensures they stay consistent. The adjustment in forgetSymbolicName() explicitly drops the old value from the map, so that we don't rely on it being overwritten. * forgetMemoizedResultsImpl() was dropping the SCEV from ExprValueMap, but not dropping the corresponding entries from ValueExprMap. Differential Revision: https://reviews.llvm.org/D113349	2021-11-27 12:37:15 +01:00
Ard Biesheuvel	da66263b6e	[ARM] implement support for ALU/LDR PC-relative group relocations Currently, LLD does not support the complete set of ARM group relocations. Given that I intend to start using these in the Linux kernel [0], let's add support for these. This implements the group processing as documented in the ELF psABI. Notably, this means support is dropped for very far symbol references that also carry a small component, where the immediate is rotated in such a way that only part of it wraps to the other end of the 32-bit word. To me, it seems unlikely that this is something anyone could be relying on, but of course I could be wrong. [0] https://lore.kernel.org/r/20211122092816.2865873-8-ardb@kernel.org/ Reviewed By: peter.smith, MaskRay Differential Revision: https://reviews.llvm.org/D114172	2021-11-27 10:26:37 +01:00
Bjorn Pettersson	00cf717f51	[test] Use -passes syntax in Feature/OperandBundles lit tests Simply use the new PM syntax instead of the deprecated legacy PM syntax when specifying the opt pipeline in some regression tests. Differential Revision: https://reviews.llvm.org/D114518	2021-11-27 09:52:55 +01:00
Bjorn Pettersson	407600604b	[test] Use -passes in lit tests for the UpdateTestChecks tool The UpdateTestChecks tool itself does not care about which pass manager that is used in the opt invocation. So the lit tests that are verifying the behavior of the UpdateTestChecks tool is updated to use the new-PM syntax (-passes=) when specifying the pass pipeline in the test cases that are used for verifying the UpdateTestChecks tool. Differential Revision: https://reviews.llvm.org/D114517	2021-11-27 09:52:55 +01:00
Bjorn Pettersson	8ebb3eac02	[test] Use -passes syntax when specifying pipeline in some more tests The legacy PM is deprecated, so update a bunch of lit tests running opt to use the new PM syntax when specifying the pipeline. In this patch focus has been put on test cases for ConstantMerge, ConstraintElimination, CorrelatedValuePropagation, GlobalDCE, GlobalOpt, SCCP, TailCallElim and PredicateInfo. Differential Revision: https://reviews.llvm.org/D114516	2021-11-27 09:52:55 +01:00
Hsiangkai Wang	b0c7421524	[RISCV] Emit DWARF location expression for RVV stack objects. VLENB is the length of a vector register in bytes. We use <vscale x 64 bits> to represent one vector register. The dwarf offset is VLENB * scalable_offset / 8. For the mask vector, it occupies one vector register. Differential Revision: https://reviews.llvm.org/D107432	2021-11-27 15:13:10 +08:00
Kazu Hirata	387927bbaf	[Target] Use range-based for loops (NFC)	2021-11-26 21:21:17 -08:00
Stanislav Funiak	a19e163526	Fixed broken build under GCC 5.4. This diff fixes broken build caused by D108550. Under GCC 5, auto lambdas that capture this require `this->` for member calls. Reviewed By: bondhugula Differential Revision: https://reviews.llvm.org/D114659	2021-11-27 09:03:27 +05:30
Fangrui Song	6fa8f7beb1	[ELF][test] Test that .o definition does not inherit .so STV_PROTECTED Test %t2.so %t.o beside %t.o %t2.so	2021-11-26 15:00:10 -08:00
Konstantin Boyarinov	56c8ad237a	[libcxx][NFC] Add tests for associative containers key_comp and value_comp Add missing tests to improve associative containers code coverage: - Tests for key_comp() and value_comp() observers - Tests for std::map and std::multimap value_compare member class Reviewed by: ldionne, rarutyun, #libc Differential Revision: https://reviews.llvm.org/D113998	2021-11-27 01:46:22 +03:00
Fangrui Song	f1ba48d508	[ELF] Simplify Symbol::extract. NFC	2021-11-26 14:10:55 -08:00
Vitaly Buka	8ae815cb19	[sanitizer] Switch StackStore to 8 MiB blocks Larger blocks are more convenient for compressions. Blocks are allocated with MmapNoReserveOrDie to save some memory. Also it's 15% faster on StackDepotBenchmarkSuite Depends on D114464. Reviewed By: morehouse Differential Revision: https://reviews.llvm.org/D114488	2021-11-26 14:00:10 -08:00
Erik Desjardins	53b00b8215	[InstSimplify] Fold X {lshr,udiv} C <u X --> true for nonzero X, non-identity C This eliminates the bounds check in Rust code like pub fn mid(data: &[i32]) -> i32 { if data.is_empty() { return 0; } return data[data.len()/2]; } (from https://blog.sigplan.org/2021/11/18/undefined-behavior-deserves-a-better-reputation/) Alive proofs: lshr https://alive2.llvm.org/ce/z/nyTu8D udiv https://alive2.llvm.org/ce/z/CNUZH7 Differential Revision: https://reviews.llvm.org/D114279	2021-11-26 16:48:33 -05:00
Patrick Oppenlander	06f1d63cb1	[compiler-rt][CMake] Support powerpcspe build Reviewed By: jhibbits, MaskRay Differential Revision: https://reviews.llvm.org/D98836	2021-11-26 13:29:06 -08:00
Patrick Oppenlander	30029e7407	[compiler-rt][CMake] Support powerpc32 on Linux Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D98831	2021-11-26 13:28:20 -08:00

... 3 4 5 6 7 ...

406005 Commits All Branches Search

406005 Commits

All Branches