llvm-project

Commit Graph

Author	SHA1	Message	Date
Hans Wennborg	2bc57d85eb	Don't override __attribute__((no_stack_protector)) by inlining (PR52886) Since `26c6a3e736`, LLVM's inliner will "upgrade" the caller's stack protector attribute based on the callee. This lead to surprising results with Clang's no_stack_protector attribute added in `4fbf84c173` (D46300). Consider the following code compiled with clang -fstack-protector-strong -Os (https://godbolt.org/z/7s3rW7a1q). extern void h(int* p); inline __attribute__((always_inline)) int g() { return 0; } int __attribute__((__no_stack_protector__)) f() { int a[1]; h(a); return g(); } LLVM will inline g() into f(), and f() would get a stack protector, against the users explicit wishes, potentially breaking the program e.g. if h() changes the value of the stack cookie. That's a miscompile. More recently, `bc044a88ee` (D91816) addressed this problem by preventing inlining when the stack protector is disabled in the caller and enabled in the callee or vice versa. However, the problem remained if the callee is marked always_inline as in the example above. This affected users, see e.g. http://crbug.com/1274129 and http://llvm.org/pr52886. One way to fix this would be to prevent inlining also in the always_inline case. Despite the name, always_inline does not guarantee inlining, so this would be legal but potentially surprising to users. However, I think the better fix is to not enable the stack protector in a caller based on the callee. The motivation for the old behaviour is unclear, it seems counter-intuitive, and causes real problems as we've seen. This commit implements that fix, which means in the example above, g() gets inlined into f() (also without always_inline), and f() is emitted without stack protector. I think that matches most developers' expectations, and that's also what GCC does. Another effect of this change is that a no_stack_protector function can now be inlined into a stack protected function, e.g. (https://godbolt.org/z/hafP6W856): extern void h(int* p); inline int __attribute__((__no_stack_protector__)) __attribute__((always_inline)) g() { return 0; } int f() { int a[1]; h(a); return g(); } I think that's fine. Such code would be unusual since no_stack_protector is normally applied to a program entry point which sets up the stack canary. And even if such code exists, inlining doesn't change the semantics: there is still no stack cookie setup/check around entry/exit of the g() code region, but there may be in the surrounding context, as there was before inlining. This also matches GCC. See also the discussion at https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94722 Differential revision: https://reviews.llvm.org/D116589	2022-01-13 12:04:49 +01:00
Simon Moll	33efbc8184	[VP] llvm.vp.merge intrinsic and LangRef llvm.vp.merge interprets the %evl operand differently than the other vp intrinsics: all lanes at positions greater or equal than the %evl operand are passed through from the second vector input. Otherwise it behaves like llvm.vp.select. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D116725	2022-01-12 14:06:56 +01:00
Phoebe Wang	1bb0caf561	[X86][MS] Change the alignment of f80 to 16 bytes on Windows 32bits to match with ICC MSVC currently doesn't support 80 bits long double. ICC supports it when the option `/Qlong-double` is specified. Changing the alignment of f80 to 16 bytes so that we can be compatible with ICC's option. Reviewed By: rnk, craig.topper Differential Revision: https://reviews.llvm.org/D115942	2022-01-12 17:50:37 +08:00
David Sherwood	51497dc0b2	[IR] Change vector.splice intrinsic to reject out-of-bounds indices I've changed the definition of the experimental.vector.splice instrinsic to reject indices that are known to be or possibly out-of-bounds. In practice, this means changing the definition so that the index is now only valid in the range [-VL, VL-1] where VL is the known minimum vector length. We use the vscale_range attribute to take the minimum vscale value into account so that we can permit more indices when the attribute is present. The splice intrinsic is currently only ever generated by the vectoriser, which will never attempt to splice vectors with out-of-bounds values. Changing the definition also makes things simpler for codegen since we can always assume that the index is valid. This patch was created in response to review comments on D115863 Differential Revision: https://reviews.llvm.org/D115933	2022-01-11 09:37:39 +00:00
Serge Guelton	d2cc6c2d0c	Use a sorted array instead of a map to store AttrBuilder string attributes Using and std::map<SmallString, SmallString> for target dependent attributes is inefficient: it makes its constructor slightly heavier, and involves extra allocation for each new string attribute. Storing the attribute key/value as strings implies extra allocation/copy step. Use a sorted vector instead. Given the low number of attributes generally involved, this is cheaper, as showcased by https://llvm-compile-time-tracker.com/compare.php?from=5de322295f4ade692dc4f1823ae4450ad3c48af2&to=05bc480bf641a9e3b466619af43a2d123ee3f71d&stat=instructions Differential Revision: https://reviews.llvm.org/D116599	2022-01-10 14:49:53 +01:00
Nikita Popov	2c0fb96254	[TypeFinder] Support opaque pointers We need to explicitly visit a number of types, as these are no longer reachable through the pointer type if opaque pointers are enabled. This is similar to ValueEnumerator changes that have been done previously.	2022-01-10 14:46:45 +01:00
Kazu Hirata	b932bdf59f	[llvm] Remove redundant member initialization (NFC) Identified with readability-redundant-member-init.	2022-01-07 17:45:09 -08:00
Nikita Popov	e4d1779990	[IR] Add ConstraintInfo::hasArg() helper (NFC) Checking whether a constraint corresponds to an argument is a recurring pattern.	2022-01-07 10:44:38 +01:00
Nikita Popov	bec726f5d2	[Verifier] Enforce elementtype attr for inline asm indirect constraints This enforces the LangRef change from D116531 in the Verifier, now that clang and tests have been updated.	2022-01-06 15:22:00 +01:00
Nikita Popov	c41aa41957	[ConstFold] Add missing check for inbounds gep If the gep is not inbounds, then the gep might compute a null value even if the base pointer is non-null.	2022-01-06 09:59:40 +01:00
Nikita Popov	32808cfb24	[IR] Track users of comdats Track all GlobalObjects that reference a given comdat, which allows determining whether a function in a comdat is dead without scanning the whole module. In particular, this makes filterDeadComdatFunctions() have complexity O(#DeadFunctions) rather than O(#SymbolsInModule), which addresses half of the compile-time issue exposed by D115545. Differential Revision: https://reviews.llvm.org/D115864	2022-01-06 09:13:58 +01:00
Luís Ferreira	34435fd105	[llvm] Add support for DW_TAG_immutable_type Added documentation about DW_TAG_immutable_type too. Reviewed By: probinson Differential Revision: https://reviews.llvm.org/D113633	2022-01-05 19:17:08 +00:00
Philip Reames	c16fd6a376	Rename doesNotReadMemory to onlyWritesMemory globally [NFC] The naming has come up as a source of confusion in several recent reviews. onlyWritesMemory is consist with onlyReadsMemory which we use for the corresponding readonly case as well.	2022-01-05 08:52:55 -08:00
Nikita Popov	6c031780aa	[ConstantFold] Remove another incorrect icmp of gep fold This folded (null + X) == g to false, but of course this is incorrect if X == g. Possibly this got confused with the null == g case, which is already handled elsewhere.	2022-01-04 16:08:09 +01:00
serge-sans-paille	9290ccc3c1	Introduce the AttributeMask class This class is solely used as a lightweight and clean way to build a set of attributes to be removed from an AttrBuilder. Previously AttrBuilder was used both for building and removing, which introduced odd situation like creation of Attribute with dummy value because the only relevant part was the attribute kind. Differential Revision: https://reviews.llvm.org/D116110	2022-01-04 15:37:46 +01:00
Nikita Popov	d74212987b	[ConstantFold] Remove unnecessary bounded index restriction The fold for merging a GEP of GEP into a single GEP currently bails if doing so would result in notional overindexing. The justification given in the comment above this check is dangerously incorrect: GEPs with notional overindexing are perfectly fine, and if some code treats them incorrectly, then that code is broken, not the GEP. Such a GEP might legally appear in source IR, so only preventing its creation cannot be sufficient. (The constant folder also ends up canonicalizing the GEP to remove the notional overindexing, but that's neither here nor there.) This check dates back to `bd4fef4a89`, and as far as I can tell the original issue this was trying to patch around has since been resolved. Differential Revision: https://reviews.llvm.org/D116587	2022-01-04 15:23:09 +01:00
Nikita Popov	1379eb5776	[ConstFold] Slightly clean up icmp of two geps fold (NFC) As we're only dealing with one type of constant expression here, try to directly cast to GEPOperator.	2022-01-04 12:33:38 +01:00
Nikita Popov	75db002725	[ConstantFold] Remove another incorrect icmp of GEP fold This fold is not correct, because indices might evaluate to zero even if they are not a literal zero integer. Additionally, this fold would be wrong (in the general case) for non-i8 types as well, due to index overflow. Drop this fold and instead let the target-dependent constant folder compute the actual offset and fold the comparison based on that.	2022-01-04 12:27:40 +01:00
Nikita Popov	8484bab9cd	[LangRef] Require elementtype attribute for indirect inline asm operands Indirect inline asm operands may require the materialization of a memory access according to the pointer element type. As this will no longer be available with opaque pointers, we require it to be explicitly annotated using the elementtype attribute, for example: define void @test(i32* %p, i32 %x) { call void asm "addl $1, $0", "=rm,r"(i32 elementtype(i32) %p, i32 %x) ret void } This patch only includes the LangRef change and Verifier updates to allow adding the elementtype attribute in this position. It does not yet enforce this, as this will require changes on the clang side (and test updates) first. Something I'm a bit unsure about is whether we really need the elementtype for all indirect constraints, rather than only indirect register constraints. I think indirect memory constraints might not strictly need it (though the backend code is written in a way that does require it). I think it's okay to just make this a general requirement though, as this means we don't need to carefully deal with multiple or alternative constraints. In addition, I believe that MemorySanitizer benefits from having the element type even in cases where it may not be strictly necessary for normal lowering (`cd2b050fa4/llvm/lib/Transforms/Instrumentation/MemorySanitizer.cpp (L4066)`). Differential Revision: https://reviews.llvm.org/D116531	2022-01-04 10:02:06 +01:00
Kazu Hirata	e5947760c2	Revert "[llvm] Remove redundant member initialization (NFC)" This reverts commit `fd4808887e`. This patch causes gcc to issue a lot of warnings like: warning: base class ‘class llvm::MCParsedAsmOperand’ should be explicitly initialized in the copy constructor [-Wextra]	2022-01-03 11:28:47 -08:00
Fraser Cormack	d762794040	[IR] Allow the 'align' param attr on vectors of pointers This patch extends the available uses of the 'align' parameter attribute to include vectors of pointers. The attribute specifies pointer alignment element-wise. This change was previously requested and discussed in D87304. The vector predication (VP) intrinsics intend to use this for scatter and gather operations, as they lack the explicit alignment parameter that the masked versions use. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D115161	2022-01-03 12:32:46 +00:00
Nikita Popov	127d955441	[ConstantFold] Drop unused function (NFC) isMaybeZeroSizeType() is no longer used after `5afbfe33e7`.	2022-01-03 10:14:52 +01:00
Nikita Popov	5afbfe33e7	[ConstantFold] Make icmp of gep fold offset based We can fold an equality or unsigned icmp between base+offset1 and base+offset2 with inbounds offsets by comparing the offsets directly. This replaces a pair of specialized folds that tried to reason based on the GEP structure instead. One of those folds was plain wrong (because it does not account for negative offsets), while the other is unnecessarily complicated and limited (e.g. it will fail with bitcasts involved). The disadvantage of this change is that it requires data layout, so the fold is no longer performed by datalayout-independent constant folding. I don't think this is a loss in practice, but it does regress the ConstantExprFold.ll test, which checks folding without running any passes. Differential Revision: https://reviews.llvm.org/D116332	2022-01-03 09:41:37 +01:00
Kazu Hirata	fd4808887e	[llvm] Remove redundant member initialization (NFC) Identified with readability-redundant-member-init.	2022-01-01 16:18:18 -08:00
Serge Pavlov	ecfd9196d5	[ConstantFolding] Use ICmpInst::Predicate instead of plain integer The function `ConstantFoldCompareInstruction` uses `unsigned short` to represent compare predicate, although all usesrs of the respective include file use definition of CmpInst also. This change replaces predicate argument type in this function to `ICmpInst::Predicate`, which allows to make code a bit clearer and simpler. No functional changes. Differential Revision: https://reviews.llvm.org/D116379	2021-12-30 14:31:44 +07:00
Kazu Hirata	5a667c0e74	[llvm] Use nullptr instead of 0 (NFC) Identified with modernize-use-nullptr.	2021-12-28 08:52:25 -08:00
Nikita Popov	23de66d163	[ConstFold] Don't fold signed comparison of gep of global An inbounds GEP may still cross the sign boundary, so signed icmps cannot be folded (https://alive2.llvm.org/ce/z/XSgi4D). This was previously fixed for other folds in this function, but this one was missed.	2021-12-28 14:13:33 +01:00
Shao-Ce SUN	ec501f15a8	[clang][CodeGen] Remove the signed version of createExpression Fix a TODO. Remove the callers of this signed version and delete. Reviewed By: CodaFi Differential Revision: https://reviews.llvm.org/D116014	2021-12-27 14:16:08 +08:00
Serge Pavlov	d86e2cc2e3	[NFC] Method for evaluation of FCmpInst for constant operands New method `FCmpInst::compare` is added, which evaluates the given compare predicate for constant operands. Interface is made similar to `ICmpInst::compare`. Differential Revision: https://reviews.llvm.org/D116168	2021-12-25 17:37:38 +07:00
Kazu Hirata	2d303e6781	Remove redundant return and continue statements (NFC) Identified with readability-redundant-control-flow.	2021-12-24 23:17:54 -08:00
Kazu Hirata	9c0a4227a9	Use Optional::getValueOr (NFC)	2021-12-24 20:57:40 -08:00
Florian Hahn	5d68dc184e	[Verifier] Iteratively traverse all indirect users. The recursive implementation can run into stack overflows, e.g. like in PR52844. The order the users are visited changes, but for the current use case this only impacts the order error messages are emitted.	2021-12-23 23:20:12 +01:00
Kazu Hirata	500c4b68dc	[llvm] Construct SmallVector with iterator ranges (NFC)	2021-12-20 23:43:24 -08:00
Sami Tolvanen	5dc8aaac39	[llvm][IR] Add no_cfi constant With Control-Flow Integrity (CFI), the LowerTypeTests pass replaces function references with CFI jump table references, which is a problem for low-level code that needs the address of the actual function body. For example, in the Linux kernel, the code that sets up interrupt handlers needs to take the address of the interrupt handler function instead of the CFI jump table, as the jump table may not even be mapped into memory when an interrupt is triggered. This change adds the no_cfi constant type, which wraps function references in a value that LowerTypeTestsModule::replaceCfiUses does not replace. Link: https://github.com/ClangBuiltLinux/linux/issues/1353 Reviewed By: nickdesaulniers, pcc Differential Revision: https://reviews.llvm.org/D108478	2021-12-20 12:55:32 -08:00
Serge Guelton	9cd55c7c34	Prevent copy of AttrBuilder It's a relatively heavy data structure, make sure it's not copied. Differential Revision: https://reviews.llvm.org/D116034	2021-12-20 10:33:32 -05:00
Nikita Popov	6e30cb7673	[Attributes] Add AttributeList ctor from AttributeSet (NFC) It was already possible to create an AttributeList from an Index and an AttributeSet. However, this would actually end up using the implicit constructor on AttrBuilder, thus doing an unnecessary conversion from AttributeSet to AttrBuilder to AttributeSet. Instead we can accept the AttributeSet directly, as that is what we need anyway.	2021-12-20 11:37:01 +01:00
Nikita Popov	65777addbd	[llvm-c] Accept GEP operators in some APIs As requested in D115787, I've added a test for LLVMConstGEP2 and LLVMConstInBoundsGEP2. However, to make this work in the echo test, I also had to change a couple of APIs to work on GEP operators, rather than only GEP instructions. Differential Revision: https://reviews.llvm.org/D115858	2021-12-17 08:54:18 +01:00
Nikita Popov	68cb111f3a	[llvm-c] Make LLVMConstGEP/LLVMConstInBoundsGEP opaque pointer compatible Weirdly, the opaque pointer compatible variants LLVMConstGEP2 and LLVMConstInBoundsGEP2 were already declared in the header, but not actually implemented. This adds the missing implementations and deprecates the incompatible functions. Differential Revision: https://reviews.llvm.org/D115787	2021-12-16 09:38:52 +01:00
Yuanfang Chen	ebf65d4842	[Verifier] Make error message precise about which variable is being diagnosed. NFCI.	2021-12-15 16:05:31 -08:00
Arthur Eubanks	5a81a60391	[NFC] Remove more calls to getAlignment() These are deprecated and should be replaced with getAlign(). Some of these asserts don't do anything because Load/Store/AllocaInst never have a 0 align value.	2021-12-15 14:40:57 -08:00
Mingming Liu	09a704c5ef	[LTO] Ignore unreachable virtual functions in WPD in hybrid LTO. Differential Revision: https://reviews.llvm.org/D115492	2021-12-14 20:18:04 +00:00
Philip Reames	423f19680a	Add FMF to hasPoisonGeneratingFlags/dropPoisonGeneratingFlags These flags are documented as generating poison values for particular input values. As such, we should really be consistent about their handling with how we handle nsw/nuw/exact/inbounds. Differential Revision: https://reviews.llvm.org/D115460	2021-12-14 08:43:00 -08:00
Nikita Popov	6213f1dd03	[IR] Make VPIntrinsic::getDeclarationForParams() opaque pointer compatible The vp.load and vp.gather intrinsics require the intrinsic return type to determine the correct function signature. With opaque pointers, it cannot be derived from the parameter pointee types. Differential Revision: https://reviews.llvm.org/D115632	2021-12-14 14:20:59 +01:00
Augie Fackler	b575405cc3	Verifier: accept enums as scopes Rust allows enums to be scopes, as shown by the previous change. Sadly, D111770 disallowed enums-as-scopes in the LLVM Verifier, which means that LLVM HEAD stopped working for Rust compiles. As a result, we back out the verifier part of D111770 with a modification to the testcase so we don't break this in the future. The testcase is now actual IR from rustc at commit 8f8092cc3, which is the nightly as of 2021-09-28. I would expect rustc 1.57 to produce similar or identical IR if someone wants to reproduce this IR in the future with minimal changes. A recipe for reproducing the IR using rustc is included in the test file. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D115353	2021-12-10 12:19:56 -08:00
Nikita Popov	1d1e29ba6c	[IR] Extract method to get single GEP index from offset (NFC) This exposes the core logic of getGEPIndicesForOffset() as a getGEPIndexForOffset() method that only returns a single offset, instead of following the whole chain.	2021-12-10 17:22:46 +01:00
Sameer Sahasrabuddhe	1d0244aed7	Reapply CycleInfo: Introduce cycles as a generalization of loops Reverts `02940d6d22`. Fixes breakage in the modules build. LLVM loops cannot represent irreducible structures in the CFG. This change introduce the concept of cycles as a generalization of loops, along with a CycleInfo analysis that discovers a nested hierarchy of such cycles. This is based on Havlak (1997), Nesting of Reducible and Irreducible Loops. The cycle analysis is implemented as a generic template and then instatiated for LLVM IR and Machine IR. The template relies on a new GenericSSAContext template which must be specialized when used for each IR. This review is a restart of an older review request: https://reviews.llvm.org/D83094 Original implementation by Nicolai Hähnle <nicolai.haehnle@amd.com>, with recent refactoring by Sameer Sahasrabuddhe <sameer.sahasrabuddhe@amd.com> Differential Revision: https://reviews.llvm.org/D112696	2021-12-10 14:36:43 +05:30
Arthur Eubanks	f5687e0fd0	[NFC] Use getAlign() instead of getAlignment() in haveSameSpecialState() getAlignment() is deprecated.	2021-12-09 13:19:42 -08:00
Kazu Hirata	ccdd5bb2c2	[llvm] Use range-based for loops (NFC)	2021-12-09 09:37:29 -08:00
Arthur Eubanks	1172712f46	[NFC] Replace some deprecated getAlignment() calls with getAlign() Reviewed By: gchatelet Differential Revision: https://reviews.llvm.org/D115370	2021-12-09 08:43:19 -08:00
Arthur Eubanks	cd11312607	[NFC][Verifier] Remove checks for atomic loads/stores that alignment is non-zero The alignment is never 0 since getAlign() returns 1 << bits. Reviewed By: dexonsmith Differential Revision: https://reviews.llvm.org/D115388	2021-12-08 23:17:08 -08:00

1 2 3 4 5 ...

5146 Commits