llvm-project

Commit Graph

Author	SHA1	Message	Date
serge-sans-paille	bda6e5bee0	[NFC] remove explicit default value for strboolattr attribute in tests Since `d6de1e1a71`, no attributes is quivalent to setting attribute to false. This is a preliminary commit for https://reviews.llvm.org/D99080	2021-05-24 19:31:04 +02:00
Craig Topper	3c0735c6d8	[X86] Call insertDAGNode on trunc/zext created in tryShiftAmountMod. This puts the new nodes in the proper place in the topologically sorted list of nodes. Fixes PR50431, which was introduced recently in D101944.	2021-05-24 10:23:22 -07:00
LLVM GN Syncbot	f55a733506	[gn build] Port `095e91c973`	2021-05-24 17:18:43 +00:00
Vitaly Buka	6435ca4e2b	[NFC][scudo] Small test cleanup Fixing issues raised on D102979 review. Reviewed By: cryptoad Differential Revision: https://reviews.llvm.org/D102994	2021-05-24 10:16:44 -07:00
Jon Roelofs	095e91c973	[Remarks] Add analysis remarks for memset/memcpy/memmove lengths Re-landing now that the crasher this patch previously uncovered has been fixed in: https://reviews.llvm.org/D102935 Differential revision: https://reviews.llvm.org/D102452	2021-05-24 10:10:44 -07:00
Roman Lebedev	c666208f63	[X86][Costmodel] getMaskedMemoryOpCost(): don't scalarize non-power-of-two vectors with legal element type This follows in steps of similar `getMemoryOpCost()` changes, D100099/D100684. Intel SDM, `VPMASKMOV — Conditional SIMD Integer Packed Loads and Stores`: ``` Faults occur only due to mask-bit required memory accesses that caused the faults. Faults will not occur due to referencing any memory location if the corresponding mask bit for that memory location is 0. For example, no faults will be detected if the mask bits are all zero. ``` I.e., if mask is all-zeros, any address is fine. Masked load/store's prime use-case is e.g. tail masking the loop remainder, where for the last iteration, only first some few elements of a vector exist. So much similarly, i don't see why must we scalarize non-power-of-two vectors, iff the element type is something we can masked- store/load. We simply need to legalize it, widen the mask, and be done with it. And we even already count the cost of widening the mask. Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D102990	2021-05-24 20:09:54 +03:00
luxufan	d70e9195a3	[RISCV] Optimize getVLENFactoredAmount function. If the local variable `NumOfVReg` isPowerOf2_32(NumOfVReg - 1) or isPowerOf2_32(NumOfVReg + 1), the ADDI and MUL instructions can be replaced with SLLI and ADD(or SUB) instructions. Based on original patch by StephenFan. Reviewed By: frasercrmck, StephenFan Differential Revision: https://reviews.llvm.org/D100577	2021-05-24 10:04:37 -07:00
Markus Böck	d35bd98651	[mlir][doc] Fix links and references in top level docs directory This is the fourth and final patch in a series of patches fixing markdown links and references inside the mlir documentation. This patch combined with the other three should fix almost every broken link on mlir.llvm.org as far as I can tell. This patch in particular addresses all Markdown files in the top level docs directory. Differential Revision: https://reviews.llvm.org/D103032	2021-05-24 18:43:00 +02:00
Jon Roelofs	694068d0db	[Remarks] Look through inttoptr/ptrtoint for -ftrivial-auto-var-init remarks. The crasher is a related problem that @aemerson found broke speck2k6/403.gcc when I landed https://reviews.llvm.org/D102452. It has been reduced & modified to reproduce without that patch. Differential revision: https://reviews.llvm.org/D102935	2021-05-24 09:23:22 -07:00
Adrian Prantl	4cba0a4f11	CoroSplit: Replace ad-hoc implementation of reachability with API from CFG.h The current ad-hoc implementation used to determine whether a basic block is unreachable doesn't work correctly in the general case (for example it won't detect successors of unreachable blocks as unreachable). This patch replaces it with the correct API that uses a DominatorTree to answer the question correctly and quickly. rdar://77181156 Differential Revision: https://reviews.llvm.org/D102963	2021-05-24 09:18:33 -07:00
Steven Wu	0346514984	[llvm] Revert align attr test in test/Bitcode/attribute-3.3.ll Revert testcase changed in D87304 now the upgrader can correctly handle the align attribute in upgrader. Reviewed By: dexonsmith Differential Revision: https://reviews.llvm.org/D102880	2021-05-24 09:15:27 -07:00
Suraj Sudhir	1ceff40df0	[mlir][tosa] Align tensor rank specifications with current spec Deconstrains several TOSA operators to align with the current TOSA spec, including all the elementwise ops. Note: some more ops are under consideration for further cleanup; they will follow once the spec has been updated. Reviewed By: stellaraccident Differential Revision: https://reviews.llvm.org/D102958	2021-05-24 16:03:42 +00:00
Kostya Kortchinsky	20c1f94220	[scudo] Separate Fuchsia & Default SizeClassMap The Fuchsia allocator config was using the default size class map. This CL gives Fuchsia its own size class map and changes a couple of things in the default one: - make `SizeDelta` configurable in `Config` for a fixed size class map as it currently is for a table size class map; - switch `SizeDelta` to 0 for the default config, it allows for size classes that allow for power of 2s, and overall better wrt pages filling; - increase the max number of caches pointers to 14 in the default, this makes the transfer batch 64/128 bytes on 32/64-bit platforms, which is cache-line friendly (previous size was 48/96 bytes). The Fuchsia size class map remains untouched for now, this doesn't impact Android which uses the table size class map. Differential Revision: https://reviews.llvm.org/D102783	2021-05-24 08:54:08 -07:00
Nikita Popov	e42636d3c1	[CVP] Add additional test for phi common val transform (NFC)	2021-05-24 17:28:38 +02:00
Nikita Popov	a832e83bcb	[LoopUnroll] Add additional trip multiple test (NFC) This uses a trip multiple on a (unique) non-latch exit.	2021-05-24 17:26:07 +02:00
Nikita Popov	971a2ae8b3	[LoopUnroll] Regenerate test checks (NFC)	2021-05-24 17:26:07 +02:00
Ivan Murashko	7f2f0247f8	Remark was added to clang tooling Diagnostic The diff adds Remark to Diagnostic::Level for clang tooling. That makes Remark diagnostic level ready to use in clang-tidy checks: the clang-diagnostic-module-import becomes visible as a part of the change.	2021-05-24 11:21:44 -04:00
Simon Pilgrim	dcaca7206e	[CostModel][X86] Add missing SSE41 v2iX sext/zext costs Also fix existing v4i8->v4i16 sext cost to match the equivalents	2021-05-24 15:53:43 +01:00
Mark de Wever	7b2606b0b6	[libc++][doc] Update format paper status. - Fixes paper number P1862 -> P1868. (The title was correct.) - Marks P1868 as in progress. - Marks P1892 as in progress. - Marks LWG-3327 as nothing to do, since the wording change doesn't impact the code. (Also updated on the general C++20 status page.)	2021-05-24 16:48:44 +02:00
thomasraoux	505933a489	[NVPTX] Fix lowering of frem for negative values to match fmod frem result must have the dividend sign. Previous implementation had the wrong sign when passing negative numbers. For ex: frem(-16, 7) was returning 5 instead of -2. We should just a ftrunc instead of floor when lowering to get the right behavior. Differential Revision: https://reviews.llvm.org/D102528	2021-05-24 07:45:03 -07:00
Simon Pilgrim	60b33ebe8b	[CostModel][X86] Regenerate sse-itoi.ll test checks	2021-05-24 15:41:01 +01:00
Sanjay Patel	a0e71f1832	[ConstProp] propagate poison from vector reduction element(s) to result This follows from the underlying logic for binops and min/max. Although it does not appear that we handle this for min/max intrinsics currently. https://alive2.llvm.org/ce/z/Kq9Xnh	2021-05-24 10:34:40 -04:00
Sanjay Patel	3dd2063671	[ConstProp] add tests for vector reductions with poison elements; NFC	2021-05-24 10:34:40 -04:00
Florian Hahn	65d3dd7c88	[VPlan] Add first VPlan version of sinkScalarOperands. This patch adds a first VPlan-based implementation of sinking of scalar operands. The current version traverse a VPlan once and processes all operands of a predicated REPLICATE recipe. If one of those operands can be sunk, it is moved to the block containing the predicated REPLICATE recipe. Continue with processing the operands of the sunk recipe. The initial version does not re-process candidates after other recipes have been sunk. It also cannot partially sink induction increments at the moment. The VPlan only contains WIDEN-INDUCTION recipes and if the induction is used for example in a GEP, only the first lane is used and in the lowered IR the adds for the other lanes can be sunk into the predicated blocks. Reviewed By: Ayal Differential Revision: https://reviews.llvm.org/D100258	2021-05-24 15:29:58 +01:00
Raphael Isemann	5d7c1d8f33	[lldb] Readd deleted variable in the sample test In D102771 wanted to make `test_var` global to demonstrate the a no-launch test, but the old variable is still needed for another test. This just creates the global var with a different name to demonstrate the no-launch functionality.	2021-05-24 16:29:25 +02:00
Raphael Isemann	54c2687292	[lldb] Introduce createTestTarget for creating a valid target in API tests At the moment nearly every test calls something similar to `self.dbg.CreateTarget(self.getBuildArtifact("a.out"))` and them sometimes checks if the created target is actually valid with something like `self.assertTrue(target.IsValid(), "some useless text")`. Beside being really verbose the error messages generated by this pattern are always just indicating that the target failed to be created but now why. This patch introduces a helper function `createTestTarget` to our Test class that creates the target with the much more verbose `CreateTarget` overload that gives us back an SBError (with a fancy error). If the target couldn't be created the function prints out the SBError that LLDB returned and asserts for us. It also defaults to the "a.out" build artifact path that nearly all tests are using to avoid to hardcode "a.out" in every test. I converted a bunch of tests to the new function but I'll do the rest of the test suite as follow ups. Reviewed By: JDevlieghere Differential Revision: https://reviews.llvm.org/D102771	2021-05-24 16:18:44 +02:00
Raphael Isemann	42a9c0c80c	[lldb] Reland "Fix UB in half2float" to fix the ubsan bot. This relands part of the UB fix in `4b074b49be`. The original commit also added some additional tests that uncovered some other issues (see D102845). I landed all the passing tests in `48780527dd` and this patch is now just fixing the UB in half2float. See D102846 for a proposed rewrite of the function. Original commit message: The added DumpDataExtractorTest uncovered that this is lshifting a negative integer which upsets ubsan and breaks the sanitizer bot. This patch just changes the variable we shift to be unsigned.	2021-05-24 15:23:32 +02:00
Anastasia Stulova	5ccc79dc38	[OpenCL][Docs] Minor update to OpenCL 3.0	2021-05-24 14:19:22 +01:00
Simon Pilgrim	1ad4f887bd	[CostModel][X86] Improve accuracy of vector non-uniform shift costs on XOP/AVX2 targets By llvm-mca analysis, Haswell/Broadwell has a non-uniform vector shift recip-throughput cost of the AVX2 targets at 2 for both 128 and 256-bit vectors - XOP capable targets have better 128-bit vector shifts so improve the fallback in those cases.	2021-05-24 14:18:21 +01:00
Florian Hahn	d251d6f812	[VectorCombine] Fix load extract scalarization tests with assumes. The input IR for @load_extract_idx_var_i64_known_valid_by_assume and @load_extract_idx_var_i64_not_known_valid_by_assume_after_load has been swapped. This patch fixes the test so that @load_extract_idx_var_i64_known_valid_by_assume has the assume before the load and the other test has it after.	2021-05-24 13:14:13 +01:00
Florian Hahn	e9d97d7d9d	[VPlan] Add mayReadOrWriteMemory & friends. This patch adds initial implementation of mayReadOrWriteMemory, mayReadFromMemory and mayWriteToMemory to VPRecipeBase. Used by D100258.	2021-05-24 13:11:32 +01:00
Anastasia Stulova	626e9641a2	[OpenCL] Fix test by adding SPIR triple	2021-05-24 13:03:50 +01:00
Bradley Smith	e40513252a	[AArch64][SVE] Add fixed length codegen for FP_ROUND/FP_EXTEND Depends on D102498 Differential Revision: https://reviews.llvm.org/D102607	2021-05-24 13:02:30 +01:00
Bradley Smith	4bc14be259	[AArch64][SVE] Improve codegen for fixed length vector concat Differential Revision: https://reviews.llvm.org/D102498	2021-05-24 12:56:02 +01:00
Anastasia Stulova	237c6924bd	[OpenCL] Add clang extension for bit-fields. Allow use of bit-fields as a clang extension in OpenCL. The extension can be enabled using pragma directives. This fixes PR45339! Differential Revision: https://reviews.llvm.org/D101843	2021-05-24 12:42:17 +01:00
David Green	543406a69b	[ARM] Allow findLoopPreheader to return headers with multiple loop successors The findLoopPreheader function will currently not find a preheader if it branches to multiple different loop headers. This patch adds an option to relax that, allowing ARMLowOverheadLoops to process more loops successfully. This helps with WhileLoopStart setup instructions that can branch/fallthrough to the low overhead loop and to branch to a separate loop from the same preheader (but I don't believe it is possible for both loops to be low overhead loops). Differential Revision: https://reviews.llvm.org/D102747	2021-05-24 12:22:15 +01:00
Florian Hahn	4e8c28b6fb	Recommit "[VectorCombine] Scalarize vector load/extract." This reverts commit `94d54155e2`. This fixes a sanitizer failure by moving scalarizeLoadExtract(I) before foldSingleElementStore(I), which may remove instructions.	2021-05-24 11:35:07 +01:00
David Green	53c42f7700	[ARM] Ensure WLS preheader blocks have branches during memcpy lowering This makes sure that the blocks created for lowering memcpy to loops end up with branches, even if they fall through to the successor. Otherwise IfCvt is getting confused with unanalyzable branches and creating invalid block layouts. The extra branches should be removed as the tail predicated loop is finalized in almost all cases.	2021-05-24 11:26:45 +01:00
David Green	6cc78b9245	[ARM] Fix inline memcpy trip count sequence The trip count for a memcpy/memset will be n/16 rounded up to the nearest integer. So (n+15)>>4. The old code was including a BIC too, to clear one of the bits, which does not seem correct. This remove the extra BIC. Note that ideally this would never actually be generated, as in the creation of a tail predicated loop we will DCE that setup code, letting the WLSTP perform the trip count calculation. So this doesn't usually come up in testing (and apparently the ARMLowOverheadLoops pass does not do any sort of validation on the tripcount). Only if the generation of the WLTP fails will it use the incorrect BIC instructions. Differential Revision: https://reviews.llvm.org/D102629	2021-05-24 11:01:58 +01:00
Uday Bondhugula	587408c199	[MLIR] Drop old cmake var names Drop old cmake variable names that were kept around so that zorg buildbot could be migrated, which has now happened (D102977). D102976 had fixed the inconsistent names. Differential Revision: https://reviews.llvm.org/D102997	2021-05-24 15:30:01 +05:30
Fraser Cormack	7a211ed110	[RISCV] Prevent store combining from infinitely looping RVV code generation does not successfully custom-lower BUILD_VECTOR in all cases. When it resorts to default expansion it may, on occasion, be expanded to scalar stores through the stack. Unfortunately these stores may then be picked up by the post-legalization DAGCombiner which merges them again. The merged store uses a BUILD_VECTOR which is then expanded, and so on. This patch addresses the issue by overriding the `mergeStoresAfterLegalization` hook. A lack of granularity in this method (being passed the scalar type) means we opt out in almost all cases when RVV fixed-length vector support is enabled. The only exception to this rule are mask vectors, which are always either custom-lowered or are expanded to a load from a constant pool. Reviewed By: HsiangKai Differential Revision: https://reviews.llvm.org/D102913	2021-05-24 10:19:32 +01:00
James Henderson	5c4a5daf29	[debuginfo-tests] Stop using installed LLDB and remove redundancy The removed code just replicated what use_llvm_tool does, plus looked for an installed LLDB on the PATH to use. In a monorepo world, it seems likely that if people want to run the tests that require LLDB, they should enable and build LLDB itself. If users really want to use the installed LLDB executable, they can specify the path to the executable as an environment variable "LLDB". See the discussion in https://reviews.llvm.org/D95339#2638619 for more details. Reviewed by: jmorse, aprantl Differential Revision: https://reviews.llvm.org/D102680	2021-05-24 10:16:36 +01:00
Roman Lebedev	32bee42719	[NFCI][LoopIdiom] 'left-shift until bittest': assert that BaseX is loop-invariant Given that BaseX is an incoming value when coming from the preheader, it should be loop-invariant, but let's just document this assumption.	2021-05-24 12:15:06 +03:00
Roman Lebedev	aa3dac95ed	[LoopIdiom] 'logical right shift until zero': the value must be loop-invariant As per the reproducer provided by Mikael Holmén in post-commit review.	2021-05-24 12:15:06 +03:00
Thorsten Schütt	0f140ce33d	flang: include limits	2021-05-24 11:12:12 +02:00
Florian Hahn	94d54155e2	Revert "[VectorCombine] Scalarize vector load/extract." This reverts commit `86497785d5`. One of the tests causes an ASAN failure. https://lab.llvm.org/buildbot/#/builders/5/builds/7927/steps/12/logs/stdio	2021-05-24 10:11:00 +01:00
Simon Pilgrim	243e588681	[CostModel][X86] Improve accuracy of vXi64 MUL costs on AVX2/AVX512 targets By llvm-mca analysis, Haswell/Broadwell has the worst v4i64 recip-throughput cost of the AVX2 targets at 6 (vs the currently used cost of 8). Similarly SkylakeServer (our only AVX512 target model) implements PMULLQ with an average cost of 1.5 (rounded up to 2.0), and the PMULUDQ-sequence (without AVX512DQ) as a cost of 6.	2021-05-24 09:48:32 +01:00
Pushpinder Singh	486110eb41	[AMDGPU][Libomptarget] Remove global KernelNameMap KernelNameMap contains entries like "key.kd" => key which clearly could be replaced by simple logic of removing suffix from the key. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D102691	2021-05-24 08:46:08 +00:00
Chen Zheng	486d6d2b8e	[Debug-Info]update section name to match AIX behaviour; nfc	2021-05-24 04:33:41 -04:00
Florian Hahn	86497785d5	[VectorCombine] Scalarize vector load/extract. This patch adds a new combine that tries to scalarize chains of `extractelement (load %ptr), %idx` to `load (gep %ptr, %idx)`. This is profitable when extracting only a few elements out of a large vector. At the moment, `store (extractelement (load %ptr), %idx), %ptr` operations on large vectors result in huge code in the backend. This can easily be triggered by using the matrix extension, e.g. https://clang.godbolt.org/z/qsccPdPf4 This should complement D98240. Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D100273	2021-05-24 09:29:08 +01:00

1 2 3 4 5 ...

389273 Commits All Branches Search

389273 Commits

All Branches