llvm-project

Commit Graph

Author	SHA1	Message	Date
Carl Ritson	976f3b3c9e	[AMDGPU] Only allow implicit WQM in pixel shaders Implicit derivatives are only valid in pixel shaders, hence only implicitly enable WQM for pixel shaders. This avoids unintended WQM in other shader types (e.g. compute) when image sampling instructions are used. Reviewed By: rampitec Differential Revision: https://reviews.llvm.org/D114414	2021-11-24 20:04:42 +09:00
David Green	581f837355	[ARM] Fold (fadd x, (vselect c, y, -1.0)) into (vselect c, (fadd x, y), x) This is similar to D113574, but as a DAG combine, not tablegen patterns. Doing the fold as a DAG combine allows the fadd to be folded with a fmul, finally producing a predicated vfma. It performs the same fold of fadd(x, vselect(p, y, -0.0)) to vselect p, (fadd x, y), x) using -0.0 as the identity value of a fadd. Differential Revision: https://reviews.llvm.org/D113584	2021-11-24 10:41:00 +00:00
Matthias Springer	ca9d149e07	[mlir][linalg][bufferize][NFC] Move vector interface impl to new build target This makes ComprehensiveBufferize entirely independent of the vector dialect. Differential Revision: https://reviews.llvm.org/D114218	2021-11-24 19:36:12 +09:00
David Sherwood	cf40ca026f	[NFC] Tidy up SelectionDAGBuilder::visitIntrinsicCall to use existing sdl debug loc In quite a few places we were calling getCurSDLoc() to get the debug location, but this is already a local variable `sdl`. Differential Revision: https://reviews.llvm.org/D114447	2021-11-24 10:35:49 +00:00
Jeremy Morse	b8f68ad9cd	[DebugInfo][InstrRef] Avoid crash when values optimised out late in sdag It appears that we can emit all the instructions for a function, including debug instructions, and then optimise some of the values out late. Specifically, in the attached test case, an argument gets optimised out after DBG_VALUE / DBG_INSTR_REFs are created. This confuses MachineFunction::finalizeDebugInstrRefs, which expects to be able to find a defining instruction, and crashes instead. Fix this by identifying when there's no defining instruction, and translating that instead into a DBG_VALUE $noreg. Differential Revision: https://reviews.llvm.org/D114476	2021-11-24 10:34:48 +00:00
David Green	d9af9c2c5a	[ARM] Fold floating point select(binop) patterns Similar to D84091 which added extra predicated folds for integer operations using the identity element of the operation, this adds them for floating point operations for the form `BinOp(x, select(p, y, Identity))`. They are folded back to predicated versions of the operator, with fadd having the identity -0.0, fsub using the identity 0.0 and fmul using 1.0. Differential Revision: https://reviews.llvm.org/D113574	2021-11-24 10:22:20 +00:00
Dmitry Vyukov	764b35d89f	tsan: extend mmap test Test size larger than clear_shadow_mmap_threshold, which is handled differently. Depends on D114348. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D114366	2021-11-24 10:57:21 +01:00
David Green	734e2386ff	[ARM] Add fma and update fadd/fmul predicated select tests. NFC	2021-11-24 09:51:33 +00:00
mydeveloperday	93fc91610f	[clang-format] NFC - recent changes caused clang-format to no longer be clang-formatted. The following 2 commits caused files in clang-format to no longer be clang-formatted. we would lose our "clean" status https://releases.llvm.org/13.0.0/tools/clang/docs/ClangFormattedStatus.html `c2271926a4` - Make clang-format fuzz through Lexing with asserts enabled (https://github.com/llvm/llvm-project/commit/c2271926a4fc ) `84bf5e3286` - Fix various problems found by fuzzing. (https://github.com/llvm/llvm-project/commit/84bf5e328664) Reviewed By: HazardyKnusperkeks, owenpan Differential Revision: https://reviews.llvm.org/D114430	2021-11-24 09:45:32 +00:00
Matthias Springer	bb273a35a0	[mlir][linalg][bufferize][NFC] Move tensor interface impl to new build target This makes ComprehensiveBufferize entirely independent of the tensor dialect. Differential Revision: https://reviews.llvm.org/D114217	2021-11-24 18:25:17 +09:00
Florian Hahn	8ef460fc51	[llvm-reduce] Add parallel chunk processing. This patch adds parallel processing of chunks. When reducing very large inputs, e.g. functions with 500k basic blocks, processing chunks in parallel can significantly speed up the reduction. To allow modifying clones of the original module in parallel, each clone needs their own LLVMContext object. To achieve this, each job parses the input module with their own LLVMContext. In case a job successfully reduced the input, it serializes the result module as bitcode into a result array. To ensure parallel reduction produces the same results as serial reduction, only the first successfully reduced result is used, and results of other successful jobs are dropped. Processing resumes after the chunk that was successfully reduced. The number of threads to use can be configured using the -j option. It defaults to 1, which means serial processing. Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D113857	2021-11-24 09:23:52 +00:00
Pavel Labath	6f82264dbb	[lldb/gdb-remote] Remove more non-stop mode remnants The read thread handling is completely dead code now that non-stop mode no longer exists.	2021-11-24 10:00:43 +01:00
Rosie Sumpter	df32a39dd0	[LoopVectorize][CostModel] Update cost model for fmuladd intrinsic This patch updates the cost model for ordered reductions so that a call to the llvm.fmuladd intrinsic is modelled as a normal fmul instruction plus the cost of an ordered fadd reduction. Differential Revision: https://reviews.llvm.org/D111630	2021-11-24 08:50:05 +00:00
Rosie Sumpter	2d33327f9d	[LoopVectorize] Print fast-math flags for VPReductionRecipe	2021-11-24 08:50:05 +00:00
Rosie Sumpter	991074012a	[LoopVectorize] Propagate fast-math flags for VPInstruction In-loop vector reductions which use the llvm.fmuladd intrinsic involve the creation of two recipes; a VPReductionRecipe for the fadd and a VPInstruction for the fmul. If the call to llvm.fmuladd has fast-math flags these should be propagated through to the fmul instruction, so an interface setFastMathFlags has been added to the VPInstruction class to enable this. Differential Revision: https://reviews.llvm.org/D113125	2021-11-24 08:50:04 +00:00
Rosie Sumpter	c2441b6b89	[LoopVectorize] Add vector reduction support for fmuladd intrinsic Enables LoopVectorize to handle reduction patterns involving the llvm.fmuladd intrinsic. Differential Revision: https://reviews.llvm.org/D111555	2021-11-24 08:50:04 +00:00
Butygin	7f5d9bf13a	[mlir][scf] Canonicalize scf.while with unused results Differential Revision: https://reviews.llvm.org/D114291	2021-11-24 11:11:22 +03:00
Clement Courbet	ba4411e7c6	[clang-tidy] performance-unnecessary-copy-initialization: Fix false negative. `isConstRefReturningMethodCall` should be considering `CXXOperatorCallExpr` in addition to `CXXMemberCallExpr`. Clang considers these to be distinct (`CXXOperatorCallExpr` derives from `CallExpr`, not `CXXMemberCallExpr`), but we don't care in the context of this check. This is important because of `std::vector<Expensive>::operator[](size_t) const`. Differential Revision: https://reviews.llvm.org/D114249	2021-11-24 08:07:21 +01:00
Vitaly Buka	b9fd7247a7	[sanitizer] Add Abs<T>	2021-11-23 22:25:36 -08:00
Abinav Puthan Purayil	078da26b1c	[AMDGPU] Check for unneeded shift mask in shift PatFrags. The existing constrained shift PatFrags only dealt with masked shift from OpenCL front-ends. This change copies the X86DAGToDAGISel::isUnneededShiftMask() function to AMDGPU and uses it in the shift PatFrag predicates. Differential Revision: https://reviews.llvm.org/D113448	2021-11-24 10:53:12 +05:30
Igor Kudrin	8cdf1c1edb	[ELF] Support the "read-only" memory region attribute The attribute 'r' allows (or disallows for the negative case) read-only sections, i.e. ones without the SHF_WRITE flag, to be assigned to the memory region. Before the patch, lld could put a section in the wrong region or fail with "error: no memory region specified for section". Differential Revision: https://reviews.llvm.org/D113771	2021-11-24 12:17:09 +07:00
Vitaly Buka	55792b5ac4	[sanitizer] Fail instead of crash without real_pthread_create	2021-11-23 20:32:09 -08:00
Bixia Zheng	02710413a3	Accept symmetric sparse matrix in Matrix Market Exchange Format. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D114402	2021-11-23 19:53:17 -08:00
Weverything	1150f02c77	Revert "tsan: new runtime (v3)" This reverts commit `ebd47b0fb7`. This was causing unexpected behavior in programs.	2021-11-23 18:32:32 -08:00
Uday Bondhugula	8bd08a9fd7	[MLIR] Remove duplicate `Pass` suffix from ViewOpGraph class name Remove duplicate `Pass` suffix from view-op-graph pass class name. The extra suffix would lead to methods like registerViewOpGraphPassPass being generated. Differential Revision: https://reviews.llvm.org/D114459	2021-11-24 08:00:16 +05:30
wren romano	d7d7ffe254	[mlir][sparse] Adding wrappers for constantOverheadTypeEncoding Minor code cleanup Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D114392	2021-11-23 18:30:06 -08:00
Jun Ma	17eb6b61de	Revert "[Taildup] Don't tail-duplicate loop header with multiple successors as its latches" This reverts commit `1f9fa54984`.	2021-11-24 10:26:37 +08:00
Jun Ma	07333810ca	Revert "Revert "Revert "Recommit "Revert "[CVP] processSwitch: Remove default case when switch cover all possible values.""""" This reverts commit `c93f93b2e3`.	2021-11-24 10:26:37 +08:00
Mehdi Amini	8ec0f22184	Update fir.insert_on_range syntax to make the range more explicit (NFC) Also replace ArrayAttr with IndexElementsAttr to model subscript dimensions. An array of attribute is a sparse inefficient storage, with an API that requires to unpack/repack integers at every call site. Instead we can store dense array of integer as IndexElementsAttr. Reviewed By: clementval, kiranchandramohan Differential Revision: https://reviews.llvm.org/D112899	2021-11-24 02:06:17 +00:00
Zequan Wu	22ced33a2f	[LLDB][NativePDB] Allow find functions by full names I don't see a reason why not to. If we allows lookup functions by full names, I can change the test case in D113930 to use `lldb-test symbols --find=function --name=full::name --function-flags=full ...`, though the duplicate method decl prolem is still there for `lldb-test symbols --dump-ast`. That's a seprate bug, we can fix it later. Differential Revision: https://reviews.llvm.org/D114467	2021-11-23 17:57:05 -08:00
Vitaly Buka	6889592ebc	[NFC][sanitizer] Limit StackStore stack size/tag to 1 byte Nothing uses more than 8bit now. So the rest of the headers can store other data. kStackTraceMax is 256 now, but all sanitizers by default store just 20-30 frames here.	2021-11-23 16:56:34 -08:00
Vitaly Buka	402a406323	[NFC][sanitizer] Test for `b80affb8a1`	2021-11-23 16:56:24 -08:00
Stanislav Mekhanoshin	661a232e34	[AMDGPU] Remove a no-op check in the gfx90a hazard recognizer Also rename helper function accordingly. Differential Revision: https://reviews.llvm.org/D114289	2021-11-23 15:35:19 -08:00
Butygin	75a1bee05d	[mlir][spirv] Add math to OpenCL conversion Differential Revision: https://reviews.llvm.org/D113780	2021-11-24 02:31:21 +03:00
Florian Mayer	26d1edfb10	[hwasan] support python3 in hwasan_sanitize Verified no diff exist between previous version, new version python 2, and python 3 for an example stack. Reviewed By: eugenis Differential Revision: https://reviews.llvm.org/D114404	2021-11-23 15:30:30 -08:00
Florian Mayer	6c06d8e310	[stack-safety] Check SCEV constraints at memory instructions. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D113160	2021-11-23 15:29:23 -08:00
Vitaly Buka	4058637f7a	[NFC][sanitizer] Reuse forEach for operator==	2021-11-23 15:23:25 -08:00
Vitaly Buka	09256fe980	[sanitizer] Add DenseMap::forEach	2021-11-23 15:23:25 -08:00
Nemanja Ivanovic	c9cb8edc51	[PowerPC] Allow scalars for asm constraint "v" with VSX Similarly to what GCC does, we should allow scalars with the "v" constraint rather than introducing unnecessary new constraints for scalars in Altivec registers. Differential revision: https://reviews.llvm.org/D113635	2021-11-23 17:03:04 -06:00
Matt Arsenault	273a0c8bc9	PrologEpilogInserter: Use explicit control for scavenge slot placement AMDGPU is unusual in that the both stack is indexed in the same direction as stack growth (up). We therefore always need the emergency stack slots placed as low as possible to ensure they are in range of load/store instruction immediate offsets. The existing logic is mostly OK, but failed if we required stack realignment. I don't understand what the existing control isFPCloseToIncomingSP is supposed to mean, but can only be used to stop placing the scavenge slots earlier. Make this explicit so that targets can opt-in rather than opt-out only.	2021-11-23 18:01:12 -05:00
Florian Hahn	73a05cc8df	[LAA] Move visitPointers up in file (NFC). This allows easier re-use in earlier functions.	2021-11-23 22:47:26 +00:00
Walter Erquinigo	877433ad45	Fix `a48501150b` Issue in https://lab.llvm.org/buildbot/#/builders/96/builds/14682. Making the test deterministic.	2021-11-23 14:23:38 -08:00
Danil Stefaniuc	9a9d9a9b00	[formatters] List and forward_list capping_size determination and application This diff is adding the capping_size determination for the list and forward list, to limit the number of children to be displayed. Also it modifies and unifies tests for libcxx and libstdcpp list data formatter. Reviewed By: wallace Differential Revision: https://reviews.llvm.org/D114433	2021-11-23 14:18:51 -08:00
Rahul Joshi	4961fcfbcf	Move dependency llvm:AllTargetsAsmParsers from Translation to ExecutionEngine. - Fixes a minor issue in https://reviews.llvm.org/D114338, which seems incorrectly added the llvm:AllTargetsAsmParsers dependency to Translation in bazel build files. Differential Revision: https://reviews.llvm.org/D114471	2021-11-23 14:10:26 -08:00
Danil Stefaniuc	193bf2e820	[formatters] Capping size limitation avoidance for the libcxx and libcpp bitset data formatters. This diff is avoiding the size limitation introduced by the capping size for the libcxx and libcpp bitset data formatters. Reviewed By: wallace Differential Revision: https://reviews.llvm.org/D114461	2021-11-23 14:03:59 -08:00
Walter Erquinigo	a48501150b	Make some libstd++ formatters safer We need to add checks that ensure that some core variables are valid, so that we avoid printing out garbage data. The worst that could happen is that an non-initialized variable is being printed as something with 123123432 children instead of 0. Differential Revision: https://reviews.llvm.org/D114458	2021-11-23 13:52:32 -08:00
Walter Erquinigo	4ba5da8e3d	Improve optional formatter As suggested by @labath in https://reviews.llvm.org/D114403, we should make the formatter more resilient to corrupted data. The Libcxx version explicitly checks for engaged = 1, so we can do that as well for safety. Differential Revision: https://reviews.llvm.org/D114450	2021-11-23 13:52:17 -08:00
Sanjay Patel	892648b18a	[InstSimplify] fold xor logic of 2 variables (a & b) ^ (~a \| b) --> ~a I was looking for a shortcut to reduce some of the complex logic folds that are currently up for review (D113216 and others in that stack), and I found this missing from instcombine/instsimplify. There is a trade-off in putting it into instsimplify: because we can't create new values here, we need a strict 'not' op (no undef elements). Otherwise, the fold is not valid: https://alive2.llvm.org/ce/z/k_AGGj If this was in instcombine instead, we could create the proper 'not'. But having the fold here benefits other passes like GVN that use instsimplify as an analysis. There is a related fold where 'and' and 'or' are swapped, and that is planned as a follow-up commit. Differential Revision: https://reviews.llvm.org/D114462	2021-11-23 16:50:23 -05:00
Vitaly Buka	b1a68b170c	[NFC][sanitizer] Make method const	2021-11-23 13:50:07 -08:00
Vitaly Buka	abd86619cf	[NFC][sanitizer] Extract StackTraceHeader struct	2021-11-23 13:50:06 -08:00

... 2 3 4 5 6 ...

405686 Commits All Branches Search

405686 Commits

All Branches