llvm-project

Commit Graph

Author	SHA1	Message	Date
Julian Lettner	284934fbc1	Make linter happy	2020-06-04 15:14:48 -07:00
Vedant Kumar	24660ea11c	[docs] HowToUpdateDebugInfo: Minor cleanups - Change the reference to salvageDebugInfoOrUndef to salvageDebugInfo (in accordance with https://reviews.llvm.org/D78369). - Reorganize a few sections in preparation for an upcoming change that attempts to specify rules for updating debug locations. - Fix some intra-document links. - Some spelling / wording fixes.	2020-06-04 14:56:01 -07:00
Yuanfang Chen	f9ea86eaa1	[Docs] Add the entry for `Advanced builds` in UserGuide.rst Also add a link to it from ThinLTO.rst.	2020-06-04 14:52:51 -07:00
Alexey Bataev	4e3d4622b1	Fix undefined behaviour when trying to deref nullptr.	2020-06-04 17:52:06 -04:00
Craig Topper	3ad8fbd205	[Reassociate] Teach ConvertShiftToMul to preserve nsw flag if the shift amount is not bitwidth - 1. Multiply and shl have different signed overflow behavior in some cases. But it looks like we should be ok as long as the shift amount is less than bitwidth - 1. Alive2: http://volta.cs.utah.edu:8080/z/MM4WZP Differential Revision: https://reviews.llvm.org/D81189	2020-06-04 14:51:34 -07:00
Matt Arsenault	1657f0ebc2	AMDGPU: Fix overriding global FP atomic feature predicates Global TableGen let override blocks are pretty dangerous and override any local special cases. In this case, the broader HasFlatGlobalInsts was overriding the more specific predicate for FeatureAtomicFaddInsts. Make sure HasFlatGlobalInsts is implied by FeatureAtomicFaddInsts, and make sure the right predicate is used. One issue with independently setting the subtarget features on incompatible targets is all of the encoding families do not define all opcodes. This will hit an assert on gfx10 for example, since we set the encoding independently based on the generation and not based on a feature.	2020-06-04 17:50:38 -04:00
Matt Arsenault	651c36b508	AMDGPU: Select strict_fmul	2020-06-04 17:49:00 -04:00
Matt Arsenault	483d4daa5e	AMDGPU: Select strict_fma Like with strict_fadd, the legalization is scalarizing the v4f16 when it should split.	2020-06-04 17:49:00 -04:00
Matt Arsenault	ae26c064ce	AMDGPU: Select strict_fadd	2020-06-04 17:49:00 -04:00
Diego Caballero	5c990d6994	[mlir] Add support for bf16 to StandardToLLVM conversion Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D81127	2020-06-04 14:36:36 -07:00
Matt Arsenault	b71f574e7f	AMDGPU: Add test for fdiv nofpexcept preservation This logically belongs with `89d48ccabe`, but this order was needed to avoid regressions before adding mayRaiseFPExceptions to relevant instructions.	2020-06-04 17:35:27 -04:00
Matt Arsenault	d259668731	AMDGPU: Set mayRaiseFPException This may be missing a few overrides to set it off still in some special cases. Since the flags set during selection should now be reliably preserved, this should not change codegen for non-strictfp functions.	2020-06-04 17:35:27 -04:00
Sanjay Patel	192cb71836	[InstCombine] avoid crashing on select-shuffle detection As mentioned in the post-commit comments of D81013 - the mask check API has to assume the shuffle is not length-changing, but we have not ruled that out in this code. Use the ShuffleVectorInst call instead.	2020-06-04 17:27:14 -04:00
Petr Hosek	d510542174	[Fuchsia] Rely on linker switch rather than dead code ref for profile runtime Follow the model used on Linux, where the clang driver passes the linker a -u switch to force the profile runtime to be linked in, rather than having every TU emit a dead function with a reference. Patch By: mcgrathr Differential Revision: https://reviews.llvm.org/D79835	2020-06-04 14:25:19 -07:00
Shilei Tian	a014fbbc21	[OpenMP] Improve D2D memcpy to use more efficient driver API Summary: In current implementation, D2D memcpy is first to copy data back to host and then copy from host to device. This is very efficient if the device supports D2D memcpy, like CUDA. In this patch, D2D memcpy will first try to use native supported driver API. If it fails, fall back to original way. It is worth noting that D2D memcpy in this scenerio contains two ideas: - Same devices: this is the D2D memcpy in the CUDA context. - Different devices: this is the PeerToPeer memcpy in the CUDA context. My implementation merges this two parts. It chooses the best API according to the source device and destination device. Reviewers: jdoerfert, AndreyChurbanov, grokos Reviewed By: jdoerfert Subscribers: yaxunl, guansong, sstefan1, openmp-commits Tags: #openmp Differential Revision: https://reviews.llvm.org/D80649	2020-06-04 16:59:06 -04:00
Yaxun (Sam) Liu	263390d4f5	[CUDA][HIP] Fix implicit HD function resolution recommit `e03394c6a6` with fix When implicit HD function calls a function in device compilation, if one candidate is an implicit HD function, current resolution rule is: D wins over HD and H HD and H are equal this caused regression when there is an otherwise worse D candidate This patch changes that to D, HD and H are all equal The rationale is that we already know for host compilation there is already a valid candidate in HD and H candidates that will not cause error. Allowing HD and H gives us a fall back candidate that will not cause error. If D wins, that means D has to be a better match otherwise, therefore D should also be a valid candidate that will not cause error. In this way, we can guarantee no regression. Differential Revision: https://reviews.llvm.org/D80450	2020-06-04 16:54:52 -04:00
Matt Arsenault	54a8a8d509	AMDGPU: Fix using unencodable instructions in tests There are a number of MIR tests using instructions on subtargets where they don't really exist. These are some of the easy cases that don't require splitting up test functions.	2020-06-04 16:50:19 -04:00
Matt Arsenault	fe0d5121fa	AMDGPU/GlobalISel: Fix making LDS FP atomics legal on SI/CI	2020-06-04 16:50:19 -04:00
Matt Arsenault	16acc12e1d	AMDGPU/GlobalISel: Fix trying to use wave32 for gfx9 test	2020-06-04 16:50:19 -04:00
Alexey Bataev	bd1c03d7b7	[OPENMP50]Codegen for inscan reductions in worksharing directives. Summary: Implemented codegen for reduction clauses with inscan modifiers in worksharing constructs. Emits the code for the directive with inscan reductions. The code is the following: ``` size num_iters = <num_iters>; <type> buffer[num_iters]; for (i: 0..<num_iters>) { <input phase>; buffer[i] = red; } for (int k = 0; k != ceil(log2(num_iters)); ++k) for (size cnt = last_iter; cnt >= pow(2, k); --k) buffer[i] op= buffer[i-pow(2,k)]; for (0..<num_iters>) { red = InclusiveScan ? buffer[i] : buffer[i-1]; <scan phase>; } ``` Reviewers: jdoerfert Subscribers: yaxunl, guansong, arphaman, cfe-commits, caomhin Tags: #clang Differential Revision: https://reviews.llvm.org/D79948	2020-06-04 16:29:33 -04:00
Thomas Lively	a07c08f74f	[WebAssembly] Lower llvm.debugtrap properly Summary: Unlike normal traps, debug traps are allowed to return and can have additional instructions in the same basic block. Without explicit backend support for debug traps, they are lowered in ISel as normal traps. Since normal traps are lowered in the WebAssembly backend to the UNREACHABLE instruction, which is a terminator, using debug traps could lead to invalid MBBs when there are additional instructions after the trap. This patch fixes the issue by lowering debug traps to a new version of the UNREACHABLE instruction, DEBUG_UNREACHABLE, that is not a terminator. An alternative approach would have been to make UNREACHABLE not a terminator, but that breaks a large number of tests. In particular, it would require removing the traps inserted after noreturn calls to @llvm.wasm.throw because otherwise the terminator throw would be followed by a non-terminator UNREACHABLE and we would be back to having invalid MBBs. Overall the approach in this patch seems simpler. Reviewers: aheejin, dschuff Subscribers: sbc100, jgravelle-google, hiraditya, sunfish, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D81055	2020-06-04 13:25:10 -07:00
Pete Steinfeld	1746c8ed26	[flang] Fixed crash on forward referenced `len` parameter Summary: Using a forward reference to define a `len` parameter causes a crash. The underlying cause was that a previously declared type had an erroneous expression for its `LEN` param value. When this expression was referenced to evaluate a subsequent expression, bad things happened. I fixed this by putting in code to detect this case. Reviewers: tskeith, klausler, DavidTruby Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80593	2020-06-04 13:12:11 -07:00
Huihui Zhang	42048ff972	[NFC] Move test vscale-factor-out-constant.ll to AArch64 sub-directory. Vscale scalable vector is specific to AArch64 target. Bring back 'uglygep' check.	2020-06-04 12:55:28 -07:00
Eric Schweitz	baa12ddb6f	[flang] Add the conversions for types. Part of lowering is to convert the front-end types to their FIR dialect representations. These conversions are done by here in the ConvertType module. proactively update the code to conform better with LLVM coding conventions Differential Revision: https://reviews.llvm.org/D81034	2020-06-04 12:54:50 -07:00
aartbik	c19fae507e	[mlir] [VectorOps] Add missing comments to CreateMaskOp lowering Summary: Add missing comment to CreateMask. Fixed typo in ConstantMask comment. Reviewers: nicolasvasilache, rriddle, reidtatge, ftynse Reviewed By: ftynse Subscribers: mehdi_amini, rriddle, jpienaar, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, liufengdb, stephenneuendorffer, Joonsoo, grosul1, frgossen, Kayjukh, jurahul Tags: #mlir Differential Revision: https://reviews.llvm.org/D81125	2020-06-04 12:50:47 -07:00
Florian Hahn	714e84be46	[SemaOverload] Use iterator_range to iterate over VectorTypes (NFC). We can simplify the code a bit by using iterator_range instead of plain iterators. Matrix type support here (added in `6f6e91d193`) already uses an iterator_range. Reviewers: rjmccall, arphaman, jfb, Bigcheese Reviewed By: rjmccall Differential Revision: https://reviews.llvm.org/D81138	2020-06-04 20:47:16 +01:00
Dmitri Gribenko	a180d5409f	AST Matchers test: use arrays instead of vectors Subscribers: cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D81180	2020-06-04 21:40:30 +02:00
Valentin Clement	3d9bb031d1	[flang] avoid GCC < 8 compiler failure after D80794 Summary: Patch D80794 remove the custom flags for release build for flang. This leads to build failure with GCC < 8. This patch add upperbound check in order to avoid the -Werror=array-bounds to trigger a build failure. ``` /home/4vn/versioning/llvm-project/flang/lib/Decimal/big-radix-floating-point.h:183:29: error: array subscript is above array bounds [-Werror=array-bounds] digit_[j] = digit_[j + remove]; ~~~~~~^ /home/4vn/versioning/llvm-project/flang/lib/Decimal/big-radix-floating-point.h:183:29: error: array subscript is above array bounds [-Werror=array-bounds] digit_[j] = digit_[j + remove]; ~~~~~~^ /home/4vn/versioning/llvm-project/flang/lib/Decimal/big-radix-floating-point.h:183:29: error: array subscript is above array bounds [-Werror=array-bounds] digit_[j] = digit_[j + remove]; ~~~~~~^ /home/4vn/versioning/llvm-project/flang/lib/Decimal/big-radix-floating-point.h:183:29: error: array subscript is above array bounds [-Werror=array-bounds] digit_[j] = digit_[j + remove]; ~~~~~~^ /home/4vn/versioning/llvm-project/flang/lib/Decimal/big-radix-floating-point.h:183:29: error: array subscript is above array bounds [-Werror=array-bounds] digit_[j] = digit_[j + remove]; ``` ``` /home/4vn/versioning/llvm-project/flang/include/flang/Evaluate/integer.h:809:28: error: array subscript is above array bounds [-Werror=array-bounds] xy += product[to]; ~~~~~~~^ /home/4vn/versioning/llvm-project/flang/include/flang/Evaluate/integer.h:810:22: error: array subscript is above array bounds [-Werror=array-bounds] product[to] = xy & partMask; ~~~~~~~^ /home/4vn/versioning/llvm-project/flang/include/flang/Evaluate/integer.h:809:28: error: array subscript is above array bounds [-Werror=array-bounds] xy += product[to]; ~~~~~~~^ ``` Reviewers: DavidTruby, sscalpone, jdoerfert Reviewed By: DavidTruby Subscribers: llvm-commits Tags: #llvm, #flang Differential Revision: https://reviews.llvm.org/D81179	2020-06-04 14:48:39 -04:00
Hiroshi Yamauchi	e52a38db07	[PGO] Enable the working set size scaling under the partial sample PGO. Summary: Following up D79831. Reviewers: davidxl Subscribers: eraman, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80939	2020-06-04 11:30:54 -07:00
Sanjay Patel	8a96c1f627	[InstCombine] move vector select ahead of select-shuffle select Cond, (shuf_sel X, Y), X --> shuf_sel X, (select Cond, Y, X) A select of a select-shuffle ("blend" in x86 lingo) can be reversed so that the select is done first. This is a more limited version of what I was trying in D80658, but it enables existing demanded bits transforms to catch some of the motivating cases. The tricky bit in that seems to be that by moving the shuffle later, we can always guarantee that poison is correctly inhibited by the shuffle mask in the final value. Alive2 checks for the basic tests: http://volta.cs.utah.edu:8080/z/Qqd3RK http://volta.cs.utah.edu:8080/z/S4wchM http://volta.cs.utah.edu:8080/z/wf9zPL http://volta.cs.utah.edu:8080/z/wJeEGk Differential Revision: https://reviews.llvm.org/D81013	2020-06-04 14:29:13 -04:00
Jan Korous	5f5d972d83	[docs] Fix self-contradictory description of llvm_unreachable Just two paragraphs above it says: "If the compiler does not support this [skipping code generation for a particular branch], it will fall back to the "abort" implementation." And that actually correctly describes llvm_unreachable implementation. Differential Revision: https://reviews.llvm.org/D81130	2020-06-04 11:15:20 -07:00
Eduardo Caldas	42f6fec387	Propose naming principle for NodeRole and apply it Reviewers: gribozavr2 Reviewed By: gribozavr2 Subscribers: cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D81157	2020-06-04 20:08:35 +02:00
Louis Dionne	cc78f1e0fe	[libc++] Avoid warning for large types with std::atomic in the test suite It is legitimate for the test suite to use types that are slow to use with std::atomic, since we need coverage for those too. If we don't disable the warning, it is promoted to an error, which prevents us from testing such types.	2020-06-04 14:06:04 -04:00
LLVM GN Syncbot	8b5ee3b9b6	[gn build] Port `e53f558057`	2020-06-04 17:56:21 +00:00
LLVM GN Syncbot	5c55033dce	[gn build] Port `c973ad1878`	2020-06-04 17:56:21 +00:00
LLVM GN Syncbot	60c2fee426	[gn build] Port `ba2a01645b`	2020-06-04 17:56:20 +00:00
LLVM GN Syncbot	3a4bf99f0b	[gn build] Port `69fa84a6e9`	2020-06-04 17:56:20 +00:00
LLVM GN Syncbot	48a50fcc9a	[gn build] Port `6756a2c953`	2020-06-04 17:56:19 +00:00
LLVM GN Syncbot	9034dc9c59	[gn build] Port `49a4f3f7d8`	2020-06-04 17:56:19 +00:00
Huihui Zhang	f7f1abdb88	[NFC] Temporarily disable check for 'uglygep' while investigating some buildbot failure. The purpose of vscale-factor-out-constant.ll is to check we are crashing with blind cast 'Factor' in a MulExpr to SCEVConstant.	2020-06-04 10:54:02 -07:00
Thomas Raoux	661235e126	[mlir][gpu] Add subgroup Id/Size/Num to GPU dialect Add SubgroupId, SubgroupSize and NumSubgroups to GPU dialect ops and add the lowering of those ops to SPIRV. Differential Revision: https://reviews.llvm.org/D81042	2020-06-04 10:52:40 -07:00
Amara Emerson	e53f558057	[AArch64][GlobalISel] Move GlobalISel source files to a dedicated subdir. Differential Revision: https://reviews.llvm.org/D81116	2020-06-04 10:51:38 -07:00
Jim Ingham	a976a7fcae	Disable this test for Windows. The printf expression crashes with the message: Attempted to dereference an invalid pointer Someone who knows more about Windows should suggest how to fix this.	2020-06-04 10:51:01 -07:00
Hans Wennborg	fcc199d696	Make regcoal_remat_empty_subrange.ll test require asserts build. The -stress-sched flag is only available when asserts are enabled.	2020-06-04 19:46:22 +02:00
Jan Kratochvil	476f520a0b	[lldb] Fix SLEB128 decoding Bug 46181 shows SLEB128 0xED9A924C00011151 decoded as 0xffffffff80011151. LLDB show a wrong value for function argument https://bugs.llvm.org/show_bug.cgi?id=46181 Differential Revision: https://reviews.llvm.org/D81119	2020-06-04 19:41:24 +02:00
Layton Kifer	7381fcdf62	[TRE] Allow accumulator elimination when base case returns non-constant Remove the requirement, that when performing accumulator elimination, all other cases must return the same dynamic constant. We can do this by initializing the accumulator with the identity value of the accumulation operation, and inserting an additional operation before any return. Differential Revision: https://reviews.llvm.org/D80844	2020-06-04 10:34:42 -07:00
Huihui Zhang	bd43f78c76	[LSR][SCEVExpander] Avoid blind cast 'Factor' to SCEVConstant in FactorOutConstant. Summary: In SCEVExpander FactorOutConstant(), when GEP indexing into/over scalable vector, it is legal for the 'Factor' in a MulExpr to be the size of a scalable vector instead of a compile-time constant. Current upstream crash with the test attached. Reviewers: efriedma, sdesmalen, sanjoy.google, mkazantsev Reviewed By: efriedma Subscribers: hiraditya, javed.absar, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80973	2020-06-04 10:33:39 -07:00
Christopher Tetreault	c2625f330f	[SVE] Eliminate calls to default-false VectorType::get() from SystemZ Reviewers: efriedma, jnspaulsson, kmclaughlin, sdesmalen, samparker, uweigand Reviewed By: uweigand Subscribers: tschuett, hiraditya, rkruppe, psnobl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80329	2020-06-04 10:05:38 -07:00
Nathan James	e21c3f223a	[clang-tidy] ignore builtin varargs from pro-type-vararg-check Disables the check from warning on some built in vararg functions, Address [[ https://bugs.llvm.org/show_bug.cgi?id=45860 \| Clang-tidy should not consider __builtin_constant_p a variadic function. ]] Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D80887	2020-06-04 17:58:23 +01:00
Zinovy Nis	6271b96bef	[clang-tidy][modernize-loop-convert] Make loop var type human readable Differential Revision: https://reviews.llvm.org/D80536	2020-06-04 19:51:45 +03:00

... 3 4 5 6 7 ...

356475 Commits All Branches Search

356475 Commits

All Branches