llvm-project

Commit Graph

Author	SHA1	Message	Date
Lucas Prates	f56550cf7f	[ARM] Enabling range checks on Neon intrinsics' lane arguments Summary: Range checks were not properly performed in the lane arguments of Neon intrinsics implemented based on splat operations. Calls to those intrinsics where translated to `__builtin__shufflevector` calls directly by the pre-processor through the arm_neon.h macros, missing the chance for the proper range checks. This patch enables the range check by introducing an auxiliary splat instruction in arm_neon.td, delaying the translation to shufflevector calls to CGBuiltin.cpp in clang after the checks were performed. Reviewers: jmolloy, t.p.northover, rsmith, olista01, ostannard Reviewed By: ostannard Subscribers: ostannard, dnsampaio, danielkiss, kristof.beyls, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D74619	2020-03-19 12:07:23 +00:00
Lucas Prates	7bf23563f4	Revert "[ARM] Setting missing isLaneQ attribute on Neon Intrisics definitions" This reverts commit `62ab15ffa3`. Multiple commits were unintentionally squashed into this one. Reverting so each of them can be pushed properly.	2020-03-19 12:01:13 +00:00
Lucas Prates	62ab15ffa3	[ARM] Setting missing isLaneQ attribute on Neon Intrisics definitions Summary: Some of the `*_laneq` intrinsics defined in arm_neon.td were missing the setting of the `isLaneQ` attribute. This patch sets the attribute on the related definitions, as they will be required to properly perform range checks on their lane arguments. Reviewers: jmolloy, t.p.northover, rsmith, olista01, dnsampaio Reviewed By: dnsampaio Subscribers: dnsampaio, kristof.beyls, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D74616	2020-03-19 11:52:41 +00:00
Fangrui Song	dbc96b518b	Revert "[CodeGenModule] Assume dso_local for -fpic -fno-semantic-interposition" This reverts commit `789a46f2d7`. Accidentally committed.	2020-02-03 10:09:39 -08:00
Fangrui Song	789a46f2d7	[CodeGenModule] Assume dso_local for -fpic -fno-semantic-interposition Summary: Clang -fpic defaults to -fno-semantic-interposition (GCC -fpic defaults to -fsemantic-interposition). Users need to specify -fsemantic-interposition to get semantic interposition behavior. Semantic interposition is currently a best-effort feature. There may still be some cases where it is not handled well. Reviewers: peter.smith, rnk, serge-sans-paille, sfertile, jfb, jdoerfert Subscribers: dschuff, jyknight, dylanmckay, nemanjai, jvesely, kbarton, fedor.sergeev, asb, rbar, johnrusso, simoncook, sabuasal, niosHD, jrtc27, zzheng, edward-jones, atanasyan, rogfer01, MartinMosbeck, brucehoult, the_o, arphaman, PkmX, jocewei, jsji, Jim, lenary, s.egerton, pzheng, sameer.abuasal, apazos, luismarques, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D73865	2020-02-03 09:52:48 -08:00
Cameron McInally	20b8ed2c2b	[IRBuilder] Update IRBuilder::CreateFNeg(...) to return a UnaryOperator Reapply r374240 with fix for Ocaml test, namely Bindings/OCaml/core.ml. Differential Revision: https://reviews.llvm.org/D61675 llvm-svn: 374782	2019-10-14 15:35:01 +00:00
Dmitri Gribenko	eaf6dd482b	Revert "[IRBuilder] Update IRBuilder::CreateFNeg(...) to return a UnaryOperator" This reverts commit r374240. It broke OCaml tests: http://lab.llvm.org:8011/builders/clang-x86_64-debian-fast/builds/19014 llvm-svn: 374354	2019-10-10 14:13:54 +00:00
Cameron McInally	47363a148f	[IRBuilder] Update IRBuilder::CreateFNeg(...) to return a UnaryOperator Also update Clang to call Builder.CreateFNeg(...) for UnaryMinus. Differential Revision: https://reviews.llvm.org/D61675 llvm-svn: 374240	2019-10-09 21:52:15 +00:00
Craig Topper	3113ec3dc7	[CodeGen] Update min-legal-vector width based on function argument and return types This is a continuation of my patches to inform the X86 backend about what the largest IR types are in the function so that we can restrict the backend type legalizer to prevent 512-bit vectors on SKX when -mprefer-vector-width=256 is specified if no explicit 512 bit vectors were specified by the user. This patch updates the vector width based on the argument and return types of the current function and from the types of any functions it calls. This is intended to make sure the backend type legalizer doesn't disturb any types that are required for ABI. Differential Revision: https://reviews.llvm.org/D52441 llvm-svn: 345168	2018-10-24 17:42:17 +00:00
Mehdi Amini	6aa9e9b41a	IRGen: Add optnone attribute on function during O0 Amongst other, this will help LTO to correctly handle/honor files compiled with O0, helping debugging failures. It also seems in line with how we handle other options, like how -fnoinline adds the appropriate attribute as well. Differential Revision: https://reviews.llvm.org/D28404 llvm-svn: 304127	2017-05-29 05:38:20 +00:00
Renato Golin	fa007aeef4	Revert "set the underlying value of “#pragma STDC FP_CONTRACT” on by default" This reverts commit r282259, as it broke the AArch64 test-suite bots. llvm-svn: 282289	2016-09-23 20:32:52 +00:00
Sebastian Pop	6919ae5abc	set the underlying value of “#pragma STDC FP_CONTRACT” on by default Clang has the default FP contraction setting of “-ffp-contract=on”, which doesn't really mean “on” in the conventional sense of the word, but rather really means “according to the per-statement effective value of the relevant pragma”. Before this patch, Clang has that pragma defaulting to “off”. Since the “-ffp-contract=on” mode is really an AND of two booleans and the second of them defaults to “off”, the whole thing effectively defaults to “off”. This patch changes the default value of the pragma to “on”, thus making the default pair of booleans (on, on) rather than (on, off). This makes FP optimization slightly more aggressive than before when not using either “-Ofast”, “-ffast-math”, or “-ffp-contract=fast”. Even with this patch the compiler still respects “-ffp-contract=off”. As per a suggestion by Steve Canon, the added code does _not_ require “-O3” or higher. This is so as to try our best to preserve identical floating-point results for unchanged source code compiling for an unchanged target when only changing from any optimization level in the set (“-O0”, “-O1”, “-O2”, “-O3”) to any other optimization level in that set. “-Os” and “-Oz” seem to be behaving identically, i.e. should probably be considered a part of the aforementioned set, but I have not reviewed this rigorously. “-Ofast” is explicitly _not_ a member of that set. Patch authored by Abe Skolnik [a.skolnik@samsung.com] and Stephen Canon [scanon@apple.com]. Differential Revision: https://reviews.llvm.org/D24481 llvm-svn: 282259	2016-09-23 16:16:25 +00:00
David Majnemer	12b9e76b62	Update for LLVM changes InstSimplify has gained the ability to remove needless bitcasts which perturbed some clang codegen tests. llvm-svn: 276728	2016-07-26 05:52:37 +00:00
Ahmed Bougacha	1d9de10130	[ARM NEON] Define vfms_f32 on ARM, and all vfms using vfma. r259537 added vfma/vfms to armv7, but the builtin was only lowered on the AArch64 side. Instead of supporting it on ARM, get rid of it. The vfms builtin lowered to: %nb = fsub float -0.0, %b %r = @llvm.fma.f32(%a, %nb, %c) Instead, define the operation in terms of vfma, and swap the multiplicands. It now lowers to: %na = fsub float -0.0, %a %r = @llvm.fma.f32(%na, %b, %c) This matches the instruction more closely, and lets current LLVM generate the "natural" operand ordering: fmls.2s v0, v1, v2 instead of the crooked (but equivalent): fmls.2s v0, v2, v1 Except for theses changes, assembly is identical. LLVM accepts both commutations, and the LLVM tests in: test/CodeGen/AArch64/arm64-fmadd.ll test/CodeGen/AArch64/fp-dp3.ll test/CodeGen/AArch64/neon-fma.ll test/CodeGen/ARM/fusedMAC.ll already check either the new one only, or both. Also verified against the test-suite unittests. llvm-svn: 266807	2016-04-19 19:44:45 +00:00
Tim Northover	58672974a9	ARM & AArch64: convert asm tests to LLVM IR and restrict optimizations. This is mostly a one-time autoconversion of tests that checked assembly after "-Owhatever" compiles to only run "opt -mem2reg" and check the assembly. This should make them much more stable to changes in LLVM so they won't break on unrelated changes. "opt -mem2reg" is a compromise designed to increase the readability of tests that check dataflow, while minimizing dependency on LLVM. Hopefully mem2reg is stable enough that no surpises will come along. Should address http://llvm.org/PR26815. llvm-svn: 263048	2016-03-09 18:54:42 +00:00
Manuel Klimek	ff39366de5	Revert "Make FP_CONTRACT ON the default." This reverts commit r253269. This leads to assert / segfault triggering on the following reduced example: float foo(float U, float base, float cell) { return (U = 2 * base) - cell; } llvm-svn: 253337	2015-11-17 15:40:10 +00:00
Stephen Canon	916be92955	Make FP_CONTRACT ON the default. Differential Revision: D14200 llvm-svn: 253269	2015-11-16 23:09:11 +00:00
Tim Northover	831d728f9a	AArch64: re-enable tests that were looking for a non-existent backend. In the final phase of the merge, I managed to disable a bunch of Clang tests accidentally. Fortunately none of them seem to have broken in the interim. llvm-svn: 211149	2014-06-18 08:37:28 +00:00
Tim Northover	25e8a6754e	AArch64/ARM64: update Clang after AArch64 removal. A few (mostly CodeGen) parts of Clang were tightly coupled to the AArch64 backend. Now that it's gone, they will not even compile. I've also deduplicated RUN lines in many of the AArch64 tests. This might improve "make check-all" time noticably: some of those NEON tests were monsters. llvm-svn: 209578	2014-05-24 12:51:25 +00:00
James Molloy	75f5f9e629	[ARM64] Allow the disabling of NEON and crypto instructions. Update tests to pass -target-feature +neon. llvm-svn: 206394	2014-04-16 15:33:48 +00:00
Tim Northover	a2ee433c8d	ARM64: initial clang support commit. This adds Clang support for the ARM64 backend. There are definitely still some rough edges, so please bring up any issues you see with this patch. As with the LLVM commit though, we think it'll be more useful for merging with AArch64 from within the tree. llvm-svn: 205100	2014-03-29 15:09:45 +00:00
Tim Northover	3d4575cc1b	AArch64 NEON: add 64-bit scalar intrinsics for _f64 mla/mls etc. These seem to be supported by GCC, and do make sense architecturally so we should probably have them. llvm-svn: 202138	2014-02-25 11:13:49 +00:00
Jiangning Liu	38799b1471	Add some missing test cases for ACLE intrinsics of AArch64 NEON. llvm-svn: 197994	2013-12-25 01:23:43 +00:00

23 Commits