llvm-project

Commit Graph

Author	SHA1	Message	Date
Bryan Chan	223307b3dc	[AArch64] Implement FP16FML intrinsics Generate the FP16FML intrinsics into arm_neon.h (AArch64 only for now). Add two new type modifiers to NeonEmitter to handle the new prototypes. Define __ARM_FEATURE_FP16FML when +fp16fml is enabled and guard the intrinsics with the macro in arm_neon.h. Based on a patch by Gao Yiling. Differential Revision: https://reviews.llvm.org/D53633 llvm-svn: 345344	2018-10-25 23:47:00 +00:00
Fangrui Song	55fab260ca	llvm::sort(C.begin(), C.end(), ...) -> llvm::sort(C, ...) Summary: The convenience wrapper in STLExtras is available since rL342102. Reviewers: rsmith, #clang, dblaikie Reviewed By: rsmith, #clang Subscribers: mgrang, arphaman, kadircet, cfe-commits Differential Revision: https://reviews.llvm.org/D52576 llvm-svn: 343147	2018-09-26 22:16:28 +00:00
Diogo N. Sampaio	bac6c88da2	Replaces __inline by __inline__ / C89 compatible llvm-svn: 341644	2018-09-07 09:37:27 +00:00
Diogo N. Sampaio	fcc97daa8a	Fix arm_neon.h and arm_fp16.h generation for compiling with std=c89 Summary: The inline attribute is not valid for C standard 89. Replace the argument in the generation of header files with __inline, as well adding tests for both header files. Reviewers: pbarrio, SjoerdMeijer, javed.absar, t.p.northover Subscribers: t.p.northover, kristof.beyls, chrib, cfe-commits Differential Revision: https://reviews.llvm.org/D51683 test/Headers/arm-fp16-header.c test/Headers/arm-neon-header.c utils/TableGen/NeonEmitter.cpp llvm-svn: 341475	2018-09-05 14:56:21 +00:00
Luke Geeson	dc54b37414	[AArch64] Corrected FP16 Intrinsic range checks in Clang + added Sema tests Summary: This fixes the ranges for the vcvth family of FP16 intrinsics in the clang front end. Previously it was accepting incorrect ranges -Changed builtin range checking in SemaChecking -added tests SemaCheck changes - included in their own file since no similar one exists -modified existing tests to reflect new ranges Reviewers: SjoerdMeijer, javed.absar Reviewed By: SjoerdMeijer Subscribers: kristof.beyls, cfe-commits Differential Revision: https://reviews.llvm.org/D47592 llvm-svn: 334489	2018-06-12 09:54:27 +00:00
Oliver Stannard	2fcee8bd52	[ARM,AArch64] Add intrinsics for dot product instructions The ACLE spec which describes these intrinsics hasn't been published yet, but this is based on the final draft which will be published soon, and these have already been implemented by GCC. Differential revision: https://reviews.llvm.org/D46109 llvm-svn: 331039	2018-04-27 14:03:32 +00:00
Alexander Kornienko	2a8c18d991	Fix typos in clang Found via codespell -q 3 -I ../clang-whitelist.txt Where whitelist consists of: archtype cas classs checkk compres definit frome iff inteval ith lod methode nd optin ot pres statics te thru Patch by luzpaz! (This is a subset of D44188 that applies cleanly with a few files that have dubious fixes reverted.) Differential revision: https://reviews.llvm.org/D44188 llvm-svn: 329399	2018-04-06 15:14:32 +00:00
Mandeep Singh Grang	c205d8cc8d	[clang] Change std::sort to llvm::sort in response to r327219 r327219 added wrappers to std::sort which randomly shuffle the container before sorting. This will help in uncovering non-determinism caused due to undefined sorting order of objects having the same key. To make use of that infrastructure we need to invoke llvm::sort instead of std::sort. llvm-svn: 328636	2018-03-27 16:50:00 +00:00
Adrian Prantl	6691e112ce	Mark fallthrough with LLVM_FALLTHROUGH llvm-svn: 323986	2018-02-01 18:10:20 +00:00
Abderrazek Zaafrani	ce8746d178	[AArch64] Add ARMv8.2-A FP16 scalar intrinsics https://reviews.llvm.org/D41792 llvm-svn: 323006	2018-01-19 23:11:18 +00:00
Benjamin Kramer	3a13ed60ba	Avoid int to string conversion in Twine or raw_ostream contexts. Some output changes from uppercase hex to lowercase hex, no other functionality change intended. llvm-svn: 321526	2017-12-28 16:58:54 +00:00
Abderrazek Zaafrani	f58a132eef	[AARch64] Add ARMv8.2-A FP16 vector intrinsics Putting back the code that was reverted few weeks ago. Differential Revision: https://reviews.llvm.org/D34161 llvm-svn: 321294	2017-12-21 19:20:01 +00:00
Adrian Prantl	f3b3ccda59	Silence a bunch of implicit fallthrough warnings llvm-svn: 321115	2017-12-19 22:06:11 +00:00
Sjoerd Meijer	98ee78578b	This reverts r305820 (ARMv.2-A FP16 vector intrinsics) because it shows problems in testing, see comments in D34161 for some more details. A fix is in progres in D35011, but a revert seems better now as the fix will probably take some more time to land. llvm-svn: 307277	2017-07-06 16:37:31 +00:00
Abderrazek Zaafrani	f10ca93f34	[AArch64] ADD ARMv.2-A FP16 vector intrinsics Differential Revision: https://reviews.llvm.org/D34161 llvm-svn: 305820	2017-06-20 18:54:57 +00:00
Vedant Kumar	a44a6ac81f	Revert "[AArch64] Add ARMv8.2-A FP16 vefctor intrinsics" This reverts commit r304493. It breaks all the Darwin bots: http://green.lab.llvm.org/green/job/clang-stage1-cmake-RA-incremental_check/37168 Failure: Failing Tests (2): Clang :: CodeGen/aarch64-v8.2a-neon-intrinsics.c Clang :: CodeGen/arm_neon_intrinsics.c llvm-svn: 304509	2017-06-02 01:22:14 +00:00
Abderrazek Zaafrani	a44e5f601d	[AArch64] Add ARMv8.2-A FP16 vefctor intrinsics llvm-svn: 304493	2017-06-01 23:22:29 +00:00
Matthias Braun	f1b01996ef	Adapt to llvm/TableGen DagInit changes. llvm-svn: 288645	2016-12-05 06:00:51 +00:00
Eugene Zelenko	58ab22fe48	Fix some Clang-tidy and Include What You Use warnings; other minor fixes (NFC). This preparation to remove SetVector.h dependency on SmallSet.h. llvm-svn: 288213	2016-11-29 22:44:24 +00:00
Mehdi Amini	9670f847b8	[NFC] Header cleanup Summary: Removed unused headers, replaced some headers with forward class declarations Patch by: Eugene <claprix@yandex.ru> Differential Revision: https://reviews.llvm.org/D20100 llvm-svn: 275882	2016-07-18 19:02:11 +00:00
Benjamin Kramer	cfeacf56f0	Apply clang-tidy's misc-move-constructor-init throughout Clang. No functionality change intended, maybe a tiny performance improvement. llvm-svn: 270996	2016-05-27 14:27:13 +00:00
Craig Topper	054b391cf4	No need to use utostr when putting integers into a raw_ostream. NFC llvm-svn: 259310	2016-01-31 00:20:26 +00:00
Craig Topper	2576124eb5	[TableGen] Merge the SuperClass Record and SMRange vector a single vector. This removes the state needed to manage the extract vector. NFC llvm-svn: 258066	2016-01-18 19:52:54 +00:00
Ahmed Bougacha	86da3d8d7c	[ARM NEON] Remove special-case for f16 vcvt handling. NFCI. We can use the 'H' typespec modifier to use 128-bit vectors directly in the only two users of this special-case: the vcvt f16 intrinsics. This also lets us use more meaningful prototype modifiers. llvm-svn: 245778	2015-08-22 01:30:13 +00:00
Ahmed Bougacha	cd5b8a0235	[ARM NEON] Use the common naming scheme for vcvt f16 builtins. NFC. We had "vcvt_f16" and "VCVT_HIGH_F16": for other FP types, this naming is used for intrinsics with integer overloads. The FP->FP conversions, on the other hand, use the full "vcvt_f32_f64" name instead. Use the same naming convention for the f16<->f32 conversions. While there, reorder the definitions a little bit. llvm-svn: 245763	2015-08-21 23:34:20 +00:00
Ahmed Bougacha	22a16965d6	[ARM NEON] Factor out FP-prototype checking. NFC. llvm-svn: 245761	2015-08-21 23:24:18 +00:00
David Blaikie	4c96a5ef1c	Fix memory ownership in the NeonEmitter by using values instead of pointers (smart or otherwise) Improvement to the memory leak fix in 244196. Address validity is required for the Intrinsic objects, but since the collections only ever grow (no elements are removed), deque provides sufficient guarantees (that the objects will never be reallocated/moved around) for this use case. llvm-svn: 244241	2015-08-06 18:29:32 +00:00
Yaron Keren	9f168530a2	Plug a memory leak in NeonEmitter: Intrinsics allocated were never released. llvm-svn: 244196	2015-08-06 07:28:36 +00:00
Alexander Kornienko	ab9db51042	Revert r240270 ("Fixed/added namespace ending comments using clang-tidy"). llvm-svn: 240353	2015-06-22 23:07:51 +00:00
Alexander Kornienko	3d9d929e42	Fixed/added namespace ending comments using clang-tidy. NFC The patch is generated using this command: $ tools/extra/clang-tidy/tool/run-clang-tidy.py -fix \ -checks=-,llvm-namespace-comment -header-filter='llvm/.\|clang/.*' \ work/llvm/tools/clang To reduce churn, not touching namespaces spanning less than 10 lines. llvm-svn: 240270	2015-06-22 09:47:44 +00:00
Ahmed Bougacha	94df730f7d	[CodeGen][NEON] Emit constants for "immediate" intrinsic arguments. On ARM/AArch64, we currently always use EmitScalarExpr for the immediate builtin arguments, instead of directly emitting the constant. When the overflow sanitizer is enabled, this generates overflow intrinsics instead of constants, breaking assumptions in various places. Instead, use the knowledge of "immediates" to directly emit a constant: - teach the tablegen backend to emit the "immediate" modifiers - use those modifiers in the NEON CodeGen, on ARM and AArch64. Fixes PR23517. Differential Revision: http://reviews.llvm.org/D10045 llvm-svn: 239002	2015-06-04 01:43:41 +00:00
Benjamin Kramer	3204b152b5	Replace push_back(Constructor(foo)) with emplace_back(foo) for non-trivial types If the type isn't trivially moveable emplace can skip a potentially expensive move. It also saves a couple of characters. Call sites were found with the ASTMatcher + some semi-automated cleanup. memberCallExpr( argumentCountIs(1), callee(methodDecl(hasName("push_back"))), on(hasType(recordDecl(has(namedDecl(hasName("emplace_back")))))), hasArgument(0, bindTemporaryExpr( hasType(recordDecl(hasNonTrivialDestructor())), has(constructExpr()))), unless(isInTemplateInstantiation())) No functional change intended. llvm-svn: 238601	2015-05-29 19:42:19 +00:00
Craig Topper	bccb773ebc	[TableGen] Clang changes for r235697 to stop leaking Expanders and Operators in SetTheory. llvm-svn: 235698	2015-04-24 06:53:50 +00:00
Benjamin Kramer	8017237277	Remove empty non-virtual destructors or mark them =default when non-public These add no value but can make a class non-trivially copyable. NFC. llvm-svn: 234689	2015-04-11 15:58:30 +00:00
Alexander Kornienko	34eb20725d	Use 'override/final' instead of 'virtual' for overridden methods Summary: The patch is generated using clang-tidy misc-use-override check. This command was used: tools/clang/tools/extra/clang-tidy/tool/run-clang-tidy.py \ -checks='-*,misc-use-override' -header-filter='llvm\|clang' -j=32 -fix Reviewers: dblaikie Reviewed By: dblaikie Subscribers: klimek, cfe-commits Differential Revision: http://reviews.llvm.org/D8926 llvm-svn: 234678	2015-04-11 02:00:23 +00:00
James Dennett	fa24549492	Fix a call to std::unique to actually discard the trailing (junk) elements. Found by inspection. (No other instances of this problem were found.) llvm-svn: 234221	2015-04-06 21:09:24 +00:00
Alexander Kornienko	6ee521c7eb	Replace size() calls on containers with empty() calls where appropriate. NFC http://reviews.llvm.org/D7090 Patch by Gábor Horváth! llvm-svn: 226914	2015-01-23 15:36:10 +00:00
Chandler Carruth	575bc3ba62	[cleanup] Re-sort the #include lines using llvm/utils/sort_includes.py No functionality changed, this is a purely mechanical cleanup to ensure the #include order remains consistent across the project. llvm-svn: 225975	2015-01-14 11:23:58 +00:00
Craig Topper	5fc8fc2d31	Simplify creation of a bunch of ArrayRefs by using None, makeArrayRef or just letting them be implicitly created. llvm-svn: 216528	2014-08-27 06:28:36 +00:00
Alp Toker	958027b698	Fix typos Also consolidate 'backward compatibility' llvm-svn: 212974	2014-07-14 19:42:55 +00:00
James Molloy	b452f78ad2	[ARM-BE] Generate correct NEON intrinsics for big endian systems. The NEON intrinsics in arm_neon.h are designed to work on vectors "as-if" loaded by (V)LDR. We load vectors "as-if" (V)LD1, so the intrinsics are currently incorrect. This patch adds big-endian versions of the intrinsics that does the "obvious but dumb" thing of reversing all vector inputs and all vector outputs. This will produce extra REVs, but we trust the optimizer to remove them. llvm-svn: 211893	2014-06-27 11:53:35 +00:00
Craig Topper	0039f3f060	Replace some assert(0)'s with llvm_unreachable. llvm-svn: 211139	2014-06-18 03:57:25 +00:00
Craig Topper	c7193c48d9	Convert assert(0) to llvm_unreachable to silence a warning about Addend being uninitialized in default case. llvm-svn: 211138	2014-06-18 03:13:41 +00:00
James Molloy	dee4ab08ba	Rewrite ARM NEON intrinsic emission completely. There comes a time in the life of any amateur code generator when dumb string concatenation just won't cut it any more. For NeonEmitter.cpp, that time has come. There were a bunch of magic type codes which meant different things depending on the context. There were a bunch of special cases that really had no reason to be there but the whole thing was so creaky that removing them would cause something weird to fall over. There was a 1000 line switch statement for code generation involving string concatenation, which actually did lexical scoping to an extent (!!) with a bunch of semi-repeated cases. I tried to refactor this three times in three different ways without success. The only way forward was to rewrite the entire thing. Luckily the testing coverage on this stuff is absolutely massive, both with regression tests and the "emperor" random test case generator. The main change is that previously, in arm_neon.td a bunch of "Operation"s were defined with special names. NeonEmitter.cpp knew about these Operations and would emit code based on a huge switch. Actually this doesn't make much sense - the type information was held as strings, so type checking was impossible. Also TableGen's DAG type actually suits this sort of code generation very well (surprising that...) So now every operation is defined in terms of TableGen DAGs. There are a bunch of operators to use, including "op" (a generic unary or binary operator), "call" (to call other intrinsics) and "shuffle" (take a guess...). One of the main advantages of this apart from making it more obvious what is going on, is that we have proper type inference. This has two obvious advantages: 1) TableGen can error on bad intrinsic definitions easier, instead of just generating wrong code. 2) Calls to other intrinsics are typechecked too. So we no longer need to work out whether the thing we call needs to be the Q-lane version or the D-lane version - TableGen knows that itself! Here's an example: before: case OpAbdl: { std::string abd = MangleName("vabd", typestr, ClassS) + "(__a, __b)"; if (typestr[0] != 'U') { // vabd results are always unsigned and must be zero-extended. std::string utype = "U" + typestr.str(); s += "(" + TypeString(proto[0], typestr) + ")"; abd = "(" + TypeString('d', utype) + ")" + abd; s += Extend(utype, abd) + ";"; } else { s += Extend(typestr, abd) + ";"; } break; } after: def OP_ABDL : Op<(cast "R", (call "vmovl", (cast $p0, "U", (call "vabd", $p0, $p1))))>; As an example of what happens if you do something wrong now, here's what happens if you make $p0 unsigned before the call to "vabd" - that is, $p0 -> (cast "U", $p0): arm_neon.td:574:1: error: No compatible intrinsic found - looking up intrinsic 'vabd(uint8x8_t, int8x8_t)' Available overloads: - float64x2_t vabdq_v(float64x2_t, float64x2_t) - float64x1_t vabd_v(float64x1_t, float64x1_t) - float64_t vabdd_f64(float64_t, float64_t) - float32_t vabds_f32(float32_t, float32_t) ... snip ... This makes it seriously easy to work out what you've done wrong in fairly nasty intrinsics. As part of this I've massively beefed up the documentation in arm_neon.td too. Things still to do / on the radar: - Testcase generation. This was implemented in the previous version and not in the new one, because - Autogenerated tests are not being run. The testcase in test/ differs from the autogenerated version. - There were a whole slew of special cases in the testcase generation that just felt (and looked) like hacks. If someone really feels strongly about this, I can try and reimplement it too. - Big endian. That's coming soon and should be a very small diff on top of this one. llvm-svn: 211101	2014-06-17 13:11:27 +00:00
Craig Topper	8ae1203992	[C++11] Use 'nullptr'. llvm-svn: 208163	2014-05-07 06:21:57 +00:00
Tim Northover	87da936164	ARM NEON: add _f16 support to a couple of vector-shuffling intrinsics. llvm-svn: 202137	2014-02-25 11:13:42 +00:00
Kevin Qin	ad64f6d4e5	[AArch64] Change int64_t from 'long long int' to 'long int' for AArch64 target. Most 64-bit targets define int64_t as long int, and AArch64 should make same definition to follow LP64 model. In GNU tool chain, int64_t is defined as long int for 64-bit target. So to get consistent with GNU, it's better Changing int64_t from 'long long int' to 'long int', otherwise clang will get different name mangling suffix compared with g++. llvm-svn: 202004	2014-02-24 02:45:03 +00:00
Tim Northover	db3e5e2408	AArch64: look up EmitAArch64Scalar support before calling. This fixes one immediate bug where an expression with side-effects could be emitted twice during a NEON call. It also prepares the way for folding CodeGen for many of the SISD intrinsics into a table, reducing code size and hopefully increasing performance eventually ("binary search + few switch cases" should be better than "lots of switch cases"). llvm-svn: 201667	2014-02-19 11:55:06 +00:00
Tim Northover	2163a0e497	ARM & AArch64: move struct definition outside function. Apparently it's not True C++. rdar://problem/16035743 still. llvm-svn: 201663	2014-02-19 10:56:23 +00:00
Tim Northover	544e79eb30	ARM NEON: use more flexible TableGen field for defs. We used to have special handling for isCrypto and isA64 bits in the NeonEmitter.cpp file (it knew the former was predicated on __ARM_FEATURE_CRYPTO and the latter on __aarch64__ and went through various contortions to make sure the correct intrinsics were emitted under the correct guard. This is ugly and has obvious scalability problems (e.g. vcvtX intrinsics are needed, which are ARMv8 only but available on both, yet another category). This patch moves the #if predicate into the arm_neon.td file directly and makes NeonEmitter.cpp agnostic about what goes in there. It also deduplicates arm_neon.td so that each desired intrinsic is mentioned in just one place (necessary because of the new mechanism for creating arm_neon.h). rdar://problem/16035743 llvm-svn: 201660	2014-02-19 10:37:09 +00:00

1 2 3

116 Commits