llvm-project

Commit Graph

Author	SHA1	Message	Date
Krzysztof Parzyszek	783e28a508	[Hexagon] Split pair-based masked memops	2020-09-10 14:24:42 -05:00
Krzysztof Parzyszek	0ee54cf883	[Hexagon] Account for truncating pairs to non-pairs when widening truncates Added missing selection patterns for vpackl.	2020-09-09 14:31:52 -05:00
Krzysztof Parzyszek	d183f47261	[Hexagon] Handle widening of truncation's operand with legal result Failing example: v8i8 = truncate v8i32. v8i8 is legal, but v8i32 was widened to HVX. Make sure that v8i8 does not get altered (even if it's changed to another legal type).	2020-09-08 16:07:39 -05:00
Krzysztof Parzyszek	9518f032e4	[Hexagon] When widening truncate result, also widen operand if necessary	2020-09-05 18:19:32 -05:00
Krzysztof Parzyszek	8789f2bbde	[Hexagon] Resize the mem operand when widening loads and stores	2020-09-05 18:17:48 -05:00
Krzysztof Parzyszek	1387f96ab3	[Hexagon] Handle widening of vector truncate	2020-09-05 15:07:38 -05:00
Krzysztof Parzyszek	69fac677bc	[Hexagon] Fix perfect shuffle generation for single vectors Perfect shuffle instruction (vdealvdd/vshuffvdd) work on vector pairs. When given a single input vector, half of it first needs to be transposed into the other vector before the generated shuffles can take effect. Also the first transpose needs to be undone at the end (this last step was missing).	2020-08-30 06:43:16 -05:00
Krzysztof Parzyszek	4ef9275b9b	[Hexagon] Emit better 32-bit multiplication sequence for HVXv62+	2020-08-27 15:24:32 -05:00
Krzysztof Parzyszek	154daf1f94	[Hexagon] Widen short vector stores to HVX vectors using masked stores Also invent a flag -hexagon-hvx-widen=N to set the minimum threshold for widening short vectors to HVX vectors.	2020-08-27 09:25:08 -05:00
Krzysztof Parzyszek	e15143d31b	[Hexagon] Implement llvm.masked.load and llvm.masked.store for HVX	2020-08-26 13:10:22 -05:00
Arthur Eubanks	f50b3ff02e	[Hexagon] Use InstSimplify instead of ConstantProp This is the last remaining use of ConstantProp, migrate it to InstSimplify in the goal of removing ConstantProp. Add -hexagon-instsimplify option to enable skipping of instsimplify in tests that can't handle the extra optimization. Differential Revision: https://reviews.llvm.org/D85047	2020-08-04 15:42:39 -07:00
Krzysztof Parzyszek	c8bfed05e2	Reland `7691790dfd` with a MSAN fix In some cases when HexagonTargetLowering::allowsMemoryAccess returned true, it did not set the "Fast" argument, leaving it uninitialized. [Hexagon] Improve casting of boolean HVX vectors to scalars - Mark memory access for bool vectors as disallowed in target lowering. This will prevent combining bitcasts of bool vectors with stores. - Replace the actual bitcasting code with a faster version. - Handle casting of v16i1 to i16.	2020-02-28 08:32:58 -06:00
Kirill Bobyrev	014728413f	Revert "[Hexagon] Improve casting of boolean HVX vectors to scalars" This reverts commit `7691790dfd`. The patch is failing tests with MSAN: http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux-fast/builds/39054/steps/check-llvm%20msan/logs/stdio	2020-02-27 11:58:32 +01:00
Krzysztof Parzyszek	7691790dfd	[Hexagon] Improve casting of boolean HVX vectors to scalars - Mark memory access for bool vectors as disallowed in target lowering. This will prevent combining bitcasts of bool vectors with stores. - Replace the actual bitcasting code with a faster version. - Handle casting of v16i1 to i16.	2020-02-26 12:46:52 -06:00
Ikhlas Ajbar	a8a4f99afb	[Hexagon] Lower bitcast of a vector predicate This patch lowers bitcast of vector predicate of type v32i1/v64i1 to i32/i64 type.	2020-02-24 15:25:51 -06:00
Krzysztof Parzyszek	c51b0bede8	[Hexagon] Introduce noop intrinsic to cast between vector predicate types The (overloaded) intrinsic is llvm.hexagon.V6.pred.typecast[.128B]. The types of the operand and the return value are HVX boolean vector types. For each cast, there needs to be a corresponding intrinsic declared, with different suffixes appended to the name, e.g. ; cast <128 x i1> to <32 x i1> declare <32 x i1> @llvm.hexagon.V6.pred.typecast.128B.s1(<128 x i1>) ; cast <32 x i1> to <64 x i1> declare <64 x i1> @llvm.hexagon.V6.pred.typecast.128B.s2(<32 x i1>) etc.	2020-02-21 07:37:59 -06:00
Krzysztof Parzyszek	b1d47467e2	[Hexagon] Change HVX vector predicate types from v512/1024i1 to v64/128i1 This commit removes the artificial types <512 x i1> and <1024 x i1> from HVX intrinsics, and makes v512i1 and v1024i1 no longer legal on Hexagon. It may cause existing bitcode files to become invalid. * Converting between vector predicates and vector registers must be done explicitly via vandvrt/vandqrt instructions (their intrinsics), i.e. (for 64-byte mode): %Q = call <64 x i1> @llvm.hexagon.V6.vandvrt(<16 x i32> %V, i32 -1) %V = call <16 x i32> @llvm.hexagon.V6.vandqrt(<64 x i1> %Q, i32 -1) The conversion intrinsics are: declare <64 x i1> @llvm.hexagon.V6.vandvrt(<16 x i32>, i32) declare <128 x i1> @llvm.hexagon.V6.vandvrt.128B(<32 x i32>, i32) declare <16 x i32> @llvm.hexagon.V6.vandqrt(<64 x i1>, i32) declare <32 x i32> @llvm.hexagon.V6.vandqrt.128B(<128 x i1>, i32) They are all pure. * Vector predicate values cannot be loaded/stored directly. This directly reflects the architecture restriction. Loading and storing or vector predicates must be done indirectly via vector registers and explicit conversions via vandvrt/vandqrt instructions.	2020-02-19 14:14:56 -06:00
Krzysztof Parzyszek	2b5d7e93dd	[MVT] Add v256i1 to MachineValueType This type can show up when lowering some HVX vector code on Hexagon. llvm-svn: 372403	2019-09-20 15:19:20 +00:00
Krzysztof Parzyszek	8460301d58	[Hexagon] Generate vector min/max for HVX llvm-svn: 369014	2019-08-15 16:13:17 +00:00
Simon Pilgrim	d395bc1cc2	[Hexagon] Remove fcmp undef from reduced tests Pre-commit for D60006 (Add fcmp UNDEF handling to SelectionDAG::FoldSetCC) Approved by @kparzysz (Krzysztof Parzyszek) llvm-svn: 357301	2019-03-29 19:14:52 +00:00
Simon Pilgrim	55e1330eda	[Hexagon] Remove icmp undef from reduced tests Pre-commit for D59363 (Add icmp UNDEF handling to SelectionDAG::FoldSetCC) Approved by @kparzysz (Krzysztof Parzyszek) llvm-svn: 356267	2019-03-15 15:07:44 +00:00
Krzysztof Parzyszek	6128ac5a8f	[Hexagon] Split vector pairs for ISD::SIGN_EXTEND and ISD::ZERO_EXTEND llvm-svn: 354473	2019-02-20 15:05:19 +00:00
Sanjay Patel	f24900b934	[DAGCombiner] allow hoisting vector bitwise logic ahead of truncates The transform performs a bitwise logic op in a wider type followed by truncate when both inputs are truncated from the same source type: logic_op (truncate x), (truncate y) --> truncate (logic_op x, y) There are a bunch of other checks that should prevent doing this when it might be harmful. We already do this transform for scalars in this spot. The vector limitation was shared with a check for the case when the operands are extended. I'm not sure if that limit is needed either, but that would be a separate patch. Differential Revision: https://reviews.llvm.org/D55448 llvm-svn: 349303	2018-12-16 14:57:04 +00:00
Sanjay Patel	25fc03c5c0	[Hexagon] make test immune to scalarization improvements; NFC llvm-svn: 349163	2018-12-14 17:23:01 +00:00
Sanjay Patel	08c0a0ac58	[Hexagon] make test immune to improvements in undef simplification llvm-svn: 347218	2018-11-19 15:34:09 +00:00
Stanislav Mekhanoshin	0ff7c8309d	DAG combiner: fold (select, C, X, undef) -> X Differential Revision: https://reviews.llvm.org/D54646 llvm-svn: 347110	2018-11-16 23:13:38 +00:00
Krzysztof Parzyszek	a6d4fc0e29	[Hexagon] Use shuffles when lowering "gather" shufflevectors Shufflevector instructions in LLVM IR that extract a subset of elements of a longer input into a shorter vector can be done using VECTOR_SHUFFLEs. This will avoid expanding them into constly extracts and inserts. llvm-svn: 342091	2018-09-12 22:14:52 +00:00
Krzysztof Parzyszek	2ff9aa15e4	[Hexagon] Enable interleaving in loop vectorizer llvm-svn: 340447	2018-08-22 20:15:04 +00:00
Krzysztof Parzyszek	d91a9e27a9	[Hexagon] Simplify CFG after atomic expansion This will remove suboptimal branching from the generated ll/sc loops. The extra simplification pass affects a lot of testcases, which have been modified to accommodate this change: either by modifying the test to become immune to the CFG simplification, or (less preferablt) by adding option -hexagon-initial-cfg-clenaup=0. llvm-svn: 338774	2018-08-02 22:17:53 +00:00
Krzysztof Parzyszek	bea23d065e	[Hexagon] Make floating point operations expensive for vectorization llvm-svn: 334508	2018-06-12 15:12:50 +00:00
Krzysztof Parzyszek	c1e712baa5	[Hexagon] Implement vector-pair zero as V6_vsubw_dv llvm-svn: 334123	2018-06-06 19:34:40 +00:00
Krzysztof Parzyszek	0da1fe3770	[Hexagon] Split CTPOP of vector pairs llvm-svn: 334109	2018-06-06 18:03:29 +00:00
Krzysztof Parzyszek	aec2c0c9b6	[Hexagon] Select HVX code for vector CTPOP, CTLZ, and CTTZ llvm-svn: 333760	2018-06-01 14:52:58 +00:00
Krzysztof Parzyszek	8987174627	[Hexagon] Use vector align-left when shift amount fits in 3 bits This saves an instruction because for align-right the shift amount would need to be put in a register first. llvm-svn: 333543	2018-05-30 13:45:34 +00:00
Krzysztof Parzyszek	95b073525b	[Hexagon] Fix packing source vectors in shufflevector selection When the shuffle mask selected a subvector of the second input vector, and aligning of the source was performed, the shuffle mask was updated incorrectly, resulting in an ICE further in the selection process. llvm-svn: 333279	2018-05-25 14:53:14 +00:00
Krzysztof Parzyszek	840b02bccf	[Hexagon] Add patterns for accumulating HVX compares llvm-svn: 333009	2018-05-22 18:27:02 +00:00
Krzysztof Parzyszek	e8a0ae7346	[Hexagon] Mark HVX vector predicate bitwise ops as legal, add patterns llvm-svn: 332525	2018-05-16 21:00:24 +00:00
Krzysztof Parzyszek	cff73a2118	[Hexagon] Add patterns for vector shift-and-accumulate llvm-svn: 331918	2018-05-09 21:10:41 +00:00
Krzysztof Parzyszek	41a24b7b13	[Hexagon] Improve HVX instruction selection (bitcast, vsplat) There was some unfortunate interaction between VSPLAT and BITCAST related to the selection of constant vectors (coming from selecting shuffles). Introduce VSPLATW that always splats a 32-bit word, and can have arbitrary result type (to avoid BITCASTs of VSPLAT). Clean up the previous selection of BITCAST/VSPLAT. llvm-svn: 330471	2018-04-20 19:38:37 +00:00
Krzysztof Parzyszek	2a9a83cd3f	[Hexagon] Use legal types when lowering CONCAT_VECTORS via BUILD_VECTOR llvm-svn: 330344	2018-04-19 17:11:58 +00:00
Krzysztof Parzyszek	d92c37e090	[Hexagon] Generate code for vector bswap intrinsics llvm-svn: 330333	2018-04-19 14:46:44 +00:00
Krzysztof Parzyszek	0375cd46ef	[Hexagon] Implement TTI::shouldMaximizeVectorBandwidth llvm-svn: 328648	2018-03-27 18:10:47 +00:00
Krzysztof Parzyszek	65059ee284	[Hexagon] Add heuristic to exclude critical path cost for scheduling Patch by Brendon Cahoon. llvm-svn: 328022	2018-03-20 19:26:27 +00:00
Krzysztof Parzyszek	dca383123f	[Hexagon] Improve scheduling based on register pressure Patch by Brendon Cahoon. llvm-svn: 327975	2018-03-20 12:28:43 +00:00
Krzysztof Parzyszek	480ab2bbc4	[Hexagon] Ignore indexed loads when handling unaligned loads llvm-svn: 327037	2018-03-08 18:15:13 +00:00
Krzysztof Parzyszek	2c3edf0567	[Hexagon] Rewrite non-HVX unaligned loads as pairs of aligned ones This is a follow-up to r325169, this time for all types, not just HVX vector types. Disable this by default, since it's not always safe. llvm-svn: 326915	2018-03-07 17:27:18 +00:00
Krzysztof Parzyszek	e3e963236a	[Hexagon] Generate valignb for shifting shuffles (instead of vdelta) llvm-svn: 326627	2018-03-02 22:22:19 +00:00
Jonas Paulsson	77cdf3881c	[Hexagon] Return true in enableMultipleCopyHints(). Enable multiple COPY hints to eliminate more COPYs during register allocation. Note that this is something all targets should do, see https://reviews.llvm.org/D38128. Review: Krzysztof Parzyszek llvm-svn: 325697	2018-02-21 16:37:45 +00:00
Krzysztof Parzyszek	ad83ce4cb4	[Hexagon] Split HVX vector pair loads/stores, expand unaligned loads llvm-svn: 325169	2018-02-14 20:46:06 +00:00
Krzysztof Parzyszek	9b48e8d233	[Hexagon] Add code to select QTRUE and QFALSE Fixes http://llvm.org/PR36320. llvm-svn: 324763	2018-02-09 19:10:46 +00:00

1 2

82 Commits