llvm-project

Commit Graph

Author	SHA1	Message	Date
Krzysztof Parzyszek	8a7fb0fe51	[Hexagon] Skip mux generation when predicate register is undefined llvm-svn: 305014	2017-06-08 20:56:36 +00:00
Krzysztof Parzyszek	5ba13825f0	[Hexagon] Generate 'inbounds' GEPs in HexagonCommonGEP llvm-svn: 304937	2017-06-07 20:04:33 +00:00
Krzysztof Parzyszek	066e8b56a0	[Hexagon] Return 0 from getDotNewPredOp when .new opcode does not exist This allows using this function to test if an instruction can be converted to a .new form. llvm-svn: 304549	2017-06-02 14:07:06 +00:00
Krzysztof Parzyszek	3cf16576d5	[Hexagon] Fix dependence check in the packetizer An incorrect check in the packetizer lead to an attempt to convert an unconditional branch to a .new (conditional) form. llvm-svn: 304442	2017-06-01 18:02:40 +00:00
Krzysztof Parzyszek	51fd5405d5	[Hexagon] Handle long-running simplification loop in idiom recognition The initial assumption was that the simplification would converge to a fixed point relatvely quickly. Turns out that there are legitimate situa- tions where the complexity of the code causes it to take a large number of iterations. Two main changes: - Instead of aborting upon hitting the limit, simply return nullptr. - Reduce the limit to 10,000 from 100,000. llvm-svn: 304441	2017-06-01 18:00:47 +00:00
Krzysztof Parzyszek	ef58017b35	[Hexagon] Improve code generation for 32x32-bit multiplication For multiplications of 64-bit values (giving 64-bit result), detect cases where the arguments are sign-extended 32-bit values, on a per- operand basis. This will allow few patterns to match a wider variety of combinations in which extensions can occur. llvm-svn: 304223	2017-05-30 17:47:51 +00:00
Matthias Braun	868bbd4022	ScheduleDAGInstrs: Fix fixupKills() Rewrite fixupKills() to use the LivePhysRegs class. Simplifies the code and fixes a bug where the CSR registers in return blocks where missed leading to invalid kill flags. Also remove the unnecessary rule that we wouldn't set kill flags on tied operands. No tests as I have an upcoming commit improving MachineVerifier checks to catch these cases in multiple existing lit tests. llvm-svn: 304055	2017-05-27 02:50:50 +00:00
Serge Pavlov	d526b13e61	Add extra operand to CALLSEQ_START to keep frame part set up previously Using arguments with attribute inalloca creates problems for verification of machine representation. This attribute instructs the backend that the argument is prepared in stack prior to CALLSEQ_START..CALLSEQ_END sequence (see http://llvm.org/docs/InAlloca.htm for details). Frame size stored in CALLSEQ_START in this case does not count the size of this argument. However CALLSEQ_END still keeps total frame size, as caller can be responsible for cleanup of entire frame. So CALLSEQ_START and CALLSEQ_END keep different frame size and the difference is treated by MachineVerifier as stack error. Currently there is no way to distinguish this case from actual errors. This patch adds additional argument to CALLSEQ_START and its target-specific counterparts to keep size of stack that is set up prior to the call frame sequence. This argument allows MachineVerifier to calculate actual frame size associated with frame setup instruction and correctly process the case of inalloca arguments. The changes made by the patch are: - Frame setup instructions get the second mandatory argument. It affects all targets that use frame pseudo instructions and touched many files although the changes are uniform. - Access to frame properties are implemented using special instructions rather than calls getOperand(N).getImm(). For X86 and ARM such replacement was made previously. - Changes that reflect appearance of additional argument of frame setup instruction. These involve proper instruction initialization and methods that access instruction arguments. - MachineVerifier retrieves frame size using method, which reports sum of frame parts initialized inside frame instruction pair and outside it. The patch implements approach proposed by Quentin Colombet in https://bugs.llvm.org/show_bug.cgi?id=27481#c1. It fixes 9 tests failed with machine verifier enabled and listed in PR27481. Differential Revision: https://reviews.llvm.org/D32394 llvm-svn: 302527	2017-05-09 13:35:13 +00:00
Krzysztof Parzyszek	d0c71ef8ab	[RDF] Remove covered parts of reached uses for phi and use in same block llvm-svn: 302305	2017-05-05 22:10:32 +00:00
Krzysztof Parzyszek	31d4b3b247	Remove stale live-ins in the branch folder Hoisting common code can cause registers that live-in in the successor blocks to no longer be live-in. The live-in information needs to be updated to reflect this, or otherwise incorrect code can be generated later on. Differential Revision: https://reviews.llvm.org/D32661 llvm-svn: 302228	2017-05-05 12:20:07 +00:00
Krzysztof Parzyszek	2af5037d34	[Hexagon] Use automatically-generated scheduling information for HVX Patch by Jyotsna Verma. llvm-svn: 302073	2017-05-03 20:10:36 +00:00
Krzysztof Parzyszek	4763c2d999	[Hexagon] Adjust latency between allocframe and the first store on stack Allocframe and the following stores on the stack have a latency of 2 cycles when not in the same packet. This happens because R29 is needed early by the store instruction. Since one of such stores can be packetized along with allocframe and use old value of R29, we can assign it 0 cycle latency while leaving latency of other stores to the default value of 2 cycles. Patch by Jyotsna Verma. llvm-svn: 302034	2017-05-03 15:33:09 +00:00
Krzysztof Parzyszek	a750383d0f	[Hexagon] Add extenders for GD_PLT_B22_PCREL and LD_PLT_B22_PCREL Patch by Sid Manning. llvm-svn: 301955	2017-05-02 18:15:33 +00:00
Krzysztof Parzyszek	9aaf923376	[Hexagon] Don't ignore mult-cycle latency information The compiler was generating code that ends up ignoring a multiple latency dependence between two instructions by scheduling the intructions in back-to-back packets. The packetizer needs to end a packet if the latency of the current current insruction and the source in the previous packet is greater than 1 cycle. This case occurs when there is still room in the current packet, but scheduling the instruction causes a stall. Instead, the packetizer should start a new packet. Also, if the current packet already contains a stall, then it is okay to add another instruction to the packet that also causes a stall. This occurs when there are no instructions that can be scheduled in between the producer and consumer instructions. This patch changes the latency for loads to 2 cycles from 3 cycles. This change refects that a load only needs to be separated by one extra packet to eliminate the stall. Patch by Ikhlas Ajbar. llvm-svn: 301954	2017-05-02 18:12:19 +00:00
Krzysztof Parzyszek	072ddb383c	[RDF] Correctly calculate lane masks for defs llvm-svn: 301700	2017-04-28 21:57:53 +00:00
Krzysztof Parzyszek	2065a2f4e6	Properly handle PHIs with subregisters in UnreachableBlockElim When a PHI operand has a subregister, create a COPY instead of simply replacing the PHI output with the input it. Differential Revision: https://reviews.llvm.org/D32650 llvm-svn: 301699	2017-04-28 21:56:33 +00:00
Krzysztof Parzyszek	0b3acbb1dd	[Hexagon] Do not move a block if it is on a fall-through path llvm-svn: 301698	2017-04-28 21:54:11 +00:00
Krzysztof Parzyszek	333b2bf2ed	[Hexagon] Generate proper offset in opt-addr-mode Also, make a few changes to allow using the pass in .mir testcases. Among other things, change the abbreviation from opt-amode to amode-opt, because otherwise lit would expand the "opt" part to the full path to the opt binary. llvm-svn: 300707	2017-04-19 15:15:51 +00:00
Matt Arsenault	f10061ec70	Add address space mangling to lifetime intrinsics In preparation for allowing allocas to have non-0 addrspace. llvm-svn: 299876	2017-04-10 20:18:21 +00:00
Krzysztof Parzyszek	2182b4b7b3	[Hexagon] Use -mattr to select HVX mode in a testcase, NFC llvm-svn: 299582	2017-04-05 19:46:37 +00:00
Krzysztof Parzyszek	b326411fdc	[Hexagon] Fix typo in HexagonEarlyIfCConv.cpp Found by PVS-Studio. Fixes llvm.org/PR32480. llvm-svn: 299258	2017-03-31 20:36:00 +00:00
Krzysztof Parzyszek	10fbac009d	[Hexagon] Avoid infinite loops in HexagonLoopIdiomRecognition - Avoid explosive growth of the simplification queue by not queuing expressions that are alredy in it. - Add an iteration counter and abort after a sufficiently large number of iterations (assuming that it's a symptom of an infinite loop). llvm-svn: 298655	2017-03-23 23:01:22 +00:00
Krzysztof Parzyszek	d033d1fd82	Recommit r298282 with fixes for memory allocation/deallocation [Hexagon] Recognize polynomial-modulo loop idiom again Regain the ability to recognize loops calculating polynomial modulo operation. This ability has been lost due to some changes in the preceding optimizations. Add code to preprocess the IR to a form that the pattern matching code can recognize. llvm-svn: 298400	2017-03-21 17:09:27 +00:00
Krzysztof Parzyszek	5e7f06f354	[Hexagon] Add -march=hexagon to a testcase llvm-svn: 298395	2017-03-21 16:59:40 +00:00
Vitaly Buka	c12716e742	Revert "[Hexagon] Recognize polynomial-modulo loop idiom again" Fix memory leaks on check-llvm tests detected by Asan. This reverts commit r298282. llvm-svn: 298329	2017-03-21 00:59:51 +00:00
Krzysztof Parzyszek	8490251de3	[Hexagon] Recognize polynomial-modulo loop idiom again Regain the ability to recognize loops calculating polynomial modulo operation. This ability has been lost due to some changes in the preceding optimizations. Add code to preprocess the IR to a form that the pattern matching code can recognize. llvm-svn: 298282	2017-03-20 18:12:58 +00:00
Krzysztof Parzyszek	0e7b1f83b7	[RDF] Remove the map of reaching defs from copy propagation Use Liveness::getNearestAliasedRef to find the reaching def instead. llvm-svn: 297526	2017-03-10 22:44:24 +00:00
Krzysztof Parzyszek	544210304f	[Hexagon] Fixes to the bitsplit generation - Fix the insertion point, which occasionally could have been incorrect. - Avoid creating multiple bitsplits with the same operands, if an old one could be reused. llvm-svn: 297414	2017-03-09 22:02:14 +00:00
Krzysztof Parzyszek	78c4fcf12e	[Hexagon] Propagate zext of i1 into arithmetic code in selection DAG (op ... (zext i1 c) ...) -> (select c (op ... 1 ...), (op ... 0 ...)) llvm-svn: 297391	2017-03-09 16:29:30 +00:00
Krzysztof Parzyszek	1b7197e690	[Hexagon] Use correct offset when extracting from the high word When extracting a bitfield from the high register in a register pair, the final offset should be relative to the high register (for 32-bit extracts). llvm-svn: 297288	2017-03-08 15:46:28 +00:00
Krzysztof Parzyszek	434d50a796	[Hexagon] Check for presence before looking registers up in bit tracker llvm-svn: 297240	2017-03-07 23:12:04 +00:00
Krzysztof Parzyszek	8e4d2e0512	[Hexagon] Generate bitsplit instruction llvm-svn: 297239	2017-03-07 23:08:35 +00:00
Krzysztof Parzyszek	3cceffb752	[Hexagon] Do not insert instructions before PHI nodes llvm-svn: 297141	2017-03-07 14:20:19 +00:00
Krzysztof Parzyszek	9e60e51a71	Revert r297039, it's causing some mysterious buildbot failures llvm-svn: 297062	2017-03-06 20:24:21 +00:00
Krzysztof Parzyszek	5b8fae5edd	[IfConversion] Only renormalize probabilities if branches are analyzable If a block has non-analyzable branches, the listed successors don't need to add up to one. For example, if a block has a conditional tail call, that tail call will not have a corresponding successor in the successor list, but will still be a possible branch. Differential Revision: https://reviews.llvm.org/D30556 llvm-svn: 297054	2017-03-06 19:12:42 +00:00
Krzysztof Parzyszek	03c5c21568	[TableGen] Ensure proper ordering of subtarget feature names llvm-svn: 297039	2017-03-06 18:08:37 +00:00
Krzysztof Parzyszek	8a4c601abc	[Hexagon] Early-if-convert branches that may exit the loop Merge the tail block into the loop in cases where the main loop body exits early, subject to profitability constraints. This will coalesce the loop body into fewer blocks. For example: loop: loop: // loop body // loop body if (...) jump exit --> // more body more: if (...) jump exit // more body jump loop jump loop llvm-svn: 297033	2017-03-06 17:24:04 +00:00
Krzysztof Parzyszek	e16ce15687	[Hexagon] Mark dead defs as <dead> in expand-condsets The code in updateDeadFlags removed unnecessary <dead> flags, but there can be cases where such a flag is not set, and yet a register has become dead. For example, if a mux with identical inputs is replaced with a COPY, the predicate register may no longer be used after that. llvm-svn: 297032	2017-03-06 17:09:06 +00:00
Krzysztof Parzyszek	143158b72e	[Hexagon] Pick a dot-old instruction that matches the architecture llvm-svn: 297031	2017-03-06 17:03:16 +00:00
Krzysztof Parzyszek	e720feb1c6	[Hexagon] Pick the right branch opcode depending on branch probabilities Specifically, pick the opcode with the correct branch prediction, i.e. jump:t or jump:nt. llvm-svn: 296821	2017-03-02 21:49:49 +00:00
Krzysztof Parzyszek	056c945a5d	[Hexagon] Skip blocks that define vector predicate registers in early-if llvm-svn: 296777	2017-03-02 18:10:59 +00:00
Krzysztof Parzyszek	fcbb7d10fe	[Hexagon] Properly handle 'q' constraint in 128-byte vector mode llvm-svn: 296772	2017-03-02 17:50:24 +00:00
Krzysztof Parzyszek	8f23dd6d68	[Hexagon] Fix lowering of formal arguments of type i1 On Hexagon, values of type i1 are passed in registers of type i32, even though i1 is not a legal value for these registers. This is a special case and needs special handling to maintain consistency of the lowering information. This fixes PR32089. llvm-svn: 296645	2017-03-01 17:30:10 +00:00
Krzysztof Parzyszek	33fd0bbbe8	[Hexagon] Generate extract instructions more aggressively llvm-svn: 296537	2017-02-28 23:27:33 +00:00
Krzysztof Parzyszek	f208681731	[Hexagon] Fix instruction selection for sign-extending i1 to i64 llvm-svn: 296532	2017-02-28 22:37:01 +00:00
Krzysztof Parzyszek	0d67b10a3c	[Hexagon] Undo shift folding where it could simplify addressing mode For example, avoid (single shift): r0 = and(##536870908,lsr(r0,#3)) r0 = memw(r1+r0<<#0) in favor of (two shifts): r0 = lsr(r0,#5) r0 = memw(r1+r0<<#2) llvm-svn: 296196	2017-02-24 23:34:24 +00:00
Sanjay Patel	832b1622d8	[DAGCombiner] add missing folds for scalar select of {-1,0,1} The motivation for filling out these select-of-constants cases goes back to D24480, where we discussed removing an IR fold from add(zext) --> select. And that goes back to: https://reviews.llvm.org/rL75531 https://reviews.llvm.org/rL159230 The idea is that we should always canonicalize patterns like this to a select-of-constants in IR because that's the smallest IR and the best for value tracking. Note that we currently do the opposite in some cases (like the cases in this patch). Ie, the proposed folds in this patch already exist in InstCombine today: https://github.com/llvm-mirror/llvm/blob/master/lib/Transforms/InstCombine/InstCombineSelect.cpp#L1151 As this patch shows, most targets generate better machine code for simple ext/add/not ops rather than a select of constants. So the follow-up steps to make this less of a patchwork of special-case folds and missing IR canonicalization: 1. Have DAGCombiner convert any select of constants into ext/add/not ops. 2 Have InstCombine canonicalize in the other direction (create more selects). Differential Revision: https://reviews.llvm.org/D30180 llvm-svn: 296137	2017-02-24 17:17:33 +00:00
Krzysztof Parzyszek	128e191eac	[Hexagon] Handle saturations in Hexagon bit tracker llvm-svn: 296026	2017-02-23 22:11:52 +00:00
Krzysztof Parzyszek	2cfc7a48de	[Hexagon] Avoid IMPLICIT_DEFs as new-value producers llvm-svn: 295997	2017-02-23 17:47:34 +00:00
Krzysztof Parzyszek	af5ff65d67	[Hexagon] Patterns for CTPOP, BSWAP and BITREVERSE llvm-svn: 295981	2017-02-23 15:02:09 +00:00

1 2 3 4 5 ...

443 Commits