llvm-project

Commit Graph

Author	SHA1	Message	Date
Krzysztof Parzyszek	1fd0c7e598	[Hexagon] Recognize C4_cmpneqi, C4_cmpltei and C4_cmplteui in NewValueJump llvm-svn: 308914	2017-07-24 19:35:48 +00:00
Krzysztof Parzyszek	3ad0d01e9e	[Hexagon] Add inline-asm constraint 'a' for modifier register class For example asm ("memw(%0++%1) = %2" : : "r"(addr),"a"(mod),"r"(val) : "memory") llvm-svn: 308761	2017-07-21 17:51:27 +00:00
Krzysztof Parzyszek	ac01994db9	[Hexagon] Fix a bug in r308502: post-inc offset is always 0 llvm-svn: 308510	2017-07-19 19:17:32 +00:00
Sumanth Gundapaneni	d5aa0f3464	[Hexagon] Emit lookup tables in text section based on a flag The flag "-hexagon-emit-lut-text" (defaulted to false) is added to decide on where to keep the switch generated lookup table. Differential Revision: https://reviews.llvm.org/D34818 llvm-svn: 308316	2017-07-18 15:31:37 +00:00
Mandeep Singh Grang	ed64963f1e	[llvm] Remove redundant check-prefix=CHECK from tests. NFC. Reviewers: t.p.northover, oren_ben_simhon, niravd, mcrosier Reviewed By: oren_ben_simhon, mcrosier Subscribers: nhaehnle, javed.absar, llvm-commits Differential Revision: https://reviews.llvm.org/D35466 llvm-svn: 308193	2017-07-17 17:32:45 +00:00
Krzysztof Parzyszek	5eef92eb7f	[Hexagon] Remove custom lowering of loads of v4i16 The target-independent lowering works fine, except concatenating 32-bit words. Add a pattern to generate A2_combinew instead of 64-bit asl/or. llvm-svn: 308186	2017-07-17 15:45:45 +00:00
Krzysztof Parzyszek	9c084fc55d	[Hexagon] Add intrinsics for data cache operations This is the LLVM part, adding definitions for void @llvm.hexagon.Y2.dccleana(i8) void @llvm.hexagon.Y2.dccleaninva(i8) void @llvm.hexagon.Y2.dcinva(i8) void @llvm.hexagon.Y2.dczeroa(i8) void @llvm.hexagon.Y4.l2fetch(i8, i32) void @llvm.hexagon.Y5.l2fetch(i8, i64) The clang part will follow. llvm-svn: 308032	2017-07-14 15:58:48 +00:00
Krzysztof Parzyszek	f67cd8259d	[Hexagon] Do not rely on callee-saved info in hasFP llvm-svn: 307675	2017-07-11 17:11:54 +00:00
Krzysztof Parzyszek	c86e2ef3f5	[Hexagon] Add support for nontemporal loads and stores on HVX Patch by Michael Wu. Differential Revision: https://reviews.llvm.org/D35104 llvm-svn: 307671	2017-07-11 16:39:33 +00:00
Krzysztof Parzyszek	df4a05d6fb	[Hexagon] Fix check for HMOTF_ConstExtend operand flag This fixes https://llvm.org/PR33718. llvm-svn: 307566	2017-07-10 18:38:52 +00:00
Nirav Dave	65b7ab1be4	[Hexagon] Preclude non-memory test from being optimized away. NFC. llvm-svn: 307153	2017-07-05 13:08:03 +00:00
Krzysztof Parzyszek	9eb75c4520	[Hexagon] Implement frame pointer elimination with -fomit-frame-pointer It applies to leaf functions that are otherwise not required to have a frame pointer. llvm-svn: 306888	2017-06-30 21:21:40 +00:00
Sumanth Gundapaneni	8c5d59557d	[Hexagon] Emit jump tables in text section based on a flag This patch adds a new LLVM flag -hexagon-emit-jt-text which is defaulted to "false". The value "true" emits the switch generated jump tables in text section. Differential Revision: https://reviews.llvm.org/D34820 llvm-svn: 306872	2017-06-30 20:21:48 +00:00
Sumanth Gundapaneni	19b74203b1	Revert "[Hexagon] Guard the generation of lookup table" This reverts commit ae521f4192c3ed0202c047fec993cb59133dd1a0. Wrong commit message llvm-svn: 306871	2017-06-30 20:20:00 +00:00
Sumanth Gundapaneni	cf73758dc8	[Hexagon] Guard the generation of lookup table The llvm flag "-hexagon-emit-lookup-tables" guards the generation of lookup table from a switch statement. Differential Revision: https://reviews.llvm.org/D34819 llvm-svn: 306869	2017-06-30 20:10:28 +00:00
Krzysztof Parzyszek	0089419417	[Hexagon] Keep all phi nodes when building DFG in addr-mode-opt The dead phis are needed for finding correct would-be reaching defs in register propagation. llvm-svn: 306690	2017-06-29 15:55:59 +00:00
Krzysztof Parzyszek	3008594cd4	Missed a check for UndefVI in r306466 llvm-svn: 306553	2017-06-28 15:46:16 +00:00
Krzysztof Parzyszek	0b7688e6c0	Create a PHI value when merging with a known undef live-in Differential Revision: https://reviews.llvm.org/D34640 llvm-svn: 306466	2017-06-27 21:30:46 +00:00
Krzysztof Parzyszek	25173e4cba	[Hexagon] Use proper predicate register state when expanding PS_vselect llvm-svn: 306458	2017-06-27 19:59:46 +00:00
Krzysztof Parzyszek	5ddd2e5899	[Hexagon] Update kills in hexagon-nvj even more properly than before Account for the fact that both, the feeder and the compare can be moved over instructions that kill registers. llvm-svn: 306443	2017-06-27 18:37:16 +00:00
Krzysztof Parzyszek	918e6d70bd	[Hexagon] Handle cases when the aligned stack pointer is missing llvm-svn: 306288	2017-06-26 14:17:58 +00:00
Krzysztof Parzyszek	717021772b	Revert "[Hexagon] Handle decreasing of stack alignment in frame lowering" This breaks passing of aligned function arguments. llvm-svn: 306145	2017-06-23 19:47:04 +00:00
Krzysztof Parzyszek	bb2fcd1921	[Hexagon] Handle decreasing of stack alignment in frame lowering llvm-svn: 306124	2017-06-23 16:53:59 +00:00
Krzysztof Parzyszek	9b7c1d2dcf	[Hexagon] Properly update kill flags in HexagonNewValueJump The feeder instruction will be moved to right before the compare, so the updating code should not be looking for kills past the compare. llvm-svn: 306059	2017-06-22 21:11:44 +00:00
Krzysztof Parzyszek	1a0da8d5a3	[Hexagon] Use LivePhysRegs to fix up kills in HexagonGenMux Remove the previous, manual shuffling of the kill flags. llvm-svn: 306054	2017-06-22 20:43:02 +00:00
Krzysztof Parzyszek	9bdb460f64	[Hexagon] Fix typo in a testcase llvm-svn: 306030	2017-06-22 16:25:46 +00:00
Krzysztof Parzyszek	f63ad39e7d	[Hexagon] Handle a global operand to A2_addi when creating duplexes llvm-svn: 306012	2017-06-22 15:53:31 +00:00
Krzysztof Parzyszek	69ffba4595	[Hexagon] Recognize potential offset overflow for store-imm to stack Reserve an extra scavenging stack slot if the offset field in store- -immediate instructions may overflow. llvm-svn: 306004	2017-06-22 14:11:23 +00:00
Krzysztof Parzyszek	fd048cc0ec	[Hexagon] Handle more types of immediate operands in expand-condsets llvm-svn: 305943	2017-06-21 19:21:30 +00:00
Krzysztof Parzyszek	3a40b34123	[Hexagon] Don't kill live registers when creating mux out of tfr The second part of r305300: when placing the mux at the later location, make sure that it won't use any register that was killed between the two original instructions. Remove any such kills and transfer them to the mux. llvm-svn: 305553	2017-06-16 12:24:03 +00:00
Krzysztof Parzyszek	b3a8d20e27	[Hexagon] Generate store-immediate instructions for stack objects Store-immediate instructions have a non-extendable offset. Since the actual offset for a stack object is not known until much later, only generate these stores when the stack size (at the time of instruction selection) is small. llvm-svn: 305305	2017-06-13 17:10:16 +00:00
Krzysztof Parzyszek	c83c267b84	[Hexagon] Generate multiply-high instruction in isel llvm-svn: 305302	2017-06-13 16:21:57 +00:00
Krzysztof Parzyszek	de2ac17b7b	[Hexagon] Don't kill live registers when creating mux out of tfr When a mux instruction is created from a pair of complementary conditional transfers, it can be placed at the location of either the earlier or the later of the transfers. Since it will use the operands of the original transfers, putting it in the earlier location may hoist a kill of a source register that was originally further down. Make sure the kill flag is removed if the register is still used afterwards. llvm-svn: 305300	2017-06-13 16:07:36 +00:00
Krzysztof Parzyszek	9bd4d91037	[Hexagon] Stop pmpy recognition when shift conversion fails The conversion of shifts from right shifts to left shifts may fail. In such case, the pmpy recognition cannot proceed. llvm-svn: 305289	2017-06-13 13:51:49 +00:00
Krzysztof Parzyszek	8a7fb0fe51	[Hexagon] Skip mux generation when predicate register is undefined llvm-svn: 305014	2017-06-08 20:56:36 +00:00
Krzysztof Parzyszek	5ba13825f0	[Hexagon] Generate 'inbounds' GEPs in HexagonCommonGEP llvm-svn: 304937	2017-06-07 20:04:33 +00:00
Krzysztof Parzyszek	066e8b56a0	[Hexagon] Return 0 from getDotNewPredOp when .new opcode does not exist This allows using this function to test if an instruction can be converted to a .new form. llvm-svn: 304549	2017-06-02 14:07:06 +00:00
Krzysztof Parzyszek	3cf16576d5	[Hexagon] Fix dependence check in the packetizer An incorrect check in the packetizer lead to an attempt to convert an unconditional branch to a .new (conditional) form. llvm-svn: 304442	2017-06-01 18:02:40 +00:00
Krzysztof Parzyszek	51fd5405d5	[Hexagon] Handle long-running simplification loop in idiom recognition The initial assumption was that the simplification would converge to a fixed point relatvely quickly. Turns out that there are legitimate situa- tions where the complexity of the code causes it to take a large number of iterations. Two main changes: - Instead of aborting upon hitting the limit, simply return nullptr. - Reduce the limit to 10,000 from 100,000. llvm-svn: 304441	2017-06-01 18:00:47 +00:00
Krzysztof Parzyszek	ef58017b35	[Hexagon] Improve code generation for 32x32-bit multiplication For multiplications of 64-bit values (giving 64-bit result), detect cases where the arguments are sign-extended 32-bit values, on a per- operand basis. This will allow few patterns to match a wider variety of combinations in which extensions can occur. llvm-svn: 304223	2017-05-30 17:47:51 +00:00
Matthias Braun	868bbd4022	ScheduleDAGInstrs: Fix fixupKills() Rewrite fixupKills() to use the LivePhysRegs class. Simplifies the code and fixes a bug where the CSR registers in return blocks where missed leading to invalid kill flags. Also remove the unnecessary rule that we wouldn't set kill flags on tied operands. No tests as I have an upcoming commit improving MachineVerifier checks to catch these cases in multiple existing lit tests. llvm-svn: 304055	2017-05-27 02:50:50 +00:00
Serge Pavlov	d526b13e61	Add extra operand to CALLSEQ_START to keep frame part set up previously Using arguments with attribute inalloca creates problems for verification of machine representation. This attribute instructs the backend that the argument is prepared in stack prior to CALLSEQ_START..CALLSEQ_END sequence (see http://llvm.org/docs/InAlloca.htm for details). Frame size stored in CALLSEQ_START in this case does not count the size of this argument. However CALLSEQ_END still keeps total frame size, as caller can be responsible for cleanup of entire frame. So CALLSEQ_START and CALLSEQ_END keep different frame size and the difference is treated by MachineVerifier as stack error. Currently there is no way to distinguish this case from actual errors. This patch adds additional argument to CALLSEQ_START and its target-specific counterparts to keep size of stack that is set up prior to the call frame sequence. This argument allows MachineVerifier to calculate actual frame size associated with frame setup instruction and correctly process the case of inalloca arguments. The changes made by the patch are: - Frame setup instructions get the second mandatory argument. It affects all targets that use frame pseudo instructions and touched many files although the changes are uniform. - Access to frame properties are implemented using special instructions rather than calls getOperand(N).getImm(). For X86 and ARM such replacement was made previously. - Changes that reflect appearance of additional argument of frame setup instruction. These involve proper instruction initialization and methods that access instruction arguments. - MachineVerifier retrieves frame size using method, which reports sum of frame parts initialized inside frame instruction pair and outside it. The patch implements approach proposed by Quentin Colombet in https://bugs.llvm.org/show_bug.cgi?id=27481#c1. It fixes 9 tests failed with machine verifier enabled and listed in PR27481. Differential Revision: https://reviews.llvm.org/D32394 llvm-svn: 302527	2017-05-09 13:35:13 +00:00
Krzysztof Parzyszek	d0c71ef8ab	[RDF] Remove covered parts of reached uses for phi and use in same block llvm-svn: 302305	2017-05-05 22:10:32 +00:00
Krzysztof Parzyszek	31d4b3b247	Remove stale live-ins in the branch folder Hoisting common code can cause registers that live-in in the successor blocks to no longer be live-in. The live-in information needs to be updated to reflect this, or otherwise incorrect code can be generated later on. Differential Revision: https://reviews.llvm.org/D32661 llvm-svn: 302228	2017-05-05 12:20:07 +00:00
Krzysztof Parzyszek	2af5037d34	[Hexagon] Use automatically-generated scheduling information for HVX Patch by Jyotsna Verma. llvm-svn: 302073	2017-05-03 20:10:36 +00:00
Krzysztof Parzyszek	4763c2d999	[Hexagon] Adjust latency between allocframe and the first store on stack Allocframe and the following stores on the stack have a latency of 2 cycles when not in the same packet. This happens because R29 is needed early by the store instruction. Since one of such stores can be packetized along with allocframe and use old value of R29, we can assign it 0 cycle latency while leaving latency of other stores to the default value of 2 cycles. Patch by Jyotsna Verma. llvm-svn: 302034	2017-05-03 15:33:09 +00:00
Krzysztof Parzyszek	a750383d0f	[Hexagon] Add extenders for GD_PLT_B22_PCREL and LD_PLT_B22_PCREL Patch by Sid Manning. llvm-svn: 301955	2017-05-02 18:15:33 +00:00
Krzysztof Parzyszek	9aaf923376	[Hexagon] Don't ignore mult-cycle latency information The compiler was generating code that ends up ignoring a multiple latency dependence between two instructions by scheduling the intructions in back-to-back packets. The packetizer needs to end a packet if the latency of the current current insruction and the source in the previous packet is greater than 1 cycle. This case occurs when there is still room in the current packet, but scheduling the instruction causes a stall. Instead, the packetizer should start a new packet. Also, if the current packet already contains a stall, then it is okay to add another instruction to the packet that also causes a stall. This occurs when there are no instructions that can be scheduled in between the producer and consumer instructions. This patch changes the latency for loads to 2 cycles from 3 cycles. This change refects that a load only needs to be separated by one extra packet to eliminate the stall. Patch by Ikhlas Ajbar. llvm-svn: 301954	2017-05-02 18:12:19 +00:00
Krzysztof Parzyszek	072ddb383c	[RDF] Correctly calculate lane masks for defs llvm-svn: 301700	2017-04-28 21:57:53 +00:00
Krzysztof Parzyszek	2065a2f4e6	Properly handle PHIs with subregisters in UnreachableBlockElim When a PHI operand has a subregister, create a COPY instead of simply replacing the PHI output with the input it. Differential Revision: https://reviews.llvm.org/D32650 llvm-svn: 301699	2017-04-28 21:56:33 +00:00

1 2 3 4 5 ...

477 Commits