llvm-project

Commit Graph

Author	SHA1	Message	Date
Daniel Sanders	3d3ea53f32	[mips][mips64r6] bc1[tf] are not available on MIPS32r6/MIPS64r6 Summary: Also tightened up the acceptable condition operand for these instructions on MIPS-I to MIPS-III. Support for $fcc[1-7] was added in MIPS-IV. Prior to that only $fcc0 is acceptable. We currently don't optimize (BEQZ (NOT $a), $target) and similar. It's probably best to do this in InstCombine. Depends on D4111 Reviewers: jkolek, zoran.jovanovic, vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D4112 llvm-svn: 210787	2014-06-12 15:00:17 +00:00
Daniel Sanders	39a1ca75ba	[mips][mips64r6] bc2[ft] are not available on MIPS32r6/MIPS64r6 Summary: These instructions are not implemented for any MIPS ISA so we only need testcases. Depends on D4110 Reviewers: jkolek, zoran.jovanovic, vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D4111 llvm-svn: 210786	2014-06-12 14:54:13 +00:00
Daniel Sanders	fd61fd3b6f	[mips][mips64r6] [sl][duw]xc1 are not available on MIPS32r6/MIPS64r6 Summary: Folded mips64-fp-indexed-ls.ll into fp-indexed-ls.ll. To do so, the zext's in mips64-fp-indexed-ls.ll were changed to implicit sign extensions (performed by getelementptr). This does not affect the purpose of the test. Depends on D4004 Reviewers: zoran.jovanovic, jkolek, vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D4110 llvm-svn: 210784	2014-06-12 14:19:28 +00:00
Dinesh Dwivedi	95f0d51bd3	This removes TODO added in http://reviews.llvm.org/D3658 The patch transforms ABS(NABS(X)) -> ABS(X) NABS(ABS(X)) -> NABS(X) Differential Revision: http://reviews.llvm.org/D4040 llvm-svn: 210782	2014-06-12 14:06:00 +00:00
Daniel Sanders	6c97d979df	[mips][mips64r6] prefx is not available on MIPS32r6/MIPS64r6 Summary: We haven't implemented this instruction so we only add a test case. Reviewers: vmedic, zoran.jovanovic, jkolek Reviewed By: jkolek Differential Revision: http://reviews.llvm.org/D4004 llvm-svn: 210779	2014-06-12 13:51:27 +00:00
Daniel Sanders	0fa6041625	[mips][mips64r6] c.cond.fmt, mov[fntz], and mov[fntz].[ds] are not available on MIPS32r6/MIPS64r6 Summary: c.cond.fmt has been replaced by cmp.cond.fmt. Where c.cond.fmt wrote to dedicated condition registers, cmp.cond.fmt writes 1 or 0 to normal FGR's (like the GPR comparisons). mov[fntz] have been replaced by seleqz and selnez. These instructions conditionally zero a register based on a bool in a GPR. The results can then be or'd together to act as a select without, for example, requiring a third register read port. mov[fntz].[ds] have been replaced with sel.[ds] MIPS64r6 currently generates unnecessary sign-extensions for most selects. This is because the result of a SETCC is currently an i32. Bits 32-63 are undefined in i32 and the behaviour of seleqz/selnez would otherwise depend on undefined bits. Later, we will fix this by making the result of SETCC an i64 on MIPS64 targets. Depends on D3958 Reviewers: jkolek, vmedic, zoran.jovanovic Reviewed By: vmedic, zoran.jovanovic Differential Revision: http://reviews.llvm.org/D4003 llvm-svn: 210777	2014-06-12 13:39:06 +00:00
Daniel Sanders	559dd851b6	[mips][mips64r6] jalx is not available on MIPS32r6/MIPS64r6 Summary: Depends on D3957 Reviewers: jkolek, zoran.jovanovic, vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D3958 llvm-svn: 210775	2014-06-12 12:58:20 +00:00
Zoran Jovanovic	b9c07f3b86	[mips][mips64r6] Add R_MIPS_PC19_S2 Differential Revision: http://reviews.llvm.org/D3866 llvm-svn: 210773	2014-06-12 12:40:00 +00:00
Daniel Sanders	1f6f0f4b54	[mips] Use MTHC1 when it is available (MIPS32r2 and later) for both FP32 and FP64 Summary: To make this work for both AFGR64 and FGR64 register sets, I've had to make the instruction definition consistent with the white lie (that it reads the lower 32-bits of the register) when they are generated by expandBuildPairF64(). Corrected the definition of hasMips32r2() and hasMips64r2() to include MIPS32r6 and MIPS64r6. Depends on D3956 Reviewers: jkolek, zoran.jovanovic, vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D3957 llvm-svn: 210771	2014-06-12 11:55:58 +00:00
Zoran Jovanovic	28a0ca0759	[mips][mips64r6] Add bgec and bgeuc instructions Differential Revision: http://reviews.llvm.org/D4017 llvm-svn: 210770	2014-06-12 11:47:44 +00:00
Daniel Sanders	ded02af45e	[mips][mips64r6] madd.[ds], msub.[ds], nmadd.[ds], and nmsub.[ds] are not available on MIPS32r6/MIPS64r6 Summary: This patch updates both the assembler and the code generator. MIPS32r6/MIPS64r6 replaces them with maddf.[ds] and msubf.[ds] which are fused multiply-add/sub operations. We don't emit these yet, this patch only prevents the removed instructions from being emitted. Depends on D3955 Reviewers: jkolek, zoran.jovanovic, vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D3956 llvm-svn: 210763	2014-06-12 11:04:18 +00:00
Daniel Sanders	826f8b3d0c	[mips][mips64r6] madd/maddu/msub/msubu are not available on MIPS32r6/MIPS64r6 Summary: This patch disables madd/maddu/msub/msubu in both the assembler and code generator. Depends on D3896 Reviewers: jkolek, zoran.jovanovic, vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D3955 llvm-svn: 210762	2014-06-12 10:54:16 +00:00
Andrea Di Biagio	972ff97f8c	[X86] Teach how to combine AVX and AVX2 horizontal binop on packed 256-bit vectors. This patch adds target combine rules to match: - [AVX] Horizontal add/sub of packed single/double precision floating point values from 256-bit vectors; - [AVX2] Horizontal add/sub of packed integer values from 256-bit vectors. llvm-svn: 210761	2014-06-12 10:53:48 +00:00
Daniel Sanders	308181eaa0	[mips][mips64r6] Replace m[tf]hi, m[tf]lo, mult, multu, dmult, dmultu, div, ddiv, divu, ddivu for MIPS32r6/MIPS64. Summary: The accumulator-based (HI/LO) multiplies and divides from earlier ISA's have been removed and replaced with GPR-based equivalents. For example: div $1, $2 mflo $3 is now: div $3, $1, $2 This patch disables the accumulator-based multiplies and divides for MIPS32r6/MIPS64r6 and uses the GPR-based equivalents instead. Renamed expandPseudoDiv to insertDivByZeroTrap to better describe the behaviour of the function. MipsDelaySlotFiller now invalidates the liveness information when moving instructions to the delay slot. Without this, divrem.ll will abort since %GP ends up used before it is defined. Reviewers: vmedic, zoran.jovanovic, jkolek Reviewed By: jkolek Differential Revision: http://reviews.llvm.org/D3896 llvm-svn: 210760	2014-06-12 10:44:10 +00:00
Matheus Almeida	789ba73570	[mips] Move CHECK lines to the same line as the instruction it's testing for consistency with the other tests. No functional changes. llvm-svn: 210757	2014-06-12 09:50:17 +00:00
Matt Arsenault	2c81994f92	R600/SI: Use a register set to -1 for data0 on ds_inc/ds_dec There is not such thing as a 0-data ds instruction, and the data operand needs to be a vgpr set to something meaningful. llvm-svn: 210756	2014-06-12 08:21:54 +00:00
Juergen Ributzka	b43a559514	[FastISel][x86] Add testcase for r210719. llvm-svn: 210746	2014-06-12 03:54:05 +00:00
Juergen Ributzka	7eac929609	[x86] Improve frameaddress test from r210709. llvm-svn: 210743	2014-06-12 03:29:29 +00:00
Juergen Ributzka	04558dc77a	[FastISel] Add support for the stackmap intrinsic. This implements target-independent FastISel lowering for the stackmap intrinsic. llvm-svn: 210742	2014-06-12 03:29:26 +00:00
Bob Wilson	2f7cc01895	Fix verifier for GlobalAliases to avoid recursing into global initializers. The verifier follows GlobalAlias operands so that it can detect cycles of alias definitions. It was doing this in a way that caused it to also recurse through initializers for the GlobalValue aliasees, and it would fail when an initializer refers to a global that is a declaration and not a definition. This patch causes it to stop recursing when it hits a global definition. <rdar://problem/17277451> llvm-svn: 210734	2014-06-12 01:46:54 +00:00
Eli Bendersky	899bef099f	Teach LoopUnrollPass to respect loop unrolling hints in metadata. See http://reviews.llvm.org/D4090 for more details. The Clang change that produces this metadata was committed in r210667 Patch by Mark Heffernan. llvm-svn: 210721	2014-06-11 23:15:35 +00:00
Juergen Ributzka	272b570a80	[FastISel][X86] Add support for the sqrt intrinsic. llvm-svn: 210720	2014-06-11 23:11:02 +00:00
Juergen Ributzka	4dc958777c	[FastISel][X86] Add support for the frameaddress intrinsic. llvm-svn: 210709	2014-06-11 21:44:44 +00:00
Chad Rosier	2205d4ef05	[AArch64] Basic Sched Model for Cortex-A57. Patch by Dave Estes<cestes@codeaurora.org> Differential Revision: http://reviews.llvm.org/D4008 llvm-svn: 210705	2014-06-11 21:06:56 +00:00
Jim Grosbach	7a930bf9ef	ARM: honor hex immediate formatting for ldr/str i12 offsets. Previously we would always print the offset as decimal, regardless of the formatting requested. Now we use the formatImm() helper so the value is printed as the client (LLDB in the motivating example) requested. Before: ldr.w r8, [sp, #180] @ always After: ldr.w r8, [sp, #0xb4] @ when printing hex immediates ldr.w r8, [sp, #0180] @ when printing decimal immediates rdar://17237103 llvm-svn: 210701	2014-06-11 20:26:45 +00:00
Jim Grosbach	3fdf7cfba0	llvm-mc: Add option for prefering hex format disassembly. Previously there was a separate mode entirely (--hdis vs. --disassemble). It makes a bit more sense for the immediate printing style to be a flag for --disassmeble rather than an entirely different thing. llvm-svn: 210700	2014-06-11 20:26:40 +00:00
Matt Arsenault	2acc7a4570	R600/SI: Fix bitcast between v2i32 and f64 This is the same problem fixed in r210664 for more types. The test passes without this fix. For some reason I'm only hitting this when creating selects lowered to v2i32 selects. llvm-svn: 210692	2014-06-11 19:31:13 +00:00
Rafael Espindola	5c4f829424	Use std::error_code instead of llvm::error_code. The idea of this patch is to turn llvm/Support/system_error.h into a transitional header that just brings in the erorr_code api to the llvm namespace. I will remove it shortly afterwards. The cases where the general idea needed some tweaking: * std::errc is a namespace in msvc, so we cannot use "using std::errc". I could add an #ifdef, but there were not that many uses, so I just added std:: to them in this patch. * Template specialization had to be moved to the std namespace in this patch set already. * The msvc implementation of default_error_condition doesn't seem to provide the same transformations as we need. Not too surprising since the standard doesn't actually say what "equivalent" means. I fixed the problem by keeping our old mapping and using it at error_code construction time. Despite these shortcomings I think this is still a good thing. Some reasons: * The different implementations of system_error might improve over time. * It removes 925 lines of code from llvm already. * It removes 6313 bytes from the text segment of the clang binary when it is built with gcc and 2816 bytes when building with clang and libstdc++. llvm-svn: 210687	2014-06-11 19:05:50 +00:00
Chad Rosier	5ea14e09e9	[Reassociate] FileCheckize and cleanup a few testcases. No functional change intended. llvm-svn: 210685	2014-06-11 18:28:45 +00:00
Matt Arsenault	caa0ec2851	R600/SI: Add common 64-bit LDS atomics llvm-svn: 210680	2014-06-11 18:08:54 +00:00
Matt Arsenault	c793e1d9dc	R600/SI: Add 32-bit LDS atomic cmpxchg llvm-svn: 210678	2014-06-11 18:08:48 +00:00
Matt Arsenault	9e874541ac	R600/SI: Use LDS atomic inc / dec llvm-svn: 210677	2014-06-11 18:08:45 +00:00
Matt Arsenault	0e69e8128c	R600/SI: Add other LDS atomic operations llvm-svn: 210676	2014-06-11 18:08:42 +00:00
Matt Arsenault	7ddcd83d49	R600/SI: Fix backwards names for local atomic instructions. The manual lists them as _RTN_U32, not _U32_RTN, which is more consistent with how every other sized instruction is named. llvm-svn: 210674	2014-06-11 18:08:37 +00:00
Matt Arsenault	725741004c	R600/SI: Refactor local atomics. Use patterns that will also match the immediate offset to match the normal read / writes. llvm-svn: 210673	2014-06-11 18:08:34 +00:00
Matt Arsenault	364a6747aa	R600/SI: Use v_cvt_f32_ubyte* instructions This eliminates extra extract instructions when loading an i8 vector to a float vector. llvm-svn: 210666	2014-06-11 17:50:44 +00:00
Matt Arsenault	064c206d23	R600/SI: Fix selection failure on scalar_to_vector There seem to be only 2 places that produce these, and it's kind of tricky to hit them. Also fixes failure to bitcast between i64 and v2f32, although this for some reason wasn't actually broken in the simple bitcast testcase, but did in the scalar_to_vector one. llvm-svn: 210664	2014-06-11 17:40:32 +00:00
Daniel Sanders	8bb4c858bc	[mips][mips64r6] Improve tests affected by the changes to multiplies and divides Summary: MIPS32r6/MIPS64r6 support has not been added yet. inlineasm-cnstrnt-reg.ll: Explicitly specify the CPU since it will not work on MIPS32r6/MIPS64r6 when -integrated-as is the default. We can't change the mnemonic since the LO register is an implicit def of mtlo and MIPS32r6/MIPS64r6 has no instructions that use LO. 2008-08-01-AsmInline.ll: Explicitly specify the CPU since MIPS32r6/MIPS64r6 will correctly emit different code and this is a regression test. mips64instrs.ll and mips64muldiv.ll Check registers and the way the multiply is used in m1 divrem.ll Check registers and use multiple filecheck prefixes to limit redundancy Reviewers: vmedic, jkolek, zoran.jovanovic, matheusalmeida Reviewed By: matheusalmeida Subscribers: matheusalmeida Differential Revision: http://reviews.llvm.org/D3894 llvm-svn: 210656	2014-06-11 15:48:00 +00:00
Matheus Almeida	595fcab2d0	[mips] Implement jr.hb and jalr.hb (Jump Register and Jump and Link Register with Hazard Barrier). Summary: These instructions are available in ISAs >= mips32/mips64. For mips32r6/mips64r6, jr.hb has a new encoding format. Reviewers: dsanders Reviewed By: dsanders Differential Revision: http://reviews.llvm.org/D4019 llvm-svn: 210654	2014-06-11 15:05:56 +00:00
Cameron McInally	5d1b7b94e4	Add AVX512 masked leadz instrinsic support. llvm-svn: 210652	2014-06-11 12:54:45 +00:00
Evgeniy Stepanov	6e8e1abfd5	Improve the test for inlining of __no_debug__ functions. llvm-svn: 210645	2014-06-11 08:46:45 +00:00
Andrea Di Biagio	c7af75f9a7	[X86] Refactor the logic to select horizontal adds/subs to a helper function. This patch moves part of the logic implemented by the target specific combine rules added at r210477 to a separate helper function. This should make easier to add more rules for matching AVX/AVX2 horizontal adds/subs. This patch also fixes a problem caused by a wrong check performed on indices of extract_vector_elt dag nodes in input to the scalar adds/subs. New tests have been added to verify that we correctly check indices of extract_vector_elt dag nodes when selecting a horizontal operation. llvm-svn: 210644	2014-06-11 07:57:50 +00:00
Jiangning Liu	b2ae37fb67	Global merge for global symbols. This commit is to improve global merge pass and support global symbol merge. The global symbol merge is not enabled by default. For aarch64, we need some more back-end fix to make it really benifit ADRP CSE. llvm-svn: 210640	2014-06-11 06:44:53 +00:00
Jiangning Liu	3e5b855a51	Rename global-merge to enable-global-merge. llvm-svn: 210639	2014-06-11 06:35:26 +00:00
Juergen Ributzka	2dace6e54b	[FastISel][X86] Extend support for {s\|u}{add\|sub\|mul}.with.overflow intrinsics. llvm-svn: 210610	2014-06-10 23:52:44 +00:00
Reid Kleckner	52073f74d2	Rearrange the CHECK lines in this test to make failure more obvious. llvm-svn: 210575	2014-06-10 20:16:47 +00:00
Reid Kleckner	b01961c2c1	Revert "Patch by Ray Donnelly to print register names instead of numbers." This reverts commit r206683. The code was confusing SEH register numbers with DWARF register numbers. The test case it was committed with was obviously incorrect. The disassembler was roundtripping '.seh_pushreg %rsi' as '.seh_pushreg %rbp', and other exciting things. Noticed by Vadim Chugunov. llvm-svn: 210574	2014-06-10 20:16:36 +00:00
Matt Arsenault	a73fd935d8	Fix error in tablegen when either operand of !if is an empty list. !if([Something], []) would error with "No type for list". llvm-svn: 210572	2014-06-10 20:10:08 +00:00
Matt Arsenault	6042506b5c	R600: Use BCNT_INT for evergreen llvm-svn: 210569	2014-06-10 19:18:28 +00:00
Matt Arsenault	8333e4378e	R600/SI: Implement i64 ctpop llvm-svn: 210568	2014-06-10 19:18:24 +00:00

1 2 3 4 5 ...

24614 Commits