llvm-project

Commit Graph

Author	SHA1	Message	Date
Richard Sandiford	e3827751e2	[SystemZ] Fix handling of 64-bit memcmp results Generalize r188163 to cope with return types other than MVT::i32, just as the existing visitMemCmpCall code did. I've split this out into a subroutine so that it can be used for other upcoming patches. I also noticed that I'd used the wrong API to record the out chain. It's a load that uses DAG.getRoot() rather than getRoot(), so the out chain should go on PendingLoads. I don't have a testcase for that because we don't do any interesting scheduling on z yet. llvm-svn: 188540	2013-08-16 10:55:47 +00:00
Richard Sandiford	a59012577c	[SystemZ] Fix sign of integer memcmp result r188163 used CLC to implement memcmp. Code that compares the result directly against zero can test the CC value produced by CLC, but code that needs an integer result must use IPM. The sequence I'd used was: ipm <reg> sll <reg>, 2 sra <reg>, 30 but I'd forgotten that this inverts the order, so that CC==1 ("less") becomes an integer greater than zero, and CC==2 ("greater") becomes an integer less than zero. This sequence should only be used if the CLC arguments are reversed to compensate. The problem then is that the branch condition must also be reversed when testing the CLC result directly. Rather than do that, I went for a different sequence that works with the natural CLC order: ipm <reg> srl <reg>, 28 rll <reg>, <reg>, 31 One advantage of this is that it doesn't clobber CC. A disadvantage is that any sign extension to 64 bits must be done separately, rather than being folded into the shifts. llvm-svn: 188538	2013-08-16 10:22:54 +00:00
Vladimir Medic	2df9ee6ec8	This patch implements wait instruction for mips. Examples are added in test files. llvm-svn: 188537	2013-08-16 10:17:03 +00:00
Craig Topper	8c929627d9	Don't use v16i32 for load pattern matching. All 512-bit loads are cated to v8i64. llvm-svn: 188534	2013-08-16 06:07:34 +00:00
Tom Stellard	dba25713a6	Revert "R600/SI: Fix incorrect encoding of DS_WRITE_B32 instructions" This reverts commit a6a39ced095c2f453624ce62c4aead25db41a18f. This is the wrong version of this fix. llvm-svn: 188523	2013-08-16 01:18:43 +00:00
Tom Stellard	82bef57f20	R600/SI: Fix incorrect encoding of DS_WRITE_B32 instructions The SIInsertWaits pass was overwriting the first operand (gds bit) of DS_WRITE_B32 with the second operand (value to write). This meant that any time the value to write was stored in an odd number VGPR, the gds bit would be set causing the instruction to write to GDS instead of LDS. llvm-svn: 188522	2013-08-16 01:12:20 +00:00
Tom Stellard	b03edeca67	R600: Add support for global vector loads with element types less than 32-bits Tested-by: Aaron Watry <awatry@gmail.com> llvm-svn: 188521	2013-08-16 01:12:16 +00:00
Tom Stellard	fbab827e2a	R600: Add support for global vector stores with elements less than 32-bits Tested-by: Aaron Watry <awatry@gmail.com> llvm-svn: 188520	2013-08-16 01:12:11 +00:00
Tom Stellard	d3ee8c103a	R600: Add support for i16 and i8 global stores Tested-by: Aaron Watry <awatry@gmail.com> llvm-svn: 188519	2013-08-16 01:12:06 +00:00
Tom Stellard	6d1379e180	R600: Add support for v4i32 stores on Cayman Tested-by: Aaron Watry <awatry@gmail.com> llvm-svn: 188518	2013-08-16 01:12:00 +00:00
Tom Stellard	16da74c205	R600: Enable folding of inline literals into REQ_SEQUENCE instructions Tested-by: Aaron Watry <awatry@gmail.com> llvm-svn: 188517	2013-08-16 01:11:55 +00:00
Tom Stellard	676c16d088	R600: Add IsExport bit to TableGen instruction definitions Tested-by: Aaron Watry <awatry@gmail.com> llvm-svn: 188516	2013-08-16 01:11:51 +00:00
Tom Stellard	ac00f9df79	R600: Change the RAT instruction assembly names so they match the docs Tested-by: Aaron Watry <awatry@gmail.com> llvm-svn: 188515	2013-08-16 01:11:46 +00:00
Matt Arsenault	5cae894a13	Fix spelling llvm-svn: 188506	2013-08-15 23:11:03 +00:00
Renato Golin	ca570633c5	make arm-use-movt available for all ARM Before this patch this flag is IOS specific, but is also useful for bare project like bootloaders / kernels etc, since movw / movt prevents simple relocation. Therefore make this flag more commonly available. note: this patch depends on a similiar rename in clang Patch by Jeroen Hofstee. llvm-svn: 188487	2013-08-15 20:54:38 +00:00
Renato Golin	0a41d9ae7f	make arm-reserve-r9 available for all ARM r9 is defined as a platform-specific register in the ARM EABI. It can be reserved for a special purpose or be used as a general purpose register. Add support for reserving r9 for all ARM, while leaving the IOS usage unchanged. Patch by Jeroen Hofstee. llvm-svn: 188485	2013-08-15 20:45:13 +00:00
Bill Wendling	2851907cdb	Constify the function parameters. llvm-svn: 188469	2013-08-15 18:46:14 +00:00
Mihai Popa	d79f00ba68	This fixes three issues related to Thumb literal loads: 1. The offset range for Thumb1 PC relative loads is [0..1020] and not [-1024..1020] 2. Thumb2 PC relative loads may define the PC, so the restriction placed on target register is removed 3. Removes unneeded alias between "ldr.n" and t1LDRpci. ".n" is actually stripped by both tablegen and the ASM parser, so this alias rule really does nothing llvm-svn: 188466	2013-08-15 15:43:06 +00:00
Jack Carter	d12e837f05	[Mips][msa] Added the simple builtins (madd_q to xori) Includes: madd_q, maddr_q, maddv, max_[asu], maxi_[su], min_[asu], mini_[su], mod_[su], msub_q, msubr_q, msubv, mul_q, mulr_q, mulv, nloc, nlzc, nori, ori, pckev, pckod, pcnt, sat_[su], shf, sld, sldi, sll, slli, splat, splati, sr[al], sr[al]i, subs_[su], subss_u, subus_s, subv, subvi, vshf, xori Patch by Daniel Sanders llvm-svn: 188460	2013-08-15 14:22:07 +00:00
Jack Carter	b95ee69163	[Mips][msa] Added the simple builtins (fadd to ftq) Includes: fadd, fceq, fcg[et], fclass, fcl[et], fcne, fcun, fdiv, fexdo, fexp2, fexup[lr], ffint_[su], ffql, ffqr, fill, flog2, fmadd, fmax, fmax_a, fmin, fmin_a, fmsub, fmul, frint, frcp, frsqrt, fseq, fsge, fsgt, fsle, fslt, fsne, fsqr, fsub, ftint_s, ftq Patch by Daniel Sanders llvm-svn: 188458	2013-08-15 13:45:36 +00:00
Jack Carter	babdcc8c2c	[Mips][msa] Added the simple builtins (add_a to dpsub[su], ilvev to ldi) Includes: add_a, adds_[asu], addv, addvi, andi.b, asub_[su].[bhwd], aver?_[su]_[bhwd], bclr, bclri, bins[lr], bins[lr]i, bmnzi, bmzi, bneg, bnegi, bseli, bset, bseti, c(eq\|ne), c(eq\|ne)i, cl[et]_[su], cl[et]i_[su], copy_[su].[bhw], div_[su], dotp_[su], dpadd_[su], dpsub_[su], ilvev, ilvl, ilvod, ilvr, insv, insve, ldi Patch by Daniel Sanders llvm-svn: 188457	2013-08-15 12:24:57 +00:00
Craig Topper	8dbc7e9d35	Revert r188449 as it turns out we're just missing the instructions that need the v16i32/v16f32 matching. llvm-svn: 188454	2013-08-15 08:38:25 +00:00
Hao Liu	cd8b02dce3	Clang and AArch64 backend patches to support shll/shl and vmovl instructions and ACLE functions llvm-svn: 188451	2013-08-15 08:26:11 +00:00
Craig Topper	2ffd06528d	Don't let isPermImmMask handle v16i32 since VPERMI doesn't match on that type. Remove 128-bit vector handling from isPermImmMask too, it's covered by isPSHUFDMask. llvm-svn: 188449	2013-08-15 07:30:51 +00:00
Alexey Samsonov	3186eb3efd	Tentative fix for global-buffer-overflow caused by r188426. Found by AddressSanitizer llvm-svn: 188448	2013-08-15 07:11:34 +00:00
Craig Topper	83e042a21b	Use MVT instead of EVT in X86ISelDAGToDAG since all the types should be legal. llvm-svn: 188446	2013-08-15 05:57:07 +00:00
Craig Topper	6f4dd2dacf	Use MVT in place of EVT in more X86 operation lowering functions. llvm-svn: 188445	2013-08-15 05:33:45 +00:00
Craig Topper	d9c2783d8f	Replace getValueType().getSimpleVT() with getSimpleValueType(). llvm-svn: 188442	2013-08-15 02:44:19 +00:00
Craig Topper	5671010cbb	Replace getValueType().getSimpleVT() with getSimpleValueType(). Also remove one weird cast from MVT->EVT just to call getSimpleVT(). llvm-svn: 188441	2013-08-15 02:33:50 +00:00
Tom Stellard	d86003e31f	R600/SI: Improve legalization of vector operations This should fix hangs in the OpenCL piglit tests. llvm-svn: 188431	2013-08-14 23:25:00 +00:00
Tom Stellard	6785065ace	R600/SI: Replace v1i32 type with i32 in imageload and sample intrinsics llvm-svn: 188430	2013-08-14 23:24:53 +00:00
Tom Stellard	9fa1791a1b	R600/SI: Convert v16i8 resource descriptors to i128 Now that compute support is better on SI, we can't continue using v16i8 for descriptors since this is also a legal type in OpenCL. This patch fixes numerous hangs with the piglit OpenCL test and since we now use a target specific DAG node for LOAD_CONSTANT with the correct MemOperandFlags, this should also fix: https://bugs.freedesktop.org/show_bug.cgi?id=66805 llvm-svn: 188429	2013-08-14 23:24:45 +00:00
Tom Stellard	8e5da41374	R600/SI: Lower BUILD_VECTOR to REG_SEQUENCE v2 Using REG_SEQUENCE for BUILD_VECTOR rather than a series of INSERT_SUBREG instructions should make it easier for the register allocator to coalasce unnecessary copies. v2: - Use an SGPR register class if all the operands of BUILD_VECTOR are SGPRs. llvm-svn: 188427	2013-08-14 23:24:32 +00:00
Tom Stellard	df94dc3917	R600/SI: Choose the correct MOV instruction for copying immediates The instruction selector will now try to infer the destination register so it can decided whether to use V_MOV_B32 or S_MOV_B32 when copying immediates. llvm-svn: 188426	2013-08-14 23:24:24 +00:00
Tom Stellard	16a9a205c8	R600/SI: Assign a register class to the $vaddr operand for MIMG instructions The previous code declared the operand as unknown:$vaddr, which made it possible for scalar registers to be used instead of vector registers. llvm-svn: 188425	2013-08-14 23:24:17 +00:00
Tom Stellard	3494b7ee42	R600/SI: Handle MSAA texture targets Patch by: Marek Olšák Signed-off-by: Marek Olšák <marek.olsak@amd.com> llvm-svn: 188421	2013-08-14 22:22:14 +00:00
Tom Stellard	20ee94f152	R600/SI: Allow conversion between v32i8 and v8i32 Patch by: Marek Olšák Signed-off-by: Marek Olšák <marek.olsak@amd.com> llvm-svn: 188420	2013-08-14 22:22:09 +00:00
Tom Stellard	a36f077159	R600/SI: Fix an obvious typo Patch by: Marek Olšák Signed-off-by: Marek Olšák <marek.olsak@amd.com> llvm-svn: 188419	2013-08-14 22:22:03 +00:00
Tom Stellard	73c31d541e	R600/SI: Add pattern for fp_to_uint This fixes the F2U opcode for the Mesa driver. Patch by: Marek Olšák Signed-off-by: Marek Olšák <marek.olsak@amd.com> llvm-svn: 188418	2013-08-14 22:21:57 +00:00
Hal Finkel	b3ca00d2a3	Actually fix PPC64 64-bit GPR inline asm constraint matching This is a follow-up to r187693, correcting that code to request the correct register class. The previous version, with the wrong register class, was not really correcting the constraints, but rather was removing them. Coincidentally, this fixed the failing test case in r187693, but obviously created other problems. llvm-svn: 188407	2013-08-14 20:05:04 +00:00
Renato Golin	b184cd99ba	Let t2LDRBi8 and t2LDRBi12 have same Base Pointer When determining if two different loads are from the same base address, this patch allows one load to use a t2LDRi8 address mode and another to use a t2LDRi12 address mode. The current implementation is very conservative and this allows the case of differing Thumb2 byte loads to be considered. Allowing these differing modes instead of forcing the exact same opcode is useful for situations where one opcodes loads from a base address+1 and a second opcode loads for a base address-1. Patch by Daniel Stewart. llvm-svn: 188385	2013-08-14 16:35:29 +00:00
Craig Topper	d03748cf5e	Make more helper methods into static functions. llvm-svn: 188366	2013-08-14 07:53:41 +00:00
Craig Topper	7b7b159574	Remove tab characters. llvm-svn: 188365	2013-08-14 07:35:18 +00:00
Craig Topper	d905fded68	Make some helper methods static. llvm-svn: 188364	2013-08-14 07:34:43 +00:00
Craig Topper	60769e050d	Use MVT in more lowering code. llvm-svn: 188363	2013-08-14 07:04:42 +00:00
Craig Topper	52b00359b1	Replace EVT with MVT in isVectorShift. Keeps compiler from generating unneeded checks and handling for extended types. llvm-svn: 188362	2013-08-14 06:21:10 +00:00
Craig Topper	67476d7485	Replace EVT with MVT in many of the shuffle lowering functions. Keeps compiler from generating unneeded checks and handling for extended types. llvm-svn: 188361	2013-08-14 05:58:39 +00:00
Akira Hatanaka	274d24c8bc	[mips] Fix bug in parsing accumulator registers. llvm-svn: 188344	2013-08-14 01:15:52 +00:00
Akira Hatanaka	feb7ee84c5	[mips] Use register operands instead of register classes in DSP instruction definitions. llvm-svn: 188343	2013-08-14 01:02:20 +00:00
Akira Hatanaka	654655f1c5	[mips] Rename DSPRegs. llvm-svn: 188342	2013-08-14 00:53:38 +00:00
Akira Hatanaka	8002a3f6d8	[mips] Rename HIRegs and LORegs. llvm-svn: 188341	2013-08-14 00:47:08 +00:00
Akira Hatanaka	7473b4705a	[mips] Properly parse registers that appear in inline-asm constraints. llvm-svn: 188336	2013-08-14 00:21:25 +00:00
Jack Carter	3a2c2d42b8	[Mips][msa] Added initial MSA support. * msa SubtargetFeature * registers * ld.[bhwd], and st.[bhwd] instructions Does not correctly prohibit use of both 32-bit FPU registers and MSA together. Patch by Daniel Sanders llvm-svn: 188313	2013-08-13 20:54:07 +00:00
Jack Carter	9770097727	[Mips] Support for unaligned load/store microMips instructions This includes instructions lwl, lwr, swl and swr. Patch by Zoran Jovnovic llvm-svn: 188312	2013-08-13 20:19:16 +00:00
Michael Gottesman	7a8017290a	Update makeLibCall to return both the call and the chain associated with the libcall instead of just the call. This allows us to specify libcalls that return void. LowerCallTo returns a pair with the return value of the call as the first element and the chain associated with the return value as the second element. If we lower a call that has a void return value, LowerCallTo returns an SDValue with a NULL SDNode and the chain for the call. Thus makeLibCall by just returning the first value makes it impossible for you to set up the chain so that the call is not eliminated as dead code. I also updated all references to makeLibCall to reflect the new return type. llvm-svn: 188300	2013-08-13 17:54:56 +00:00
Joey Gouly	9960e764aa	ARMv8: SWP and SWPB are obsoleted on ARMv8. llvm-svn: 188288	2013-08-13 16:40:47 +00:00
Evgeniy Stepanov	7dee697faa	Fix compiler warnings. ../lib/Target/X86/X86ISelLowering.cpp:9715:7: error: unused variable 'OpVT' [-Werror,-Wunused-variable] EVT OpVT = Op0.getValueType(); ^ ../lib/Target/X86/X86ISelLowering.cpp:9763:14: error: unused variable 'NumElems' [-Werror,-Wunused-variable] unsigned NumElems = VT.getVectorNumElements(); llvm-svn: 188269	2013-08-13 14:04:20 +00:00
Mihai Popa	0e1012f0f4	Fix signed overflow in when computing encodings for ADR instructions llvm-svn: 188268	2013-08-13 14:02:13 +00:00
Elena Demikhovsky	60b1f289f2	AVX-512: Added CMP and BLEND instructions. Lowering for SETCC. llvm-svn: 188265	2013-08-13 13:24:07 +00:00
Vladimir Medic	27c87ea6bb	This patch introduces changes to MipsAsmParser register parsing routines. The code now follows more deterministic path and makes the code more efficient and easier to maintain. llvm-svn: 188264	2013-08-13 13:07:09 +00:00
Kevin Enderby	b03f3fe4e8	Fix a crash with X86 Mach-O and a subtraction expression where both symbols are undefined and produce an error message instead as this is a non-relocatable expression with X86 Mach-O. rdar://8920876 llvm-svn: 188218	2013-08-12 22:45:44 +00:00
Tom Stellard	fc455471c3	R600: Set scheduling preference to Sched::Source R600 doesn't need to do any scheduling on the SelectionDAG now that it has a very good MachineScheduler. Also, using the VLIW SelectionDAG scheduler was having a major impact on compile times. For example with the phatk kernel here are the LLVM IR to machine code compile times: With Sched::VLIW Total Compile Time: 1.4890 Seconds (User + System) SelectionDAG Instruction Scheduling: 1.1670 Seconds (User + System) With Sched::Source Total Compile Time: 0.3330 Seconds (User + System) SelectionDAG Instruction Scheduling: 0.0070 Seconds (User + System) The code ouput was identical with both schedulers. This may not be true for all programs, but it gives me confidence that there won't be much reduction, if any, in code quality by using Sched::Source. llvm-svn: 188215	2013-08-12 22:33:21 +00:00
Vladimir Medic	939877ee14	This patch implements ei and di instructions for mips. Test cases are added. llvm-svn: 188176	2013-08-12 13:07:23 +00:00
Richard Sandiford	564681c88d	[SystemZ] Use CLC and IPM to implement memcmp For now this is restricted to fixed-length comparisons with a length in the range [1, 256], as for memcpy() and MVC. llvm-svn: 188163	2013-08-12 10:28:10 +00:00
Richard Sandiford	761703a248	[SystemZ] Add a definition of the CLC instruction llvm-svn: 188162	2013-08-12 10:17:33 +00:00
Richard Sandiford	87326c73c6	[SystemZ] Add a definition of the IPM instruction llvm-svn: 188161	2013-08-12 10:05:58 +00:00
Benjamin Kramer	c9b7d47b21	Remove global construction. const char* is sufficient here. No functionality change. llvm-svn: 188158	2013-08-12 09:37:29 +00:00
Reed Kotler	d265e88827	Don't generate floating point stubs for mips16 code if the function is actually an instrinsic that will not occur in libc. This list here is not exhaustive but fixes the one places in test-suite where this occurs. I have filed a bug against myself to research the full list and add them to the array of such cases. In the future, actual stub generation will occur in a later phase and we won't need this code because we will know at that time during the compilation that in fact no helper function was even needed. llvm-svn: 188149	2013-08-11 21:30:27 +00:00
Elena Demikhovsky	5fed3b95db	AVX-512: Added more tests for BROADCAST llvm-svn: 188148	2013-08-11 12:29:16 +00:00
Elena Demikhovsky	cf5b1458e6	AVX-512: Added VPERM* instructons and MOV* zmm-to-zmm instructions. Added a test for shuffles using VPERM. llvm-svn: 188147	2013-08-11 07:55:09 +00:00
Reed Kotler	705c5951ca	Incorrect JAL instruction attributes caused the optimizer to make a wrong instruction move. Just affects static relocation. -static works fine now with mips16 for the most part. llvm-svn: 188143	2013-08-10 22:18:22 +00:00
Venkatraman Govindaraju	b50bf5a0e3	[Sparc] Enable xword directive in sparcv9. llvm-svn: 188141	2013-08-10 20:13:20 +00:00
Niels Ole Salscheider	d3a039fed2	R600/SI: FMA is faster than fmul and fadd for f64 llvm-svn: 188136	2013-08-10 10:38:54 +00:00
Niels Ole Salscheider	6509ac65a9	R600/SI: Add FMA pattern llvm-svn: 188135	2013-08-10 10:38:47 +00:00
Reed Kotler	be316cffa7	Add another intrinsic that LLVM gives an incorrect prototype to. I need to go through all the runtime routine list and see if there are any more I need to add for mips16 floating point. Prototypes must be correct or else I don't know to add a helper function call. llvm-svn: 188106	2013-08-09 21:33:41 +00:00
Benjamin Kramer	21585fd9c1	Add a overload to CostTable which allows it to infer the size of the table. Use it to avoid repeating ourselves too often. Also store MVT::SimpleValueType in the TTI tables so they can be statically initialized, MVT's constructors create bloated initialization code otherwise. llvm-svn: 188095	2013-08-09 19:33:32 +00:00
Mihai Popa	4c2801f7fd	This fixes the Thumb2 CPS assembly syntax. In Thumb1, only one variant is supported: CPS{effect} {flags} Thumb2 supports three: CPS{effect}.W {flags} CPS{effect} {flags} {mode} CPS {mode} Canonically, .W should be used only when ambiguity is present between encodings of different width. The wide suffix is still accepted for the latter two forms via aliases. llvm-svn: 188071	2013-08-09 13:52:32 +00:00
Mihai Popa	ad18d3ce53	Fix assembling of Thumb2 branch instructions. The long encoding for Thumb2 unconditional branches is broken. Additionally, there is no range checking for target operands; as such for instructions originating in assembly code, only short Thumb encodings are generated, regardless of the bitsize needed for the offset. Adding range checking is non trivial due to the representation of Thumb branch instructions. There is no true difference between conditional and unconditional branches in terms of operands and syntax - even unconditional branches have a predicate which is expected to match that of the IT block they are in. Yet, the encodings and the permitted size of the offset differ. Due to this, for any mnemonic there are really 4 encodings to choose for. The problem cannot be handled in the parser alone or by manipulating td files. Because the parser builds first a set of match candidates and then checks them one by one, whatever tablegen-only solution might be found will ultimately be dependent of the parser's evaluation order. What's worse is that due to the fact that all branches have the same syntax and the same kinds of operands, that order is governed by the lexicographical ordering of the names of operand classes... To circumvent all this, any necessary disambiguation is added to the instruction validation pass. llvm-svn: 188067	2013-08-09 10:38:32 +00:00
Richard Sandiford	9140910be8	[SystemZ] Update README llvm-svn: 188062	2013-08-09 09:25:57 +00:00
Jack Carter	7bd3c7d1fc	Mips ELF: MicroMips direct object Little endian support. Test included. Patch by Zoran Jovanovich llvm-svn: 188024	2013-08-08 23:30:40 +00:00
Michael J. Spencer	126973ba93	[Object] Split the ELF interface into 3 parts. * ELFTypes.h contains template magic for defining types based on endianess, size, and alignment. * ELFFile.h defines the ELFFile class which provides low level ELF specific access. * ELFObjectFile.h contains ELFObjectFile which uses ELFFile to implement the ObjectFile interface. llvm-svn: 188022	2013-08-08 22:27:13 +00:00
Akira Hatanaka	00fcf2e169	[mips] Rename accumulator register classes and FP register operands. llvm-svn: 188020	2013-08-08 21:54:26 +00:00
Akira Hatanaka	6bf3c03861	[mips] Mark pseudo instructions as code-gen only. llvm-svn: 188017	2013-08-08 21:44:39 +00:00
Akira Hatanaka	85ccf23d7d	[mips] Delete register class HWRegs64. No functionality change. llvm-svn: 188016	2013-08-08 21:37:32 +00:00
David Fang	2f1b0b55b8	cast fix to appease buildbot llvm-svn: 188014	2013-08-08 21:29:30 +00:00
David Fang	b88cdf62f5	initial draft of PPCMachObjectWriter.cpp this records relocation entries in the mach-o object file for PIC code generation. tested on powerpc-darwin8, validated against darwin otool -rvV llvm-svn: 188004	2013-08-08 20:14:40 +00:00
Niels Ole Salscheider	719fbc9ae7	R600/SI: Implement fp32<->fp64 conversions llvm-svn: 187988	2013-08-08 16:06:15 +00:00
Niels Ole Salscheider	4715d886f8	R600/SI: Implement sint<->fp64 conversions llvm-svn: 187987	2013-08-08 16:06:08 +00:00
Jakub Staszak	9c34922ff2	Use pop_back() instead of pop_back_val() when the returned value is not used. llvm-svn: 187986	2013-08-08 15:48:46 +00:00
Silviu Baranga	82656be84d	Remove the now redundant FeatureFP16 from the Cortex-A15 feature list. It was made redundant when FeatureVFP4 was added which implies FP16. llvm-svn: 187985	2013-08-08 15:47:33 +00:00
Jakub Staszak	b5ab81d5d0	Fix the comment. llvm-svn: 187984	2013-08-08 15:19:25 +00:00
Mihai Popa	b8d900e304	The name "tCDP" isn't used anywhere else in the source code, so renaming it for consistency doesn't cause any problems. This is the only Thumb2 instruction defined with "t" prefix; all other Thumb2 instructions have "t2" prefix (e.g. "t2CDP2" which is defined immediately afterwards). Patch by Artyom Skrobov. llvm-svn: 187973	2013-08-08 10:20:41 +00:00
Hal Finkel	2b7b2f373b	PPC: Map frin to round() not nearbyint() and rint() Making use of the recently-added ISD::FROUND, which allows for custom lowering of round(), the PPC backend will now map frin to round(). Previously, we had been using frin to lower nearbyint() (and rint() via some custom lowering to handle the extra fenv flags requirements), but only in fast-math mode because frin does not tie-to-even. Several users had complained about this behavior, and this new mapping of frin to round is certainly more appropriate (and does not require fast-math mode). In effect, this reverts r178362 (and part of r178337, replacing the nearbyint mapping with the round mapping). llvm-svn: 187960	2013-08-08 04:31:34 +00:00
Hal Finkel	171817ee8a	Add ISD::FROUND for libm round() All libm floating-point rounding functions, except for round(), had their own ISD nodes. Recent PowerPC cores have an instruction for round(), and so here I'm adding ISD::FROUND so that round() can be custom lowered as well. For the most part, this is straightforward. I've added an intrinsic and a matching ISD node just like those for nearbyint() and friends. The SelectionDAG pattern I've named frnd (because ISD::FP_ROUND has already claimed fround). This will be used by the PowerPC backend in a follow-up commit. llvm-svn: 187926	2013-08-07 22:49:12 +00:00
Elena Demikhovsky	45c54ad8dc	AVX-512 set: Added BROADCAST instructions with lowering logic and a test. llvm-svn: 187884	2013-08-07 12:34:55 +00:00
Richard Sandiford	0897fce2f4	[SystemZ] Optimize floating-point comparisons with zero This follows the same lines as the integer code. In the end it seemed easier to have a second 4-bit mask in TSFlags to specify the compare-like CC values. That eats one more TSFlags bit than adding a CCHasUnordered would have done, but it feels more concise. llvm-svn: 187883	2013-08-07 11:10:06 +00:00
Richard Sandiford	9f11bc1956	[SystemZ] Add floating-point load-and-test instructions These instructions can also be used as comparisons with zero. llvm-svn: 187882	2013-08-07 11:03:34 +00:00
Craig Topper	c5b0ad27ab	Simplify code. No functional change intended. llvm-svn: 187870	2013-08-07 08:16:07 +00:00
Evgeniy Stepanov	bc8808ce4a	Initialize SIInsertWaits::ExpInstrTypesSeen in the pass constructor. This value may be used uninitialized in SIInsertWaits::insertWait. Found with MemorySanitizer. llvm-svn: 187869	2013-08-07 07:47:41 +00:00
Reed Kotler	bb870e20e2	Create a pattern for the "trap" instruction. llvm-svn: 187863	2013-08-07 04:00:26 +00:00

1 2 3 4 5 ...

25282 Commits