llvm-project

Commit Graph

Author	SHA1	Message	Date
Benjamin Kramer	0b03cbd416	Hexagon: Initialize TBB to 0. Found by valgrind. llvm-svn: 156744	2012-05-13 15:13:22 +00:00
Sirish Pande	8bb9745a5e	Make sure new value jump is enabled for Hexagon V5 as well. llvm-svn: 156700	2012-05-12 05:54:15 +00:00
Sirish Pande	4bd20c50eb	Support for Hexagon feature, New Value Jump. llvm-svn: 156698	2012-05-12 05:10:30 +00:00
Akira Hatanaka	a6c3fd8317	Remove MipsEmitGPRestore.cpp. llvm-svn: 156696	2012-05-12 03:24:03 +00:00
Akira Hatanaka	3ecc5273c1	Delete all functions that are no longer needed in MipsFunctionInfo, including the ones that get or set the frame index for the $gp save slot. Remove the piece of code in MipsFunctionInfo::getGlobalBaseReg() which returns GP. This function should always return a virtual register. llvm-svn: 156695	2012-05-12 03:22:13 +00:00
Akira Hatanaka	2e31e036b6	Stop reserving register $gp. Do not call isGPFI to check whether a frame object is the $gp save slot. llvm-svn: 156694	2012-05-12 03:21:18 +00:00
Akira Hatanaka	0fb87feb39	Do not add the pass which restores $gp after every function call. llvm-svn: 156693	2012-05-12 03:19:51 +00:00
Akira Hatanaka	f542ebd958	Make the following changes in MipsISelLowering.cpp: - Stop creating stack frame objects needed for saving $gp. - Insert a node that copies the global pointer register to register $gp before the call node. This will ensure $gp is valid at the entry of the called function. llvm-svn: 156692	2012-05-12 03:19:04 +00:00
Akira Hatanaka	c980f8453a	Make the following changes in MipsFrameLowering.cpp: - Stop emitting instructions needed to initialize the global pointer register. - Stop emitting .cprestore directive. - Do not take into account the $gp save slot when computing stack size. llvm-svn: 156691	2012-05-12 03:18:00 +00:00
Akira Hatanaka	8f3573034b	Make the following changes in MipsAsmPrinter.cpp: - Remove code which lowers pseudo SETGP01. - Fix LowerSETGP01. The first two of the three instructions that are emitted to initialize the global pointer register now use register $2. - Stop emitting .cpload directive. llvm-svn: 156689	2012-05-12 00:48:43 +00:00
Akira Hatanaka	d918f77ba3	Insert instructions to the entry basic block which initializes the global pointer register. This is the first of the series of patches which clean up the way global pointer register is used. The patches will make the following improvements: - Make $gp an allocatable temporary register rather than reserving it. - Use a virtual register as the global pointer register and let the register allocator decide which register to assign to it or whether spill/reloads are needed. - Make sure $gp is valid at the entry of a called function, which is necessary for functions using lazy binding. - Remove the need for emitting .cprestore and .cpload directives. llvm-svn: 156671	2012-05-12 00:17:17 +00:00
Akira Hatanaka	0661b81bca	Do not replace operands of pseudo instructions with register $zero. llvm-svn: 156663	2012-05-11 23:22:18 +00:00
Chad Rosier	aa9cb9df59	[fast-isel] Add support for selecting @llvm.trap(). llvm-svn: 156646	2012-05-11 21:33:49 +00:00
Brendon Cahoon	5edcf8822d	Updated instruction table due to addded intrinsics. llvm-svn: 156644	2012-05-11 21:10:16 +00:00
Sirish Pande	95d0117bb3	Remove warnings from HexagonVLIWPacketizer. llvm-svn: 156636	2012-05-11 20:00:34 +00:00
Brendon Cahoon	31f8723ef3	Hexagon constant extender support. Patch by Jyotsna Verma. llvm-svn: 156634	2012-05-11 19:56:59 +00:00
Chad Rosier	06e34d9220	Typo. llvm-svn: 156633	2012-05-11 19:43:29 +00:00
Chad Rosier	3268692aa8	[fast-isel] Remove -disable-arm-fast-isel option. -fast-isel=0 suffices. Minor cleanup. llvm-svn: 156632	2012-05-11 19:40:25 +00:00
Sirish Pande	83ccb6ce08	Hexagon V5 intrinsics support. llvm-svn: 156631	2012-05-11 19:39:13 +00:00
Chad Rosier	90f9afe659	[fast-isel] Cleaner fix for when we're unable to handle a non-double multi-reg retval. Hoists check before emitting the call to avoid unnecessary work. rdar://11430407 PR12796 llvm-svn: 156628	2012-05-11 18:51:55 +00:00
Chad Rosier	519b12f927	[fast-isel] Rather then assert (or segfault in a non-asserts build), fall back to selection DAG isel if we're unable to handle a non-double multi-reg retval. rdar://11430407 PR12796 llvm-svn: 156622	2012-05-11 17:41:06 +00:00
Chad Rosier	466d3d8faa	The return type is an unsigned, not a bool. llvm-svn: 156621	2012-05-11 16:41:38 +00:00
Manman Ren	0d5ec28ccc	Add space before an open parenthesis in control flow statements. llvm-svn: 156620	2012-05-11 15:36:46 +00:00
Preston Gurd	09de6ae399	Added X86 Atom latencies to X86InstrMMX.td. llvm-svn: 156615	2012-05-11 14:27:12 +00:00
Hans Wennborg	f9d0e44b82	Implement initial-exec TLS model for 32-bit PIC x86 This fixes a TODO from 2007 :) Previously, LLVM would emit the wrong code here (see the update to test/CodeGen/X86/tls-pie.ll). llvm-svn: 156611	2012-05-11 10:11:01 +00:00
Silviu Baranga	ddc67a7655	Added the missing bit definition for the 4th bit of the STR (post reg) instruction. It is now set to 0. The patch also sets the unpredictable mask for SEL and SXTB-type instructions. llvm-svn: 156609	2012-05-11 09:28:27 +00:00
Silviu Baranga	5a719f9b9a	Fixed the LLVM ARM v7 assembler and instruction printer for 8-bit immediate offset addressing. The assembler and instruction printer were not properly handeling the #-0 immediate. llvm-svn: 156608	2012-05-11 09:10:54 +00:00
Akira Hatanaka	e37614438f	Fix a misleading comment. llvm-svn: 156603	2012-05-11 01:45:15 +00:00
Manman Ren	dc8ad0058f	ARM: peephole optimization to remove cmp instruction This patch will optimize the following cases: sub r1, r3 \| sub r1, imm cmp r3, r1 or cmp r1, r3 \| cmp r1, imm bge L1 TO subs r1, r3 bge L1 or ble L1 If the branch instruction can use flag from "sub", then we can replace "sub" with "subs" and eliminate the "cmp" instruction. rdar: 10734411 llvm-svn: 156599	2012-05-11 01:30:47 +00:00
Dan Gohman	dfab443ae8	Define a new intrinsic, @llvm.debugger. It will be similar to __builtin_trap(), but it generates int3 on x86 instead of ud2. llvm-svn: 156593	2012-05-11 00:19:32 +00:00
Preston Gurd	4fe10a5d9a	Added X86 Atom latencies for instructions in X86InstrInfo.td. llvm-svn: 156579	2012-05-10 21:58:35 +00:00
Eric Christopher	ed51b9ec0b	Add support for the 'X' inline asm operand modifier. Patch by Jack Carter. llvm-svn: 156577	2012-05-10 21:48:22 +00:00
Sirish Pande	fc8118bf41	Hexagon V5 Support - V5 td file. llvm-svn: 156569	2012-05-10 20:24:28 +00:00
Sirish Pande	69295b8963	Hexagon V5 FP Support. llvm-svn: 156568	2012-05-10 20:20:25 +00:00
Manman Ren	b555b382bd	Revert: 156550 "ARM: peephole optimization to remove cmp instruction" This commit broke an external linux bot and gave a compile-time warning. llvm-svn: 156556	2012-05-10 18:49:43 +00:00
Manman Ren	c860887b2d	ARM: peephole optimization to remove cmp instruction This patch will optimize the following cases: sub r1, r3 \| sub r1, imm cmp r3, r1 or cmp r1, r3 \| cmp r1, imm bge L1 TO subs r1, r3 bge L1 or ble L1 If the branch instruction can use flag from "sub", then we can replace "sub" with "subs" and eliminate the "cmp" instruction. rdar: 10734411 llvm-svn: 156550	2012-05-10 16:48:21 +00:00
Nadav Rotem	1a65397017	Fix merge-typo and cleanup llvm-svn: 156541	2012-05-10 12:50:02 +00:00
Nadav Rotem	15946e50c1	AVX2: Add an additional broadcast idiom. llvm-svn: 156540	2012-05-10 12:39:13 +00:00
Nadav Rotem	b86a3fb8d0	Generate AVX/AVX2 shuffles even when there is a memory op somewhere else in the program. Starting r155461 we are able to select patterns for vbroadcast even when the load op is used by other users. Fix PR11900. llvm-svn: 156539	2012-05-10 12:22:05 +00:00
Roman Divacky	e07cc042f6	Mark .opd @progbits, thus avoiding a warning from asm. llvm-svn: 156494	2012-05-09 18:24:23 +00:00
Akira Hatanaka	ca41d13bbd	Add another peephole pattern for conditional moves. llvm-svn: 156460	2012-05-09 02:29:29 +00:00
Jakob Stoklund Olesen	7e21d617ef	Use ptr_rc_tailcall instead of GR32_TC. The getPointerRegClass() hook will return GR32_TC, or whatever is appropriate for the current function. Patch by Yiannis Tsiouris! llvm-svn: 156459	2012-05-09 01:50:09 +00:00
Akira Hatanaka	05b9dad1e6	Make register FP allocatable if the compiled function does not have dynamic allocas. llvm-svn: 156458	2012-05-09 01:38:13 +00:00
Akira Hatanaka	0a8ab718cb	Expand 64-bit shifts if target ABI is O32. llvm-svn: 156457	2012-05-09 00:55:21 +00:00
Richard Trieu	edf46e6b6e	Remove unused variable to silence compiler warning. llvm-svn: 156456	2012-05-09 00:30:21 +00:00
Jakob Stoklund Olesen	10191fd44f	Use a shared function for a common operation. llvm-svn: 156441	2012-05-08 23:27:30 +00:00
Eric Christopher	d666bb0dd8	Remove excess semi-colons to quiet warnings. llvm-svn: 156416	2012-05-08 20:45:04 +00:00
Sirish Pande	1c9f7dbc10	Update load/store instruction patterns in Hexagon V4. llvm-svn: 156411	2012-05-08 19:50:20 +00:00
Akira Hatanaka	c515bfb9e7	Define mips16 instruction formats. Patch by Reed Kotler. llvm-svn: 156408	2012-05-08 19:08:58 +00:00
Jakob Stoklund Olesen	276ae14023	s/CSR_Ghc/CSR_NoRegs/ Share the CalleeSavedRegs defs between all calling conventions having no callee-saved registers. Patch by Yiannis Tsiouris! llvm-svn: 156382	2012-05-08 15:07:29 +00:00
Craig Topper	7daf897678	Remove 256-bit AVX non-temporal store intrinsics. Similar was previously done for 128-bit. llvm-svn: 156375	2012-05-08 06:58:15 +00:00
Jakob Stoklund Olesen	3c52f0281f	Add an MF argument to TRI::getPointerRegClass() and TII::getRegClass(). The getPointerRegClass() hook can return register classes that depend on the calling convention of the current function (ptr_rc_tailcall). So far, we have been able to infer the calling convention from the subtarget alone, but as we add support for multiple calling conventions per target, that no longer works. Patch by Yiannis Tsiouris! llvm-svn: 156328	2012-05-07 22:10:26 +00:00
Jakob Stoklund Olesen	c4b3a7a1d7	Fix bug in TRI::getCommonSuperRegClass(). Test cases for this code are coming. It is not used for anything yet. llvm-svn: 156327	2012-05-07 21:59:31 +00:00
Jakob Stoklund Olesen	65a6dafc8d	Add TRI::getCommonSuperRegClass(). This function is a generalization of getMatchingSuperRegClass() to the symmetric case where both sides are using a sub-register index. It will find a super-register class and sub-register indexes that make this diagram commute: PreA SuperRC ----------> RCA \| \| \| \| PreB \| \| SubA \| \| \| \| V V RCB ----------> SubRC SubB This can be used to coalesce copies like: %vreg1:sub16 = COPY %vreg2:sub16; GR64:%vreg1, GR32: %vreg2 llvm-svn: 156317	2012-05-07 19:14:58 +00:00
Chad Rosier	d8287fec17	Fix a regression from r147481. This combine should only happen if there is a single use. rdar://11360370 llvm-svn: 156316	2012-05-07 18:47:44 +00:00
Manman Ren	ef4e0479ec	X86: optimization for -(x != 0) This patch will optimize -(x != 0) on X86 FROM cmpl $0x01,%edi sbbl %eax,%eax notl %eax TO negl %edi sbbl %eax %eax In order to generate negl, I added patterns in Target/X86/X86InstrCompiler.td: def : Pat<(X86sub_flag 0, GR32:$src), (NEG32r GR32:$src)>; rdar: 10961709 llvm-svn: 156312	2012-05-07 18:06:23 +00:00
Eric Christopher	0d8c15d20f	Add support for the 'x' constraint. Patch by Jack Carter. llvm-svn: 156295	2012-05-07 06:25:19 +00:00
Eric Christopher	9c492e6ebf	Add support for the 'l' constraint. Patch by Jack Carter. llvm-svn: 156294	2012-05-07 06:25:15 +00:00
Eric Christopher	e3c494de82	Add support for the 'c' constraint. Patch by Jack Carter. llvm-svn: 156293	2012-05-07 06:25:10 +00:00
Eric Christopher	c18ae4a3b1	Add support for the 'P' constraint. Patch by Jack Carter. llvm-svn: 156292	2012-05-07 06:25:02 +00:00
Craig Topper	dbb98b4917	Fix some issues in the f16c instructions. llvm-svn: 156287	2012-05-07 06:00:15 +00:00
Eric Christopher	470578a91b	Add support for the 'O' constraint. Patch by Jack Carter. llvm-svn: 156285	2012-05-07 05:46:48 +00:00
Eric Christopher	e07aa430b8	Add support for the 'N' inline asm constraint. Patch by Jack Carter. llvm-svn: 156284	2012-05-07 05:46:43 +00:00
Eric Christopher	1109b3406d	Add support for the 'L' inline asm constraint. Patch by Jack Carter. llvm-svn: 156283	2012-05-07 05:46:37 +00:00
Eric Christopher	3ff88a05b7	Add support for the inline asm constraint 'K'. llvm-svn: 156282	2012-05-07 05:46:29 +00:00
Craig Topper	d4e1894ec1	Add SSE4A MOVNTSS/MOVNTSD instructions. llvm-svn: 156281	2012-05-07 05:36:19 +00:00
Eric Christopher	7201e1b4b9	Support the 'J' constraint. Patch by Jack Carter. llvm-svn: 156280	2012-05-07 03:13:42 +00:00
Eric Christopher	1d6c89eea1	Add support for the 'I' inline asm constraint. Also add tests from the previous 2 patches. Patch by Jack Carter. llvm-svn: 156279	2012-05-07 03:13:32 +00:00
Eric Christopher	58daf04681	Allow 64 bit integer values in gpu registers if arch and abi are 64 bit. Patch by Jack Carter. llvm-svn: 156278	2012-05-07 03:13:22 +00:00
Eric Christopher	cfcd77b0bc	When using inline asm constraints representing non-floating point general registers allow 8 and 16-bit elements. Patch by Jack Carter. llvm-svn: 156277	2012-05-07 03:13:16 +00:00
Craig Topper	00a1e6d48b	Use MVT instead of EVT as the argument to all the shuffle decode functions. Simplify some of the decode functions. llvm-svn: 156268	2012-05-06 19:46:21 +00:00
Craig Topper	804be3b546	Add VPERMQ/VPERMPD to the list of target specific shuffles that can be looked through for DAG combine purposes. llvm-svn: 156266	2012-05-06 18:54:26 +00:00
Craig Topper	54bdb350e2	Add shuffle decode support for VPERMQ/VPERMPD. llvm-svn: 156265	2012-05-06 18:44:02 +00:00
Jim Grosbach	7ce129268e	Nuke a few dead remnants of the CBE. llvm-svn: 156241	2012-05-05 17:45:12 +00:00
Benjamin Kramer	e31f31e5c0	Add a new target hook "predictableSelectIsExpensive". This will be used to determine whether it's profitable to turn a select into a branch when the branch is likely to be predicted. Currently enabled for everything but Atom on X86 and Cortex-A9 devices on ARM. I'm not entirely happy with the name of this flag, suggestions welcome ;) llvm-svn: 156233	2012-05-05 12:49:14 +00:00
Benjamin Kramer	a25a61b9e8	NVPTX: Initialize the UseF32FTZ flag. llvm-svn: 156232	2012-05-05 11:22:02 +00:00
Eric Christopher	de9e92ed9b	Typo. llvm-svn: 156226	2012-05-05 01:16:06 +00:00
David Blaikie	891d0a3d20	Fix warnings in release build. This fixes a couple of Clang warnings in release builds of LLVM: * Missing return in ISelLowering * Unused variable in NVPTXutil.cpp llvm-svn: 156216	2012-05-04 22:34:16 +00:00
Kevin Enderby	cabbae653e	Tweak to the fix in r156212, as with the change in removing the shift the SignExtend32<22>(Val<<1) also needs to change to SignExtend32<21>(Val) . llvm-svn: 156213	2012-05-04 22:09:52 +00:00
Kevin Enderby	8ce1ada1be	Fix a bug in the ARM disassembler for wide branch conditional instructions where the symbolic operand's displacement was incorrectly shifted left by 1. rdar://11387046 llvm-svn: 156212	2012-05-04 22:02:27 +00:00
Chandler Carruth	cd3464ee22	Fix a Clang warning in the new NVPTX backend: In file included from ../lib/Target/NVPTX/VectorElementize.cpp:53: ../lib/Target/NVPTX/NVPTX.h:44:3: warning: default label in switch which covers all enumeration values [-Wcovered-switch-default] default: assert(0 && "Unknown condition code"); ^ 1 warning generated. The prevailing pattern in LLVM is to not use a default label, and instead to use llvm_unreachable to denote that the switch in fact covers all return paths from the function. llvm-svn: 156209	2012-05-04 21:35:49 +00:00
Justin Holewinski	ae556d3ef7	This patch adds a new NVPTX back-end to LLVM which supports code generation for NVIDIA PTX 3.0. This back-end will (eventually) replace the current PTX back-end, while maintaining compatibility with it. The new target machines are: nvptx (old ptx32) => 32-bit PTX nvptx64 (old ptx64) => 64-bit PTX The sources are based on the internal NVIDIA NVPTX back-end, and contain more functionality than the current PTX back-end currently provides. NV_CONTRIB llvm-svn: 156196	2012-05-04 20:18:50 +00:00
Sebastian Pop	2420e8b7d5	Added missing CMN case in Thumb2SizeReduction pass so that LLVM emits 16-bits encoding of CMN instructions. llvm-svn: 156195	2012-05-04 19:53:56 +00:00
Preston Gurd	d6c440cd4c	Adds Intel Atom scheduling latencies to X86InstrSystem.td. llvm-svn: 156194	2012-05-04 19:26:37 +00:00
Matt Beaumont-Gay	e82ab6baa7	Pacify GCC's -Wreturn-type llvm-svn: 156189	2012-05-04 18:34:27 +00:00
Hans Wennborg	aea412008e	Make ARM and Mips use TargetMachine::getTLSModel() This moves the logic for selecting a TLS model to a single place, instead of the previous three (ARM, Mips, and X86 which already uses this function). llvm-svn: 156162	2012-05-04 09:40:39 +00:00
Craig Topper	bdd2e34b1f	Fix some loops to match coding standards. No functional change intended. llvm-svn: 156159	2012-05-04 06:39:13 +00:00
Craig Topper	d4d3237bb8	Fix up some spacing. No functional change. llvm-svn: 156158	2012-05-04 06:18:33 +00:00
Craig Topper	e2ae413746	Simplify broadcast lowering code. No functional change intended. llvm-svn: 156157	2012-05-04 05:49:51 +00:00
Craig Topper	42f2182366	Allow v16i16 and v32i8 shuffles to be rewritten as narrower shuffles. llvm-svn: 156156	2012-05-04 04:44:49 +00:00
Craig Topper	59063c0a3d	Simplify shuffle narrowing code a bit. No functional change intended. llvm-svn: 156154	2012-05-04 04:08:44 +00:00
Jakob Stoklund Olesen	796e5272ab	Remove the SubRegClasses field from RegisterClass descriptions. This information in now computed by TableGen. llvm-svn: 156152	2012-05-04 03:30:34 +00:00
Jakob Stoklund Olesen	34a8f13e5f	Initialize SparcInstrInfo before SparcTargetLowering. The TargetLowering construction needs to use a valid TargetRegisterInfo instance. llvm-svn: 156146	2012-05-04 02:16:39 +00:00
Jakob Stoklund Olesen	57c7050675	Add a SuperRegClassIterator class. This iterator class provides a more abstract interface to the (Idx, Mask) lists of super-registers for a register class. The layout of the tables shouldn't be exposed to clients. llvm-svn: 156144	2012-05-04 01:48:29 +00:00
Jakob Stoklund Olesen	2f460ae3b4	Use a shared implementation of getMatchingSuperRegClass(). TargetRegisterClass now gives access to the necessary tables. llvm-svn: 156122	2012-05-03 22:49:04 +00:00
Kevin Enderby	914223010c	Fix issues with the ARM bl and blx thumb instructions and the J1 and J2 bits for the assembler and disassembler. Which were not being set/read correctly for offsets greater than 22 bits in some cases. Changes to lib/Target/ARM/ARMAsmBackend.cpp from Gideon Myles! llvm-svn: 156118	2012-05-03 22:41:56 +00:00
Sirish Pande	f8e5e3c072	Support for target dependent Hexagon VLIW packetizer. This patch creates and optimizes packets as per Hexagon ISA rules. llvm-svn: 156109	2012-05-03 21:52:53 +00:00
Silviu Baranga	9560af848c	Fixed disassembler for vstm/vldm ARM VFP instructions. llvm-svn: 156077	2012-05-03 16:38:40 +00:00
Sirish Pande	c92c31674e	Extensions of Hexagon V4 instructions. This adds new instructions for Hexagon V4 architecture. llvm-svn: 156071	2012-05-03 16:18:50 +00:00
Craig Topper	242183834a	Use 'unsigned' instead of 'int' in a few places dealing with counts of vector elements. llvm-svn: 156060	2012-05-03 07:26:59 +00:00
Craig Topper	315a5cc789	Fix 256-bit vpshuflw and vpshufhw immediate encoding to handle undefs in the lower half correctly. Missed in r155982. llvm-svn: 156059	2012-05-03 07:12:59 +00:00
Andrew Trick	32aea358e1	Added TargetRegisterInfo::getAllocatableClass. The ensures that virtual registers always belong to an allocatable class. If your target attempts to create a vreg for an operand that has no allocatable register subclass, you will crash quickly. This ensures that targets define register classes as intended. llvm-svn: 156046	2012-05-03 01:14:37 +00:00
Preston Gurd	926afd7401	For Intel Atom, use ILP scheduling always, instead of ILP for 64 bit and Hybrid for 32 bit, since benchmarks show ILP scheduling is better most of the time. llvm-svn: 156028	2012-05-02 22:02:02 +00:00
Preston Gurd	c0b976c42a	Change the Intel Atom detection code to recognize Lincroft and Medfield. llvm-svn: 156025	2012-05-02 21:38:46 +00:00
Jim Grosbach	28b0b7279e	ARM: Add missing two-operand VBIC aliases. llvm-svn: 156019	2012-05-02 21:11:56 +00:00
Preston Gurd	fa3f6cb830	This patch continues the work of adding instruction latencies for X86 Atom, by providing the latencies for the instructions in X86InstrFPStack.td. llvm-svn: 155996	2012-05-02 16:03:35 +00:00
Manman Ren	f02efc8731	Revert r155853 The commit is intended to fix rdar://10961709. But it is the root cause of PR12720. Revert it for now. llvm-svn: 155992	2012-05-02 15:24:32 +00:00
Richard Barton	0fc56890ba	Disallow YIELD and other allocated nop hints in pre-ARMv6 architectures. llvm-svn: 155983	2012-05-02 09:43:18 +00:00
Craig Topper	c73bc39c22	Add support for selecting AVX2 vpshuflw and vpshufhw. Add decoding support for AsmPrinter. llvm-svn: 155982	2012-05-02 08:03:44 +00:00
Jakub Staszak	6126401c83	Remove unneeded break. llvm-svn: 155959	2012-05-01 23:08:16 +00:00
Jakub Staszak	339380286b	Remove trailing spaces. llvm-svn: 155956	2012-05-01 23:04:38 +00:00
Jim Grosbach	1d20efb837	ARM: Add a few missing add->sub aliases w/ 'w' suffix. Aliases for adding a negative immediate when using an explicit 'w' suffix. E.g., adds.w r2, #-16 adds.w r2, r2, #-16 addw r2, #-16 addw r2, #-16 addw r2, r2, #-16 rdar://11330769 llvm-svn: 155946	2012-05-01 21:17:34 +00:00
Jim Grosbach	70bed4faaf	ARM: allow vanilla expressions for movw/movt. Expressions for movw/movt don't always have an :upper16: or :lower16: on them and that's ok. When they don't, it's just a plain [0-65536] immediate result, effectively the same as a :lower16: variant kind. rdar://10550147 llvm-svn: 155941	2012-05-01 20:43:21 +00:00
Preston Gurd	5ae5278ca1	This patch marks the X86 floating point stack registers ST0-ST7 as reserved in order to avoid assertion failures in the register scavenger. The assertion failures were “Bad machine code: Using an undefined physical register” and “Bad machine code: MBB exits via unconditional fall-through but its successor differs from its CFG successor!”. llvm-svn: 155930	2012-05-01 19:50:22 +00:00
Manman Ren	425a55c1ce	X86: optimization for max-like struct This patch will optimize the following cases on X86 (a > b) ? (a-b) : 0 (a >= b) ? (a-b) : 0 (b < a) ? (a-b) : 0 (b <= a) ? (a-b) : 0 FROM movl %edi, %ecx subl %esi, %ecx cmpl %edi, %esi movl $0, %eax cmovll %ecx, %eax TO xorl %eax, %eax subl %esi, %edi cmovll %eax, %edi movl %edi, %eax rdar: 10734411 llvm-svn: 155919	2012-05-01 17:16:15 +00:00
Alexey Samsonov	c4b3ad8195	X86: Use StackRegister instead of FrameRegister in getFrameIndexReference (to generate debug info for local variables) if stack needs realignment llvm-svn: 155917	2012-05-01 15:16:06 +00:00
Benjamin Kramer	cb3e98cf44	Move MipsDisassembler classes into an anonymous namespace. llvm-svn: 155915	2012-05-01 14:34:24 +00:00
Benjamin Kramer	512c1dce8f	Value-initialize global to avoid global construction. llvm-svn: 155909	2012-05-01 10:48:02 +00:00
Bill Wendling	b12f16e75f	Change the PassManager from a reference to a pointer. The TargetPassManager's default constructor wants to initialize the PassManager to 'null'. But it's illegal to bind a null reference to a null l-value. Make the ivar a pointer instead. PR12468 llvm-svn: 155902	2012-05-01 08:27:43 +00:00
Craig Topper	05eb6e096a	Allow BMI, AES, F16C, POPCNT, FMA3, and CLMUL to be detected on AMD processors. llvm-svn: 155899	2012-05-01 07:10:32 +00:00
Craig Topper	bae0e9ea1d	Make XOP and FMA4 require SSE4A to match GCC behavior. Use this to simplify Bulldozer feature list. llvm-svn: 155897	2012-05-01 06:54:48 +00:00
Craig Topper	d32ebcc36b	Attempt to handle MRMInitReg in emitVEXOpcodePrefix. Hopefully fixes PR12711. llvm-svn: 155896	2012-05-01 06:34:01 +00:00
Craig Topper	43518cc55f	Make XOP imply AVX as its needed to legalize the registers types. llvm-svn: 155891	2012-05-01 05:41:41 +00:00
Craig Topper	c0cef32b83	Remove HasSSE2 from AES and CLMUL predicates. It's now implied by the HasAES and HasCLMUL predicates. llvm-svn: 155890	2012-05-01 05:35:02 +00:00
Craig Topper	29dd148a71	Make CLMUL and AES imply SSE2 since its needed to legalize the type. llvm-svn: 155888	2012-05-01 05:28:32 +00:00
Craig Topper	0eacda5f69	Enable AVX and FMA4 for AMD Bulldozer processors. llvm-svn: 155885	2012-05-01 05:18:13 +00:00
Manman Ren	4f4d5c8fc8	X86: optimization for -(x != 0) This patch will optimize -(x != 0) on X86 FROM cmpl $0x01,%edi sbbl %eax,%eax notl %eax TO negl %edi sbbl %eax %eax llvm-svn: 155853	2012-04-30 22:51:25 +00:00
Jim Grosbach	e78031a9f3	ARM: Diagnostics for out of range fixups. Replace some assert() calls w/ actual diagnostics. In a perfect world, there'd be range checks on these values long before things ever reached this code. For now, though, issuing a better-late-than-never diagnostic is still a big improvement over assert(). rdar://11347287 llvm-svn: 155851	2012-04-30 22:30:43 +00:00
Jakob Stoklund Olesen	8503ba984f	Fix address calculation error from r155744. This was exposed by SingleSource/UnitTests/Vector/constpool.c. The computed size of a basic block isn't always a multiple of its known alignment, and that can introduce extra alignment padding after the block. <rdar://problem/11347135> llvm-svn: 155845	2012-04-30 20:19:00 +00:00
Chad Rosier	d427d51c2b	Tidy up. No functional change intended. llvm-svn: 155832	2012-04-30 17:47:15 +00:00
Derek Schuff	b051adf263	Fix fastcc structure return with fast-isel on x86-32 On x86-32, structure return via sret lets the callee pop the hidden pointer argument off the stack, which the caller then re-pushes. However if the calling convention is fastcc, then a register is used instead, and the caller should not adjust the stack. This is implemented with a check of IsTailCallConvention X86TargetLowering::LowerCall but is now checked properly in X86FastISel::DoSelectCall. (this time, actually commit what was reviewed!) llvm-svn: 155825	2012-04-30 16:57:15 +00:00
Bob Wilson	9245c93656	Don't introduce illegal types when creating vmull operations. <rdar://11324364> ARM BUILD_VECTORs created after type legalization cannot use i8 or i16 operands, since those types are not legal. Instead use i32 operands, which will be implicitly truncated by the BUILD_VECTOR to match the element type. llvm-svn: 155824	2012-04-30 16:53:34 +00:00
Craig Topper	55b3990837	No need to normalize index before calling Extract128BitVector llvm-svn: 155811	2012-04-30 05:17:10 +00:00
Pete Cooper	f76b5fe5ab	Copied all the VEX prefix encoding code from X86MCCodeEmitter to the x86 JIT emitter. Needs some major refactoring as these two code emitters are almost identical llvm-svn: 155810	2012-04-30 03:56:44 +00:00
Jakub Staszak	da03f3ba64	Remove unneeded casts. No functionality change. llvm-svn: 155800	2012-04-29 20:52:53 +00:00
Craig Topper	3b94fa63d6	Simplify code a bit. No functional change intended. llvm-svn: 155798	2012-04-29 20:22:05 +00:00
Kalle Raiskila	4c5f83ea19	Update the documentation of CellSPU, in case it gets removed in 3.1. llvm-svn: 155797	2012-04-29 20:00:55 +00:00
Jakob Stoklund Olesen	ae7521d1e4	Fix a problem with blocks that need to be split twice. The code could search past the end of the basic block when there was already a constant pool entry after the block. Test case with giant basic block in SingleSource/UnitTests/Vector/constpool.c llvm-svn: 155753	2012-04-28 06:21:38 +00:00
Jim Grosbach	c6f32b3295	ARM: Thumb add(sp plus register) asm constraints. Make sure when parsing the Thumb1 sp+register ADD instruction that the source and destination operands match. In thumb2, just use the wide encoding if they don't. In Thumb1, issue a diagnostic. rdar://11219154 llvm-svn: 155748	2012-04-27 23:51:36 +00:00
Jim Grosbach	9d8f6f3d9d	ARM: Tweak tADDrSP definition for consistent operand order. Make the operand order of the instruction match that of the asm syntax. llvm-svn: 155747	2012-04-27 23:51:33 +00:00
Derek Schuff	a99b168145	Revert r155745 llvm-svn: 155746	2012-04-27 23:37:41 +00:00
Derek Schuff	bbf8b83e90	Fix fastcc structure return with fast-isel on x86-32 On x86-32, structure return via sret lets the callee pop the hidden pointer argument off the stack, which the caller then re-pushes. However if the calling convention is fastcc, then a register is used instead, and the caller should not adjust the stack. This is implemented with a check of IsTailCallConvention X86TargetLowering::LowerCall but is now checked properly in X86FastISel::DoSelectCall. llvm-svn: 155745	2012-04-27 23:27:17 +00:00
Jakob Stoklund Olesen	5f0d1b462c	Track worst case alignment padding more accurately. Previously, ARMConstantIslandPass would conservatively compute the address of an aligned basic block as: RoundUpToAlignment(Offset + UnknownPadding) This worked fine for the layout algorithm itself, but it could fool the verify() function because it accounts for alignment padding twice: Once when adding the worst case UnknownPadding, and again by rounding up the fictional block offset. This meant that when optimizeThumb2Instructions would shrink an instruction, the conservative distance estimate could grow. That shouldn't be possible since the woorst case alignment padding wss already included. This patch drops the use of RoundUpToAlignment, and depends only on worst case padding to compute conservative block offsets. This has the weird effect that the computed offset for an aligned block may not be aligned. The important difference is that shrinking an instruction can never cause the estimated distance between two instructions to grow. The estimated distance is always larger than the real distance that only the assembler knows. <rdar://problem/11339352> llvm-svn: 155744	2012-04-27 22:58:38 +00:00
Craig Topper	0fa6c7e593	Use 'unsigned' instead of 'int' in several places when retrieving number of vector elements. llvm-svn: 155742	2012-04-27 22:54:43 +00:00
Chad Rosier	32c2178ef3	Add x86-specific DAG combine to simplify: x == -y --> x+y == 0 x != -y --> x+y != 0 On x86, the generated code goes from negl %esi cmpl %esi, %edi je .LBB0_2 to addl %esi, %edi je .L4 This case is correctly handled for ARM with "cmn". Patch by Manman Ren. rdar://11245199 PR12545 llvm-svn: 155739	2012-04-27 22:33:25 +00:00
Craig Topper	42cd8d2c00	Tidy up spacing. llvm-svn: 155733	2012-04-27 21:05:09 +00:00
Lang Hames	ea001225c1	Fix the order of the operands in the llvm.fma intrinsic patterns for ARM, <rdar://problem/11325085>. llvm-svn: 155724	2012-04-27 18:51:24 +00:00
Richard Barton	82f95ea2ad	Fix ARM assembly parsing for upper case condition codes on IT instructions. llvm-svn: 155720	2012-04-27 17:34:01 +00:00
Benjamin Kramer	913da4b261	X86: Don't emit conditional floating point moves on when targeting pre-pentiumpro architectures. * Model FPSW (the FPU status word) as a register. * Add ISel patterns for the FUCOM, FNSTSW and SAHF instructions. During Legalize/Lowering, build a node sequence to transfer the comparison result from FPSW into EFLAGS. If you're wondering about the right-shift: That's an implicit sub-register extraction (%ax -> %ah) which is handled later on by the instruction selector. Fixes PR6679. Patch by Christoph Erhardt! llvm-svn: 155704	2012-04-27 12:07:43 +00:00
Richard Barton	f435b09eaf	Refactor IT handling not to store the bottom bit of the condition code in the mask operand in the MCInst. llvm-svn: 155700	2012-04-27 08:42:59 +00:00
Evan Cheng	1ec87ee096	Implement a bastardized ABI. llvm-svn: 155686	2012-04-27 02:11:10 +00:00
Evan Cheng	f52003de56	- thumbv6 shouldn't imply +thumb2. Cortex-M0 doesn't suppport 32-bit Thumb2 instructions. - However, it does support dmb, dsb, isb, mrs, and msr. rdar://11331541 llvm-svn: 155685	2012-04-27 01:27:19 +00:00
Jim Grosbach	3d6c629e26	ARM: Thumb ldr(literal) base address alignment is 32-bits. The base address for the PC-relative load is Align(PC,4), so it's the address of the word containing the 16-bit instruction, not the address of the instruction itself. Ugh. rdar://11314619 llvm-svn: 155659	2012-04-26 20:48:12 +00:00
Preston Gurd	81290f4be5	Trivial change to set UseLeaForSP flag in addition to toggling the FeatureLeaForSP feature bit when llvm auto detects Intel Atom. Patch by Andy Zhang llvm-svn: 155655	2012-04-26 19:52:27 +00:00
Tim Northover	3de97b7a86	Use VLD1 in NEON extenting-load patterns instead of VLDR. On some cores it's a bad idea for performance to mix VFP and NEON instructions and since these patterns are NEON anyway, the NEON load should be used. llvm-svn: 155630	2012-04-26 08:46:29 +00:00
Tim Northover	6699a60b0e	Test commit. llvm-svn: 155626	2012-04-26 08:24:07 +00:00
Craig Topper	08ccfbe57b	Enable detection of AVX and AVX2 support through CPUID. Add AVX/AVX2 to corei7-avx, core-avx-i, and core-avx2 cpu names. llvm-svn: 155618	2012-04-26 06:40:15 +00:00
Evan Cheng	9f7ad310b5	If triple is armv7 / thumbv7 and a CPU is specified, do not automatically assume the feature set of v7a. This comes about if the user specifies something like -arch armv7 -mcpu=cortex-m3. We shouldn't be generating instructions such as uxtab in this case. rdar://11318438 llvm-svn: 155601	2012-04-26 01:13:36 +00:00
Richard Barton	ba5b0cc82e	Unify internal representation of ARM instructions with a register right-shifted by #32 . These are stored as shifts by #0 in the MCInst and correctly marshalled when transforming from or to assembly representation. llvm-svn: 155565	2012-04-25 18:00:18 +00:00
Craig Topper	3ec7c2aa84	Add ifdef around getSubtargetFeatureName in tablegen output file so that only targets that want the function get it. This prevents other targets from getting an unused function warning. llvm-svn: 155538	2012-04-25 06:56:34 +00:00
Craig Topper	5ff6dc34b9	Use vector_shuffles instead of target specific unpack nodes for AVX ZERO_EXTEND/ANY_EXTEND combine. These will be converted to target specific nodes during lowering. This is more consistent with other code. llvm-svn: 155537	2012-04-25 06:39:39 +00:00
Akira Hatanaka	2020e27d6d	Do not use $gp as a dedicated global register if the target ABI is not O32. llvm-svn: 155522	2012-04-25 01:24:52 +00:00
Jim Grosbach	5117ef7453	ARM: improved assembler diagnostics for missing CPU features. When an instruction match is found, but the subtarget features it requires are not available (missing floating point unit, or thumb vs arm mode, for example), issue a diagnostic that identifies what the feature mismatch is. rdar://11257547 llvm-svn: 155499	2012-04-24 22:40:08 +00:00
Jim Grosbach	1e75fc1fe1	ARM: Nuke remnant bogus code. r154362 was supposed to delete this bit, but obviously didn't. rdar://11305594 llvm-svn: 155465	2012-04-24 18:39:47 +00:00
Nadav Rotem	810734b7f4	AVX: Add additional vbroadcast replacement sequences for integers. Remove the v2f64 patterns because it does not match any vbroadcast instruction. llvm-svn: 155461	2012-04-24 18:09:59 +00:00
Nadav Rotem	7b7b99c74a	AVX2: The BLENDPW instruction selects between vectors of v16i16 using an i8 immediate. We can't use it here because the shuffle code does not check that the lower part of the word is identical to the upper part. llvm-svn: 155440	2012-04-24 11:27:53 +00:00
Richard Barton	e9600009e9	Refactor Thumb ITState handling in ARM Disassembler to more efficiently use its vector llvm-svn: 155439	2012-04-24 11:13:20 +00:00
Nadav Rotem	aa3ff8da00	AVX: We lower VECTOR_SHUFFLE and BUILD_VECTOR nodes into vbroadcast instructions using the pattern (vbroadcast (i32load src)). In some cases, after we generate this pattern new users are added to the load node, which prevent the selection of the blend pattern. This commit provides fallback patterns which perform in-vector broadcast (using in-vector vbroadcast in AVX2 and pshufd on AVX1). llvm-svn: 155437	2012-04-24 11:07:03 +00:00
Craig Topper	0b65c40821	Remove dangling spaces. Fix some other formatting. llvm-svn: 155429	2012-04-24 06:36:35 +00:00
Craig Topper	6f2a535de2	Simplify code a bit and make it compile better. Remove unused parameters. llvm-svn: 155428	2012-04-24 06:02:29 +00:00
Jim Grosbach	671ad2a572	Tidy up. 80 columns, whitespace, et. al. llvm-svn: 155399	2012-04-23 22:04:10 +00:00
Nadav Rotem	3f8acfc3c4	Optimize the vector UINT_TO_FP, SINT_TO_FP and FP_TO_SINT operations where the integer type is i8 (commonly used in graphics). llvm-svn: 155397	2012-04-23 21:53:37 +00:00
Preston Gurd	9a0914753a	This patch fixes a problem which arose when using the Post-RA scheduler on X86 Atom. Some of our tests failed because the tail merging part of the BranchFolding pass was creating new basic blocks which did not contain live-in information. When the anti-dependency code in the Post-RA scheduler ran, it would sometimes rename the register containing the function return value because the fact that the return value was live-in to the subsequent block had been lost. To fix this, it is necessary to run the RegisterScavenging code in the BranchFolding pass. This patch makes sure that the register scavenging code is invoked in the X86 subtarget only when post-RA scheduling is being done. Post RA scheduling in the X86 subtarget is only done for Atom. This patch adds a new function to the TargetRegisterClass to control whether or not live-ins should be preserved during branch folding. This is necessary in order for the anti-dependency optimizations done during the PostRASchedulerList pass to work properly when doing Post-RA scheduling for the X86 in general and for the Intel Atom in particular. The patch adds and invokes the new function trackLivenessAfterRegAlloc() instead of using the existing requiresRegisterScavenging(). It changes BranchFolding.cpp to call trackLivenessAfterRegAlloc() instead of requiresRegisterScavenging(). It changes the all the targets that implemented requiresRegisterScavenging() to also implement trackLivenessAfterRegAlloc(). It adds an assertion in the Post RA scheduler to make sure that post RA liveness information is available when it is needed. It changes the X86 break-anti-dependencies test to use –mcpu=atom, in order to avoid running into the added assertion. Finally, this patch restores the use of anti-dependency checking (which was turned off temporarily for the 3.1 release) for Intel Atom in the Post RA scheduler. Patch by Andy Zhang! Thanks to Jakob and Anton for their reviews. llvm-svn: 155395	2012-04-23 21:39:35 +00:00
Jim Grosbach	41e94d79be	ARM: VSLI two-operand assmebly aliases are tblgen'erated. llvm-svn: 155393	2012-04-23 21:22:04 +00:00
Jim Grosbach	3dada484c3	ARM: tblgen'erate VSRA/VRSRA/VSRI assembly two-operand aliases. llvm-svn: 155392	2012-04-23 21:00:49 +00:00
Jim Grosbach	e5012fbad3	ARM: vqdmulh two-operand aliases are tblgen'erated now. llvm-svn: 155387	2012-04-23 20:37:20 +00:00
Chandler Carruth	3c3bb55a85	Revert r155365, r155366, and r155367. All three of these have regression test suite failures. The failures occur at each stage, and only get worse, so I'm reverting all of them. Please resubmit these patches, one at a time, after verifying that the regression test suite passes. Never submit a patch without running the regression test suite. llvm-svn: 155372	2012-04-23 18:25:57 +00:00
Sirish Pande	a3f8ba2439	Hexagon V5 (floating point) support. llvm-svn: 155367	2012-04-23 17:49:40 +00:00
Sirish Pande	2c7bf00fba	Support for Hexagon architectural feature, new value jump. llvm-svn: 155366	2012-04-23 17:49:28 +00:00
Sirish Pande	6cd2251598	Support for Hexagon VLIW Packetizer. llvm-svn: 155365	2012-04-23 17:49:20 +00:00
Craig Topper	153bb34a3c	Use MVT instead of EVT through all of LowerVECTOR_SHUFFLEtoBlend and not just the switch. Saves a little bit of binary size. llvm-svn: 155339	2012-04-23 07:36:33 +00:00
Craig Topper	0a2c809d09	Make getZeroVector and getOnesVector more alike as far as how they detect 128-bit versus 256-bit vectors. Be explicit about both sizes and use llvm_unreachable. Similar changes to getLegalSplat. llvm-svn: 155337	2012-04-23 07:24:41 +00:00
Craig Topper	2bbe8bcf4e	Tidy up by removing some 'else' after 'return' llvm-svn: 155336	2012-04-23 06:57:04 +00:00
Craig Topper	5c51eeecfc	Tidy up spacing in LowerVECTOR_SHUFFLEtoBlend. Remove code that checks if shuffle operand has a different type than the the shuffle result since it can never happen. llvm-svn: 155333	2012-04-23 06:38:28 +00:00
Craig Topper	a52f0d09b6	Add a couple llvm_unreachables. llvm-svn: 155332	2012-04-23 03:42:40 +00:00
Craig Topper	984dc015ae	Remove some tab characers. llvm-svn: 155331	2012-04-23 03:28:34 +00:00
Craig Topper	ea428fd79c	Remove some 'else' after 'return'. No functional change. llvm-svn: 155330	2012-04-23 03:26:18 +00:00
Craig Topper	bf7d5666f0	Make Extract128BitVector and Insert128BitVector take an unsigned instead of an ConstantNode SDValue. getConstant was almost always called just before only to have the functions take it apart and build a new ConstantSDNode. llvm-svn: 155325	2012-04-22 20:55:18 +00:00
Craig Topper	2d474d6d92	Convert getNode(UNDEF) to getUNDEF. llvm-svn: 155321	2012-04-22 19:29:34 +00:00
Craig Topper	860ed0d20a	Make calls to getVectorShuffle more consistent. Use shuffle VT for calls to getUNDEF instead of requerying. Use &Mask[0] instead of Mask.data(). llvm-svn: 155320	2012-04-22 19:17:57 +00:00
Craig Topper	43397c0900	Tidy up. 80 columns and argument alignment. llvm-svn: 155319	2012-04-22 18:51:37 +00:00
Craig Topper	ad56a744f1	Simplify code by converting multiple places that were manually concatenating 128-bit vectors to use either CONCAT_VECTORS or a helper function. CONCAT_VECTORS will itself be lowered to the same pattern as before. The helper function is needed for concats of BUILD_VECTORs since getNode(CONCAT_VECTORS) will just return a large BUILD_VECTOR and we may be trying to lower large BUILD_VECTORS when this occurs. llvm-svn: 155318	2012-04-22 18:15:59 +00:00
Benjamin Kramer	8877d68db7	ARM: Initialize the HasRAS bit. Found by valgrind. llvm-svn: 155313	2012-04-22 11:52:41 +00:00
Elena Demikhovsky	8d7e56c409	ZERO_EXTEND/SIGN_EXTEND/TRUNCATE optimization for AVX2 llvm-svn: 155309	2012-04-22 09:39:03 +00:00
Bill Wendling	f9774c3253	Remove some potential warnings about variables used uninitialized. llvm-svn: 155307	2012-04-22 07:23:04 +00:00
Craig Topper	6eadae8e60	Make some fixed arrays const. Use array_lengthof in a couple places instead of a hardcoded number. llvm-svn: 155294	2012-04-21 18:58:38 +00:00
Craig Topper	2568bf3089	Tidy up. 80 columns and some other spacing issues. llvm-svn: 155291	2012-04-21 18:13:35 +00:00
NAKAMURA Takumi	e30303fa86	llvm/lib/Target: [PR12611] Add "llvm/Support/raw_ostream.h" for Debug build on MSVC. Thanks to Andy Gibbs, to report the issue. llvm-svn: 155287	2012-04-21 15:31:45 +00:00
NAKAMURA Takumi	54eed760da	HexagonISelLowering.cpp: Reorder #includes. llvm-svn: 155286	2012-04-21 15:31:36 +00:00
NAKAMURA Takumi	df3d5ea990	HexagonInstPrinter.cpp: Suppress -Wunused-variable warnings with -Asserts. llvm-svn: 155281	2012-04-21 11:24:55 +00:00
Jim Grosbach	c931d451cd	ARM: tblgen'erate more NEON two-operand aliases. VMUL and VEXT. llvm-svn: 155258	2012-04-20 23:46:33 +00:00
Jim Grosbach	b4e849b924	ARM: tblgen'erate more NEON two-operand aliases. llvm-svn: 155254	2012-04-20 23:30:14 +00:00
Jim Grosbach	2937df45a8	ARM: Update NEON assembly two-operand aliases. Use the new TwoOperandAliasConstraint to handle lots of the two-operand aliases for NEON instructions. There's still more to go, but this is a good chunk of them. llvm-svn: 155210	2012-04-20 18:12:54 +00:00
Gabor Greif	c8a9abe9df	effectively back out my last change (r155190) llvm-svn: 155195	2012-04-20 11:41:38 +00:00
Gabor Greif	9eccbe9c82	fix obviously bogus (IMO) operand index of the load in asserts (load only has one operand) and smuggle in some whitespace changes too NB: I am obviously testing the water here, and believe that the unguarded cast is still wrong, but why is the getZExtValue of the load's operand tested against zero here? Any review is appreciated. llvm-svn: 155190	2012-04-20 08:58:49 +00:00
Craig Topper	c7242e054d	Convert more uses of XXXRegisterClass to &XXXRegClass. No functional change since they are equivalent. llvm-svn: 155188	2012-04-20 07:30:17 +00:00
Craig Topper	abadc660e0	Convert some uses of XXXRegisterClass to &XXXRegClass. No functional change since they are equivalent. llvm-svn: 155186	2012-04-20 06:31:50 +00:00
Jim Grosbach	9cc324d31a	ARM some VFP tblgen'erated two-operand aliases. llvm-svn: 155178	2012-04-20 00:15:00 +00:00
Jim Grosbach	6b46134862	ARM let TableGen handle a few two-operand aliases. No need for these explicit aliases anymore. Nuke 'em. llvm-svn: 155173	2012-04-19 23:59:26 +00:00
Gabor Greif	180c4445cf	zap tabs llvm-svn: 155128	2012-04-19 15:16:31 +00:00
Kevin Enderby	ec4bd31206	Fixed the llvm-mv X86 disassembler so the 'C' API gets jumps properly symbolicated. These have and operand type of TYPE_RELv which was not handled as isBranch in translateImmediate() in X86Disassembler.cpp. rdar://11268426 llvm-svn: 155074	2012-04-18 23:12:11 +00:00
Chandler Carruth	b415bf98f0	This reverts a long string of commits to the Hexagon backend. These commits have had several major issues pointed out in review, and those issues are not being addressed in a timely fashion. Furthermore, this was all committed leading up to the v3.1 branch, and we don't need piles of code with outstanding issues in the branch. It is possible that not all of these commits were necessary to revert to get us back to a green state, but I'm going to let the Hexagon maintainer sort that out. They can recommit, in order, after addressing the feedback. Reverted commits, with some notes: Primary commit r154616: HexagonPacketizer - There are lots of review comments here. This is the primary reason for reverting. In particular, it introduced large amount of warnings due to a bad construct in tablegen. - Follow-up commits that should be folded back into this when reposting: - r154622: CMake fixes - r154660: Fix numerous build warnings in release builds. - Please don't resubmit this until the three commits above are included, and the issues in review addressed. Primary commit r154695: Pass to replace transfer/copy ... - Reverted to minimize merge conflicts. I'm not aware of specific issues with this patch. Primary commit r154703: New Value Jump. - Primarily reverted due to merge conflicts. - Follow-up commits that should be folded back into this when reposting: - r154703: Remove iostream usage - r154758: Fix CMake builds - r154759: Fix build warnings in release builds - Please incorporate these fixes and and review feedback before resubmitting. Primary commit r154829: Hexagon V5 (floating point) support. - Primarily reverted due to merge conflicts. - Follow-up commits that should be folded back into this when reposting: - r154841: Remove unused variable (fixing build warnings) There are also accompanying Clang commits that will be reverted for consistency. llvm-svn: 155047	2012-04-18 21:31:19 +00:00
Akira Hatanaka	fc1d00bbd6	Mark instruction classes ArithLogicR, ArithLogicI and LoadUpper as isRematerializable. llvm-svn: 155031	2012-04-18 18:52:10 +00:00
Akira Hatanaka	4167bb9346	Delete blank line. llvm-svn: 155030	2012-04-18 18:47:17 +00:00
Silviu Baranga	ca45af9a75	Added support for disassembling unpredictable swp/swpb ARM instructions. llvm-svn: 155004	2012-04-18 14:18:57 +00:00
Silviu Baranga	d5c6a63a50	Fix the bahavior of the disassembler when decoding unpredictable mrs instructions on ARM. Now the diasassembler emmits warnings instead of errors. llvm-svn: 155002	2012-04-18 14:09:07 +00:00
Silviu Baranga	41f1fcd80e	Added support for unpredictable mcrr/mcrr2/mrrc/mrrc2 ARM instruction in the disassembler. Since the upredicability conditions are complex, C++ code was added to handle them. llvm-svn: 155001	2012-04-18 13:12:50 +00:00
Silviu Baranga	a2944116dc	Fixed decoding for the ARM cdp2 instruction. The restriction on the coprocessor number was removed for this instruction. llvm-svn: 155000	2012-04-18 13:02:55 +00:00
Silviu Baranga	9da1918c84	Add suport for unpredicatble cases of the cmp, tst, teq and cmnz ARM instructions in the disassembler. llvm-svn: 154999	2012-04-18 12:48:43 +00:00
Craig Topper	d3c9e404ba	Remove AVX vpermil intrinsics. I removed their uses from clang headers and builtins a while back. llvm-svn: 154985	2012-04-18 05:24:00 +00:00
Joe Groff	a81bcbb9bb	fix pr12559: mark unavailable win32 math libcalls also fix SimplifyLibCalls to use TLI rather than compile-time conditionals to enable optimizations on floor, ceil, round, rint, and nearbyint llvm-svn: 154960	2012-04-17 23:05:54 +00:00
Chad Rosier	41675546eb	Typo. llvm-svn: 154953	2012-04-17 21:48:36 +00:00
Akira Hatanaka	236e14017f	Delete latter half of CMakeLists.txt. llvm-svn: 154936	2012-04-17 18:18:09 +00:00
Akira Hatanaka	71928e681b	Add disassembler to MIPS. Patch by Vladimir Medic. llvm-svn: 154935	2012-04-17 18:03:21 +00:00
Jay Foad	08a0598cd4	Remove unused CCIfSubtarget. llvm-svn: 154921	2012-04-17 11:29:05 +00:00
James Molloy	a9bcf20d22	Fix bad EXTRACT_SUBREG in instruction selection for extending-loads on NEON. llvm-svn: 154915	2012-04-17 08:18:00 +00:00
Craig Topper	354103d8ca	Don't decode vperm2i128 or vperm2f128 into a shuffle if bit 3 or 7 of the immediate is set. llvm-svn: 154907	2012-04-17 05:54:54 +00:00
Kevin Enderby	29ae538647	Fix ARM disassembly of VLD2 (single 2-element structure to all lanes) instructions with writebacks. And add test a case for all opcodes handed by DecodeVLD2DupInstruction() in ARMDisassembler.cpp . llvm-svn: 154884	2012-04-17 00:49:27 +00:00
Jim Grosbach	2bf5f73977	ARM two-operand forms for vhadd and vhsub instructions. rdar://11252521 llvm-svn: 154875	2012-04-16 23:00:25 +00:00
Preston Gurd	5333e2e5ce	Temporarily turn off anti-dependency checking during Post RA scheduling in X86, until the X86 target is changed to properly set up post RA liveness. llvm-svn: 154874	2012-04-16 22:52:28 +00:00
Jim Grosbach	003607f474	ARM handle :lower16: and :upper16: after a '#' prefix. rdar://11252521 llvm-svn: 154862	2012-04-16 21:18:46 +00:00
Richard Smith	12da79b859	Fix incorrect atomics codegen introduced in r154705, and extend test to catch it. llvm-svn: 154845	2012-04-16 18:43:53 +00:00
David Blaikie	e67cdc07a5	Remove unused variable llvm-svn: 154841	2012-04-16 18:10:13 +00:00
Jim Grosbach	6068d0014a	ARM assembly two-operand forms for VRSHL. rdar://11252521 llvm-svn: 154840	2012-04-16 18:03:16 +00:00
Akira Hatanaka	3e9d81f47c	Do not add offset in applyFixup. This has already been accounted for in Value. llvm-svn: 154838	2012-04-16 18:00:19 +00:00
Jim Grosbach	cd1c000a9f	ARM two-operand aliases for VRHADD instructions. rdar://11252521 llvm-svn: 154832	2012-04-16 17:14:11 +00:00
Sirish Pande	96e8ee17e0	Hexagon V5 (Floating Point) Support. llvm-svn: 154829	2012-04-16 17:05:06 +00:00
Craig Topper	4badeb3f0d	Replace vpermd/vpermps intrinic patterns with custom lowering to target specific nodes. llvm-svn: 154801	2012-04-16 07:13:00 +00:00
Craig Topper	26d7a94981	Change type profile for vpermv back to using operand type for the mask argument to match intrinsic behavior. Add a bitcast to the lowering code to convert mask from v8i32 to v8f32 for vpermps. llvm-svn: 154798	2012-04-16 06:43:40 +00:00
Craig Topper	c0075aa7ff	Flip the arguments when converting vpermd/vpermps intrinsics into instructions. The intrinsic has the mask as the last operand, but the instruction has it as the second. llvm-svn: 154797	2012-04-16 06:26:15 +00:00
Craig Topper	b86fa404d3	Merge vpermps/vpermd and vpermpd/vpermq SD nodes. llvm-svn: 154782	2012-04-16 00:41:45 +00:00
Craig Topper	b04fe34030	Fix SDTypeProfile for vpermps. The mask operand should be v8i32. llvm-svn: 154781	2012-04-16 00:12:20 +00:00
Craig Topper	1f8c9eb925	Spacing fixes and 80 column fixes. Use 0 instead of 0x80 for undef indices in vpermps/vpermd. Hardware only looks at lower 3-bits. llvm-svn: 154780	2012-04-15 23:48:57 +00:00
Craig Topper	bfc9a5f7d3	Remove AVX2 vpermq and vpermpd intrinsics. These can now be handled with normal shuffle vectors. llvm-svn: 154778	2012-04-15 22:43:31 +00:00
Nadav Rotem	42bcd04ee3	Fix PR12529. The Vxx family of instructions are only supported by AVX. Use non-vex instructions for SSE4. llvm-svn: 154770	2012-04-15 19:36:44 +00:00
Benjamin Kramer	673824b4a1	Wire up support for diagnostic ranges in the ARMAsmParser. As an example, attach range info to the "invalid instruction" message: $ clang -arch arm -c asm.c asm.c:2:11: error: invalid instruction __asm__("foo r0"); ^ <inline asm>:1:2: note: instantiated into assembly here foo r0 ^~~ llvm-svn: 154765	2012-04-15 17:04:27 +00:00
Elena Demikhovsky	779a72b49e	Added VPERM optimization for AVX2 shuffles llvm-svn: 154761	2012-04-15 11:18:59 +00:00
NAKAMURA Takumi	67de410135	HexagonCopyToCombine.cpp: Silence two warnings, -Wunused-variable, with -Asserts. llvm-svn: 154759	2012-04-15 05:33:43 +00:00
NAKAMURA Takumi	355eebf4cf	Target/Hexagon: Tweak to fix msvc build. llvm-svn: 154758	2012-04-15 05:09:09 +00:00
Richard Smith	3e8f1f6aea	Fix X86 codegen for 'atomicrmw nand' to generate x = ~(x & y), not x = ~x & y. llvm-svn: 154705	2012-04-13 22:47:00 +00:00

... 3 4 5 6 7 ...

21525 Commits