llvm-project

Commit Graph

Author	SHA1	Message	Date
Benjamin Kramer	ad100ec90e	Contrary to what the documentation says, .lcomm alignment on COFF is in bytes, not power of 2. llvm-svn: 163405	2012-09-07 18:56:10 +00:00
Chad Rosier	b759ede963	Update function names to conform to guidelines. No functional change intended. llvm-svn: 163401	2012-09-07 18:16:38 +00:00
Jakob Stoklund Olesen	e45e22b20f	Custom DAGCombine for and/or/xor are for all ARMs. The 'select' transformations apply to all ARM architectures and don't require hasV6T2Ops. llvm-svn: 163396	2012-09-07 17:34:15 +00:00
Benjamin Kramer	47f9ec92cb	MC: Overhaul handling of .lcomm - Darwin lied about not supporting .lcomm and turned it into zerofill in the asm parser. Push the zerofill-conversion down into macho-specific code. - This makes the tri-state LCOMMType enum superfluous, there are no targets without .lcomm. - Do proper error reporting when trying to use .lcomm with alignment on a target that doesn't support it. - .comm and .lcomm alignment was parsed in bytes on COFF, should be power of 2. - Fixes PR13755 (.lcomm crashes on ELF). llvm-svn: 163395	2012-09-07 17:25:13 +00:00
Benjamin Kramer	e3d658bb6c	PR13754: llvm-mc/x86 crashes on .cfi directives without the % prefix for registers. gas accepts this and it seems to be common enough to be worth supporting. This doesn't affect the parsing of reg operands outside of .cfi directives. llvm-svn: 163390	2012-09-07 14:51:35 +00:00
Benjamin Kramer	fa530570db	MipsAsmParser: Fix a couple of string use-after-frees and misuses of classof. llvm-svn: 163383	2012-09-07 09:47:42 +00:00
Hans Wennborg	08238adbbb	SimplifyCFG: ValidLookupTableConstant should be static llvm-svn: 163378	2012-09-07 08:22:57 +00:00
Michael Liao	b7cd341901	Stop emitting lifetime region info when stack coloring is not enabled in O0 - this should fix PR13780 llvm-svn: 163370	2012-09-07 05:13:00 +00:00
Jack Carter	97158ca5c2	The Mips standalone assembler aliased instruction support. The assembler can alias one instruction into another based on the operands. For example the jump instruction "J" takes and immediate operand, but if the operand is a register the assembler will change it into a jump register "JR" instruction. These changes are in the instruction td file. Test cases included Contributer: Vladimir Medic llvm-svn: 163368	2012-09-07 01:42:38 +00:00
Jack Carter	be33217bb8	The Mips standalone assembler intial directive support. Actually these are just stubs for parsing the directives. Semantic support will come later. Test cases included Contributer: Vladimir Medic llvm-svn: 163364	2012-09-07 00:48:02 +00:00
Jack Carter	a63b16ac1e	The Mips standalone assembler fpu instruction support. Test cases included Contributer: Vladimir Medic llvm-svn: 163363	2012-09-07 00:23:42 +00:00
Michael Liao	026f833368	Re-work bit/bits value resolving in tblgen - This patch is inspired by the failure of the following code snippet which is used to convert enumerable values into encoding bits to improve the readability of td files. class S<int s> { bits<2> V = !if(!eq(s, 8), {0, 0}, !if(!eq(s, 16), {0, 1}, !if(!eq(s, 32), {1, 0}, !if(!eq(s, 64), {1, 1}, {?, ?})))); } Later, PR8330 is found to report not exactly the same bug relevant issue to bit/bits values. - Instead of resolving bit/bits values separately through resolveBitReference(), this patch adds getBit() for all Inits and resolves bit value by resolving plus getting the specified bit. This unifies the resolving of bit with other values and removes redundant logic for resolving bit only. In addition, BitsInit::resolveReferences() is optimized to take advantage of this origanization by resolving VarBitInit's variable reference first and then getting bits from it. - The type interference in '!if' operator is revised to support possible combinations of int and bits/bit in MHS and RHS. - As there may be illegal assignments from integer value to bit, says assign 2 to a bit, but we only check this during instantiation in some cases, e.g. bit V = !if(!eq(x, 17), 0, 2); Verbose diagnostic message is generated when invalid value is resolveed to help locating the error. - PR8330 is fixed as well. llvm-svn: 163360	2012-09-06 23:32:48 +00:00
David Blaikie	4fc4775561	Remove unused variable introduced by r163346. llvm-svn: 163359	2012-09-06 23:31:29 +00:00
Jack Carter	dc1e35d418	The Mips standalone assembler memory instruction support. This includes sb,sc,sh,sw,lb,lw,lbu,lh,lhu,ll,lw Test case included Contributer: Vladimir Medic llvm-svn: 163346	2012-09-06 20:00:02 +00:00
Manman Ren	c3366ccecb	Release build: guard dump functions with "ifndef NDEBUG" No functional change. llvm-svn: 163344	2012-09-06 19:55:56 +00:00
Manman Ren	742534c4dc	Release build: guard dump functions with "ifndef NDEBUG" No functional change. llvm-svn: 163339	2012-09-06 19:06:06 +00:00
Jakob Stoklund Olesen	866908c42c	Allow overlaps between virtreg and physreg live ranges. The RegisterCoalescer understands overlapping live ranges where one register is defined as a copy of the other. With this change, register allocators using LiveRegMatrix can do the same, at least for copies between physical and virtual registers. When a physreg is defined by a copy from a virtreg, allow those live ranges to overlap: %CL<def> = COPY %vreg11:sub_8bit; GR32_ABCD:%vreg11 %vreg13<def,tied1> = SAR32rCL %vreg13<tied0>, %CL<imp-use,kill> We can assign %vreg11 to %ECX, overlapping the live range of %CL. llvm-svn: 163336	2012-09-06 18:15:23 +00:00
Jakob Stoklund Olesen	bb4bdd8912	Handle overlapping regunit intervals in LiveIntervals::addKillFlags(). We will soon allow virtual register live ranges to overlap regunit live ranges when the physreg is defined as a copy of the virtreg: %EAX = COPY %vreg5 FOO %vreg5 BAR %EAX<kill> There is no real interference since %vreg5 and %EAX have the same value where they overlap. This patch prevents addKillFlags from adding virtreg kill flags to FOO where the assigned physreg is overlapping the virtual register live range. llvm-svn: 163335	2012-09-06 18:15:18 +00:00
Jakob Stoklund Olesen	4aed470376	Clear kill flags while computing live ranges. Kill flags are difficult to maintain, and liveness queries are better handled by live intervals. Kill flags are reinserted after register allocation by addKillFlags(). llvm-svn: 163334	2012-09-06 18:15:15 +00:00
Roman Divacky	4717a8d654	Dont cast away const needlessly. Found by gcc48 -Wcast-qual. llvm-svn: 163324	2012-09-06 15:42:13 +00:00
Tim Northover	00e071ad52	Diagnose invalid alignments on duplicating VLDn instructions. Patch by Chris Lidbury. llvm-svn: 163323	2012-09-06 15:27:12 +00:00
Tim Northover	fb3cdd83b0	Check for invalid alignment values when decoding VLDn/VSTn (single ln) instructions. Patch by Chris Lidbury. llvm-svn: 163321	2012-09-06 15:17:49 +00:00
Arnold Schwaighofer	8dc34cfb99	BasicAA: Recognize cyclic NoAlias phis Enhances basic alias analysis to recognize phis whose first incoming values are NoAlias and whose other incoming values are just the phi node itself through some amount of recursion. Example: With this change basicaa reports that ptr_phi and ptr_phi2 do not alias each other. bb: ptr = ptr2 + 1 loop: ptr_phi = phi [bb, ptr], [loop, ptr_plus_one] ptr2_phi = phi [bb, ptr2], [loop, ptr2_plus_one] ... ptr_plus_one = gep ptr_phi, 1 ptr2_plus_one = gep ptr2_phi, 1 This enables the elimination of one load in code like the following: extern int foo; int test_noalias(int ptr, int num, int coeff) { int ptr2 = ptr; int result = (ptr++) * (coeff--); while (num--) { ptr2++ = ptr; result += (coeff--) * (ptr++); } ptr = foo; return result; } Part 2/2 of fix for PR13564. llvm-svn: 163319	2012-09-06 14:41:53 +00:00
Tim Northover	262f6f564f	Use correct part of complex operand to encode VST1 alignment. Patch by Chris Lidbury. llvm-svn: 163318	2012-09-06 14:36:55 +00:00
Arnold Schwaighofer	76dca58c66	BasicAA: GEPs of NoAlias'ing base ptr with equivalent indices are NoAlias If we can show that the base pointers of two GEPs don't alias each other using precise analysis and the indices and base offset are equal then the two GEPs also don't alias each other. This is primarily needed for the follow up patch that analyses NoAlias'ing PHI nodes. Part 1/2 of fix for PR13564. llvm-svn: 163317	2012-09-06 14:31:51 +00:00
Nadav Rotem	9e3cc9f884	Disable stack coloring by default in order to resolve the i386 failures. llvm-svn: 163316	2012-09-06 14:27:06 +00:00
Elena Demikhovsky	42777877c2	AVX2 optimization. Added generation of VPSHUB instruction for <32 x i8> vector shuffle when possible. llvm-svn: 163312	2012-09-06 12:42:01 +00:00
Nadav Rotem	a8e15b0892	Fix a few old-GCC warnings. No functional change. llvm-svn: 163309	2012-09-06 11:13:55 +00:00
James Molloy	9d30dc2432	Fix self-host; ensure signedness is consistent. llvm-svn: 163306	2012-09-06 10:32:08 +00:00
Hans Wennborg	feb4d07d88	Fix switch_to_lookup_table.ll test from r163302. The lookup tables did not get built in a deterministic order. This makes them get built in the order that the corresponding phi nodes were found. llvm-svn: 163305	2012-09-06 10:10:35 +00:00
James Molloy	49bdbce8e1	Improve codegen for BUILD_VECTORs on ARM. If we have a BUILD_VECTOR that is mostly a constant splat, it is often better to splat that constant then insertelement the non-constant lanes instead of insertelementing every lane from an undef base. llvm-svn: 163304	2012-09-06 09:55:02 +00:00
Hans Wennborg	8a62fc5294	Build lookup tables for switches (PR884) This adds a transformation to SimplifyCFG that attemps to turn switch instructions into loads from lookup tables. It works on switches that are only used to initialize one or more phi nodes in a common successor basic block, for example: int f(int x) { switch (x) { case 0: return 5; case 1: return 4; case 2: return -2; case 5: return 7; case 6: return 9; default: return 42; } This speeds up the code by removing the hard-to-predict jump, and reduces code size by removing the code for the jump targets. llvm-svn: 163302	2012-09-06 09:43:28 +00:00
Nadav Rotem	7c277da364	Add a new optimization pass: Stack Coloring, that merges disjoint static allocations (allocas). Allocas are known to be disjoint if they are marked by disjoint lifetime markers (@llvm.lifetime.XXX intrinsics). llvm-svn: 163299	2012-09-06 09:17:37 +00:00
James Molloy	34e9931bec	Optimize codegen for VSETLNi{8,16,32} operating on Q registers. Degenerate to a VSETLN on D registers, instead of an (INSERT_SUBREG (VSETLN (EXTRACT_SUBREG ))) sequence to help the register coalescer. llvm-svn: 163298	2012-09-06 09:16:01 +00:00
Michael Liao	2d95a2b5c4	Remove duplicated helper function llvm-svn: 163295	2012-09-06 07:11:22 +00:00
Craig Topper	f3e4aa8cdd	Use iPTR instead of i32 for extract_subvector/insert_subvector index in lowering and patterns. This makes it consistent with the incoming DAG nodes from the DAG builder. llvm-svn: 163293	2012-09-06 06:09:01 +00:00
Craig Topper	daa5ed1e0a	Add patterns for converting stores of subvector_extracts of lower 128-bits of a 256-bit vector to VMOVAPSmr/VMOVUPSmr. llvm-svn: 163292	2012-09-06 05:15:01 +00:00
NAKAMURA Takumi	ffa157177e	Whitespace. llvm-svn: 163289	2012-09-06 03:02:56 +00:00
NAKAMURA Takumi	8a54d812c5	Unix/Signals.inc: Fix a typo. Thanks to Dani Berg! llvm-svn: 163288	2012-09-06 03:01:43 +00:00
Jack Carter	0744143370	There are some Mips instructions that are lowered by the assembler such as shifts greater than 32. In the case of direct object, the code gen needs to do this lowering since the assembler is not involved. With the advent of the llvm-mc assembler, it also needs to do the same lowering. This patch makes that specific lowering code accessible to both the direct object output and the assembler. This patch does not affect generated output. llvm-svn: 163287	2012-09-06 02:31:34 +00:00
Jim Grosbach	30c4282f88	Update function names to conform to guidelines. No functional change. llvm-svn: 163279	2012-09-06 00:59:08 +00:00
Jack Carter	71e6a7492e	Mips specific llvm assembler support for branch and jump instructions. Test case included. Contributer: Vladimir Medic llvm-svn: 163277	2012-09-06 00:43:26 +00:00
Jakob Stoklund Olesen	b7a50ecdb6	Remove predicated pseudo-instructions. These pseudos are no longer needed now that it is possible to represent predicated instructions in SSA form. llvm-svn: 163275	2012-09-05 23:58:04 +00:00
Jakob Stoklund Olesen	f831059f60	Use predication instead of pseudo-opcodes when folding into MOVCC. Now that it is possible to dynamically tie MachineInstr operands, predicated instructions are possible in SSA form: %vreg3<def> = SUBri %vreg1, -2147483647, pred:14, pred:%noreg, %opt:%noreg %vreg4<def,tied1> = MOVCCr %vreg3<tied0>, %vreg1, %pred:12, pred:%CPSR Becomes a predicated SUBri with a tied imp-use: SUBri %vreg1, -2147483647, pred:13, pred:%CPSR, opt:%noreg, %vreg1<imp-use,tied0> This means that any instruction that is safe to move can be folded into a MOVCC, and the *CC pseudo-instructions are no longer needed. The test case changes reflect that Thumb2SizeReduce recognizes the predicated instructions. It didn't understand the pseudos. llvm-svn: 163274	2012-09-05 23:58:02 +00:00
Chad Rosier	f24ae7b084	[ms-inline asm] Use the asm dialect from the MI to set the parser dialect. llvm-svn: 163273	2012-09-05 23:57:37 +00:00
Manman Ren	f3fedb6935	JumpThreading: when default destination is the destination of some cases in a switch, make sure we include the value for the cases when calculating edge value from switch to the default destination. rdar://12241132 llvm-svn: 163270	2012-09-05 23:45:58 +00:00
Jack Carter	b4dbc17acd	Mips specific llvm assembler support for ALU instructions. This includes register support. Test case included. Contributer: Vladimir Medic llvm-svn: 163268	2012-09-05 23:34:03 +00:00
Chad Rosier	e53314f7e3	Cleanup a few magic numbers. llvm-svn: 163263	2012-09-05 22:40:13 +00:00
Roman Divacky	ad06cee239	Stop casting away const qualifier needlessly. llvm-svn: 163258	2012-09-05 22:26:57 +00:00
Chad Rosier	cbd2a1983f	[ms-inline asm] We only need one bit to represent the AsmDialect in the MachineInstr. llvm-svn: 163257	2012-09-05 22:17:43 +00:00
Roman Divacky	9338344acb	Constify this properly. Found by gcc48 -Wcast-qual. llvm-svn: 163256	2012-09-05 22:15:49 +00:00
Roman Divacky	9f33d68668	Mark checkSignature const, and in turn stop casting away const from ArchiveMemberHeader. Found by gcc48 -Wcast-qual. llvm-svn: 163255	2012-09-05 22:09:23 +00:00
Roman Divacky	665260222f	Constify SDNodeIterator an stop its only non-const user being cast stripped of its constness. Found by gcc48 -Wcast-qual. llvm-svn: 163254	2012-09-05 22:03:34 +00:00
Roman Divacky	77198de4d6	Constify subtarget info properly so that we dont cast away the const in the SubtargetInfoKV tables. Found by gcc48 -Wcast-qual. llvm-svn: 163251	2012-09-05 21:43:57 +00:00
Roman Divacky	6792380e7b	Use const properly so that we dont remove const qualifier from region and MII by casting. Found with gcc48. llvm-svn: 163247	2012-09-05 21:17:34 +00:00
Chad Rosier	994f4040f5	[ms-inline asm] Propagate the asm dialect into the MachineInstr representation. llvm-svn: 163243	2012-09-05 21:00:58 +00:00
Hal Finkel	efe4a44106	Move the PPC TOC defs into the PPC64 InstrInfo file. Since TOC is just defined for PPC64, move its definition to PPC64 td file. Patch by Adhemerval Zanella. llvm-svn: 163234	2012-09-05 19:22:27 +00:00
Chad Rosier	d8c7610781	[ms-inline asm] Enumerate the InlineAsm dialects and rename the nsdialect to inteldialect. llvm-svn: 163231	2012-09-05 19:00:49 +00:00
Tim Northover	c8d867d42d	Strip old MachineInstrs after we know we can put them back. Previous patch accidentally decided it couldn't convert a VFP to a NEON instruction after it had already destroyed the old one. Not a good move. llvm-svn: 163230	2012-09-05 18:37:53 +00:00
Roman Divacky	09c8a3dde5	Remove unused typedefs gcc4.8 warns about. llvm-svn: 163225	2012-09-05 17:55:46 +00:00
Jim Grosbach	dc1123fcab	MCJIT: getPointerToFunction() references target address space. Make sure to return a pointer into the target memory, not the local memory. Often they are the same, but we can't assume that. llvm-svn: 163217	2012-09-05 16:50:40 +00:00
Benjamin Kramer	6c2649ca4e	Switch BasicAliasAnalysis' cache to SmallDenseMap. It relies on clear() being fast and the cache rarely has more than 1 or 2 elements, so give it an inline capacity and always shrink it back down in case it grows. DenseMap will grow to 64 buckets which makes clear() a lot slower. llvm-svn: 163215	2012-09-05 16:49:37 +00:00
Pranav Bhandarkar	823f9ebaa3	LLVM Bug Fix 13709: Remove needless lsr(Rp, #32 ) instruction access the subreg_hireg of register pair Rp. * lib/Target/Hexagon/HexagonPeephole.cpp(PeepholeDoubleRegsMap): New DenseMap similar to PeepholeMap that additionally records subreg info too. (runOnMachineFunction): Record information in PeepholeDoubleRegsMap and copy propagate the high sub-reg of Rp0 in Rp1 = lsr(Rp0, #32) to the instruction Rx = COPY Rp1:logreg_subreg. * test/CodeGen/Hexagon/remove_lsr.ll: New test. llvm-svn: 163214	2012-09-05 16:01:40 +00:00
Kostya Serebryany	5f5973df08	[asan] fix lint llvm-svn: 163205	2012-09-05 09:00:18 +00:00
Silviu Baranga	3f40d87207	Fixed the DAG combiner to better handle the folding of AND nodes for vector types. The previous code was making the assumption that the length of the bitmask returned by isConstantSplat was equal to the size of the vector type. Now we first make sure that the splat value has at least the length of the vector lane type, then we only use as many fields as we have available in the splat value. llvm-svn: 163203	2012-09-05 08:57:21 +00:00
Kostya Serebryany	2fa38f8ce0	[asan] extend the blacklist functionality to handle global-init. Patch by Reid Watson llvm-svn: 163199	2012-09-05 07:29:56 +00:00
Craig Topper	81f06df699	Remove some of the patterns added in r163196. Increasing the complexity on insert_subvector into undef accomplishes the same thing. llvm-svn: 163198	2012-09-05 07:26:35 +00:00
Craig Topper	f7c87d6eea	Add patterns for integer forms of VINSERTF128/VINSERTI128 folded with loads. Also add patterns to turn subvector inserts with loads to index 0 of an undef into VMOVAPS. llvm-svn: 163196	2012-09-05 06:58:39 +00:00
Chad Rosier	5895edaf66	Add a FIXME that assumes we maintain backward compatibility until the next major release. llvm-svn: 163195	2012-09-05 06:28:52 +00:00
Logan Chien	1b170de77a	Reorder the comments of EmitExceptionTable. llvm-svn: 163194	2012-09-05 06:28:26 +00:00
Logan Chien	eeaaf65cb6	Fix UseInitArray option for MIPS target. llvm-svn: 163193	2012-09-05 06:17:17 +00:00
Craig Topper	2db2353b21	Convert vextracti128/vextractf128 intrinsics to extract_subvector at DAG build time. Similar was previously done for vinserti128/vinsertf128. Add patterns for folding these extract_subvectors with stores. llvm-svn: 163192	2012-09-05 05:48:09 +00:00
Richard Smith	398bd48172	Remove redundant semicolons to fix -pedantic-errors build. llvm-svn: 163190	2012-09-05 01:41:37 +00:00
Chad Rosier	a05ea0f3e3	Fix function name per coding standard. llvm-svn: 163187	2012-09-05 01:15:43 +00:00
Chad Rosier	18fcdcfb9e	[ms-inline asm] Add support for the nsdialect keyword in the Bitcode Reader/Writer. llvm-svn: 163185	2012-09-05 00:56:20 +00:00
Chad Rosier	9772d82df4	[ms-inline asm] Add the nsdialect keyword to the lexer. llvm-svn: 163184	2012-09-05 00:51:02 +00:00
Chad Rosier	f42fad6251	[ms-inline asm] Emit the (new) inline asm Non-Standard Dialect attribute. llvm-svn: 163181	2012-09-05 00:08:17 +00:00
Dan Gohman	df476e5e93	Make provenance checking conservative in cases when pointers-to-strong-pointers may be in play. These can lead to retains and releases happening in unstructured ways, foiling the optimizer. This fixes rdar://12150909. llvm-svn: 163180	2012-09-04 23:16:20 +00:00
Jakub Staszak	e535c1a12e	BypassSlowDivision: Assign to reference, don't copy the object. llvm-svn: 163179	2012-09-04 23:11:11 +00:00
Jakob Stoklund Olesen	ade363e86c	Search the whole instruction for tied operands. Implicit uses can be dynamically tied to defs. This will soon be used for predicated instructions on ARM. llvm-svn: 163177	2012-09-04 22:59:30 +00:00
Chad Rosier	8b3014ea04	[ms-inline asm] Add the inline assembly dialect, AsmDialect, to the InlineAsm class. llvm-svn: 163175	2012-09-04 22:46:24 +00:00
Chad Rosier	38d24e6751	[ms-inline asm] Remove the Inline Asm Non-Standard Dialect attribute. This implementation does not co-exist well with how the sideeffect and alignstack attributes are handled. The reverts r161641. llvm-svn: 163174	2012-09-04 22:29:45 +00:00
Jakub Staszak	85a7787588	Fix my previous patch (r163164). It does now what it is supposed to do: Doesn't set MadeChange to TRUE if BypassSlowDivision doesn't change anything. llvm-svn: 163165	2012-09-04 21:16:59 +00:00
Jakub Staszak	46beca6364	Return false if BypassSlowDivision doesn't change anything. Also a few minor changes: - use pre-inc instead of post-inc - use isa instead of dyn_cast - 80 col - trailing spaces llvm-svn: 163164	2012-09-04 20:48:24 +00:00
Jakob Stoklund Olesen	d92e2bc2e9	Typo. llvm-svn: 163154	2012-09-04 18:44:43 +00:00
Jakob Stoklund Olesen	9fceda741d	Actually use the MachineOperand field for isRegTiedToDefOperand(). The MachineOperand::TiedTo field was maintained, but not used. This patch enables it in isRegTiedToDefOperand() and isRegTiedToUseOperand() which are the actual functions use by the register allocator. llvm-svn: 163153	2012-09-04 18:43:25 +00:00
Jakob Stoklund Olesen	c7579cdded	Move tie checks into MachineVerifier::visitMachineOperand. llvm-svn: 163152	2012-09-04 18:38:28 +00:00
Jakob Stoklund Olesen	0a09da83b6	Allow tied uses and defs in different orders. After much agonizing, use a full 4 bits of precious MachineOperand space to encode this. This uses existing padding, and doesn't grow MachineOperand beyond its current 32 bytes. This allows tied defs among the first 15 operands on a normal instruction, just like the current MCInstrDesc constraint encoding. Inline assembly needs to be able to tie more than the first 15 operands, and gets special treatment. Tied uses can appear beyond 15 operands, as long as they are tied to a def that's in range. llvm-svn: 163151	2012-09-04 18:36:28 +00:00
Preston Gurd	cdf540d5d6	Generic Bypass Slow Div - CodeGenPrepare pass for identifying div/rem ops - Backend specifies the type mapping using addBypassSlowDivType - Enabled only for Intel Atom with O2 32-bit -> 8-bit - Replace IDIV with instructions which test its value and use DIVB if the value is positive and less than 256. - In the case when the quotient and remainder of a divide are used a DIV and a REM instruction will be present in the IR. In the non-Atom case they are both lowered to IDIVs and CSE removes the redundant IDIV instruction, using the quotient and remainder from the first IDIV. However, due to this optimization CSE is not able to eliminate redundant IDIV instructions because they are located in different basic blocks. This is overcome by calculating both the quotient (DIV) and remainder (REM) in each basic block that is inserted by the optimization and reusing the result values when a subsequent DIV or REM instruction uses the same operands. - Test cases check for the presents of the optimization when calculating either the quotient, remainder, or both. Patch by Tyler Nowicki! llvm-svn: 163150	2012-09-04 18:22:17 +00:00
Sergei Larin	4d8986af12	Porting Hexagon MI Scheduler to the new API. Change current Hexagon MI scheduler to use new converging scheduler. Integrates DFA resource model into it. llvm-svn: 163137	2012-09-04 14:49:56 +00:00
Arnold Schwaighofer	f00fb1c581	Patch to implement UMLAL/SMLAL instructions for the ARM architecture This patch corrects the definition of umlal/smlal instructions and adds support for matching them to the ARM dag combiner. Bug 12213 Patch by Yin Ma! llvm-svn: 163136	2012-09-04 14:37:49 +00:00
Elena Demikhovsky	cbe99bbb36	This patch optimizes shuffle instruction - generates 2 instructions instead of 4. Since this specific shuffle is widely used in many workloads we have ~10% performance on them. shufflevector <8 x float> %A, <8 x float> %B, <8 x i32> <i32 0, i32 8, i32 2, i32 10, i32 4, i32 12, i32 6, i32 14> vmovaps (%rdx), %ymm0 vshufps $8, %ymm0, %ymm0, %ymm0 vmovaps (%rcx), %ymm1 vshufps $8, %ymm0, %ymm1, %ymm1 vunpcklps %ymm0, %ymm1, %ymm0 vmovaps (%rcx), %ymm0 vmovsldup (%rdx), %ymm1 vblendps $85, %ymm0, %ymm1, %ymm0 llvm-svn: 163134	2012-09-04 12:49:02 +00:00
Nadav Rotem	03dcd85b56	LICM may hoist an instruction with undefined behavior above a trap. Scan the body of the loop and find instructions that may trap. Use this information when deciding if it is safe to hoist or sink instructions. Notice that we can optimize the search of instructions that may throw in the case of nested loops. rdar://11518836 llvm-svn: 163132	2012-09-04 10:25:04 +00:00
Evgeniy Stepanov	c439a42649	Fix Android build of gtest and lib/Support. llvm-svn: 163131	2012-09-04 09:14:45 +00:00
Alexey Samsonov	c942e6b781	Add support for fetching inlining context (stack of source code locations) by instruction address from DWARF. Add --inlining flag to llvm-dwarfdump to demonstrate and test this functionality, so that "llvm-dwarfdump --inlining --address=0x..." now works much like "addr2line -i 0x...", provided that the binary has debug info (Clang's -gline-tables-only is enough). llvm-svn: 163128	2012-09-04 08:12:33 +00:00
Bob Wilson	01cfbfe9d0	Be conservative about allocations that may alias the accessed pointer. If an allocation has a must-alias relation to the access pointer, we treat it as a Def. Otherwise, without this check, the code here was just skipping over the allocation call and ignoring it. I noticed this by inspection and don't have a specific testcase that it breaks, but it seems like we need to treat a may-alias allocation as a Clobber. llvm-svn: 163127	2012-09-04 03:30:13 +00:00
Chad Rosier	9e2aff8b6d	[ms-inline asm] Asm operands can map to one or more MCOperands. Therefore, add the NumMCOperands argument to the GetMCInstOperandNum() function that is set to the number of MCOperands this asm operand mapped to. llvm-svn: 163124	2012-09-03 20:31:23 +00:00
Chad Rosier	0f13c3ccf8	[ms-inline asm] Add a comment. llvm-svn: 163123	2012-09-03 19:04:35 +00:00
Chad Rosier	391d299737	[ms-inline asm] Add an interface to the GetMCInstOperandNum() function in the MCTargetAsmParser class. llvm-svn: 163122	2012-09-03 18:47:45 +00:00
Roman Divacky	2be394bdcd	Remove always true checks. Noticed by Adhemerval Zanella. llvm-svn: 163117	2012-09-03 16:55:42 +00:00
Chad Rosier	c942943103	Add braces to the case statement. llvm-svn: 163116	2012-09-03 16:21:15 +00:00
Benjamin Kramer	8d9890ab69	IRBuilderify the SjlLjEHPrepare pass. No functionality change. llvm-svn: 163115	2012-09-03 12:27:43 +00:00
Lang Hames	90152701eb	When updating live range endpoints, make sure to preserve the early clobber bit. Fixs PR13719. llvm-svn: 163107	2012-09-03 06:31:45 +00:00
Bob Wilson	dcc54decd5	Fix more fallout from r158919, similar to PR13547. This code used to only handle malloc-like calls, which do not read memory. r158919 changed it to check isNoAliasFn(), which includes strdup-like and realloc-like calls, but it was not checking for dependencies on the memory read by those calls. llvm-svn: 163106	2012-09-03 05:15:15 +00:00
Chad Rosier	a353dba17d	Removed unused argument. llvm-svn: 163104	2012-09-03 03:16:09 +00:00
Chris Lattner	ba3ba8fa1f	some peepholes that should match horizontal add/sub operations. llvm-svn: 163103	2012-09-03 02:58:21 +00:00
Chad Rosier	e38bb6a34e	[ms-inline asm] Expose the Kind and Opcode variables from the MatchInstructionImpl() function. These values are used by the ConvertToMCInst() function to index into the ConversionTable. The values are also needed to call the GetMCInstOperandNum() function. llvm-svn: 163101	2012-09-03 02:06:46 +00:00
Chad Rosier	10f14bcbda	Move ErrorLoc decl into the scope where it's actually used. llvm-svn: 163100	2012-09-03 01:55:11 +00:00
Nadav Rotem	10f6b8802b	Fix a typo. llvm-svn: 163094	2012-09-02 12:21:50 +00:00
Nadav Rotem	9d83202620	Not all targets have efficient ISel code generation for select instructions. For example, the ARM target does not have efficient ISel handling for vector selects with scalar conditions. This patch adds a TLI hook which allows the different targets to report which selects are supported well and which selects should be converted to CF duting codegen prepare. llvm-svn: 163093	2012-09-02 12:10:19 +00:00
Benjamin Kramer	599a4bb6ea	LoopRotation: Make the brute force DomTree update more brute force. We update until we hit a fixpoint. This is probably slow but also slightly simplifies the code. It should also fix the occasional invalid domtrees observed when building with expensive checking. I couldn't find a case where this had a measurable slowdown, but if someone finds a pathological case where it does we may have to find a cleverer way of updating dominators here. Thanks to Duncan for the test case. llvm-svn: 163091	2012-09-02 11:57:22 +00:00
Logan Chien	9ab55b8d59	Rename ANDROIDEABI to Android. Most of the code guarded with ANDROIDEABI are not ARM-specific, and having no relation with arm-eabi. Thus, it will be more natural to call this environment "Android" instead of "ANDROIDEABI". Note: We are not using ANDROID because several projects are using "-DANDROID" as the conditional compilation flag. llvm-svn: 163087	2012-09-02 09:29:46 +00:00
Nadav Rotem	500d691d4a	Generate better select code by allowing the target to use scalar select, and not sign-extend. llvm-svn: 163086	2012-09-02 08:20:07 +00:00
Pete Cooper	2455e9c4a5	Only legalise a VSELECT in to bitwise operations if the vector mask bool is zeros or all ones. A vector bool with just ones isn't suitable for masking with. No test case unfortunately as i couldn't find a target which fit all the conditions needed to hit this code. llvm-svn: 163075	2012-09-01 22:27:48 +00:00
Tim Northover	726d32cdfa	Limit domain conversion to cases where it won't break dep chains. NEON domain conversion was too heavy-handed with its widened registers, which could have stripped existing instructions of their dependency, leaving them vulnerable to scheduling errors. llvm-svn: 163070	2012-09-01 18:07:29 +00:00
Pete Cooper	2117ac40c9	Revert "Take account of boolean vector contents when promoting a build vector from i1 to some other type. rdar://problem/12210060" This reverts commit 5dd9e214fb92847e947f9edab170f9b4e52b908f. Thanks to Duncan for explaining how this should have been done. Conflicts: test/CodeGen/X86/vec_select.ll llvm-svn: 163064	2012-09-01 17:37:55 +00:00
Logan Chien	cea0354c1b	Fix Thumb2 fixup kind in the integrated-as. llvm-svn: 163063	2012-09-01 15:06:36 +00:00
Logan Chien	64f361e0e1	Fix typo. llvm-svn: 163059	2012-09-01 12:11:41 +00:00
Benjamin Kramer	3be6a480a4	LoopRotation: Check some invariants of the dominator updating code. llvm-svn: 163058	2012-09-01 12:04:51 +00:00
Craig Topper	d6cc4062be	Typos llvm-svn: 163053	2012-09-01 06:33:50 +00:00
Owen Anderson	90e0eaffa8	Teach DAG combine a number of tricks to simplify FMA expressions in fast-math mode. llvm-svn: 163051	2012-09-01 06:04:27 +00:00
Michael Liao	ec385012ae	Fix typo llvm-svn: 163049	2012-09-01 04:09:16 +00:00
Manman Ren	26c5d0f607	SelectionDAG: when constructing VZEXT_LOAD from other loads, make sure its output chain is correctly setup. As an example, if the original load must happen before later stores, we need to make sure the constructed VZEXT_LOAD is constrained to be before the stores. rdar://11457792 llvm-svn: 163036	2012-08-31 23:16:57 +00:00
Craig Topper	908e685102	Mark FMA4 instructions as commutable and add them to the folding tables. llvm-svn: 163035	2012-08-31 23:10:34 +00:00
Chad Rosier	451ef13cde	Remove an unused argument. The MCInst opcode is set in the ConvertToMCInst() function nowadays. llvm-svn: 163030	2012-08-31 22:12:31 +00:00
Craig Topper	7573c8f081	Add selection of RegOp2MemOpTable3 to canFoldMemoryOperand llvm-svn: 163029	2012-08-31 22:12:16 +00:00
Jakob Stoklund Olesen	5c8eda0ebc	Add MachineInstr::tieOperands, remove setIsTied(). Manage tied operands entirely internally to MachineInstr. This makes it possible to change the representation of tied operands, as I will do shortly. The constraint that tied uses and defs must be in the same order was too restrictive. llvm-svn: 163021	2012-08-31 20:50:53 +00:00
Michael Liao	3224543bf9	Fix PR12359 - In addition to undefined, if V2 is zero vector, skip 2nd PSHUFB and POR as well as PSHUFB will zero elements with negative indices. Patch by Sriram Murali <sriram.murali@intel.com> llvm-svn: 163018	2012-08-31 20:12:31 +00:00
Jack Carter	b3f3b17e16	The instruction DINS may be transformed into DINSU or DEXTM depending on the size of the extraction and its position in the 64 bit word. This patch allows support of the dext transformations with mips64 direct object output. 0 <= msb < 32 0 <= lsb < 32 0 <= pos < 32 1 <= size <= 32 DINS The field is entirely contained in the right-most word of the doubleword 32 <= msb < 64 0 <= lsb < 32 0 <= pos < 32 2 <= size <= 64 DINSM The field straddles the words of the doubleword 32 <= msb < 64 32 <= lsb < 64 32 <= pos < 64 1 <= size <= 32 DINSU The field is entirely contained in the left-most word of the doubleword llvm-svn: 163010	2012-08-31 18:06:48 +00:00
Bill Wendling	6bbe48967a	Move the GCOVFormat enums into their own namespace per the LLVM coding standard. llvm-svn: 163008	2012-08-31 17:31:28 +00:00
Chad Rosier	9d1fc3672b	Add a comment to explain what's really going on. llvm-svn: 163005	2012-08-31 17:24:10 +00:00
Chad Rosier	a8f3c4fe35	The ConvertToMCInst() function can't fail, so remove the now dead Match_ConversionFail enum. llvm-svn: 163002	2012-08-31 16:41:07 +00:00
Craig Topper	c0387f6b23	Mark FMA3 instructions as commutable so that the operands to the multiply part can be commuted. llvm-svn: 163001	2012-08-31 16:31:13 +00:00
Craig Topper	a8227cb76a	Use CloneMachineInstr to make a new MI in commuteInstruction to make the code tolerant of instructions with more than two input operands. llvm-svn: 163000	2012-08-31 16:30:05 +00:00
Craig Topper	c30fdbc46c	Add support for converting llvm.fma to fma4 instructions. llvm-svn: 162999	2012-08-31 15:40:30 +00:00
Jakob Stoklund Olesen	96f87069c4	Don't enforce ordered inline asm operands. I was too optimistic, inline asm can have tied operands that don't follow the def order. Fixes PR13742. llvm-svn: 162998	2012-08-31 15:34:59 +00:00
Benjamin Kramer	e7e5235726	Clean up ProfileDataLoader a bit. - Overloading operator<< for raw_ostream and pointers is dangerous, it alters the behavior of code that includes the header. - Remove unused ID. - Use LLVM's byte swapping helpers instead of a hand-coded. - Make ReadProfilingData work directly on a pointer. No functionality change. llvm-svn: 162992	2012-08-31 12:43:07 +00:00
Bill Wendling	5aed004cf1	Cleanups due to feedback. No functionality change. Patch by Alistair. llvm-svn: 162979	2012-08-31 05:18:31 +00:00
Michael Liao	969f3913dd	Clean up AddedComplexity further after adding UseSSEx llvm-svn: 162973	2012-08-31 03:01:35 +00:00
Jakob Stoklund Olesen	d3bda3c5b9	Fix a couple of typos in EmitAtomic. Thumb2 instructions are mostly constrained to rGPR, not tGPR which is for Thumb1. rdar://problem/12203728 llvm-svn: 162968	2012-08-31 02:08:34 +00:00
Jim Grosbach	e423e865fe	X86: Fix encoding of 'movd %xmm0, %rax' The assembly string for the VMOVPQIto64rr instruction incorrectly lacked the 'v' prefix, resulting in mis-assembly of the vanilla movd instruction. llvm-svn: 162963	2012-08-31 00:30:30 +00:00
Chad Rosier	98cfa1044f	With the fix in r162954/162955 every cvt function returns true. Thus, have the ConvertToMCInst() return void, rather then a bool. Update all the cvt functions as well. llvm-svn: 162961	2012-08-31 00:03:31 +00:00
Pete Cooper	e969340fea	Take account of boolean vector contents when promoting a build vector from i1 to some other type. rdar://problem/12210060 llvm-svn: 162960	2012-08-30 23:58:52 +00:00
Owen Anderson	cc61f87cf7	Teach the DAG combiner to turn chains of FADDs (x+x+x+x+...) into FMULs by constants. This is only enabled in unsafe FP math mode, since it does not preserve rounding effects for all such constants. llvm-svn: 162956	2012-08-30 23:35:16 +00:00
Chad Rosier	db482ef7a7	Fix for r162954. Return the Error. llvm-svn: 162955	2012-08-30 23:22:05 +00:00
Chad Rosier	8513ffbb83	Move a check to the validateInstruction() function where it more properly belongs. llvm-svn: 162954	2012-08-30 23:20:38 +00:00
Chad Rosier	5eec49fe09	Typo. llvm-svn: 162952	2012-08-30 23:00:00 +00:00
Nadav Rotem	ea973bda26	Currently targets that do not support selects with scalar conditions and vector operands - scalarize the code. ARM is such a target because it does not support CMOV of vectors. To implement this efficientlyi, we broadcast the condition bit and use a sequence of NAND-OR to select between the two operands. This is the same sequence we use for targets that don't have vector BLENDs (like SSE2). rdar://12201387 llvm-svn: 162926	2012-08-30 19:17:29 +00:00
Michael Liao	bbd10792c2	Introduce 'UseSSEx' to force SSE legacy encoding - Add 'UseSSEx' to force SSE legacy insn not being selected when AVX is enabled. As the penalty of inter-mixing SSE and AVX instructions, we need prevent SSE legacy insn from being generated except explicitly specified through some intrinsics. For patterns supported by both SSE and AVX, so far, we force AVX insn will be tried first relying on AddedComplexity or position in td file. It's error-prone and introduces bugs accidentally. 'UseSSEx' is disabled when AVX is turned on. For SSE insns inherited by AVX, we need this predicate to force VEX encoding or SSE legacy encoding only. For insns not inherited by AVX, we still use the previous predicates, i.e. 'HasSSEx'. So far, these insns fall into the following categories: * SSE insns with MMX operands * SSE insns with GPR/MEM operands only (xFENCE, PREFETCH, CLFLUSH, CRC, and etc.) * SSE4A insns. * MMX insns. * x87 insns added by SSE. 2 test cases are modified: - test/CodeGen/X86/fast-isel-x86-64.ll AVX code generation is different from SSE one. 'vcvtsi2sdq' cannot be selected by fast-isel due to complicated pattern and fast-isel fallback to materialize it from constant pool. - test/CodeGen/X86/widen_load-1.ll AVX code generation is different from SSE one after fixing SSE/AVX inter-mixing. Exec-domain fixing prefers 'vmovapd' instead of 'vmovaps'. llvm-svn: 162919	2012-08-30 16:54:46 +00:00
NAKAMURA Takumi	fa81438042	Apply "/Og-" also to MSC15(aka VS9) on VMCore/Function.cpp. llvm-svn: 162917	2012-08-30 16:22:26 +00:00

1 2 3 4 5 ...

56180 Commits