llvm-project

Commit Graph

Author	SHA1	Message	Date
Benjamin Kramer	599a4bb6ea	LoopRotation: Make the brute force DomTree update more brute force. We update until we hit a fixpoint. This is probably slow but also slightly simplifies the code. It should also fix the occasional invalid domtrees observed when building with expensive checking. I couldn't find a case where this had a measurable slowdown, but if someone finds a pathological case where it does we may have to find a cleverer way of updating dominators here. Thanks to Duncan for the test case. llvm-svn: 163091	2012-09-02 11:57:22 +00:00
Logan Chien	9ab55b8d59	Rename ANDROIDEABI to Android. Most of the code guarded with ANDROIDEABI are not ARM-specific, and having no relation with arm-eabi. Thus, it will be more natural to call this environment "Android" instead of "ANDROIDEABI". Note: We are not using ANDROID because several projects are using "-DANDROID" as the conditional compilation flag. llvm-svn: 163087	2012-09-02 09:29:46 +00:00
Nadav Rotem	500d691d4a	Generate better select code by allowing the target to use scalar select, and not sign-extend. llvm-svn: 163086	2012-09-02 08:20:07 +00:00
Pete Cooper	2455e9c4a5	Only legalise a VSELECT in to bitwise operations if the vector mask bool is zeros or all ones. A vector bool with just ones isn't suitable for masking with. No test case unfortunately as i couldn't find a target which fit all the conditions needed to hit this code. llvm-svn: 163075	2012-09-01 22:27:48 +00:00
Tim Northover	726d32cdfa	Limit domain conversion to cases where it won't break dep chains. NEON domain conversion was too heavy-handed with its widened registers, which could have stripped existing instructions of their dependency, leaving them vulnerable to scheduling errors. llvm-svn: 163070	2012-09-01 18:07:29 +00:00
Pete Cooper	2117ac40c9	Revert "Take account of boolean vector contents when promoting a build vector from i1 to some other type. rdar://problem/12210060" This reverts commit 5dd9e214fb92847e947f9edab170f9b4e52b908f. Thanks to Duncan for explaining how this should have been done. Conflicts: test/CodeGen/X86/vec_select.ll llvm-svn: 163064	2012-09-01 17:37:55 +00:00
Logan Chien	cea0354c1b	Fix Thumb2 fixup kind in the integrated-as. llvm-svn: 163063	2012-09-01 15:06:36 +00:00
Logan Chien	64f361e0e1	Fix typo. llvm-svn: 163059	2012-09-01 12:11:41 +00:00
Benjamin Kramer	3be6a480a4	LoopRotation: Check some invariants of the dominator updating code. llvm-svn: 163058	2012-09-01 12:04:51 +00:00
Craig Topper	d6cc4062be	Typos llvm-svn: 163053	2012-09-01 06:33:50 +00:00
Owen Anderson	90e0eaffa8	Teach DAG combine a number of tricks to simplify FMA expressions in fast-math mode. llvm-svn: 163051	2012-09-01 06:04:27 +00:00
Michael Liao	ec385012ae	Fix typo llvm-svn: 163049	2012-09-01 04:09:16 +00:00
Manman Ren	26c5d0f607	SelectionDAG: when constructing VZEXT_LOAD from other loads, make sure its output chain is correctly setup. As an example, if the original load must happen before later stores, we need to make sure the constructed VZEXT_LOAD is constrained to be before the stores. rdar://11457792 llvm-svn: 163036	2012-08-31 23:16:57 +00:00
Craig Topper	908e685102	Mark FMA4 instructions as commutable and add them to the folding tables. llvm-svn: 163035	2012-08-31 23:10:34 +00:00
Chad Rosier	451ef13cde	Remove an unused argument. The MCInst opcode is set in the ConvertToMCInst() function nowadays. llvm-svn: 163030	2012-08-31 22:12:31 +00:00
Craig Topper	7573c8f081	Add selection of RegOp2MemOpTable3 to canFoldMemoryOperand llvm-svn: 163029	2012-08-31 22:12:16 +00:00
Jakob Stoklund Olesen	5c8eda0ebc	Add MachineInstr::tieOperands, remove setIsTied(). Manage tied operands entirely internally to MachineInstr. This makes it possible to change the representation of tied operands, as I will do shortly. The constraint that tied uses and defs must be in the same order was too restrictive. llvm-svn: 163021	2012-08-31 20:50:53 +00:00
Michael Liao	3224543bf9	Fix PR12359 - In addition to undefined, if V2 is zero vector, skip 2nd PSHUFB and POR as well as PSHUFB will zero elements with negative indices. Patch by Sriram Murali <sriram.murali@intel.com> llvm-svn: 163018	2012-08-31 20:12:31 +00:00
Jack Carter	b3f3b17e16	The instruction DINS may be transformed into DINSU or DEXTM depending on the size of the extraction and its position in the 64 bit word. This patch allows support of the dext transformations with mips64 direct object output. 0 <= msb < 32 0 <= lsb < 32 0 <= pos < 32 1 <= size <= 32 DINS The field is entirely contained in the right-most word of the doubleword 32 <= msb < 64 0 <= lsb < 32 0 <= pos < 32 2 <= size <= 64 DINSM The field straddles the words of the doubleword 32 <= msb < 64 32 <= lsb < 64 32 <= pos < 64 1 <= size <= 32 DINSU The field is entirely contained in the left-most word of the doubleword llvm-svn: 163010	2012-08-31 18:06:48 +00:00
Bill Wendling	6bbe48967a	Move the GCOVFormat enums into their own namespace per the LLVM coding standard. llvm-svn: 163008	2012-08-31 17:31:28 +00:00
Chad Rosier	9d1fc3672b	Add a comment to explain what's really going on. llvm-svn: 163005	2012-08-31 17:24:10 +00:00
Chad Rosier	a8f3c4fe35	The ConvertToMCInst() function can't fail, so remove the now dead Match_ConversionFail enum. llvm-svn: 163002	2012-08-31 16:41:07 +00:00
Craig Topper	c0387f6b23	Mark FMA3 instructions as commutable so that the operands to the multiply part can be commuted. llvm-svn: 163001	2012-08-31 16:31:13 +00:00
Craig Topper	a8227cb76a	Use CloneMachineInstr to make a new MI in commuteInstruction to make the code tolerant of instructions with more than two input operands. llvm-svn: 163000	2012-08-31 16:30:05 +00:00
Craig Topper	c30fdbc46c	Add support for converting llvm.fma to fma4 instructions. llvm-svn: 162999	2012-08-31 15:40:30 +00:00
Jakob Stoklund Olesen	96f87069c4	Don't enforce ordered inline asm operands. I was too optimistic, inline asm can have tied operands that don't follow the def order. Fixes PR13742. llvm-svn: 162998	2012-08-31 15:34:59 +00:00
Benjamin Kramer	e7e5235726	Clean up ProfileDataLoader a bit. - Overloading operator<< for raw_ostream and pointers is dangerous, it alters the behavior of code that includes the header. - Remove unused ID. - Use LLVM's byte swapping helpers instead of a hand-coded. - Make ReadProfilingData work directly on a pointer. No functionality change. llvm-svn: 162992	2012-08-31 12:43:07 +00:00
Bill Wendling	5aed004cf1	Cleanups due to feedback. No functionality change. Patch by Alistair. llvm-svn: 162979	2012-08-31 05:18:31 +00:00
Michael Liao	969f3913dd	Clean up AddedComplexity further after adding UseSSEx llvm-svn: 162973	2012-08-31 03:01:35 +00:00
Jakob Stoklund Olesen	d3bda3c5b9	Fix a couple of typos in EmitAtomic. Thumb2 instructions are mostly constrained to rGPR, not tGPR which is for Thumb1. rdar://problem/12203728 llvm-svn: 162968	2012-08-31 02:08:34 +00:00
Jim Grosbach	e423e865fe	X86: Fix encoding of 'movd %xmm0, %rax' The assembly string for the VMOVPQIto64rr instruction incorrectly lacked the 'v' prefix, resulting in mis-assembly of the vanilla movd instruction. llvm-svn: 162963	2012-08-31 00:30:30 +00:00
Chad Rosier	98cfa1044f	With the fix in r162954/162955 every cvt function returns true. Thus, have the ConvertToMCInst() return void, rather then a bool. Update all the cvt functions as well. llvm-svn: 162961	2012-08-31 00:03:31 +00:00
Pete Cooper	e969340fea	Take account of boolean vector contents when promoting a build vector from i1 to some other type. rdar://problem/12210060 llvm-svn: 162960	2012-08-30 23:58:52 +00:00
Owen Anderson	cc61f87cf7	Teach the DAG combiner to turn chains of FADDs (x+x+x+x+...) into FMULs by constants. This is only enabled in unsafe FP math mode, since it does not preserve rounding effects for all such constants. llvm-svn: 162956	2012-08-30 23:35:16 +00:00
Chad Rosier	db482ef7a7	Fix for r162954. Return the Error. llvm-svn: 162955	2012-08-30 23:22:05 +00:00
Chad Rosier	8513ffbb83	Move a check to the validateInstruction() function where it more properly belongs. llvm-svn: 162954	2012-08-30 23:20:38 +00:00
Chad Rosier	5eec49fe09	Typo. llvm-svn: 162952	2012-08-30 23:00:00 +00:00
Nadav Rotem	ea973bda26	Currently targets that do not support selects with scalar conditions and vector operands - scalarize the code. ARM is such a target because it does not support CMOV of vectors. To implement this efficientlyi, we broadcast the condition bit and use a sequence of NAND-OR to select between the two operands. This is the same sequence we use for targets that don't have vector BLENDs (like SSE2). rdar://12201387 llvm-svn: 162926	2012-08-30 19:17:29 +00:00
Michael Liao	bbd10792c2	Introduce 'UseSSEx' to force SSE legacy encoding - Add 'UseSSEx' to force SSE legacy insn not being selected when AVX is enabled. As the penalty of inter-mixing SSE and AVX instructions, we need prevent SSE legacy insn from being generated except explicitly specified through some intrinsics. For patterns supported by both SSE and AVX, so far, we force AVX insn will be tried first relying on AddedComplexity or position in td file. It's error-prone and introduces bugs accidentally. 'UseSSEx' is disabled when AVX is turned on. For SSE insns inherited by AVX, we need this predicate to force VEX encoding or SSE legacy encoding only. For insns not inherited by AVX, we still use the previous predicates, i.e. 'HasSSEx'. So far, these insns fall into the following categories: * SSE insns with MMX operands * SSE insns with GPR/MEM operands only (xFENCE, PREFETCH, CLFLUSH, CRC, and etc.) * SSE4A insns. * MMX insns. * x87 insns added by SSE. 2 test cases are modified: - test/CodeGen/X86/fast-isel-x86-64.ll AVX code generation is different from SSE one. 'vcvtsi2sdq' cannot be selected by fast-isel due to complicated pattern and fast-isel fallback to materialize it from constant pool. - test/CodeGen/X86/widen_load-1.ll AVX code generation is different from SSE one after fixing SSE/AVX inter-mixing. Exec-domain fixing prefers 'vmovapd' instead of 'vmovaps'. llvm-svn: 162919	2012-08-30 16:54:46 +00:00
NAKAMURA Takumi	fa81438042	Apply "/Og-" also to MSC15(aka VS9) on VMCore/Function.cpp. llvm-svn: 162917	2012-08-30 16:22:26 +00:00
NAKAMURA Takumi	ac49029fd9	PPCISelLowering.cpp: Fix r162725. [Tobias von Koch] What's happening here is that the CR6SET/CR6UNSET is breaking the chain of register copies glued to the function call (BL_SVR4 node). The scheduler then moves other instructions in between those and the function call, which isn't good! Right. That's the case where there is no chain of register copies before the call, so InFlag == 0... Attached is a new revision of the patch which should fix this for good. llvm-svn: 162916	2012-08-30 15:52:29 +00:00
NAKAMURA Takumi	8ad54e04d2	PPCISelLowering.cpp: Whitespace. llvm-svn: 162915	2012-08-30 15:52:23 +00:00
Michael Ilseman	30c3e14e8e	test llvm-svn: 162914	2012-08-30 15:45:16 +00:00
Benjamin Kramer	afdfdb5cff	LoopRotate: Also rotate loops with multiple exits. The old PHI updating code in loop-rotate was replaced with SSAUpdater a while ago, it has no problems with comples PHIs. What had to be fixed is detecting whether a loop was already rotated and updating dominators when multiple exits were present. This change increases overall code size a bit, mostly due to additional loop unrolling opportunities. Passes test-suite and selfhost with -verify-dom-info. Fixes PR7447. Thanks to Andy for the input on the domtree updating code. llvm-svn: 162912	2012-08-30 15:39:42 +00:00
Benjamin Kramer	d4a64716ab	InstCombine: Fix comment to reflect the code. llvm-svn: 162911	2012-08-30 15:07:40 +00:00
Jakob Stoklund Olesen	0eecbbeb5b	Don't use MCInstrDesc flags for implicit operands. When a MachineInstr is constructed, its implicit operands are added first, then the explicit operands are inserted before the implicits. MCInstrDesc has oprand flags like early clobber and operand ties that apply to the explicit operands. Don't look at those flags when the implicit operands are first added in the explicit operands's positions. llvm-svn: 162910	2012-08-30 14:39:06 +00:00
Alexey Samsonov	f54e3aaeaa	Whitespace llvm-svn: 162907	2012-08-30 13:47:13 +00:00
Nadav Rotem	d5f5777b77	It is illegal to transform (sdiv (ashr X c1) c2) -> (sdiv x (2^c1 * c2)), because C always rounds towards zero. Thanks Dirk and Ben. llvm-svn: 162899	2012-08-30 11:23:20 +00:00
Tim Northover	ca9f384ff8	Add support for moving pure S-register to NEON pipeline if desired llvm-svn: 162898	2012-08-30 10:17:45 +00:00
Alexey Samsonov	45be793e3a	Refactor fetching file/line info from DWARFContext to simplify the code and allow better code reuse. Make the code a bit more conforming to LLVM code style. No functionality change. llvm-svn: 162895	2012-08-30 07:49:50 +00:00
Craig Topper	2da13f9ef8	Add FMA to switch statement in VectorLegalizer::LegalizeOp so that it can be expanded when it isn't legal. llvm-svn: 162894	2012-08-30 07:34:22 +00:00
Craig Topper	c8f5d77e75	Add support for FMA to WidenVectorResult. llvm-svn: 162893	2012-08-30 07:13:41 +00:00
Craig Topper	e39ad7b549	Only perform DAG combine on FMAs of legal types. llvm-svn: 162892	2012-08-30 06:56:15 +00:00
Bill Wendling	14c8a051ca	Pass by pointer and not std::string. llvm-svn: 162888	2012-08-30 01:32:31 +00:00
Bill Wendling	1f6f8c2cb7	Revert r162855 in favor of changing clang to emit the absolute coverage file path. llvm-svn: 162883	2012-08-30 00:34:21 +00:00
Michael Liao	3c8980646b	Fix PR13727 - The root cause is that target constant materialization in X86 fast-isel creates a PC-rel addressing which may overflow 32-bit range in non-Small code model if .rodata section is allocated too far away from code segment in MCJIT, which uses Large code model so far. - Follow the similar logic to fix non-Small code model in fast-isel by skipping non-Small code model. llvm-svn: 162881	2012-08-30 00:30:16 +00:00
Jakob Stoklund Olesen	ffba07b927	Verify the order of tied operands in inline asm. When there are multiple tied use-def pairs on an inline asm instruction, the tied uses must appear in the same order as the defs. It is possible to write an LLVM IR inline asm instruction that breaks this constraint, but there is no reason for a front end to emit the operands out of order. The gnu inline asm syntax specifies tied operands as a single read/write constraint "+r", so ouf of order operands are not possible. llvm-svn: 162878	2012-08-29 23:52:52 +00:00
Benjamin Kramer	ffa24e0438	Add some __builtin_expect magic to StringMap. Tombstones and full hash collisions are rare, mark the "empty" and "no collision" paths as likely. The bug in simplifycfg that prevented the hints from being picked during selfhost up was fixed recently :) llvm-svn: 162874	2012-08-29 22:57:04 +00:00
Benjamin Kramer	bd7f8d0260	Replace the BUILTIN_EXPECT macro with a less horrible LLVM_LIKELY/LLVM_UNLIKELY interface. llvm-svn: 162873	2012-08-29 22:57:00 +00:00
Owen Anderson	9d0f923e7c	Allow targets to specify a minimum supported NOP size when performing NOP padding. If the desired padding is smaller than the supported NOP size, we will enlarge the padding to make it work. llvm-svn: 162870	2012-08-29 22:18:56 +00:00
Jakob Stoklund Olesen	b2bef482fd	Set the isTied flags when building INLINEASM MachineInstrs. For normal instructions, isTied() is set automatically by addOperand(), based on MCInstrDesc, but inline asm has tied operands outside the descriptor. llvm-svn: 162869	2012-08-29 22:02:00 +00:00
Andrew Trick	3051aa1cb8	Preserve branch profile metadata during switch formation. Patch by Michael Ilseman! This fixes SimplifyCFGOpt::FoldValueComparisonIntoPredecessors to preserve metata when folding conditional branches into switches. void foo(int x) { if (x == 0) bar(1); else if (__builtin_expect(x == 10, 1)) bar(2); else if (x == 20) bar(3); } CFG: B0 \| \ \| X0 B10 \| \ \| X10 B20 \| \ E X20 Merge B0-B10: w(B0-X0) = w(B0-X0)sum-weights(B10) = w(B0-X0) (w(B10-X10) + w(B10-B20)) w(B0-X10) = w(B0-B10) * w(B10-X10) w(B0-B20) = w(B0-B10) * w(B10-B20) B0 __ \| \ \ \| X10 X0 B20 \| \ E X20 Merge B0-B20: w(B0-X0) = w(B0-X0) * sum-weights(B20) = w(B0-X0) * (w(B20-E) + w(B20-X20)) w(B0-X10) = w(B0-X10) * sum-weights(B20) = ... w(B0-X20) = w(B0-B20) * w(B20-X20) w(B0-E) = w(B0-B20) * w(B20-E) llvm-svn: 162868	2012-08-29 21:46:38 +00:00
Andrew Trick	f3cf1932b3	whitespace llvm-svn: 162867	2012-08-29 21:46:36 +00:00
Jakob Stoklund Olesen	cea3e77433	Rename hasVolatileMemoryRef() to hasOrderedMemoryRef(). Ordered memory operations are more constrained than volatile loads and stores because they must be ordered with respect to all other memory operations. llvm-svn: 162861	2012-08-29 21:19:21 +00:00
Jakob Stoklund Olesen	813a109fa5	Don't move normal loads across volatile/atomic loads. It is technically allowed to move a normal load across a volatile load, but probably not a good idea. It is not allowed to move a load across an atomic load with Ordering > Monotonic, and we model those with MOVolatile as well. I recently removed the mayStore flag from atomic load instructions, so they don't need a pseudo-opcode. This patch makes up for the difference. llvm-svn: 162857	2012-08-29 20:48:45 +00:00
Bill Wendling	11e61b9557	Use the full path to output the .gcda file. This lets the user run the program from a different directory and still have the .gcda files show up in the correct place. <rdar://problem/12179524> llvm-svn: 162855	2012-08-29 20:30:44 +00:00
Hal Finkel	1859d26528	Reserve space for the mandatory traceback fields on PPC64. We need to reserve space for the mandatory traceback fields, though leaving them as zero is appropriate for now. Although the ABI calls for these fields to be filled in fully, no compiler on Linux currently does this, and GDB does not read these fields. GDB uses the first word of zeroes during exception handling to find the end of the function and the size field, allowing it to compute the beginning of the function. DWARF information is used for everything else. We need the extra 8 bytes of pad so the size field is found in the right place. As a comparison, GCC fills in a few of the fields -- language, number of saved registers -- but ignores the rest. IBM's proprietary OSes do make use of the full traceback table facility. Patch by Bill Schmidt. llvm-svn: 162854	2012-08-29 20:22:24 +00:00
Bill Wendling	e8aee6b8a5	Use ArrayRef instead of SmallVector when passing vector into function. llvm-svn: 162851	2012-08-29 18:45:41 +00:00
Jakob Stoklund Olesen	7a837b9a76	Verify the consistency of inline asm operands. The operands on an INLINEASM machine instruction are divided into groups headed by immediate flag operands. Verify this structure. Extract verifyTiedOperands(), and only call it for non-inlineasm instructions. llvm-svn: 162849	2012-08-29 18:11:05 +00:00
Eric Christopher	2a4e616df6	Clean this up slightly, doesn't really fall through. llvm-svn: 162848	2012-08-29 17:59:32 +00:00
Tim Northover	771f160758	Refactor setExecutionDomain to be clearer about what it's doing and more robust. llvm-svn: 162844	2012-08-29 16:36:07 +00:00
Benjamin Kramer	8f5c5ded4e	Make helper function static. llvm-svn: 162843	2012-08-29 16:17:01 +00:00
Benjamin Kramer	8bcc971174	Make MemoryBuiltins aware of TargetLibraryInfo. This disables malloc-specific optimization when -fno-builtin (or -ffreestanding) is specified. This has been a problem for a long time but became more severe with the recent memory builtin improvements. Since the memory builtin functions are used everywhere, this required passing TLI in many places. This means that functions that now have an optional TLI argument, like RecursivelyDeleteTriviallyDeadFunctions, won't remove dead mallocs anymore if the TLI argument is missing. I've updated most passes to do the right thing. Fixes PR13694 and probably others. llvm-svn: 162841	2012-08-29 15:32:21 +00:00
Craig Topper	a999c66292	Convert FMA4 patterns to use target specific nodes instead of intrinsics to align with FMA3. llvm-svn: 162829	2012-08-29 07:18:25 +00:00
Craig Topper	5f96ca51b6	Add virtual keywords for methods that override the base class. llvm-svn: 162826	2012-08-29 05:48:09 +00:00
Andrew Trick	b57e225742	Cleanup sloppy code. Jakob's review. llvm-svn: 162825	2012-08-29 04:41:37 +00:00
Jush Lu	e87e559e62	[arm-fast-isel] Add support for ARM PIC. llvm-svn: 162823	2012-08-29 02:41:21 +00:00
Andrew Trick	bd0073ddd7	Fix ARM vector copies of overlapping register tuples. I have tested the fix, but have not been successfull in generating a robust unit test. This can only be exposed through particular register assignments. llvm-svn: 162821	2012-08-29 01:58:55 +00:00
Andrew Trick	4cc6949a2b	cleanup llvm-svn: 162820	2012-08-29 01:58:52 +00:00
Jakob Stoklund Olesen	dbbff7899d	Verify the tied operand flags. WHen running with -verify-machineinstrs, check that tied operands come in matching use/def pairs, and that they are consistent with MCInstrDesc when it applies. llvm-svn: 162816	2012-08-29 00:38:03 +00:00
Jakob Stoklund Olesen	2b16664522	Maintain a vaild isTied bit as operands are added and removed. The isTied bit is set automatically when a tied use is added and MCInstrDesc indicates a tied operand. The tie is broken when one of the tied operands is removed. llvm-svn: 162814	2012-08-29 00:37:58 +00:00
Chad Rosier	3b1336ceb9	Typo. llvm-svn: 162807	2012-08-28 23:57:47 +00:00
Michael Liao	407d659fa5	Add comments on the literal value used. llvm-svn: 162805	2012-08-28 23:42:17 +00:00
Manman Ren	abbb01abea	Profile: set branch weight metadata with data generated from profiling. This patch implements ProfileDataLoader which loads profile data generated by -insert-edge-profiling and updates branch weight metadata accordingly. Patch by Alastair Murray. llvm-svn: 162799	2012-08-28 22:21:25 +00:00
Jack Carter	cd6b0e1368	The instruction DEXT may be transformed into DEXTU or DEXTM depending on the size of the extraction and its position in the 64 bit word. This patch allows support of the dext transformations with mips64 direct object output. 0 <= msb < 32 0 <= lsb < 32 0 <= pos < 32 1 <= size <= 32 DINS The field is entirely contained in the right-most word of the doubleword 32 <= msb < 64 0 <= lsb < 32 0 <= pos < 32 2 <= size <= 64 DINSM The field straddles the words of the doubleword 32 <= msb < 64 32 <= lsb < 64 32 <= pos < 64 1 <= size <= 32 DINSU The field is entirely contained in the left-most word of the doubleword llvm-svn: 162782	2012-08-28 20:07:41 +00:00
Michael Liao	710e1a594b	Explicitly update the number of nodes to be traversed llvm-svn: 162780	2012-08-28 19:20:29 +00:00
Jack Carter	c20a21b855	Some instructions are passed to the assembler to be transformed to the final instruction variant. An example would be dsrll which is transformed into dsll32 if the shift value is greater than 32. For direct object output we need to do this transformation in the codegen. If the instruction was inside branch delay slot, it was being missed. This patch corrects this oversight. llvm-svn: 162779	2012-08-28 19:07:39 +00:00
Roman Divacky	8c4b6a307e	Emit word of zeroes after the last instruction as a start of the mandatory traceback table on PowerPC64. This helps gdb handle exceptions. The other mandatory fields are ignored by gdb and harder to implement so just add there a FIXME. Patch by Bill Schmidt. PR13641. llvm-svn: 162778	2012-08-28 19:06:55 +00:00
Akira Hatanaka	206cefe66c	Follow-up patch to r162731. Fix a couple of bugs in mips' long branch pass. This patch was supposed to be committed along with r162731, so I don't have a new test case. llvm-svn: 162777	2012-08-28 18:58:57 +00:00
Jakob Stoklund Olesen	e56c60c5eb	Add a MachineOperand::isTied() flag. While in SSA form, a MachineInstr can have pairs of tied defs and uses. The tied operands are used to represent read-modify-write operands that must be assigned the same physical register. Previously, tied operand pairs were computed from fixed MCInstrDesc fields, or by using black magic on inline assembly instructions. The isTied flag makes it possible to add tied operands to any instruction while getting rid of (some of) the inlineasm magic. Tied operands on normal instructions are needed to represent predicated individual instructions in SSA form. An extra <tied,imp-use> operand is required to represent the output value when the instruction predicate is false. Adding a predicate to: %vreg0<def> = ADD %vreg1, %vreg2 Will look like: %vreg0<tied,def> = ADD %vreg1, %vreg2, pred:3, %vreg7<tied,imp-use> The virtual register %vreg7 is the value given to %vreg0 when the predicate is false. It will be assigned the same physreg as %vreg0. This commit adds the isTied flag and sets it based on MCInstrDesc when building an instruction. The flag is not used for anything yet. llvm-svn: 162774	2012-08-28 18:34:41 +00:00
Jakob Stoklund Olesen	dba99d0dfa	Don't allow TargetFlags on MO_Register MachineOperands. Register operands are manipulated by a lot of target-independent code, and it is not always possible to preserve target flags. That means it is not safe to use target flags on register operands. None of the targets in the tree are using register operand target flags. External targets should be using immediate operands to annotate instructions with operand modifiers. llvm-svn: 162770	2012-08-28 18:05:48 +00:00
Hal Finkel	742b535e40	Add PPC Freescale e500mc and e5500 subtargets. Add subtargets for Freescale e500mc (32-bit) and e5500 (64-bit) to the PowerPC backend. Patch by Tobias von Koch. llvm-svn: 162764	2012-08-28 16:12:39 +00:00
Benjamin Kramer	1e1a1dedc6	InstCombine: Defensively avoid undefined shifts by limiting the amount to the bit width. No test case, undefined shifts get folded early, but can occur when other transforms generate a constant. Thanks to Duncan for bringing this up. llvm-svn: 162755	2012-08-28 13:59:23 +00:00
Benjamin Kramer	9c0a807c27	InstCombine: Guard the transform introduced in r162743 against large ints and non-const shifts. llvm-svn: 162751	2012-08-28 13:08:13 +00:00
Nadav Rotem	d457787fed	Make sure that we don't call getZExtValue on values > 64 bits. Thanks Benjamin for noticing this. llvm-svn: 162749	2012-08-28 12:23:22 +00:00
Nadav Rotem	11935b29f3	Teach InstCombine to canonicalize [SU]div+[AL]shl patterns. For example: %1 = lshr i32 %x, 2 %2 = udiv i32 %1, 100 rdar://12182093 llvm-svn: 162743	2012-08-28 10:01:43 +00:00
Bill Wendling	cc56718038	The commutative flag is already correctly set within the multiclass. If we set it here, then a 'register-memory' version would wrongly get the commutative flag. <rdar://problem/12180135> llvm-svn: 162741	2012-08-28 07:36:46 +00:00
Craig Topper	72f51c3986	Convert V_SETALLONES/AVX_SETALLONES/AVX2_SETALLONES to Post-RA pseudos. llvm-svn: 162740	2012-08-28 07:30:47 +00:00
Craig Topper	bd509eea4a	Merge AVX_SET0PSY/AVX_SET0PDY/AVX2_SET0 into a single post-RA pseudo. llvm-svn: 162738	2012-08-28 07:05:28 +00:00
Michael Liao	b7d85b6328	Fix PR12312 - Add a target-specific DAG optimization to recognize a pattern PTEST-able. Such a pattern is a OR'd tree with X86ISD::OR as the root node. When X86ISD::OR node has only its flag result being used as a boolean value and all its leaves are extracted from the same vector, it could be folded into an X86ISD::PTEST node. llvm-svn: 162735	2012-08-28 03:34:40 +00:00
Jakob Stoklund Olesen	87cb471e52	Remove extra MayLoad/MayStore flags from atomic_load/store. These extra flags are not required to properly order the atomic load/store instructions. SelectionDAGBuilder chains atomics as if they were volatile, and SelectionDAG::getAtomic() sets the isVolatile bit on the memory operands of all atomic operations. The volatile bit is enough to order atomic loads and stores during and after SelectionDAG. This means we set mayLoad on atomic_load, mayStore on atomic_store, and mayLoad+mayStore on the remaining atomic read-modify-write operations. llvm-svn: 162733	2012-08-28 03:11:32 +00:00
Jakob Stoklund Olesen	b3de7b1790	Revert r162713: "Add ATOMIC_LDR* pseudo-instructions to model atomic_load on ARM." This wasn't the right way to enforce ordering of atomics. We are already setting the isVolatile bit on memory operands of atomic operations which is good enough to enforce the correct ordering. llvm-svn: 162732	2012-08-28 03:11:27 +00:00
Akira Hatanaka	b5af7121b1	Fix mips' long branch pass. Instructions emitted to compute branch offsets now use immediate operands instead of symbolic labels. This change was needed because there were problems when R_MIPS_HI16/LO16 relocations were used to make shared objects. llvm-svn: 162731	2012-08-28 03:03:05 +00:00
Hal Finkel	679c73cb33	Split several PPC instruction classes. Slight reorganisation of PPC instruction classes for scheduling. No functionality change for existing subtargets. - Clearly separate load/store-with-update instructions from regular loads and stores. - Split IntRotateD -> IntRotateD and IntRotateDI - Split out fsub and fadd from FPGeneral -> FPAddSub - Update existing itineraries Patch by Tobias von Koch. llvm-svn: 162729	2012-08-28 02:49:14 +00:00
Akira Hatanaka	adb14f56c7	Fix bug 13532. In SelectionDAGLegalize::ExpandLegalINT_TO_FP, expand INT_TO_FP nodes without using any f64 operations if f64 is not a legal type. Patch by Stefan Kristiansson. llvm-svn: 162728	2012-08-28 02:12:42 +00:00
Hal Finkel	686f2ee226	Allow remat of LI on PPC. Allow load-immediates to be rematerialised in the register coalescer for PPC. This makes test/CodeGen/PowerPC/big-endian-formal-args.ll fail, because it relies on a register move getting emitted. The immediate load is equivalent, so change this test case. Patch by Tobias von Koch. llvm-svn: 162727	2012-08-28 02:10:33 +00:00
Hal Finkel	b5d177e5b0	Add the Freescale vendor to Triple. Adds the vendor 'fsl' (used by Freescale SDK) to Triple. This will allow clang support for Freescale cross-compile configurations. Patch by Tobias von Koch. llvm-svn: 162726	2012-08-28 02:10:30 +00:00
Hal Finkel	5ab378037f	Eliminate redundant CR moves on PPC32. The 32-bit ABI requires CR bit 6 to be set if the call has fp arguments and unset if it doesn't. The solution up to now was to insert a MachineNode to set/unset the CR bit, which produces a CR vreg. This vreg was then copied into CR bit 6. When the register allocator saw a bunch of these in the same function, it allocated the set/unset CR bit in some random CR register (1 extra instruction) and then emitted CR moves before every vararg function call, rather than just setting and unsetting CR bit 6 directly before every vararg function call. This patch instead inserts a PPCcrset/PPCcrunset instruction which are then matched by a dedicated instruction pattern. Patch by Tobias von Koch. llvm-svn: 162725	2012-08-28 02:10:27 +00:00
Hal Finkel	e39526a789	Optimize zext on PPC64. The zeroextend IR instruction is lowered to an 'and' node with an immediate mask operand, which in turn gets legalised to a sequence of ori's & ands. This can be done more efficiently using the rldicl instruction. Patch by Tobias von Koch. llvm-svn: 162724	2012-08-28 02:10:15 +00:00
Jakob Stoklund Olesen	89d6b29d16	More missing mayLoad flags on AVX multiclasses. llvm-svn: 162714	2012-08-28 00:02:01 +00:00
Jakob Stoklund Olesen	b24cb8c541	Add ATOMIC_LDR* pseudo-instructions to model atomic_load on ARM. It is not safe to use normal LDR instructions because they may be reordered by the scheduler. The ATOMIC_LDR pseudos have a mayStore flag that prevents reordering. Atomic loads are also prevented from participating in rematerialization and load folding. llvm-svn: 162713	2012-08-27 23:58:52 +00:00
Marshall Clow	ef271cce0a	Fix compile error when building with C++11 - clang thinks that PRIx64 is a user-defined suffix or something llvm-svn: 162704	2012-08-27 22:53:35 +00:00
Bill Wendling	988a47d7e5	Make sure we add the predicate after all of the registers are added. <rdar://problem/12183003> llvm-svn: 162703	2012-08-27 22:12:44 +00:00
Dan Gohman	10c82cee04	Don't use for loops for code that is only intended to execute once. No intended functionality change. Thanks to Ahmed Charles for spotting it. llvm-svn: 162686	2012-08-27 18:31:36 +00:00
Rafael Espindola	073ee7d0a8	Fix comment. llvm-svn: 162678	2012-08-27 16:04:24 +00:00
Danil Malyshev	97714bc149	Fix comment for function RuntimeDyldImpl.resolveRelocation() llvm-svn: 162677	2012-08-27 15:34:01 +00:00
Hongbin Zheng	14c05c409a	Remove the the block_node_iterator of Region, replace it by the block_iterator. llvm-svn: 162672	2012-08-27 13:49:24 +00:00
NAKAMURA Takumi	37338a8352	DWARFDebugRangeList.cpp: Use PRIx64 for uint64_t as format string. llvm-svn: 162665	2012-08-27 10:10:10 +00:00
Craig Topper	a737ef8964	Remove MMX shift intrinsic handling code that also exists in SelectionDAGBuilder. llvm-svn: 162661	2012-08-27 08:08:30 +00:00
Alexey Samsonov	b4c95daf86	[DebugInfo] fixup for r162657: update CMakeLists.txt llvm-svn: 162659	2012-08-27 07:24:43 +00:00
Craig Topper	5af2fed5f2	Don't allow vextractf128 to be folded with unaligned stores. We don't fold unaligned loads so shouldn't fold unaligned stores as it can cause an alignment fault to occur. llvm-svn: 162658	2012-08-27 07:19:59 +00:00
Alexey Samsonov	034e57a297	Add basic support for .debug_ranges section to LLVM's DebugInfo library. This section (introduced in DWARF-3) is used to define instruction address ranges for functions that are not contiguous and can't be described by low_pc/high_pc attributes (this is the usual case for inlined subroutines). The patch is the first step to support fetching complete inlining info from DWARF. Reviewed by Benjamin Kramer. llvm-svn: 162657	2012-08-27 07:17:47 +00:00
Craig Topper	6d44554cd4	Fold some patterns into instruction definitons so tablegen can infer flags removing the need for an explicit 'neverHasSideEffects = 1' llvm-svn: 162656	2012-08-27 07:04:50 +00:00
Craig Topper	f7828f91ee	Add HasAVX1Only predicate and use it for patterns that have an AVX1 instruction and an AVX2 instruction rather than relying on AddedComplexity. llvm-svn: 162654	2012-08-27 06:08:57 +00:00
Richard Smith	228e6d4cf3	Fix integer undefined behavior due to signed left shift overflow in LLVM. Reviewed offline by chandlerc. llvm-svn: 162623	2012-08-24 23:29:28 +00:00
Jakob Stoklund Olesen	3d91b43ad2	Add missing mayLoad flags to a large class of AVX *_Int instructions. llvm-svn: 162622	2012-08-24 23:29:07 +00:00
Jakob Stoklund Olesen	74352494a6	Missed tLEApcrelJT. ARMConstantIslandPass expects this instruction to stay in the same basic block as the jump table branch. llvm-svn: 162615	2012-08-24 22:46:55 +00:00
Jakob Stoklund Olesen	47ac1a8ec0	Explicitly mark LEApcrel pseudos with hasSideEffects. It's not clear that they should be marked as such, but tbb formation fails if t2LEApcrelJT is hoisted of of a loop. This doesn't change the flags on these instructions, UnmodeledSideEffects was already inferred from the missing pattern. llvm-svn: 162603	2012-08-24 21:44:11 +00:00
Jakob Stoklund Olesen	e6afde59db	Fix call instruction operands in ARMFastISel. The ARM BL and BLX instructions don't have predicate operands, but the thumb variants tBL and tBLX do. The argument registers should be added as implicit uses. llvm-svn: 162593	2012-08-24 20:52:46 +00:00
Jakob Stoklund Olesen	b50cf8b30f	Mark X86::RET and RETI instructions as variadic. There is special magic happening when returning floating point values on the x87 stack. The RET instructions get extra f80 operands. llvm-svn: 162592	2012-08-24 20:52:44 +00:00
Jakob Stoklund Olesen	10cdd09318	Avoid including explicit uses when counting SDNode imp-uses. It is legal to have a register node as an explicit operand, it shouldn't be counted as an implicit use. llvm-svn: 162591	2012-08-24 20:52:42 +00:00
Akira Hatanaka	4a08a4a8b6	Disable Mips' delay slot filler when optimization level is O0. llvm-svn: 162589	2012-08-24 20:40:15 +00:00
Akira Hatanaka	e8e4ef102d	In MipsDAGToDAGISel::SelectAddr, fold add node into address operand, if its second operand is MipsISD::GPRel. llvm-svn: 162584	2012-08-24 20:21:49 +00:00
Manman Ren	cf10446ffa	BranchProb: modify the definition of an edge in BranchProbabilityInfo to handle the case of multiple edges from one block to another. A simple example is a switch statement with multiple values to the same destination. The definition of an edge is modified from a pair of blocks to a pair of PredBlock and an index into the successors. Also set the weight correctly when building SelectionDAG from LLVM IR, especially when converting a Switch. IntegersSubsetMapping is updated to calculate the weight for each cluster. llvm-svn: 162572	2012-08-24 18:14:27 +00:00
Kostya Serebryany	4cc511daf0	[asan/tsan] rename FunctionBlackList* to BlackList* as this class is not limited to functions any more llvm-svn: 162566	2012-08-24 16:44:47 +00:00
Kostya Serebryany	36dfc5ceab	[asan/tsan] extend the functionality of FunctionBlackList to globals and modules. Patch by Reid Watson. llvm-svn: 162565	2012-08-24 16:40:11 +00:00
Roman Divacky	ace4707ea6	Lower constant pools and jump tables via TOC on PPC64/SVR4. In collaboration with Adhemerval Zanella. llvm-svn: 162562	2012-08-24 16:26:02 +00:00
Benjamin Kramer	dd62d6b6c8	GVN: Fix quadratic runtime on the number of switch cases. No intended behavior change. This was introduced in r162023. With the fixed algorithm a Release build of ARMInstPrinter.cpp goes from 16s to 10s on a 2011 MBP. llvm-svn: 162559	2012-08-24 15:06:28 +00:00
Jakob Stoklund Olesen	3ac45d9a1f	Fix load/store SDNode flags. llvm-svn: 162558	2012-08-24 14:43:30 +00:00
Jakob Stoklund Olesen	a954e92053	Add missing SDNPSideEffect flags. llvm-svn: 162557	2012-08-24 14:43:27 +00:00
Jakob Stoklund Olesen	8ff666fcb6	Remove more mayLoad workarounds. llvm-svn: 162556	2012-08-24 14:43:22 +00:00
Craig Topper	663d160adb	Custom lower FMA intrinsics to target specific nodes and remove the patterns. llvm-svn: 162534	2012-08-24 04:03:22 +00:00
Eric Christopher	bb69a27dbc	Use DW_FORM_flag_present to save space in debug information if we're not in darwin gdb compat mode. Fixes rdar://10975088 llvm-svn: 162526	2012-08-24 01:14:27 +00:00
Eric Christopher	d999bb7ff4	Add support for some missing DW_FORM_*. TODO: Fix code duplication and coding style. llvm-svn: 162525	2012-08-24 01:14:23 +00:00
Eric Christopher	67942acc6b	Formatting. llvm-svn: 162524	2012-08-24 01:14:21 +00:00
Richard Smith	f3c75f7e7c	Fix undefined behavior (negation of INT_MIN) in ARM backend. llvm-svn: 162520	2012-08-24 00:35:46 +00:00
Richard Smith	c621af1f60	Fix floating-point divide by zero, in a case where the value was not going to be used anyway. llvm-svn: 162518	2012-08-24 00:31:45 +00:00
Jakob Stoklund Olesen	d3511235d1	Remove some spurious mayLoad = 0 flags. They were inserted to silence TableGen's warning about redundant properties. That warning is now gone. llvm-svn: 162517	2012-08-24 00:31:20 +00:00
Jakob Stoklund Olesen	acf7c47e64	Add missing SDNP properties on the flushw node. llvm-svn: 162515	2012-08-24 00:31:13 +00:00
Jakob Stoklund Olesen	df1faa0503	X86MemBarrier has unmodeled side effects. llvm-svn: 162514	2012-08-24 00:31:10 +00:00

1 2 3 4 5 ...

56070 Commits