llvm-project

Commit Graph

Author	SHA1	Message	Date
Andrew Trick	b70d9780ac	80 col. comment. llvm-svn: 198653	2014-01-07 01:02:52 +00:00
Jack Carter	0cd3c19f33	[Mips] TargetStreamer Support for .abicalls and .set pic0. This patch adds .abicalls and .set pic0 support which affects the ELF ABI and its flags. In addition the patch uses a common interface for both the MipsTargetSteamer and MipsObjectStreamer that both the integrated and standalone assemblers will use for the output for these directives. llvm-svn: 198646	2014-01-06 23:27:31 +00:00
Kevin Enderby	f16c8c5162	For the 'C' disassembler API, add a new ReferenceType for the SymbolLookUp() call back to return a demangled C++ name to be used as a comment. For example darwin's otool(1) program the uses the llvm disassembler now can produce disassembly like: callq __ZNK4llvm6Target20createMCDisassemblerERKNS_15MCSubtargetInfoE ## llvm::Target::createMCDisassembler(llvm::MCSubtargetInfo const&) const Also fix a bug in LLVMDisasmInstruction() that was not flushing the raw_svector_ostream for the disassembled instruction string before copying it to the output buffer that was causing truncation of the output. rdar://10173828 llvm-svn: 198637	2014-01-06 22:08:08 +00:00
Rafael Espindola	abdd726ce5	Improve documentation of the 'a' specifier and the '<abi>:<pref>' align pair. llvm-svn: 198636	2014-01-06 21:40:24 +00:00
Andrew Trick	6796ab424c	Reapply r198478 "Fix PR18361: Invalidate LoopDispositions after LoopSimplify hoists things." Now with a fix for PR18384: ValueHandleBase::ValueIsDeleted. We need to invalidate SCEV's loop info when we delete a block, even if no values are hoisted. llvm-svn: 198631	2014-01-06 19:43:14 +00:00
Rafael Espindola	d61a0c028d	Remove dead code. llvm-svn: 198624	2014-01-06 18:14:34 +00:00
Tim Northover	d6a729bb85	ARM MachO: sort out isTargetDarwin/isTargetIOS/... checks. The ARM backend has been using most of the MachO related subtarget checks almost interchangeably, and since the only target it's had to run on has been IOS (which is all three of MachO, Darwin and IOS) it's worked out OK so far. But we'd like to support embedded targets under the "--none-macho" triple, which means everything starts falling apart and inconsistent behaviours emerge. This patch should pick a reasonably sensible set of behaviours for the new triple (and any others that come along, with luck). Some choices were debatable (notably FP == r7 or r11), but we can revisit those later when deficiencies become apparent. llvm-svn: 198617	2014-01-06 14:28:05 +00:00
Robert Lytton	9523aa41fb	XCore Target: correct callee save register spilling when callsUnwindInit is true. llvm-svn: 198616	2014-01-06 14:21:12 +00:00
Robert Lytton	c8c4aa667b	XCore target: Lower EH_RETURN llvm-svn: 198615	2014-01-06 14:21:07 +00:00
Robert Lytton	5da175214b	XCore target: Lower FRAME_TO_ARGS_OFFSET This requires a knowledge of the stack size which is not known until the frame is complete, hence the need for the XCoreFTAOElim pass which lowers the XCoreISD::FRAME_TO_ARGS_OFFSET instrution into its final form. llvm-svn: 198614	2014-01-06 14:21:00 +00:00
Robert Lytton	dec798751a	XCore target: Lower RETURNADDR Only handles a depth of zero (the same as FRAMEADDR) llvm-svn: 198613	2014-01-06 14:20:53 +00:00
Robert Lytton	cbb588a264	XCore target: Optimise entsp / retsp selection llvm-svn: 198612	2014-01-06 14:20:47 +00:00
Robert Lytton	a53360a339	XCore target: Refactor LR handling We also narrow the liveness of FP & LR during the prologue to reflect the actual usage of the registers. I have been unable to construct a test to prove the previous live range was too large. llvm-svn: 198611	2014-01-06 14:20:41 +00:00
Robert Lytton	9288eea910	XCore target: Refactor the loading of constants into a register This common functionality will be used to lower FRAME_TO_ARGS_OFFSET. llvm-svn: 198610	2014-01-06 14:20:37 +00:00
Robert Lytton	bc4d976152	XCore target: fix handling of unsized global arrays in large code model llvm-svn: 198609	2014-01-06 14:20:32 +00:00
Tim Northover	7649ebacd6	ARM: keep special non-AEABIness of "-darwin-eabi" triples for now Longer term, we want to move users to "---macho" for embedded work, but for now people are relying on the last thing we told them, which is unfortunately "-*-darwin-eabi". rdar://problem/15703934 llvm-svn: 198602	2014-01-06 12:00:44 +00:00
Elena Demikhovsky	3629b4aa0e	AVX-512: added intrinsic vcvtpd2ps (with rounding mode and without) llvm-svn: 198593	2014-01-06 08:45:54 +00:00
Venkatraman Govindaraju	2bab98bbae	[Sparc] Explicitly cast -1 to unsigned to fix buildbot errors. llvm-svn: 198592	2014-01-06 08:24:44 +00:00
Venkatraman Govindaraju	dfcccc7db0	[Sparc] Add initial implementation of disassembler for sparc llvm-svn: 198591	2014-01-06 08:08:58 +00:00
David Majnemer	a45a176ebc	MC: Fatally error if subtraction operand is bad Instead of crashing, raise an error when a subtraction expression involves an undefined symbol. This fixes PR18375. llvm-svn: 198590	2014-01-06 07:39:46 +00:00
Craig Topper	7c6baa7834	Remove SegOvrBits from X86 TSFlags since they weren't being used. llvm-svn: 198588	2014-01-06 06:51:58 +00:00
Craig Topper	78e58b28a5	Remove argument to fix build bot failure. llvm-svn: 198587	2014-01-06 06:09:03 +00:00
Craig Topper	7ceb54a2a1	Add OpSize16 bit, for instructions which need 0x66 prefix in 16-bit mode The 0x66 prefix toggles between 16-bit and 32-bit addressing mode. So in 32-bit mode it is used to switch to 16-bit addressing mode for the following instruction, while in 16-bit mode it's the other way round — it's used to switch to 32-bit mode instead. Thus, emit the 0x66 prefix byte for OpSize only in 32-bit (and 64-bit) mode, and introduce a new OpSize16 bit which is used in 16-bit mode instead. This is just the basic infrastructure for that change; a subsequent patch will add the new OpSize16 bit to the 32-bit instructions that need it. Patch from David Woodhouse. llvm-svn: 198586	2014-01-06 06:02:58 +00:00
Bill Wendling	13199b17f8	Remove unnecessary #includes. llvm-svn: 198585	2014-01-06 06:00:00 +00:00
Craig Topper	3c80d62a6c	[x86] Add basic support for .code16 This is not really expected to work right yet. Mostly because we will still emit the OpSize (0x66) prefix in all the wrong places, along with a number of other corner cases. Those will all be fixed in the subsequent commits. Patch from David Woodhouse. llvm-svn: 198584	2014-01-06 04:55:54 +00:00
Kevin Qin	5cd73c9e0a	[AArch64 NEON] Fix invalid constant used in vselect condition. There is a wrong assumption that the vector element type and the type of each ConstantSDNode in the build_vector were the same. However, when promoting the integer operand of a legally typed build_vector, the operand type and the vector element type do not need to be the same (See method 'DAGTypeLegalizer::PromoteIntOp_BUILD_VECTOR' in LegalizeIntegerTypes.cpp). in AArch64 backend, the following dag sequence: C0: i1 = Constant<0> C1: i1 = Constant<-1> V: v8i1 = BUILD_VECTOR C1, C1, C0, C0, C0, C0, C0, C0 is type-legalized into: NewC0: i32 = Constant<0> NewC1: i32 = Constant<1> V: v8i8 = BUILD_VECTOR NewC1, NewC1, NewC0, NewC0, NewC0, NewC0, NewC0, NewC0 Forcing a getZeroExtend to VTBits to ensure that the new constant is correctly. llvm-svn: 198582	2014-01-06 02:26:10 +00:00
Venkatraman Govindaraju	b73aeca888	[Sparc] Add ELF Object Writer for Sparc. llvm-svn: 198580	2014-01-06 01:22:54 +00:00
Bill Wendling	908bf814e7	Refactor function that checks that __builtin_returnaddress's argument is constant. This moves the check up into the parent class so that all targets can use it without having to copy (and keep in sync) the same error message. llvm-svn: 198579	2014-01-06 00:43:20 +00:00
Saleem Abdulrasool	b961c99f1a	ARM: move ARMUnwindOp.h into Support Move the ARM EHABI unwind opcode definitions from the ARM MCTargetDesc into LLVM Support. This enables sharing of the definitions across the ARM target code as well as llvm-readobj. This will allow implementation of the unwind decoding in llvm-readobj. llvm-svn: 198576	2014-01-06 00:15:00 +00:00
Benjamin Kramer	db5122f6da	SPARC: Make helper function static. llvm-svn: 198567	2014-01-05 20:26:05 +00:00
Craig Topper	21ba8fbc18	Fix ModR/M byte output for 16-bit addressing modes (PR18220) Add some tests to validate correct register selection, including a fix to an existing test which was requiring the wrong output. Patch from David Woodhouse. llvm-svn: 198566	2014-01-05 19:40:56 +00:00
Craig Topper	792587cc7b	Remove opcode from MOV32r0 that I accidentally left when I converted it to Pseudo. Remove FIXME as well. llvm-svn: 198564	2014-01-05 19:25:13 +00:00
Saleem Abdulrasool	681e0bb3a6	ARM: style changes to LDRD, STRD definition Fix indentation, name registers similar to ARM ARM. No functionality change! llvm-svn: 198563	2014-01-05 16:36:37 +00:00
Elena Demikhovsky	f404e054a1	AVX-512: changed property name from "neverHasSideEffects=1" to "hasSideEffects=0", added this property to VMOVSS/VMOVSD; Optimized a truncate pattern. llvm-svn: 198562	2014-01-05 14:21:07 +00:00
Elena Demikhovsky	52e4a0e109	AVX-512: Added more intrinsics for convert and min/max. Removed vzeroupper from AVX-512 mode - our optimization gude does not recommend to insert vzeroupper at all. llvm-svn: 198557	2014-01-05 10:46:09 +00:00
Chandler Carruth	c4ddab6ff2	[PM] Add a definition for the static PassID in the CallGraphAnalysis. Missed this when adding the skeleton analysis. Caught by a build break in the next patch I'm working on when trying to use the analysis. llvm-svn: 198556	2014-01-05 10:38:52 +00:00
Craig Topper	7894e812bb	Add the other form of movq xmm,xmm for the disassembler. llvm-svn: 198551	2014-01-05 07:16:04 +00:00
Craig Topper	d9e1669d1c	Use patterns to remove some duplicate instructions. llvm-svn: 198550	2014-01-05 06:55:48 +00:00
Craig Topper	34db6523f3	Fix encoding for PUSH64i16. Add In64BitMode Predicate. Remove disassembler hack. llvm-svn: 198547	2014-01-05 05:46:38 +00:00
Craig Topper	0550ce7ac1	Mark x86 _alt instructions as AsmParserOnly so they will be omitted from disassembler without string matches. llvm-svn: 198545	2014-01-05 04:55:55 +00:00
Craig Topper	5165cf78b0	Use new ForceDisassemble flag on the 2-byte forms of INC/DEC for 32-bit mode and remove disassmbler table emitter hack. llvm-svn: 198544	2014-01-05 04:32:42 +00:00
Craig Topper	3484fc2161	Add a new x86 specific instruction flag to force some isCodeGenOnly instructions to go through to the disassembler tables without resorting to string matches. Apply flag to all _REV instructions. llvm-svn: 198543	2014-01-05 04:17:28 +00:00
Venkatraman Govindaraju	5f1cce50e6	[Sparc] Add initial implementation of MC Code emitter for sparc. llvm-svn: 198533	2014-01-05 02:13:48 +00:00
Bill Wendling	df7dd28dc8	Emit an error message if the value passed to __builtin_returnaddress isn't a constant __builtin_returnaddress requires that the value passed into is be a constant. However, at -O0 even a constant expression may not be converted to a constant. Emit an error message intead of crashing. llvm-svn: 198531	2014-01-05 01:47:20 +00:00
Craig Topper	5999d47538	Mark the 64-bit x86 push/pop instructions as In64BitMode. Mark the corresponding 32-bit versions with the same encodings Not64BitMode. Remove hack from tablegen disassembler table emitter. Fix bad test. llvm-svn: 198530	2014-01-05 01:35:51 +00:00
Alp Toker	f929e09b10	Add missed cleanup from r198456 All other uses of this macro in LLVM/clang have been moved to the function definition so follow suite (and the usage advice) here too for consistency. llvm-svn: 198516	2014-01-04 22:47:48 +00:00
Craig Topper	bc281ad8c1	Tag x86 move to/from debug/control registers with Not64BitMode/In64BitMode. Remove disassembler hack. llvm-svn: 198515	2014-01-04 22:29:41 +00:00
Alp Toker	5e9f3265f8	Revert "Fix PR18361: Invalidate LoopDispositions after LoopSimplify hoists things." This commit was the source of crasher PR18384: While deleting: label %for.cond127 An asserting value handle still pointed to this value! UNREACHABLE executed at llvm/lib/IR/Value.cpp:671! Reverting to get the builders green, feel free to re-land after fixing up. (Renato has a handy isolated repro if you need it.) This reverts commit r198478. llvm-svn: 198503	2014-01-04 17:00:45 +00:00
Venkatraman Govindaraju	c2dee7dc74	[Sparc] Add the initial implementation of an asm parser for sparc/sparcv9. llvm-svn: 198484	2014-01-04 11:30:13 +00:00
Venkatraman Govindaraju	96ab3bc5bd	[SparcV9]: Implement RETURNADDR and FRAMEADDR lowering in SPARC64. Fixes PR18356. llvm-svn: 198480	2014-01-04 07:17:21 +00:00
Andrew Trick	aceac9746d	Fix PR18361: Invalidate LoopDispositions after LoopSimplify hoists things. getSCEV for an ashr instruction creates an intermediate zext expression when it truncates its operand. The operand is initially inside the loop, so the narrow zext expression has a non-loop-invariant loop disposition. LoopSimplify then runs on an outer loop, hoists the ashr operand, and properly invalidate the SCEVs that are mapped to value. The SCEV expression for the ashr is now an AddRec with the hoisted value as the now loop-invariant start value. The LoopDisposition of this wide value was properly invalidated during LoopSimplify. However, if we later get the ashr SCEV again, we again try to create the intermediate zext expression. We get the same SCEV that we did earlier, and it is still cached because it was never mapped to a Value. When we try to create a new AddRec we abort because we're using the old non-loop-invariant LoopDisposition. I don't have a solution for this other than to clear LoopDisposition when LoopSimplify hoists things. I think the long-term strategy should be to perform LoopSimplify on all loops before computing SCEV and before running any loop opts on individual loops. It's possible we may want to rerun LoopSimplify on individual loops, but it should rarely do anything, so rarely require invalidating SCEV. llvm-svn: 198478	2014-01-04 05:52:49 +00:00
Craig Topper	1da8582322	Remove JMP64pcrel32 (jmpq ). There are no tests for it. I'm pretty sure it won't be emitted correctly since it was set to NoImm. And I can't prove that gas accepts 'jmpq' with an immediate either. Remove the special case for it from the disassembler table generator. llvm-svn: 198475	2014-01-04 05:09:27 +00:00
Nico Weber	7408c7066a	Add a LLVM_DUMP_METHOD macro. The motivation is to mark dump methods as used in debug builds so that they can be called from lldb, but to not do so in release builds so that they can be dead-stripped. There's lots of potential follow-up work suggested in the thread "Should dump methods be LLVM_ATTRIBUTE_USED only in debug builds?" on cfe-dev, but everyone seems to agreen on this subset. Macro name chosen by fair coin toss. llvm-svn: 198456	2014-01-03 22:53:37 +00:00
Reid Kleckner	19bccb790e	Revert "For disassembly when adding a symbolic operand that is a C++ symbol name, also put the human readable name in a comment." This reverts commit r198441. This change doesn't build on Windows, and doesn't do the right thing on Linux and other platforms that don't use a _Z prefix instead of __Z for C++ names. It also had no tests, so it wasn't clear how to fix it forward. llvm-svn: 198445	2014-01-03 19:56:20 +00:00
Kevin Enderby	b05bec7ce8	For disassembly when adding a symbolic operand that is a C++ symbol name, also put the human readable name in a comment. Also fix a bug in LLVMDisasmInstruction() that was not flushing the raw_svector_ostream for the disassembled instruction string before copying it to the output buffer that was causing truncation of the output. rdar://10173828 llvm-svn: 198441	2014-01-03 19:33:09 +00:00
Rafael Espindola	58873566b3	Make the llvm mangler depend only on DataLayout. Before this patch any program that wanted to know the final symbol name of a GlobalValue had to link with Target. This patch implements a compromise solution where the mangler uses DataLayout. This way, any tool that already links with Target (llc, clang) gets the exact behavior as before and new IR files can be mangled without linking with Target. With this patch the mangler is constructed with just a DataLayout and DataLayout is extended to include the information the Mangler needs. llvm-svn: 198438	2014-01-03 19:21:54 +00:00
Ana Pazos	e891c5f264	[AArch64][NEON] Added SXTL and SXTL2 instruction aliases llvm-svn: 198437	2014-01-03 19:20:31 +00:00
David Blaikie	cfb2115e66	Revert "Revert "Debug Info: Type Units: Simplify type hashing using IR-provided unique names."" This reverts commit r198398, thus reapplying r198397. I had accidentally introduced an endianness issue when applying the hash to the type unit. Using support::ulittle64_t in the reinterpret_cast in addDwarfTypeUnitType fixes this issue. Original commit message: Debug Info: Type Units: Simplify type hashing using IR-provided unique names. What's good for LTO metadata size problems ought to be good for non-LTO debug info size too, so let's rely on the same uniqueness in both cases. If it's insufficient for non-LTO for whatever reason (since we now won't be uniquing CU-local types or any C types - but these are likely to not be the most significant contributors to type bloat) we should consider a frontend solution that'll help both LTO and non-LTO alike, rather than using DWARF-level DIE-hashing that only helps non-LTO debug info size. It's also much simpler this way and benefits C++ even more since we can deduplicate lexically separate definitions of the same C++ type since they have the same mangled name. llvm-svn: 198436	2014-01-03 18:59:42 +00:00
David Peixotto	ea9ba446d5	Fix loop rerolling pass failure with non-consant loop lower bound The loop rerolling pass was failing with an assertion failure from a failed cast on loops like this: void foo(int A, int B, int m, int n) { for (int i = m; i < n; i+=4) { A[i+0] = B[i+0] * 4; A[i+1] = B[i+1] * 4; A[i+2] = B[i+2] * 4; A[i+3] = B[i+3] * 4; } } The code was casting the SCEV-expanded code for the new induction variable to a phi-node. When the loop had a non-constant lower bound, the SCEV expander would end the code expansion with an add insted of a phi node and the cast would fail. It looks like the cast to a phi node was only needed to get the induction variable value coming from the backedge to compute the end of loop condition. This patch changes the loop reroller to compare the induction variable to the number of times the backedge is taken instead of the iteration count of the loop. In other words, we stop the loop when the current value of the induction variable == IterationCount-1. Previously, the comparison was comparing the induction variable value from the next iteration == IterationCount. This problem only seems to occur on 32-bit targets. For some reason, the loop is not rerolled on 64-bit targets. PR18290 llvm-svn: 198425	2014-01-03 17:20:01 +00:00
Arnold Schwaighofer	833a82ecde	BasicAA: Use reachabilty instead of dominance for checking value equality in phi cycles This allows the value equality check to work even if we don't have a dominator tree. Also add some more comments. I was worried about compile time impacts and did not implement reachability but used the dominance check in the initial patch. The trade-off was that the dominator tree was required. The llvm utility function isPotentiallyReachable cuts off the recursive search after 32 visits. Testing did not show any compile time regressions showing my worries unjustfied. No compile time or performance regressions at O3 -flto -mavx on test-suite + externals. Addresses review comments from r198290. llvm-svn: 198400	2014-01-03 05:47:03 +00:00
David Blaikie	ab0ba24983	Revert "Debug Info: Type Units: Simplify type hashing using IR-provided unique names." Reverting due to bot failure I won't have time to investigate until tomorrow. This reverts commit r198397. llvm-svn: 198398	2014-01-03 04:49:04 +00:00
David Blaikie	ddb66281cd	Debug Info: Type Units: Simplify type hashing using IR-provided unique names. What's good for LTO metadata size problems ought to be good for non-LTO debug info size too, so let's rely on the same uniqueness in both cases. If it's insufficient for non-LTO for whatever reason (since we now won't be uniquing CU-local types or any C types - but these are likely to not be the most significant contributors to type bloat) we should consider a frontend solution that'll help both LTO and non-LTO alike, rather than using DWARF-level DIE-hashing that only helps non-LTO debug info size. It's also much simpler this way and benefits C++ even more since we can deduplicate lexically separate definitions of the same C++ type since they have the same mangled name. llvm-svn: 198397	2014-01-03 04:20:26 +00:00
Eric Christopher	4d214b9e9c	80-column. llvm-svn: 198394	2014-01-03 02:17:35 +00:00
Eric Christopher	50effa0437	Remove TextSectionSym as it is unused. llvm-svn: 198393	2014-01-03 02:16:44 +00:00
David Blaikie	22b29a5f1a	Revert "Reverting r193835 due to weirdness with Go..." The cgo problem was that it wants dwarf2 which doesn't support direct constant encoding of the location. So let's add support for dwarf2 encoding (using a location expression) of data member locations. This reverts commit r198385. llvm-svn: 198389	2014-01-03 01:30:05 +00:00
David Blaikie	2ada116a34	Reverting r193835 due to weirdness with Go... Apologies for the noise - we're seeing some Go failures with cgo interacting with Clang's debug info due to this change. llvm-svn: 198385	2014-01-03 00:48:38 +00:00
Quentin Colombet	1fb3362a6e	[RegAlloc] Make tryInstructionSplit less aggressive. The greedy register allocator tries to split a live-range around each instruction where it is used or defined to relax the constraints on the entire live-range (this is a last chance split before falling back to spill). The goal is to have a big live-range that is unconstrained (i.e., that can use the largest legal register class) and several small local live-range that carry the constraints implied by each instruction. E.g., Let csti be the constraints on operation i. V1= op1 V1(cst1) op2 V1(cst2) V1 live-range is constrained on the intersection of cst1 and cst2. tryInstructionSplit relaxes those constraints by aggressively splitting each def/use point: V1= V2 = V1 V3 = V2 op1 V3(cst1) V4 = V2 op2 V4(cst2) Because of how the coalescer infrastructure works, each new variable (V3, V4) that is alive at the same time as V1 (or its copy, here V2) interfere with V1. Thus, we end up with an uncoalescable copy for each split point. To make tryInstructionSplit less aggressive, we check if the split point actually relaxes the constraints on the whole live-range. If it does not, we do not insert it. Indeed, it will not help the global allocation problem: - V1 will have the same constraints. - V1 will have the same interference + possibly the newly added split variable VS. - VS will produce an uncoalesceable copy if alive at the same time as V1. <rdar://problem/15570057> llvm-svn: 198369	2014-01-02 22:47:22 +00:00
Hal Finkel	860fa9052e	[PPC] Fix comment to match function name llvm-svn: 198362	2014-01-02 22:09:39 +00:00
Eric Christopher	94932438d4	Remove comments on CU skeleton construction, they're probably obvious. llvm-svn: 198361	2014-01-02 22:04:47 +00:00
Hal Finkel	1d429f2ee0	[PPC] Fix the scheduling of CR logicals on the P7 CR logicals (crand, crxor, etc.) on the P7 need to be in the first slot of each dispatch group. The old itinerary entry was just wrong (but has not mattered because we don't generate these instructions). This will matter when, in an upcoming commit, we start generating these instructions. llvm-svn: 198359	2014-01-02 21:38:26 +00:00
Eric Christopher	d8beca3b78	Elaborate on comment for skeleton CU construction. llvm-svn: 198358	2014-01-02 21:38:18 +00:00
Eric Christopher	40734c4c0c	Revert seemingly unnecessary section sym for the data section. llvm-svn: 198357	2014-01-02 21:38:13 +00:00
Hal Finkel	77c8dc1da3	[PPC] Use the correct immediate operands on 64-bit instructions Several of the 64-bit fixed-point instructions with immediate operands were using the 32-bit (i32) operand nodes instead of the corresponding 64-bit (i64) operand definitions (u16imm instead of u16imm64, for example). This error has had no effect so far, but would have caused type-checking violations with an upcoming change. llvm-svn: 198356	2014-01-02 21:26:59 +00:00
Hal Finkel	decb024c86	Disable compare sinking in CodeGenPrepare when multiple condition registers are available As noted in the comment above CodeGenPrepare::OptimizeInst, which aggressively sinks compares to reduce pressure on the condition register(s), for targets such as PowerPC with multiple condition registers, this may not be the right thing to do. This adds an HasMultipleConditionRegisters boolean to TLI, and CodeGenPrepare::OptimizeInst is skipped when HasMultipleConditionRegisters is true. This functionality will be used by the PowerPC backend in an upcoming commit. Especially when the PowerPC backend starts tracking individual condition register bits as separate allocatable entities (which will happen in this upcoming commit), this sinking from CodeGenPrepare::OptimizeInst is significantly suboptimial. llvm-svn: 198354	2014-01-02 21:13:43 +00:00
Andrew Trick	b6bc783060	indvars: cleanup the IV visitor. It does more than gather sext/zext info. llvm-svn: 198353	2014-01-02 21:12:11 +00:00
Eric Christopher	d4368fde45	Fix up a couple of review comments: Use an if statement instead of a pair of ternary operators checking the same condition. Use a cheap method call rather than returning the local symbol. llvm-svn: 198351	2014-01-02 21:03:28 +00:00
Eric Christopher	8bdb6e1d49	Simplify conditional. llvm-svn: 198350	2014-01-02 21:03:22 +00:00
Matt Arsenault	00436ea156	Allow addrspacecast in global aliases llvm-svn: 198349	2014-01-02 20:55:01 +00:00
Hal Finkel	a8c1f46767	[TableGen] Correctly generate implicit anonymous prototype defs in multiclasses Even within a multiclass, we had been generating concrete implicit anonymous defs when parsing values (generally in value lists). This behavior was incorrect, and led to errors when multiclass parameters were used in the parameter list of the implicit anonymous def. If we had some multiclass: multiclass mc<string n> { ... : SomeClass<SomeOtherClass<n> > The capture of the multiclass parameter 'n' would not work correctly, and depending on how the implicit SomeOtherClass was used, either TableGen would ignore something it shouldn't, or would crash. To fix this problem, when inside a multiclass, we generate prototype anonymous defs for implicit anonymous defs (just as we do for explicit anonymous defs). Within the multiclass, the current record prototype is populated with a node that is essentially: !cast<SomeOtherClass>(!strconcat(NAME, anon_value_name)). This is then resolved to the correct concrete anonymous def, in the usual way, when NAME is resolved during multiclass instantiation. llvm-svn: 198348	2014-01-02 20:47:09 +00:00
Matt Arsenault	461c8e0a8c	Delete unread globals through addrspacecast llvm-svn: 198346	2014-01-02 20:01:43 +00:00
Matt Arsenault	da1deabb16	Fix addrspacecast with metadata globals llvm-svn: 198345	2014-01-02 19:53:49 +00:00
Lang Hames	8e6e6abf53	Remove redundant fold call introduced in r195944. Thanks very much to Juergen for pointing this out. llvm-svn: 198341	2014-01-02 19:38:41 +00:00
Hal Finkel	f2a0b2b340	[TableGen] Use the same anonymous name as the prefix on all multiclass defs TableGen had been generating a different name for an anonymous multiclass's NAME for every def in the multiclass. This had an unfortunate side effect: it was impossible to reference one def within the multiclass from another (in the parameter list, for example). By making sure we only generate an anonymous name once per multiclass (which, as it turns out, requires only changing the name parameter to reference type), we can now concatenate NAME within the multiclass with a def name in order to generate a reference to that def. This does not matter so much, in and of itself, but is necessary for a follow-up commit that will fix variable capturing in implicit anonymous multiclass defs (and that is important). llvm-svn: 198340	2014-01-02 19:35:33 +00:00
Andrew Trick	020dd898fc	indvars: insert truncate at loop boundary to avoid redundant IVs. When widening an IV to remove s/zext, we generally try to eliminate the original narrow IV. However, LCSSA phi nodes outside the loop were still using the original IV. Clean this up more aggressively to avoid redundancy in generated code. llvm-svn: 198338	2014-01-02 19:29:38 +00:00
Craig Topper	66c20f344e	Mark REX64_PREFIX as In64BitMode, remove hack from X86RecognizableInstr. llvm-svn: 198336	2014-01-02 19:12:10 +00:00
David Blaikie	7a2380486c	Make llvm::Regex non-copyable but movable. Based on a patch by Maciej Piechotka. llvm-svn: 198334	2014-01-02 19:04:59 +00:00
Adrian Prantl	fd3279f27f	Revert "Debug info: Add enumerators to the __apple_names accelerator table." This reverts r197927 until the discussion on llvm-commits comes to a conclusion. llvm-svn: 198333	2014-01-02 18:48:24 +00:00
Craig Topper	eabdbcb8a9	Mark PUSHFS64/PUSHGS64/POPFS64/POPGS64 as In64BitMode and remove the hack from the disassembler table builder. llvm-svn: 198327	2014-01-02 18:20:48 +00:00
Craig Topper	9dd48c8ed4	Mark all x86 Int_ and _Int patterns as isCodeGenOnly so the disassembler table builder doesn't need to string match them to exclude them. llvm-svn: 198323	2014-01-02 17:28:14 +00:00
Logan Chien	05ae744813	[arm] Add softvfp to supported FPU names. llvm-svn: 198313	2014-01-02 15:50:02 +00:00
Rafael Espindola	d89b16dcb8	Make the ARM ABI selectable via SubtargetFeature. This patch makes it possible to select the ABI with -mattr. It will be used to forward clang's -target-abi option to llvm's CodeGen. llvm-svn: 198304	2014-01-02 13:40:08 +00:00
Arnold Schwaighofer	0d10a9d579	BasicAA: Fix value equality and phi cycles When there are cycles in the value graph we have to be careful interpreting "Value" identity as "value" equivalence. We interpret the value of a phi node as the value of its operands. When we check for value equivalence now we make sure that the "Value" dominates all cycles (phis). %0 = phi [%noaliasval, %addr2] %l = load %ptr %addr1 = gep @a, 0, %l %addr2 = gep @a, 0, (%l + 1) store %ptr ... Before this patch we would return NoAlias for (%0, %addr1) which is wrong because the value of the load is from different iterations of the loop. Tested on x86_64 -mavx at O3 and O3 -flto with no performance or compile time regressions. PR18068 radar://15653794 llvm-svn: 198290	2014-01-02 03:31:36 +00:00
Rafael Espindola	6994fdf33c	Remove the 's' DataLayout specification During the years there have been some attempts at figuring out how to align byval arguments. A look at the commit log suggests that they were * Use the ABI alignment. * When that was not sufficient for x86-64, I added the 's' specification to DataLayout. * When that was not sufficient Evan added the virtual getByValTypeAlignment. * When even that was not sufficient, we just got the FE to add the alignment to the byval. This patch is just a simple cleanup that removes my first attempt at fixing the problem. I also added an AArch64 implementation of getByValTypeAlignment to make sure this patch is a nop. I also left the 's' parsing for backward compatibility. I will send a short email to llvmdev about the change for anyone maintaining an out of tree target. llvm-svn: 198287	2014-01-01 22:29:43 +00:00
Venkatraman Govindaraju	9a3da52ea2	[Sparc] Handle atomic loads/stores in sparc backend. llvm-svn: 198286	2014-01-01 22:11:54 +00:00
Craig Topper	3321c99a06	Remove modifierType/Base from X86 disassembler tables as they are no longer used. Removes ~11.5K from static tables. llvm-svn: 198284	2014-01-01 21:52:57 +00:00
Venkatraman Govindaraju	77011e861b	[SparcV9]: Custom lower UMULO/SMULO so that the arguments are send to __multi3() in correct order. llvm-svn: 198281	2014-01-01 20:22:45 +00:00
Venkatraman Govindaraju	acf0233a46	[SparcV9]: Use SRL instead of SLL to clear top 32-bits in ctpop:i32. SLL does not clear top 32 bit, only SRL does. llvm-svn: 198280	2014-01-01 19:00:10 +00:00
NAKAMURA Takumi	545b6803c3	X86Disassembler.cpp: Prune stray @return on translateFPRegister(). [-Wdocumentation] llvm-svn: 198279	2014-01-01 16:19:26 +00:00
Craig Topper	9155118602	Remove need for MODIFIER_OPCODE in the disassembler tables. AddRegFrms are really more like OrRegFrm so we don't need a difference since we can just mask bits. llvm-svn: 198278	2014-01-01 15:29:32 +00:00
Elena Demikhovsky	de3f751baf	AVX-512: Added intrinsics for vcvt, vcvtt, vrndscale, vcmp Printing rounding control. Enncoding for EVEX_RC (rounding control). llvm-svn: 198277	2014-01-01 15:12:34 +00:00
Craig Topper	623b0d64b3	Second attempt at Removing special form of AddRegFrm used by FP instructions. These instructions can be handled by MRMXr instead. llvm-svn: 198276	2014-01-01 14:22:37 +00:00
Nick Lewycky	2d4ba2ebba	Fold vector selects with undef elements in the condition. Fixes PR18319. Patch by Ilia Filippov! llvm-svn: 198267	2013-12-31 19:30:47 +00:00
Craig Topper	e98c8cb9f0	Revert r198238 and add FP disassembler tests. It didn't work and I didn't realized we had no FP disassembler test cases. llvm-svn: 198265	2013-12-31 17:21:44 +00:00
Craig Topper	b771ffaf4c	Remove old comment referring to an argument that no longer exists. llvm-svn: 198263	2013-12-31 15:29:14 +00:00
Mark Seaborn	c3bd177ec2	Fix misaligned indentation in "if" block in MipsMCCodeEmitter.cpp llvm-svn: 198262	2013-12-31 13:05:15 +00:00
Craig Topper	df912ba6ec	Add missing MRM_XX forms to the old JIT emitter for consistency. llvm-svn: 198258	2013-12-31 03:26:24 +00:00
Craig Topper	99f02458e5	Remove MRMInitReg form now that it's last use is gone. llvm-svn: 198257	2013-12-31 03:19:03 +00:00
Alp Toker	1bcdd6ae02	Silence g++ 4.9 build issue lib/Support/ThreadLocal.cpp:53:15: error: typedef 'SIZE_TOO_BIG' locally defined but not used [-Werror=unused-local-typedefs] typedef int SIZE_TOO_BIG[sizeof(pthread_key_t) <= sizeof(data) ? 1 : -1]; Done the C++11 way, switching on and using LLVM_STATIC_ASSERT() instead of LLVM_ATTRIBUTE_UNUSED. llvm-svn: 198255	2013-12-31 03:16:55 +00:00
Craig Topper	854f644781	Handle MOV32r0 in expandPostRAPseudo instead of MCInst lowering. No functional change intended. llvm-svn: 198254	2013-12-31 03:05:38 +00:00
Craig Topper	258ab6abc9	Merge case statements to remove redundant code. llvm-svn: 198241	2013-12-30 19:47:49 +00:00
Craig Topper	0e21bca6dd	Remove special form of AddRegFrm used by FP instructions. These instructions can be handled by MRMXr instead. llvm-svn: 198238	2013-12-30 19:16:48 +00:00
Saleem Abdulrasool	e3a9dc134d	ARM IAS: account for predicated pre-UAL mnemonics Checking the trailing letter of the mnemonic is insufficient. Be more thorough in the scanning of the instruction to ensure that we correctly work with the predicated mnemonics. llvm-svn: 198235	2013-12-30 18:38:01 +00:00
Eric Christopher	05893f475b	Refactor and reduce code duplication for non-split dwarf strings. llvm-svn: 198233	2013-12-30 18:32:31 +00:00
Eric Christopher	d86672037b	Revert r198208 and reapply: r198196: Use a pointer to keep track of the skeleton unit for each normal unit and construct it up front. r198199: Reapply r198196 with a fix to zero initialize the skeleton pointer. r198202: Fix aranges and split dwarf by ensuring that the symbol and relocation back to the compile unit from the aranges section is to the skeleton unit and not the one in the dwo. with a fix to use integer 0 for DW_AT_low_pc since the relocation to the text section symbol was causing issues with COFF. Accordingly remove addLocalLabelAddress and machinery since we're not currently using it. llvm-svn: 198222	2013-12-30 17:22:27 +00:00
NAKAMURA Takumi	17b7310858	Revert r198199 (and r198202). It broke 3 DebugInfo tests for targeting i686-cygming. r198196: Use a pointer to keep track of the skeleton unit for each normal unit and construct it up front. r198199: Reapply r198196 with a fix to zero initialize the skeleton pointer. r198202: Fix aranges and split dwarf by ensuring that the symbol and relocation back to the compile unit from the aranges section is to the skeleton unit and not the one in the dwo. They could be reproducible with explicit target. llvm/lib/MC/WinCOFFObjectWriter.cpp:224: bool {anonymous}::COFFSymbol::should_keep() const: Assertion `Section->Number != -1 && "Sections with relocations must be real!"' failed. llvm-svn: 198208	2013-12-30 09:26:10 +00:00
Eric Christopher	c2d401e952	Fix aranges and split dwarf by ensuring that the symbol and relocation back to the compile unit from the aranges section is to the skeleton unit and not the one in the dwo. Do this by adding a method to grab a forwarded on local sym and local section by querying the skeleton if one exists and using that. Add a few tests to verify the relocations are back to the correct section. llvm-svn: 198202	2013-12-30 05:25:49 +00:00
Bill Wendling	6c1d9599d4	Keep comment with 'Subtarget' ivar. llvm-svn: 198201	2013-12-30 05:17:29 +00:00
Eric Christopher	d039baad05	Reapply r198196 with a fix to zero initialize the skeleton pointer. llvm-svn: 198199	2013-12-30 03:40:32 +00:00
Eric Christopher	be4c91c57c	Temporarily revert "Use a pointer to keep track of the skeleton unit for each normal unit" as it seems to be causing problems in the asan tests. llvm-svn: 198197	2013-12-30 03:12:31 +00:00
Eric Christopher	83fff3fce7	Use a pointer to keep track of the skeleton unit for each normal unit and construct it up front. Add address ranges at the end and a helper routine so that we're not needlessly using an indirction in the case of split dwarf. Update testcases according to the new ordering of attributes on the compile unit. llvm-svn: 198196	2013-12-30 03:02:12 +00:00
Jiangning Liu	a0acf70af1	For AArch64 Neon, simplify scalar dup by lane0 for fp. llvm-svn: 198194	2013-12-30 02:44:35 +00:00
Hao Liu	fe3bfc8c41	[AArch64]Add code to spill/fill Q register tuples such as QPair/QTriple/QQuad. llvm-svn: 198193	2013-12-30 02:38:12 +00:00
Hao Liu	b591f835d6	[AArch64]Can't select shift left 0 of type v1i64 llvm-svn: 198192	2013-12-30 02:12:46 +00:00
Kevin Qin	ede9ce1933	Fix a bug in DAGcombiner about zero-extend after setcc. For AArch64 backend, if DAGCombiner see "sext(setcc)", it will combine them together to a single setcc with extended value type. Then if it see "zext(setcc)", it assumes setcc is Vxi1, and try to create "(and (vsetcc), (1, 1, ...)". While setcc isn't Vxi1, DAGcombiner will create wrong node and get wrong code emitted. llvm-svn: 198190	2013-12-30 02:05:13 +00:00
Hao Liu	74107fe526	[AArch64]Fix the problem that can't select mul of v1i64/v2i64 types. E.g. Can't select such IR: %tmp = mul <2 x i64> %a, %b llvm-svn: 198188	2013-12-30 01:38:41 +00:00
Nico Weber	1226531099	Set LLVM_EXPORTED_SYMBOL_FILE in CMakeLists whose corresponding Makefiles do so. (unittests/ExecutionEngine/JIT/CMakeLists.txt is still missing for now, since it handles export files in a strange way: It generates a .exports file from a .def file instead of the other way round.) llvm-svn: 198183	2013-12-29 23:06:49 +00:00
Saleem Abdulrasool	aca443c02c	ARM IAS: fix after r198172 The DPR and SPR register lists are also register lists. Furthermore, the registers need not be checked individually since the register type can be checked via the list kind. Use that to simplify the logic and fix the incorrect assertion. llvm-svn: 198174	2013-12-29 18:53:16 +00:00
Saleem Abdulrasool	4da9c6e566	ARM: provide VFP aliases for pre-V6 mnemonics In order to provide compatibility with the GNU assembler, provide aliases for pre-UAL mnemonics for floating point operations. llvm-svn: 198172	2013-12-29 17:58:35 +00:00
Saleem Abdulrasool	a1937cbc62	ARM: fix a few typos in comments llvm-svn: 198171	2013-12-29 17:58:31 +00:00
Saleem Abdulrasool	da96a81ee6	ARM: fix typo in VFP instruction definition The vstm family of VFP instructions belong to the VFP store itinerary class, not the VFP load itinerary class. llvm-svn: 198170	2013-12-29 17:58:27 +00:00
Mark Seaborn	774c24385e	Fix indentation alignment of a declaration in MipsMCCodeEmitter.cpp llvm-svn: 198162	2013-12-29 10:47:04 +00:00
Bill Wendling	76cce1906a	Store the global variable that's created so that it's reclaimed afterwards. This plugs a memory leak in ARM's FastISel by storing the GV in Module so that it's reclaimed. PR17978 llvm-svn: 198160	2013-12-29 08:00:04 +00:00
Venkatraman Govindaraju	3e3a29a2e9	[SparcV9] Use separate instruction patterns for 64 bit arithmetic instructions instead of reusing 32 bit instruction patterns. This is done to avoid spilling the result of the 64-bit instructions to a 4-byte slot. llvm-svn: 198157	2013-12-29 07:15:09 +00:00
Venkatraman Govindaraju	5ac9c8faec	[SparcV9] For codegen generated library calls that return float, set inreg flag manually in LowerCall(). This makes the sparc backend to generate Sparc64 ABI compliant code. llvm-svn: 198149	2013-12-29 04:27:21 +00:00
Craig Topper	a448bd868f	Make more of the x86 lowering helper functions static. llvm-svn: 198146	2013-12-29 01:48:38 +00:00
Venkatraman Govindaraju	0776cc0acd	[SparcV9]: Implement lowering of long double (fp128) arguments in Sparc64 ABI. Also, pass fp128 arguments to varargs through integer registers if necessary. llvm-svn: 198145	2013-12-29 01:20:36 +00:00
Craig Topper	059e8e0da1	Switch from EVT to MVT in more of the x86 instruction lowering code. llvm-svn: 198144	2013-12-29 01:10:06 +00:00
Saleem Abdulrasool	7230b377df	CodeGen: silence a C++11 feature warning llvm-svn: 198133	2013-12-28 22:47:55 +00:00
Saleem Abdulrasool	0c4b10264b	ARM IAS: handle errors more appropriately Directive parsers must return false if the target assembler is interested in handling the directive. The Error member function returns true always. Using the 'return Error()' pattern would incorrectly indicate to the general parser that the target was not interested in the directive, when in reality it simply encountered a badly formed directive or some other error. This corrects the behaviour to ensure that the parser behaves appropriately. llvm-svn: 198132	2013-12-28 22:47:53 +00:00
Andrew Trick	7afe481801	Uninitialized variable (in never taken path) after factoring. llvm-svn: 198131	2013-12-28 22:25:57 +00:00
Andrew Trick	3ca67d6404	New machine model for cortex-a9. Schedule for resources and latency. Schedule more conservatively to account for stalls on floating point resources and latency. Use the AGU resource to model latency stalls since it's shared between FP and LD/ST instructions. This might not be completely accurate but should work well in practice. llvm-svn: 198125	2013-12-28 21:57:05 +00:00
Andrew Trick	33e05d7665	Added debugging options: -misched-only-func/block llvm-svn: 198124	2013-12-28 21:57:02 +00:00
Andrew Trick	03b22e39be	The Cortex-A9 machine model is incomplete. Mark it as such. Many vector operations never had itineraries. Since the new machine model was a mapping from existing itinerary classes, we don't have a model for these. We still want to migrate A9 even though no one has invested in a complete model, so mark it incomplete to avoid the scheduler asserting. llvm-svn: 198123	2013-12-28 21:57:00 +00:00
Andrew Trick	d14d7c20f5	Add a PostMachineScheduler pass with generic implementation. PostGenericScheduler uses either the new machine model or the hazard checker for top-down scheduling. Most of the infrastructure for PreRA machine scheduling is reused. With a some tuning, this should allow MachineScheduler to be default for all ARM targets, including cortex-A9, using the new machine model. Likewise, with additional tuning, it should be able to replace PostRAScheduler for all targets. The PostMachineScheduler pass does not currently run the AntiDepBreaker. There is less need for it on targets that are already running preRA MachineScheduler. I want to prove it's necessary before committing to the maintenance burden. The PostMachineScheduler also currently removes kill flags and adds them all back later. This is a bit ridiculous. I'd prefer passes to directly use a liveness utility than rely on flags. A test case that enables this scheduler will be included in a subsequent checkin that updates the A9 model. llvm-svn: 198122	2013-12-28 21:56:57 +00:00
Andrew Trick	6b104f8b9e	Move the PostRA scheduler's fixupKills function for reuse. llvm-svn: 198121	2013-12-28 21:56:55 +00:00
Andrew Trick	17080b9bf2	Stub out a PostMachineScheduler pass. Placeholder and boilerplate for a PostRA MachineScheduler pass. llvm-svn: 198120	2013-12-28 21:56:51 +00:00
Andrew Trick	d7f890edb0	Factor MI-Sched in preparation for post-ra scheduling support. Factor the MachineFunctionPass into MachineSchedulerBase. Split the DAG class into ScheduleDAGMI and SchedulerDAGMILive. llvm-svn: 198119	2013-12-28 21:56:47 +00:00
Craig Topper	bf096926c9	Use getSimpleValueType in a few spots where the type should be simple. llvm-svn: 198117	2013-12-28 18:35:48 +00:00
Craig Topper	e829fe42af	Minor indentation fix to match other switch statements. Change llvm_unreachable text to match similar places. llvm-svn: 198116	2013-12-28 17:37:32 +00:00
Craig Topper	8c4ac147ec	Mark some Type and EVT methods as LLVM_READONLY. llvm-svn: 198115	2013-12-28 16:17:26 +00:00
Andrea Di Biagio	eaceba0ed0	[X86] Teach the backend how to fold target specific dag node for packed vector shift by immedate count (VSHLI/VSRLI/VSRAI) into a build_vector when the vector in input to the shift is a build_vector of all constants or UNDEFs. Target specific nodes for packed shifts by immediate count are in general introduced by function 'getTargetVShiftByConstNode' (in X86ISelLowering.cpp) when lowering shift operations, SSE/AVX immediate shift intrinsics and (only in very few cases) SIGN_EXTEND_INREG dag nodes. This patch adds extra rules for simplifying vector shifts inside function 'getTargetVShiftByConstNode'. Added file test/CodeGen/X86/vec_shift5.ll to verify that packed shifts by immediate are correctly folded into a build_vector when the input vector to the shift dag node is a vector of constants or undefs. llvm-svn: 198113	2013-12-28 11:11:52 +00:00
Saleem Abdulrasool	51cff7199d	AsmParser: cleanup diagnostics for .rep/.rept Avoid double diagnostics for invalid expressions for count. Improve caret location for negative count. llvm-svn: 198099	2013-12-28 06:39:29 +00:00
Saleem Abdulrasool	d743d0ab8c	IAS: support .rep as an alias for .rept The GNU assembler supports .rep as an alias for .rept. This simply creates the alias for it and introduces a test for both .rept and .rep. llvm-svn: 198097	2013-12-28 05:54:33 +00:00
Saleem Abdulrasool	83e3770ae7	ARMAsmParser: fix typo in comment llvm-svn: 198095	2013-12-28 03:07:12 +00:00
Chandler Carruth	f5689f8304	Disable transforms that introduce calls to exp10*() on Linux due to widespread glibc bugs. The glibc implementation of exp10 has a very serious precision bug in version 2.15 (and older versions). This is still very widely used (the current Ubuntu LTS for example uses it) and so it isn't reasonable to make transforms that produce these functions. This fixes many miscompiles introduced when we started transforming pow(10.0, ...) into exp10, and it may have fixed other latent miscompiles where exp10 provided sufficient precision but exp10f did not. This is all really horrible. The primary bug has been fixed for over a year and glibc 2.18 works correctly for the test cases I have, but it will be 2017 before the LTS using 2.15 is no longer supported by Ubuntu (and thus reasonable for folks to be relying on). =[ We're either going to need to live without these optimizations, or find a way to switch behavior more dynamically than using simply the fact that the OS is "Linux". To make matters worse, there appears to be significant testing and fixing of numerous other bugs in the exp10 family of functions right now in glibc. While those haven't been causing problems I've seen in the wild, it gives me concerns that we may need to wait until an even later release of glibc before we can reliably transform code into exp10. llvm-svn: 198093	2013-12-28 02:40:19 +00:00
Eric Christopher	8458862f20	Remove AsmPrinter::needsRelocationsForDwarfStringPool() since it's just calling into MAI and is only abstracting for a single interface that we actually need to check in multiple places. llvm-svn: 198092	2013-12-28 01:39:17 +00:00
Andrea Di Biagio	46dcddb350	Teach DAGCombiner how to fold a SIGN_EXTEND_INREG of a BUILD_VECTOR of ConstantSDNodes (or UNDEFs) into a simple BUILD_VECTOR. For example, given the following sequence of dag nodes: i32 C = Constant<1> v4i32 V = BUILD_VECTOR C, C, C, C v4i32 Result = SIGN_EXTEND_INREG V, ValueType:v4i1 The SIGN_EXTEND_INREG node can be folded into a build_vector since the vector in input is a BUILD_VECTOR of constants. The optimized sequence is: i32 C = Constant<-1> v4i32 Result = BUILD_VECTOR C, C, C, C llvm-svn: 198084	2013-12-27 20:20:28 +00:00
David Blaikie	ac2002973c	DebugInfo: Remove dead code, DICompositeType::addMember(DIDescriptor D) It's no longer necessary to lazily add members to the DICompositeType member list. Instead any lazy members (special member functions and member template instantiations) are added to the parent late based on their context link, the same way that nested types have always been handled (never being in the member list - just added to the parent DIE lazily based on context). Clang's been updated not to use this function anymore as it improves type unit consistency by never emitting lazy members in type units. llvm-svn: 198079	2013-12-27 19:11:52 +00:00
Chandler Carruth	87c3a0cfa6	Use two variables here rather than reusing (and abusing) one. This is much more clear to me. I meant to make this change before committing the original patch, but forgot to merge it in. Sorry. llvm-svn: 198069	2013-12-27 04:44:35 +00:00
Chandler Carruth	f8c5281c87	Introduce a simple line-by-line iterator type into the Support library. This is an iterator which you can build around a MemoryBuffer. It will iterate through the non-empty, non-comment lines of the buffer as a forward iterator. It should be small and reasonably fast (although it could be made much faster if anyone cares, I don't really...). This will be used to more simply support the text-based sample profile file format, and is largely based on the original patch by Diego. I've re-worked the style of it and separated it from the work of producing a MemoryBuffer from a file which both simplifies the interface and makes it easier to test. The style of the API follows the C++ standard naming conventions to fit in better with iterators in general, much like the Path and FileSystem interfaces follow standard-based naming conventions. llvm-svn: 198068	2013-12-27 04:28:57 +00:00
Reid Kleckner	f4355eef5e	TLI: Make exp10* avaiable on Linux/Mac/iOS and unavailable elsewhere This makes it unavailable on NetBSD, Android, etc. Patch by Brad Smith! llvm-svn: 198056	2013-12-26 19:17:04 +00:00
Joerg Sonnenberger	a13f8b4f36	Recognize armv7a and friends as aliases for armv7-a etc. for the purpose of architecture naming. llvm-svn: 198043	2013-12-26 11:50:28 +00:00
Saleem Abdulrasool	a554968dde	ARM IAS: support .even directive The .even directive aligns content to an evan-numbered address. This is an ARM specific directive applicable to any section. llvm-svn: 198031	2013-12-26 01:52:28 +00:00
Venkatraman Govindaraju	bf683fd15c	[Sparc] Lower and MachineInstr to MC and print assembly using MCInstPrinter. llvm-svn: 198030	2013-12-26 01:49:59 +00:00
Venkatraman Govindaraju	08bcf29068	[Sparc] Add target specific MCExpr class to handle sparc specific modifiers like %hi, %lo, etc., llvm-svn: 198029	2013-12-26 00:01:52 +00:00
Venkatraman Govindaraju	0b938652d3	[Sparc] Add MCInstPrinter implementation for SPARC. llvm-svn: 198028	2013-12-25 23:43:39 +00:00
Simon Atanasyan	fde102cb77	[Mips] Does not take in account 'use-soft-float' attribute's value when consider to generate stubs for mips16 hard-float mode. The patch reviewed by Reed Kotler. llvm-svn: 198019	2013-12-25 17:00:27 +00:00
Alexander Potapenko	4f0335f863	[ASan] Fix the test for __asan_gen_ globals and actually fix http://llvm.org/bugs/show_bug.cgi?id=17976 by setting the correct linkage (as stated in the bug). llvm-svn: 198018	2013-12-25 16:46:27 +00:00
Alexander Potapenko	daf96ae81b	[ASan] Make sure none of the __asan_gen_ global strings end up in the symbol table, add a test. This should fix http://llvm.org/bugs/show_bug.cgi?id=17976 Another test checking for the global variables' locations and prefixes on Darwin will be committed separately. llvm-svn: 198017	2013-12-25 14:22:15 +00:00
Elena Demikhovsky	371e363833	AVX-512: decoder for AVX-512, made by Alexey Bader. llvm-svn: 198013	2013-12-25 11:40:51 +00:00
Zoran Jovanovic	bd28c373c4	Support for microMIPS load effective address. llvm-svn: 198010	2013-12-25 10:14:07 +00:00
Zoran Jovanovic	8876be39c7	Support for microMIPS FPU instructions 2. llvm-svn: 198009	2013-12-25 10:09:27 +00:00
Elena Demikhovsky	b64d7e8586	AVX-512: Result type of scalar SETCC is MVT::i1 for AVX-512. llvm-svn: 198008	2013-12-25 10:06:40 +00:00
Hao Liu	83799741fb	[AArch64]Fix a problem that the register order of fmls/fmla by element is incorrect. E.g. the codegen result is fmls v1.2s, v0.2s, v2.s[3] which is expected to be fmls v0.2s, v1.2s, v2.s[3] llvm-svn: 198001	2013-12-25 07:12:34 +00:00
Richard Sandiford	002019a285	Fix typo. llvm-svn: 197986	2013-12-24 15:22:39 +00:00
Richard Sandiford	41350a52ca	[SystemZ] Use interlocked-access 1 instructions for CodeGen ...namely LOAD AND ADD, LOAD AND AND, LOAD AND OR and LOAD AND EXCLUSIVE OR. LOAD AND ADD LOGICAL isn't really separately useful for LLVM. I'll look at adding reusing the CC results in new year. llvm-svn: 197985	2013-12-24 15:18:04 +00:00
Richard Sandiford	45645a2c1c	[SystemZ] Add MC support for interlocked-access 1 instructions llvm-svn: 197984	2013-12-24 15:14:05 +00:00
Elena Demikhovsky	64c9548d66	AVX-512: fixed some patterns for MVT::i1 llvm-svn: 197981	2013-12-24 14:24:07 +00:00
Hao Liu	ce7a12be8f	[AArch64]Add patterns to match normal shift nodes: shl, sra and srl. llvm-svn: 197969	2013-12-24 09:00:21 +00:00
Kevin Qin	82bd84aadf	[AArch64 NEON] Fix a bug when lowering BUILD_VECTOR. DAG.getVectorShuffle() doesn't always return a vector_shuffle node. If mask is the exact sequence of it's operand(For example, operand_0 is v8i8, and the mask is 0, 1, 2, 3, 4, 5, 6, 7), it will directly return that operand. So a check is added here. llvm-svn: 197967	2013-12-24 08:16:06 +00:00
Kevin Qin	cd5f3153f5	[AArch64 NEON] Fix a pattern match failure with NEON_VDUP. This failure caused by improper condition when lowering shuffle_vector to scalar_to_vector. After this patch NEON_VDUP with v1i64 will not be generated. llvm-svn: 197966	2013-12-24 08:11:47 +00:00
Ana Pazos	bc2996b30f	[AArch64] Check fmul node single use in fused multiply patterns Check for single use of fmul node in fused multiply patterns to allow generation of fused multiply add/sub instructions. Otherwise fmul operation ends up being repeated more than once which does not help peformance on targets with only one MAC unit, as for example cortex-a53. llvm-svn: 197929	2013-12-24 00:47:29 +00:00
Ana Pazos	3ca23915cd	[AArch64 NEON] Fixed fused multiply negate add/sub patterns The correct pattern matching should be: - fnmadd is (-Ra) + (-Rn)Rm which should be matched as: fma (fneg node:$Rn), node:$Rm, (fneg node:$Ra) and as (f32 (fsub (f32 (fneg FPR32:$Ra)), (f32 (fmul FPR32:$Rn, FPR32:$Rm)))) - fnmsub is (-Ra) + RnRm which should be matched as fma node:$Rn, node:$Rm, (fneg node:$Ra) and as (f32 (fsub (f32 (fmul FPR32:$Rn, FPR32:$Rm)), FPR32:$Ra)))) llvm-svn: 197928	2013-12-24 00:40:10 +00:00
Adrian Prantl	ad64aeac44	Debug info: Add enumerators to the __apple_names accelerator table. rdar://problem/11516681. llvm-svn: 197927	2013-12-23 23:50:20 +00:00
Andrew Trick	0ba77a0740	Add support to indvars for optimizing sadd.with.overflow. Split sadd.with.overflow into add + sadd.with.overflow to allow analysis and optimization. This should ideally be done after InstCombine, which can perform code motion (eventually indvars should run after all canonical instcombines). We want ISEL to recombine the add and the check, at least on x86. This is currently under an option for reducing live induction variables: -liv-reduce. The next step is reducing liveness of IVs that are live out of the overflow check paths. Once the related optimizations are fully developed, reviewed and tested, I do expect this to become default. llvm-svn: 197926	2013-12-23 23:31:49 +00:00
Adrian Prantl	edb61f02b6	Debug info: On ARM ensure that the data sections come before the (optional) DWARF sections, so compiling with -g does not result in different code being generated. rdar://problem/15623193 llvm-svn: 197922	2013-12-23 22:24:47 +00:00
Saleem Abdulrasool	701875542d	ARM: bkpt has an implicit immediate constant 0 The bkpt mnemonic has an implicit immediate constant of 0 unless otherwise specified. Add an instruction alias for the unvalued breakpoint mnemonic to treat it as a 0. This improves compatibility with GNU AS. Signed-off-by: Saleem Abdulrasool <compnerd@compnerd.org> llvm-svn: 197913	2013-12-23 17:23:58 +00:00
Richard Sandiford	1fb5c13e3a	Fix Scalarizer insertion point when replacing PHIs with insertelements If the Scalarizer scalarized a vector PHI but could not scalarize all uses of it, it would insert a series of insertelements to reconstruct the vector PHI value from the scalar ones. The problem was that it would emit these insertelements immediately after the PHI, even if there were other PHIs after it. llvm-svn: 197909	2013-12-23 14:51:56 +00:00
Richard Sandiford	3548cbb980	Fix Scalarizer handling of vector GEPs with multiple index operands The old code only worked for one index operand. Also handle "inbounds". llvm-svn: 197908	2013-12-23 14:45:00 +00:00
Kostya Serebryany	530e207d8a	[asan] don't unpoison redzones on function exit in use-after-return mode. Summary: Before this change the instrumented code before Ret instructions looked like: <Unpoison Frame Redzones> if (Frame != OriginalFrame) // I.e. Frame is fake <Poison Complete Frame> Now the instrumented code looks like: if (Frame != OriginalFrame) // I.e. Frame is fake <Poison Complete Frame> else <Unpoison Frame Redzones> Reviewers: eugenis Reviewed By: eugenis CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D2458 llvm-svn: 197907	2013-12-23 14:15:08 +00:00
Kostya Serebryany	ff7bde1582	[asan] produce fewer stores when poisoning stack shadow llvm-svn: 197904	2013-12-23 09:24:36 +00:00
Roman Divacky	bc1655b4b0	Use r2 when encoding tls on ppc32. Fixes PR18305. llvm-svn: 197878	2013-12-22 10:45:37 +00:00
Benjamin Kramer	2151e63c71	Dwarf: Fix a copy-paste bug. This tag isn't emitted by any compiler at the moment. PR18306. llvm-svn: 197877	2013-12-22 10:23:23 +00:00
Elena Demikhovsky	fe24a30e38	AVX512: SETCC returns i1 for AVX-512 and i8 for all others llvm-svn: 197876	2013-12-22 10:13:18 +00:00
Roman Divacky	8854e76944	Add some comments. llvm-svn: 197875	2013-12-22 09:48:38 +00:00
Alp Toker	ce91fe5569	TableGen: Generate valid identifiers for anonymous records Backends like OptParserEmitter assume that record names can be used as valid identifiers. The period '.' in generated anonymous names broke that assumption, causing a build-time error and in practice forcing all records to be named. llvm-svn: 197869	2013-12-21 18:51:00 +00:00
Mark Lacey	1d7c97eff3	Fix typo in assert message: s/load/store llvm-svn: 197846	2013-12-21 00:00:49 +00:00
Yuchen Wu	5947c8fa99	BlockFrequencyInfo: Readded getEntryFreq. llvm-svn: 197839	2013-12-20 22:11:11 +00:00
Lang Hames	18c98a587f	ARM AnalyzeBranch should ignore DEBUG_VALUES while analyzing terminators. Found by inspection by Julien Lerouge. Thanks Julian! llvm-svn: 197833	2013-12-20 20:27:51 +00:00
Timur Iskhodzhanov	09069e0ff3	clang-format a couple of mis-formatted functions llvm-svn: 197831	2013-12-20 20:16:51 +00:00
Timur Iskhodzhanov	c1fb2d6111	[COFF] Add support for the .secidx directive Reviewed at http://llvm-reviews.chandlerc.com/D2445 llvm-svn: 197826	2013-12-20 18:15:00 +00:00
Roman Divacky	32143e2bda	Implement initial-exec TLS for PPC32. llvm-svn: 197824	2013-12-20 18:08:54 +00:00
Zoran Jovanovic	ce02486d16	Support for microMIPS FPU instructions 1. llvm-svn: 197815	2013-12-20 15:44:08 +00:00
Rafael Espindola	e23b87746a	Make this array const. llvm-svn: 197814	2013-12-20 15:21:32 +00:00
Richard Sandiford	83a0b6abd0	[SystemZ] Optimize comparisons with truncated extended loads If the extension of a loaded value is compared against zero and used in other arithmetic, InstCombine will change the comparison to use the unextended load. It's also possible that the comparison could be against the unextended load from the outset. In DAG form this becomes a truncation of an extending load. We want to strip the truncation if possible so that we can use load-and-test instructions. llvm-svn: 197804	2013-12-20 11:56:02 +00:00
Richard Sandiford	220ee49bce	[SystemZ] Extend RISBG optimization The handling of ANY_EXTEND and ZERO_EXTEND was too strict. In this context we can treat ZERO_EXTEND in much the same way as an AND and then also handle outermost ZERO_EXTENDs. I couldn't find a test that benefited from the ANY_EXTEND change, but it's more obvious to write it this way once SIGN_EXTEND and ZERO_EXTEND are handled differently. llvm-svn: 197802	2013-12-20 11:49:48 +00:00
Kai Nacke	b38bf9626a	Add support for krait cpu in llvm::sys::getHostCPUName() Recently, support for krait cpu was added. This commit extends getHostCPUName() to return krait as cpu for the APQ8064 (a Krait 300). llvm-svn: 197792	2013-12-20 09:24:13 +00:00
Justin Bogner	0ba3f211c4	Transforms: Don't create bad weights when eliminating dead cases If we happen to eliminate every case in a switch that has branch weights, we currently try to create metadata for the one remaining branch, triggering an assert. Instead, we need to check that the metadata we're trying to create is sensible. llvm-svn: 197791	2013-12-20 08:21:30 +00:00
Saleem Abdulrasool	6e6c239e33	ARM IAS: add support for the .pool directive The .pool directive is an alias for the .ltorg directive used to create a literal pool. Simply treat .pool as if .ltorg was passed. llvm-svn: 197787	2013-12-20 07:21:16 +00:00
Tom Stellard	eddfa69465	R600: Allow ftrunc v2: Add ftrunc->TRUNC pattern instead of replacing int_AMDGPU_trunc v3: move ftrunc pattern next to TRUNC definition, it's available since R600 Patch By: Jan Vesely Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 197783	2013-12-20 05:11:55 +00:00
Eric Christopher	565ab11a35	Ranges in the .debug_range section need to have begin and end labels, assert that this is so. llvm-svn: 197780	2013-12-20 04:34:22 +00:00
Eric Christopher	46e2343554	Add support for a CU to output a set of ranges for the CU. This is useful when you want to have the full list of addresses for a particular CU or when you have multiple modules linked together and can't depend upon the ordering of a single CU for begin/end ranges. llvm-svn: 197776	2013-12-20 04:16:18 +00:00
Dmitri Gribenko	8da5f7a96d	When parsing data layout string looking for endianness, use the correct default llvm-svn: 197771	2013-12-20 02:54:35 +00:00
Dmitri Gribenko	5362ad579e	Correctly apply the default pointer size llvm-svn: 197770	2013-12-20 02:46:23 +00:00
Eric Christopher	c0a5aaeab0	[x86] Rename In32BitMode predicate to Not64BitMode That's what it actually means, and with 16-bit support it's going to be a little more relevant since in a few corner cases we may actually want to distinguish between 16-bit and 32-bit mode (for example the bare 'push' aliases to pushw/pushl etc.) Patch by David Woodhouse llvm-svn: 197768	2013-12-20 02:04:49 +00:00
Alp Toker	171b0c36a3	Fix documentation typos llvm-svn: 197757	2013-12-20 00:33:39 +00:00
Kevin Enderby	36eba25fee	Un-revert: the buildbot failure in LLVM on lld-x86_64-win7 had me with this commit as the only one on the Blamelist so I quickly reverted this. However it was actually Nick's change who has since fixed that issue. Original commit message: Changed the X86 assembler for intel syntax to work with directional labels. The X86 assembler as a separate code to parser the intel assembly syntax in X86AsmParser::ParseIntelOperand(). This did not parse directional labels. And if something like 1f was used as a branch target it would get an "Unexpected token" error. The fix starts in X86AsmParser::ParseIntelExpression() in the case for AsmToken::Integer, it needs to grab the IntVal from the current token then look for a 'b' or 'f' following an Integer. Then it basically needs to do what is done in AsmParser::parsePrimaryExpr() for directional labels. It saves the MCExpr it creates in the IntelExprStateMachine in the Sym field. When it returns to X86AsmParser::ParseIntelOperand() it looks for a non-zero Sym field in the IntelExprStateMachine and if set it creates a memory operand not an immediate operand it would normally do for the Integer. rdar://14961158 llvm-svn: 197744	2013-12-19 23:16:14 +00:00
Rafael Espindola	458a4851dd	Change getStringRepresentation to skip defaults. I have a pending change for clang to use getStringRepresentation to check that its DataLayout is in sync with llvm's. getStringRepresentation is not called from llvm itself, so far it is mostly a debugging aid, so the shorter strings are an independent improvement. llvm-svn: 197740	2013-12-19 23:03:03 +00:00
David Peixotto	52303f6ed3	Ensure deterministic when printing ARM assembler constant pools We dump any non-empty assembler constant pools after a successful parse of an assembly file that uses the ldr pseudo opcode. These per-section constant pools should be output in a deterministic order to ensure that we always generate the same output when printing the output with an AsmStreamer. This patch changes the map data struture used to associate a section with its constant pool to a MapVector to ensure deterministic output. Because this map type does not support deletion, we now check that the constant pool is not empty before dumping its entries and clear the entries after emitting them with the streamer. llvm-svn: 197735	2013-12-19 22:41:56 +00:00
Kevin Enderby	d6f2a63791	Revert my change to the X86 assembler for intel syntax to work with directional labels. Because it doesn't work for windows :) llvm-svn: 197731	2013-12-19 22:24:09 +00:00
Kevin Enderby	592d3ac226	Changed the X86 assembler for intel syntax to work with directional labels. The X86 assembler has a separate code to parser the intel assembly syntax in X86AsmParser::ParseIntelOperand(). This did not parse directional labels. And if something like 1f was used as a branch target it would get an "Unexpected token" error. The fix starts in X86AsmParser::ParseIntelExpression() in the case for AsmToken::Integer, it needs to grab the IntVal from the current token then look for a 'b' or 'f' following the Integer. Then it basically needs to do what is done in AsmParser::parsePrimaryExpr() for directional labels. It saves the MCExpr it creates in the IntelExprStateMachine in the Sym field. When it returns to X86AsmParser::ParseIntelOperand() it looks for a non-zero Sym field in the IntelExprStateMachine and if set it creates a memory operand not an immediate operand it would normally do for the Integer. rdar://14961158 llvm-svn: 197728	2013-12-19 22:02:03 +00:00
Hans Wennborg	fabf8bfdea	Make sys::ThreadLocal<> zero-initialized on non-thread builds (PR18205) According to the docs, ThreadLocal<>::get() should return NULL if no object has been set. This patch makes that the case also for non-thread builds and adds a very basic unit test to check it. (This was causing PR18205 because PrettyStackTraceHead didn't get zero- initialized and we'd crash trying to read past the end of that list. We didn't notice this so much on Linux since we'd crash after printing all the entries, but on Mac we print into a SmallString, and would crash before printing that.) llvm-svn: 197718	2013-12-19 20:32:44 +00:00
Kay Tiong Khoo	e37d52095e	Stay classy (and legal) LLVM. Remove links to 3rd party SMT solver whose links may not be permanent. llvm-svn: 197713	2013-12-19 18:35:54 +00:00
Quentin Colombet	90a646e4d1	[X86][fast-isel] Fix select lowering. The condition in selects is supposed to be i1. Make sure we are just reading the less significant bit of the 8 bits width value to match this constraint. <rdar://problem/15651765> llvm-svn: 197712	2013-12-19 18:32:04 +00:00
David Peixotto	80c083a678	Implement the .ltorg directive for ARM assembly This directive will write out the assembler-maintained constant pool for the current section. These constant pools are created to support the ldr-pseudo instruction (e.g. ldr r0, =val). The directive can be used by the programmer to place the constant pool in a location that can be reached by a pc-relative offset in the ldr instruction. llvm-svn: 197711	2013-12-19 18:26:07 +00:00
David Peixotto	e407d093e8	Implement the ldr-pseudo opcode for ARM assembly The ldr-pseudo opcode is a convenience for loading 32-bit constants. It is converted into a pc-relative load from a constant pool. For example, ldr r0, =0x10001 ldr r1, =bar will generate this output in the final assembly ldr r0, .Ltmp0 ldr r1, .Ltmp1 ... .Ltmp0: .long 0x10001 .Ltmp1: .long bar Sketch of the LDR pseudo implementation: Keep a map from Section => ConstantPool When parsing ldr r0, =val parse val as an MCExpr get ConstantPool for current Section Label = CreateTempSymbol() remember val in ConstantPool at next free slot add operand to ldr that is MCSymbolRef of Label On finishParse() callback Write out all non-empty constant pools for each Entry in ConstantPool Emit Entry.Label Emit Entry.Value Possible improvements to be added in a later patch: 1. Does not convert load of small constants to mov (e.g. ldr r0, =0x1 => mov r0, 0x1) 2. Does reuse constant pool entries for same constant The implementation was tested for ARM, Thumb1, and Thumb2 targets on linux and darwin. llvm-svn: 197708	2013-12-19 18:12:36 +00:00
David Peixotto	308e7e4367	Add a finishParse() callback to the targer asm parser This callback is invoked when the parse has finished successfuly. It will be used to write out ARM constant pools to implement the ldr pseudo. llvm-svn: 197706	2013-12-19 18:08:08 +00:00
Kay Tiong Khoo	a570b5adb5	Improved fix for PR17827 (instcombine of shift/and/compare). This change fixes the case of arithmetic shift right - do not attempt to fold that case. This change also relaxes the conditions when attempting to fold the logical shift right and shift left cases. No additional IR-level test cases included at this time. See http://llvm.org/bugs/show_bug.cgi?id=17827 for proofs that these are correct transformations. llvm-svn: 197705	2013-12-19 18:07:17 +00:00
Rafael Espindola	4fa79758b7	Small simplification, p0 is the same as p. llvm-svn: 197699	2013-12-19 16:51:03 +00:00
Zoran Jovanovic	8e918c3c4d	Support for microMIPS control instructions. llvm-svn: 197696	2013-12-19 16:25:00 +00:00
Rafael Espindola	9ec26f395b	Long doubles are required to be aligned to 128 bits and svr4 32 bits. Clang was already getting this right. llvm-svn: 197694	2013-12-19 16:23:59 +00:00
Hal Finkel	2345347eb9	Add a disassembler to the PowerPC backend The tests for the disassembler were adapted from the encoder tests, and for the most part, the output from the disassembler matches that encoder-test inputs. There are some places where more-informative mnemonics could be produced (notably for the branch instructions), and those cases are noted in the tests with FIXMEs. Future work includes: - Generating more-informative mnemonics when possible (this may also be done in the printer). - Remove the dependence on positional "numbered" operand-to-variable mapping (for both encoding and decoding). - Internally using 64-bit instruction variants in 64-bit mode (if this turns out to matter). llvm-svn: 197693	2013-12-19 16:13:01 +00:00
Zoran Jovanovic	ff9d5f3284	Support for microMIPS LL and SC instructions. llvm-svn: 197692	2013-12-19 16:12:56 +00:00
Zoran Jovanovic	69be811a6e	Support for microMIPS TLS relocations. llvm-svn: 197685	2013-12-19 16:02:32 +00:00
Evgeniy Stepanov	a284e559d7	[dfsan] Simplify code after r197677. llvm-svn: 197679	2013-12-19 14:37:03 +00:00
Evgeniy Stepanov	a9164e9e2a	Add an explicit insert point argument to SplitBlockAndInsertIfThen. Currently SplitBlockAndInsertIfThen requires that branch condition is an Instruction itself, which is very inconvenient, because it is sometimes an Operator, or even a Constant. llvm-svn: 197677	2013-12-19 13:29:56 +00:00
NAKAMURA Takumi	6e3c4235be	GCOV.cpp: Fix format strings, %lf. Don't use %lf to double. llvm-svn: 197663	2013-12-19 08:46:28 +00:00
Matt Arsenault	a98cd6a56e	R600/SI: Make private pointers be 32-bit. Different sized address spaces should theoretically work most of the time now, and since 64-bit add is currently disabled, using more 32-bit pointers fixes some cases. llvm-svn: 197659	2013-12-19 05:32:55 +00:00
Saleem Abdulrasool	c0da2cb3b4	ARM IAS: support .inst directive This adds support for the .inst directive. This is an ARM specific directive to indicate an instruction encoded as a constant expression. The major difference between .word, .short, or .byte and .inst is that the latter will be disassembled as an instruction since it does not get flagged as data. llvm-svn: 197657	2013-12-19 05:17:58 +00:00
Josh Magee	22b8ba2d67	[stackprotector] Use analysis from the StackProtector pass for stack layout in PEI a nd LocalStackSlot passes. This changes the MachineFrameInfo API to use the new SSPLayoutKind information produced by the StackProtector pass (instead of a boolean flag) and updates a few pass dependencies (to preserve the SSP analysis). The stack layout follows the same approach used prior to this change - i.e., only LargeArray stack objects will be placed near the canary and everything else will be laid out normally. After this change, structures containing large arrays will also be placed near the canary - a case previously missed by the old implementation. Out of tree targets will need to update their usage of MachineFrameInfo::CreateStackObject to remove the MayNeedSP argument. The next patch will implement the rules for sspstrong and sspreq. The end goal is to support ssp-strong stack layout rules. WIP. Differential Revision: http://llvm-reviews.chandlerc.com/D2158 llvm-svn: 197653	2013-12-19 03:17:11 +00:00
Rafael Espindola	2fc7101e3c	Add stack alignment information for Sparc. This matches the data in clang which was added by Jakob Stoklund Olesen in r179596. Thanks for erikjv on irc for pointing me to the relevant documents: http://sparc.com/standards/64.psabi.1.35.ps.Z page 25: Every stack frame must be 16-byte aligned. http://sparc.com/standards/psABI3rd.pdf page 3-10: Although the architecture requires only word alignment, software convention and the operating system require every stack frame to be doubleword aligned. I tried to add a test, but it looks like sparc doesn't implement dynamic stack realignment. This will be tested in clang shortly. llvm-svn: 197646	2013-12-19 02:21:16 +00:00
Reid Kleckner	a534a38130	Begin adding docs and IR-level support for the inalloca attribute The inalloca attribute is designed to support passing C++ objects by value in the Microsoft C++ ABI. It behaves the same as byval, except that it always implies that the argument is in memory and that the bytes are never copied. This attribute allows the caller to take the address of an outgoing argument's memory and execute arbitrary code to store into it. This patch adds basic IR support, docs, and verification. It does not attempt to implement any lowering or fix any possibly broken transforms. When this patch lands, a complete description of this feature should appear at http://llvm.org/docs/InAlloca.html . Differential Revision: http://llvm-reviews.chandlerc.com/D2173 llvm-svn: 197645	2013-12-19 02:14:12 +00:00
Rafael Espindola	ddb913cc8f	Synchronize the NaCl DataLayout strings with the ones in clang. Patch by Derek Schuff. llvm-svn: 197640	2013-12-19 00:44:37 +00:00
Reed Kotler	47f3c64a48	Make cosmetic changes as part of Mips internal post commit review of patch r196331. llvm-svn: 197638	2013-12-19 00:43:08 +00:00
Yuchen Wu	bb6a477131	llvm-cov: Added -f option for function summaries. Similar to the file summaries, the function summaries output line, branching and call statistics. The file summaries have been moved outside the initial loop so that all of the function summaries can be outputted before file summaries. Also updated test cases. llvm-svn: 197633	2013-12-19 00:29:25 +00:00
Reed Kotler	2500bd6c20	Fix a problem with mips16 stubs when calls are transformed during tail call optimization. Some more work may be needed for indirect calls but this patch fixes the current regression in Prolangc++/trees. S2 optimization as part of the general cleanup and optimization of prolog and epilog was not saving S2 in this case and needed to. llvm-svn: 197630	2013-12-18 23:57:48 +00:00
Weiming Zhao	63871d255f	[aarch32] fix bug 18268: Incorrect condition of vsel Given vsel_cc, op1, op2, since vsel has no LE/LT, to generate vsel for such selection, it needs to inverse cc and swap op1 and op2. To inverse cc, both L/G and E bits should be flipped. llvm-svn: 197615	2013-12-18 22:25:17 +00:00
Adrian Prantl	99c7af26b7	Debug info: Implement (rvalue) reference qualifiers for C++11 non-static member functions. Paired commit with CFE. rdar://problem/15356637 llvm-svn: 197613	2013-12-18 21:48:19 +00:00
Adrian Prantl	31631e4a47	Pull in a couple of new constants from the upcoming DWARF 5 standard. llvm-svn: 197611	2013-12-18 21:48:14 +00:00
Rafael Espindola	84a8726a31	Correctly handle the degenerated triple "thumb". Fixes a crash in llc where some parts think the target is thumb and others think it is ARM. llvm-svn: 197607	2013-12-18 21:29:44 +00:00
Yuchen Wu	8256ee6d4a	llvm-cov: Print coverage summary to STDOUT. File summaries will now be optionally outputted which will give line, branching and call coverage info. Unfortunately, clang's current instrumentation does not give enough information to deduce function calls, something that gcc is able to do. Thus, no calls are always outputted to be consistent with gcov output. Also updated tests. llvm-svn: 197606	2013-12-18 21:12:51 +00:00
Yuchen Wu	c9b2dcdbee	llvm-cov: s/(.*)Executed/\1Exec/ llvm-svn: 197595	2013-12-18 18:46:25 +00:00
Yuchen Wu	73dc38187b	llvm-cov: Added -c option for branch counts. This will cause llvm-cov to output branch counts instead of branch probabilities. -b must be enabled. Also updated tests. llvm-svn: 197594	2013-12-18 18:40:15 +00:00
Logan Chien	a39510aeaa	[arm] Rename Tag_VFP_arch to Tag_FP_arch. According to "Addenda to ABI for ARM architecture", Tag_FP_arch is the new name for the equivalent Tag_VFP_arch. This commit renames Tag_VFP_arch to Tag_FP_arch. llvm-svn: 197587	2013-12-18 17:23:15 +00:00
Rafael Espindola	988f35e999	Fix f64 and f128 for ppc-darwin. This patch adds -f64:32:64 to 32 bit ppc darwin since a f64 inside a structure are only 32 bit aligned. The patch also drop -f128:64:128 from all ppc darwin, since f128 is 128 bit aligned. llvm-svn: 197574	2013-12-18 15:06:25 +00:00
Rafael Espindola	382ee385fd	One ppc32-darwin, a i64 inside a structure can have 32 bit alignment. Thanks for Iain Sandoe for testing this with the original gcc. Clang was already getting this right. llvm-svn: 197572	2013-12-18 14:35:37 +00:00
Tim Northover	f1c31b95e0	ARM: update comment to match reality llvm-svn: 197570	2013-12-18 14:18:36 +00:00
Tobias Grosser	84db1e744d	DiagnosticInfo: Add missing namespace llvm-svn: 197556	2013-12-18 10:12:06 +00:00
Tim Northover	44594ad7e2	ARM: set default float ABI based on triple. Clang sets the float-abi target option manually, but no longer annotates each function with its ABI. This can lead to confusing mistmatch between "clang -emit-llvm \| llc" and normal clang invocations. Besides which, gnueabihf actually is hard-float. Defaulting to soft was just perverse. llvm-svn: 197554	2013-12-18 09:27:33 +00:00
Kevin Qin	53eaea0104	[AArch64 NEON]Implment loading vector constant form constant pool. llvm-svn: 197551	2013-12-18 06:26:04 +00:00
Saleem Abdulrasool	88186c49c5	AsmParser: add support for .end directive The .end directive indicates the end of the file. No further instructions are processed after a .end directive is encountered. One potential (glaringly obvious) optimisation that could be pursued here is to extend MCAsmParser with a DiscardRemainder method to avoid processing lexemes to the end of the file. It was unclear at this point if that would be worth adding, and could easily be added in a follow on change. Signed-off-by: Saleem Abdulrasool <compnerd@compnerd.org> llvm-svn: 197547	2013-12-18 02:53:03 +00:00
David Blaikie	47f615eae5	DebugInfo: Introduce new DIValue, DIETypeSignature to encode references to type units via their signatures This simplifies type unit and type unit reference creation as well as setting the stage for inter-type hashing across type unit boundaries. llvm-svn: 197539	2013-12-17 23:32:35 +00:00
Rafael Espindola	febb8d2b96	Fix N32 registers and stack alignment. This patch fixes the "n" and "S" components of the data layout for mips. Clang already gets this right. This will be tested in clang. llvm-svn: 197536	2013-12-17 23:15:58 +00:00
Hal Finkel	b4b99e545b	Eliminate PPC instruction decoding ambiguities The instruction definitions in the PPC backend have a number of variants defined for the same instruction to represent differences between 64-bit and 32-bit semantics. In order to generate a disassembler for the PPC backend, we need to mark all but one of these as CodeGen only. No functionality change intended; this is prep work for PPC disassembly support. llvm-svn: 197535	2013-12-17 23:05:18 +00:00
Quentin Colombet	98e79a0604	[DiagnosticPrinter] Use the appropriate method to print a Twine object in a raw_ostream. llvm-svn: 197531	2013-12-17 22:35:07 +00:00
Reid Kleckner	d4e53f55f1	MC COFF: Emit the 'b' section flag for .bss sections in GNU assembly Without this, assembling clang's disassembly would produce an object file with the IMAGE_SCN_CNT_INITIALIZED_DATA section characteristic rather than the uninitialized one. link.exe would warn when merging comdats with different flags. llvm-svn: 197529	2013-12-17 22:12:40 +00:00
Rafael Espindola	8c08120dba	On APCS, only try to align aggregates to 32 bits instead of 64. This matches clang's behavior and since it is only a preference, it is not an ABI issue. llvm-svn: 197526	2013-12-17 21:36:54 +00:00
Rafael Espindola	9704fd03d1	Handle i64 first for clarity. No functionality change. llvm-svn: 197524	2013-12-17 21:28:36 +00:00
Duncan P. N. Exon Smith	ab5dbebc11	Assert that the last operand is actually EFLAGS This is another follow-up to r197503, after a post-commit review by Andy. <rdar://problem/15627766> llvm-svn: 197520	2013-12-17 20:28:21 +00:00
Andrew Trick	e4083f9e85	Disabled subregister copy coalescing during MachineCSE. This effectively backs out r197465 but leaves some of the general fixes in place. Not all targets are ready to handle this feature. To enable it, some infrastructure work is needed to better handle register class constraints. llvm-svn: 197514	2013-12-17 19:29:36 +00:00
Quentin Colombet	b4c44d239c	Add warning capabilities in LLVM. This reapplies r197438 and fixes the link-time circular dependency between IR and Support. The fix consists in moving the diagnostic support into IR. The patch adds a new LLVMContext::diagnose that can be used to communicate to the front-end, if any, that something of interest happened. The diagnostics are supported by a new abstraction, the DiagnosticInfo class. The base class contains the following information: - The kind of the report: What this is about. - The severity of the report: How bad this is. This patch also adds 2 classes: - DiagnosticInfoInlineAsm: For inline asm reporting. Basically, this diagnostic will be used to switch to the new diagnostic API for LLVMContext::emitError. - DiagnosticStackSize: For stack size reporting. Comes as a replacement of the hard coded warning in PEI. This patch also features dynamic diagnostic identifiers. In other words plugins can use this infrastructure for their own diagnostics (for more details, see getNextAvailablePluginDiagnosticKind). This patch introduces a new DiagnosticHandlerTy and a new DiagnosticContext in the LLVMContext that should be set by the front-end to be able to map these diagnostics in its own system. http://llvm-reviews.chandlerc.com/D2376 <rdar://problem/15515174> llvm-svn: 197508	2013-12-17 17:47:22 +00:00
Matheus Almeida	8cc8b35a73	[mips] Fix off by one issue when applying a fixup. The branch offset for a R_MIPS_PC16 relocation is indeed a 16-bit signed immediate. llvm-svn: 197506	2013-12-17 17:10:00 +00:00
Duncan P. N. Exon Smith	512601d77f	Revert "Revert "Mark vastart_save_xmm_regs as changing EFLAGS"" This reverts commit r197481, recommiting r197469 with an extra fix. The vastart_save_xmm_regs pseudo-instruction expands to a test and a branch, so it modifies EFLAGS. Mark it so, or else the scheduler might place it in the middle of another test+branch. This fixes a bug exposed by r192750, which changed the initial scheduler to source-order as part of enabling the MI Scheduler for X86. This re-commit changes the VASTART_SAVE_XMM_REGS custom inserter not to try to save %flags, and adds a test that catches the bad behavior of r197469. <rdar://problem/15627766> llvm-svn: 197503	2013-12-17 15:54:45 +00:00
Rafael Espindola	345d718d16	Fix the pointer size for the PS3 datalayout. This will be tested from clang. llvm-svn: 197501	2013-12-17 15:29:48 +00:00
Stepan Dyatkovskiy	7f7c2710e0	Fix for PR18045: http://llvm.org/bugs/show_bug.cgi?id=18045 Short issue description: For X86 machines with sse < sse4.1 we got failures for some particular load/store vector sequences: $ clang-trunk -m32 -O2 test-case.c fatal error: error in backend: Cannot select: 0x4200920: v4i32,ch = load 0x41d6ab0, 0x4205850, 0x41dcb10<LD16[getelementptr inbounds ([4 x i32]* @e, i32 0, i32 0)](align=4)> [ORD=82] [ID=58] 0x4205850: i32 = X86ISD::Wrapper 0x41d5490 [ORD=26] [ID=43] 0x41d5490: i32 = TargetGlobalAddress<[4 x i32]* @e> 0 [ORD=26] [ID=23] 0x41dcb10: i32 = undef [ID=2] The reason is that EltsFromConsecutiveLoads could emit such load instruction both before and after legalize stage. Though this instruction is not legal for machines with SSSE3 and lower. The fix: In EltsFromConsecutiveLoads, if we have passed legalize stage, we check whether nodes it emits are legal. P.S.: If you get failure in time from 12:00 and till 22:00 (UTC-8), perhaps I'll slow with response, so you better reject this commit. Thanks! llvm-svn: 197492	2013-12-17 12:07:33 +00:00
Yaron Keren	7da8e45b57	There are no __register_frame and __deregister_frame functions when using structured exception handling (SEH) on Windows 64. http://llvm-reviews.chandlerc.com/D2378 Patch by Jonathan Liu! llvm-svn: 197483	2013-12-17 08:40:11 +00:00
Elena Demikhovsky	c5f6726a24	AVX-512: Added implementation of CONCAT_VECTORS for v8i1 vectors (by Alexey Bader). Added implementation of "truncate" from integer type (i64/i32/i16/i8) to i1. llvm-svn: 197482	2013-12-17 08:33:15 +00:00
Duncan P. N. Exon Smith	b2d4274d3f	Revert "Mark vastart_save_xmm_regs as changing EFLAGS" This reverts commit r197469. The sanitizer and dragonegg buildbots are failing, I think because of this change. Reverting until I figure out why. llvm-svn: 197481	2013-12-17 07:13:58 +00:00
Duncan P. N. Exon Smith	a4acde39e9	Mark vastart_save_xmm_regs as changing EFLAGS The vastart_save_xmm_regs pseudo-instruction expands to a test and a branch, so it modifies EFLAGS. Mark it so, or else the scheduler might place it in the middle of another test+branch. This fixes a bug exposed by r192750, which turned on the MI Scheduler for X86. <rdar://problem/15627766> llvm-svn: 197469	2013-12-17 06:12:05 +00:00
Andrew Trick	e339828b90	Allow MachineCSE to coalesce trivial subregister copies the same way that it coalesces normal copies. Without this, MachineCSE is powerless to handle redundant operations with truncated source operands. This required fixing the 2-addr pass to handle tied subregisters. It isn't clear what combinations of subregisters can legally be tied, but the simple case of truncated source operands is now safely handled: %vreg11<def> = COPY %vreg1:sub_32bit; GR32:%vreg11 GR64:%vreg1 %vreg12<def> = COPY %vreg2:sub_32bit; GR32:%vreg12 GR64:%vreg2 %vreg13<def,tied1> = ADD32rr %vreg11<tied0>, %vreg12<kill>, %EFLAGS<imp-def> Test case: cse-add-with-overflow.ll. This exposed an existing bug in PPCInstrInfo::commuteInstruction. Thanks to Rafael for the test case: PowerPC/crash.ll. llvm-svn: 197465	2013-12-17 04:50:45 +00:00
Andrew Trick	9defbd882b	whitespace llvm-svn: 197464	2013-12-17 04:50:40 +00:00
Jim Grosbach	04caa27387	Make comment more explicit. Re-reading the comment I updated in previous commit, it's better to make it more explicit and avoid ambiguity more effectively. llvm-svn: 197458	2013-12-17 02:18:02 +00:00
Jim Grosbach	dde043b3fd	Typo. s/reserved/preserved/ llvm-svn: 197457	2013-12-17 02:01:13 +00:00
Jim Grosbach	ea2db453dd	Add a machine code print in DEBUG() following instruction selection. Make debugging ISel a bit easier by printing out a dump of the generated code at the end. llvm-svn: 197456	2013-12-17 02:01:10 +00:00
Quentin Colombet	382b135d92	Revert r197438 and r197447 until we figure out how to avoid circular dependency at link time llvm-svn: 197451	2013-12-17 01:19:59 +00:00
Arnold Schwaighofer	50b8302c55	LoopVectorizer: Don't if-convert constant expressions that can trap A phi node operand or an instruction operand could be a constant expression that can trap (division). Check that we don't vectorize such cases. PR16729 radar://15653590 llvm-svn: 197449	2013-12-17 01:11:01 +00:00
Quentin Colombet	0caf4fef47	[LLVM Diagnostic Capabilities] Remove useless includes from DiagnosticPrinter.cpp. These was creating a link time dependencies of IR on CodeGen and Analysis. Part of <rdar://problem/15515174> llvm-svn: 197447	2013-12-17 00:56:19 +00:00
Quentin Colombet	66673f4075	Add warning capabilities in LLVM. The patch adds a new LLVMContext::diagnose that can be used to communicate to the front-end, if any, that something of interest happened. The diagnostics are supported by a new abstraction, the DiagnosticInfo class. The base class contains the following information: - The kind of the report: What this is about. - The severity of the report: How bad this is. This patch also adds 2 classes: - DiagnosticInfoInlineAsm: For inline asm reporting. Basically, this diagnostic will be used to switch to the new diagnostic API for LLVMContext::emitError. - DiagnosticStackSize: For stack size reporting. Comes as a replacement of the hard coded warning in PEI. This patch also features dynamic diagnostic identifiers. In other words plugins can use this infrastructure for their own diagnostics (for more details, see getNextAvailablePluginDiagnosticKind). This patch introduces a new DiagnosticHandlerTy and a new DiagnosticContext in the LLVMContext that should be set by the front-end to be able to map these diagnostics in its own system. http://llvm-reviews.chandlerc.com/D2376 <rdar://problem/15515174> llvm-svn: 197438	2013-12-16 23:22:51 +00:00
Yi Jiang	6ab044ee35	Enable double to float shrinking optimizations for binary functions like 'fmin/fmax'. Fix radar:15283121 llvm-svn: 197434	2013-12-16 22:42:40 +00:00
Yuchen Wu	66d93b82ac	llvm-cov: Added -u option for unconditional branch info. Outputs branch information for unconditional branches in addition to conditional branches. -b option must be enabled. Also updated tests. llvm-svn: 197432	2013-12-16 22:14:02 +00:00
Juergen Ributzka	9ed985baad	[Stackmap] Allow WebKit_JS calling convention to store 4 byte sized and aligned arguments. This allows the WebKit_JS calling convention to perform partial writes on a 4 byte granularity to stack slots. llvm-svn: 197431	2013-12-16 22:05:32 +00:00
Matt Arsenault	cb34f84e39	Fix typo in instruction name. SI_KIL -> SI_KILL llvm-svn: 197425	2013-12-16 20:58:33 +00:00
Rafael Espindola	f152836788	Revert "Allow MachineCSE to coalesce trivial subregister copies the same way that it coalesces normal copies." This reverts commit r197414. It broke the ppc64 bootstrap. I will post a testcase in a sec. llvm-svn: 197424	2013-12-16 20:57:09 +00:00
Yuchen Wu	8742a28560	llvm-cov: Removed extra semicolon from ;;. llvm-svn: 197418	2013-12-16 20:03:11 +00:00
Juergen Ributzka	b1612c18ab	[Stackmap] The first integer argument is passed in register for the WebKit_JS calling convention. Pass the first integer argument (callee) in register to optimize inline caches. llvm-svn: 197416	2013-12-16 19:53:31 +00:00
Andrew Trick	88bd8629b2	Allow MachineCSE to coalesce trivial subregister copies the same way that it coalesces normal copies. Without this, MachineCSE is powerless to handle redundant operations with truncated source operands. This required fixing the 2-addr pass to handle tied subregisters. It isn't clear what combinations of subregisters can legally be tied, but the simple case of truncated source operands is now safely handled: %vreg11<def> = COPY %vreg1:sub_32bit; GR32:%vreg11 GR64:%vreg1 %vreg12<def> = COPY %vreg2:sub_32bit; GR32:%vreg12 GR64:%vreg2 %vreg13<def,tied1> = ADD32rr %vreg11<tied0>, %vreg12<kill>, %EFLAGS<imp-def> llvm-svn: 197414	2013-12-16 19:36:21 +00:00
Andrew Trick	cccd82f21f	whitespace llvm-svn: 197413	2013-12-16 19:36:18 +00:00
Rafael Espindola	e89b41495a	One last cleanup of LLVM's DataLayout strings. Produce them in the same order on every target. The order is that of getStringRepresentation: e\|E-i-f-v-a-s-n-S*. llvm-svn: 197411	2013-12-16 19:31:14 +00:00
Rafael Espindola	0eb1ebeaac	Structure R600's computeDataLayout more like every other target. While there, simplify "p3:32:32:32" to "p3:32:32". llvm-svn: 197407	2013-12-16 19:18:57 +00:00
Joerg Sonnenberger	8fe41b7319	Recognize EABIHF as environment and use it for RTAPI + VFP. llvm-svn: 197405	2013-12-16 18:51:28 +00:00

... 4 5 6 7 8 ...

66493 Commits