llvm-project

Commit Graph

Author	SHA1	Message	Date
Josh Magee	22b8ba2d67	[stackprotector] Use analysis from the StackProtector pass for stack layout in PEI a nd LocalStackSlot passes. This changes the MachineFrameInfo API to use the new SSPLayoutKind information produced by the StackProtector pass (instead of a boolean flag) and updates a few pass dependencies (to preserve the SSP analysis). The stack layout follows the same approach used prior to this change - i.e., only LargeArray stack objects will be placed near the canary and everything else will be laid out normally. After this change, structures containing large arrays will also be placed near the canary - a case previously missed by the old implementation. Out of tree targets will need to update their usage of MachineFrameInfo::CreateStackObject to remove the MayNeedSP argument. The next patch will implement the rules for sspstrong and sspreq. The end goal is to support ssp-strong stack layout rules. WIP. Differential Revision: http://llvm-reviews.chandlerc.com/D2158 llvm-svn: 197653	2013-12-19 03:17:11 +00:00
Rafael Espindola	2fc7101e3c	Add stack alignment information for Sparc. This matches the data in clang which was added by Jakob Stoklund Olesen in r179596. Thanks for erikjv on irc for pointing me to the relevant documents: http://sparc.com/standards/64.psabi.1.35.ps.Z page 25: Every stack frame must be 16-byte aligned. http://sparc.com/standards/psABI3rd.pdf page 3-10: Although the architecture requires only word alignment, software convention and the operating system require every stack frame to be doubleword aligned. I tried to add a test, but it looks like sparc doesn't implement dynamic stack realignment. This will be tested in clang shortly. llvm-svn: 197646	2013-12-19 02:21:16 +00:00
Reid Kleckner	a534a38130	Begin adding docs and IR-level support for the inalloca attribute The inalloca attribute is designed to support passing C++ objects by value in the Microsoft C++ ABI. It behaves the same as byval, except that it always implies that the argument is in memory and that the bytes are never copied. This attribute allows the caller to take the address of an outgoing argument's memory and execute arbitrary code to store into it. This patch adds basic IR support, docs, and verification. It does not attempt to implement any lowering or fix any possibly broken transforms. When this patch lands, a complete description of this feature should appear at http://llvm.org/docs/InAlloca.html . Differential Revision: http://llvm-reviews.chandlerc.com/D2173 llvm-svn: 197645	2013-12-19 02:14:12 +00:00
Rafael Espindola	ddb913cc8f	Synchronize the NaCl DataLayout strings with the ones in clang. Patch by Derek Schuff. llvm-svn: 197640	2013-12-19 00:44:37 +00:00
Reed Kotler	47f3c64a48	Make cosmetic changes as part of Mips internal post commit review of patch r196331. llvm-svn: 197638	2013-12-19 00:43:08 +00:00
Yuchen Wu	bb6a477131	llvm-cov: Added -f option for function summaries. Similar to the file summaries, the function summaries output line, branching and call statistics. The file summaries have been moved outside the initial loop so that all of the function summaries can be outputted before file summaries. Also updated test cases. llvm-svn: 197633	2013-12-19 00:29:25 +00:00
Reed Kotler	2500bd6c20	Fix a problem with mips16 stubs when calls are transformed during tail call optimization. Some more work may be needed for indirect calls but this patch fixes the current regression in Prolangc++/trees. S2 optimization as part of the general cleanup and optimization of prolog and epilog was not saving S2 in this case and needed to. llvm-svn: 197630	2013-12-18 23:57:48 +00:00
Weiming Zhao	63871d255f	[aarch32] fix bug 18268: Incorrect condition of vsel Given vsel_cc, op1, op2, since vsel has no LE/LT, to generate vsel for such selection, it needs to inverse cc and swap op1 and op2. To inverse cc, both L/G and E bits should be flipped. llvm-svn: 197615	2013-12-18 22:25:17 +00:00
Adrian Prantl	99c7af26b7	Debug info: Implement (rvalue) reference qualifiers for C++11 non-static member functions. Paired commit with CFE. rdar://problem/15356637 llvm-svn: 197613	2013-12-18 21:48:19 +00:00
Adrian Prantl	31631e4a47	Pull in a couple of new constants from the upcoming DWARF 5 standard. llvm-svn: 197611	2013-12-18 21:48:14 +00:00
Rafael Espindola	84a8726a31	Correctly handle the degenerated triple "thumb". Fixes a crash in llc where some parts think the target is thumb and others think it is ARM. llvm-svn: 197607	2013-12-18 21:29:44 +00:00
Yuchen Wu	8256ee6d4a	llvm-cov: Print coverage summary to STDOUT. File summaries will now be optionally outputted which will give line, branching and call coverage info. Unfortunately, clang's current instrumentation does not give enough information to deduce function calls, something that gcc is able to do. Thus, no calls are always outputted to be consistent with gcov output. Also updated tests. llvm-svn: 197606	2013-12-18 21:12:51 +00:00
Yuchen Wu	c9b2dcdbee	llvm-cov: s/(.*)Executed/\1Exec/ llvm-svn: 197595	2013-12-18 18:46:25 +00:00
Yuchen Wu	73dc38187b	llvm-cov: Added -c option for branch counts. This will cause llvm-cov to output branch counts instead of branch probabilities. -b must be enabled. Also updated tests. llvm-svn: 197594	2013-12-18 18:40:15 +00:00
Logan Chien	a39510aeaa	[arm] Rename Tag_VFP_arch to Tag_FP_arch. According to "Addenda to ABI for ARM architecture", Tag_FP_arch is the new name for the equivalent Tag_VFP_arch. This commit renames Tag_VFP_arch to Tag_FP_arch. llvm-svn: 197587	2013-12-18 17:23:15 +00:00
Rafael Espindola	988f35e999	Fix f64 and f128 for ppc-darwin. This patch adds -f64:32:64 to 32 bit ppc darwin since a f64 inside a structure are only 32 bit aligned. The patch also drop -f128:64:128 from all ppc darwin, since f128 is 128 bit aligned. llvm-svn: 197574	2013-12-18 15:06:25 +00:00
Rafael Espindola	382ee385fd	One ppc32-darwin, a i64 inside a structure can have 32 bit alignment. Thanks for Iain Sandoe for testing this with the original gcc. Clang was already getting this right. llvm-svn: 197572	2013-12-18 14:35:37 +00:00
Tim Northover	f1c31b95e0	ARM: update comment to match reality llvm-svn: 197570	2013-12-18 14:18:36 +00:00
Tobias Grosser	84db1e744d	DiagnosticInfo: Add missing namespace llvm-svn: 197556	2013-12-18 10:12:06 +00:00
Tim Northover	44594ad7e2	ARM: set default float ABI based on triple. Clang sets the float-abi target option manually, but no longer annotates each function with its ABI. This can lead to confusing mistmatch between "clang -emit-llvm \| llc" and normal clang invocations. Besides which, gnueabihf actually is hard-float. Defaulting to soft was just perverse. llvm-svn: 197554	2013-12-18 09:27:33 +00:00
Kevin Qin	53eaea0104	[AArch64 NEON]Implment loading vector constant form constant pool. llvm-svn: 197551	2013-12-18 06:26:04 +00:00
Saleem Abdulrasool	88186c49c5	AsmParser: add support for .end directive The .end directive indicates the end of the file. No further instructions are processed after a .end directive is encountered. One potential (glaringly obvious) optimisation that could be pursued here is to extend MCAsmParser with a DiscardRemainder method to avoid processing lexemes to the end of the file. It was unclear at this point if that would be worth adding, and could easily be added in a follow on change. Signed-off-by: Saleem Abdulrasool <compnerd@compnerd.org> llvm-svn: 197547	2013-12-18 02:53:03 +00:00
David Blaikie	47f615eae5	DebugInfo: Introduce new DIValue, DIETypeSignature to encode references to type units via their signatures This simplifies type unit and type unit reference creation as well as setting the stage for inter-type hashing across type unit boundaries. llvm-svn: 197539	2013-12-17 23:32:35 +00:00
Rafael Espindola	febb8d2b96	Fix N32 registers and stack alignment. This patch fixes the "n" and "S" components of the data layout for mips. Clang already gets this right. This will be tested in clang. llvm-svn: 197536	2013-12-17 23:15:58 +00:00
Hal Finkel	b4b99e545b	Eliminate PPC instruction decoding ambiguities The instruction definitions in the PPC backend have a number of variants defined for the same instruction to represent differences between 64-bit and 32-bit semantics. In order to generate a disassembler for the PPC backend, we need to mark all but one of these as CodeGen only. No functionality change intended; this is prep work for PPC disassembly support. llvm-svn: 197535	2013-12-17 23:05:18 +00:00
Quentin Colombet	98e79a0604	[DiagnosticPrinter] Use the appropriate method to print a Twine object in a raw_ostream. llvm-svn: 197531	2013-12-17 22:35:07 +00:00
Reid Kleckner	d4e53f55f1	MC COFF: Emit the 'b' section flag for .bss sections in GNU assembly Without this, assembling clang's disassembly would produce an object file with the IMAGE_SCN_CNT_INITIALIZED_DATA section characteristic rather than the uninitialized one. link.exe would warn when merging comdats with different flags. llvm-svn: 197529	2013-12-17 22:12:40 +00:00
Rafael Espindola	8c08120dba	On APCS, only try to align aggregates to 32 bits instead of 64. This matches clang's behavior and since it is only a preference, it is not an ABI issue. llvm-svn: 197526	2013-12-17 21:36:54 +00:00
Rafael Espindola	9704fd03d1	Handle i64 first for clarity. No functionality change. llvm-svn: 197524	2013-12-17 21:28:36 +00:00
Duncan P. N. Exon Smith	ab5dbebc11	Assert that the last operand is actually EFLAGS This is another follow-up to r197503, after a post-commit review by Andy. <rdar://problem/15627766> llvm-svn: 197520	2013-12-17 20:28:21 +00:00
Andrew Trick	e4083f9e85	Disabled subregister copy coalescing during MachineCSE. This effectively backs out r197465 but leaves some of the general fixes in place. Not all targets are ready to handle this feature. To enable it, some infrastructure work is needed to better handle register class constraints. llvm-svn: 197514	2013-12-17 19:29:36 +00:00
Quentin Colombet	b4c44d239c	Add warning capabilities in LLVM. This reapplies r197438 and fixes the link-time circular dependency between IR and Support. The fix consists in moving the diagnostic support into IR. The patch adds a new LLVMContext::diagnose that can be used to communicate to the front-end, if any, that something of interest happened. The diagnostics are supported by a new abstraction, the DiagnosticInfo class. The base class contains the following information: - The kind of the report: What this is about. - The severity of the report: How bad this is. This patch also adds 2 classes: - DiagnosticInfoInlineAsm: For inline asm reporting. Basically, this diagnostic will be used to switch to the new diagnostic API for LLVMContext::emitError. - DiagnosticStackSize: For stack size reporting. Comes as a replacement of the hard coded warning in PEI. This patch also features dynamic diagnostic identifiers. In other words plugins can use this infrastructure for their own diagnostics (for more details, see getNextAvailablePluginDiagnosticKind). This patch introduces a new DiagnosticHandlerTy and a new DiagnosticContext in the LLVMContext that should be set by the front-end to be able to map these diagnostics in its own system. http://llvm-reviews.chandlerc.com/D2376 <rdar://problem/15515174> llvm-svn: 197508	2013-12-17 17:47:22 +00:00
Matheus Almeida	8cc8b35a73	[mips] Fix off by one issue when applying a fixup. The branch offset for a R_MIPS_PC16 relocation is indeed a 16-bit signed immediate. llvm-svn: 197506	2013-12-17 17:10:00 +00:00
Duncan P. N. Exon Smith	512601d77f	Revert "Revert "Mark vastart_save_xmm_regs as changing EFLAGS"" This reverts commit r197481, recommiting r197469 with an extra fix. The vastart_save_xmm_regs pseudo-instruction expands to a test and a branch, so it modifies EFLAGS. Mark it so, or else the scheduler might place it in the middle of another test+branch. This fixes a bug exposed by r192750, which changed the initial scheduler to source-order as part of enabling the MI Scheduler for X86. This re-commit changes the VASTART_SAVE_XMM_REGS custom inserter not to try to save %flags, and adds a test that catches the bad behavior of r197469. <rdar://problem/15627766> llvm-svn: 197503	2013-12-17 15:54:45 +00:00
Rafael Espindola	345d718d16	Fix the pointer size for the PS3 datalayout. This will be tested from clang. llvm-svn: 197501	2013-12-17 15:29:48 +00:00
Stepan Dyatkovskiy	7f7c2710e0	Fix for PR18045: http://llvm.org/bugs/show_bug.cgi?id=18045 Short issue description: For X86 machines with sse < sse4.1 we got failures for some particular load/store vector sequences: $ clang-trunk -m32 -O2 test-case.c fatal error: error in backend: Cannot select: 0x4200920: v4i32,ch = load 0x41d6ab0, 0x4205850, 0x41dcb10<LD16[getelementptr inbounds ([4 x i32]* @e, i32 0, i32 0)](align=4)> [ORD=82] [ID=58] 0x4205850: i32 = X86ISD::Wrapper 0x41d5490 [ORD=26] [ID=43] 0x41d5490: i32 = TargetGlobalAddress<[4 x i32]* @e> 0 [ORD=26] [ID=23] 0x41dcb10: i32 = undef [ID=2] The reason is that EltsFromConsecutiveLoads could emit such load instruction both before and after legalize stage. Though this instruction is not legal for machines with SSSE3 and lower. The fix: In EltsFromConsecutiveLoads, if we have passed legalize stage, we check whether nodes it emits are legal. P.S.: If you get failure in time from 12:00 and till 22:00 (UTC-8), perhaps I'll slow with response, so you better reject this commit. Thanks! llvm-svn: 197492	2013-12-17 12:07:33 +00:00
Yaron Keren	7da8e45b57	There are no __register_frame and __deregister_frame functions when using structured exception handling (SEH) on Windows 64. http://llvm-reviews.chandlerc.com/D2378 Patch by Jonathan Liu! llvm-svn: 197483	2013-12-17 08:40:11 +00:00
Elena Demikhovsky	c5f6726a24	AVX-512: Added implementation of CONCAT_VECTORS for v8i1 vectors (by Alexey Bader). Added implementation of "truncate" from integer type (i64/i32/i16/i8) to i1. llvm-svn: 197482	2013-12-17 08:33:15 +00:00
Duncan P. N. Exon Smith	b2d4274d3f	Revert "Mark vastart_save_xmm_regs as changing EFLAGS" This reverts commit r197469. The sanitizer and dragonegg buildbots are failing, I think because of this change. Reverting until I figure out why. llvm-svn: 197481	2013-12-17 07:13:58 +00:00
Duncan P. N. Exon Smith	a4acde39e9	Mark vastart_save_xmm_regs as changing EFLAGS The vastart_save_xmm_regs pseudo-instruction expands to a test and a branch, so it modifies EFLAGS. Mark it so, or else the scheduler might place it in the middle of another test+branch. This fixes a bug exposed by r192750, which turned on the MI Scheduler for X86. <rdar://problem/15627766> llvm-svn: 197469	2013-12-17 06:12:05 +00:00
Andrew Trick	e339828b90	Allow MachineCSE to coalesce trivial subregister copies the same way that it coalesces normal copies. Without this, MachineCSE is powerless to handle redundant operations with truncated source operands. This required fixing the 2-addr pass to handle tied subregisters. It isn't clear what combinations of subregisters can legally be tied, but the simple case of truncated source operands is now safely handled: %vreg11<def> = COPY %vreg1:sub_32bit; GR32:%vreg11 GR64:%vreg1 %vreg12<def> = COPY %vreg2:sub_32bit; GR32:%vreg12 GR64:%vreg2 %vreg13<def,tied1> = ADD32rr %vreg11<tied0>, %vreg12<kill>, %EFLAGS<imp-def> Test case: cse-add-with-overflow.ll. This exposed an existing bug in PPCInstrInfo::commuteInstruction. Thanks to Rafael for the test case: PowerPC/crash.ll. llvm-svn: 197465	2013-12-17 04:50:45 +00:00
Andrew Trick	9defbd882b	whitespace llvm-svn: 197464	2013-12-17 04:50:40 +00:00
Jim Grosbach	04caa27387	Make comment more explicit. Re-reading the comment I updated in previous commit, it's better to make it more explicit and avoid ambiguity more effectively. llvm-svn: 197458	2013-12-17 02:18:02 +00:00
Jim Grosbach	dde043b3fd	Typo. s/reserved/preserved/ llvm-svn: 197457	2013-12-17 02:01:13 +00:00
Jim Grosbach	ea2db453dd	Add a machine code print in DEBUG() following instruction selection. Make debugging ISel a bit easier by printing out a dump of the generated code at the end. llvm-svn: 197456	2013-12-17 02:01:10 +00:00
Quentin Colombet	382b135d92	Revert r197438 and r197447 until we figure out how to avoid circular dependency at link time llvm-svn: 197451	2013-12-17 01:19:59 +00:00
Arnold Schwaighofer	50b8302c55	LoopVectorizer: Don't if-convert constant expressions that can trap A phi node operand or an instruction operand could be a constant expression that can trap (division). Check that we don't vectorize such cases. PR16729 radar://15653590 llvm-svn: 197449	2013-12-17 01:11:01 +00:00
Quentin Colombet	0caf4fef47	[LLVM Diagnostic Capabilities] Remove useless includes from DiagnosticPrinter.cpp. These was creating a link time dependencies of IR on CodeGen and Analysis. Part of <rdar://problem/15515174> llvm-svn: 197447	2013-12-17 00:56:19 +00:00
Quentin Colombet	66673f4075	Add warning capabilities in LLVM. The patch adds a new LLVMContext::diagnose that can be used to communicate to the front-end, if any, that something of interest happened. The diagnostics are supported by a new abstraction, the DiagnosticInfo class. The base class contains the following information: - The kind of the report: What this is about. - The severity of the report: How bad this is. This patch also adds 2 classes: - DiagnosticInfoInlineAsm: For inline asm reporting. Basically, this diagnostic will be used to switch to the new diagnostic API for LLVMContext::emitError. - DiagnosticStackSize: For stack size reporting. Comes as a replacement of the hard coded warning in PEI. This patch also features dynamic diagnostic identifiers. In other words plugins can use this infrastructure for their own diagnostics (for more details, see getNextAvailablePluginDiagnosticKind). This patch introduces a new DiagnosticHandlerTy and a new DiagnosticContext in the LLVMContext that should be set by the front-end to be able to map these diagnostics in its own system. http://llvm-reviews.chandlerc.com/D2376 <rdar://problem/15515174> llvm-svn: 197438	2013-12-16 23:22:51 +00:00
Yi Jiang	6ab044ee35	Enable double to float shrinking optimizations for binary functions like 'fmin/fmax'. Fix radar:15283121 llvm-svn: 197434	2013-12-16 22:42:40 +00:00
Yuchen Wu	66d93b82ac	llvm-cov: Added -u option for unconditional branch info. Outputs branch information for unconditional branches in addition to conditional branches. -b option must be enabled. Also updated tests. llvm-svn: 197432	2013-12-16 22:14:02 +00:00
Juergen Ributzka	9ed985baad	[Stackmap] Allow WebKit_JS calling convention to store 4 byte sized and aligned arguments. This allows the WebKit_JS calling convention to perform partial writes on a 4 byte granularity to stack slots. llvm-svn: 197431	2013-12-16 22:05:32 +00:00
Matt Arsenault	cb34f84e39	Fix typo in instruction name. SI_KIL -> SI_KILL llvm-svn: 197425	2013-12-16 20:58:33 +00:00
Rafael Espindola	f152836788	Revert "Allow MachineCSE to coalesce trivial subregister copies the same way that it coalesces normal copies." This reverts commit r197414. It broke the ppc64 bootstrap. I will post a testcase in a sec. llvm-svn: 197424	2013-12-16 20:57:09 +00:00
Yuchen Wu	8742a28560	llvm-cov: Removed extra semicolon from ;;. llvm-svn: 197418	2013-12-16 20:03:11 +00:00
Juergen Ributzka	b1612c18ab	[Stackmap] The first integer argument is passed in register for the WebKit_JS calling convention. Pass the first integer argument (callee) in register to optimize inline caches. llvm-svn: 197416	2013-12-16 19:53:31 +00:00
Andrew Trick	88bd8629b2	Allow MachineCSE to coalesce trivial subregister copies the same way that it coalesces normal copies. Without this, MachineCSE is powerless to handle redundant operations with truncated source operands. This required fixing the 2-addr pass to handle tied subregisters. It isn't clear what combinations of subregisters can legally be tied, but the simple case of truncated source operands is now safely handled: %vreg11<def> = COPY %vreg1:sub_32bit; GR32:%vreg11 GR64:%vreg1 %vreg12<def> = COPY %vreg2:sub_32bit; GR32:%vreg12 GR64:%vreg2 %vreg13<def,tied1> = ADD32rr %vreg11<tied0>, %vreg12<kill>, %EFLAGS<imp-def> llvm-svn: 197414	2013-12-16 19:36:21 +00:00
Andrew Trick	cccd82f21f	whitespace llvm-svn: 197413	2013-12-16 19:36:18 +00:00
Rafael Espindola	e89b41495a	One last cleanup of LLVM's DataLayout strings. Produce them in the same order on every target. The order is that of getStringRepresentation: e\|E-i-f-v-a-s-n-S*. llvm-svn: 197411	2013-12-16 19:31:14 +00:00
Rafael Espindola	0eb1ebeaac	Structure R600's computeDataLayout more like every other target. While there, simplify "p3:32:32:32" to "p3:32:32". llvm-svn: 197407	2013-12-16 19:18:57 +00:00
Joerg Sonnenberger	8fe41b7319	Recognize EABIHF as environment and use it for RTAPI + VFP. llvm-svn: 197405	2013-12-16 18:51:28 +00:00
Chad Rosier	5f87edb484	[AArch64] Fix v1fx patterns for Floating-point Multiply Extend and Floating-point Compare to Zero. llvm-svn: 197402	2013-12-16 18:29:35 +00:00
Reid Kleckner	86a8e1e0e4	MemoryBuffer: Increase the alignment of small file buffers to 16 This was manifesting as an LLVM_ASSUME_ALIGNED() failure in an ELF debug info test when building LLVM with clang in the Microsoft C++ ABI. llvm-svn: 197401	2013-12-16 18:18:12 +00:00
Rafael Espindola	bccb9d45ad	The preferred alignment defaults to the abi alignment. Omit if it is the same. llvm-svn: 197400	2013-12-16 18:01:51 +00:00
Rafael Espindola	f057093fdc	Don't duplicate the DataLayout defaults for integer, floats and vectors. llvm-svn: 197398	2013-12-16 17:41:15 +00:00
Rafael Espindola	8afbb28cea	On DataLayout, omit the default of p:64:64:64. llvm-svn: 197397	2013-12-16 17:15:29 +00:00
Hal Finkel	0a576d52fa	Set has_asmparser in PowerPC/LLVMBuild.txt PowerPC now has an asm parser (and has for many months now); indicate this in PowerPC/LLVMBuild.txt. llvm-svn: 197393	2013-12-16 15:48:09 +00:00
Elena Demikhovsky	47fc44e52e	AVX-512: Added legal type MVT::i1 and VK1 register for it. Added scalar compare VCMPSS, VCMPSD. Implemented LowerSELECT for scalar FP operations. I replaced FSETCCss, FSETCCsd with one node type FSETCCs. Node extract_vector_elt(v16i1/v8i1, idx) returns an element of type i1. llvm-svn: 197384	2013-12-16 13:52:35 +00:00
Evgeniy Stepanov	a1df6379a6	Fix Android regression in r197332. llvm-svn: 197366	2013-12-16 07:02:51 +00:00
Hao Liu	774cabb538	[AArch64]Fix the pattern match failure for v1i8/v1i16/v1i32 types. Currently we have such types as legal vector types. The DAG combiner may generate some DAG nodes having such types but we don't have patterns to match them. E.g. a load i32 and a bitcast i32 to v1i32 will be combined into a load v1i32: bitcast (load i32) to v1i32 -> load v1i32. So this patch fixes such problems for load/dup instructions. If v1i8/v1i16/v1i32 are not legal any more, the code in this patch can be deleted. So I also add some FIXME. llvm-svn: 197361	2013-12-16 02:51:28 +00:00
Reed Kotler	b69ea1e92e	remove an uneeded statement (condition is covered by the statement that follows). llvm-svn: 197358	2013-12-15 23:33:59 +00:00
Reed Kotler	06b3c4f484	Fix some indentation. llvm-svn: 197357	2013-12-15 23:03:35 +00:00
Reed Kotler	4d030b4e89	Get rid of an superfluous tab in the .s file. This was originally part of a multi-line pseudo which worked around a linker bug for mips16. llvm-svn: 197356	2013-12-15 22:02:31 +00:00
Reed Kotler	5c29d63a66	Last change for mips16 prolog/epilog cleanup and optimization. Some tiny cosmetic code changes to follow. Because of the wide ranging nature of the patch a full 24 test cycle was needed to check against regression. This was the smallest patch I could make to progress from the earlier ones in the series. llvm-svn: 197350	2013-12-15 20:49:30 +00:00
Joerg Sonnenberger	ddb582896a	There is no exp10 on NetBSD. llvm-svn: 197348	2013-12-15 20:36:17 +00:00
Michael Kuperstein	e31b486cdd	Fix AsmWriter's handling of SPIR calling conventions. Patch by Boaz Ouriel. llvm-svn: 197335	2013-12-15 10:01:20 +00:00
Joerg Sonnenberger	7466979f20	Replace string matching with a switch on Triple::getEnvironment. llvm-svn: 197332	2013-12-15 00:12:52 +00:00
Juergen Ributzka	c26b68a94f	[Stackmap] Refactor operand parsing. llvm-svn: 197329	2013-12-14 23:06:19 +00:00
Matt Arsenault	52226f9a8e	Don't manually calculate size in bytes llvm-svn: 197327	2013-12-14 18:21:59 +00:00
Iain Sandoe	e0b4cb62f5	[Powerpc darwin] AsmParser Base implementation. This is a base implementation of the powerpc-apple-darwin asm parser dialect. * Enables infrastructure (essentially isDarwin()) and fixes up the parsing of asm directives to separate out ELF and MachO/Darwin additions. * Enables parsing of {r,f,v}XX as register identifiers. * Enables parsing of lo16() hi16() and ha16() as modifiers. The changes to the test case are from David Fang (fangism). llvm-svn: 197324	2013-12-14 13:34:02 +00:00
Juergen Ributzka	db9ee00b59	Remove weak vtables. No functional change. llvm-svn: 197323	2013-12-14 12:23:14 +00:00
Juergen Ributzka	e82947539e	[Stackmap] Liveness Analysis Pass This optional register liveness analysis pass can be enabled with either -enable-stackmap-liveness, -enable-patchpoint-liveness, or both. The pass traverses each basic block in a machine function. For each basic block the instructions are processed in reversed order and if a patchpoint or stackmap instruction is encountered the current live-out register set is encoded as a register mask and attached to the instruction. Later on during stackmap generation the live-out register mask is processed and also emitted as part of the stackmap. This information is optional and intended for optimization purposes only. This will enable a client of the stackmap to reason about the registers it can use and which registers need to be preserved. Reviewed by Andy llvm-svn: 197317	2013-12-14 06:53:06 +00:00
Juergen Ributzka	36f4619753	[Stackmap] Only the AnyReg calling convention should preserve all registers. llvm-svn: 197316	2013-12-14 06:52:59 +00:00
Juergen Ributzka	310034e166	Convert register liveness tracking to work on a sub-register level instead of just register units. Reviewed by Andy llvm-svn: 197315	2013-12-14 06:52:56 +00:00
Rafael Espindola	456f047546	Refactor NVPTX's computeDataLayout. No functionality change. llvm-svn: 197312	2013-12-14 06:42:48 +00:00
Rafael Espindola	307d7abc7f	Turn NVPTXSubtarget::getDataLayout into a static function. No functionality change. llvm-svn: 197311	2013-12-14 06:36:30 +00:00
Rafael Espindola	ceb0c4962a	Turn AMDGPUSubtarget::getDataLayout into a static function. No functionality change. llvm-svn: 197310	2013-12-14 06:13:44 +00:00
Michael Gottesman	5e985ee5b5	[block-freq] Rename getEntryFrequency() -> getEntryFreq() to match getBlockFreq() in all BlockFrequencyInfo. llvm-svn: 197304	2013-12-14 02:37:38 +00:00
Michael Gottesman	fb9164f0d2	[block-freq] Teach branch probability how to return the edge weight in between a BasicBlock and one of its successors. IMHO At some point BasicBlock should be refactored along the lines of MachineBasicBlock so that successors/weights are actually embedded within the block. Now is not that time though. llvm-svn: 197303	2013-12-14 02:24:25 +00:00
Michael Gottesman	8f17dccdcb	[block-freq] Add a right shift to BlockFrequency that saturates at 1. llvm-svn: 197302	2013-12-14 02:24:22 +00:00
Michael Gottesman	8c79ee409a	[block-freq] Remove old BlockFrequency entry frequency and printing code. llvm-svn: 197297	2013-12-14 00:57:18 +00:00
Michael Gottesman	9f49d74413	[block-freq] Refactor LiveInterals::getSpillWeight to use the new MachineBlockFrequencyInfo methods. This is slightly more interesting than the previous batch of changes. Specifically: 1. We refactor getSpillWeight to take a MachineBlockFrequencyInfo (MBFI) object. This enables us to completely encapsulate the actual manner we use the MachineBlockFrequencyInfo to get our spill weights. This yields cleaner code since one does not need to fetch the actual block frequency before getting the spill weight if all one wants it the spill weight. It also gives us access to entry frequency which we need for our computation. 2. Instead of having getSpillWeight take a MachineBasicBlock (as one might think) to look up the block frequency via the MBFI object, we instead take in a MachineInstr object. The reason for this is that the method is supposed to return the spill weight for an instruction according to the comments around the function. llvm-svn: 197296	2013-12-14 00:53:32 +00:00
Matt Arsenault	d3ee7af2f4	Teach MemoryBuiltins about address spaces llvm-svn: 197292	2013-12-14 00:27:48 +00:00
Michael Gottesman	092647b37a	[block-freq] Store MBFI as a field on SpillPlacement so we can access it to get the entry frequency while processing data. llvm-svn: 197291	2013-12-14 00:25:47 +00:00
Michael Gottesman	b78dec8faf	[block-freq] Update MachineBlockPlacement and RegAllocGreedy to use the new MachineBlockFrequencyInfo methods. llvm-svn: 197290	2013-12-14 00:25:45 +00:00
Michael Gottesman	b0c1ed8f4c	[block-freq] Update BlockFrequencyInfo/MachineBlockFrequencyInfo to use the new print methods. llvm-svn: 197289	2013-12-14 00:25:42 +00:00
Matt Arsenault	68c38fd6d1	Print the address space of a MachineMemOperand llvm-svn: 197288	2013-12-14 00:24:02 +00:00
Michael Gottesman	fd5c4b2c09	[block-freq] Add the equivalent methods to MachineBlockFrequencyInfo and BlockFrequencyInfo that were added to BlockFrequencyImpl in r197285 and r197284. llvm-svn: 197287	2013-12-14 00:06:03 +00:00
Rafael Espindola	f39136c39f	Pointer sizes are stored in Bytes. Fix variables names to say so. Also update for the current naming style. llvm-svn: 197283	2013-12-13 23:15:20 +00:00
Kevin Enderby	651898c19f	Fixed a bug in getARMFixupKindMachOInfo() where three ARM fixup kinds were falling into the cases for 24-bit branch kinds which are not 24-bit branches. The routine is to return false for fixups are expected to always be resolvable at assembly time. Which these three fixups are as they have limited displacement and are for local references within a function. rdar://15586725 llvm-svn: 197282	2013-12-13 22:46:54 +00:00
Andrew Trick	60cf0adeb5	comment typo. llvm-svn: 197278	2013-12-13 22:23:54 +00:00
Michael Gottesman	e1fad2b560	Remove APInt::extractBit since it is already implemented via operator[]. Change tests for extractBit to test operator[]. llvm-svn: 197277	2013-12-13 22:00:19 +00:00
David Blaikie	bc563276e0	DebugInfo: Move type units into the debug_types section with appropriate comdat grouping and type unit headers This commit does not complete the type units feature - there are issues around fission support (skeletal type units, pubtypes/pubnames) and hashing of some types including those containing references to types in other type units. Originally committed as r197073 and reverted in r197079. Recommitted as r197197 to reproduce the failure and reverted as r197199 Turns out there was unstable ordering in the type unit dumping code. Fixed by using MapVector in DWARFContext to store the debug_types comdat sections. Recommitted as r197210 with a fix to dumping and reverted as r197211 because I was a bit gun shy and thought I saw a failure that turned out to be unrelated. So here we go - once more with feeling! \o/ llvm-svn: 197275	2013-12-13 21:33:40 +00:00
Michael Gottesman	4497d963fb	[block-freq] Add the APInt method extractBit. llvm-svn: 197271	2013-12-13 20:47:34 +00:00
Andrew Trick	27709d0b3c	Revert "Convert liveness tracking to work on a sub-register level instead of just register units." This reverts commit r197253. This was a great change, but Juergen should be the commit author. llvm-svn: 197262	2013-12-13 19:04:08 +00:00
Andrew Trick	7bcb0100df	Revert "Liveness Analysis Pass" This reverts commit r197254. This was an accidental merge of Juergen's patch. It will be checked in shortly, but wasn't meant to go in quite yet. Conflicts: include/llvm/CodeGen/StackMaps.h lib/CodeGen/StackMaps.cpp test/CodeGen/X86/stackmap-liveness.ll llvm-svn: 197260	2013-12-13 18:57:20 +00:00
Andrew Trick	e8cba373a3	Grow the stackmap/patchpoint format to hold 64-bit IDs. llvm-svn: 197255	2013-12-13 18:37:10 +00:00
Andrew Trick	8d6a658430	Liveness Analysis Pass llvm-svn: 197254	2013-12-13 18:37:03 +00:00
Andrew Trick	8df84fa2f2	Convert liveness tracking to work on a sub-register level instead of just register units. llvm-svn: 197253	2013-12-13 18:36:56 +00:00
Chad Rosier	e139dd4fe6	[AArch64] Simplify the Neon Scalar3Same patterns for floating-point reciprocal step, floating-point reciprocal square root step, floating-point absolute difference, and integer/floating-point compare instructions. Also, move the scalar general arithmetic operation patterns closer to similar code. No functional change intended. llvm-svn: 197250	2013-12-13 17:56:44 +00:00
Rafael Espindola	1caa693a7b	Assume defaults to produce smaller datalayout strings. llvm-svn: 197249	2013-12-13 17:56:11 +00:00
Rafael Espindola	dfc1470d2d	Fix pr18235. The cpp backend is not a reasonable fallback for a missing target. It is a very special backend, so it is reasonable to use it only if explicitly requested. While at it, simplify the interface a bit. llvm-svn: 197241	2013-12-13 16:05:32 +00:00
Richard Sandiford	0847c450b6	[SystemZ] Optimize X [!=]= Y in cases where X - Y or Y - X is also computed In those cases it's better to compare the result of the subtraction against zero. llvm-svn: 197239	2013-12-13 15:50:30 +00:00
Richard Sandiford	c3dc44781b	[SystemZ] Make more use of TMHH This originally came about after noticing that InstCombine turns some of the TMHH (icmp (and...), ...) tests into plain comparisons. Since there is no instruction to compare with a 64-bit immediate, TMHH is generally better than an ordered comparison for the cases that it can handle. llvm-svn: 197238	2013-12-13 15:46:55 +00:00
Iain Sandoe	680385830f	test commit. Amend a comment. llvm-svn: 197237	2013-12-13 15:46:48 +00:00
Richard Sandiford	57485472e2	[SystemZ] Extend integer absolute selection This patch makes more use of LPGFR and LNGFR. It builds on top of the LTGFR selection from r197234. Most of the tests are motivated by what InstCombine would produce. llvm-svn: 197236	2013-12-13 15:35:00 +00:00
Richard Sandiford	d420f7344f	[SystemZ] Add a structure to represent a selected comparison ...in an attempt to rein back the increasingly complex selection code. A knock-on effect is that ICmpType is exposed from the outset, which slightly simplifies adjustSubwordCmp. The code is no piece of art even after this change, but at least it should be slightly better. No behavioral change intended. llvm-svn: 197235	2013-12-13 15:28:45 +00:00
Richard Sandiford	bd2f0e9cd0	[SystemZ] Make more use of LTGFR InstCombine turns (sext (trunc)) into (ashr (shl)), then converts any comparison of the ashr against zero into a comparison of the shl against zero. This makes sense in itself, but we want to undo it for z, since the sign- extension instruction has a CC-setting form. I've included tests for both the original and InstCombined variants, but the former already worked. The patch fixes the latter. llvm-svn: 197234	2013-12-13 15:07:39 +00:00
Benjamin Kramer	e723bb10b0	X86: When lowering shl_parts, don't emit shift amounts larger than the bit width. While it's safe for the X86-specific shift nodes, dag combining will kill generic nodes. Insert an AND to make it safe, isel will nuke it as x86's shift instructions have an implicit AND. Fixes PR16108, which contains a contraption to hit this case in between constant folders. llvm-svn: 197228	2013-12-13 13:40:24 +00:00
Joerg Sonnenberger	002a14765e	Enabling thumb2 mode used to force support for armv6t2. Replace this with a temporary assertion and adjust the various test cases. llvm-svn: 197224	2013-12-13 11:16:00 +00:00
Matheus Almeida	e0d75aacf1	[mips] Add checks for alignment and maximum displacements for most of the branch instructions for mips and micromips instruction sets thus avoiding the situation of generating branches to undesired locations if offsets cannot be encoded. This patch also checks if a fixup cannot be applied and returns a fatal error if that's the case. llvm-svn: 197223	2013-12-13 11:11:02 +00:00
Chandler Carruth	37d25de459	[inliner] Fix PR18206 by preventing inlining functions that call setjmp through an invoke instruction. The original patch for this was written by Mark Seaborn, but I've reworked his test case into the existing returns_twice test case and implemented the fix by the prior refactoring to actually run the cost analysis over invoke instructions, and then here fixing our detection of the returns_twice attribute to work for both calls and invokes. We never noticed because we never saw an invoke. =[ llvm-svn: 197216	2013-12-13 08:00:01 +00:00
Chandler Carruth	0814d2adf0	[inliner] Completely change (and fix) how the inline cost analysis handles terminator instructions. The inline cost analysis inheritted some pretty rough handling of terminator insts from the original cost analysis, and then made it much, much worse by factoring all of the important analyses into a separate instruction visitor. That instruction visitor never visited the terminator. This works fine for things like conditional branches, but for many other things we simply computed The Wrong Value. First example are unconditional branches, which should be free but were counted as full cost. This is most significant for conditional branches where the condition simplifies and folds during inlining. We paid a 1 instruction tax on every branch in a straight line specialized path. =[ Oh, we also claimed that the unreachable instruction had cost. But it gets worse. Let's consider invoke. We never applied the call penalty. We never accounted for the cost of the arguments. Nope. Worse still, we didn't handle the correctness constraints of not inlining recursive invokes, or exception throwing returns_twice functions. Oops. See PR18206. Sadly, PR18206 requires yet another fix, but this refactoring is at least a huge step in that direction. llvm-svn: 197215	2013-12-13 07:59:56 +00:00
David Blaikie	04adff775f	Revert "DebugInfo: Move type units into the debug_types section with appropriate comdat grouping and type unit headers" This reverts commit r197210. llvm-svn: 197211	2013-12-13 06:43:32 +00:00
David Blaikie	753c6e4eb2	DebugInfo: Move type units into the debug_types section with appropriate comdat grouping and type unit headers This commit does not complete the type units feature - there are issues around fission support (skeletal type units, pubtypes/pubnames) and hashing of some types including those containing references to types in other type units. Originally committed as r197073 and reverted in r197079. Recommitted as r197197 to reproduce the failure and reverted as r197199 Turns out there was unstable ordering in the type unit dumping code. Fixed by using MapVector in DWARFContext to store the debug_types comdat sections. llvm-svn: 197210	2013-12-13 06:27:38 +00:00
Kai Nacke	87b23aec08	Change stack probing code for MingW. Since gcc 4.6 the compiler uses ___chkstk_ms which has the same semantics as the MS CRT function __chkstk. This simplifies the prologue generation a bit. Reviewed by Rafael Espíndola. llvm-svn: 197205	2013-12-13 05:37:05 +00:00
David Blaikie	6201712bb0	Revert "DebugInfo: Move type units into the debug_types section with appropriate comdat grouping and type unit headers" This reverts commit r197197. llvm-svn: 197199	2013-12-13 01:24:54 +00:00
Yuchen Wu	342714c11c	llvm-cov: Added -b option for branch probabilities. This option tells llvm-cov to print out branch probabilities when a basic block contains multiple branches. It also prints out some function summary info including the number of times the function enters, the percent of time it returns, and how many blocks were executed. Also updated tests. llvm-svn: 197198	2013-12-13 01:15:07 +00:00
David Blaikie	baaf74d4ca	DebugInfo: Move type units into the debug_types section with appropriate comdat grouping and type unit headers This commit does not complete the type units feature - there are issues around fission support (skeletal type units, pubtypes/pubnames) and hashing of some types including those containing references to types in other type units. Originally committed as r197073 and reverted in r197079. This commit originally got jumbled up with another build-breaking commit and I can't find the failures I thought this caused anymore. Recommitting to hopefully get some clean buildbot results to work from. I have a sneaking suspicion there's unstable output in the comdat group output of MCStreamer... llvm-svn: 197197	2013-12-13 01:06:41 +00:00
Hal Finkel	f59fd7dcb4	Fix a use-after-free error in GlobalOpt CleanupConstantGlobalUsers GlobalOpt's CleanupConstantGlobalUsers function uses a worklist array to manage constant users to be visited. The pointers in this array need to be weak handles because when we delete a constant array, we may also be holding a pointer to one of its elements (or an element of one of its elements if we're dealing with an array of arrays) in the worklist. Fixes PR17347. llvm-svn: 197178	2013-12-12 20:45:24 +00:00
Hal Finkel	26fc4c29c6	Initialize the barrier pass llvm::initializeIPO The barrier pass is a temporary hack, and should go away soon. Nevertheless, if we don't initialize it, then opt will not understand -barrier, and this will break bugpoint (because when it dumps the passes from the default pass manager -barrier will be there). llvm-svn: 197177	2013-12-12 20:45:08 +00:00
Rafael Espindola	720ae4f885	Simplify the datalayout string of ARM and AArch64. No functionality change. Reviewed by Tim Northover. llvm-svn: 197172	2013-12-12 17:43:37 +00:00
Rafael Espindola	3db958387f	Simplify the SystemZ datalayout string. Reviewed by Richard Sandiford. llvm-svn: 197170	2013-12-12 17:30:07 +00:00
Rafael Espindola	e8f4d58700	Use "a" instead of "a0" in DataLayout. It means exactly the same and is just a bit shorter. llvm-svn: 197169	2013-12-12 17:21:51 +00:00
Rafael Espindola	b75ea019ea	Fix Typo. llvm-svn: 197168	2013-12-12 16:17:40 +00:00
Rafael Espindola	1f58e4dc11	Convert the other getHostByName implementations to StringRef. llvm-svn: 197166	2013-12-12 16:10:48 +00:00
Rafael Espindola	32cb5ac904	Switch to the new MingW ABI. GCC 4.7 changed the MingW ABI. On the LLVM side it means that sret functions don't pop the stack. llvm-svn: 197163	2013-12-12 16:06:58 +00:00
Chad Rosier	4055f42d22	[AArch64] Removed unnecessary copy patterns with v1fx types. - Copy patterns with float/double types are enough. - Fix typos in test case names that were using v1fx. - There is no ACLE intrinsic that uses v1f32 type. And there is no conflict of neon and non-neon ovelapped operations with this type, so there is no need to support operations with this type. - Remove v1f32 from FPR32 register and disallow v1f32 as a legal type for operations. Patch by Ana Pazos! llvm-svn: 197159	2013-12-12 15:46:29 +00:00
Rafael Espindola	74f444cde5	Return a StringRef from getHostCPUName. llvm-svn: 197158	2013-12-12 15:45:32 +00:00
Chandler Carruth	cb5beb347a	[cleanup] Remove trailing whitespace before I start changing this file. llvm-svn: 197149	2013-12-12 11:59:26 +00:00
Andrea Di Biagio	9b5c3dcf01	Added new X86 patterns to select SSE scalar fp arithmetic instructions from a vector packed single/double fp operation followed by a vector insert. The effect is that the backend coverts the packed fp instruction followed by a vectro insert into a SSE or AVX scalar fp instruction. For example, given the following code: __m128 foo(__m128 A, __m128 B) { __m128 C = A + B; return (__m128) {c[0], a[1], a[2], a[3]}; } previously we generated: addps %xmm0, %xmm1 movss %xmm1, %xmm0 we now generate: addss %xmm1, %xmm0 llvm-svn: 197145	2013-12-12 11:50:47 +00:00
Gabor Greif	5fde43bf2e	typo in comment llvm-svn: 197136	2013-12-12 08:00:34 +00:00
Hao Liu	46a10eec28	[AArch64]Fix the problem that AArch64 backend fails to select scalar_to_vector of vector types having more than one element. llvm-svn: 197135	2013-12-12 07:36:26 +00:00
Alp Toker	d0d1a74ac9	Add missing escape characters to the new Regex::escape() function The old AddFixedStringToRegEx() it was based on got away with this for the longest time, but the problem became easy to spot after the cleanup in r197096. Also add a quick unit test to cover regex escaping. llvm-svn: 197121	2013-12-12 02:51:58 +00:00
Reed Kotler	3230e725aa	Check for null pointer before dereferencing. A careless typo on my part. I don't know why this did not show up earlier. This code has been around for ages. llvm-svn: 197119	2013-12-12 02:41:11 +00:00
Yi Jiang	f92a574246	Resubmit r196544: Apply transformation on OS X 10.9+ and iOS 7.0+: pow(10, x) ―> __exp10(x) llvm-svn: 197109	2013-12-12 01:55:04 +00:00
Yi Jiang	53823be49d	Add TargetLibraryInfo in LTO passes builder llvm-svn: 197105	2013-12-12 01:37:39 +00:00
Hal Finkel	fa50630e43	Remove unused multiclass from PPCInstrInfo.td llvm-svn: 197100	2013-12-12 00:23:29 +00:00
Hal Finkel	ceb1f12d9a	Improve instruction scheduling for the PPC POWER7 Aside from a few minor latency corrections, the major change here is a new hazard recognizer which focuses on better dispatch-group formation on the POWER7. As with the PPC970's hazard recognizer, the most important thing it does is avoid load-after-store hazards within the same dispatch group. It uses the POWER7's special dispatch-group-terminating nop instruction (instead of inserting multiple regular nop instructions). This new hazard recognizer makes use of the scheduling dependency graph itself, built using AA information, to robustly detect the possibility of load-after-store hazards. significant test-suite performance changes (the error bars are 99.5% confidence intervals based on 5 test-suite runs both with and without the change -- speedups are negative): speedups: MultiSource/Benchmarks/FreeBench/pcompress2/pcompress2 -0.55171% +/- 0.333168% MultiSource/Benchmarks/TSVC/CrossingThresholds-dbl/CrossingThresholds-dbl -17.5576% +/- 14.598% MultiSource/Benchmarks/TSVC/Reductions-dbl/Reductions-dbl -29.5708% +/- 7.09058% MultiSource/Benchmarks/TSVC/Reductions-flt/Reductions-flt -34.9471% +/- 11.4391% SingleSource/Benchmarks/BenchmarkGame/puzzle -25.1347% +/- 11.0104% SingleSource/Benchmarks/Misc/flops-8 -17.7297% +/- 9.79061% SingleSource/Benchmarks/Shootout-C++/ary3 -35.5018% +/- 23.9458% SingleSource/Regression/C/uint64_to_float -56.3165% +/- 25.4234% SingleSource/UnitTests/Vectorizer/gcc-loops -18.5309% +/- 6.8496% regressions: MultiSource/Benchmarks/ASCI_Purple/SMG2000/smg2000 18.351% +/- 12.156% SingleSource/Benchmarks/Shootout-C++/methcall 27.3086% +/- 14.4733% llvm-svn: 197099	2013-12-12 00:19:11 +00:00
Quentin Colombet	18b779e3f4	Fix an over-constrained assertion in MachineFunction::addLiveIn. The assertion was checking that the virtual register VReg used to represent the physical register PReg uses the same register class as the one passed to MachineFunction::addLiveIn. This is over-constraining because it is sufficient to check that the register class of VReg (VRegRC) is a subclass of the register class of PReg (PRegRC) and that VRegRC contains PReg. Indeed, if VReg gets constrained because of some operation constraints between two calls of MachineFunction::addLiveIn, the original assertion cannot match. This fixes <rdar://problem/15633429>. llvm-svn: 197097	2013-12-12 00:15:47 +00:00
Hans Wennborg	6f4f77b7e9	Expose FileCheck's AddFixedStringToRegEx as Regex::escape Both FileCheck and clang's -verify need to escape strings for regexes, so let's expose this as a utility in the Regex class. llvm-svn: 197096	2013-12-12 00:06:41 +00:00
Chad Rosier	446d8ea0fb	[AArch64] Refactor NEON floating-point Max/Min/Maxnm/Minnm across vector AArch64 intrinsics to use f32 types, rather than their vector equivalents. llvm-svn: 197090	2013-12-11 23:21:25 +00:00
Hal Finkel	94a6f380bb	Fix the PPC subsumes-predicate check For one predicate to subsume another, they must both check the same condition register. Failure to check this prerequisite was causing miscompiles. Fixes PR18003. llvm-svn: 197089	2013-12-11 23:12:25 +00:00
Hal Finkel	4fd3b1de2a	Add two additional hazard recognizer functions This adds two additional functions to the hazard recognizer interface. These are optional (in the sense that the default implementations preserve the current behavior), and used by the post-RA scheduler. Upcoming commits will use this functionality in order to improve dispatch-group formation on the POWER7 and related cores. Dispatch groups are an odd construct: sometimes we need to insert nops to force a new one to start (for performance reasons), and some instructions need to appear in certain positions within a group, but the groups are not fundamentally cycle based (they can contain instructions with data dependencies with non-trivial latencies). Motivation: unsigned PreEmitNoops(SUnit ) - Used to force the post-RA scheduler to insert nops to force a new dispatch group to begin. We already have a NoopHazard, and this is also still needed. However, NoopHazard only causes a nop to be inserted if there are no other available instructions, and so is not always sufficient. The number of nops to insert depends on state that only the hazard recognizer has, so a general callback is necessary. bool ShouldPreferAnother(SUnit ) - Used to avoid scheduling instructions that would start a new dispatch group when others are available that could be part of the current dispatch group. In this case, we don't want to issue nops, because the non-preferred instruction will implicitly start a new dispatch group regardless. Although the motivation for these functions is driven by the PowerPC backend, they are completely general. llvm-svn: 197084	2013-12-11 22:33:43 +00:00
Rafael Espindola	2b5a0c9e68	On ELF and COFF treat linker_private like private. The linkers on these systems don't have anything special to do with these symbols. Since the intent is for them to be absent from the final object, just treat them as private. llvm-svn: 197080	2013-12-11 22:18:44 +00:00
David Blaikie	727747eb29	Revert "DebugInfo: Move type units into the debug_types section with appropriate comdat grouping and type unit headers" This reverts commit r197073. The test seems to be failing on some buildbots for unknown reasons. Reverting until I can figure that out. If anyone's got a reproduction (.s and .o together would be great) - I'd really appreciate it. llvm-svn: 197079	2013-12-11 22:08:39 +00:00
David Blaikie	4fe3c00eed	DebugInfo: Move type units into the debug_types section with appropriate comdat grouping and type unit headers This commit does not complete the type units feature - there are issues around fission support (skeletal type units, pubtypes/pubnames) and hashing of some types including those containing references to types in other type units. llvm-svn: 197073	2013-12-11 21:36:27 +00:00
David Blaikie	3332d4c75f	DwarfUnit: LLVM_OVERRIDE and constify some functions llvm-svn: 197072	2013-12-11 21:14:02 +00:00
Chad Rosier	088f93d4b5	[AArch64] Add NEON scalar floating-point compare LLVM AArch64 intrinsics that use f32/f64 types, rather than their vector equivalents. llvm-svn: 197068	2013-12-11 21:03:46 +00:00
Chad Rosier	473a01e1c9	[AArch64] Refactor the NEON scalar floating-point reciprocal step and floating-point reciprocal square root step LLVM AArch64 intrinsics to use f32/f64 types, rather than their vector equivalents. llvm-svn: 197067	2013-12-11 21:03:43 +00:00
Chad Rosier	7098fcc062	[AArch64] Refactor the NEON scalar floating-point reciprocal estimate, floating- point reciprocal exponent, and floating-point reciprocal square root estimate LLVM AArch64 intrinsics to use f32/f64 types, rather than their vector equivalents. llvm-svn: 197066	2013-12-11 21:03:40 +00:00
Rafael Espindola	009e758628	Don't set unused variable. llvm-svn: 197064	2013-12-11 20:40:57 +00:00
Tom Stellard	d7e146ede6	R600: Re-format Processors.td This makes it a little easier to read. Reviewed-by: Vincent Lejeune <vljn at ovi.com> llvm-svn: 197058	2013-12-11 17:51:51 +00:00
Tom Stellard	f2ba972af6	R600: Register AMDGPUCFGStructurizer pass This enables -print-before-all to dump MachineInstrs after it is run. Reviewed-by: Vincent Lejeune <vljn at ovi.com> llvm-svn: 197057	2013-12-11 17:51:47 +00:00
Tom Stellard	1de5582d06	R600: Register R600EmitClauseMarkers pass This enables -print-before-all to dump MachineInstrs after it is run. Reviewed-by: Vincent Lejeune <vljn at ovi.com> llvm-svn: 197056	2013-12-11 17:51:41 +00:00
Logan Chien	439e8f9e38	[arm] Implement ARM .arch directive. llvm-svn: 197052	2013-12-11 17:16:25 +00:00
Benjamin Kramer	671a596282	SelectionDAG: Fix a typo. Found by "cppcheck". PR18208. llvm-svn: 197047	2013-12-11 16:36:09 +00:00
Tim Northover	76fc8a4c40	ARM: constrain register-class in fast-isel The tests were no longer using fast-isel at all (MachO needs an "ios" rather than "darwin" triple at the moment and Linux needs ARM mode). Once that was corrected, the verifier complained about a t2ADDri created for the alloca. llvm-svn: 197046	2013-12-11 16:04:57 +00:00
Alp Toker	b30f01ee42	Build fix for Android NDK which has neither futimes nor futimens Based on a patch by Neil Henning! llvm-svn: 197045	2013-12-11 15:42:33 +00:00
Elena Demikhovsky	cf08809813	AVX-512: Removed "z" suffix from AVX-512 instructions, since it is incompatible with GCC. I moved a test from avx512-vbroadcast-crash.ll to avx512-vbroadcast.ll I defined HasAVX512 predicate as AssemblerPredicate. It means that you should invoke llvm-mc with "-mcpu=knl" to get encoding for AVX-512 instructions. I need this to let AsmMatcher to set different encoding for AVX and AVX-512 instructions that have the same mnemonic and operands (all scalar instructions). llvm-svn: 197041	2013-12-11 14:31:04 +00:00
Richard Sandiford	73170f8488	[SystemZ] Optimize fcmp X, 0 in cases where X is also negated In such cases it's often better to test the result of the negation instead, since the negation also sets CC. llvm-svn: 197032	2013-12-11 11:45:08 +00:00
Richard Sandiford	d1093636cc	Extend (truncate (load)) folding DAGCombiner could fold (truncate (load)) -> smaller load if the original load was the width of the truncation result or wider. This patch extends it to handle cases where the original load was narrower (and so the extension type stays the same). llvm-svn: 197030	2013-12-11 11:37:27 +00:00
Andrew Trick	2d8826a1b5	Add TargetRegisterInfo::reverseLocalAssignment hook. This hook reverses the order of assignment for local live ranges. This will generally allocate shorter local live ranges first. For targets with many registers, this could reduce regalloc compile time by a large factor. It should still achieve optimal coloring; however, it can change register eviction decisions. It is disabled by default for two reasons: (1) Top-down allocation is simpler and easier to debug for targets that don't benefit from reversing the order. (2) Bottom-up allocation could result in poor evicition decisions on some targets affecting the performance of compiled code. llvm-svn: 197001	2013-12-11 03:40:15 +00:00
Reed Kotler	5bde5c35f4	Distinguish and choose 16 or 32 bit forms of save/restore for Mips16. llvm-svn: 196999	2013-12-11 03:32:44 +00:00
Kevin Qin	310b6c08ba	[AArch64 NEON] Get instruction BSL matched to VSELECT. llvm-svn: 196998	2013-12-11 02:33:50 +00:00
Rafael Espindola	b2fb78d45a	Move mips' datalayout computation out of line and add comments. llvm-svn: 196996	2013-12-11 01:41:10 +00:00
Rafael Espindola	60f48e5a67	Move Sparc's getDataLayout out of line and add comments. llvm-svn: 196990	2013-12-11 01:07:43 +00:00
NAKAMURA Takumi	8bc9bfaa5a	Prune redundant dependencies in LLVMBuild.txt. llvm-svn: 196988	2013-12-11 00:30:57 +00:00
Rafael Espindola	5b3585871b	Move PPC's getDataLayoutString out of line and document it better. llvm-svn: 196987	2013-12-11 00:09:06 +00:00
Reid Kleckner	ad92aca47c	Revert the backend fatal error from r196939 The combination of inline asm, stack realignment, and dynamic allocas turns out to be too common to reject out of hand. ASan inserts empy inline asm fragments and uses aligned allocas. Compiling any trivial function containing a dynamic alloca with ASan is enough to trigger the check. XFAIL the test cases that would be miscompiled and add one that uses the relevant functionality. llvm-svn: 196986	2013-12-10 23:23:52 +00:00
Rafael Espindola	002f8aa584	Refactor the computation of the x86 datalayout. llvm-svn: 196976	2013-12-10 22:05:32 +00:00
Reid Kleckner	30b2a9a59f	[asan] Fix the coverage.cc test broken by r196939 It was failing because ASan was adding all of the following to one function: - dynamic alloca - stack realignment - inline asm This patch avoids making the static alloca dynamic when coverage is used. ASan should probably not be inserting empty inline asm blobs to inhibit duplicate tail elimination. llvm-svn: 196973	2013-12-10 21:49:28 +00:00
Matt Arsenault	eaa3a7efab	Use llvm_unreachable instead of assert(0) llvm-svn: 196971	2013-12-10 21:37:42 +00:00
David Fang	1b01849f2d	on darwin<10, fallback to .weak_definition (PPC,X86) .weak_def_can_be_hidden was not yet supported by the system assembler llvm-svn: 196970	2013-12-10 21:37:41 +00:00
Chad Rosier	f70af21651	[AArch64] Refactor the NEON floating-point absolute difference LLVM AArch64 intrinsic to use f32/f64 types, rather than their vector equivalents. llvm-svn: 196965	2013-12-10 21:33:59 +00:00
Chad Rosier	07cc3f9100	[AArch64] Refactor the NEON signed/unsigned floating-point convert to fixed-point LLVM AArch64 intrinsics to use f32/f64, rather than their vector equivalents. llvm-svn: 196964	2013-12-10 21:33:56 +00:00
Chad Rosier	98b8baa35c	[AArch64] Overload NEON signed/unsigned floating-point convert to fixed-point and fixed-point convert to floating-point LLVM AArch64 intrinsics. llvm-svn: 196963	2013-12-10 21:33:53 +00:00
Chad Rosier	cc34d187b8	[AArch64] Overload NEON signed/unsigned integer convert to floating-point LLVM AArch64 intrinsics. llvm-svn: 196962	2013-12-10 21:33:50 +00:00
Matt Arsenault	0f5f015bfd	Fix gcc warnings. Unused variable and unused typedef in release build. llvm-svn: 196947	2013-12-10 18:55:37 +00:00
Reid Kleckner	ee08897fb8	Reland "Fix miscompile of MS inline assembly with stack realignment" This re-lands commit r196876, which was reverted in r196879. The tests have been fixed to pass on platforms with a stack alignment larger than 4. Update to clang side tests will land shortly. llvm-svn: 196939	2013-12-10 18:27:32 +00:00
Tim Northover	9653eb5759	Make Triple's isOSBinFormatXXX functions partition triple-space. Most users would be surprised if "isCOFF" and "isMachO" were simultaneously true, unless they'd put the compiler in a box with a gun attached to a photon detector. This makes sure precisely one of the three formats is true for any triple and simplifies some target logic based on that. llvm-svn: 196934	2013-12-10 16:57:43 +00:00
Chad Rosier	7a9bba442f	[AArch64] Refactor the Neon vector/scalar floating-point convert intrinsics so that they use float/double rather than the vector equivalents when appropriate. llvm-svn: 196930	2013-12-10 16:11:39 +00:00
Chad Rosier	fcc4c366d1	[AArch64] Refactor the Neon vector/scalar floating-point convert implementation. Specifically, reuse the ARM intrinsics when possible. llvm-svn: 196926	2013-12-10 15:35:33 +00:00
Andrea Di Biagio	f7c33c8162	Ensure that the backend no longer emits unnecessary vector insert instructions immediately after SSE scalar fp instructions like addss or mulss. Added patterns to select SSE scalar fp arithmetic instructions from a scalar fp operation followed by a blend. For example, given the following code: __m128 foo(__m128 A, __m128 B) { A[0] += B[0]; return A; } previously we generated: addss %xmm0, %xmm1 movss %xmm1, %xmm0 now we generate: addss %xmm1, %xmm0 llvm-svn: 196925	2013-12-10 15:22:48 +00:00
Vincent Lejeune	cc0ea74c7b	R600: Fix an infinite loop when trying to reorganize export/tex vector input llvm-svn: 196923	2013-12-10 14:43:31 +00:00
Vincent Lejeune	f92d64d160	R600: Fix input modifiers lost for Cayman llvm-svn: 196922	2013-12-10 14:43:27 +00:00
Reed Kotler	0ff4001781	Next step in Mips16 prologue/epilogue cleanup. Save S2(reg 18) only when we are calling floating point stubs that have a return value of float or complex. Some more work to make this better but this is the first step. llvm-svn: 196921	2013-12-10 14:29:38 +00:00
Elena Demikhovsky	e382c3fdcd	AVX-512: changed intrinsics for mask operations llvm-svn: 196918	2013-12-10 13:53:10 +00:00
Elena Demikhovsky	6270b388c8	AVX-512: Changed intrinsics of VPCONFLICT to match GCC builtin form llvm-svn: 196914	2013-12-10 11:58:35 +00:00
Tim Northover	3e8df696ea	Darwin: update default iOS version to 5.0 Defaulting to iOS 3.0 when LLVM has to guess the version is no longer a useful option and can give surprising results (like tail calls being disabled). 5.0 seems like a reasonable compromise as a platform that's still interesting to some people. rdar://problem/15567348 llvm-svn: 196912	2013-12-10 11:53:16 +00:00

... 2 3 4 5 6 ...

66154 Commits