llvm-project

Commit Graph

Author	SHA1	Message	Date
Kostya Serebryany	4d237a8503	[asan] decrease asan-instrumentation-with-call-threshold from 10000 to 7000, see PR17409 llvm-svn: 209623	2014-05-26 11:57:16 +00:00
Owen Anderson	115aa160e6	Make the LoopRotate pass's maximum header size configurable both programmatically and via the command line, mirroring similar functionality in LoopUnroll. In situations where clients used custom unrolling thresholds, their intent could previously be foiled by LoopRotate having a hardcoded threshold. llvm-svn: 209617	2014-05-26 08:58:51 +00:00
David Blaikie	ab53c91010	DwarfUnit: Remove some misleading no-op code introduced in r204162. Post commit review feedback from Manman called this out, but it looks like it slipped through the cracks. llvm-svn: 209611	2014-05-26 05:32:21 +00:00
David Blaikie	ea86226774	DebugInfo: Fix inlining with #file directives a little harder Seems my previous fix was insufficient - we were still not adding the inlined function to the abstract scope list. Which meant it wasn't flagged as inline, didn't have nested lexical scopes in the abstract definition, and didn't have abstract variables - so the inlined variable didn't reference an abstract variable, instead being described completely inline. llvm-svn: 209602	2014-05-25 18:11:35 +00:00
Rafael Espindola	4a04c4b69c	Emit data or code export directives based on the type. Currently we look at the Aliasee to decide what type of export directive to use. It seems better to use the type of the alias directly. This is similar to how we handle the alias having the same address but other attributes (linkage, visibility) from the aliasee. With this patch it is now possible to do things like target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128" target triple = "x86_64-pc-windows-msvc" @foo = global [6 x i8] c"\B8\00\00\00\C3", section ".text", align 16 @f = dllexport alias i32 (), [6 x i8] @foo !llvm.module.flags = !{!0} !0 = metadata !{i32 6, metadata !"Linker Options", metadata !1} !1 = metadata !{metadata !2, metadata !3} !2 = metadata !{metadata !"/DEFAULTLIB:libcmt.lib"} !3 = metadata !{metadata !"/DEFAULTLIB:oldnames.lib"} llvm-svn: 209600	2014-05-25 12:49:07 +00:00
Peter Collingbourne	0a4376190f	Add an extension point for peephole optimizers. This extension point allows adding passes that perform peephole optimizations similar to the instruction combiner. These passes will be inserted after each instance of the instruction combiner pass. Differential Revision: http://reviews.llvm.org/D3905 llvm-svn: 209595	2014-05-25 10:27:02 +00:00
Hans Wennborg	12d1e24da2	Fix some misplaced spaces around 'override' llvm-svn: 209589	2014-05-24 20:19:40 +00:00
Tim Northover	391f93a554	AArch64: disable FastISel for large code model. The code emitted is what would be expected for the small model, so it shouldn't be used when objects can be the full 64-bits away. This fixes MCJIT tests on Linux. llvm-svn: 209585	2014-05-24 19:45:41 +00:00
Benjamin Kramer	5256ce37ac	MachineVerifier: Clean up some syntactic weirdness left behind by find&replace. No functionality change. llvm-svn: 209581	2014-05-24 13:31:10 +00:00
Benjamin Kramer	389cec0d3e	CodeGen: Make MachineBasicBlock::back skip to the beginning of the last bundle. This makes front/back symmetric with begin/end, avoiding some confusion. Added instr_front/instr_back for the old behavior, corresponding to instr_begin/instr_end. Audited all three in-tree users of back(), all of them look like they don't want to look inside bundles. Fixes an assertion (PR19815) when generating debug info on mips, where a delay slot was bundled at the end of a branch. llvm-svn: 209580	2014-05-24 13:13:17 +00:00
Tim Northover	3b0846e8f7	AArch64/ARM64: move ARM64 into AArch64's place This commit starts with a "git mv ARM64 AArch64" and continues out from there, renaming the C++ classes, intrinsics, and other target-local objects for consistency. "ARM64" test directories are also moved, and tests that began their life in ARM64 use an arm64 triple, those from AArch64 use an aarch64 triple. Both should be equivalent though. This finishes the AArch64 merge, and everyone should feel free to continue committing as normal now. llvm-svn: 209577	2014-05-24 12:50:23 +00:00
Tim Northover	cc08e1fe1b	AArch64/ARM64: remove AArch64 from tree prior to renaming ARM64. I'm doing this in two phases for a better "git blame" record. This commit removes the previous AArch64 backend and redirects all functionality to ARM64. It also deduplicates test-lines and removes orphaned AArch64 tests. The next step will be "git mv ARM64 AArch64" and rewire most of the tests. Hopefully LLVM is still functional, though it would be even better if no-one ever had to care because the rename happens straight afterwards. llvm-svn: 209576	2014-05-24 12:42:26 +00:00
Michael Zolotukhin	d4c724625a	Implement sext(C1 + C2X) --> sext(C1) + sext(C2X) and sext{C1,+,C2} --> sext(C1) + sext{0,+,C2} transformation in Scalar Evolution. That helps SLP-vectorizer to recognize consecutive loads/stores. <rdar://problem/14860614> llvm-svn: 209568	2014-05-24 08:09:57 +00:00
Tim Northover	e471e43484	ARM64: extract a 32-bit subreg when selecting an inreg extend After the load/store refactoring, we were sometimes trying to feed a GPR64 into a 32-bit register offset operand. This failed in copyPhysReg. llvm-svn: 209566	2014-05-24 07:05:42 +00:00
Rafael Espindola	ef2c4fb25b	clang-format function. llvm-svn: 209550	2014-05-23 20:39:23 +00:00
Rafael Espindola	d246759973	Remove a confusing use of a static method. No functionality change. llvm-svn: 209548	2014-05-23 20:35:47 +00:00
David Blaikie	169ffe41af	DebugInfo: Put concrete definitions referencing abstract definitions in the same scope as the abstract definition. This seems like a simple cleanup/improved consistency, but also helps lay the foundation to fix the bug mentioned in the test case: concrete definitions preceeding any inlined usage aren't properly split into concrete + abstract (because they're not known to need it until it's too late). Once we start deferring this choice until later, we won't have the choice to put concrete definitions for inlined subroutines in a different scope from concrete definitions for non-inlined subroutines (since we won't know at time-of-construction which one it'll be). This change brings those two cases into alignment ahead of that future chaneg/fix. llvm-svn: 209547	2014-05-23 20:25:15 +00:00
Andrew Trick	839e30b2c0	Fix and improve SCEV ComputeBackedgeTankCount. This is a follow-up to r209358: PR19799: Indvars miscompile due to an incorrect max backedge taken count from SCEV. That fix was incomplete as pointed out by Arnold and Michael Z. The code was also too confusing. It needed a careful rewrite with more unit tests. This version will also happen to optimize more cases. <rdar://17005101> PR19799: Indvars miscompile... llvm-svn: 209545	2014-05-23 19:47:13 +00:00
Rafael Espindola	a5bb2f61cf	Use alias linkage and visibility to decide tls access mode. This matches both what we do for the non-thread case and what gcc does. With this patch clang would match gcc's behaviour in static __thread int a = 42; extern __thread int b __attribute__((alias("a"))); int f(void) { return &a; } int g(void) { return &b; } if not for pr19843. Manually writing the IL does produce the same access modes. It is also a step in the direction of fixing pr19844. llvm-svn: 209543	2014-05-23 19:16:56 +00:00
Jingyue Wu	bbb6e4a885	Add the extracted constant offset using GEP Fixed a TODO in r207783. Add the extracted constant offset using GEP instead of ugly ptrtoint+add+inttoptr. Using GEP simplifies future optimizations and makes IR easier to understand. Updated all affected tests, and added a new test in split-gep.ll to cover a corner case where emitting uglygep is necessary. llvm-svn: 209537	2014-05-23 18:39:40 +00:00
Lang Hames	8e30e4b9b7	[RuntimeDyld] Remove relocation bounds check introduced in r208375 (MachO only). We do all of our address arithmetic in 64-bit, and operations involving logically negative 32-bit offsets (actually represented as unsigned 64 bit ints) often overflow into higher bits. The overflow check could be preserved by casting to uint32 at the callsite for applyRelocationValue, but this would eliminate the value of the check. The right way to handle overflow in relocations is to make relocation processing target specific, and compute the values for RelocationEntry objects in the appropriate types (32-bit for 32-bit targets, 64-bit for 64-bit targets). This is coming as part of the cleanup I'm working on. This fixes another i386 regression test. <rdar://problem/16889891> llvm-svn: 209536	2014-05-23 18:35:44 +00:00
David Blaikie	05b8584f16	Add FIXME comment based on code review feedback by Hal Finkel on r209338 llvm-svn: 209529	2014-05-23 16:53:14 +00:00
Rafael Espindola	6314ad41d1	Aliases are always definition, delete dead code. While at it, use a range loop. llvm-svn: 209519	2014-05-23 15:18:06 +00:00
Rafael Espindola	a31f3e50dc	Delete dead code. GV is never used past this point. This was probably a copy and paste error. llvm-svn: 209518	2014-05-23 15:07:51 +00:00
Daniel Sanders	683ed961e1	[mips] Work around inconsistency in llvm-mc's placement of fixup markers Summary: Add a second fixup table to MipsAsmBackend::getFixupKindInfo() to correctly position llvm-mc's fixup placeholders for big-endian. See PR19836 for full details of the issue. To summarize, the fixup placeholders do not account for endianness properly and the implementations of getFixupKindInfo() for each target are measuring MCFixupKindInfo.TargetOffset from different ends of the instruction encoding to compensate. Reviewers: jkolek, zoran.jovanovic, vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D3889 llvm-svn: 209514	2014-05-23 13:35:24 +00:00
Daniel Sanders	8966caab05	[mips][mips64r6] t(eq\|ge\|lt\|ne)i and t(ge\|lt)iu are not available in MIPS32r6/MIPS64r6 Summary: Depends on D3872 Reviewers: jkolek, zoran.jovanovic, vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D3891 llvm-svn: 209513	2014-05-23 13:24:08 +00:00
Daniel Sanders	ac27263512	[mips][mips64r6] [ls][dw][lr] are not available in MIPS32r6/MIPS64r6 Summary: Instead the system is required to provide some means of handling unaligned load/store without special instructions. Options include full hardware support, full trap-and-emulate, and hybrids such as hardware support within a cache line and trap-and-emulate for multi-line accesses. MipsSETargetLowering::allowsUnalignedMemoryAccesses() has been configured to assume that unaligned accesses are 'fast' on the basis that I expect few hardware implementations will opt for pure-software handling of unaligned accesses. The ones that do handle it purely in software can override this. mips64-load-store-left-right.ll has been merged into load-store-left-right.ll The stricter testing revealed a Bits!=Bytes bug in passByValArg(). This has been fixed and the variables renamed to clarify the units they hold. Reviewers: zoran.jovanovic, jkolek, vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D3872 llvm-svn: 209512	2014-05-23 13:18:02 +00:00
Kostya Serebryany	c7895a83d2	[asan] properly instrument memory accesses that have small alignment (smaller than min(8,size)) by making two checks instead of one. This may slowdown some cases, e.g. long long on 32-bit or wide loads produced after loop unrolling. The benefit is higher sencitivity. llvm-svn: 209508	2014-05-23 11:52:07 +00:00
Bradley Smith	63c8b1bcb3	Fixup sys::getHostCPUFeatures crypto names so it doesn't clash with kernel headers llvm-svn: 209506	2014-05-23 10:14:13 +00:00
Simon Atanasyan	84242dc774	[YAML] Add an optional argument `EnumMask` to the `yaml::IO::bitSetCase()`. Some bit-set fields used in ELF file headers in fact contain two parts. The first one is a regular bit-field. The second one is an enumeraion. For example ELF header `e_flags` for MIPS target might contain the following values: Bit-set values: EF_MIPS_NOREORDER = 0x00000001 EF_MIPS_PIC = 0x00000002 EF_MIPS_CPIC = 0x00000004 EF_MIPS_ABI2 = 0x00000020 Enumeration: EF_MIPS_ARCH_32 = 0x50000000 EF_MIPS_ARCH_64 = 0x60000000 EF_MIPS_ARCH_32R2 = 0x70000000 EF_MIPS_ARCH_64R2 = 0x80000000 For printing bit-sets we use the `yaml::IO::bitSetCase()`. It does not support bit-set/enumeration combinations and prints too many flags from an enumeration part. This patch fixes this problem. New method `yaml::IO::maskedBitSetCase()` handle "enumeration" part of bitset defined by provided mask. Patch reviewed by Nick Kledzik and Sean Silva. llvm-svn: 209504	2014-05-23 08:07:09 +00:00
Jingyue Wu	69a6685c8d	Test commit. The keyword "virtual" is not necessary. llvm-svn: 209501	2014-05-23 06:30:12 +00:00
David Blaikie	4860225570	Rename a couple of variables to be more accurate. It's not really a "ScopeDIE", as such - it's the abstract function definition's DIE. And we usually use "SP" for subprograms, rather than "Sub". llvm-svn: 209499	2014-05-23 05:03:23 +00:00
David Blaikie	96fb9024f2	DebugInfo: Fix cross-CU references for scopes (and variables within those scopes) in abstract definitions of cross-CU inlined functions Found by Adrian Prantl during post-commit review of r209335. llvm-svn: 209498	2014-05-23 04:23:06 +00:00
Jiangning Liu	4b5b757d65	[ARM64] Fix a bug in shuffle vector lowering to generate corect vext ISD with swapped input vectors. llvm-svn: 209495	2014-05-23 02:54:50 +00:00
Justin Bogner	cbb8438bb3	ScalarEvolution: Fix handling of AddRecs in isKnownPredicate ScalarEvolution::isKnownPredicate() can wrongly reduce a comparison when both the LHS and RHS are SCEVAddRecExprs. This checks that both LHS and RHS are guarded in the case when both are SCEVAddRecExprs. The test case is against indvars because I could not find a way to directly test SCEV. Patch by Sanjay Patel! llvm-svn: 209487	2014-05-23 00:06:56 +00:00
Lang Hames	7f9fc2b339	[RuntimeDyld] Teach RuntimeDyldMachO how to handle scattered VANILLA relocs on i386. This fixes two more MCJIT regression tests on i386: ExecutionEngine/MCJIT/2003-05-06-LivenessClobber.ll ExecutionEngine/MCJIT/2013-04-04-RelocAddend.ll The implementation of processScatteredVANILLA is tasteless (ba-dum-ching), but I'm working on a substantial tidy-up of RuntimeDyldMachO that should improve things. This patch also fixes a type-o in RuntimeDyldMachO::processSECTDIFFRelocation, and teaches that method to skip over the PAIR reloc following the SECTDIFF. <rdar://problem/16961886> llvm-svn: 209478	2014-05-22 22:30:13 +00:00
Matt Arsenault	46b51b7f62	R600: Add definition for flat address space ID. Use 4 since that's probably what it will be for spir. Move ADDRESS_NONE to the end to keep the constant_buffer_* values unchanged, since apparently a bunch of r600 tests use those directly. llvm-svn: 209463	2014-05-22 18:27:07 +00:00
Matt Arsenault	05e96f4444	R600: Try to convert BFE back to standard bit ops when possible. This allows existing DAG combines to work on them, and then we can re-match to BFE if necessary during instruction selection. llvm-svn: 209462	2014-05-22 18:09:12 +00:00
Matt Arsenault	5565f65e13	R600: Add dag combine for BFE llvm-svn: 209461	2014-05-22 18:09:07 +00:00
Matt Arsenault	bf8694d36d	R600: Implement ComputeNumSignBitsForTargetNode for BFE llvm-svn: 209460	2014-05-22 18:09:03 +00:00
Matt Arsenault	af6df9d943	R600: Implement computeMaskedBitsForTargetNode for BFE llvm-svn: 209459	2014-05-22 18:09:00 +00:00
Matt Arsenault	493c5f1bc4	R600: Expand mul24 for GPUs without it llvm-svn: 209458	2014-05-22 18:00:24 +00:00
Matt Arsenault	f15a05623e	R600: Expand mad24 for GPUs without it llvm-svn: 209457	2014-05-22 18:00:20 +00:00
Matt Arsenault	eb260206c3	R600: Add intrinsics for mad24 llvm-svn: 209456	2014-05-22 18:00:15 +00:00
Eric Christopher	9eff5178f1	Return false if we're not going to do anything. llvm-svn: 209455	2014-05-22 17:49:33 +00:00
Matt Arsenault	f37abc71de	R600/SI: Move instruction pattern to instruction definition llvm-svn: 209454	2014-05-22 17:45:20 +00:00
Diego Novillo	0b761a48cf	Remove LLVMContextImpl::optimizationRemarkEnabledFor. Summary: This patch moves the handling of -pass-remarks* over to lib/DiagnosticInfo.cpp. This allows the removal of the optimizationRemarkEnabledFor functions from LLVMContextImpl, as they're not needed anymore. Reviewers: qcolombet Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D3878 llvm-svn: 209453	2014-05-22 17:19:01 +00:00
Andrea Di Biagio	c8dd1ad85b	[X86] Improve the lowering of BITCAST from MVT::f64 to MVT::v4i16/MVT::v8i8. This patch teaches the x86 backend how to efficiently lower ISD::BITCAST dag nodes from MVT::f64 to MVT::v4i16 (and vice versa), and from MVT::f64 to MVT::v8i8 (and vice versa). This patch extends the logic from revision 208107 to also handle MVT::v4i16 and MVT::v8i8. Also, this patch correctly propagates Undef values when performing the widening of a vector (example: when widening from v2i32 to v4i32, the upper 64bits of the resulting vector are 'undef'). llvm-svn: 209451	2014-05-22 16:21:39 +00:00
Tim Northover	b2a6fdb11a	ARM64: remove '#' from annotation of add/sub immediate The full string used to be "// =#12" for example, which looks too busy. llvm-svn: 209443	2014-05-22 14:20:05 +00:00
Diego Novillo	7f8af8bf91	Add support for missed and analysis optimization remarks. Summary: This adds two new diagnostics: -pass-remarks-missed and -pass-remarks-analysis. They take the same values as -pass-remarks but are intended to be triggered in different contexts. -pass-remarks-missed is used by LLVMContext::emitOptimizationRemarkMissed, which passes call when they tried to apply a transformation but couldn't. -pass-remarks-analysis is used by LLVMContext::emitOptimizationRemarkAnalysis, which passes call when they want to inform the user about analysis results. The patch also: 1- Adds support in the inliner for the two new remarks and a test case. 2- Moves emitOptimizationRemark* functions to the llvm namespace. 3- Adds an LLVMContext argument instead of making them member functions of LLVMContext. Reviewers: qcolombet Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D3682 llvm-svn: 209442	2014-05-22 14:19:46 +00:00
Tim Northover	f9e798ba6a	Segmented stacks: omit __morestack call when there's no frame. Patch by Florian Zeitz llvm-svn: 209436	2014-05-22 13:03:43 +00:00
Tim Northover	2dce43c26f	ARM64: these work too llvm-svn: 209430	2014-05-22 12:14:49 +00:00
Tim Northover	5949b60550	Yes they do llvm-svn: 209429	2014-05-22 12:14:02 +00:00
Tim Northover	4a3ab28ac7	ARM64: model pre/post-indexed operations properly. We should be keeping track of the writeback on these instructions, otherwise we're relying on LLVM's stupidity for correct code. Fortunately, the MC layer can now handle all required constraints, which means we can get rid of the CodeGen only PseudoInsts too. llvm-svn: 209426	2014-05-22 11:56:20 +00:00
Tim Northover	c350acfda5	ARM64: separate load/store operands to simplify assembler This changes ARM64 to use separate operands for each component of an address, and look for separate '[', '$Rn, ..., ']' tokens when parsing. This allows us to do away with quite a bit of special C++ code to handle monolithic "addressing modes" in the MC components. The more incremental matching of the assembler operands also allows for better diagnostics when LLVM is presented with invalid input. Most of the complexity here is with the register-offset instructions, which were extremely dodgy beforehand: even when the instruction used wM, LLVM's model had xM as an operand. We papered over this discrepancy before, but that approach doesn't work now so I split them into separate X and W variants. llvm-svn: 209425	2014-05-22 11:56:09 +00:00
Bradley Smith	9288b2181f	Extend sys::getHostCPUFeatures to work on AArch64 platforms llvm-svn: 209420	2014-05-22 11:44:34 +00:00
Daniel Sanders	36ff7c2adc	[mips][mips64r6] addi is not available on MIPS32r6/MIPS64r6 Summary: Depends on D3787. Tablegen will raise an assertion without it. Reviewers: zoran.jovanovic, jkolek, vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D3842 llvm-svn: 209419	2014-05-22 11:42:31 +00:00
Daniel Sanders	a3566c633d	[mips][mips64r6] Test that paired single instructions are invalid Summary: These emit the 'unknown instruction' instead of the correct error because they have not been implemented in LLVM for any MIPS ISA. Reviewers: jkolek, zoran.jovanovic, vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D3841 llvm-svn: 209418	2014-05-22 11:37:38 +00:00
Daniel Sanders	5c582b2f6d	[mips][mips64r6] Add b[on]vc Summary: This required me to implement the disassembler for MIPS64r6 since the encodings are ambiguous with other instructions. This in turn revealed a few assembly/disassembly bugs which I have fixed. * da[ht]i only take two operands according to the spec, not three. * DecodeBranchTarget2[16] correctly handles wider immediates than simm16 * Also made non-functional change to DecodeBranchTarget and DecodeBranchTargetMM to keep implementation style consistent between them. * Difficult encodings are handled by a custom decode method on the most general encoding in the group. This method will convert the MCInst to a different opcode if necessary. DecodeBranchTarget is not currently the inverse of getBranchTargetOpValue so disassembling some branch instructions emit incorrect output. This seems to affect branches with delay slots on all MIPS ISA's. I've left this bug for now and temporarily removed the check for the immediate on bc[12]eqz/bc[12]nez in the MIPS32r6/MIPS64r6 tests. jialc and jic crash the disassembler for some reason. I've left these instructions commented out for the moment. Depends on D3760 Reviewers: jkolek, zoran.jovanovic, vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D3761 llvm-svn: 209415	2014-05-22 11:23:21 +00:00
Tim Northover	0f6272271e	ARM64: assert if we see i64 -> i64 extend in the DAG. Should be no change in behaviour, but it makes the intended functionality a bit clearer and means we only have to reason about real extend operations. llvm-svn: 209409	2014-05-22 07:41:37 +00:00
Saleem Abdulrasool	9dd60cfb64	MC: initialise MCAsmParser variable Properly initialise HadError to false during construction. Detected as use-of-uninitialised variable by MSan! llvm-svn: 209393	2014-05-22 06:02:59 +00:00
Eric Christopher	65382d7316	Remove unused variable. llvm-svn: 209391	2014-05-22 05:33:03 +00:00
Saleem Abdulrasool	2bd1262a32	ARM: introduce llvm.arm.undefined intrinsic This intrinsic permits the emission of platform specific undefined sequences. ARM has reserved the 0xde opcode which takes a single integer parameter (ignored by the CPU). This permits the operating system to implement custom behaviour on this trap. The llvm.arm.undefined intrinsic is meant to provide a means for generating the target specific behaviour from the frontend. This is particularly useful for Windows on ARM which has made use of a series of these special opcodes. llvm-svn: 209390	2014-05-22 04:46:46 +00:00
Matt Arsenault	c3a73c3087	R600/SI: Match fp_to_uint / uint_to_fp for f64 llvm-svn: 209388	2014-05-22 03:20:30 +00:00
Saleem Abdulrasool	6663f8f2c0	MC: formalise some assertions into proper errors Now that clang can be used as an assembler via the IAS, invalid assembler inputs would cause the assertions to trigger. Although we cannot recover from the errors here, nor provide caret diagnostics, attempt to handle them slightly more gracefully by reporting a fatal error. llvm-svn: 209387	2014-05-22 02:18:10 +00:00
Eric Christopher	0e6e7cf385	Override runOnMachineFunction for ARMISelDAGToDAG so that we can reset the subtarget on each function. llvm-svn: 209386	2014-05-22 02:00:27 +00:00
Eric Christopher	4f09c59243	Override runOnMachineFunction for X86ISelDAGToDAG so that we can reset the subtarget on each function. llvm-svn: 209384	2014-05-22 01:53:26 +00:00
Eric Christopher	0d5c99eb08	Avoid using subtarget features when adding X86 specific passes to the pass pipeline. llvm-svn: 209382	2014-05-22 01:46:02 +00:00
Eric Christopher	e0bd2fa927	Remove extra local variable. llvm-svn: 209381	2014-05-22 01:45:59 +00:00
Eric Christopher	463b84b48b	Rename createGlobalBaseRegPass -> createX86GlobalBaseRegPass to make it obvious that it's a target specific pass. llvm-svn: 209380	2014-05-22 01:45:57 +00:00
Eric Christopher	89f18805f4	Fix typo. llvm-svn: 209377	2014-05-22 01:21:44 +00:00
Eric Christopher	d71e4441c9	Avoid using subtarget features when initializing the pass pipeline on PPC. llvm-svn: 209376	2014-05-22 01:21:35 +00:00
Eric Christopher	1b8e763630	Reset the subtarget for DAGToDAG on every iteration of runOnMachineFunction. This required updating the generated functions and TD file accordingly to be pointers rather than const references. llvm-svn: 209375	2014-05-22 01:07:24 +00:00
Eric Christopher	0ecfbdf4ad	Reset the subtarget for DAGToDAG on every iteration of runOnMachineFunction. llvm-svn: 209374	2014-05-22 01:07:21 +00:00
Eric Christopher	e43ecace70	Sort includes. llvm-svn: 209373	2014-05-22 01:07:18 +00:00
David Blaikie	8729bca333	DebugInfo: Simplify dead variable collection slightly. constructSubprogramDIE was already called for every subprogram in every CU when the module was started - there's no need to call it again at module finalization. llvm-svn: 209372	2014-05-22 00:48:36 +00:00
Andrew Trick	e255359b57	Fix a bug in SCEV's backedge taken count computation from my prior fix in Jan. This has to do with the trip count computation for loops with multiple exits, which is quite subtle. Most passes just ask for a single trip count number, so we must be conservative assuming any exit could be taken. Normally, we rely on the "exact" trip count, which was correctly given as "unknown". However, SCEV also gives a "max" back-edge taken count. The loops max BE taken count is conservatively a maximum over the max of each exit's non-exiting iterations count. Note that some exit tests can be skipped so the max loop back-edge taken count can actually exceed the max non-exiting iterations for some exits. However, when we know the loop latch cannot be skipped, we can directly use its max taken count disregarding other exits. I previously took the minimum here without checking whether the other exit could be skipped. The correct, and simpler thing to do here is just to directly use the loop latch's max non-exiting iterations as the loops max back-edge count. In the problematic test case, the first loop exit had a max of zero non-exiting iterations, but could be skipped. The loop latch was known not to be skipped but had max of one non-exiting iteration. We incorrectly claimed the loop back-edge could be taken zero times, when it is actually taken one time. Fixes Loop %for.body.i: <multiple exits> Unpredictable backedge-taken count. Loop %for.body.i: max backedge-taken count is 1. llvm-svn: 209358	2014-05-22 00:37:03 +00:00
Eli Bendersky	f13a05607c	Similar to bitcast, treat addrspacecast as a foldable operand. Added a test sink-addrspacecast.ll to verify this change. Patch by Jingyue Wu. llvm-svn: 209343	2014-05-22 00:02:52 +00:00
Eric Christopher	3470bbbd54	Fix compilation issues. llvm-svn: 209342	2014-05-21 23:51:57 +00:00
Eric Christopher	6b0fcfee36	Make early if conversion dependent upon the subtarget and add a subtarget hook to enable. Unconditionally add to the pass pipeline for targets that might want to use it. No functional change. llvm-svn: 209340	2014-05-21 23:40:26 +00:00
David Blaikie	2da282b860	Revert "DebugInfo: Don't put fission type units in comdat sections." This reverts commit r208930, r208933, and r208975. It seems not all fission consumers are ready to handle this behavior. Reverting until tools are brought up to spec. llvm-svn: 209338	2014-05-21 23:27:41 +00:00
Saleem Abdulrasool	0bd31835ea	MC: correct IMAGE_REL_ARM_MOV32T relocation emission This corrects the emission of IMAGE_REL_ARM_MOV32T relocations. Previously, we were avoiding the high portion of the relocation too early. If there was a section-relative relocation with an offset greater than 16-bits (65535), you would end up truncating the high order bits of the offset. Allow the current relocation representation to flow through out the MC layer to the object writer. Use the new ability to restrict recorded relocations to avoid emitting the relocation into the final object. llvm-svn: 209337	2014-05-21 23:17:56 +00:00
Saleem Abdulrasool	54bed12082	MC: introduce ability to restrict recorded relocations Add support to allow a target specific COFF object writer to restrict the recorded resolutions in the emitted object files. This is motivated by the need in Windows on ARM, where an intermediate relocation needs to be prevented from being emitted in the object file. llvm-svn: 209336	2014-05-21 23:17:50 +00:00
David Blaikie	1ea9db2dce	DebugInfo: Use the SPMap to find the parent CU of inlined functions as they may not be in the current CU Committed in r209178 then reverted in r209251 due to LTO breakage, here's a proper fix for the case of the missing subprogram DIE. The DIEs were there, just in other compile units. Using the SPMap we can find the right compile unit to search for and produce cross-unit references to describe this kind of inlining. One existing test case needed to be updated because it had a function that wasn't in the CU's subprogram list, so it didn't appear in the SPMap. llvm-svn: 209335	2014-05-21 23:14:12 +00:00
Matt Arsenault	40100887b6	R600: Add comment describing problems with LowerConstantInitializer llvm-svn: 209333	2014-05-21 22:59:17 +00:00
Matt Arsenault	6a57fd8b47	R600: Partially fix constant initializers for structs and vectors. This should extend the current workaround to work with structs that only contain legal, scalar types. llvm-svn: 209331	2014-05-21 22:42:42 +00:00
Eric Christopher	0120db5f8a	Remove getTargetLowering from TargetPassConfig as the target lowering can change depending upon subtarget/subtarget features for a function. llvm-svn: 209329	2014-05-21 22:42:07 +00:00
Eric Christopher	1e65e7cab5	Remove unused member variable from hexagon pass. llvm-svn: 209328	2014-05-21 22:42:02 +00:00
David Blaikie	825bdd2fc6	DebugInfo: Ensure concrete out of line variables from inlined functions reference their abstract origins. llvm-svn: 209327	2014-05-21 22:41:17 +00:00
Quentin Colombet	b4d53f1afa	[X86] Fix a bug in the lowering of BLENDI introduced in r209043. ISD::VSELECT mask uses 1 to identify the first argument and 0 to identify the second argument. On the other hand, BLENDI uses 0 to identify the first argument and 1 to identify the second argument. Fix the generation of the blend mask to account for this difference. The bug did not show up with r209043, because we were not checking for the actual arguments of the blend instruction! This commit also fixes the test cases. Note: The same mask works for the BLENDr variant because the arguments are swapped during instruction selection (see the BLENDXXrr patterns). <rdar://problem/16975435> llvm-svn: 209324	2014-05-21 22:00:39 +00:00
David Blaikie	ce7a1bd038	DebugInfo: Simplify subprogram declaration creation/references and accidentally refix PR11300. Also simplifies the linkage name handling a little too. llvm-svn: 209311	2014-05-21 18:04:33 +00:00
Matt Arsenault	03df7eeda1	Use cast<> instead of unchecked dyn_cast llvm-svn: 209310	2014-05-21 18:03:59 +00:00
Saleem Abdulrasool	6eae1e6bbf	MC: loosen an overzealous assertion Permit active macro expansions when terminating the assembler if there were errors during the expansion. This would only trigger on invalid input when built with assertions. llvm-svn: 209309	2014-05-21 17:53:18 +00:00
Daniel Sanders	2a83d68081	[mips][mips64r6] Add bc[12](eq\|ne)z Summary: Depends on D3691 Reviewers: jkolek, zoran.jovanovic, vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D3760 llvm-svn: 209292	2014-05-21 12:56:39 +00:00
Evgeniy Stepanov	fc9c78a6b6	[asan] Fix x86-32 asm instrumentation to preserve flags. Patch by Yuri Gorshenin. llvm-svn: 209280	2014-05-21 08:14:24 +00:00
Saleem Abdulrasool	ec8c2db283	MC: mark COFF .drectve section as REMOVE The .drectve section should be marked as IMAGE_SCN_LNK_REMOVE. This matches what the MSVC toolchain does and accurately reflects that this section should not be emitted into the final binary. This section is merely information for the linker, comprising of additional linker directives. llvm-svn: 209273	2014-05-21 05:15:01 +00:00
Richard Smith	56f9c191e1	[modules] Add module maps for LLVM. These are not quite ready for prime-time yet, but only a few more Clang patches need to land. (I have 'ninja check' passing locally.) llvm-svn: 209269	2014-05-21 02:46:14 +00:00
Saleem Abdulrasool	8d60fdc50d	ARM: correct bundle generation for MOV32T relocations Although the previous code would construct a bundle and add the correct elements to it, it would not finalise the bundle. This resulted in the InternalRead markers not being added to the MachineOperands nor, more importantly, the externally visible defs to the bundle itself. So, although the bundle was not exposing the def, the generated code would be correct because there was no optimisations being performed. When optimisations were enabled, the post register allocator would kick in, and the hazard recognizer would reorder operations around the load which would define the value being operated upon. Rather than manually constructing the bundle, simply construct and finalise the bundle via the finaliseBundle call after both MIs have been emitted. This improves the code generation with optimisations where IMAGE_REL_ARM_MOV32T relocations are emitted. The changes to the other tests are the result of the bundle generation preventing the scheduler from hoisting the moves across the loads. The net effect of the generated code is equivalent, but, is much more identical to what is actually being lowered. llvm-svn: 209267	2014-05-21 01:25:24 +00:00
Eric Christopher	eb71972887	Move the verbose asm option to be part of the options struct and set appropriately. llvm-svn: 209258	2014-05-20 23:59:50 +00:00
Kevin Enderby	1b985af0ba	Update MachOObjectFile::getSymbolAddress so it returns UnknownAddressOrSize for undefined symbols, so it matches what COFFObjectFile::getSymbolAddress does. This allows llvm-nm to print spaces instead of 0’s for the value of undefined symbols in Mach-O files. To make this change other uses of MachOObjectFile::getSymbolAddress are updated to handle when the Value is returned as UnknownAddressOrSize. Which is needed to keep two of the ExecutionEngine tests working for example. llvm-svn: 209253	2014-05-20 23:04:47 +00:00
David Blaikie	374af662e9	Revert "DebugInfo: Assume all subprogram DIEs have been created before any abstract subprograms are constructed." This reverts commit r209178. This seems to be asserting in an LTO build on some internal Apple buildbots. No upstream reproduction (and I don't have an LLVM-aware gold built right now to reproduce it personally) but it's a small patch & the failure's semi-plausible so I'm going to revert first while I try to reproduce this. llvm-svn: 209251	2014-05-20 22:33:09 +00:00
Adam Nemet	2ba6492b7b	[ARM64] PR19792: Fix cycle in DAG after performPostLD1Combine Povray and dealII currently assert with "Overran sorted position" in AssignTopologicalOrder. The problem is that performPostLD1Combine can introduce cycles. Consider: (insert_vector_elt (INSERT_SUBREG undef, (load (add %vreg0, Constant<8>), undef), <= A TargetConstant<2>), (load %vreg0, undef), <= B Constant<1>) This is turned into a LD1LANEpost node. However the address in A is not a valid user of the post-incremented address of B in LD1LANEpost. llvm-svn: 209242	2014-05-20 21:47:07 +00:00
David Blaikie	93ef46b02a	Unbreak the sanitizer buildbots after r209226 due to SROA issue described in http://reviews.llvm.org/D3714 Undecided whether this should include a test case - SROA produces bad dbg.value metadata describing a value for a reference that is actually the value of the thing the reference refers to. For now, loosening the assert lets this not assert, but it's still bogus/wrong output... If someone wants to tell me to add a test, I'm willing/able, just undecided. Hopefully we'll get SROA fixed soon & we can tighten up this assertion again. llvm-svn: 209240	2014-05-20 21:40:13 +00:00
Eric Christopher	2feed5fd68	Move the function and data section flags into the options struct and make the functions to set them non-static. Move and rename the llvm specific backend options to avoid conflicting with the clang option. Paired with a backend commit to update. llvm-svn: 209238	2014-05-20 21:25:34 +00:00
Kevin Enderby	fcbed5af67	Revert r209235 as it broke two tests: Failing Tests (2): LLVM :: ExecutionEngine/MCJIT/stubs-sm-pic.ll LLVM :: ExecutionEngine/MCJIT/stubs.ll llvm-svn: 209236	2014-05-20 21:10:15 +00:00
Kevin Enderby	1126d02c0c	Update MachOObjectFile::getSymbolAddress so it returns UnknownAddressOrSize for undefined symbols. Allowing llvm-nm to print spaces instead of 0’s for the value of undefined symbols in Mach-O files. llvm-svn: 209235	2014-05-20 20:32:18 +00:00
Quentin Colombet	c88baa5c10	[LSR] Canonicalize reg1 + ... + regN into reg1 + ... + 1*regN. This commit introduces a canonical representation for the formulae. Basically, as soon as a formula has more that one base register, the scaled register field is used for one of them. The register put into the scaled register is preferably a loop variant. The commit refactors how the formulae are built in order to produce such representation. This yields a more accurate, but still perfectible, cost model. <rdar://problem/16731508> llvm-svn: 209230	2014-05-20 19:25:04 +00:00
David Blaikie	1d9aec67b0	Fix test breakage introduced in r209223. Oops, broke the broken enum constants again. llvm-svn: 209226	2014-05-20 18:36:35 +00:00
Alexey Samsonov	dfcaf9c8d8	Rewrite calculateDbgValueHistory to make it (hopefully) more transparent. This change preserves the original algorithm of generating history for user variables, but makes it more clear. High-level description of algorithm: Scan all the machine basic blocks and machine instructions in the order they are emitted to the object file. Do the following: 1) If we see a DBG_VALUE instruction, add it to the history of the corresponding user variable. Keep track of all user variables, whose locations are described by a register. 2) If we see a regular instruction, look at all the registers it clobbers, and terminate the location range for all variables described by these registers. 3) At the end of the basic block, terminate location ranges for all user variables described by some register. Although this change shouldn't be user-visible (the contents of .debug_loc section should be the same), it changes some internal assumptions about the set of instructions used to track the variable locations. Watching the bots. llvm-svn: 209225	2014-05-20 18:34:54 +00:00
David Blaikie	2af1c805b4	PR19767: DebugInfo emission of pointer constants. In refactoring DwarfUnit::isUnsignedDIType I restricted it to only work on values with signedness (unsigned or signed), asserting on anything else (which did uncover some bugs). But it turns out that we do need to emit constants of signless data, such as pointer constants - only null pointer constants are known to need this so far, but it's conceivable that there might be non-null pointer constants at some point (hardcoded address offsets for device drivers?). This patch just uses 'unsigned' for signless data such as pointer constants. Arguably we could use signless representations (DW_FORM_dataN) instead, allowing a trinary result from isUnsignedDIType (signed, unsigned, signless), but this seems reasonable for now. llvm-svn: 209223	2014-05-20 18:21:51 +00:00
Adam Nemet	571eb5fc91	[PowerPC] PR19796: Also match ISD::TargetConstant in isIntS16Immediate The SplitIndexingFromLoad changes exposed a latent isel bug in the PowerPC64 backend. We matched an immediate offset with STWX8 even though it only supports register offset. The culprit is the complex-pattern predicate, SelectAddrIdx, which decides that if the offset is not ISD::Constant it must be a register. Many thanks to Bill Schmidt for testing this. llvm-svn: 209219	2014-05-20 17:20:34 +00:00
Eric Christopher	650c8f2a06	Clean up language and grammar. Based on a patch by jfcaron3@gmail.com! PR19806 llvm-svn: 209216	2014-05-20 17:11:11 +00:00
Daniel Sanders	a714fcb02a	Temporarily revert: r209129 - [mips][mips64r6] Sorted _ENC, _DESC classes and tests After discussion with Zoran, we have decided to temporarily revert this commit. It's causing some difficult to resolve conflicts and we are under time pressure to deliver an initial MIPS64r6 compiler. We will re-apply an equivalent patch once the time pressure has passed. llvm-svn: 209211	2014-05-20 14:46:24 +00:00
Tim Northover	c807a17a9b	TableGen: permit non-leaf ComplexPattern uses This allows the results of a ComplexPattern check to be distributed to separate named Operands, instead of the current system where all results must apply (and match perfectly) with a single Operand. For example, if "some_addrmode" is a ComplexPattern producing two results, you can write: def : Pat<(load (some_addrmode GPR64:$base, imm:$offset)), (INST GPR64:$base, imm:$offset)>; This should allow neater instruction definitions in TableGen that don't put all possible aspects of addressing into a single operand, but are still usable with relatively simple C++ CodeGen idioms. llvm-svn: 209206	2014-05-20 11:52:46 +00:00
Simon Atanasyan	e7fa2314af	Add parentheses to suppress the gcc warning '-Wparentheses'. No functional changes. llvm-svn: 209203	2014-05-20 10:23:04 +00:00
Benjamin Kramer	7bd6bee385	Legalizer: Make bswap promotion safe for vectors. llvm-svn: 209202	2014-05-20 09:42:31 +00:00
Simon Atanasyan	9cb4090867	[Mips] Add more relocation types and MIPS specific e_flags constants. llvm-svn: 209201	2014-05-20 09:27:49 +00:00
Christian Pirker	875629f713	ARMEB: Additional test files for ARM fixups llvm-svn: 209200	2014-05-20 09:24:37 +00:00
Tim Northover	9a24f88a37	TableGen: convert InstAlias's Emit bit to an int. When multiple aliases overlap, the correct string to print can often be determined purely by considering the InstAlias declarations in some particular order. This allows the user to specify that order manually when desired, without resorting to hacking around with the default lexicographical order on Record instantiation, which is error-prone and ugly. I was also mistaken about "add w2, w3, w4" being the same as "add w2, w3, w4, uxtw". That's only true if Rn is the stack pointer. llvm-svn: 209199	2014-05-20 09:17:16 +00:00
Alexey Volkov	6226de6721	[X86] Tune LEA usage for Silvermont According to Intel Software Optimization Manual on Silvermont in some cases LEA is better to be replaced with ADD instructions: "The rule of thumb for ADDs and LEAs is that it is justified to use LEA with a valid index and/or displacement for non-destructive destination purposes (especially useful for stack offset cases), or to use a SCALE. Otherwise, ADD(s) are preferable." Differential Revision: http://reviews.llvm.org/D3826 llvm-svn: 209198	2014-05-20 08:55:50 +00:00
Zinovy Nis	abdf44e7f3	[LV][REFACTOR] One more tiny fix for printing debug locations in loop vectorizer. Now consistent with the remarks emitter. Differential Revision: http://reviews.llvm.org/D3821 llvm-svn: 209197	2014-05-20 08:26:20 +00:00
Nick Lewycky	ec373545b8	Teach isKnownNonNull that a nonnull return is not null. Add a test for this case as well as the case of a nonnull attribute (already handled but not tested). llvm-svn: 209193	2014-05-20 05:13:21 +00:00
David Blaikie	8e1d489351	DebugInfo: Emit function definitions within their namespace scope. This workaround (presumably for ancient GDB) doesn't appear to be required (GDB 7.5 seems to tolerate function definition DIEs in namespace scope just fine). llvm-svn: 209189	2014-05-20 03:23:24 +00:00
Nick Lewycky	d52b1528c0	Add 'nonnull', a new parameter and return attribute which indicates that the pointer is not null. Instcombine will elide comparisons between these and null. Patch by Luqman Aden! llvm-svn: 209185	2014-05-20 01:23:40 +00:00
David Blaikie	424b59b1ce	DebugInfo: Assume all subprogram DIEs have been created before any abstract subprograms are constructed. Since we visit the whole list of subprograms for each CU at module start, this is clearly true - don't test for the case, just assert it. A few old test cases seemed to have incomplete subprogram lists, but any attempt to reproduce them shows full subprogram lists that even include entities that have been completely inlined and the out of line definition removed. llvm-svn: 209178	2014-05-19 23:16:19 +00:00
Chad Rosier	b06ed63ebf	[ARM64] Adds Cortex-A53 scheduling support for vector load/store post. Patch by Dave Estes<cestes@codeaurora.org>! PR19761 http://reviews.llvm.org/D3829 llvm-svn: 209176	2014-05-19 22:59:51 +00:00
Matt Arsenault	0a3b8f5507	Remove unused method declaration llvm-svn: 209174	2014-05-19 22:55:35 +00:00
David Blaikie	973141a035	DebugInfo: Don't include DW_AT_inline on each abstract definition multiple times. When I refactored this in r208636 I accidentally caused this to be added multiple times to each abstract subprogram (not accounting for the deduplicating effect of the InlinedSubprogramDIEs set). This got better in r208798 when the abstract definitions got the attribute added to them at construction time, but still had the redundant copies introduced in r208636. This commit removes those excess DW_AT_inlines and relies solely on the insertion in r208798. llvm-svn: 209166	2014-05-19 22:07:16 +00:00
David Blaikie	48b056bab0	DebugInfo: Fix missing inlined_subroutines caused by r208748. The check in DwarfDebug::constructScopeDIE was meant to consider inlined subroutines as any non-top-level scope that was a subprogram. Instead of checking "not top level scope" it was checking if the /subprogram's/ scope was non-top-level. Fix this and beef up a test case to demonstrate some of the missing inlined_subroutines are no longer missing. In the course of fixing this I also found that r208748 (with this fix) found one /extra/ inlined_subroutine in concrete_out_of_line.ll due to two inlined_subroutines having the same inlinedAt location. The previous implementation was collapsing these into a single inlined subroutine. I'm not sure what the original code was that created this .ll file so I'm not sure if this actually happens in practice today. Since we deliberately include column information to disambiguate two calls on the same line, that may've addressed this bug in the frontend, but it's good to know that workaround isn't necessary for this particular case anymore. llvm-svn: 209165	2014-05-19 21:54:31 +00:00
Eric Christopher	710c0ae7de	Fix typos. llvm-svn: 209164	2014-05-19 21:18:47 +00:00
Juergen Ributzka	431761771c	[ConstantHoisting][X86] Change the cost model to never hoist constants for types larger than i128. Currently the X86 backend doesn't support types larger than i128 very well. For example an i192 multiply will assert in codegen when the 2nd argument is a constant and the constant got hoisted. This fix changes the cost model to never hoist constants for types larger than i128. Once the codegen issues have been resolved, the cost model can be updated to allow also larger types. This is related to <rdar://problem/16954938> llvm-svn: 209162	2014-05-19 21:00:53 +00:00
Andrea Di Biagio	7a85cadfd6	[X86] Add ISel patterns to improve the selection of TZCNT and LZCNT. Instructions TZCNT (requires BMI1) and LZCNT (requires LZCNT), always provide the operand size as output if the input operand is zero. We can take advantage of this knowledge during instruction selection stage in order to simplify a few corner case. llvm-svn: 209159	2014-05-19 20:38:59 +00:00
Kevin Enderby	403258f5a3	Implement MachOObjectFile::isSectionData() and MachOObjectFile::isSectionBSS so that llvm-size will total up all the sections in the Berkeley format. This allows for rough categorizations for Mach-O sections. And allows the total of llvm-size’s Berkeley and System V formats to be the same. llvm-svn: 209158	2014-05-19 20:36:02 +00:00
Filipe Cabecinhas	dc92102766	Added more insertps optimizations Summary: When inserting an element that's coming from a vector load or a broadcast of a vector (or scalar) load, combine the load into the insertps instruction. Added PerformINSERTPSCombine for the case where we need to fix the load (load of a vector + insertps with a non-zero CountS). Added patterns for the broadcasts. Also added tests for SSE4.1, AVX, and AVX2. Reviewers: delena, nadav, craig.topper Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D3581 llvm-svn: 209156	2014-05-19 19:45:57 +00:00
Lang Hames	1fcbc08500	[RuntimeDyld] Fix x86-64 MachO GOT relocation handling. For GOT relocations the addend should modify the offset to the GOT entry, not the value of the entry itself. Teach RuntimeDyldMachO to do The Right Thing here. Fixes <rdar://problem/16961886>. llvm-svn: 209154	2014-05-19 19:21:25 +00:00
Peter Collingbourne	68a889757d	Check the alwaysinline attribute on the call as well as on the caller. Differential Revision: http://reviews.llvm.org/D3815 llvm-svn: 209150	2014-05-19 18:25:54 +00:00
Matt Arsenault	04b67ceeeb	Use range for llvm-svn: 209147	2014-05-19 17:52:48 +00:00
Jyotsna Verma	9a103563f4	reverting r209132 llvm-svn: 209139	2014-05-19 16:22:11 +00:00
Alp Toker	d71b6dfd85	MemoryBuffer: Use GetNativeSystemInfo() Removes old 4096 byte workaround. This functionality has been available since Windows XP. llvm-svn: 209137	2014-05-19 16:13:28 +00:00
Eric Christopher	a5ec92556f	Revert "Patch for function cloning to inline all blocks whose address is taken" as it was causing build failures in ruby. This reverts commit r207713. llvm-svn: 209135	2014-05-19 16:04:10 +00:00
Bradley Smith	c3b931d005	[ARM64] Split tbz/tbnz into W/X register variant llvm-svn: 209134	2014-05-19 15:58:15 +00:00
Jyotsna Verma	daeb25d4e0	Hexagon: Add encoding bits to the mpy instructions. llvm-svn: 209132	2014-05-19 15:32:07 +00:00
Zoran Jovanovic	b2b1f98da4	[mips][mips64r6] Sorted _ENC, _DESC classes and tests Differential Revision: http://reviews.llvm.org/D3808 llvm-svn: 209129	2014-05-19 14:57:46 +00:00
Aaron Ballman	0dfed533ec	Resolving MSVC warnings about switch statements with a default label, but no case labels. No functional changes intended. llvm-svn: 209126	2014-05-19 14:29:04 +00:00
Benjamin Kramer	f3ad23551d	SDAG: Legalize vector BSWAP into a shuffle if the shuffle is legal but the bswap not. - On ARM/ARM64 we get a vrev because the shuffle matching code is really smart. We still unroll anything that's not v4i32 though. - On X86 we get a pshufb with SSSE3. Required more cleverness in isShuffleMaskLegal. - On PPC we get a vperm for v8i16 and v4i32. v2i64 is unrolled. llvm-svn: 209123	2014-05-19 13:12:38 +00:00
Dinesh Dwivedi	f82f16e3e6	Added inst-combine for 'MIN(MIN(A, 97), 23)' and 'MAX(MAX(A, 23), 97)' This removes TODO added in r208849 [http://reviews.llvm.org/D3629] MIN(MIN(A, 97), 23) -> MIN(A, 23) MAX(MAX(A, 23), 97) -> MAX(A, 97) Differential Revision: http://reviews.llvm.org/D3785 llvm-svn: 209110	2014-05-19 07:08:32 +00:00
Craig Topper	b816593cf0	Remove last uses of OwningPtr from llvm. As far as I can tell these method versions are not used by lldb, lld, or clang. llvm-svn: 209103	2014-05-18 21:55:38 +00:00
Saleem Abdulrasool	8bfb192ecd	ARM: make libcall setup more table driven Rather than create a series of function calls to setup the library calls, create a table with the information and just use the table to drive the configuration of the library calls. This makes it easier to both inspect the list as well as to modify it. NFC. llvm-svn: 209089	2014-05-18 16:39:11 +00:00
Benjamin Kramer	f9d2d512c7	Options: Use erase_if to remove Args from the list. While there make getOption return a const reference so we don't have to put it on the stack when calling methods on it. No functionality change. llvm-svn: 209088	2014-05-18 15:14:13 +00:00
Saleem Abdulrasool	a521845381	ARM: improve WoA ABI conformance for frame register Windows on ARM uses R11 for the frame pointer even though the environment is a pure Thumb-2, thumb-only environment. Replicate this behaviour to improve Windows ABI compatibility. This register is used for fast stack walking, and thus is part of the Windows ABI. llvm-svn: 209085	2014-05-18 04:12:52 +00:00
Saleem Abdulrasool	f11f4b4e20	ARM: consolidate frame pointer register knowledge Use the ARMBaseRegisterInfo to query the frame register. The base register info is aware of the frame register that is used for the frame pointer. Use that to determine the frame register rather than duplicating the knowledge. Although, the code path is slightly different in that it may return SP, that can only occur if the frame pointer has been omitted in the machine function, which is supposed to contain the desired value in that case. llvm-svn: 209084	2014-05-18 03:18:09 +00:00
Saleem Abdulrasool	f3a5a5c546	Target: remove old constructors for CallLoweringInfo This is mostly a mechanical change changing all the call sites to the newer chained-function construction pattern. This removes the horrible 15-parameter constructor for the CallLoweringInfo in favour of setting properties of the call via chained functions. No functional change beyond the removal of the old constructors are intended. llvm-svn: 209082	2014-05-17 21:50:17 +00:00
Saleem Abdulrasool	9f664c1083	Target: change member from reference to pointer This is a preliminary step to help ease the construction of CallLoweringInfo. Changing the construction to a chained function pattern requires that the parameter be nullable. However, rather than copying the vector, save a pointer rather than the reference to permit a late binding of the arguments. llvm-svn: 209080	2014-05-17 21:50:01 +00:00
Saleem Abdulrasool	6d11b7cd7a	ARM: whitespace Remove some whitespace. NFC. llvm-svn: 209079	2014-05-17 21:49:54 +00:00
Rafael Espindola	f1bedd3747	Use create methods since msvc doesn't handle delegating constructors. llvm-svn: 209076	2014-05-17 21:29:57 +00:00
Rafael Espindola	77bbb54fbf	Handle ConstantAggregateZero when upgrading global_ctors. llvm-svn: 209075	2014-05-17 21:00:22 +00:00
Rafael Espindola	8370565820	Reduce abuse of default values in the GlobalAlias constructor. This is in preparation for adding an optional offset. llvm-svn: 209073	2014-05-17 19:57:46 +00:00
NAKAMURA Takumi	7ef81a4f98	Revert r209049 and r209065, "Add support for combining GEPs across PHI nodes" It broke clang selfhosting even after r209065. llvm-svn: 209067	2014-05-17 14:39:21 +00:00
Louis Gerbarg	455805694e	Fix for sanitizer crash introduced in r209049 This patch fixes 3 issues introduced by r209049 that only showed up in on the sanitizer buildbots. One was a typo in a compare. The other is a check to confirm that the single differing value in the two incoming GEPs is the same type. The final issue was the the IRBuilder under some circumstances would build PHIs in the middle of the block. llvm-svn: 209065	2014-05-17 06:51:36 +00:00
David Majnemer	483e4e08ac	Target: Replace getSection().empty() with hasSection() No functional change, just a small cleanup. llvm-svn: 209064	2014-05-17 05:18:40 +00:00
Saleem Abdulrasool	46fed305db	ARM: use the proper target object format for WoA WoA uses COFF, not ELF. ARMISelLowering::createTLOF would previously return ELF for any non-MachO platform. This was a missed site when the original change for target format support for Windows on ARM was done. llvm-svn: 209057	2014-05-17 04:28:08 +00:00
Chandler Carruth	c85473143c	[x86] Fix a bad predicate I spotted by inspection -- pshufhw and pshuflw were added in SSE2, no SSSE3. Found this while auditing all uses of SSSE3 in the X86 target. I don't actually expect this to make a significant difference on anything and I don't have any detailed test cases but I updated the existing test cases that already covered some of this code path. llvm-svn: 209056	2014-05-17 03:29:20 +00:00
Alexey Samsonov	cd01472a9b	[DWARF parser] Teach DIContext to fetch short (non-linkage) function names for a given address. Change --functions option in llvm-symbolizer tool to accept values "none", "short" or "linkage". Update the tests and docs accordingly. llvm-svn: 209050	2014-05-17 00:07:48 +00:00
Louis Gerbarg	8d2a43e9be	Add support for combining GEPs across PHI nodes Currently LLVM will generally merge GEPs. This allows backends to use more complex addressing modes. In some cases this is not happening because there is PHI inbetween the two GEPs: GEP1--\ \|-->PHI1-->GEP3 GEP2--/ This patch checks to see if GEP1 and GEP2 are similiar enough that they can be cloned (GEP12) in GEP3's BB, allowing GEP->GEP merging (GEP123): GEP1--\ --\ --\ \|-->PHI1-->GEP3 ==> \|-->PHI2->GEP12->GEP3 == > \|-->PHI2->GEP123 GEP2--/ --/ --/ This also breaks certain use chains that are preventing GEP->GEP merges that the the existing instcombine would merge otherwise. Tests included. rdar://15547484 llvm-svn: 209049	2014-05-16 23:47:24 +00:00
Pete Cooper	0d4ea975ef	Use a sized enum for MachineOperandType. No functionality change llvm-svn: 209048	2014-05-16 23:28:17 +00:00
Filipe Cabecinhas	89654da069	Implemented special cases for PerformVSELECTCombine. vselects with constant masks, after legalization, will get turned into specialized shuffle_vectors so they can be matched to blend+imm instructions. Fixed some tests. llvm-svn: 209044	2014-05-16 22:47:54 +00:00
Filipe Cabecinhas	e15551832c	Lower vselects into X86ISD::BLENDI when appropriate. LowerVSELECT will, if possible, generate a X86ISD::BLENDI DAG node if the condition is constant and we can emit that instruction, given the subtarget. This is not enough for all cases. An additional SELECTCombine optimization will be committed. Fixed tests that were expecting variable blends but where a blend+imm can be generated. Added test where we can't emit blend+immediate. Added avx2 blend+imm tests. llvm-svn: 209043	2014-05-16 22:47:49 +00:00
Filipe Cabecinhas	17254aaead	Implemented LowerVSELECT to custom lower some instructions. No functionality change intended. The types that previously were set to lower as Expand or Legal are doing the same thing with this lowering function. llvm-svn: 209042	2014-05-16 22:47:43 +00:00
Rafael Espindola	e0098928c9	Delete getAliasedGlobal. llvm-svn: 209040	2014-05-16 22:37:03 +00:00
David Blaikie	48369d1b8e	DebugInfo: Assert rather than conditionalizing when a CU's subprogram list contains declarations. llvm-svn: 209039	2014-05-16 22:21:45 +00:00
David Blaikie	c405c9cb0b	DebugInfo: Handle emitting constants of C++ unicode character type. Patch by Stephan Tolksdorf! (with some test case stuff by me) Differential Revision: http://reviews.llvm.org/D3810 llvm-svn: 209037	2014-05-16 21:53:09 +00:00
Tom Stellard	c721a23882	R600/SI: Refactor the VOP3_32 tablegen class This will allow us to use a single MachineInstr to represent instructions which behave the same but have different encodings on some subtargets. llvm-svn: 209028	2014-05-16 20:56:47 +00:00
Tom Stellard	0e70de57a3	R600/SI: Add a PredicateControl class for managing TableGen predicates This was inspired by the PredicateControl class in the MIPS backend. llvm-svn: 209027	2014-05-16 20:56:45 +00:00
Tom Stellard	0289ff4a4f	R600/SI: Move tablegen patterns away from instruction defs llvm-svn: 209026	2014-05-16 20:56:44 +00:00
Tom Stellard	2671338497	R600/SI: Remove unused instruction llvm-svn: 209025	2014-05-16 20:56:43 +00:00
Tom Stellard	f719ee9e76	R600/SI: Promote f32 SELECT to i32 llvm-svn: 209024	2014-05-16 20:56:41 +00:00
Tom Stellard	725db5d2c8	R600/SI: Remove duplicate pattern llvm-svn: 209023	2014-05-16 20:56:37 +00:00
Reid Kleckner	fceb76f5f9	Add comdat key field to llvm.global_ctors and llvm.global_dtors This allows us to put dynamic initializers for weak data into the same comdat group as the data being initialized. This is necessary for MSVC ABI compatibility. Once we have comdats for guard variables, we can use the combination to help GlobalOpt fire more often for weak data with guarded initialization on other platforms. Reviewers: nlewycky Differential Revision: http://reviews.llvm.org/D3499 llvm-svn: 209015	2014-05-16 20:39:27 +00:00
Rafael Espindola	0bf5b143f5	Fix a warning in builds without asserts. llvm-svn: 209012	2014-05-16 20:05:08 +00:00
David Blaikie	46d0ca5b40	DebugInfo: Add an assert regarding the subprogram in the subprogram map matching the abstract subprogram. I'm not sure this is how it'll be going forward (I'd rather prefer the definition to be in the main SP mapping, for various reasons) but this helps me understand how it is today. llvm-svn: 209009	2014-05-16 19:42:10 +00:00
Rafael Espindola	6b238633b7	Fix most of PR10367. This patch changes the design of GlobalAlias so that it doesn't take a ConstantExpr anymore. It now points directly to a GlobalObject, but its type is independent of the aliasee type. To avoid changing all alias related tests in this patches, I kept the common syntax @foo = alias i32* @bar to mean the same as now. The cases that used to use cast now use the more general syntax @foo = alias i16, i32* @bar. Note that GlobalAlias now behaves a bit more like GlobalVariable. We know that its type is always a pointer, so we omit the '*'. For the bitcode, a nice surprise is that we were writing both identical types already, so the format change is minimal. Auto upgrade is handled by looking through the casts and no new fields are needed for now. New bitcode will simply have different types for Alias and Aliasee. One last interesting point in the patch is that replaceAllUsesWith becomes smart enough to avoid putting a ConstantExpr in the aliasee. This seems better than checking and updating every caller. A followup patch will delete getAliasedGlobal now that it is redundant. Another patch will add support for an explicit offset. llvm-svn: 209007	2014-05-16 19:35:39 +00:00
David Blaikie	825f487b68	DebugInfo: Assume the CU's Subprogram list only contains definitions. DIBuilder maintains this invariant and the current DwarfDebug code could end up doing weird things if it contained declarations (such as putting the definition DIE inside a CU that contained the declaration - this doesn't seem like a good idea, so rather than adding logic to handle this case we'll just ban in for now & cross that bridge if we come to it later). llvm-svn: 209004	2014-05-16 18:26:53 +00:00
Chad Rosier	d978ca0308	[ARM64] Increases the Sched Model accuracy for Cortex-A53. Patch by Dave Estes <cestes@codeaurora.org> http://reviews.llvm.org/D3769 llvm-svn: 209001	2014-05-16 17:15:33 +00:00
David Majnemer	78910fc4da	InstSimplify: Improve handling of ashr/lshr Summary: Analyze the range of values produced by ashr/lshr cst, %V when it is being used in an icmp. Reviewers: nicholas Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D3774 llvm-svn: 209000	2014-05-16 17:14:03 +00:00
David Majnemer	ea8d5dbf24	InstSimplify: Optimize using dividend in sdiv Summary: The dividend in an sdiv tells us the largest and smallest possible results. Use this fact to optimize comparisons against an sdiv with a constant dividend. Reviewers: nicholas Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D3795 llvm-svn: 208999	2014-05-16 16:57:04 +00:00
Tilmann Scheller	83c5743650	[ARM64] Fix wrong comment in load/store optimization pass. ldr x1, [x0, #64] add x0, x0, #64 -> ldr x1, [x0], #64 is not a valid transformation, the correct transformation (and what the code actually does) is: ldr x1, [x0, #64] add x0, x0, #64 -> ldr x1, [x0, #64]! llvm-svn: 208998	2014-05-16 16:50:13 +00:00
David Blaikie	4a3b84d2f5	DwarfDebug: Refactor AT_ranges/AT_high_pc+AT_low_pc emission into helper function. llvm-svn: 208997	2014-05-16 16:42:40 +00:00
Simon Atanasyan	b83f380ae4	[yaml2obj][ELF] Add an optional `Size` field to the YAML section declaration. Now the only method to configure ELF section's content and size is to assign a hexadecimal string to the `Content` field. Unfortunately this way is completely useless when you need to declare a really large section. To solve this problem this patch adds one more optional field `Size` to the `RawContentSection` structure. When yaml2obj generates an ELF file it uses the following algorithm: 1. If both `Content` and `Size` fields are missed create an empty section. 2. If only `Content` field is missed take section length from the `Size` field and fill the section by zero. 3. If only `Size` field is missed create a section using data from the `Content` field. 4. If both `Content` and `Size` fields are provided validate that the `Size` value is not less than size of `Content` data. Than take section length from the `Size`, fill beginning of the section by `Content` and the rest by zero. Examples -------- * Create a section 0x10000 bytes long filled by zero Name: .data Type: SHT_PROGBITS Flags: [ SHF_ALLOC ] Size: 0x10000 * Create a section 0x10000 bytes long starting from 'CA' 'FE' 'BA' 'BE' Name: .data Type: SHT_PROGBITS Flags: [ SHF_ALLOC ] Content: CAFEBABE Size: 0x10000 The patch reviewed by Michael Spencer. llvm-svn: 208995	2014-05-16 16:01:00 +00:00
James Molloy	a70697e10e	Re-enable inline memcpy expansion for Thumb1. Patch by Moritz Roth! llvm-svn: 208994	2014-05-16 14:24:22 +00:00
Rafael Espindola	a800445710	Small dyn_cast and auto cleanup. llvm-svn: 208993	2014-05-16 14:22:33 +00:00
James Molloy	556763d2ef	Fix the Load/Store optimization pass to work with Thumb1. Patch by Moritz Roth! llvm-svn: 208992	2014-05-16 14:14:30 +00:00
James Molloy	92a15078f1	Enable the Load/Store optimization pass for Thumb1 but make it return immediately for now. Patch by Moritz Roth! llvm-svn: 208991	2014-05-16 14:11:38 +00:00
James Molloy	bb73c23ffa	Fix a few comment typos and style issues. Patch by Moritz Roth! llvm-svn: 208990	2014-05-16 14:08:46 +00:00
Zoran Jovanovic	6110e3bce6	[mips][mips64r6] Add SELEQZ and SELNEZ instructions Differential Revision: http://reviews.llvm.org/D3743 llvm-svn: 208987	2014-05-16 13:40:57 +00:00
Rafael Espindola	4fe0094fd1	Change the GlobalAlias constructor to look a bit more like GlobalVariable. This is part of the fix for pr10367. A GlobalAlias always has a pointer type, so just have the constructor build the type. llvm-svn: 208983	2014-05-16 13:34:04 +00:00
Zoran Jovanovic	52c56b93e5	[mips][mips64r6] Add Compact indexed jumps. Differential Revision: http://reviews.llvm.org/D3707 llvm-svn: 208981	2014-05-16 13:19:46 +00:00
Yaron Keren	152172009a	Fix hardcoded slash to native path seperator which was exposed from llvm::sys::path. http://reviews.llvm.org/D3687 llvm-svn: 208980	2014-05-16 13:16:30 +00:00
Rafael Espindola	5a52b9f139	Revert "Implement global merge optimization for global variables." This reverts commit r208934. The patch depends on aliases to GEPs with non zero offsets. That is not supported and fairly broken. The good news is that GlobalAlias is being redesigned and will have support for offsets, so this patch should be a nice match for it. llvm-svn: 208978	2014-05-16 13:02:18 +00:00
Zoran Jovanovic	5a8c1e2900	[mips][mips64r6] Add Compact zero-compare branch-and-link instructions Differential Revision: http://reviews.llvm.org/D3718 llvm-svn: 208977	2014-05-16 12:27:19 +00:00
Stepan Dyatkovskiy	948366ac0b	MergeFunctions Pass, introduced total ordering among GEP operations. Patch replaces old isEquivalentGEP implementation, and changes type of comparison result from bool (equal or not) to {-1, 0, 1} (less, equal, greater). This patch belongs to patch series that improves MergeFunctions performance time from O(NN) to O(Nlog(N)). llvm-svn: 208976	2014-05-16 11:55:02 +00:00

... 2 3 4 5 6 ...

69962 Commits