llvm-project

Commit Graph

Author	SHA1	Message	Date
David Blaikie	374af662e9	Revert "DebugInfo: Assume all subprogram DIEs have been created before any abstract subprograms are constructed." This reverts commit r209178. This seems to be asserting in an LTO build on some internal Apple buildbots. No upstream reproduction (and I don't have an LLVM-aware gold built right now to reproduce it personally) but it's a small patch & the failure's semi-plausible so I'm going to revert first while I try to reproduce this. llvm-svn: 209251	2014-05-20 22:33:09 +00:00
Adam Nemet	2ba6492b7b	[ARM64] PR19792: Fix cycle in DAG after performPostLD1Combine Povray and dealII currently assert with "Overran sorted position" in AssignTopologicalOrder. The problem is that performPostLD1Combine can introduce cycles. Consider: (insert_vector_elt (INSERT_SUBREG undef, (load (add %vreg0, Constant<8>), undef), <= A TargetConstant<2>), (load %vreg0, undef), <= B Constant<1>) This is turned into a LD1LANEpost node. However the address in A is not a valid user of the post-incremented address of B in LD1LANEpost. llvm-svn: 209242	2014-05-20 21:47:07 +00:00
David Blaikie	93ef46b02a	Unbreak the sanitizer buildbots after r209226 due to SROA issue described in http://reviews.llvm.org/D3714 Undecided whether this should include a test case - SROA produces bad dbg.value metadata describing a value for a reference that is actually the value of the thing the reference refers to. For now, loosening the assert lets this not assert, but it's still bogus/wrong output... If someone wants to tell me to add a test, I'm willing/able, just undecided. Hopefully we'll get SROA fixed soon & we can tighten up this assertion again. llvm-svn: 209240	2014-05-20 21:40:13 +00:00
Eric Christopher	2feed5fd68	Move the function and data section flags into the options struct and make the functions to set them non-static. Move and rename the llvm specific backend options to avoid conflicting with the clang option. Paired with a backend commit to update. llvm-svn: 209238	2014-05-20 21:25:34 +00:00
Kevin Enderby	fcbed5af67	Revert r209235 as it broke two tests: Failing Tests (2): LLVM :: ExecutionEngine/MCJIT/stubs-sm-pic.ll LLVM :: ExecutionEngine/MCJIT/stubs.ll llvm-svn: 209236	2014-05-20 21:10:15 +00:00
Kevin Enderby	1126d02c0c	Update MachOObjectFile::getSymbolAddress so it returns UnknownAddressOrSize for undefined symbols. Allowing llvm-nm to print spaces instead of 0’s for the value of undefined symbols in Mach-O files. llvm-svn: 209235	2014-05-20 20:32:18 +00:00
Quentin Colombet	c88baa5c10	[LSR] Canonicalize reg1 + ... + regN into reg1 + ... + 1*regN. This commit introduces a canonical representation for the formulae. Basically, as soon as a formula has more that one base register, the scaled register field is used for one of them. The register put into the scaled register is preferably a loop variant. The commit refactors how the formulae are built in order to produce such representation. This yields a more accurate, but still perfectible, cost model. <rdar://problem/16731508> llvm-svn: 209230	2014-05-20 19:25:04 +00:00
David Blaikie	1d9aec67b0	Fix test breakage introduced in r209223. Oops, broke the broken enum constants again. llvm-svn: 209226	2014-05-20 18:36:35 +00:00
Alexey Samsonov	dfcaf9c8d8	Rewrite calculateDbgValueHistory to make it (hopefully) more transparent. This change preserves the original algorithm of generating history for user variables, but makes it more clear. High-level description of algorithm: Scan all the machine basic blocks and machine instructions in the order they are emitted to the object file. Do the following: 1) If we see a DBG_VALUE instruction, add it to the history of the corresponding user variable. Keep track of all user variables, whose locations are described by a register. 2) If we see a regular instruction, look at all the registers it clobbers, and terminate the location range for all variables described by these registers. 3) At the end of the basic block, terminate location ranges for all user variables described by some register. Although this change shouldn't be user-visible (the contents of .debug_loc section should be the same), it changes some internal assumptions about the set of instructions used to track the variable locations. Watching the bots. llvm-svn: 209225	2014-05-20 18:34:54 +00:00
David Blaikie	2af1c805b4	PR19767: DebugInfo emission of pointer constants. In refactoring DwarfUnit::isUnsignedDIType I restricted it to only work on values with signedness (unsigned or signed), asserting on anything else (which did uncover some bugs). But it turns out that we do need to emit constants of signless data, such as pointer constants - only null pointer constants are known to need this so far, but it's conceivable that there might be non-null pointer constants at some point (hardcoded address offsets for device drivers?). This patch just uses 'unsigned' for signless data such as pointer constants. Arguably we could use signless representations (DW_FORM_dataN) instead, allowing a trinary result from isUnsignedDIType (signed, unsigned, signless), but this seems reasonable for now. llvm-svn: 209223	2014-05-20 18:21:51 +00:00
Adam Nemet	571eb5fc91	[PowerPC] PR19796: Also match ISD::TargetConstant in isIntS16Immediate The SplitIndexingFromLoad changes exposed a latent isel bug in the PowerPC64 backend. We matched an immediate offset with STWX8 even though it only supports register offset. The culprit is the complex-pattern predicate, SelectAddrIdx, which decides that if the offset is not ISD::Constant it must be a register. Many thanks to Bill Schmidt for testing this. llvm-svn: 209219	2014-05-20 17:20:34 +00:00
Eric Christopher	650c8f2a06	Clean up language and grammar. Based on a patch by jfcaron3@gmail.com! PR19806 llvm-svn: 209216	2014-05-20 17:11:11 +00:00
Daniel Sanders	a714fcb02a	Temporarily revert: r209129 - [mips][mips64r6] Sorted _ENC, _DESC classes and tests After discussion with Zoran, we have decided to temporarily revert this commit. It's causing some difficult to resolve conflicts and we are under time pressure to deliver an initial MIPS64r6 compiler. We will re-apply an equivalent patch once the time pressure has passed. llvm-svn: 209211	2014-05-20 14:46:24 +00:00
Tim Northover	c807a17a9b	TableGen: permit non-leaf ComplexPattern uses This allows the results of a ComplexPattern check to be distributed to separate named Operands, instead of the current system where all results must apply (and match perfectly) with a single Operand. For example, if "some_addrmode" is a ComplexPattern producing two results, you can write: def : Pat<(load (some_addrmode GPR64:$base, imm:$offset)), (INST GPR64:$base, imm:$offset)>; This should allow neater instruction definitions in TableGen that don't put all possible aspects of addressing into a single operand, but are still usable with relatively simple C++ CodeGen idioms. llvm-svn: 209206	2014-05-20 11:52:46 +00:00
Simon Atanasyan	e7fa2314af	Add parentheses to suppress the gcc warning '-Wparentheses'. No functional changes. llvm-svn: 209203	2014-05-20 10:23:04 +00:00
Benjamin Kramer	7bd6bee385	Legalizer: Make bswap promotion safe for vectors. llvm-svn: 209202	2014-05-20 09:42:31 +00:00
Simon Atanasyan	9cb4090867	[Mips] Add more relocation types and MIPS specific e_flags constants. llvm-svn: 209201	2014-05-20 09:27:49 +00:00
Christian Pirker	875629f713	ARMEB: Additional test files for ARM fixups llvm-svn: 209200	2014-05-20 09:24:37 +00:00
Tim Northover	9a24f88a37	TableGen: convert InstAlias's Emit bit to an int. When multiple aliases overlap, the correct string to print can often be determined purely by considering the InstAlias declarations in some particular order. This allows the user to specify that order manually when desired, without resorting to hacking around with the default lexicographical order on Record instantiation, which is error-prone and ugly. I was also mistaken about "add w2, w3, w4" being the same as "add w2, w3, w4, uxtw". That's only true if Rn is the stack pointer. llvm-svn: 209199	2014-05-20 09:17:16 +00:00
Alexey Volkov	6226de6721	[X86] Tune LEA usage for Silvermont According to Intel Software Optimization Manual on Silvermont in some cases LEA is better to be replaced with ADD instructions: "The rule of thumb for ADDs and LEAs is that it is justified to use LEA with a valid index and/or displacement for non-destructive destination purposes (especially useful for stack offset cases), or to use a SCALE. Otherwise, ADD(s) are preferable." Differential Revision: http://reviews.llvm.org/D3826 llvm-svn: 209198	2014-05-20 08:55:50 +00:00
Zinovy Nis	abdf44e7f3	[LV][REFACTOR] One more tiny fix for printing debug locations in loop vectorizer. Now consistent with the remarks emitter. Differential Revision: http://reviews.llvm.org/D3821 llvm-svn: 209197	2014-05-20 08:26:20 +00:00
Nick Lewycky	ec373545b8	Teach isKnownNonNull that a nonnull return is not null. Add a test for this case as well as the case of a nonnull attribute (already handled but not tested). llvm-svn: 209193	2014-05-20 05:13:21 +00:00
David Blaikie	8e1d489351	DebugInfo: Emit function definitions within their namespace scope. This workaround (presumably for ancient GDB) doesn't appear to be required (GDB 7.5 seems to tolerate function definition DIEs in namespace scope just fine). llvm-svn: 209189	2014-05-20 03:23:24 +00:00
Nick Lewycky	d52b1528c0	Add 'nonnull', a new parameter and return attribute which indicates that the pointer is not null. Instcombine will elide comparisons between these and null. Patch by Luqman Aden! llvm-svn: 209185	2014-05-20 01:23:40 +00:00
David Blaikie	424b59b1ce	DebugInfo: Assume all subprogram DIEs have been created before any abstract subprograms are constructed. Since we visit the whole list of subprograms for each CU at module start, this is clearly true - don't test for the case, just assert it. A few old test cases seemed to have incomplete subprogram lists, but any attempt to reproduce them shows full subprogram lists that even include entities that have been completely inlined and the out of line definition removed. llvm-svn: 209178	2014-05-19 23:16:19 +00:00
Chad Rosier	b06ed63ebf	[ARM64] Adds Cortex-A53 scheduling support for vector load/store post. Patch by Dave Estes<cestes@codeaurora.org>! PR19761 http://reviews.llvm.org/D3829 llvm-svn: 209176	2014-05-19 22:59:51 +00:00
Matt Arsenault	0a3b8f5507	Remove unused method declaration llvm-svn: 209174	2014-05-19 22:55:35 +00:00
David Blaikie	973141a035	DebugInfo: Don't include DW_AT_inline on each abstract definition multiple times. When I refactored this in r208636 I accidentally caused this to be added multiple times to each abstract subprogram (not accounting for the deduplicating effect of the InlinedSubprogramDIEs set). This got better in r208798 when the abstract definitions got the attribute added to them at construction time, but still had the redundant copies introduced in r208636. This commit removes those excess DW_AT_inlines and relies solely on the insertion in r208798. llvm-svn: 209166	2014-05-19 22:07:16 +00:00
David Blaikie	48b056bab0	DebugInfo: Fix missing inlined_subroutines caused by r208748. The check in DwarfDebug::constructScopeDIE was meant to consider inlined subroutines as any non-top-level scope that was a subprogram. Instead of checking "not top level scope" it was checking if the /subprogram's/ scope was non-top-level. Fix this and beef up a test case to demonstrate some of the missing inlined_subroutines are no longer missing. In the course of fixing this I also found that r208748 (with this fix) found one /extra/ inlined_subroutine in concrete_out_of_line.ll due to two inlined_subroutines having the same inlinedAt location. The previous implementation was collapsing these into a single inlined subroutine. I'm not sure what the original code was that created this .ll file so I'm not sure if this actually happens in practice today. Since we deliberately include column information to disambiguate two calls on the same line, that may've addressed this bug in the frontend, but it's good to know that workaround isn't necessary for this particular case anymore. llvm-svn: 209165	2014-05-19 21:54:31 +00:00
Eric Christopher	710c0ae7de	Fix typos. llvm-svn: 209164	2014-05-19 21:18:47 +00:00
Juergen Ributzka	431761771c	[ConstantHoisting][X86] Change the cost model to never hoist constants for types larger than i128. Currently the X86 backend doesn't support types larger than i128 very well. For example an i192 multiply will assert in codegen when the 2nd argument is a constant and the constant got hoisted. This fix changes the cost model to never hoist constants for types larger than i128. Once the codegen issues have been resolved, the cost model can be updated to allow also larger types. This is related to <rdar://problem/16954938> llvm-svn: 209162	2014-05-19 21:00:53 +00:00
Andrea Di Biagio	7a85cadfd6	[X86] Add ISel patterns to improve the selection of TZCNT and LZCNT. Instructions TZCNT (requires BMI1) and LZCNT (requires LZCNT), always provide the operand size as output if the input operand is zero. We can take advantage of this knowledge during instruction selection stage in order to simplify a few corner case. llvm-svn: 209159	2014-05-19 20:38:59 +00:00
Kevin Enderby	403258f5a3	Implement MachOObjectFile::isSectionData() and MachOObjectFile::isSectionBSS so that llvm-size will total up all the sections in the Berkeley format. This allows for rough categorizations for Mach-O sections. And allows the total of llvm-size’s Berkeley and System V formats to be the same. llvm-svn: 209158	2014-05-19 20:36:02 +00:00
Filipe Cabecinhas	dc92102766	Added more insertps optimizations Summary: When inserting an element that's coming from a vector load or a broadcast of a vector (or scalar) load, combine the load into the insertps instruction. Added PerformINSERTPSCombine for the case where we need to fix the load (load of a vector + insertps with a non-zero CountS). Added patterns for the broadcasts. Also added tests for SSE4.1, AVX, and AVX2. Reviewers: delena, nadav, craig.topper Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D3581 llvm-svn: 209156	2014-05-19 19:45:57 +00:00
Lang Hames	1fcbc08500	[RuntimeDyld] Fix x86-64 MachO GOT relocation handling. For GOT relocations the addend should modify the offset to the GOT entry, not the value of the entry itself. Teach RuntimeDyldMachO to do The Right Thing here. Fixes <rdar://problem/16961886>. llvm-svn: 209154	2014-05-19 19:21:25 +00:00
Peter Collingbourne	68a889757d	Check the alwaysinline attribute on the call as well as on the caller. Differential Revision: http://reviews.llvm.org/D3815 llvm-svn: 209150	2014-05-19 18:25:54 +00:00
Matt Arsenault	04b67ceeeb	Use range for llvm-svn: 209147	2014-05-19 17:52:48 +00:00
Jyotsna Verma	9a103563f4	reverting r209132 llvm-svn: 209139	2014-05-19 16:22:11 +00:00
Alp Toker	d71b6dfd85	MemoryBuffer: Use GetNativeSystemInfo() Removes old 4096 byte workaround. This functionality has been available since Windows XP. llvm-svn: 209137	2014-05-19 16:13:28 +00:00
Eric Christopher	a5ec92556f	Revert "Patch for function cloning to inline all blocks whose address is taken" as it was causing build failures in ruby. This reverts commit r207713. llvm-svn: 209135	2014-05-19 16:04:10 +00:00
Bradley Smith	c3b931d005	[ARM64] Split tbz/tbnz into W/X register variant llvm-svn: 209134	2014-05-19 15:58:15 +00:00
Jyotsna Verma	daeb25d4e0	Hexagon: Add encoding bits to the mpy instructions. llvm-svn: 209132	2014-05-19 15:32:07 +00:00
Zoran Jovanovic	b2b1f98da4	[mips][mips64r6] Sorted _ENC, _DESC classes and tests Differential Revision: http://reviews.llvm.org/D3808 llvm-svn: 209129	2014-05-19 14:57:46 +00:00
Aaron Ballman	0dfed533ec	Resolving MSVC warnings about switch statements with a default label, but no case labels. No functional changes intended. llvm-svn: 209126	2014-05-19 14:29:04 +00:00
Benjamin Kramer	f3ad23551d	SDAG: Legalize vector BSWAP into a shuffle if the shuffle is legal but the bswap not. - On ARM/ARM64 we get a vrev because the shuffle matching code is really smart. We still unroll anything that's not v4i32 though. - On X86 we get a pshufb with SSSE3. Required more cleverness in isShuffleMaskLegal. - On PPC we get a vperm for v8i16 and v4i32. v2i64 is unrolled. llvm-svn: 209123	2014-05-19 13:12:38 +00:00
Dinesh Dwivedi	f82f16e3e6	Added inst-combine for 'MIN(MIN(A, 97), 23)' and 'MAX(MAX(A, 23), 97)' This removes TODO added in r208849 [http://reviews.llvm.org/D3629] MIN(MIN(A, 97), 23) -> MIN(A, 23) MAX(MAX(A, 23), 97) -> MAX(A, 97) Differential Revision: http://reviews.llvm.org/D3785 llvm-svn: 209110	2014-05-19 07:08:32 +00:00
Craig Topper	b816593cf0	Remove last uses of OwningPtr from llvm. As far as I can tell these method versions are not used by lldb, lld, or clang. llvm-svn: 209103	2014-05-18 21:55:38 +00:00
Saleem Abdulrasool	8bfb192ecd	ARM: make libcall setup more table driven Rather than create a series of function calls to setup the library calls, create a table with the information and just use the table to drive the configuration of the library calls. This makes it easier to both inspect the list as well as to modify it. NFC. llvm-svn: 209089	2014-05-18 16:39:11 +00:00
Benjamin Kramer	f9d2d512c7	Options: Use erase_if to remove Args from the list. While there make getOption return a const reference so we don't have to put it on the stack when calling methods on it. No functionality change. llvm-svn: 209088	2014-05-18 15:14:13 +00:00
Saleem Abdulrasool	a521845381	ARM: improve WoA ABI conformance for frame register Windows on ARM uses R11 for the frame pointer even though the environment is a pure Thumb-2, thumb-only environment. Replicate this behaviour to improve Windows ABI compatibility. This register is used for fast stack walking, and thus is part of the Windows ABI. llvm-svn: 209085	2014-05-18 04:12:52 +00:00
Saleem Abdulrasool	f11f4b4e20	ARM: consolidate frame pointer register knowledge Use the ARMBaseRegisterInfo to query the frame register. The base register info is aware of the frame register that is used for the frame pointer. Use that to determine the frame register rather than duplicating the knowledge. Although, the code path is slightly different in that it may return SP, that can only occur if the frame pointer has been omitted in the machine function, which is supposed to contain the desired value in that case. llvm-svn: 209084	2014-05-18 03:18:09 +00:00
Saleem Abdulrasool	f3a5a5c546	Target: remove old constructors for CallLoweringInfo This is mostly a mechanical change changing all the call sites to the newer chained-function construction pattern. This removes the horrible 15-parameter constructor for the CallLoweringInfo in favour of setting properties of the call via chained functions. No functional change beyond the removal of the old constructors are intended. llvm-svn: 209082	2014-05-17 21:50:17 +00:00
Saleem Abdulrasool	9f664c1083	Target: change member from reference to pointer This is a preliminary step to help ease the construction of CallLoweringInfo. Changing the construction to a chained function pattern requires that the parameter be nullable. However, rather than copying the vector, save a pointer rather than the reference to permit a late binding of the arguments. llvm-svn: 209080	2014-05-17 21:50:01 +00:00
Saleem Abdulrasool	6d11b7cd7a	ARM: whitespace Remove some whitespace. NFC. llvm-svn: 209079	2014-05-17 21:49:54 +00:00
Rafael Espindola	f1bedd3747	Use create methods since msvc doesn't handle delegating constructors. llvm-svn: 209076	2014-05-17 21:29:57 +00:00
Rafael Espindola	77bbb54fbf	Handle ConstantAggregateZero when upgrading global_ctors. llvm-svn: 209075	2014-05-17 21:00:22 +00:00
Rafael Espindola	8370565820	Reduce abuse of default values in the GlobalAlias constructor. This is in preparation for adding an optional offset. llvm-svn: 209073	2014-05-17 19:57:46 +00:00
NAKAMURA Takumi	7ef81a4f98	Revert r209049 and r209065, "Add support for combining GEPs across PHI nodes" It broke clang selfhosting even after r209065. llvm-svn: 209067	2014-05-17 14:39:21 +00:00
Louis Gerbarg	455805694e	Fix for sanitizer crash introduced in r209049 This patch fixes 3 issues introduced by r209049 that only showed up in on the sanitizer buildbots. One was a typo in a compare. The other is a check to confirm that the single differing value in the two incoming GEPs is the same type. The final issue was the the IRBuilder under some circumstances would build PHIs in the middle of the block. llvm-svn: 209065	2014-05-17 06:51:36 +00:00
David Majnemer	483e4e08ac	Target: Replace getSection().empty() with hasSection() No functional change, just a small cleanup. llvm-svn: 209064	2014-05-17 05:18:40 +00:00
Saleem Abdulrasool	46fed305db	ARM: use the proper target object format for WoA WoA uses COFF, not ELF. ARMISelLowering::createTLOF would previously return ELF for any non-MachO platform. This was a missed site when the original change for target format support for Windows on ARM was done. llvm-svn: 209057	2014-05-17 04:28:08 +00:00
Chandler Carruth	c85473143c	[x86] Fix a bad predicate I spotted by inspection -- pshufhw and pshuflw were added in SSE2, no SSSE3. Found this while auditing all uses of SSSE3 in the X86 target. I don't actually expect this to make a significant difference on anything and I don't have any detailed test cases but I updated the existing test cases that already covered some of this code path. llvm-svn: 209056	2014-05-17 03:29:20 +00:00
Alexey Samsonov	cd01472a9b	[DWARF parser] Teach DIContext to fetch short (non-linkage) function names for a given address. Change --functions option in llvm-symbolizer tool to accept values "none", "short" or "linkage". Update the tests and docs accordingly. llvm-svn: 209050	2014-05-17 00:07:48 +00:00
Louis Gerbarg	8d2a43e9be	Add support for combining GEPs across PHI nodes Currently LLVM will generally merge GEPs. This allows backends to use more complex addressing modes. In some cases this is not happening because there is PHI inbetween the two GEPs: GEP1--\ \|-->PHI1-->GEP3 GEP2--/ This patch checks to see if GEP1 and GEP2 are similiar enough that they can be cloned (GEP12) in GEP3's BB, allowing GEP->GEP merging (GEP123): GEP1--\ --\ --\ \|-->PHI1-->GEP3 ==> \|-->PHI2->GEP12->GEP3 == > \|-->PHI2->GEP123 GEP2--/ --/ --/ This also breaks certain use chains that are preventing GEP->GEP merges that the the existing instcombine would merge otherwise. Tests included. rdar://15547484 llvm-svn: 209049	2014-05-16 23:47:24 +00:00
Pete Cooper	0d4ea975ef	Use a sized enum for MachineOperandType. No functionality change llvm-svn: 209048	2014-05-16 23:28:17 +00:00
Filipe Cabecinhas	89654da069	Implemented special cases for PerformVSELECTCombine. vselects with constant masks, after legalization, will get turned into specialized shuffle_vectors so they can be matched to blend+imm instructions. Fixed some tests. llvm-svn: 209044	2014-05-16 22:47:54 +00:00
Filipe Cabecinhas	e15551832c	Lower vselects into X86ISD::BLENDI when appropriate. LowerVSELECT will, if possible, generate a X86ISD::BLENDI DAG node if the condition is constant and we can emit that instruction, given the subtarget. This is not enough for all cases. An additional SELECTCombine optimization will be committed. Fixed tests that were expecting variable blends but where a blend+imm can be generated. Added test where we can't emit blend+immediate. Added avx2 blend+imm tests. llvm-svn: 209043	2014-05-16 22:47:49 +00:00
Filipe Cabecinhas	17254aaead	Implemented LowerVSELECT to custom lower some instructions. No functionality change intended. The types that previously were set to lower as Expand or Legal are doing the same thing with this lowering function. llvm-svn: 209042	2014-05-16 22:47:43 +00:00
Rafael Espindola	e0098928c9	Delete getAliasedGlobal. llvm-svn: 209040	2014-05-16 22:37:03 +00:00
David Blaikie	48369d1b8e	DebugInfo: Assert rather than conditionalizing when a CU's subprogram list contains declarations. llvm-svn: 209039	2014-05-16 22:21:45 +00:00
David Blaikie	c405c9cb0b	DebugInfo: Handle emitting constants of C++ unicode character type. Patch by Stephan Tolksdorf! (with some test case stuff by me) Differential Revision: http://reviews.llvm.org/D3810 llvm-svn: 209037	2014-05-16 21:53:09 +00:00
Tom Stellard	c721a23882	R600/SI: Refactor the VOP3_32 tablegen class This will allow us to use a single MachineInstr to represent instructions which behave the same but have different encodings on some subtargets. llvm-svn: 209028	2014-05-16 20:56:47 +00:00
Tom Stellard	0e70de57a3	R600/SI: Add a PredicateControl class for managing TableGen predicates This was inspired by the PredicateControl class in the MIPS backend. llvm-svn: 209027	2014-05-16 20:56:45 +00:00
Tom Stellard	0289ff4a4f	R600/SI: Move tablegen patterns away from instruction defs llvm-svn: 209026	2014-05-16 20:56:44 +00:00
Tom Stellard	2671338497	R600/SI: Remove unused instruction llvm-svn: 209025	2014-05-16 20:56:43 +00:00
Tom Stellard	f719ee9e76	R600/SI: Promote f32 SELECT to i32 llvm-svn: 209024	2014-05-16 20:56:41 +00:00
Tom Stellard	725db5d2c8	R600/SI: Remove duplicate pattern llvm-svn: 209023	2014-05-16 20:56:37 +00:00
Reid Kleckner	fceb76f5f9	Add comdat key field to llvm.global_ctors and llvm.global_dtors This allows us to put dynamic initializers for weak data into the same comdat group as the data being initialized. This is necessary for MSVC ABI compatibility. Once we have comdats for guard variables, we can use the combination to help GlobalOpt fire more often for weak data with guarded initialization on other platforms. Reviewers: nlewycky Differential Revision: http://reviews.llvm.org/D3499 llvm-svn: 209015	2014-05-16 20:39:27 +00:00
Rafael Espindola	0bf5b143f5	Fix a warning in builds without asserts. llvm-svn: 209012	2014-05-16 20:05:08 +00:00
David Blaikie	46d0ca5b40	DebugInfo: Add an assert regarding the subprogram in the subprogram map matching the abstract subprogram. I'm not sure this is how it'll be going forward (I'd rather prefer the definition to be in the main SP mapping, for various reasons) but this helps me understand how it is today. llvm-svn: 209009	2014-05-16 19:42:10 +00:00
Rafael Espindola	6b238633b7	Fix most of PR10367. This patch changes the design of GlobalAlias so that it doesn't take a ConstantExpr anymore. It now points directly to a GlobalObject, but its type is independent of the aliasee type. To avoid changing all alias related tests in this patches, I kept the common syntax @foo = alias i32* @bar to mean the same as now. The cases that used to use cast now use the more general syntax @foo = alias i16, i32* @bar. Note that GlobalAlias now behaves a bit more like GlobalVariable. We know that its type is always a pointer, so we omit the '*'. For the bitcode, a nice surprise is that we were writing both identical types already, so the format change is minimal. Auto upgrade is handled by looking through the casts and no new fields are needed for now. New bitcode will simply have different types for Alias and Aliasee. One last interesting point in the patch is that replaceAllUsesWith becomes smart enough to avoid putting a ConstantExpr in the aliasee. This seems better than checking and updating every caller. A followup patch will delete getAliasedGlobal now that it is redundant. Another patch will add support for an explicit offset. llvm-svn: 209007	2014-05-16 19:35:39 +00:00
David Blaikie	825f487b68	DebugInfo: Assume the CU's Subprogram list only contains definitions. DIBuilder maintains this invariant and the current DwarfDebug code could end up doing weird things if it contained declarations (such as putting the definition DIE inside a CU that contained the declaration - this doesn't seem like a good idea, so rather than adding logic to handle this case we'll just ban in for now & cross that bridge if we come to it later). llvm-svn: 209004	2014-05-16 18:26:53 +00:00
Chad Rosier	d978ca0308	[ARM64] Increases the Sched Model accuracy for Cortex-A53. Patch by Dave Estes <cestes@codeaurora.org> http://reviews.llvm.org/D3769 llvm-svn: 209001	2014-05-16 17:15:33 +00:00
David Majnemer	78910fc4da	InstSimplify: Improve handling of ashr/lshr Summary: Analyze the range of values produced by ashr/lshr cst, %V when it is being used in an icmp. Reviewers: nicholas Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D3774 llvm-svn: 209000	2014-05-16 17:14:03 +00:00
David Majnemer	ea8d5dbf24	InstSimplify: Optimize using dividend in sdiv Summary: The dividend in an sdiv tells us the largest and smallest possible results. Use this fact to optimize comparisons against an sdiv with a constant dividend. Reviewers: nicholas Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D3795 llvm-svn: 208999	2014-05-16 16:57:04 +00:00
Tilmann Scheller	83c5743650	[ARM64] Fix wrong comment in load/store optimization pass. ldr x1, [x0, #64] add x0, x0, #64 -> ldr x1, [x0], #64 is not a valid transformation, the correct transformation (and what the code actually does) is: ldr x1, [x0, #64] add x0, x0, #64 -> ldr x1, [x0, #64]! llvm-svn: 208998	2014-05-16 16:50:13 +00:00
David Blaikie	4a3b84d2f5	DwarfDebug: Refactor AT_ranges/AT_high_pc+AT_low_pc emission into helper function. llvm-svn: 208997	2014-05-16 16:42:40 +00:00
Simon Atanasyan	b83f380ae4	[yaml2obj][ELF] Add an optional `Size` field to the YAML section declaration. Now the only method to configure ELF section's content and size is to assign a hexadecimal string to the `Content` field. Unfortunately this way is completely useless when you need to declare a really large section. To solve this problem this patch adds one more optional field `Size` to the `RawContentSection` structure. When yaml2obj generates an ELF file it uses the following algorithm: 1. If both `Content` and `Size` fields are missed create an empty section. 2. If only `Content` field is missed take section length from the `Size` field and fill the section by zero. 3. If only `Size` field is missed create a section using data from the `Content` field. 4. If both `Content` and `Size` fields are provided validate that the `Size` value is not less than size of `Content` data. Than take section length from the `Size`, fill beginning of the section by `Content` and the rest by zero. Examples -------- * Create a section 0x10000 bytes long filled by zero Name: .data Type: SHT_PROGBITS Flags: [ SHF_ALLOC ] Size: 0x10000 * Create a section 0x10000 bytes long starting from 'CA' 'FE' 'BA' 'BE' Name: .data Type: SHT_PROGBITS Flags: [ SHF_ALLOC ] Content: CAFEBABE Size: 0x10000 The patch reviewed by Michael Spencer. llvm-svn: 208995	2014-05-16 16:01:00 +00:00
James Molloy	a70697e10e	Re-enable inline memcpy expansion for Thumb1. Patch by Moritz Roth! llvm-svn: 208994	2014-05-16 14:24:22 +00:00
Rafael Espindola	a800445710	Small dyn_cast and auto cleanup. llvm-svn: 208993	2014-05-16 14:22:33 +00:00
James Molloy	556763d2ef	Fix the Load/Store optimization pass to work with Thumb1. Patch by Moritz Roth! llvm-svn: 208992	2014-05-16 14:14:30 +00:00
James Molloy	92a15078f1	Enable the Load/Store optimization pass for Thumb1 but make it return immediately for now. Patch by Moritz Roth! llvm-svn: 208991	2014-05-16 14:11:38 +00:00
James Molloy	bb73c23ffa	Fix a few comment typos and style issues. Patch by Moritz Roth! llvm-svn: 208990	2014-05-16 14:08:46 +00:00
Zoran Jovanovic	6110e3bce6	[mips][mips64r6] Add SELEQZ and SELNEZ instructions Differential Revision: http://reviews.llvm.org/D3743 llvm-svn: 208987	2014-05-16 13:40:57 +00:00
Rafael Espindola	4fe0094fd1	Change the GlobalAlias constructor to look a bit more like GlobalVariable. This is part of the fix for pr10367. A GlobalAlias always has a pointer type, so just have the constructor build the type. llvm-svn: 208983	2014-05-16 13:34:04 +00:00
Zoran Jovanovic	52c56b93e5	[mips][mips64r6] Add Compact indexed jumps. Differential Revision: http://reviews.llvm.org/D3707 llvm-svn: 208981	2014-05-16 13:19:46 +00:00
Yaron Keren	152172009a	Fix hardcoded slash to native path seperator which was exposed from llvm::sys::path. http://reviews.llvm.org/D3687 llvm-svn: 208980	2014-05-16 13:16:30 +00:00
Rafael Espindola	5a52b9f139	Revert "Implement global merge optimization for global variables." This reverts commit r208934. The patch depends on aliases to GEPs with non zero offsets. That is not supported and fairly broken. The good news is that GlobalAlias is being redesigned and will have support for offsets, so this patch should be a nice match for it. llvm-svn: 208978	2014-05-16 13:02:18 +00:00
Zoran Jovanovic	5a8c1e2900	[mips][mips64r6] Add Compact zero-compare branch-and-link instructions Differential Revision: http://reviews.llvm.org/D3718 llvm-svn: 208977	2014-05-16 12:27:19 +00:00
Stepan Dyatkovskiy	948366ac0b	MergeFunctions Pass, introduced total ordering among GEP operations. Patch replaces old isEquivalentGEP implementation, and changes type of comparison result from bool (equal or not) to {-1, 0, 1} (less, equal, greater). This patch belongs to patch series that improves MergeFunctions performance time from O(NN) to O(Nlog(N)). llvm-svn: 208976	2014-05-16 11:55:02 +00:00
NAKAMURA Takumi	9ba87f4bfa	MC: Add DwarfTypesDWOSection also to MCCOFF. llvm-svn: 208975	2014-05-16 11:14:51 +00:00
Zoran Jovanovic	3c8869dc6a	[mips][mips64r6] Add compact branch instructions Differential Revision: http://reviews.llvm.org/D3691 llvm-svn: 208974	2014-05-16 11:03:45 +00:00
Stepan Dyatkovskiy	fa6820a035	MergeFunctions Pass, introduced total ordering among operations. Patch replaces old isEquivalentOperation implementation, and changes type of comparison result from bool (equal or not) to {-1, 0, 1} (less, equal, greater). This patch belongs to patch series that improves MergeFunctions performance time from O(NN) to O(Nlog(N)). llvm-svn: 208973	2014-05-16 11:02:22 +00:00
Zoran Jovanovic	d04688d1c6	[mips][mips64r6] Add LWPC and LWUPC instructions Differential Revision: http://reviews.llvm.org/D3788 llvm-svn: 208971	2014-05-16 10:27:10 +00:00
Zoran Jovanovic	027a5df93d	[mips][mips64r6] Add Floating Point Compare setting Mask - CMP.condn.fmt Differential Revision: http://reviews.llvm.org/D3750 llvm-svn: 208970	2014-05-16 09:48:29 +00:00
Tim Northover	5896b066e6	TableGen: fix operand counting for aliases TableGen has a fairly dubious heuristic to decide whether an alias should be printed: does the alias have lest operands than the real instruction. This is bad enough (particularly with no way to override it), but it should at least be calculated consistently for both strings. This patch implements that logic: first get the correct string for the variant, in the same way as the Matcher, without guessing; then count the number of whitespace chars. There are basically 4 changes this brings about after the previous commits; all of these appear to be good, so I have changed the tests: + ARM64: we print "neg X, Y" instead of "sub X, xzr, Y". + ARM64: we skip implicit "uxtx" and "uxtw" modifiers. + Sparc: we print "mov A, B" instead of "or %g0, A, B". + Sparc: we print "fcmpX A, B" instead of "fcmpX %fcc0, A, B" llvm-svn: 208969	2014-05-16 09:42:04 +00:00
Tim Northover	28aef9e05d	ARM64: disable printing of "fcmXY ..., #0" aliases The canonical syntax is "fcmXY ..., #0.0". This will be tested when the TableGen "should I print this Alias" heuristic is fixed (very soon). llvm-svn: 208968	2014-05-16 09:41:48 +00:00
Tim Northover	488e6206df	AArch64: disable printing of add/sub alias This alias appears not to have an appropriate PrintMethod. Normally, I'd look into it, but since AArch64 is disappearing soon it's probably not worth it. This will be tested when the TableGen "should I print this Alias" heuristic is fixed (very soon). llvm-svn: 208967	2014-05-16 09:41:43 +00:00
Tim Northover	a670f746a3	Sparc: disable printing of jmp/call aliases (C++ does it) These aliases are handled entirely in C++ and only having TableGen InstAliases for some of them was confusing LLVM. This will be tested when the TableGen "should I print this Alias" heuristic is fixed (very soon). llvm-svn: 208966	2014-05-16 09:41:39 +00:00
Tim Northover	ba101dd35d	Sparc: disable printing on longer "brX,pt" aliases This will be tested when the TableGen "should I print this Alias" heuristic is fixed (very soon). llvm-svn: 208965	2014-05-16 09:41:35 +00:00
Tim Northover	fe6591ed77	Mips: don't print subu alias for addiu Certainly not without having a custom PrintMethod to invert the immediate beforehand. But probably not at all. This will be tested when the TableGen "should I print this Alias" heuristic is fixed (very soon). llvm-svn: 208964	2014-05-16 09:41:30 +00:00
Tim Northover	8f8df324f3	X86: disable printing of bare "mov" aliases In AT&T syntax, we should probably print the full "movl" or "movw". TableGen used to ignore these aliases because it was miscounting the number of operands. This fixes the issue. This will be tested when the TableGen "should I print this Alias" heuristic is fixed (very soon). llvm-svn: 208963	2014-05-16 09:41:26 +00:00
Tim Northover	32dcf2d042	AArch64: disable printing of MOV -> MOVZ aliases Actually, MOV sometimes is canonical, but for now this is a better approximation than what's there. This will be tested when the TableGen "should I print this Alias" heuristic is fixed (very soon). llvm-svn: 208962	2014-05-16 09:41:21 +00:00
Tim Northover	3e1c7e0198	ARM64: disable printing of swapped compare-mask aliases You can perform (say) an fcmle operation by swapping the operands on an fcmge, but it shouldn't be printed like that. This will be tested when the TableGen "should I print this Alias" heuristic is fixed (very soon). llvm-svn: 208961	2014-05-16 09:41:16 +00:00
Tim Northover	5420649590	ARM64: disable printing of LDUR -> LDR aliases We accept "ldr w3, [x1, #-1]" as a convenience, but we should still print the canonical "ldur" form. This will be tested when the TableGen "should I print this Alias" heuristic is fixed (very soon). llvm-svn: 208960	2014-05-16 09:41:12 +00:00
Tim Northover	5763670b09	ARM64: give TST aliases priority over ANDS. If an ANDS instruction has Rd == ZR it should be printed as TST since its only effect is on the flags register NZCV. This will be tested when the TableGen "should I print this Alias" heuristic is fixed (very soon). llvm-svn: 208959	2014-05-16 09:41:08 +00:00
Tim Northover	106ac4830e	ARM64: give MOV priority over shorter ORR when printing aliases. MOV is almost always the right thing to print if possile. People understand it. This will be tested when the TableGen "should I print this Alias" heuristic is fixed (very soon). llvm-svn: 208958	2014-05-16 09:41:03 +00:00
Tim Northover	51b4de90f2	ARM64: give NEG priority over SUB when printing aliases. For example, the full instruction "sub w0, wzr, w1, uxtw" could print as either "neg w0, w1" or "sub w0, wzr, w1". The former is better. This will be tested when the TableGen "should I print this Alias" heuristic is fixed (very soon). llvm-svn: 208957	2014-05-16 09:40:58 +00:00
Tim Northover	9c2b4c29ab	ARM64: disable printing of "lslv" type aliases You can write "lslv w0, w1, w2" (probably for legacy reasons), but it should be printed as simply "lsl". This will be tested when the TableGen "should I print this Alias" heuristic is fixed (very soon). llvm-svn: 208956	2014-05-16 09:40:52 +00:00
Hao Liu	8579f0d4d1	[ARM64]Implement NEON post-increment LD1(lane) and post-increment LD1R. llvm-svn: 208955	2014-05-16 09:39:02 +00:00
Stepan Dyatkovskiy	5c2cc2506d	MergeFunctions Pass, introduced total ordering among function attributes. This patch belongs to patch series that improves MergeFunctions performance time from O(NN) to O(Nlog(N)). llvm-svn: 208953	2014-05-16 08:55:34 +00:00
Zoran Jovanovic	b397fea9ab	[mips][mips64r6] Add Floating Point Fused Multiply Add Subtract Differential Revision: http://reviews.llvm.org/D3727 llvm-svn: 208952	2014-05-16 08:42:27 +00:00
Saleem Abdulrasool	056fc3da4a	ARM: add some integer/floating point conversion libcalls Add some Windows on ARM specific library calls. These are provided by msvcrt, and can be used to perform integer to floating-point conversions (and vice-versa) mirroring similar functions in the RTABI. llvm-svn: 208949	2014-05-16 05:41:33 +00:00
Juergen Ributzka	34390c70a5	Add C API for thread yielding callback. Sometimes a LLVM compilation may take more time then a client would like to wait for. The problem is that it is not possible to safely suspend the LLVM thread from the outside. When the timing is bad it might be possible that the LLVM thread holds a global mutex and this would block any progress in any other thread. This commit adds a new yield callback function that can be registered with a context. LLVM will try to yield by calling this callback function, but there is no guaranteed frequency. LLVM will only do so if it can guarantee that suspending the thread won't block any forward progress in other LLVM contexts in the same process. Once the client receives the call back it can suspend the thread safely and resume it at another time. Related to <rdar://problem/16728690> llvm-svn: 208945	2014-05-16 02:33:15 +00:00
Justin Bogner	a119f32344	ProfileData: Allow multiple profiles in RawInstrProfReader Allow multiple raw profiles to coexist in a single .profraw file, given the following conditions: - Zero padding at the end of or between profiles will be skipped. - Each profile must start with a valid header. - Mixing endianness or pointer sizes in concatenated profiles files is not allowed. This is needed to handle cases where a program's shared libraries are profiled as well as the main executable itself, as we'll need to emit each executable's counters. Combining the tables in the runtime would be expensive for the instrumented program. rdar://16918688 llvm-svn: 208938	2014-05-16 00:38:00 +00:00
Eric Christopher	c21d3d5f90	Remove the Options query functions and just access our Options directly. llvm-svn: 208937	2014-05-16 00:32:52 +00:00
Reid Kleckner	d20c970aac	musttail: Fix the verification of alignment attributes Previously this would fail with an assertion failure when trying to add an alignment attribute without a value. llvm-svn: 208935	2014-05-15 23:58:57 +00:00
Jiangning Liu	932e1c3924	Implement global merge optimization for global variables. This commit implements two command line switches -global-merge-on-external and -global-merge-aligned, and both of them are false by default, so this optimization is disabled by default for all targets. For ARM64, some back-end behaviors need to be tuned to get this optimization further enabled. llvm-svn: 208934	2014-05-15 23:45:42 +00:00
David Blaikie	962c9a2d54	DebugInfo: Follow up to r208930, comment usage of 'using' to bring in base class overload. Code review feedback from Eric Christopher. llvm-svn: 208933	2014-05-15 23:29:53 +00:00
Eric Christopher	5d376066df	Move more MC options into the MCTargetOptions structure. No functional change. llvm-svn: 208932	2014-05-15 23:27:49 +00:00
Eric Christopher	98dcb8c6b1	Remove unused llvm namespace bool variable. llvm-svn: 208931	2014-05-15 23:27:44 +00:00
David Blaikie	bc094f387b	DebugInfo: Don't put fission type units in comdat sections. Since type units in the dwo file are handled by a debug aware tool, they don't need to leverage the ELF comdat grouping to implement deduplication. Avoid creating all the .group sections for these as a space optimization. llvm-svn: 208930	2014-05-15 23:18:15 +00:00
Reed Kotler	6280d9711d	Finish materialize for ints Summary: We add code to materialize all integer literals. Test Plan: simplestorei.ll Reviewers: dsanders Reviewed By: dsanders Differential Revision: http://reviews.llvm.org/D3596 llvm-svn: 208923	2014-05-15 21:54:15 +00:00
Matt Arsenault	d504a74e3c	Use range for llvm-svn: 208922	2014-05-15 21:44:05 +00:00
Alexey Samsonov	dce67348a8	[DWARF parser] Use enums instead of bitfields in DILineInfoSpecifier. It is more appropriate than the current situation, when one flag (AbsoluteFilePath) is relevant only if another flag is set. This refactoring would also simplify fetching the short function name (stored in DW_AT_name) instead of a linkage name returned currently. No functionality change. llvm-svn: 208921	2014-05-15 21:24:32 +00:00
Reid Kleckner	900d46ff39	Don't insert lifetime.end markers between a musttail call and ret The allocas going out of scope are immediately killed by the return instruction. This is a resend of r208912, which was committed accidentally. Reviewers: chandlerc Differential Revision: http://reviews.llvm.org/D3792 llvm-svn: 208920	2014-05-15 21:10:46 +00:00
Reid Kleckner	b16564109b	Revert "Don't insert lifetime.end markers between a musttail call and ret" This reverts commit r208912. It was committed accidentally without review. llvm-svn: 208914	2014-05-15 20:41:05 +00:00
Reid Kleckner	6af21245eb	Remove unused variable in inliner We have to iterate over all the calls that were inlined to find out if any were musttail. Sink another variable down to where its used. llvm-svn: 208913	2014-05-15 20:39:42 +00:00
Reid Kleckner	26ab7ead45	Don't insert lifetime.end markers between a musttail call and ret The allocas going out of scope are immediately killed by the return instruction. Reviewers: chandlerc Differential Revision: http://reviews.llvm.org/D3630 llvm-svn: 208912	2014-05-15 20:39:13 +00:00
David Blaikie	4c6d987b06	DebugInfo: Simplify retrieving filename/directory name for line table entry building. llvm-svn: 208911	2014-05-15 20:18:50 +00:00
Reid Kleckner	f0915aa0e6	Teach the inliner how to preserve musttail invariants The interesting case is what happens when you inline a musttail call through a musttail call site. In this case, we can't break perfect forwarding or allow any stack growth. Instead of merging control flow from the inlined return instruction after a musttail call into the body of the caller, leave the inlined return instruction in the caller so that the musttail call stays in the tail position. More work is required in http://reviews.llvm.org/D3630 to handle the case where the inlined function has dynamic allocas or byval arguments. Reviewers: chandlerc Differential Revision: http://reviews.llvm.org/D3491 llvm-svn: 208910	2014-05-15 20:11:28 +00:00
David Blaikie	6c21716439	DebugInfo: Add FIXME regarding DILexicalBlock uniquing fields. llvm-svn: 208909	2014-05-15 20:09:55 +00:00
Simon Atanasyan	8d59c8da41	[obj2yaml][ELF] Do not print empty Link and Info fields for ELF sections. llvm-svn: 208905	2014-05-15 18:04:02 +00:00
Juergen Ributzka	bcbed0a549	Revert "[PM] Add pass run listeners to the pass manager." Revert the current implementation and C API. New implementation and C APIs are in the works. llvm-svn: 208904	2014-05-15 17:49:20 +00:00
Bradley Smith	597122b026	[ARM64] Improve diagnostics for Cn operands in SYS instructions llvm-svn: 208902	2014-05-15 16:28:32 +00:00
Andrea Di Biagio	d621120533	[X86] Teach the backend how to fold SSE4.1/AVX/AVX2 blend intrinsics. Added target specific combine rules to fold blend intrinsics according to the following rules: 1) fold(blend A, A, Mask) -> A; 2) fold(blend A, B, <allZeros>) -> A; 3) fold(blend A, B, <allOnes>) -> B. Added two new tests to verify that the new folding rules work for all the optimized blend intrinsics. llvm-svn: 208895	2014-05-15 15:18:15 +00:00
Zoran Jovanovic	d6879febdc	[mips][mips64r6] Add CLASS.fmt instructions Differential Revision: http://reviews.llvm.org/D3712 llvm-svn: 208894	2014-05-15 15:16:36 +00:00
Zoran Jovanovic	bdf1cd374f	[mips][mips64r6] Add RINT.fmt instructions Differential Revision: http://reviews.llvm.org/D3711 llvm-svn: 208892	2014-05-15 15:04:37 +00:00
Zoran Jovanovic	702d27e4db	[mips][mips64r6] Add SELEQZ/SELNEZ.fmt instructions Differential Revision: http://reviews.llvm.org/D3710 llvm-svn: 208891	2014-05-15 14:58:42 +00:00
Zoran Jovanovic	11d4ce1788	[mips][mips64r6] Add MAX/MIN/MAXA/MINA.fmt instructions Differential Revision: http://reviews.llvm.org/D3709 llvm-svn: 208890	2014-05-15 14:54:06 +00:00
Tom Stellard	436780bebb	R600/SI: Stop using VSrc_* as the default register class for types. We now use SReg_* for integer types and VReg_* for floating-point types. This should help simplify the SIFixSGPRCopies pass and no longer causes ISel to insert a COPY after termiator instuctions that output a value. This change is covered by exisitng tests. llvm-svn: 208888	2014-05-15 14:41:57 +00:00
Tom Stellard	a568738b47	R600/SI: Fix a bug with handling of INSERT_SUBREG in SIFixSGPRCopies This prevents a future commit from regressing the load-i1.ll test. llvm-svn: 208887	2014-05-15 14:41:55 +00:00
Tom Stellard	73b98ed8cf	R600/SI: Only use SALU instructions for 64-bit add in a block of CF depth 0 llvm-svn: 208886	2014-05-15 14:41:54 +00:00
Tom Stellard	365a2b49f2	R600/SI: Use VALU instructions for i1 ops llvm-svn: 208885	2014-05-15 14:41:50 +00:00
Tim Northover	60091cfeb9	TableGen: use correct MIOperand when printing aliases Previously, TableGen assumed that every aliased operand consumed precisely 1 MachineInstr slot (this was reasonable because until a couple of days ago, nothing more complicated was eligible for printing). This allows a couple more ARM64 aliases to print so we can remove the special code. On the X86 side, I've gone for explicit AT&T size specifiers as the default, so turned off a few of the aliases that would have just started printing. llvm-svn: 208880	2014-05-15 13:36:01 +00:00
Daniel Sanders	e17212d621	[mips][mips64r6] Add bitswap, and dbitswap Summary: Depends on D3728 Reviewers: jkolek, zoran.jovanovic, vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D3729 llvm-svn: 208877	2014-05-15 12:18:23 +00:00
Jay Foad	5a29c367f7	Instead of littering asserts throughout the code after every call to computeKnownBits, consolidate them into one assert at the end of computeKnownBits itself. llvm-svn: 208876	2014-05-15 12:12:55 +00:00
Tim Northover	2509a3fc64	ARM64: print correct aliases for NEON mov & mvn instructions In all cases, if a "mov" alias exists, it is the canonical form of the instruction. Now that TableGen can support aliases containing syntax variants, we can enable them and improve the quality of the asm output. llvm-svn: 208874	2014-05-15 12:11:02 +00:00
Daniel Sanders	01124a0132	[mips][mips64r6] Add align and dalign Summary: Depends on D3689 Reviewers: vmedic, zoran.jovanovic, jkolek Reviewed By: jkolek Differential Revision: http://reviews.llvm.org/D3728 llvm-svn: 208872	2014-05-15 12:06:36 +00:00
Tim Northover	d8d65a69cf	TableGen/ARM64: print aliases even if they have syntax variants. To get at least one use of the change (and some actual tests) in with its commit, I've enabled the AArch64 & ARM64 NEON mov aliases. llvm-svn: 208867	2014-05-15 11:16:32 +00:00
Tim Northover	dd8fca5136	ARM64: add correct vector registers during asm parsing Previously, we ignored the difference between V64 and V128 when parsing assembly: they both got mapped to registers in the FPR128 class. This is basically harmless at the moment because they both print and encode the same way. However, it will affect the printing of aliases. llvm-svn: 208866	2014-05-15 11:16:19 +00:00
Bradley Smith	5c44b08912	[ARM64] Improve load/store diagnostics and forbid 32-bit register addresses llvm-svn: 208864	2014-05-15 11:08:30 +00:00
Bradley Smith	e0483f9cd1	[ARM64] Parse fixed vector lanes properly so that diagnostics can be emitted llvm-svn: 208863	2014-05-15 11:07:57 +00:00
Bradley Smith	c294914adc	[ARM64] Add/Fixup diagnostics for floating point immediates llvm-svn: 208862	2014-05-15 11:07:28 +00:00
Bradley Smith	3d1a9ef162	[ARM64] Add condition code operand type such that proper diagnostics can be emitted llvm-svn: 208861	2014-05-15 11:06:51 +00:00
Bradley Smith	13a76e5b88	[ARM64] Add more simple diagnostics for immediate/shift ranges llvm-svn: 208860	2014-05-15 11:06:16 +00:00
Daniel Sanders	b59e1a41f4	[mips][mips64r6] Add addiupc, aluipc, and auipc Summary: No support for symbols in place of the immediate yet since it requires new relocations. Depends on D3671 Reviewers: jkolek, zoran.jovanovic, vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D3689 llvm-svn: 208858	2014-05-15 10:45:58 +00:00
Daniel Sanders	a3412b13d4	[mips][mips64r6] Add aui, daui, dahi, and dati Summary: Depends on D3671 Reviewers: jkolek, zoran.jovanovic, vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D3759 llvm-svn: 208857	2014-05-15 10:27:19 +00:00
Chandler Carruth	a0e5695ad9	Teach the constant folder to look through bitcast constant expressions much more effectively when trying to constant fold a load of a constant. Previously, we only handled bitcasts by trying to find a totally generic byte representation of the constant and use that. Now, we look through the bitcast to see what constant we might fold the load into, and then try to form a constant expression cast of the found value that would be equivalent to loading the value. You might wonder why on earth this actually matters. Well, turns out that the Itanium ABI causes us to create a single array for a vtable where the first elements are virtual base offsets, followed by the virtual function pointers. Because the array is homogenous the element type is consistently i8* and we inttoptr the virtual base offsets into the initial elements. Then constructors bitcast these pointers to i64 pointers prior to loading them. Boom, no more constant folding of virtual base offsets. This is the first fix to LLVM to address the insane performance Eric Niebler discovered with Clang on his range comprehensions[1]. There is more to come though, this doesn't really fix the problem fully. [1]: http://ericniebler.com/2014/04/27/range-comprehensions/ llvm-svn: 208856	2014-05-15 09:56:28 +00:00
Daniel Sanders	19627f470b	[mips][mips64r6] Test that branch likelies are not accepted on MIPS64r6. Summary: They aren't implemented for any ISA at the moment. Depends on D3670 Reviewers: jkolek, zoran.jovanovic, vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D3671 llvm-svn: 208855	2014-05-15 09:47:43 +00:00
Dinesh Dwivedi	83c11da849	Reverting r208848, reason: build failure: sanitizer-x86_64-linux-bootstrap/builds/3399 llvm-svn: 208852	2014-05-15 08:22:55 +00:00
Dinesh Dwivedi	f675f4201b	Added instcombine for 'MIN(MIN(A, 27), 93)' and 'MAX(MAX(A, 93), 27)' MIN(MIN(A, 23), 97) -> MIN(A, 23) MAX(MAX(A, 97), 23) -> MAX(A, 97) Differential Revision: http://reviews.llvm.org/D3629 llvm-svn: 208849	2014-05-15 06:13:40 +00:00
Dinesh Dwivedi	837c16097e	Added inst combine transforms for single bit tests from Chris's note if ((x & C) == 0) x \|= C becomes x \|= C if ((x & C) != 0) x ^= C becomes x &= ~C if ((x & C) == 0) x ^= C becomes x \|= C if ((x & C) != 0) x &= ~C becomes x &= ~C if ((x & C) == 0) x &= ~C becomes nothing Z3 Verifications code for above transform http://rise4fun.com/Z3/Pmsh Differential Revision: http://reviews.llvm.org/D3717 llvm-svn: 208848	2014-05-15 06:01:33 +00:00
Jonathan Roelofs	4971b40d7e	Fix some dyslexia in an assert message llvm-svn: 208842	2014-05-15 02:24:50 +00:00
Alp Toker	beaca19c7c	Fix typos llvm-svn: 208839	2014-05-15 01:52:21 +00:00
Jiangning Liu	09cc564310	[ARM64] Support aggressive fastcc/tailcallopt breaking ABI by popping out argument stack from callee. llvm-svn: 208837	2014-05-15 01:33:17 +00:00
Eric Christopher	737e089bda	Move the TargetMachine MC options to MCTargetOptions. No functional change. llvm-svn: 208832	2014-05-15 01:08:00 +00:00
David Majnemer	186c94244c	InstCombine: Optimize -x s< cst Summary: This gets rid of a sub instruction by moving the negation to the constant when valid. Reviewers: nicholas Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D3773 llvm-svn: 208827	2014-05-15 00:02:20 +00:00
David Blaikie	91e8104622	DwarfDebug: Don't set frame index locations on abstract variables. Abstract variables should never have/use locations. In this case the data wasn't used, so no functional change intended here, just simplification. llvm-svn: 208820	2014-05-14 22:51:59 +00:00
David Blaikie	9ba7254688	DebugInfo: Sure up subprogram variable list handling with more assertions and fewer conditionals. Many old tests using prior schemas still had some brokenness here (both indirect arrays and arrays with single bogus elements). Fixed those up so they don't hit the new assertions. Also reduced nesting in some places, etc. llvm-svn: 208817	2014-05-14 21:52:46 +00:00
David Blaikie	7af6e6f267	DebugInfo: Assert that a CU's subprogram list contains only subprograms. llvm-svn: 208816	2014-05-14 21:52:37 +00:00
Kevin Enderby	e858a65323	Teach llvm-nm to know about fat archives (aka MachOUniversal files containing archives). First step as other tools will be updated next. llvm-svn: 208812	2014-05-14 21:18:50 +00:00
Jay Foad	a0653a3e6c	Rename ComputeMaskedBits to computeKnownBits. "Masked" has been inappropriate since it lost its Mask parameter in r154011. llvm-svn: 208811	2014-05-14 21:14:37 +00:00
David Majnemer	2d6c023576	InstSimplify: Optimize signed icmp of -(zext V) Summary: We know that -(zext V) will always be <= zero, simplify signed icmps that have these. Uncovered using http://www.cs.utah.edu/~regehr/souper/ Reviewers: nicholas Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D3754 llvm-svn: 208809	2014-05-14 20:16:28 +00:00
David Blaikie	f662f0a65e	DebugInfo: Do not delay attaching DW_AT_inline attribute to abstract definitions. This is just unneccessary - we only create abstract definitions when we're inlining anyway, so there's no reason to delay this to see if we're going to inline anything. llvm-svn: 208798	2014-05-14 17:58:53 +00:00
Christian Pirker	6692e7c116	ARM-BE: test files for vector argument passing Reviewed at http://reviews.llvm.org/D3766 llvm-svn: 208793	2014-05-14 16:59:44 +00:00
Christian Pirker	85cdab63c4	[ARM64-BE] Fix byte order of CIE and FDE frames for exception handling Reviewed at http://reviews.llvm.org/D3741 llvm-svn: 208792	2014-05-14 16:51:58 +00:00
Logan Chien	95188b9092	Fix ARM EHABI when function has landingpad and nounwind. If the function has the landingpad instruction, then the handlerdata should be emitted even if the function has nouwnind attribute. Otherwise, following code will not work: void test1() noexcept { try { throw_exception(); } catch (...) { log_unexpected_exception(); } } Since the cantunwind was incorrectly emitted and the LSDA is not available. llvm-svn: 208791	2014-05-14 16:38:30 +00:00
Benjamin Kramer	594f963ea6	X86: If we have an instruction that sets a flag and a zero test on the input of that instruction try to eliminate the test. For example tzcntl %edi, %ebx testl %edi, %edi je .label can be rewritten into tzcntl %edi, %ebx jb .label A minor complication is that tzcnt sets CF instead of ZF when the input is zero, we have to rewrite users of the flags from ZF to CF. Currently we recognize patterns using lzcnt, tzcnt and popcnt. Differential Revision: http://reviews.llvm.org/D3454 llvm-svn: 208788	2014-05-14 16:14:45 +00:00
Daniel Sanders	a2ffa21d80	[mips][mips64r6] Add sel.s and sel.d Summary: Also use named constants for common opcode fields. Depends on D3669 Reviewers: vmedic, zoran.jovanovic, jkolek Reviewed By: jkolek Differential Revision: http://reviews.llvm.org/D3670 llvm-svn: 208784	2014-05-14 15:29:44 +00:00
Tim Northover	5f92cf60f9	ARM64: remove unneeded InstPrinter hacks Now that TableGen handles aliases, these are unneeded. Hopefully more will be able to go soon. llvm-svn: 208781	2014-05-14 14:44:18 +00:00
Evgeniy Stepanov	ed31ca4bdc	[asan] Fix compiler warnings. llvm-svn: 208769	2014-05-14 10:56:19 +00:00
Evgeniy Stepanov	aaf4bb2394	[asan] Set debug location in ASan function prologue. Most importantly, it gives debug location info to the coverage callback. This change also removes 2 cases of unnecessary setDebugLoc when IRBuilder is created with the same debug location. llvm-svn: 208767	2014-05-14 10:30:15 +00:00
Serge Pavlov	e6de9e39a8	Fix the case when reordering shuffle and binop produces a constant. This resolves PR19737. llvm-svn: 208762	2014-05-14 09:05:09 +00:00
Jay Foad	e48d9e8efe	Update the comments for ComputeMaskedBits, which lost its Mask parameter in r154011. llvm-svn: 208757	2014-05-14 08:00:07 +00:00
Simon Atanasyan	ae6bb33ac2	[obj2yaml] Support ELF input format in the obj2yaml tool. The ELF header e_flags field in the MIPS related test cases handled incorrectly. The obj2yaml prints too many flags. I will fix that in the next patches. The patch reviewed by Michael Spencer and Sean Silva. llvm-svn: 208752	2014-05-14 05:07:47 +00:00
Saleem Abdulrasool	27351f2022	ARM: implement support for the UDF mnemonic The UDF instruction is a reserved undefined instruction space. The assembler mnemonic was introduced with ARM ARM rev C.a. The instruction is not predicated and the immediate constant is ignored by the CPU. Add support for the three encodings for this instruction. The changes to the invalid instruction test is due to the fact that the invalid instructions actually overlap with the undefined instruction. Introduction of the new instruction results in a partial decode as an undefined sequence. Drop the tests as they are invalid instruction patterns anyways. llvm-svn: 208751	2014-05-14 03:47:39 +00:00
Nick Lewycky	f0cf8fa941	Optimize integral reciprocal (udiv 1, x and sdiv 1, x) to not use division. This fires exactly once in a clang bootstrap, but covers a few different results from http://www.cs.utah.edu/~regehr/souper/ llvm-svn: 208750	2014-05-14 03:03:05 +00:00
David Blaikie	9b8c8cda0d	Recommit r208506: DebugInfo: Include lexical scopes in inlined subroutines. This was reverted in r208642 due to regressions surrounding file changes within lexical scopes causing inlining information to be lost. The issue was in LexicalScopes::getOrCreateInlinedScope, where I was previously testing "isLexicalBlock" which is false for "DILexicalBlockFile" (a scope used to represent changes in the current file name) and assuming it was then a function (breaking out of the inlined scope path and reaching for the parent non-inlined scopes). By inverting the condition and testing for "isSubprogram" the correct behavior is attained. (also found some weirdness in Clang, see r208742 when reducing this test case - the resulting test case doesn't apply with the Clang fix, but I've added a more realistic test case to inline-scopes.ll which does reproduce the issue and demonstrate the fix) llvm-svn: 208748	2014-05-14 01:08:28 +00:00
Eric Christopher	02e1804d8d	Fix typo in function name. llvm-svn: 208743	2014-05-14 00:31:15 +00:00
Matt Arsenault	4b0402e317	R600/SI: Try to fix BFE operands when moving to VALU This was broken by r208479 llvm-svn: 208740	2014-05-13 23:45:50 +00:00
Lang Hames	890758d5b3	[RuntimeDyld] Fix handling of i386 PC-rel external relocations. This fixes several more i386 MCJIT regression test failures. <rdar://problem/16889891> llvm-svn: 208735	2014-05-13 22:09:07 +00:00
Louis Gerbarg	1b91aa2cf5	Add missing line breaks to debug output in CodeGenPrepare llvm-svn: 208731	2014-05-13 21:54:22 +00:00
Benjamin Kramer	3f085ba6d6	GVN: Fix non-determinism in map iteration. Iterating over a DenseMaop is non-deterministic and results to unpredictable IR output. Based on a patch by Daniel Reynaud! llvm-svn: 208728	2014-05-13 21:06:40 +00:00
Benjamin Kramer	d97f95e2f2	GVN: rangify a couple of loops. No functionality change. llvm-svn: 208727	2014-05-13 21:06:36 +00:00
Eric Christopher	d1309ee27d	Save the optimization level the subtarget was created with in a member variable and sink the initialization of crbits into the subtarget feature reset code. No functional change, but this refactor will be used in a future commit. llvm-svn: 208726	2014-05-13 20:49:08 +00:00
Eric Christopher	addf51ddde	Make the split function use StringRef::split. llvm-svn: 208723	2014-05-13 19:55:17 +00:00
Rafael Espindola	99e05cf163	Split GlobalValue into GlobalValue and GlobalObject. This allows code to statically accept a Function or a GlobalVariable, but not an alias. This is already a cleanup by itself IMHO, but the main reason for it is that it gives a lot more confidence that the refactoring to fix the design of GlobalAlias is correct. That will be a followup patch. llvm-svn: 208716	2014-05-13 18:45:48 +00:00
Joerg Sonnenberger	94cbb666b2	Check explicitly for EHABI and just use the default settings. Code depends on the assembler and linker to fix things up... llvm-svn: 208715	2014-05-13 17:58:13 +00:00
Christian Pirker	39db7ec81f	ARMEB: Fix byte order of EH frame unwinding instructions, with modified test file This commit was already commited as revision rL208689 and discussd in phabricator revision D3704. But the test file was crashing on OS X and windows. I fixed the test file in the same way as in rL208340. llvm-svn: 208711	2014-05-13 16:44:30 +00:00
Joey Gouly	12a8bf09d0	[CGP] r205941 changed the logic, so that a cast happens before 'Result' is compared to 'AddrMode.BaseReg'. In the case that 'AddrMode.BaseReg' is nullptr, 'Result' will also be nullptr, so the cast causes an assertion. We should use dyn_cast_or_null here to check 'Result' is not null and it is an instruction. Bug found by Mats Petersson, and I reduced his IR to get a test case. llvm-svn: 208705	2014-05-13 15:42:45 +00:00
Rafael Espindola	2e7eceb317	Revert "ARMEB: Fix byte order of EH frame unwinding instructions" This reverts commit r208689. The test was crashing on OS X and windows. llvm-svn: 208704	2014-05-13 15:19:56 +00:00
Daniel Sanders	387fc15d2c	[mips] Marked up instructions added in MIPS32r2 and tested that IAS for -mcpu=mips(2\|32) does not accept them Summary: This required a new instruction group representing the 32-bit subset of MIPS-3 that was available in MIPS32R2. To limit the number of tests required, only one 32-bit and one 64-bit ISA prior to MIPS32/MIPS64 are tested. rdhwr has been deliberately left without an ISA annotation for now. This is because the assembler and CodeGen disagree on when the instruction is available. Strictly speaking, it is only available in MIPS32r2 and MIPS64r2. However, it is emulated by a kernel trap on earlier ISA's and is necessary for TLS so CodeGen should emit it on older ISA's too. Depends on D3696 Reviewers: vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D3697 llvm-svn: 208690	2014-05-13 11:45:36 +00:00
Christian Pirker	ea3514ecdb	ARMEB: Fix byte order of EH frame unwinding instructions llvm-svn: 208689	2014-05-13 11:41:49 +00:00
Daniel Sanders	579168629e	[mips] Free up two values in SubtargetFeatureFlag by folding the redundant IsGP32/IsGP64 into IsGP32bit/IsGP64bit Summary: We are currently very close to the 32-bit limit of the current assembler implementation. This is because there is no way to represent an instruction that is available in, for example, Mips3 or Mips32. We have to define a feature bit that represents this. This patch cleans up a pair of redundant feature bits and slightly postpones the point we will reach the limit. Reviewers: zoran.jovanovic, jkolek, vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D3703 llvm-svn: 208685	2014-05-13 11:17:46 +00:00
Artyom Skrobov	6298b347b6	[un]wrap extracted from lib/Target/Target[MachineC].cpp, lib/ExecutionEngine/ExecutionEngineBindings.cpp into include/llvm/IR/DataLayout.h llvm-svn: 208680	2014-05-13 09:45:26 +00:00
Kevin Qin	97e5d98779	[ARM64] Fix the misleading diagnostic on bad extend amount of reg+reg addressing mode. A vague diagnostic replaced the misleading one. This can fix bug 19502. llvm-svn: 208669	2014-05-13 07:35:12 +00:00
Serge Pavlov	b575ee8294	Fix type of shuffle resulted from shuffle merge. This fix resolves PR19730. llvm-svn: 208666	2014-05-13 06:07:21 +00:00
Rafael Espindola	7e2b7567a8	Assert that we don't RAUW a Constant with a ConstantExpr that contains it. We already had an assert for foo->RAUW(foo), but not for something like foo->RAUW(GEP(foo)) and would go in an infinite loop trying to apply the replacement. llvm-svn: 208663	2014-05-13 01:23:21 +00:00
Weiming Zhao	dd83691cc3	Folding into CSEL when there is ZEXT between SETCC and ADD Normally, patterns like (add x, (setcc cc ...)) will be folded into (csel x, x+1, not cc). However, if there is a ZEXT after SETCC, they won't be folded. This patch recognizes the ZEXT and allows the generation of CSINC. This patch fixes bug 19680. llvm-svn: 208660	2014-05-13 00:40:58 +00:00
David Blaikie	290e22872d	Revert "DebugInfo: Include lexical scopes in inlined subroutines." This reverts commit r208506. Some inlined subroutine scopes appear to be missing with this change. Reverting while I investigate. llvm-svn: 208642	2014-05-12 23:53:03 +00:00
Pete Cooper	7fd1d725b9	Use a logical not when inverting SetCC. This unfortunately doesn't fire on any targets so I couldn't find a test case to trigger it. The problem occurs when a non-i1 setcc is inverted. For example 'i8 = setcc' will get 'xor 0xff' to invert this. This is clearly wrong when the boolean contents are ZeroOrOne. This patch introduces getLogicalNOT and updates SetCC legalisation to use it. Reviewed by Hal Finkel. llvm-svn: 208641	2014-05-12 23:26:58 +00:00
Adam Nemet	5d78558c2b	[DAGCombiner] Split up an indexed load if only the base pointer value is live Right now the load may not get DCE'd because of the side-effect of updating the base pointer. This can happen if we lower a read-modify-write of an illegal larger type (e.g. i48) such that the modification only affects one of the subparts (the lower i32 part but not the higher i16 part). See the testcase. In order to spot the dead load we need to revisit it when SimplifyDemandedBits decided that the value of the load is masked off. This is the CommitTargetLoweringOpt piece. I checked compile time with ARM64 by sending SPEC bitcode files through llc. No measurable change. Fixes <rdar://problem/16031651> llvm-svn: 208640	2014-05-12 23:00:03 +00:00
Reid Kleckner	7a59e0845f	Try to fix an SDAG dependence issue with sret r208453 added support for having sret on the second parameter. In that change, the code for copying sret into a virtual register was hoisted into the loop that lowers formal parameters. This caused a "Wrong topological sorting" assertion failure during scheduling when a parameter is passed in memory. This change undoes that by creating a second loop that deals with sret. I'm worried that this fix is incomplete. I don't fully understand the dependence issues. However, with this change we produce the same DAGs we used to produce, so if they are broken, they are just as broken as they have always been. llvm-svn: 208637	2014-05-12 22:01:27 +00:00
David Blaikie	525358db2c	DebugInfo: Attach DW_AT_inline to inlined subprograms at DIE-construction time rather than as a post-processing step. llvm-svn: 208636	2014-05-12 21:50:44 +00:00
Lang Hames	36072da3d9	[RuntimeDyld] Add support for MachO __jump_table and __pointers sections, and SECTDIFF relocations on 32-bit x86. This fixes several of the MCJIT regression test failures that show up on 32-bit builds. <rdar://problem/16886294> llvm-svn: 208635	2014-05-12 21:39:59 +00:00
Matt Arsenault	37c12d7343	Use cast<> for unchecked use llvm-svn: 208627	2014-05-12 20:42:57 +00:00
Sebastian Pop	05719e486f	use nullptr instead of NULL llvm-svn: 208622	2014-05-12 20:11:01 +00:00
Louis Gerbarg	efdcf23736	Add support bswap16 to/from memory compiling to rev16 on ARM/Thumb The current patterns for REV16 misses mostn __builtin_bswap16() due to legalization promoting the operands to from load/stores toi32s and then truncing/extending them. This patch adds new patterns that catch the resultant DAGs and codegens them to rev16 instructions. Tests included. rdar://15353652 llvm-svn: 208620	2014-05-12 19:53:52 +00:00
Matt Arsenault	b3ee388594	Use cast<> for unchecked use llvm-svn: 208618	2014-05-12 19:26:38 +00:00
Matt Arsenault	4d64f96530	Use range for llvm-svn: 208617	2014-05-12 19:23:21 +00:00
Sebastian Pop	b1a548f72d	do not assert when delinearization fails llvm-svn: 208615	2014-05-12 19:01:53 +00:00
Sebastian Pop	0e75c5cb64	use isZero() llvm-svn: 208614	2014-05-12 19:01:49 +00:00
David Blaikie	4abe19edad	DwarfDebug: Avoid an extra map lookup while constructing abstract scope DIEs and reduce nesting/conditionals. One test case had to be updated as it still had the extra indirection for the variable list - removing the extra indirection got it back to passing. llvm-svn: 208608	2014-05-12 18:23:35 +00:00
Tim Northover	ee20caaf82	TableGen: use PrintMethods to print more aliases llvm-svn: 208607	2014-05-12 18:04:06 +00:00
Tim Northover	0184206ff7	AArch64/ARM64: use InstAliases for NEON logical (imm) instructions. llvm-svn: 208606	2014-05-12 18:03:42 +00:00
Tim Northover	faaf00c402	AArch64/ARM64: implement "mov $Rd, $Imm" aliases in TableGen. This is a slightly different approach to AArch64 (the base instruction definitions aren't quite right for that to work), but achieves the same thing and reduces C++ hackery in AsmParser. llvm-svn: 208605	2014-05-12 18:03:36 +00:00
Matt Arsenault	62b1737081	R600: Add mul24 intrinsics llvm-svn: 208604	2014-05-12 17:49:57 +00:00
Matt Arsenault	2adca6090f	Make SimplifyDemandedBits understand BUILD_PAIR llvm-svn: 208598	2014-05-12 17:14:48 +00:00
Daniel Sanders	f99637cb4d	Revert: r208582 - [mips][mips64r6] Add sel.s and sel.d Accidentally committed an unreviewed patch. Reverted it. llvm-svn: 208583	2014-05-12 15:43:41 +00:00
Daniel Sanders	52de11e475	[mips][mips64r6] Add sel.s and sel.d Summary: Also use named constants for common opcode fields. Depends on D3669 Reviewers: jkolek, vmedic, zoran.jovanovic Differential Revision: http://reviews.llvm.org/D3670 llvm-svn: 208582	2014-05-12 15:39:10 +00:00
Daniel Sanders	08e1e0a873	[mips][mips64r6] Add d?div, d?mod, d?divu, d?modu Summary: Depends on D3668 Reviewers: jkolek, zoran.jovanovic, vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D3669 llvm-svn: 208579	2014-05-12 15:24:16 +00:00
Daniel Sanders	0ac5ec58b8	[mips][mips64r6] Added mul/mulu/muh/muhu Summary: The 'mul' line of the test is temporarily commented out because it currently matches the MIPS32 mul instead of the MIPS32r6 mul. This line will be uncommented when we disable the MIPS32 mul on MIPS32r6. Reviewers: jkolek, zoran.jovanovic, vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D3668 llvm-svn: 208576	2014-05-12 15:12:45 +00:00
Rafael Espindola	05447dd278	Move EmitDwarfAdvanceLineAddr and EmitDwarfAdvanceFrameAddr to the obj streamer. This lets us delete the MCAsmStreamer implementation. No functionality change. llvm-svn: 208570	2014-05-12 14:43:25 +00:00
Rafael Espindola	1bb4a3f660	Pass a MCObjectStreamer instead of a MCStreamer when possible. No functionality change. llvm-svn: 208569	2014-05-12 14:40:12 +00:00
Rafael Espindola	4066e8dd64	Pass a MCObjectStreamer instead of a MCStreamer when possible. No functionality change. llvm-svn: 208567	2014-05-12 14:28:48 +00:00
Aaron Ballman	29fd7b9b20	Silencing an MSVC warning about not all control paths returning a value (even though the switch is fully covered). No functional change. llvm-svn: 208565	2014-05-12 14:22:58 +00:00
Tim Northover	120195542c	ARM64: remove dead validation code from the AsmParser. If this code triggers, any immediate has already been validated so it can't possibly trigger a diagnostic. llvm-svn: 208564	2014-05-12 14:13:21 +00:00
Tim Northover	2625a993f9	ARM64: merge "extend" and "shift" addressing-mode enums. In terms of assembly, these have too much overlap to be neatly modelled as disjoint classes: in many cases "lsl" is an acceptable alternative to either "uxtw" or "uxtx". llvm-svn: 208563	2014-05-12 14:13:17 +00:00
Rafael Espindola	3dd8ef6b49	Move EH/Debug frame handling to the object streamer. Now that the asm streamer doesn't use it, the MCStreamer doesn't need to know about it. llvm-svn: 208562	2014-05-12 14:02:44 +00:00
Rafael Espindola	aa7851d18d	Remove always true argument and unused field. llvm-svn: 208561	2014-05-12 13:47:05 +00:00
Rafael Espindola	01ee31bbad	Remove always true argument and field. llvm-svn: 208559	2014-05-12 13:40:49 +00:00
Rafael Espindola	8285b778f4	Remove always true argument. llvm-svn: 208558	2014-05-12 13:34:25 +00:00
Rafael Espindola	7f4ccced49	Remove an always true argument. llvm-svn: 208557	2014-05-12 13:30:10 +00:00
Rafael Espindola	dba6bbee0f	Remove write only field. llvm-svn: 208555	2014-05-12 13:20:37 +00:00
Rafael Espindola	bf520f23e8	Remove now empty method. llvm-svn: 208554	2014-05-12 13:18:13 +00:00
Rafael Espindola	d67df50f29	Remove the always true UseCFI member. llvm-svn: 208553	2014-05-12 13:12:22 +00:00
Benjamin Kramer	3b36b72a9c	X86: Make sure that we have SSE4.1 before we generate insertps nodes. PR19721. llvm-svn: 208552	2014-05-12 13:12:08 +00:00
Rafael Espindola	883cf7e656	Remove the useCFI constructor argument to MCAsmStreamer. llvm-svn: 208551	2014-05-12 13:07:11 +00:00
Daniel Sanders	aadc357e5f	[mips] Marked up instructions added in MIPS32 and tested that IAS for -mcpu=mips2 does not accept them Summary: To limit the number of tests required, only one 32-bit and one 64-bit ISA prior to MIPS32/MIPS64 are explicitly tested. Depends on D3695 Reviewers: vmedic Differential Revision: http://reviews.llvm.org/D3696 llvm-svn: 208549	2014-05-12 13:04:32 +00:00
Rafael Espindola	9e1b99cbcd	Remove MCUseCFI from TargetMachine. It was always true. llvm-svn: 208547	2014-05-12 13:01:42 +00:00
Daniel Sanders	07cdea2baa	[mips] Marked up instructions added in MIPS-V and tested that IAS for -mcpu=mips[1234] does not accept them Summary: This required a new instruction group representing the 32-bit subset of MIPS-V that was available in MIPS32R2 Most of these instructions are correctly rejected but with the wrong error message. These have been placed in a separate test for now. It happens because many of the MIPS V instructions have not been implemented. Depends on D3694 Reviewers: vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D3695 llvm-svn: 208546	2014-05-12 12:52:44 +00:00
Daniel Sanders	070fd1c42a	[mips] Fold FeatureBitCount into FeatureMips32 and FeatureMips64 Summary: DCL[ZO] are now correctly marked as being MIPS64 instructions. This has no effect on the CodeGen tests since expansion of i64 prevented their use anyway. The check for MIPS16 to prevent the use of CLZ no longer prevents DCLZ as well. This is not a functional change since DCLZ is still prohibited by being a MIPS64 instruction (MIPS16 is only compatible with MIPS32). No functional change Reviewers: vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D3694 llvm-svn: 208544	2014-05-12 12:41:59 +00:00
Daniel Sanders	fcea8102e8	[mips] Fold FeatureSEInReg into FeatureMips32r2 Summary: No functional change Reviewers: vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D3693 llvm-svn: 208543	2014-05-12 12:28:15 +00:00
Daniel Sanders	39d0051847	[mips] Fold FeatureSwap into FeatureMips32r2 and FeatureMips64r2 Summary: dsbh and dshd are not available on Mips32r2. No codegen test changes required since expansion of i64 prevented the use of these instructions anyway. Depends on D3690 Reviewers: vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D3692 llvm-svn: 208542	2014-05-12 12:15:41 +00:00
Daniel Sanders	94eda2e1ab	[mips] Replace FeatureFPIdx with FeatureMips4_32r2 Summary: No functional change. The minor change to the MIPS16 code is in preparation for a patch that will handle 32-bit FPIdx instructions separately to 64-bit (because they were added in different revisions) Depends on D3677 Reviewers: rkotler, vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D3690 llvm-svn: 208541	2014-05-12 11:56:16 +00:00
Bradley Smith	bbec45a4f1	[ARM64] Add proper bounds checking/diagnostics to logical shifts llvm-svn: 208540	2014-05-12 11:49:16 +00:00
Christian Pirker	238c7c165b	ARM: Implement big endian bit-conversion for NEON type llvm-svn: 208538	2014-05-12 11:19:20 +00:00
NAKAMURA Takumi	6c383021c9	X86ISelLowering.cpp:LowerINTRINSIC_W_CHAIN(): Prune impossible "default:" [-Wcovered-switch-default] llvm-svn: 208533	2014-05-12 10:16:46 +00:00
Serge Pavlov	02ff620c7b	Fix type of shuffle obtained from reordering with binary operation In transformation: BinOp(shuffle(v1,undef), shuffle(v2,undef)) -> shuffle(BinOp(v1, v2),undef) type of the undef argument must be same as type of BinOp. llvm-svn: 208531	2014-05-12 10:11:27 +00:00
Bradley Smith	d5de13e4d6	[ARM64] Add diagnostics for bitfield extract/insert instructions Unfortunately, since ARM64 models all these instructions as aliases, the checks need to be done at the time the alias is seen rather than during instruction validation as AArch64 does it. llvm-svn: 208529	2014-05-12 09:44:57 +00:00
Bradley Smith	9ba3c963ff	[ARM64] Correct more bounds checks/diagnostics for arithmetic shift operands llvm-svn: 208528	2014-05-12 09:41:43 +00:00
Bradley Smith	ad363d7121	[ARM64] Move register/register MOV handling into tablegen and improve diagnostics llvm-svn: 208527	2014-05-12 09:38:16 +00:00
Elena Demikhovsky	4f591c0d45	Fixed compilation issue llvm-svn: 208524	2014-05-12 07:45:41 +00:00
Elena Demikhovsky	8e8fde8e93	AVX-512: changes in intrinsics 1) Changed gather and scatter intrinsics. Now they are aligned with GCC built-ins. There is no more non-masked form. Masked intrinsic receives -1 if all lanes are executed. 2) I changed the function that works with intrinsics inside X86ISelLowering.cpp. I put all intrinsics in one table. I did it for INTRINSICS_W_CHAIN and plan to put all intrinsics from WO_CHAIN set to the same table in order to avoid the long-long "switch". (I wanted to use static map initialization that allowed by C++11 but I wasn't able to compile it on VS2012). 3) I added gather/scatter prefetch intrinsics. 4) I fixed MRMm encoding for masked instructions. llvm-svn: 208522	2014-05-12 07:18:51 +00:00
Saleem Abdulrasool	fba09d47e9	CodeGen: add parenthesis around complex expression Add missing parenthesis suggested by GCC. NFC. llvm-svn: 208519	2014-05-12 06:08:18 +00:00
Serge Pavlov	0581109708	Fix reordering of shuffles and binary operations Do not apply transformation: BinOp(shuffle(v1), shuffle(v2)) -> shuffle(BinOp(v1, v2)) if operands v1 and v2 are of different size. This change fixes PR19717, which was caused by r208488. llvm-svn: 208518	2014-05-12 05:44:53 +00:00
Matt Arsenault	46013d903f	Fix return before else llvm-svn: 208510	2014-05-11 21:24:41 +00:00
Hal Finkel	0d8db46799	[PowerPC] Add global named register support Support for the intrinsics that read from and write to global named registers is added for r1, r2 and r13 (depending on the subtarget). llvm-svn: 208509	2014-05-11 19:29:11 +00:00
Hal Finkel	f0e086a0bc	Pass the value type to TLI::getRegisterByName We must validate the value type in TLI::getRegisterByName, because if we don't and the wrong type was used with the IR intrinsic, then we'll assert (because we won't be able to find a valid register class with which to construct the requested copy operation). For PPC64, additionally, the type information is necessary to decide between the 64-bit register and the 32-bit subregister. No functionality change. llvm-svn: 208508	2014-05-11 19:29:07 +00:00
Hal Finkel	b33e9872a0	Add 'override' to getRegisterByName in *ISelLowering.h No functionality change intended. llvm-svn: 208507	2014-05-11 19:28:55 +00:00
David Blaikie	9576766be9	DebugInfo: Include lexical scopes in inlined subroutines. llvm-svn: 208506	2014-05-11 18:12:17 +00:00
David Blaikie	e0f14743c0	DwarfUnit: Make explicit a limitation/bug in enumeration constant emission. Filed as PR19712, LLVM fails to detect the right type of an enum constant when a frontend does not provide an underlying type for the enumeration type. llvm-svn: 208502	2014-05-11 17:04:05 +00:00
Hal Finkel	c4c6c87666	[PowerPC] On PPC32, 128-bit shifts might be runtime calls The counter-loops formation pass needs to know what operations might be function calls (because they can't appear in counter-based loops). On PPC32, 128-bit shifts might be runtime calls (even though you can't use __int128 on PPC32, it seems that SROA might form them). Fixes PR19709. llvm-svn: 208501	2014-05-11 16:23:29 +00:00
David Blaikie	60cae1ba49	DwarfUnit: Pick a winner between isTypeSigned and isUnsignedDIType. And the winner by a nose is isUnsignedDIType, for no particular reason. These two functions were just complements of each other and used in very related code, so refactor callers to just use one of them. llvm-svn: 208500	2014-05-11 16:08:41 +00:00
David Blaikie	c0a2841e2f	DwarfUnit: Factor out calling isUnsignedDIType into a utility function so each caller of emitConstantValue doesn't have to call it separately. llvm-svn: 208496	2014-05-11 15:56:59 +00:00
David Blaikie	c05c8f483b	DwarfUnit: Share common constant value emission between APInts of small (<= 64 bit) and MCOperand immediates. Doesn't seem a good reason to duplicate this code (it was more literally duplicated prior to r208494, and while the dataN code /does/ actually fire in this case, it doesn't seem necessary (and the DWARF standard recommends using udata/sdata pervasively instead of dataN, so as to indicate signedness of the values)) llvm-svn: 208495	2014-05-11 15:47:39 +00:00
David Blaikie	958647c36d	DebugInfo: Simplify constant value emission. This code looks to have become dead at some time in the past. I tried to reproduce cases where LLVM would emit constants with dataN, but could not. Upon inspection it seems the code doesn't do that anymore - the only time a size is provided by isTypeSigned is when the type is signed, and in those cases we use sdata. dataN is only used for unsigned types and isTypeSigned doesn't provide a value for sizeInBits in that case. Remove the dead cases/size plumbing. llvm-svn: 208494	2014-05-11 15:06:20 +00:00
Benjamin Kramer	1fef64214c	SLPVectorizer: Instead of just performing CSE on dead blocks ignore them completely. Turns out that there is a very cheap way of testing whether a block is dead, just look it up in the DomTree. We have to do this anyways so just ignore unreachable blocks before sorting by domination. This restores a proper ordering for std::stable_sort when dead code is present. Covered by existing tests & buildbots running in STL debug mode (MSVC). llvm-svn: 208492	2014-05-11 10:28:58 +00:00
Serge Pavlov	9ef66a8266	Reorder shuffle and binary operation. This patch enables transformations: BinOp(shuffle(v1), shuffle(v2)) -> shuffle(BinOp(v1, v2)) BinOp(shuffle(v1), const1) -> shuffle(BinOp, const2) They allow to eliminate extra shuffles in some cases. Differential Revision: http://reviews.llvm.org/D3525 llvm-svn: 208488	2014-05-11 08:46:12 +00:00
Filipe Cabecinhas	0e3d1cb5d6	Fixed a bug when lowering build_vector (PR19694) When lowering build_vector to an insertps, we would still lower it, even if the source vectors weren't v4x32. This would break on avx if the source was a v8x32. We now check the type of the source vectors. llvm-svn: 208487	2014-05-11 08:12:56 +00:00
Vincent Lejeune	29c0c210fc	R600/SI: Fold fabs/fneg into src input modifier llvm-svn: 208480	2014-05-10 19:18:39 +00:00
Vincent Lejeune	94af31fbe8	R600/SI: Prettier display of input modifiers llvm-svn: 208479	2014-05-10 19:18:33 +00:00
Vincent Lejeune	79a5834647	R600/SI: Use pseudo instruction for fabs/clamp/fneg llvm-svn: 208478	2014-05-10 19:18:25 +00:00
Benjamin Kramer	8cff45aa20	SCEV: Use range-based for loop and fold variable into assert. llvm-svn: 208476	2014-05-10 17:47:18 +00:00
Tim Northover	55b3e22927	ARM64: fix SELECT_CC lowering in absence of NaNs. We were swapping the true & false results while testing for FMAX/FMIN, but not putting them back to the original state if the later checks failed. Should fix PR19700. llvm-svn: 208469	2014-05-10 07:37:50 +00:00
Benjamin Kramer	8722aa5754	SLPVectorizer: When sorting by domination for CSE don't assert on unreachable code. There is no total ordering if the CFG is disconnected. We don't care if we catch all CSE opportunities in dead code either so just exclude ignore them in the assert. PR19646 llvm-svn: 208461	2014-05-09 23:28:49 +00:00
Reid Kleckner	c487d73f41	Revert "[ms-cxxabi] Add a new calling convention that swaps 'this' and 'sret'" This reverts commit r200561. This calling convention was an attempt to match the MSVC C++ ABI for methods that return structures by value. This solution didn't scale, because it would have required splitting every CC available on Windows into two: one for methods and one for free functions. Now that we can put sret on the second arg (r208453), and Clang does that (r208458), revert this hack. llvm-svn: 208459	2014-05-09 22:56:42 +00:00
Sebastian Pop	47fe7de1b5	move findArrayDimensions to ScalarEvolution we do not use the information from SCEVAddRecExpr to compute the shape of the array, so a better place for this function is in ScalarEvolution. llvm-svn: 208456	2014-05-09 22:45:07 +00:00
Sebastian Pop	444621abe1	fix typo in debug message llvm-svn: 208455	2014-05-09 22:45:02 +00:00

... 4 5 6 7 8 ...

69962 Commits