llvm-project

Commit Graph

Author	SHA1	Message	Date
Lang Hames	5dc14bd54c	[CodeGen] Teach the peephole optimizer to remember (and exploit) all folding opportunities in the current basic block, rather than just the last one seen. <rdar://problem/16478629> llvm-svn: 205481	2014-04-02 22:59:58 +00:00
Rafael Espindola	af9129468e	Fix a nomenclature error in llvm-nm. What llvm-nm prints depends on the file format. On ELF for example, if the file is relocatable, it prints offsets. If it is not, it prints addresses. Since it doesn't really need to care what it is that it is printing, use the generic term value. Fix or implement getSymbolValue to keep llvm-nm working. llvm-svn: 205479	2014-04-02 22:52:46 +00:00
Pete Cooper	4da0a0c87b	Add ability to disable building LLVM utils Patch by Chris Bieneman llvm-svn: 205478	2014-04-02 22:49:58 +00:00
Hal Finkel	f823380a44	[PowerPC] Make PPCTTI::getMemoryOpCost call BasicTTI::getMemoryOpCost PPCTTI::getMemoryOpCost will now make use of BasicTTI::getMemoryOpCost to calculate the base cost of the memory access, and then adjust on top of that. There is no functionality change from this modification, but it will become important so that PPCTTI can take advantage of scalarization information for which BasicTTI::getMemoryOpCost will account in the near future. llvm-svn: 205476	2014-04-02 22:43:49 +00:00
Juergen Ributzka	fcd2e94ecc	Add comments and test case for [DAG] Keep the opaque constant flag when performing unary constant folding operations (r204737). llvm-svn: 205474	2014-04-02 22:21:01 +00:00
Adrian Prantl	fd43f6ba3b	typo llvm-svn: 205473	2014-04-02 22:17:30 +00:00
Lang Hames	c2c751312e	[X86] Make the VFMA*231 variants commutable and relax the alignment restrictions on FMA3 memory operands. FMA3 instructions are VEX encoded, so they can load from unaligned memory. Testcase to follow, along with related patch. <rdar://problem/16478629> llvm-svn: 205472	2014-04-02 22:06:16 +00:00
Duncan P. N. Exon Smith	4680f40d28	Revert "Reapply "LTO: add API to set strategy for -internalize"" This reverts commit r199244. Conflicts: include/llvm-c/lto.h include/llvm/LTO/LTOCodeGenerator.h lib/LTO/LTOCodeGenerator.cpp llvm-svn: 205471	2014-04-02 22:05:57 +00:00
Juergen Ributzka	27435b3b8a	Add comments and test case for [X86TTI] Make constant base pointers for GetElementPtr opaque (r204739). llvm-svn: 205468	2014-04-02 21:45:36 +00:00
Saleem Abdulrasool	009d0e96b7	ARM: fixup tests to specify the target more explicitly This changes the tests that were targeting ARM EABI to explicitly specify the environment rather than relying on the default. This breaks with the new Windows on ARM support when running the tests on Windows where the default environment is no longer EABI. Take the opportunity to avoid a pointless redirect (helps when trying to debug with providing a command line invocation which can be copy and pasted) and removing a few greps in favour of FileCheck. llvm-svn: 205465	2014-04-02 21:22:03 +00:00
Juergen Ributzka	ab6f44efaf	Add test case for [Stackmaps][X86TTI] Fix think-o in getIntImmCost calculation (r204738). llvm-svn: 205464	2014-04-02 21:15:36 +00:00
Saleem Abdulrasool	cd1308296e	ARM: update subtarget information for Windows on ARM Update the subtarget information for Windows on ARM. This enables using the MC layer to target Windows on ARM. llvm-svn: 205459	2014-04-02 20:32:05 +00:00
Jim Grosbach	2a2459f365	Make a few more range-based loops use explicit types. No functional change. llvm-svn: 205458	2014-04-02 20:21:22 +00:00
Rafael Espindola	a1ec7409a6	Add back an assert that was lost in the ELFObjectFile.h split. llvm-svn: 205456	2014-04-02 20:00:33 +00:00
Tom Stellard	36a031870b	TargetLibraryInfo: Disable memcpy and memset on R600 There are no implementations of these for R600. llvm-svn: 205455	2014-04-02 19:53:29 +00:00
Jim Grosbach	36c4953348	Simplify resolveFrameIndex() signature. Just pass a MachineInstr reference rather than an MBB iterator. Creating a MachineInstr& is the first thing every implementation did anyway. llvm-svn: 205453	2014-04-02 19:28:18 +00:00
Jim Grosbach	4a1a9ce5e6	ARM: cortex-m0 doesn't support unaligned memory access. Unlike other v6+ processors, cortex-m0 never supports unaligned accesses. From the v6m ARM ARM: "A3.2 Alignment support: ARMv6-M always generates a fault when an unaligned access occurs." rdar://16491560 llvm-svn: 205452	2014-04-02 19:28:13 +00:00
Jim Grosbach	df1e05bb8a	Make some range based loop types more explicit. No functional change, but more readable code. llvm-svn: 205451	2014-04-02 19:28:08 +00:00
Kai Nacke	13673ac704	[mips] Add more Octeon cnMips instructions Adds the instructions ext/ext32/cins/cins32. It also changes pop/dpop to accept the two operand version and adds a simple pattern to generate baddu. Tests for the two operand versions (including baddu/dmul/dpop/pop) and the code generation pattern for baddu are included. Reviewed by: Daniel.Sanders@imgtec.com llvm-svn: 205449	2014-04-02 18:40:43 +00:00
Jim Grosbach	20b0790df7	[C++11,ARM64] Range based for and explicit 'override' in STP cleanup. No functional change intended. llvm-svn: 205446	2014-04-02 18:00:59 +00:00
Jim Grosbach	05abd709f3	[C++11,ARM64] Range based for loops in constant promotion. No functional change intended. llvm-svn: 205445	2014-04-02 18:00:56 +00:00
Jim Grosbach	7dc9edeaa5	[C++11,ARM64] Range based for loops in load/store pair optimizer. No functional change intended. llvm-svn: 205444	2014-04-02 18:00:53 +00:00
Jim Grosbach	020e657790	[C++11,ARM64] Range based for loops in target lowering. No functional change intended. llvm-svn: 205443	2014-04-02 18:00:51 +00:00
Jim Grosbach	91f1f47751	[C++11,ARM64] Range based for loops in frame lowering. No functional change intended. llvm-svn: 205442	2014-04-02 18:00:49 +00:00
Jim Grosbach	f39d752b03	[C++11,ARM64] Range based for loops in pseudo expansion. No functional change intended. llvm-svn: 205441	2014-04-02 18:00:46 +00:00
Jim Grosbach	673825ebac	[C++11,ARM64] Range based for loops for LOH No functional change intended. llvm-svn: 205440	2014-04-02 18:00:44 +00:00
Jim Grosbach	2539c3d07a	[C++11,ARM64] Range based for loops TLS cleanup. No functional change intended. llvm-svn: 205439	2014-04-02 18:00:41 +00:00
Jim Grosbach	0d0c5a614a	[C++11,ARM64] Range based for loops in branch relaxation. No functional change intended. llvm-svn: 205438	2014-04-02 18:00:39 +00:00
Jim Grosbach	1c762ca9bd	[C++11,ARM64] Range based for loops in address type promotion. No functional change intended. llvm-svn: 205437	2014-04-02 18:00:36 +00:00
Quentin Colombet	7bf9d8cd13	[ARM64][CollectLOH] Remove the link to the radar from the comments. llvm-svn: 205435	2014-04-02 16:40:49 +00:00
Simon Atanasyan	3ee21b014b	[yaml2obj][ELF] Convert some static functions into class members to reduce number of arguments. No functional changes. llvm-svn: 205434	2014-04-02 16:34:54 +00:00
Simon Atanasyan	ee7776d681	[yaml2obj][ELF] Remove unused typedef. No functional changes. llvm-svn: 205433	2014-04-02 16:34:48 +00:00
Simon Atanasyan	220c54a0e3	[yaml2obj][ELF] Move section index to the ELFState class. No functional changes. llvm-svn: 205432	2014-04-02 16:34:40 +00:00
Simon Atanasyan	ee882239cb	[yaml2obj][ELF] Remove relationship between ELFState and ContiguousBlobAccumulator classes. Pass ContiguousBlobAccumulator to the handleSymtabSectionHeader function directly. No functional changes. llvm-svn: 205431	2014-04-02 16:34:34 +00:00
Oliver Stannard	b14c625111	ARM: Add support for segmented stacks Patch by Alex Crichton, ILyoan, Luqman Aden and Svetoslav. llvm-svn: 205430	2014-04-02 16:10:33 +00:00
Adrian Prantl	a731cf0018	clarify comment llvm-svn: 205429	2014-04-02 15:49:45 +00:00
Adrian Prantl	f79621e440	fix a comment to use ASCII aprostrophes. llvm-svn: 205428	2014-04-02 15:49:37 +00:00
Tim Northover	6d69168ffd	ARM64: use GOT for weak symbols & PIC. Weak symbols cannot use the small code model's usual ADRP sequences since the instruction simply may not be able to encode a value of 0. This redirects them to use the GOT, which hopefully linkers are able to cope with even in the static relocation model. llvm-svn: 205426	2014-04-02 14:39:11 +00:00
Tim Northover	0d80f70530	ARM64: fix lowering of fp128 fptosi/fptoui We were creating libcall nodes that returned an MVT::f128, when these particular operations actually return an int of some stripe. llvm-svn: 205425	2014-04-02 14:39:07 +00:00
Tim Northover	670df3d937	SLPVectorizer: compare entire intrinsic for SLP compatibility. Some Intrinsics are overloaded to the extent that return type equality (all that's been checked up to now) does not guarantee that the arguments are the same. In these cases SLP vectorizer should not recurse into the operands, which can be achieved by comparing them as "Function *" rather than simply the ID. llvm-svn: 205424	2014-04-02 14:39:02 +00:00
Tim Northover	ebd37ab382	ARM64: make sure first argument to INSERT_SUBVECTOR has right type. Again, coalescing and other optimisations swiftly made the MachineInstrs consistent again, but when compiled at -O0 a bad INSERT_SUBREGISTER was produced. llvm-svn: 205423	2014-04-02 14:38:58 +00:00
Tim Northover	5e3a484e3b	ARM64: convert fp16 narrowing ISel to pseudo-instruction The previous attempt was fine with optimisations, but was actually rather cavalier with its types. When compiled at -O0, it produced invalid COPY MachineInstrs. llvm-svn: 205422	2014-04-02 14:38:54 +00:00
Job Noorman	f7da105f39	Mark FPB as a reserved register when needed. llvm-svn: 205421	2014-04-02 13:13:56 +00:00
Rafael Espindola	b1b49789d0	Work around gold bug http://sourceware.org/PR16794 . llvm-svn: 205416	2014-04-02 12:15:20 +00:00
Renato Golin	d93295ea56	Remove duplicated DMB instructions ARM specific optimiztion, finding places in ARM machine code where 2 dmbs follow one another, and eliminating one of them. Patch by Reinoud Elhorst. llvm-svn: 205409	2014-04-02 09:03:43 +00:00
Yaron Keren	2895496852	Added isTargetWindowsMSVC(), renamed isTargetMingw() to isTargetWindowsGNU() and isTargetCygwin() to isTargetWindowsCygwin() to be consistent with the four Windows environments in Triple.h. Suggestion by Saleem Abdulrasool! llvm-svn: 205393	2014-04-02 04:27:51 +00:00
Hal Finkel	b0ebdc0f43	[LoopVectorizer] Count dependencies of consecutive pointers as uniforms For the purpose of calculating the cost of the loop at various vectorization factors, we need to count dependencies of consecutive pointers as uniforms (which means that the VF = 1 cost is used for all overall VF values). For example, the TSVC benchmark function s173 has: ... %3 = add nsw i64 %indvars.iv, 16000 %arrayidx8 = getelementptr inbounds %struct.GlobalData* @global_data, i64 0, i32 0, i64 %3 ... and we must realize that the add will be a scalar in order to correctly deduce it to be profitable to vectorize this on PowerPC with VSX enabled. In fact, all dependencies of a consecutive pointer must be a scalar (uniform), and so we simply need to add all consecutive pointers to the worklist that currently detects collects uniforms. Fixes PR19296. llvm-svn: 205387	2014-04-02 02:34:49 +00:00
David Blaikie	326e1fa13b	Adjust comments regarding non-relocated abbrev offset in debug_info.dwo I'm not sure the comment in the implementation really adds a lot of value (it's clear that we emit zero when no symbol is provided, but it doesn't explain why we would do that). Happy to iterate. llvm-svn: 205386	2014-04-02 02:04:51 +00:00
David Blaikie	94c1d7f174	Split debug_loc and debug_loc.dwo emission into two separate functions Based on code review feedback from Eric Christopher on r204697 llvm-svn: 205385	2014-04-02 01:50:20 +00:00
David Blaikie	0a456de5a2	DebugInfo: Introduce DebugLocList to encapsulate a list of DebugLocEntries and an MC Label to refer to them This removes the magic-number-esque code creating/retrieving the same label for a debug_loc entry from two places and removes the last small piece of reusable logic from emitDebugLoc so that there will be less duplication when refactoring it into two functions (one for debug_loc, the other for debug_loc.dwo). llvm-svn: 205382	2014-04-02 01:43:18 +00:00
Quentin Colombet	3c2b13b258	[ARM64][CollectLOH] Add some comments to explain how the LOHs framework works (for the compiler part), since the design document is not available. llvm-svn: 205379	2014-04-02 01:02:28 +00:00
Adrian Prantl	3c5453cb6e	Add a doxygen comment to DebugLocEntry::Merge. llvm-svn: 205374	2014-04-01 23:34:45 +00:00
David Blaikie	6fa9966ee6	DebugLocEntry: Actually merge the loc entry when returning true. Seems we didn't have any test coverage for merging... awesome. So I added some - but hit an llvm-objdump bug while I was there. I'm choosing not to shave that yak right now. Code review feedback/bug catch by Adrian Prantl in r205360. llvm-svn: 205373	2014-04-01 23:19:23 +00:00
David Blaikie	91567b6700	Fix accidental fallthrough in DebugLocEntry::hasSameValueOrLocation No test case (this would invoke UB by examining uninitialized members, etc, at best - and this code is apparently untested anyway - I'm about to fix that) Code review feedback from Adrian Prantl on r205360. llvm-svn: 205367	2014-04-01 22:25:09 +00:00
David Blaikie	c2af77b027	Remove unused function DebugLocEntry::isEmpty llvm-svn: 205365	2014-04-01 22:06:18 +00:00
David Blaikie	d306baf572	Refactor out the comparison of the location/value in a DebugLocEntry llvm-svn: 205364	2014-04-01 22:04:07 +00:00
David Blaikie	98caf8bc64	Add inequality operator for MachineLocation. Fixes the build I broke in r205360 llvm-svn: 205361	2014-04-01 21:54:52 +00:00
David Blaikie	1275e4f026	DebugInfo: Split DebugLocEntry into its own file. It seems big enough that it deserves its own file - but it is header only, so there's no need for another cpp file, etc. llvm-svn: 205360	2014-04-01 21:49:04 +00:00
Adrian Prantl	6b444c5c8e	Add a comment about the DIDescriptor class hierarchy. llvm-svn: 205358	2014-04-01 21:04:24 +00:00
Adrian Prantl	75ce62acef	DwarfDebug: Prevent DebugLocEntry merging from coalescing two different constants into only the first one. rdar://14874886. llvm-svn: 205357	2014-04-01 21:04:18 +00:00
Hal Finkel	9e0baa6d3a	[PowerPC] Add some missing VSX bitcast patterns llvm-svn: 205352	2014-04-01 19:24:27 +00:00
Yaron Keren	48d68d439a	If isKnownWindowsMSVCEnvironment then getOS == Triple::Win32 and Environment == Triple::MSVC so it will never be MinGW or Cygwin. llvm-svn: 205349	2014-04-01 18:52:55 +00:00
Hal Finkel	2eed29f3c8	Implement X86TTI::getUnrollingPreferences This provides an initial implementation of getUnrollingPreferences for x86. getUnrollingPreferences is used by the generic (concatenation) unroller, which is distinct from the unrolling done by the loop vectorizer. Many modern x86 cores have some kind of uop cache and loop-stream detector (LSD) used to efficiently dispatch small loops, and taking full advantage of this requires unrolling small loops (small here means 10s of uops). These caches also have limits on the number of taken branches in the loop, and so we also cap the loop unrolling factor based on the maximum "depth" of the loop. This is currently calculated with a partial DFS traversal (partial because it will stop early if the path length grows too much). This is still an approximation, and one that is both conservative (because it does not account for branches eliminated via block placement) and optimistic (because it is only recording the maximum depth over minimum paths). Nevertheless, because the loops that fit in these uop caches are so small, it is not clear how much the details matter. The original set of patches posted for review produced the following test-suite performance results (from the TSVC benchmark) at that time: ControlLoops-dbl - 13% speedup ControlLoops-flt - 15% speedup Reductions-dbl - 7.5% speedup llvm-svn: 205348	2014-04-01 18:50:34 +00:00
Hal Finkel	6386cb8d4d	Add some additional fields to TTI::UnrollingPreferences In preparation for an upcoming commit implementing unrolling preferences for x86, this adds additional fields to the UnrollingPreferences structure: - PartialThreshold and PartialOptSizeThreshold - Like Threshold and OptSizeThreshold, but used when not fully unrolling. These are necessary because we need different thresholds for full unrolling from those used when partially unrolling (the full unrolling thresholds are generally going to be larger). - MaxCount - A cap on the unrolling factor when partially unrolling. This can be used by a target to prevent the unrolled loop from exceeding some resource limit independent of the loop size (such as number of branches). There should be no functionality change for any in-tree targets. llvm-svn: 205347	2014-04-01 18:50:30 +00:00
Hal Finkel	b4e001cc81	Use TopTTI->getGEPCost from within getUserCost The implementation of getUserCost had duplicated (and hard-coded) the default logic in getGEPCost. Instead, it is better to use getGEPCost directly, which limits the default logic to the implementation of one function, and allows targets to override the behavior. No functionality change intended. llvm-svn: 205346	2014-04-01 18:50:06 +00:00
Kai Nacke	af47f60f83	[mips] Add Octeon cnMips instructions mtmX and mtpX Adds the Octeon cnMips instructions "load multiplier register MPLx" and "load product register Px". Includes tests. Reviews by: Daniel.Sanders@imgtec.com llvm-svn: 205343	2014-04-01 18:35:26 +00:00
Reid Kleckner	101102711d	Support segmented stacks on Win64 Identical to Win32 method except the GS segment register is used for TLS instead of FS and pvArbitrary is at TEB offset 0x28 instead of 0x14. llvm-svn: 205342	2014-04-01 18:34:21 +00:00
Matt Arsenault	553751b9bc	Fix missing RUN line in test llvm-svn: 205341	2014-04-01 18:34:13 +00:00
Yaron Keren	136fe7db46	isTargetWindows() renamed to isTargetKnownWindowsMSVC() to reflect its current functionality. Based on Takumi NAKAMURA suggestion. llvm-svn: 205338	2014-04-01 18:15:34 +00:00
Matt Arsenault	e407ae9846	Make isSetCCEquivalent respect the TargetBooleanContents llvm-svn: 205336	2014-04-01 18:13:26 +00:00
Matt Arsenault	6310c3f667	Add helpers for checking if a value is a target boolean constant. llvm-svn: 205335	2014-04-01 18:13:22 +00:00
David Blaikie	0e84adc621	DebugInfo: Factor out common functionality for rendering debug_loc and debug_loc.dwo location list entries In preparation for refactoring this function into two, one for debug_loc, one for debug_loc.dwo. llvm-svn: 205324	2014-04-01 16:17:41 +00:00
David Blaikie	7f1f8742ea	Cleanup remaining use of removed variable to fix the build llvm-svn: 205323	2014-04-01 16:13:29 +00:00
David Blaikie	e12ab1276d	Simplify debug_loc.dwo handling slightly. llvm-svn: 205322	2014-04-01 16:09:49 +00:00
Christian Pirker	dc9ff75554	ARM: rename ARMle/ARMbe with ARMLE/ARMBE, and Thumble/Thumbbe with ThumbLE/ThumbBE llvm-svn: 205317	2014-04-01 15:19:30 +00:00
Tim Northover	0feb91ef15	ARM: teach LLVM that Cortex-A7 is very similar to A8. llvm-svn: 205314	2014-04-01 14:10:07 +00:00
Aaron Ballman	8bf5a548ea	Attempting to fix r205124, which had failed asserts when built with MSVC. Suggestion from Yaron Keren. llvm-svn: 205313	2014-04-01 13:56:35 +00:00
Tim Northover	1351030801	ARM: add cyclone CPU with ZeroCycleZeroing feature. The Cyclone CPU is similar to swift for most LLVM purposes, but does have two preferred instructions for zeroing a VFP register. This teaches LLVM about them. llvm-svn: 205309	2014-04-01 13:22:02 +00:00
Daniel Sanders	21bce30fdc	[mips] Renamed ParseAnyRegisterWithoutDollar to MatchAnyRegisterWithoutDollar This is for consistency with other functions. The Parse* functions consume tokens and the Match* functions don't. No functional change. llvm-svn: 205305	2014-04-01 12:35:23 +00:00
Aaron Ballman	0947bb20d8	Fixing an MSVC warning about widening the result of a 32-bit shift implicitly. No functional change intended. llvm-svn: 205304	2014-04-01 12:24:25 +00:00
Tim Northover	4f1dd58e2e	ARM64: add intrinsic for pmull (p64 x p64 = p128) operations. llvm-svn: 205302	2014-04-01 12:22:37 +00:00
Aaron Ballman	d1726ee8fa	Fixing warnings in the MSVC build. No functional changes intended. llvm-svn: 205301	2014-04-01 12:22:20 +00:00
Daniel Sanders	ffd8436d6c	[mips] Extend ParseJumpTarget to support the full symbol expression syntax. Summary: This should fix the issues the D3222 caused in lld. Testcase is based on the one that failed in the buildbot. Depends on D3233 Reviewers: matheusalmeida, vmedic Reviewed By: matheusalmeida Differential Revision: http://llvm-reviews.chandlerc.com/D3234 llvm-svn: 205298	2014-04-01 10:41:48 +00:00
Daniel Sanders	315386c083	[mips] Use AsmLexer::peekTok() to resolve the conflict between $reg and $sym Summary: Parsing registers no longer consume the $ token before it's confirmed whether it really has a register or not, therefore it's no longer impossible to match symbols if registers were tried first. Depends on D3232 Reviewers: matheusalmeida, vmedic Reviewed By: matheusalmeida Differential Revision: http://llvm-reviews.chandlerc.com/D3233 llvm-svn: 205297	2014-04-01 10:40:14 +00:00
Daniel Sanders	0993457891	[mips] Hoist Parser.Lex() calls out of MatchAnyRegisterNameWithoutDollar() Summary: No functional change Depends on D3222 Reviewers: matheusalmeida, vmedic Reviewed By: matheusalmeida Differential Revision: http://llvm-reviews.chandlerc.com/D3232 llvm-svn: 205295	2014-04-01 10:37:46 +00:00
Tim Northover	ff179ba3d3	ARM64: add patterns for more lane-wise ld1/st1 operations. llvm-svn: 205294	2014-04-01 10:37:09 +00:00
Tim Northover	d8d613b979	ARM64: fix bug in ld3r (1d) SelectionDAG. llvm-svn: 205293	2014-04-01 10:37:03 +00:00
Daniel Sanders	b50ccf8e26	[mips] Rewrite MipsAsmParser and MipsOperand. Summary: Highlights: - Registers are resolved much later (by the render method). Prior to that point, GPR32's/GPR64's are GPR's regardless of register size. Similarly FGR32's/FGR64's/AFGR64's are FGR's regardless of register size or FR mode. Numeric registers can be anything. - All registers are parsed the same way everywhere (even when handling symbol aliasing) - One consequence is that all registers can be specified numerically almost anywhere (e.g. $fccX, $wX). The exception is symbol aliasing but that can be easily resolved. - Removes the need for the hasConsumedDollar hack - Parenthesis and Bracket suffixes are handled generically - Micromips instructions are parsed directly instead of going through the standard encodings first. - rdhwr accepts all 32 registers, and the following instructions that previously xfailed now work: ddiv, ddivu, div, divu, cvt.l.[ds], se[bh], wsbh, floor.w.[ds], c.ngl.d, c.sf.s, dsbh, dshd, madd.s, msub.s, nmadd.s, nmsub.s, swxc1 - Diagnostics involving registers point at the correct character (the $) - There's only one kind of immediate in MipsOperand. LSA immediates are handled by the predicate and renderer. Lowlights: - Hardcoded '$zero' in the div patterns is handled with a hack. MipsOperand::isReg() will return true for a k_RegisterIndex token with Index == 0 and getReg() will return ZERO for this case. Note that it doesn't return ZERO_64 on isGP64() targets. - I haven't cleaned up all of the now-unused functions. Some more of the generic parser could be removed too (integers and relocs for example). - insve.df needed a custom decoder to handle the implicit fourth operand that was needed to make it parse correctly. The difficulty was that the matcher expected a Token<'0'> but gets an Imm<0>. Adding an implicit zero solved this. Reviewers: matheusalmeida, vmedic Reviewed By: matheusalmeida Differential Revision: http://llvm-reviews.chandlerc.com/D3222 llvm-svn: 205292	2014-04-01 10:35:28 +00:00
Renato Golin	33f973a43a	Recover TableGen/LangRef, make it official Making the new TableGen documentation official and marking the old file as "Moved". Also, reverting the original LangRef as the normative formal description of the language, while keeping the "new" LangRef as LangIntro for the less inlcined to reading language grammars. We should remove TableGenFundamentals.rst one day, but for now, just a warning that it moved will have to do, while we make sure there are no more links to it from elsewhere. llvm-svn: 205289	2014-04-01 09:51:49 +00:00
Alexey Volkov	1328b28dc6	[x86] Do not convert to cmp32 for Atom arch by Sergey Okunev Differential Revision: http://llvm-reviews.chandlerc.com/D2824 llvm-svn: 205288	2014-04-01 08:13:07 +00:00
David Blaikie	3464161070	DebugInfo: Avoid creating unnecessary/empty line tables and remove the special case of '0' in DwarfCompileUnit::initStmtList by just always using a label difference This moves one case of raw text checking down into the MCStreamer interfaces in the form of a virtual function, even if we ultimately end up consolidating on the one-or-many line tables issue one day, this is nicer in the interim. This just generally streamlines a bunch of use cases into a common code path. llvm-svn: 205287	2014-04-01 08:07:52 +00:00
David Blaikie	8bf66c4f3f	DebugInfo: Emit relocation to debug_line section when emitting asm for asm I don't think this is reachable by any frontend (why would you transform asm to asm+debug info?) but it helps tidy up some of this code, avoid the weird special case of "emit the first CU, store the label, then emit the rest" in MCDwarfLineTable::Emit by instead having the DWARF-for-assembly case use the same codepath as DwarfDebug.cpp, by registering the label of the debug_line section, thus causing it to be emitted. (with a special case in asm output to just emit the label since asm output uses the .loc directives, etc, rather than the debug_loc directly) llvm-svn: 205286	2014-04-01 07:35:52 +00:00
Adrian Prantl	762bdd5118	Remove FIXMEs. The scope of a Variable is always a lexical scope; there is nothing to be gained from switching this over to a DIScopeRef. llvm-svn: 205281	2014-04-01 03:50:01 +00:00
Adrian Prantl	d09ba23faf	LTO type uniquing: store the Decl field of a DIImportedEntity as a DIRef. No other functionality changes, DIBuilder testcase is included in a paired CFE commit. This relaxes the assertion in isScopeRef to also accept subclasses of DIScope. llvm-svn: 205279	2014-04-01 03:41:04 +00:00
Adrian Prantl	c3264f393d	Add a comment about type-uniquing ObjC types. llvm-svn: 205277	2014-04-01 03:40:59 +00:00
David Blaikie	f78cbb5b44	Comment to describe the debug_loc.dwo constants Code review feedback from Eric Christopher on r204697 llvm-svn: 205268	2014-03-31 23:50:20 +00:00
Saleem Abdulrasool	d3bafe3e41	MCJIT: ensure that cygwin is identified properly Cygwin is now a proper environment rather than an OS. This updates the MCJIT tests to avoid execution on Cygwin. This fixes native cygwin tests. llvm-svn: 205266	2014-03-31 23:42:23 +00:00
Hal Finkel	86b3064f2b	Move partial/runtime unrolling late in the pipeline The generic (concatenation) loop unroller is currently placed early in the standard optimization pipeline. This is a good place to perform full unrolling, but not the right place to perform partial/runtime unrolling. However, most targets don't enable partial/runtime unrolling, so this never mattered. However, even some x86 cores benefit from partial/runtime unrolling of very small loops, and follow-up commits will enable this. First, we need to move partial/runtime unrolling late in the optimization pipeline (importantly, this is after SLP and loop vectorization, as vectorization can drastically change the size of a loop), while keeping the full unrolling where it is now. This change does just that. llvm-svn: 205264	2014-03-31 23:23:51 +00:00
Duncan P. N. Exon Smith	acb367bd62	lit: Set a base directory for compiler-rt tests Setting this parameter enables llvm-lit to run on source directories for compiler-rt test suites that implement magic in their lit.cfg. <rdar://problem/16458307> llvm-svn: 205262	2014-03-31 23:14:10 +00:00
Arnold Schwaighofer	15262e6703	Revert "SLPVectorizer: Ignore users that are insertelements we can reschedule them" This reverts commit r205018. Conflicts: lib/Transforms/Vectorize/SLPVectorizer.cpp test/Transforms/SLPVectorizer/X86/insert-element-build-vector.ll This is breaking libclc build. llvm-svn: 205260	2014-03-31 23:05:56 +00:00
Joerg Sonnenberger	deb1e4adc1	Shifting into the sign bit is UB as discussed on IRC. Explicitly use the BitWord type for the constants to avoid this. llvm-svn: 205257	2014-03-31 22:53:57 +00:00
Juergen Ributzka	e117992f00	[Stackmaps] Update the stackmap format to use 64-bit relocations for the function address and properly align all entries. This commit updates the stackmap format to version 1 to indicate the reorganizaion of several fields. This was done in order to align stackmap entries to their natural alignment and to minimize padding. Fixes <rdar://problem/16005902> llvm-svn: 205254	2014-03-31 22:14:04 +00:00
Adam Nemet	10c4ce2584	[X86] Adjust cost of FP_TO_UINT v4f64->v4i32 as well Pretty obvious follow-on to r205159 to also handle conversion from double besides float. Fixes <rdar://problem/16373208> llvm-svn: 205253	2014-03-31 21:54:48 +00:00
Matt Arsenault	d6c4326786	R600/SI: Remove leftover pattern splitting 64-bit ors. It's now matched to the scalar 64-bit or and split later if necessary.' llvm-svn: 205252	2014-03-31 21:46:46 +00:00
Manman Ren	63efd8e7e6	Register allocator: set CSRFirstUseCost to 5 for ARM64. A value of 5 means if we have a split or spill option that has a really low cost (1 << 14 is the entry frequency), we will choose to spill or split the really cold path before using a callee-saved register. This gives us the performance benefit on SPECInt2k and is also conservative. rdar://16162005 llvm-svn: 205248	2014-03-31 21:06:36 +00:00
Matt Arsenault	f751d6272d	Change shouldSplitVectorElementType to better match the description. Pass the entire vector type, and not just the element. llvm-svn: 205247	2014-03-31 20:54:58 +00:00
Rui Ueyama	f4d29bc05b	Fix MSVC warning. This patch is to fix the following warning when compiled with MSVC 64 bit. warning C4334: '<<' : result of 32-bit shift implicitly converted to 64 bits (was 64-bit shift intended?) llvm-svn: 205245	2014-03-31 20:04:37 +00:00
Matt Arsenault	d7bdcc46a6	R600/SI: Implement shouldConvertConstantLoadToIntImm llvm-svn: 205244	2014-03-31 19:54:27 +00:00
Hal Finkel	b811b6d0d1	Add an optional ability to expand larger BUILD_VECTORs with shuffles This adds the ability to expand large (meaning with more than two unique defined values) BUILD_VECTOR nodes in terms of SCALAR_TO_VECTOR and (legal) vector shuffles. There is now no limit of the size we are capable of expanding this way, although we don't currently do this for vectors with many unique values because of the default implementation of TLI's shouldExpandBuildVectorWithShuffles function. There is currently no functional change to any existing targets because the new capabilities are not used unless some target overrides the TLI shouldExpandBuildVectorWithShuffles function. As a result, I've not included a test case for the new functionality in this commit, but regression tests will (at least) be added soon when I commit support for the PPC QPX vector instruction set. The benefit of committing this now is that it makes the shouldExpandBuildVectorWithShuffles callback, which had to be added for other reasons regardless, fully functional. I suspect that other targets will also benefit from tuning the heuristic. llvm-svn: 205243	2014-03-31 19:42:55 +00:00
Matt Arsenault	378bf9c68b	R600: Compute masked bits for min and max llvm-svn: 205242	2014-03-31 19:35:33 +00:00
Rafael Espindola	ee1c342ef9	Don't relocate with sections if there might be a paired relocation. llvm-svn: 205240	2014-03-31 19:00:23 +00:00
Daniel Sanders	e34a120285	Revert: [mips] Rewrite MipsAsmParser and MipsOperand.' due to buildbot errors in lld tests. It's currently unable to parse 'sym + imm' without surrounding parenthesis. llvm-svn: 205237	2014-03-31 18:51:43 +00:00
Matt Arsenault	4c53717787	R600: Add BFE, BFI, and BFM intrinsics to help with writing tests. llvm-svn: 205236	2014-03-31 18:21:18 +00:00
Matt Arsenault	b34583661b	R600: Add target nodes for BFM and BFI llvm-svn: 205235	2014-03-31 18:21:13 +00:00
Saleem Abdulrasool	2070088bef	ARM: fix typo llvm-svn: 205233	2014-03-31 18:09:10 +00:00
Rafael Espindola	c627a8750a	Now that this test is assembly, make the checks a bit stronger. This will be used for a followup patch. llvm-svn: 205232	2014-03-31 18:01:50 +00:00
Hal Finkel	b4240ca0f4	[PowerPC] Don't ever expand BUILD_VECTOR of v2i64 with shuffles If we have two unique values for a v2i64 build vector, this will always result in two vector loads if we expand using shuffles. Only one is necessary. llvm-svn: 205231	2014-03-31 17:48:16 +00:00
Hal Finkel	1977514287	Add a TLI hook to control when BUILD_VECTOR might be expanded using shuffles There are two general methods for expanding a BUILD_VECTOR node: 1. Use SCALAR_TO_VECTOR on the defined scalar values and then shuffle them together. 2. Build the vector on the stack and then load it. Currently, we use a fixed heuristic: If there are only one or two unique defined values, then we attempt an expansion in terms of SCALAR_TO_VECTOR and vector shuffles (provided that the required shuffle mask is legal). Otherwise, always expand via the stack. Even when SCALAR_TO_VECTOR is not legal, this can still be a good idea depending on what tricks the target can play when lowering the resulting shuffle. If the target can't do anything special, however, and if SCALAR_TO_VECTOR is expanded via the stack, this heuristic leads to sub-optimal code (two stack loads instead of one). Because only the target knows whether the SCALAR_TO_VECTORs and shuffles for a build vector of a particular type are likely to be optimial, this adds a new TLI function: shouldExpandBuildVectorWithShuffles which takes the vector type and the count of unique defined values. If this function returns true, then method (1) will be used, subject to the constraint that all of the necessary shuffles are legal (as determined by isShuffleMaskLegal). If this function returns false, then method (2) is always used. This commit does not enhance the current code to support expanding a build_vector with more than two unique values using shuffles, but I'll commit an implementation of the more-general case shortly. llvm-svn: 205230	2014-03-31 17:48:10 +00:00
Daniel Sanders	0c648ba5be	[mips] Rewrite MipsAsmParser and MipsOperand. Summary: Highlights: - Registers are resolved much later (by the render method). Prior to that point, GPR32's/GPR64's are GPR's regardless of register size. Similarly FGR32's/FGR64's/AFGR64's are FGR's regardless of register size or FR mode. Numeric registers can be anything. - All registers are parsed the same way everywhere (even when handling symbol aliasing) - One consequence is that all registers can be specified numerically almost anywhere (e.g. $fccX, $wX). The exception is symbol aliasing but that can be easily resolved. - Removes the need for the hasConsumedDollar hack - Parenthesis and Bracket suffixes are handled generically - Micromips instructions are parsed directly instead of going through the standard encodings first. - rdhwr accepts all 32 registers, and the following instructions that previously xfailed now work: ddiv, ddivu, div, divu, cvt.l.[ds], se[bh], wsbh, floor.w.[ds], c.ngl.d, c.sf.s, dsbh, dshd, madd.s, msub.s, nmadd.s, nmsub.s, swxc1 - Diagnostics involving registers point at the correct character (the $) - There's only one kind of immediate in MipsOperand. LSA immediates are handled by the predicate and renderer. Lowlights: - Hardcoded '$zero' in the div patterns is handled with a hack. MipsOperand::isReg() will return true for a k_RegisterIndex token with Index == 0 and getReg() will return ZERO for this case. Note that it doesn't return ZERO_64 on isGP64() targets. - I haven't cleaned up all of the now-unused functions. Some more of the generic parser could be removed too (integers and relocs for example). - insve.df needed a custom decoder to handle the implicit fourth operand that was needed to make it parse correctly. The difficulty was that the matcher expected a Token<'0'> but gets an Imm<0>. Adding an implicit zero solved this. Reviewers: matheusalmeida, vmedic Reviewed By: matheusalmeida Differential Revision: http://llvm-reviews.chandlerc.com/D3222 llvm-svn: 205229	2014-03-31 17:43:46 +00:00
Paul Robinson	7c99ec5b99	Disable each MachineFunctionPass for 'optnone' functions, unless that pass normally runs at optimization level None, or is part of the register allocation pipeline. llvm-svn: 205228	2014-03-31 17:43:35 +00:00
Yaron Keren	c6a57ea4e9	Two updated tests for MinGW 32 and 64 exception handling code generation. llvm-svn: 205227	2014-03-31 17:34:15 +00:00
Hal Finkel	4c8f634f23	[PowerPC] Correct P7 dispatch unit allocation for vector instructions llvm-svn: 205222	2014-03-31 17:02:10 +00:00
Tom Roeder	ed0e88c31a	This patch fixes LTO's RecordStreamer so that it records symbols in the MCExpr part of an asm .symver directive as being used. This prevents referenced functions from being internalized and deleted. Without the patch to LTOModule.cpp, the test case will produce the error: LLVM ERROR: A @@ version cannot be undefined. llvm-svn: 205221	2014-03-31 16:59:13 +00:00
Saleem Abdulrasool	28b82bc39e	Support: generalise object type handling for Windows This generalises the object file type parsing to all Windows environments. This is used by cygwin as well as MSVC environments for MCJIT. This also makes the triple more similar to Chandler's suggestion of a separate field for the object file format. llvm-svn: 205219	2014-03-31 16:34:41 +00:00
Eli Bendersky	6a0ccfb585	PR19099 - revert r203483 Now that r205212 was committed, r203483 is no longer necessary; it was a temporary workaround that only handled a small number of the problematic cases. llvm-svn: 205216	2014-03-31 16:11:57 +00:00
Christian Pirker	6d301b8ff2	ARM: change parameter names of the ELFARMAsmBackend constructor I removed the underscore at the beginning of the parameter name, because of a comment from Tim. llvm-svn: 205215	2014-03-31 16:06:39 +00:00
Robert Khasanov	ed0b2e9733	Test commit. llvm-svn: 205214	2014-03-31 16:01:38 +00:00
Daniel Sanders	d69adeb8e7	[mips] Fix use of uninitialized value reported by the sanitizer-x86_64-linux-bootstrap buildbot llvm-svn: 205213	2014-03-31 15:58:58 +00:00
Eli Bendersky	264cd4672d	Fix for PR19099 - NVPTX produces invalid symbol names. This is a more thorough fix for the issue than r203483. An IR pass will run before NVPTX codegen to make sure there are no invalid symbol names that can't be consumed by the ptxas assembler. llvm-svn: 205212	2014-03-31 15:56:26 +00:00
Tim Northover	5081cd0f81	ARM64: add extra patterns for scalar shifts llvm-svn: 205209	2014-03-31 15:46:46 +00:00
Tim Northover	e7834c3bbc	ARM64: add extra scalar neg pattern & tests. llvm-svn: 205208	2014-03-31 15:46:42 +00:00
Tim Northover	4468670345	ARM64: add patterns for scalar sqdmlal & sqdmlsl. llvm-svn: 205207	2014-03-31 15:46:38 +00:00
Tim Northover	5731fc75af	ARM64: add more patterns for commuted fmsub operations. llvm-svn: 205206	2014-03-31 15:46:34 +00:00
Tim Northover	290e0698d4	ARM64: shuffle patterns around for fmin/fmax & add tests. llvm-svn: 205205	2014-03-31 15:46:30 +00:00
Tim Northover	903814ccd6	ARM64: add more scalar patterns for usqadd & suqadd. llvm-svn: 205204	2014-03-31 15:46:26 +00:00
Tim Northover	4c9d2c7e3f	ARM64: add more scalar patterns for reciprocal ops. llvm-svn: 205203	2014-03-31 15:46:22 +00:00
Tim Northover	f48103618e	ARM64: add i64 scalar pattern for @llvm.arm64.abs This will be used by the Clang front-end code for vabsd_s64. llvm-svn: 205202	2014-03-31 15:46:17 +00:00
Daniel Sanders	a567da5a36	[mips] Implement missing relocations in the integrated assembler. %got_hi, %got_lo, %call_hi, %call_lo, %higher, and %highest are now recognised by MipsAsmParser::getVariantKind(). To prevent future issues with missing entries in this StringSwitch, I've added an assertion to the default case. llvm-svn: 205200	2014-03-31 15:15:02 +00:00
Daniel Sanders	c991305cc9	[mips] Remove R_MIPS_GOT which isn't used and shares the same number as R_MIPS_GOT16 Unlike my previous commit, don't try to remove the corresponding VK_Mips_GOT yet even though it shares the same assembly text since that is used. llvm-svn: 205196	2014-03-31 14:47:41 +00:00
Daniel Sanders	cefddb2ca6	Revert r205194 - [mips] Removed R_MIPS_GOT. It's identical to R_MIPS_GOT16. There's a couple additional bits I missed. llvm-svn: 205195	2014-03-31 14:34:36 +00:00
Daniel Sanders	a104300dbe	[mips] Removed R_MIPS_GOT. It's identical to R_MIPS_GOT16. llvm-svn: 205194	2014-03-31 14:30:05 +00:00
Rafael Espindola	2378d4c0ce	Capitalize the D in parseDirectiveGpDWord. DWord seems to be the canonical way to camel case dword in llvm. Thanks to Daniel Sander for noticing. llvm-svn: 205191	2014-03-31 14:15:07 +00:00
Dmitri Gribenko	1c1f487654	Remove unused private typedef llvm-svn: 205190	2014-03-31 14:14:13 +00:00
Tom Stellard	30f59417cf	R600/SI: Implement SIInstrInfo::isTriviallyRematerializable() llvm-svn: 205188	2014-03-31 14:01:56 +00:00
Tom Stellard	7ea3d6d420	R600/SI: Lower i64 SELECT by bitcasting to a vector type This allows allows us to replace ISD::EXTRACT_ELEMENT, which is lowered using shifts, with ISD::EXTRACT_VECTOR_ELT, which is a no-op. llvm-svn: 205187	2014-03-31 14:01:55 +00:00
Tom Stellard	7277b008ee	R600/SI: Return the correct index for VGPRs in getHWRegIndex() The register index is stored in the low 8-bits of the encoding. llvm-svn: 205186	2014-03-31 14:01:52 +00:00
Zoran Jovanovic	9b05a31f76	Fixed issue with microMIPS JAL instruction. Differential Revision: http://llvm-reviews.chandlerc.com/D3200 llvm-svn: 205185	2014-03-31 14:00:10 +00:00
NAKAMURA Takumi	66f560903f	llvm/test/MC/Mips/mips64r2/valid-xfail.s: This REQUIRES asserts. Seems it doesn't fail with -Asserts. llvm-svn: 205182	2014-03-31 13:30:02 +00:00
Daniel Sanders	8aa19c1392	[mips] Added a full set of instruction test cases for all ISA's (but not ASE's). Summary: Where those ISA's are not currently supported, the test is run with the smallest superset of that ISA. Some instructions are valid but don't pass yet. These have been placed in the valid-xfail.s's which will XPASS if _any_ instruction starts working. The valid.s's do not verify the encoding yet. There are also no tests checking that instructions from neighbouring ISA's are not accepted. Reviewers: matheusalmeida Reviewed By: matheusalmeida Differential Revision: http://llvm-reviews.chandlerc.com/D3214 llvm-svn: 205180	2014-03-31 12:13:12 +00:00
Hal Finkel	02807595fb	Look at shuffles of build_vectors in DAGCombiner::visitEXTRACT_VECTOR_ELT When the loop vectorizer vectorizes code that uses the loop induction variable, we often end up with IR like this: %b1 = insertelement <2 x i32> undef, i32 %v, i32 0 %b2 = shufflevector <2 x i32> %b1, <2 x i32> undef, <2 x i32> zeroinitializer %i = add <2 x i32> %b2, <i32 2, i32 3> If the add in this example is not legal (as is the case on PPC with VSX), it will be scalarized, and we'll end up with a number of extract_vector_elt nodes with the vector shuffle as the input operand, and that vector shuffle is fed by one or more build_vector nodes. By the time that vector operations are expanded, visitEXTRACT_VECTOR_ELT will not create new extract_vector_elt by looking through the vector shuffle (to make sure that no illegal operations are created), and so the extract_vector_elt -> vector shuffle -> build_vector is never simplified to an operand of the build vector. By looking at build_vectors through a shuffle we fix this particular situation, preventing a vector from being built, only to be deconstructed again (for the scalarized add) -- an expensive proposition when this all needs to be done via the stack. We probably want a more comprehensive fix here where we look back recursively through any shuffles to any build_vectors or scalar_to_vectors, etc. but that can come later. llvm-svn: 205179	2014-03-31 11:43:19 +00:00
Daniel Sanders	daf7c022c9	[mips] Check emitted code for llvm.bswap.i32 on MIPS16/MIPS64 and llvm.bswap.i64 on MIPS16. While reviewing r204163, I noticed that the MIPS16 test only checked for a .ent directive and didn't actually check the code emitted. Fixed this and added a check for llvm.bswap.i32 on MIPS64 at the same time. llvm-svn: 205177	2014-03-31 11:00:04 +00:00
Tim Northover	241856e5f8	ARM64: fix a couple of signed/unsigned comparison warnings. llvm-svn: 205174	2014-03-31 10:21:36 +00:00
Daniel Sanders	9cf3d3b764	[yaml2obj] Add support for ELF e_flags. Summary: The FileHeader mapping now accepts an optional Flags sequence that accepts the EF_<arch>_<flag> constants. When not given, Flags defaults to zero. Reviewers: atanasyan Reviewed By: atanasyan CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D3213 llvm-svn: 205173	2014-03-31 09:44:05 +00:00
Alexey Samsonov	23aaf2a182	Try to fix MSan bootstrap bot: make ARM64Disassembler::getInstruction() always initialize Size argument. llvm-svn: 205171	2014-03-31 07:59:33 +00:00
Yaron Keren	070a752d7e	Correct OS conditionals following r204977 and r204978. Previously, MinGW OS was Triple::MinGW and Cygwin was Triple::Cygwin and now it is Triple::Win32 with Environment being GNU or Cygwin. So, TheTriple.getOS() == Triple::Win32 is replaced by TheTriple.isWindowsMSVCEnvironment() and (TheTriple.getOS() == Triple::MinGW32 \|\| TheTriple.getOS() == Triple::Cygwin) is replaced by TheTriple.isOSCygMing() llvm-svn: 205170	2014-03-31 07:59:14 +00:00
Craig Topper	ec82847a64	[C++11] Mark more classes in the X86 target as 'final'. llvm-svn: 205166	2014-03-31 06:53:13 +00:00
Craig Topper	26eec09d84	Mark a couple of the X86 target classes as final. Allows the compiler to de-virtualize some internal calls. llvm-svn: 205165	2014-03-31 06:22:15 +00:00
NAKAMURA Takumi	82ec13e3d5	ARM64CollectLOH.cpp: Tweak \param. [-Wdocumentation] llvm-svn: 205162	2014-03-31 01:10:26 +00:00
Chandler Carruth	d28515af31	[ARM64] Fix materialization of an fp128 zero immediate. There currently is not a pattern to lower this with clever instructions that zero the register, so restrict the zero immediate legality special case to f64 and f32 (the only two sizes which fmov seems to directly support). Fixes backend errors when building code such as libxml. llvm-svn: 205161	2014-03-31 00:02:10 +00:00
Adam Nemet	6dafe97271	[X86] Adjust cost of FP_TO_UINT v8f32->v8i32 There is no direct AVX instruction to convert to unsigned. I have some ideas how we may be able to do this with three vector instructions but the current backend just bails on this to get it scalarized. See the comment why we need to adjust the cost returned by BasicTTI. The test is a bit roundabout (and checks assembly rather than bit code) because I'd like it to work even if at some point we could vectorize this conversion. Fixes <rdar://problem/16371920> llvm-svn: 205159	2014-03-30 18:07:13 +00:00
Stepan Dyatkovskiy	8baf17fc5f	PR18929: According to ARM assembler language hash symbol is optional before immediates. For example, see here for more details: http://infocenter.arm.com/help/index.jsp?topic=/com.arm.doc.dui0473j/dom1359731154529.html llvm-svn: 205157	2014-03-30 17:09:54 +00:00
Hal Finkel	90adf0fe06	Make use of previously generated stores in SelectionDAGLegalize::ExpandExtractFromVectorThroughStack When expanding EXTRACT_VECTOR_ELT and EXTRACT_SUBVECTOR using SelectionDAGLegalize::ExpandExtractFromVectorThroughStack, we store the entire vector and then load the piece we want. This is fine in isolation, but generating a new store (and corresponding stack slot) for each extraction ends up producing code of poor quality. When we scalarize a vector operation (using SelectionDAG::UnrollVectorOp for example) we generate one EXTRACT_VECTOR_ELT for each element in the vector. This used to generate one stored copy of the vector for each element in the vector. Now we search the uses of the vector for a suitable store before generating a new one, which results in much more efficient scalarization code. llvm-svn: 205153	2014-03-30 15:10:18 +00:00
NAKAMURA Takumi	411d35e25e	llvm/test/MC/ELF/nocompression.s: Loosen an expression to match "llvm-mc.EXE". llvm-svn: 205148	2014-03-30 14:04:00 +00:00
Hal Finkel	5c0d1454d6	[PowerPC] Handle VSX v2i64 SIGN_EXTEND_INREG sitofp from v2i32 to v2f64 ends up generating a SIGN_EXTEND_INREG v2i64 node (and similarly for v2i16 and v2i8). Even though there are no sign-extension (or algebraic shifts) for v2i64 types, we can handle v2i32 sign extensions by converting two and from v2i64. The small trick necessary here is to shift the i32 elements into the right lanes before the i32 -> f64 step. This is because of the big Endian nature of the system, we need the i32 portion in the high word of the i64 elements. For v2i16 and v2i8 we can do the same, but we first use the default Altivec shift-based expansion from v2i16 or v2i8 to v2i32 (by casting to v4i32) and then apply the above procedure. llvm-svn: 205146	2014-03-30 13:22:59 +00:00
Chandler Carruth	9df0fd4018	[Allocator] Lift the slab size and size threshold into template parameters rather than runtime parameters. There is only one user of these parameters and they are compile time for that user. Making these compile time seems to better reflect their intended usage as well. llvm-svn: 205143	2014-03-30 12:07:07 +00:00
Chandler Carruth	a05a221e63	[Allocator] Simplify unittests by using the default size parameters in more places. llvm-svn: 205141	2014-03-30 11:36:32 +00:00
Chandler Carruth	f95623b790	[Allocator] Stop forward-declaring BumpPtrAllocator in a few places. This is a necessary step to lifting some of its configuration into template parameters rather than runtime parameters. llvm-svn: 205140	2014-03-30 11:36:29 +00:00
Chandler Carruth	a48ecb7639	Don't mark the declarations of the TSan annotation functions as weak. That causes references to them to be weak references which can collapse to null if no definition is provided. We call these functions unconditionally, so a definition must be provided. Make the definitions provided in the .cpp file weak by re-declaring them as weak just prior to defining them. This should keep compilers which cannot attach the weak attribute to the definition happy while actually resolving the symbols correctly during the link. You might ask yourself upon reading this commit log: how did any of this work before? Well, fun story. It turns out we have some code in Support (BumpPtrAllocator) which both uses virtual dispatch and has out-of-line vtables used by that virtual dispatch. If you move the virtual dispatch into its header in just the right way, the optimizer gets to devirtualize, and remove all references to the vtable. Then the sad part: the references to this one vtable were the only strong symbol uses in the support library for llvm-tblgen AFAICT. At least, after doing something just like this, these symbols stopped getting their weak definition and random calls to them would segfault instead. Yay software. llvm-svn: 205137	2014-03-30 11:20:25 +00:00
Chandler Carruth	81f7061065	[ARM64] Fix a heap-use-after-free spotted by ASan. StringRef::lower() returns a std::string. Better yet, we can now stop thinking about what it returns and write 'auto'. It does the right thing. =] llvm-svn: 205135	2014-03-30 09:08:07 +00:00
Tim Northover	bf679cec67	ARM64: uncopy/paste helper function It was doing functional but highly suspect operations on bools due to the more limited shifting operands supported by memory instructions. Should fix some MSVC warnings. llvm-svn: 205134	2014-03-30 08:30:28 +00:00
Tim Northover	6b3258f087	ARM64: remove unused variables llvm-svn: 205133	2014-03-30 07:35:48 +00:00
Tim Northover	af6bfb21cd	ARM64: remove -m32/-m64 mapping with ARM. This is causing the ARM build-bots to fail since they only include the ARM backend and can't create an ARM64 target. llvm-svn: 205132	2014-03-30 07:25:23 +00:00
Tim Northover	3e52557212	ARM64: override all the things. Actually, mostly only those in the top-level directory that already had a "virtual" attached. But it's the thought that counts and it's been a long day. llvm-svn: 205131	2014-03-30 07:25:18 +00:00
Saleem Abdulrasool	f80b49b5d2	Support: correct Windows normalisation If the environment is unknown and no object file is provided, then assume an "MSVC" environment, otherwise, set the environment to the object file format. In the case that we have a known environment but a non-native file format for Windows (COFF) which is used for MCJIT, then append the custom file format to the triple as an additional component. This fixes the MCJIT tests on Windows. llvm-svn: 205130	2014-03-30 07:19:31 +00:00
NAKAMURA Takumi	d423225159	Suppress llvm/test/CodeGen/ARM64 for targeting pecoff. ARM64 is unaware of that. FIXME: Could we support them? llvm-svn: 205126	2014-03-30 05:01:17 +00:00
NAKAMURA Takumi	4cf1a3be82	llvm/test/Transforms/LoopStrengthReduce/ARM64/lsr-*.ll: Add explicit triple arm64-unknown for targeting pecoff. llvm-svn: 205125	2014-03-30 05:01:04 +00:00
NAKAMURA Takumi	09717bd1c4	X86Subtarget.h: isTargetWindows() should tell whether he is targeting msvc. FYI, !isWindowsGNUEnvironment() is insufficient. It missed cygwin. FIXME: The name "isTargetWindows" should be fixed. llvm-svn: 205124	2014-03-30 04:35:00 +00:00
Lang Hames	c339840666	[MC] Remove an unused (and broken) variant of the setupForSymbolicDisassembly method in MCDisassembler. llvm-svn: 205123	2014-03-30 04:27:33 +00:00
Lang Hames	652b0a4f3b	[PBQP] Move invalid graph nodeId/edgeId methods into base class. llvm-svn: 205122	2014-03-30 03:47:00 +00:00
Rafael Espindola	5e66a7e699	Add a missing break. Patch by Tobias Güntner. I tried to write a test, but the only difference is the Changed value that gets returned. It can be tested with "opt -debug-pass=Executions -functionattrs, but that doesn't seem worth it. llvm-svn: 205121	2014-03-30 03:26:17 +00:00
Saleem Abdulrasool	ceec2cba64	Support: normalize the default triple on Unix This will fix cross-compiling buildbots (e.g. cygwin). This is in the same vein as SVN r205070. Apply this to fix the cross-compiling scenario, even though the preferred solution is to update the build system to normalize the embedded triple rather than perform this at runtime every time. This is meant to tide us over until that approach is fleshed out and applied. llvm-svn: 205120	2014-03-30 03:22:37 +00:00
Rafael Espindola	986b14c507	Remove dead declarations. Patch by Tobias Güntner. llvm-svn: 205119	2014-03-30 02:33:01 +00:00
Benjamin Kramer	4d951abf1f	Remove outdated comment. llvm-svn: 205117	2014-03-29 20:16:23 +00:00
Dmitri Gribenko	1fd72104ad	Fix a few -Wdocumentation warnings llvm-svn: 205116	2014-03-29 19:40:32 +00:00
Benjamin Kramer	3ad660a515	Detemplatize LOHDirective. The ARM64 backend uses it only as a container to keep an MCLOHType and Arguments around so give it its own little copy. The other functionality isn't used and we had a crazy method specialization hack in place to keep it working. Unfortunately that was incompatible with MSVC. Also range-ify a couple of loops while at it. llvm-svn: 205114	2014-03-29 19:21:20 +00:00
Benjamin Kramer	61e595be4d	ARM64: Remove unused helper function, make others static. llvm-svn: 205112	2014-03-29 18:00:49 +00:00
Benjamin Kramer	48e7e85d29	tblgen: Twinify PrintFatalError. No functionality change. llvm-svn: 205110	2014-03-29 17:17:15 +00:00
Tim Northover	4e55afed7e	TableGen: don't save a StringRef to a local std::string. This caused a failure in some Windows builds. llvm-svn: 205109	2014-03-29 16:59:27 +00:00
Benjamin Kramer	fd719b9551	Avoid storing Twines. While there nested ifs into a helper function. No functionality change. llvm-svn: 205108	2014-03-29 16:54:29 +00:00
Hal Finkel	777c9dd90a	[PowerPC] Handle v2i64 comparisons v2i64 is a legal type under VSX, however we don't have native vector comparisons. We can handle eq/ne by casting it to an Altivec type, but everything else must be expanded. llvm-svn: 205106	2014-03-29 16:04:40 +00:00
Tim Northover	adbd34e045	ARM64: format register strings without creating a local Twine. It was causing horrible failures on some build-bots. llvm-svn: 205105	2014-03-29 15:35:57 +00:00
Logan Chien	8faefa2d31	llvm-mc: Fix build breakage caused by r205050. When LLVM is not built with zlib, nocompression.s will test for the error message. But this test case will cause breakage because the exit code is non-zero. This commit fix this issue by adding "not" to the command. llvm-svn: 205102	2014-03-29 15:10:22 +00:00
Hal Finkel	e8fba98735	[PowerPC] VSX instruction latency corrections The vector divide and sqrt instructions have high latencies, and the scalar comparisons are like all of the others. On the P7, permutations take an extra cycle over purely-simple vector ops. llvm-svn: 205096	2014-03-29 13:20:31 +00:00
Stepan Dyatkovskiy	df657cc1d5	Recommitted fix for PR18931, with extended tests set. Issue subject: Crash using integrated assembler with immediate arithmetic Fix description: Expressions like 'cmp r0, #(l1 - l2) >> 3' could not be evaluated on asm parsing stage, since it is impossible to resolve labels on this stage. In the end of stage we still have expression (MCExpr). Then, when we want to encode it, we expect it to be an immediate, but it still an expression. Patch introduces a Fixup (MCFixup instance), that is processed after main encoding stage. llvm-svn: 205094	2014-03-29 13:12:40 +00:00
Tim Northover	2125374ecf	ARM64: use 64-bit constant even on 32-bit machines Another existing bot failure so no tests. llvm-svn: 205093	2014-03-29 11:51:49 +00:00
Tim Northover	2011df293d	ARM64: change format specifier to work on 32-bit targets Existing tests were failing. llvm-svn: 205092	2014-03-29 11:47:07 +00:00
Chandler Carruth	7b7a67c5c8	[ARM64] Fix 'assert("...")' to be 'assert(0 && "...")'. Otherwise, it is no assert at all. ;] Some of these should probably be switched to llvm_unreachable, but I didn't want to perturb the behavior in this patch. Found by -Wstring-conversion, which I'll try to turn on in CMake builds at least as it is finding useful things. llvm-svn: 205091	2014-03-29 11:07:40 +00:00
Tim Northover	00ed9964c6	ARM64: initial backend import This adds a second implementation of the AArch64 architecture to LLVM, accessible in parallel via the "arm64" triple. The plan over the coming weeks & months is to merge the two into a single backend, during which time thorough code review should naturally occur. Everything will be easier with the target in-tree though, hence this commit. llvm-svn: 205090	2014-03-29 10:18:08 +00:00
Tim Northover	3e38d290c8	TableGen: avoid dereferencing nullptr variable ARM64 ended up reaching odder parts of TableGen alias generation than current backends and caused a segfault. llvm-svn: 205089	2014-03-29 09:03:22 +00:00
Tim Northover	753eca0f78	CodeGen: add sensible defaults for the ISD::FROUND operation Some exotic types didn't know how to handle FROUND, which ARM64 uses. llvm-svn: 205088	2014-03-29 09:03:18 +00:00

... 2 3 4 5 6 ...

101961 Commits