llvm-project

Commit Graph

Author	SHA1	Message	Date
Tim Northover	3b684d8359	ARM: use pristine object file while processing relocations Previously we would read-modify-write the target bits when processing relocations for the MCJIT. This had the problem that when relocations were processed multiple times for the same object file (as they can be), the result is not idempotent and the values became corrupted. The solution to this is to take any bits used in the destination from the pristine object file as LLVM emitted it. This should fix PR16013 and remote MCJIT on ARM ELF targets. llvm-svn: 182800	2013-05-28 19:48:19 +00:00
Manman Ren	b5b5453e61	LTO+Debug Info: correctly emit inlined_subroutine when the inlined callee is from a different CU. We used to print out an error message and fail to generate inlined_subroutine. If we use ref_addr in the generated DWARF, the DWARF version should be 3 or above. rdar://13926659 llvm-svn: 182791	2013-05-28 19:01:58 +00:00
Jyotsna Verma	cceafb2d6d	Hexagon: Typo fix. llvm-svn: 182790	2013-05-28 19:01:45 +00:00
Chad Rosier	1bbbb3128a	Remove the MCRegAliasIterator tables and compute the aliases dynamically. The size reduction in the RegDiffLists are rather dramatic. Here are a few size differences for MCTargetDesc.o files (before and after) in bytes: R600 - 36160B - 11184B - 69% reduction ARM - 28480B - 8368B - 71% reduction Mips - 816B - 576B - 29% reduction One side effect of dynamically computing the aliases is that the iterator does not guarantee that the entries are ordered or that duplicates have been removed. The documentation implies this is a safe assumption and I found no clients that requires these attributes (i.e., strict ordering and uniqueness). My local LNT tester results showed no execution-time failures or significant compile-time regressions (i.e., beyond what I would consider noise) for -O0g, -O2 and -O3 runs on x86_64 and i386 configurations. rdar://12906217 llvm-svn: 182783	2013-05-28 18:08:48 +00:00
Benjamin Kramer	262b154247	Simplify code. No functionality change. llvm-svn: 182779	2013-05-28 16:39:36 +00:00
Benjamin Kramer	351d53c225	Remove double semicolons. llvm-svn: 182778	2013-05-28 16:31:26 +00:00
James Molloy	f6f121e277	Extend RemapInstruction and friends to take an optional new parameter, a ValueMaterializer. Extend LinkModules to pass a ValueMaterializer to RemapInstruction and friends to lazily create Functions for lazily linked globals. This is a big win when linking small modules with large (mostly unused) library modules. llvm-svn: 182776	2013-05-28 15:17:05 +00:00
Evgeniy Stepanov	fca012334b	[msan] Fix argument shadow alignment. llvm-svn: 182771	2013-05-28 13:07:43 +00:00
Renato Golin	467e256493	Typo llvm-svn: 182766	2013-05-28 11:28:37 +00:00
Richard Sandiford	0fb90ab0cb	[SystemZ] Register compare-and-branch support This patch adds support for the CRJ and CGRJ instructions. Support for the immediate forms will be a separate patch. The architecture has a large number of comparison instructions. I think it's generally better to concentrate on using the "best" comparison instruction first and foremost, then only use something like CRJ if CR really was the natual choice of comparison instruction. The patch therefore opportunistically converts separate CR and BRC instructions into a single CRJ while emitting instructions in ISelLowering. llvm-svn: 182764	2013-05-28 10:41:11 +00:00
Renato Golin	c08f218b48	Linking ReleaseProcess doc with the world llvm-svn: 182763	2013-05-28 10:32:55 +00:00
Richard Sandiford	53c9efd9c1	[SystemZ] Tweak SystemZInstrInfo::isBranch() interface This is needed for the upcoming compare-and-branch patch. No functional change intended. llvm-svn: 182762	2013-05-28 10:13:54 +00:00
Alexey Samsonov	1eba4e3254	Revert r182715 and r182758 llvm-svn: 182761	2013-05-28 10:08:08 +00:00
Renato Golin	6347551e45	Adding ReleaseProcess doc llvm-svn: 182759	2013-05-28 09:48:52 +00:00
Alexey Samsonov	b262d264d4	Fixup for r182715: provide correct arg to --gtest-filter llvm-svn: 182758	2013-05-28 09:40:42 +00:00
Michael Kuperstein	f3e663af39	Make BasicAliasAnalysis recognize the fact a noalias argument cannot alias another argument, even if the other argument is not itself marked noalias. llvm-svn: 182755	2013-05-28 08:17:48 +00:00
Rafael Espindola	eaf53276f7	Make it explicit that GlobalAlias are ok in llvm.used. No functionality change. llvm-svn: 182747	2013-05-27 22:47:09 +00:00
Rafael Espindola	f30f2cce50	Make helper functions static. And remove header and cpp file that are empty after that. llvm-svn: 182746	2013-05-27 22:34:59 +00:00
Preston Gurd	048f99de11	Convert sqrt functions into sqrt instructions when -ffast-math is in effect. When -ffast-math is in effect (on Linux, at least), clang defines __FINITE_MATH_ONLY__ > 0 when including <math.h>. This causes the preprocessor to include <bits/math-finite.h>, which renames the sqrt functions. For instance, "sqrt" is renamed as "__sqrt_finite". This patch adds the 3 new names in such a way that they will be treated as equivalent to their respective original names. llvm-svn: 182739	2013-05-27 15:44:35 +00:00
Rafael Espindola	cca5f562db	Add a cpu to try to bring back the atom bots. llvm-svn: 182734	2013-05-27 13:22:52 +00:00
Hal Finkel	8ebfe6c263	PPC: Add a isConsecutiveLS utility function isConsecutiveLS is a slightly more general form of SelectionDAG::isConsecutiveLoad. Aside from also handling stores, it also does not assume equality of the chain operands is necessary. In the case of the PPC backend, this chain condition is checked in a more general way by the surrounding code. Mostly, this part of the refactoring in preparation for supporting optimized unaligned stores. llvm-svn: 182723	2013-05-27 02:06:39 +00:00
NAKAMURA Takumi	d5c2e60b19	llvm-objdump.cpp: Appease MSC16 x64. utostr(n++) causes internal compiler error. llvm-svn: 182722	2013-05-27 00:02:48 +00:00
Hal Finkel	7d8a691b5d	Prefer to duplicate PPC Altivec loads when expanding unaligned loads When expanding unaligned Altivec loads, we use the decremented offset trick to prevent page faults. Unfortunately, if we have a sequence of consecutive unaligned loads, this leads to suboptimal code generation because the 'extra' load from the first unaligned load can be combined with the base load from the second (but only if the decremented offset trick is not used for the first). Search up and down the chain, through loads and token factors, looking for consecutive loads, and if one is found, don't use the offset reduction trick. These duplicate loads are later combined to yield the desired sequence (in the future, we might want a more-powerful chain search, but that will require some changes to allow the combiner routines to access the AA object). This should complete the initial implementation of the optimized unaligned Altivec load expansion. There is some refactoring that should be done, but that will happen when the unaligned store expansion is added. llvm-svn: 182719	2013-05-26 18:08:30 +00:00
Kai Nacke	4157b371f6	Add LDC compiler to list of external OS projects using LLVM 3.3 llvm-svn: 182718	2013-05-26 17:37:43 +00:00
Andrew Trick	c66d26adf0	Fix PR16143: Insert DEBUG_VALUE before terminator. llvm-svn: 182717	2013-05-26 08:58:50 +00:00
Galina Kistanova	a035f3b2ce	Fixed bug when tests in executable partially used absolute paths. llvm-svn: 182715	2013-05-26 03:58:41 +00:00
Chris Lattner	4093afda9b	Disable the StringMapEntry copy constructor, to make sure we reject things like: "for (auto Entry : SomeStringMap)". Previously this would copy the value but not the tail allocated string data (the key). llvm-svn: 182713	2013-05-25 22:28:22 +00:00
Cameron Zwarich	80cbcd2d11	Add support for DWARF line number table entries for values in the instruction stream. llvm-svn: 182712	2013-05-25 21:56:53 +00:00
Eric Christopher	5bed56d2f5	Add some comments to the stringify function. llvm-svn: 182710	2013-05-25 05:13:17 +00:00
Hal Finkel	bc2ee4c4e6	PPC: Combine duplicate (offset) lvsl Altivec intrinsics The lvsl permutation control instruction is a function only of the alignment of the pointer operand (relative to the 16-byte natural alignment of Altivec vectors). As a result, multiple lvsl intrinsics where the operands differ by a multiple of 16 can be combined. llvm-svn: 182708	2013-05-25 04:05:05 +00:00
Andrew Trick	8972aba193	Track IR ordering of SelectionDAG nodes 4/4. Unit test cases for -pre-RA-sched=source. llvm-svn: 182706	2013-05-25 03:26:51 +00:00
Andrew Trick	e2431c64bc	Track IR ordering of SelectionDAG nodes 3/4. Remove the old IR ordering mechanism and switch to new one. Fix unit test failures. llvm-svn: 182704	2013-05-25 03:08:10 +00:00
Andrew Trick	ef9de2a739	Track IR ordering of SelectionDAG nodes 2/4. Change SelectionDAG::getXXXNode() interfaces as well as call sites of these functions to pass in SDLoc instead of DebugLoc. llvm-svn: 182703	2013-05-25 02:42:55 +00:00
Andrew Trick	175143bf88	Track IR ordering of SelectionDAG nodes 1/4. Use a field in the SelectionDAGNode object to track its IR ordering. This adds fields and utility classes without changing existing interfaces or functionality. llvm-svn: 182701	2013-05-25 02:20:36 +00:00
Andrew Trick	fc1c5fe927	Fix RecyclingAllocator::PrintStats to print the underlying allocator's stats. llvm-svn: 182700	2013-05-25 01:47:42 +00:00
Eric Christopher	ba63e07f3a	Add to testsuite. llvm-svn: 182693	2013-05-24 23:20:16 +00:00
Eric Christopher	fcee6f0abc	ArrayRef-ize MD5 and clean up a few variable names. Add a stringize method to make dumping a bit easier, and add a testcase exercising a few different paths. llvm-svn: 182692	2013-05-24 23:08:17 +00:00
Hal Finkel	cf2e908014	PPC: Initial support for permutation-based unaligned Altivec loads Altivec only directly supports aligned loads, but the loads have a strange property: If given an unaligned address, they truncate the address to the next lower aligned address, and load from there. This property, along with an extra load and some special-purpose permutation-control instructions that generate the appropriate permutations from the original unaligned address, allow efficient lowering of aligned loads. This code uses the trick explained in the Apple Velocity Engine optimization overview document to prevent the needed extra load from possibly causing a page fault if the original address happens to be aligned. As noted in the FIXMEs, there are several additional optimizations that can be performed to reduce the cost of these loads even more. These will be implemented in future commits. llvm-svn: 182691	2013-05-24 23:00:14 +00:00
Michael J. Spencer	a8db3f6fa7	[Support] Remove Count{Leading,Trailing}Zeros_{32,64}. llvm-svn: 182690	2013-05-24 22:58:37 +00:00
Jim Grosbach	c161680c47	Tidy up. Whitespace. llvm-svn: 182689	2013-05-24 22:53:06 +00:00
Quentin Colombet	f482805c28	Follow up of the introduction of MCSymbolizer. - Ressurect old MCDisassemble API to soften transition. - Extend MCTargetDesc to set target specific symbolizer. llvm-svn: 182688	2013-05-24 22:51:52 +00:00
Michael Gottesman	410bd52561	clang formatted APFloat.h llvm-svn: 182686	2013-05-24 22:40:37 +00:00
Michael Gottesman	356ead3f36	clang-formatted APInt.h llvm-svn: 182685	2013-05-24 22:38:49 +00:00
Benjamin Kramer	2ce482e628	MathExtras: Return the result of find(First\|Last)Set in the input type. Otherwise ZB_Max returns a wrong result when sizeof(T) > sizeof(size_t). llvm-svn: 182684	2013-05-24 22:25:20 +00:00
Michael J. Spencer	df1ecbd734	Replace Count{Leading,Trailing}Zeros_{32,64} with count{Leading,Trailing}Zeros. llvm-svn: 182680	2013-05-24 22:23:49 +00:00
Michael J. Spencer	795ecd2c43	[Support][MathExtras] Fix literal type issues. llvm-svn: 182679	2013-05-24 22:19:05 +00:00
Michael J. Spencer	4fd69975aa	Add missing header for atexit. llvm-svn: 182672	2013-05-24 20:54:11 +00:00
Michael J. Spencer	0d9d75f2ec	[Support][MathExtras] Add missing include and disable _BitScan{Forward,Reverse}64 on non x64 MSVC systems. llvm-svn: 182671	2013-05-24 20:51:59 +00:00
Michael Gottesman	e67f40c514	[objc-arc] KnownSafe does not imply that it is safe to perform code motion across CFG edges since even if it is safe to remove RR pairs, we may still be able to move a retain/release into a loop. rdar://13949644 llvm-svn: 182670	2013-05-24 20:44:05 +00:00
Michael Gottesman	5a91bbf33a	[objc-arc] Make sure that multiple owners is propogated correctly through the pass via the usage of a global data structure. rdar://13750319 llvm-svn: 182669	2013-05-24 20:44:02 +00:00
Michael J. Spencer	eb91eac9fb	[Support] Add type generic bit utilities to MathExtras.h llvm-svn: 182667	2013-05-24 20:29:47 +00:00
Benjamin Kramer	6ac1e62377	LoopVectorize: LoopSimplify can't canonicalize loops with an indirectbr in it, don't assert on those cases. Fixes PR16139. llvm-svn: 182656	2013-05-24 18:05:35 +00:00
Diego Novillo	c2c4467690	Do not reserve space for the ColdEdges and NormalEdges vectors. Discussion and rationale at http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20130520/175698.html llvm-svn: 182653	2013-05-24 17:00:22 +00:00
Richard Sandiford	dc5ed71353	[SystemZ] Improve AsmParser handling of invalid instructions Previously, an invalid instruction like: foo %r1, %r0 would generate the rather odd error message: ....: error: unknown token in expression foo %r1, %r0 ^ We now get the more informative: ....: error: invalid instruction foo %r1, %r0 ^ The same would happen if an address were used where a register was expected. We now get "invalid operand for instruction" instead. llvm-svn: 182644	2013-05-24 14:26:46 +00:00
Richard Sandiford	675f86996a	[SystemZ] Improve AsmParser register parsing The idea is to make sure that: (1) "register expected" is restricted to cases where ParseRegister() is called and the token obviously isn't a register. (2) "invalid register" is restricted to cases where a register-like "%..." sequence is found, but the "..." makes no sense. (3) the generic "invalid operand for instruction" is used in cases where the wrong register type is used (GPR instead of FPR, etc.). (4) the new "invalid register pair" is used if the register has the right type, but is not a valid register pair. Testing of (1)-(3) is now restricted to regs-bad.s. It uses a representative instruction for each register class to make sure that only registers from that class are accepted. (4) is tested by both regs-bad.s (which checks all invalid register pairs) and insn-bad.s (which tests one invalid pair for each instruction that requires a pair). While there, I changed "Number" to "Num" for consistency with the operand class. llvm-svn: 182643	2013-05-24 14:14:38 +00:00
Joey Gouly	b34294d0e4	Run clang-format over the scalarizePHI function. llvm-svn: 182640	2013-05-24 12:33:28 +00:00
Joey Gouly	83699284be	scalarizePHI needs to insert the next ExtractElement in the same block as the BinaryOperator, not in the block where the IRBuilder is currently inserting into. Fixes a bug where scalarizePHI would create instructions that would not dominate all uses. llvm-svn: 182639	2013-05-24 12:29:54 +00:00
Diego Novillo	c63995394d	Add a new function attribute 'cold' to functions. Other than recognizing the attribute, the patch does little else. It changes the branch probability analyzer so that edges into blocks postdominated by a cold function are given low weight. Added analysis and code generation tests. Added documentation for the new attribute. llvm-svn: 182638	2013-05-24 12:26:52 +00:00
Benjamin Kramer	534d3a4670	Remove the Copied parameter from MemoryObject::readBytes. There was exactly one caller using this API right, the others were relying on specific behavior of the default implementation. Since it's too hard to use it right just remove it and standardize on the default behavior. Defines away PR16132. llvm-svn: 182636	2013-05-24 10:54:58 +00:00
Daniel Jasper	01a8079bf2	Fix unused warning in opt builds. In these builds, the asserts() are completely compiled out of the code leaving "End" unused. Directly accessing it, should not have a performance impact, as it is just a data member. llvm-svn: 182634	2013-05-24 06:26:18 +00:00
Ahmed Bougacha	aa79068157	MC: Disassembled CFG reconstruction. This patch builds on some existing code to do CFG reconstruction from a disassembled binary: - MCModule represents the binary, and has a list of MCAtoms. - MCAtom represents either disassembled instructions (MCTextAtom), or contiguous data (MCDataAtom), and covers a specific range of addresses. - MCBasicBlock and MCFunction form the reconstructed CFG. An MCBB is backed by an MCTextAtom, and has the usual successors/predecessors. - MCObjectDisassembler creates a module from an ObjectFile using a disassembler. It first builds an atom for each section. It can also construct the CFG, and this splits the text atoms into basic blocks. MCModule and MCAtom were only sketched out; MCFunction and MCBB were implemented under the experimental "-cfg" llvm-objdump -macho option. This cleans them up for further use; llvm-objdump -d -cfg now generates graphviz files for each function found in the binary. In the future, MCObjectDisassembler may be the right place to do "intelligent" disassembly: for example, handling constant islands is just a matter of splitting the atom, using information that may be available in the ObjectFile. Also, better initial atom formation than just using sections is possible using symbols (and things like Mach-O's function_starts load command). This brings two minor regressions in llvm-objdump -macho -cfg: - The printing of a relocation's referenced symbol. - An annotation on loop BBs, i.e., which are their own successor. Relocation printing is replaced by the MCSymbolizer; the basic CFG annotation will be superseded by more related functionality. llvm-svn: 182628	2013-05-24 01:07:04 +00:00
Ahmed Bougacha	ad1084de84	Add MCSymbolizer for symbolic/annotated disassembly. This is a basic first step towards symbolization of disassembled instructions. This used to be done using externally provided (C API) callbacks. This patch introduces: - the MCSymbolizer class, that mimics the same functions that were used in the X86 and ARM disassemblers to symbolize immediate operands and to annotate loads based off PC (for things like c string literals). - the MCExternalSymbolizer class, which implements the old C API. - the MCRelocationInfo class, which provides a way for targets to translate relocations (either object::RelocationRef, or disassembler C API VariantKinds) to MCExprs. - the MCObjectSymbolizer class, which does symbolization using what it finds in an object::ObjectFile. This makes simple symbolization (with no fancy relocation stuff) work for all object formats! - x86-64 Mach-O and ELF MCRelocationInfos. - A basic ARM Mach-O MCRelocationInfo, that provides just enough to support the C API VariantKinds. Most of what works in otool (the only user of the old symbolization API that I know of) for x86-64 symbolic disassembly (-tvV) works, namely: - symbol references: call _foo; jmp 15 <_foo+50> - relocations: call _foo-_bar; call _foo-4 - __cf?string: leaq 193(%rip), %rax ## literal pool for "hello" Stub support is the main missing part (because libObject doesn't know, among other things, about mach-o indirect symbols). As for the MCSymbolizer API, instead of relying on the disassemblers to call the tryAdding* methods, maybe this could be done automagically using InstrInfo? For instance, even though PC-relative LEAs are used to get the address of string literals in a typical Mach-O file, a MOV would be used in an ELF file. And right now, the explicit symbolization only recognizes PC-relative LEAs. InstrInfo should have already have most of what is needed to know what to symbolize, so this can definitely be improved. I'd also like to remove object::RelocationRef::getValueString (it seems only used by relocation printing in objdump), as simply printing the created MCExpr is definitely enough (and cleaner than string concats). llvm-svn: 182625	2013-05-24 00:39:57 +00:00
Ulrich Weigand	9948546923	[PowerPC] Remove symbolLo/symbolHi instruction operand types Now that there is no longer any distinction between symbolLo and symbolHi operands in either printing, encoding, or parsing, the operand types can be removed in favor of simply using s16imm. This completes the patch series to decouple lo/hi operand part processing from the particular instruction whose operand it is. No change in code generation expected from this patch. llvm-svn: 182618	2013-05-23 22:48:06 +00:00
Daniel Malea	fddddbeab0	Re-implement DebugIR in a way that does not subclass AssemblyWriter: - move AsmWriter.h from public headers into lib - marked all AssemblyWriter functions as non-virtual; no need to override them - DebugIR now "plugs into" AssemblyWriter with an AssemblyAnnotationWriter helper - exposed flags to control hiding of a) debug metadata b) debug intrinsic calls C/R: Paul Redmond llvm-svn: 182617	2013-05-23 22:34:33 +00:00
Ulrich Weigand	41789de165	[PowerPC] Clean up generation of ha16() / lo16() markers When targeting the Darwin assembler, we need to generate markers ha16() and lo16() to designate the high and low parts of a (symbolic) immediate. This is necessary not just for plain symbols, but also for certain symbolic expression, typically along the lines of ha16(A - B). The latter doesn't work when simply using VariantKind flags on the symbol reference. This is why the current back-end uses hacks (explicitly called out as such via multiple FIXMEs) in the symbolLo/symbolHi print methods. This patch uses target-defined MCExpr codes to represent the Darwin ha16/lo16 constructs, following along the lines of the equivalent solution used by the ARM back end to handle their :upper16: / :lower16: markers. This allows us to get rid of special handling both in the symbolLo/symbolHi print method and in the common code MCExpr::print routine. Instead, the ha16 / lo16 markers are printed simply in a custom print routine for the target MCExpr types. (As a result, the symbolLo/symbolHi print methods can now replaced by a single printS16ImmOperand routine that also handles symbolic operands.) The patch also provides a EvaluateAsRelocatableImpl routine to handle ha16/lo16 constructs. This is not actually used at the moment by any in-tree code, but is provided as it makes merging into David Fang's out-of-tree Mach-O object writer simpler. Since there is no longer any need to treat VK_PPC_GAS_HA16 and VK_PPC_DARWIN_HA16 differently, they are merged into a single VK_PPC_ADDR16_HA (and likewise for the _LO16 types). llvm-svn: 182616	2013-05-23 22:26:41 +00:00
Bill Wendling	f44b2a2e2d	The command line options need to be processed before we create the TargetMachine. Move the processing of the command line options to right before we create the TargetMachine instead of after. <rdar://problem/13468287> llvm-svn: 182611	2013-05-23 21:21:50 +00:00
Tim Northover	bc93308489	ARM: implement @llvm.readcyclecounter intrinsic This implements the @llvm.readcyclecounter intrinsic as the specific MRC instruction specified in the ARM manuals for CPUs with the Power Management extensions. Older CPUs had slightly different methods which may also have to be implemented eventually, but this should cover all v7 cases. rdar://problem/13939186 llvm-svn: 182603	2013-05-23 19:11:20 +00:00
Tim Northover	cedd48183f	ARM: Add Performance Monitor Extensions feature Performance monitors, including a basic cycle counter, are an official extension in the ARMv7 specification. This adds support for enabling and disabling them, orthogonally from CPU selection. rdar://problem/13939186 llvm-svn: 182602	2013-05-23 19:11:14 +00:00
Tom Stellard	1b086cbcb8	R600: Fix R600ControlFlowFinalizer not considering VTX_READ 128 bit dst reg Patch by: Vincent Lejeune https://bugs.freedesktop.org/show_bug.cgi?id=64877 NOTE: This is a candidate for the 3.3 branch. llvm-svn: 182600	2013-05-23 18:26:42 +00:00
Benjamin Kramer	d78bb468bd	Move passes from namespace llvm into anonymous namespaces. Sort includes while there. llvm-svn: 182594	2013-05-23 17:10:37 +00:00
Jakob Stoklund Olesen	43711c51ec	Fix PR16110: Handle DBG_VALUE in ConnectedVNInfoEqClasses::Distribute(). Now that the LiveDebugVariables pass is running after register coalescing, the ConnectedVNInfoEqClasses class needs to deal with DBG_VALUE instructions. This only comes up when rematerialization during coalescing causes the remaining live range of a virtual register to separate into two connected components. llvm-svn: 182592	2013-05-23 17:02:23 +00:00
Benjamin Kramer	ad5c24f161	More symbols that should be static. llvm-svn: 182590	2013-05-23 16:09:15 +00:00
Benjamin Kramer	e79beacb32	Hexagon: Make helper functions static. llvm-svn: 182588	2013-05-23 15:43:11 +00:00
Benjamin Kramer	635e368e33	R600: Hide symbols of implementation details. Also removes an unused function. llvm-svn: 182587	2013-05-23 15:43:05 +00:00
Benjamin Kramer	bc6666bedf	InlineSpiller: Store bucket pointers instead of iterators. Lets us use a SetVector instead of an explicit set + vector combination. llvm-svn: 182586	2013-05-23 15:42:57 +00:00
Aaron Ballman	15f193a1a3	Setting the default value (fixes CRT assertions about uninitialized variable use when doing debug MSVC builds), and fixing coding style. llvm-svn: 182585	2013-05-23 14:55:00 +00:00
Rafael Espindola	00345fa97b	Fix 32 bit build in c++11 mode. The error was: error: non-constant-expression cannot be narrowed from type 'long long' to 'long' in initializer list [-Wc++11-narrowing] MI.getOperand(6).getImm() & 0x1F, llvm-svn: 182584	2013-05-23 13:22:30 +00:00
Nick Lewycky	7b431030ac	Add missing test from r175092. llvm-svn: 182564	2013-05-23 07:46:13 +00:00
Rafael Espindola	39aca620db	Fix a leak on the r600 backend. This should bring the valgrind bot back to life. llvm-svn: 182561	2013-05-23 03:31:47 +00:00
Rafael Espindola	bd6847fbea	clang-format this file. llvm-svn: 182560	2013-05-23 03:28:39 +00:00
Rafael Espindola	d02e7e5693	Remove redundant rpath. These are not needed since we added the $ORIGIN based rpath. Fixes pr12517. llvm-svn: 182559	2013-05-23 02:53:22 +00:00
Rafael Espindola	1250a307f6	Fix indentation. llvm-svn: 182558	2013-05-23 02:38:50 +00:00
Michael Gottesman	740db977f6	[objc-arc] Fixed number of prefixing slashes in some comments in a function from 3 to 2 to match the rest of ObjCARCOpts. llvm-svn: 182557	2013-05-23 02:35:21 +00:00
Michael Gottesman	9964db9252	Fixed trailing whitespace. llvm-svn: 182556	2013-05-23 02:03:05 +00:00
Michael Gottesman	ea77dd14de	Updated the comments of APInt.h to match the llvm style guide and be consistent. No functionality change. llvm-svn: 182555	2013-05-23 02:00:03 +00:00
Kevin Enderby	64d934507e	Missed removing one of the assert()'s from the LLVMCreateDisasmCPU() library API with my 176880 revision. If a bad Triple is passed in it can also assert. In this case too it should just return 0 to indicate failure to create the disassembler. rdar://13955214 llvm-svn: 182542	2013-05-23 00:32:34 +00:00
Chad Rosier	3821723c32	Minor fix to comment from my previous commit. llvm-svn: 182536	2013-05-22 23:25:59 +00:00
Chad Rosier	81f43ae23a	Simplify the logic described in the comment. llvm-svn: 182534	2013-05-22 23:23:14 +00:00
David Blaikie	5174c84add	Solidify the assumption that a DW_TAG_subprogram's type is a DW_TAG_subroutine_type There were bits & pieces of code lying around that may've given the impression that debug info metadata supported the possibility that a subprogram's type could be specified by a non-subroutine type describing the return type of a void function. This support was incomplete & unnecessary. Asserts & API have been changed to make the desired usage more clear. llvm-svn: 182532	2013-05-22 23:22:18 +00:00
Chad Rosier	abdb1d69ab	Simplify logic now that r182490 is in place. No functional change intended. llvm-svn: 182531	2013-05-22 23:17:36 +00:00
Chad Rosier	682ae15bb9	Simplify logic now that r182490 is in place. No functional change intended. llvm-svn: 182527	2013-05-22 22:36:55 +00:00
Chad Rosier	c7505ef8ba	Simplify logic now that r182490 is in place. No functional change intended. llvm-svn: 182526	2013-05-22 22:26:05 +00:00
Bill Schmidt	9b703f9c5d	Recognize ValueType operands in source patterns for fast-isel. Currently the fast-isel table generator recognizes registers, register classes, and immediates for source pattern operands. ValueType operands are not recognized. This is not a problem for existing targets with fast-isel support, but will not work for targets like PowerPC and SPARC that use types in source patterns. The proposed patch allows ValueType operands and treats them in the same manner as register classes. There is no convenient way to map from a ValueType to a register class, but there's no need to do so. The table generator already requires that all types in the source pattern be identical, and we know the register class of the output operand already. So we just assign that register class to any ValueType operands we encounter. No functional effect on existing targets. Testing deferred until the PowerPC target implements fast-isel. llvm-svn: 182512	2013-05-22 20:45:11 +00:00
Bill Schmidt	f88571e027	Change some PowerPC PatLeaf definitions to ImmLeaf for fast-isel. Using PatLeaf rather than ImmLeaf when defining immediate predicates prevents simple patterns using those predicates from being recognized for fast instruction selection. This patch replaces the immSExt16 PatLeaf predicate with two ImmLeaf predicates, imm32SExt16 and imm64SExt16, allowing a few more patterns to be recognized (ADDI, ADDIC, MULLI, ADDI8, and ADDIC8). Using the new predicates does not help for LI, LI8, SUBFIC, and SUBFIC8 because these are rejected for other reasons, but I see no reason to retain the PatLeaf predicate. No functional change intended, and thus no test cases yet. This is preliminary work for enabling fast-isel support for PowerPC. When that support is ready, we'll be able to test this function. llvm-svn: 182510	2013-05-22 20:09:24 +00:00
Nadav Rotem	9e00eb38a2	SLPVectorizer: Change the order in which new instructions are added to the function. We are not working on a DAG and I ran into a number of problems when I enabled the vectorizations of 'diamond-trees' (trees that share leafs). * Imroved the numbering API. * Changed the placement of new instructions to the last root. * Fixed a bug with external tree users with non-zero lane. * Fixed a bug in the placement of in-tree users. llvm-svn: 182508	2013-05-22 19:47:32 +00:00
Nadav Rotem	7b66c47051	X86: Fix a bug in EltsFromConsecutiveLoads. We can't generate new loads without chains. llvm-svn: 182507	2013-05-22 19:28:41 +00:00
Reid Kleckner	d082ea15f4	Remove unneeded call to a base default ctor llvm-svn: 182503	2013-05-22 19:07:26 +00:00
Jean-Luc Duprat	0dda6f168c	This is an update to a previous commit (r181216). The earlier change list introduced the following inst combines: B * (uitofp i1 C) —> select C, B, 0 A * (1 - uitofp i1 C) —> select C, 0, A select C, 0, B + select C, A, 0 —> select C, A, B Together these 3 changes would simplify : A * (1 - uitofp i1 C) + B * uitofp i1 C down to : select C, B, A In practice we found that the first two substitutions can have a negative effect on performance, because they reduce opportunities to use FMA contractions; between the two options FMAs are often the better choice. This change list amends the previous one to enable just these inst combines: select C, B, 0 + select C, 0, A —> select C, B, A A * (1 - uitofp i1 C) + B * uitofp i1 C —> select C, B, A llvm-svn: 182499	2013-05-22 18:29:31 +00:00
Rui Ueyama	142736fc64	Fix typo in docs/GettingStarted.rst. llvm-svn: 182496	2013-05-22 18:09:39 +00:00
Adrian Prantl	0d1e5592a6	Unify formatting of debug output. llvm-svn: 182495	2013-05-22 18:02:19 +00:00
Reid Kleckner	ef5f065f88	Fix StringMapIterator compile errors for non-MSVC compilers. llvm-svn: 182493	2013-05-22 17:32:15 +00:00
Chad Rosier	4233f1f268	Add the IncludeSelf parameter to the MCSubRegIterator and MCSuperRegIterator constructors. No functional change. Part of rdar://12906217 llvm-svn: 182490	2013-05-22 17:26:26 +00:00
Reid Kleckner	1fc96a323e	[Support] Add StringMap::swap() and a default ctor for iterators This makes StringMap<> more compatible with std::map<std::string, ...>. Differential Revision: http://llvm-reviews.chandlerc.com/D842 llvm-svn: 182487	2013-05-22 17:10:11 +00:00
Benjamin Kramer	d76cc186fc	X86: When expanding PCMPGTQ to PCMPGTD we always want to compare the lower halves as unsigned. Take #2 on fixing PR15977. llvm-svn: 182486	2013-05-22 17:01:12 +00:00
Arnold Schwaighofer	12b0d1cda0	LoopVectorize: Make Value pointers that could be RAUW'ed a VH The Value pointers we store in the induction variable list can be RAUW'ed by a call to SCEVExpander::expandCodeFor, use a TrackingVH instead. Do the same thing in some other places where we store pointers that could potentially be RAUW'ed. Fixes PR16073. llvm-svn: 182485	2013-05-22 16:54:56 +00:00
Rafael Espindola	e3d83fb8c3	Fix use after free (pr16103). llvm-svn: 182482	2013-05-22 15:31:11 +00:00
Rafael Espindola	ebd8e38849	Check that a function starts with llvm. before using GET_FUNCTION_RECOGNIZER. Fixes a use of uninitialized memory found by asan and valgind. llvm-svn: 182480	2013-05-22 14:57:42 +00:00
Richard Sandiford	14a4449589	[SystemZ] Rename PSW to CC Addresses a review comment from Ulrich Weigand. No functional change intended. I'm not sure whether the old TODO that this patch touches still holds, but that's something we'd get to when adding a targetted scheduling description. llvm-svn: 182474	2013-05-22 13:38:45 +00:00
Rafael Espindola	9ac7b48ddb	sync projects/sample's autohell. llvm-svn: 182464	2013-05-22 12:37:27 +00:00
Richard Sandiford	03528f346a	[SystemZ] Fix thinko in long branch pass The original version of the pass could underestimate the length of a backward branch in cases like: alignment to N bytes or more ... relaxable branch A ... foo: (aligned to M<N bytes) ... bar: (aligned to N bytes) ... relaxable branch B to foo We don't add any misalignment gap for "bar" because N bytes of alignment had already been reached earlier in the function. In this case, assuming that A is relaxed can push "foo" closer to "bar", and make B appear to be in range. Similar problems can occur for forward branches. I don't think it's possible to create blocks with mixed alignments as things stand, not least because we haven't yet defined getPrefLoopAlignment() for SystemZ (that would need benchmarking). So I don't think we can test this yet. Thanks to Rafael Espíndola for spotting the bug. llvm-svn: 182460	2013-05-22 09:57:57 +00:00
David Majnemer	7ea2a52a0c	X86: Remove test instructions proceeding shift by immediate instructions Allow LLVM to take advantage of shift instructions that set the ZF flag, making instructions that test the destination superfluous. llvm-svn: 182454	2013-05-22 08:13:02 +00:00
NAKAMURA Takumi	4f328e1c2f	R600ISelLowering.cpp: Avoid "using namespace Intrinsic;" to appease MSC. Specify namespaces explicitly here. MSC is confused about "memcpy" between <cstring> and llvm::Intrinsic::memcpy, when llvm::Intrinsic were exposed. llvm-svn: 182452	2013-05-22 06:37:31 +00:00
NAKAMURA Takumi	18ca09c1cc	R600: Whitespace and untabify. llvm-svn: 182451	2013-05-22 06:37:25 +00:00
Owen Anderson	616852848a	Create an FPOW SDNode opcode def in the target independent .td file rather than in a specific backend. llvm-svn: 182450	2013-05-22 06:36:09 +00:00
Filip Pizlo	3fdbaff3b9	Expose the RTDyldMemoryManager through the C API. This allows clients of the C API to provide their own way of allocating JIT memory (both code and data) and finalizing memory permissions (page protections, cache flush). llvm-svn: 182448	2013-05-22 02:46:43 +00:00
Rafael Espindola	cf1e6574c2	Allow duplicates in LLVM_TARGETS_TO_BUILD and LLVM_EXPERIMENTAL_TARGETS_TO_BUILD. Should fix the cmake bots that were already building R600. llvm-svn: 182447	2013-05-22 02:45:28 +00:00
Rafael Espindola	21ea01d132	Attempt to fix the mingw32 bot. This should hopefully fix http://lab.llvm.org:8011/builders/clang-x86_64-darwin11-self-mingw32 llvm-svn: 182446	2013-05-22 02:30:47 +00:00
Rafael Espindola	525cf28652	s/u_int32_t/uint32_t/ llvm-svn: 182444	2013-05-22 01:36:19 +00:00
Rafael Espindola	f568827654	Fix warning in non-assert build. llvm-svn: 182443	2013-05-22 01:29:38 +00:00
Rafael Espindola	f6474d2834	Make R600 non-experimental. The r600 backend has been in tree for some time now. Marking it as non-experimental to avoid accidental breakage. llvm-svn: 182442	2013-05-22 00:35:47 +00:00
Reed Kotler	c6c7e4a67c	Mips16 does not use register scavenger from TargetRegisterInfo. It allocates a RegScavenger object on it's own. llvm-svn: 182430	2013-05-21 22:06:02 +00:00
Eric Christopher	cecf828972	Be more specific and capitalize filenames. llvm-svn: 182424	2013-05-21 21:22:34 +00:00
Jakob Stoklund Olesen	23386ed8f2	Define BYTE_ORDER on Solaris. Solaris doesn't have an endian.h header, but SPARC is the only big-endian architecture that runs Solaris, so just use that to detect endianness at compile time. llvm-svn: 182419	2013-05-21 20:36:13 +00:00
Filip Pizlo	1cec8abfe9	Put RTDyldMemoryManager into its own file, and make it linked into libExecutionEngine. Move method implementations that aren't specific to allocation out of SectionMemoryManager and into RTDyldMemoryManager. This is in preparation for exposing RTDyldMemoryManager through the C API. This is a fixed version of r182407 and r182411. That first revision broke builds because I forgot to move the conditional includes of various POSIX headers from SectionMemoryManager into RTDyldMemoryManager. Those includes are necessary because of how getPointerToNamedFunction works around the glibc libc_nonshared.a thing. The latter revision still broke things because I forgot to include llvm/Config/config.h. llvm-svn: 182418	2013-05-21 20:24:07 +00:00
Filip Pizlo	9d801b1084	Roll out r182411 and 182412 because it's still broken. llvm-svn: 182415	2013-05-21 20:17:14 +00:00
Filip Pizlo	76a95062da	Fix busted comment. This conditional include block used to be in SectionMemoryManager, but is now in RTDyldMemoryManager. llvm-svn: 182412	2013-05-21 20:11:01 +00:00
Filip Pizlo	b2a1e19a2d	Put RTDyldMemoryManager into its own file, and make it linked into libExecutionEngine. Move method implementations that aren't specific to allocation out of SectionMemoryManager and into RTDyldMemoryManager. This is in preparation for exposing RTDyldMemoryManager through the C API. This is a fixed version of r182407. That revision broke builds because I forgot to move the conditional includes of various POSIX headers from SectionMemoryManager into RTDyldMemoryManager. Those includes are necessary because of how getPointerToNamedFunction works around the glibc libc_nonshared.a thing. llvm-svn: 182411	2013-05-21 20:07:12 +00:00
Filip Pizlo	5aefb1339c	Roll out r182407 and r182408 because they broke builds. llvm-svn: 182409	2013-05-21 20:03:01 +00:00
Filip Pizlo	e1e3f7cc01	Expose the RTDyldMemoryManager through the C API. This allows clients of the C API to provide their own way of allocating JIT memory (both code and data) and finalizing memory permissions (page protections, cache flush). llvm-svn: 182408	2013-05-21 20:00:56 +00:00
Filip Pizlo	34b9ee6f3b	Put RTDyldMemoryManager into its own file, and make it linked into libExecutionEngine. Move method implementations that aren't specific to allocation out of SectionMemoryManager and into RTDyldMemoryManager. This is in preparation for exposing RTDyldMemoryManager through the C API. llvm-svn: 182407	2013-05-21 19:56:00 +00:00
Rafael Espindola	c823f00ed1	Use std::list so that we have a stable iterator. I will try to avoid creating these std::strings, but for now this gets the tests passing with libc++. llvm-svn: 182405	2013-05-21 18:53:50 +00:00
Benjamin Kramer	298526a97f	Remove duplicated comment. Found by -Wdocumentation. llvm-svn: 182402	2013-05-21 18:06:33 +00:00
Rafael Espindola	e5cf1ba5b9	Regenerate configure. llvm-svn: 182401	2013-05-21 17:59:15 +00:00
Akira Hatanaka	be76cd0b8e	[mips] Rename option to make it compatible with gcc. llvm-svn: 182397	2013-05-21 17:17:59 +00:00
Akira Hatanaka	6871031be9	[mips] Add instruction selection patterns for blez and bgez. llvm-svn: 182396	2013-05-21 17:13:47 +00:00
Justin Holewinski	48f4ad3fc0	[NVPTX] Add @llvm.nvvm.sqrt.f() intrinsic llvm-svn: 182394	2013-05-21 16:51:30 +00:00
Jyotsna Verma	1b056e422c	Hexagon: SelectionDAG should not use MVT::Other to check the legality of BR_CC. llvm-svn: 182390	2013-05-21 15:54:32 +00:00
Justin Holewinski	fff1f5f5e2	Drop @llvm.annotation and @llvm.ptr.annotation intrinsics during codegen. The intrinsic calls are dropped, but the annotated value is propagated. Fixes PR 15253 Original patch by Zeng Bin! llvm-svn: 182387	2013-05-21 14:37:16 +00:00
Hal Finkel	c5211291f1	Fix PPC branch selection for counter-based branches Although I had added some support for the BDZ/BDNZ branches into the selector (in r158204), I had not correctly adjusted the condition at the top of the loop. As a result, these branches were still essentially unsupported. This fixes PR16086. Unfortunately, any test case would be very large (because it would need to force the loop backedge to exceed the range of the 16-bit immediate). llvm-svn: 182385	2013-05-21 14:21:09 +00:00
Elena Demikhovsky	0dd4025ae9	removed commented lines llvm-svn: 182377	2013-05-21 13:27:44 +00:00
Evgeniy Stepanov	ebd7f8e7ef	[msan] A no-op implementation of VarArg handling. This stuff is used on platforms where MSan does not have a proper VarArg implementation (anything other than x86_64 at the moment). llvm-svn: 182375	2013-05-21 12:27:47 +00:00
Elena Demikhovsky	fad029202f	Removed SSEPacked domain from all forms (AVX, SSE, signed, unsigned) scalar compare instructions, like COMISS, COMISD. No functional changes. llvm-svn: 182371	2013-05-21 12:04:22 +00:00
Ulrich Weigand	7c81c7c66b	Alternative fix for problem addressed in r182233 Revision r182233 partially reverted the change in r181200 to simplify JIT unif test #ifdefs, because that change caused a link error on some host operating systems where the export list requires the following symbols to be defined: JITTest_AvailableExternallyFunction JITTest_AvailableExternallyGlobal As discussed on the list, the commit reverts r182233 (and re-installs the full r181200 change), and instead fixes the link problem by moving those two symbols to the top of the file and unconditionally defining them. llvm-svn: 182367	2013-05-21 10:30:59 +00:00
Benjamin Kramer	18ef6b22b9	X86: When emulating unsigned PCMPGTQ with PCMPGTD, fix the sign bit for the smaller type. Otherwise we'll get a mix of signed and unsigned compares. Fixes PR15977. llvm-svn: 182364	2013-05-21 09:58:54 +00:00
Richard Sandiford	586f41777e	[SystemZ] Tighten branch tests After r182274, the branches in these tests must always be short. llvm-svn: 182358	2013-05-21 08:53:17 +00:00
Benjamin Kramer	8aaf197990	DAGCombine: Avoid an edge case where it tried to create an i0 type for (x & 0) == 0. Fixes PR16083. llvm-svn: 182357	2013-05-21 08:51:09 +00:00
Richard Sandiford	3b105a063f	Fix indentation llvm-svn: 182356	2013-05-21 08:48:24 +00:00
Eric Christopher	db142d4e1e	Add cmake bits for md5. llvm-svn: 182349	2013-05-21 01:30:38 +00:00
Eric Christopher	e1dc3c45e6	Add an md5 library derived from a public domain implementation for dwarf4 type signature computation. llvm-svn: 182348	2013-05-21 01:28:35 +00:00
Reed Kotler	75653a0677	Add checks that the proper predeined stubs are being called to the test case. These were accidentally omitted. llvm-svn: 182347	2013-05-21 01:27:36 +00:00
Manman Ren	9d4c735885	Dwarf: use a single line table to generate assembly when .loc is used. This is to fix PR15408 where an undefined symbol Lline_table_start1 is used. Since we do not generate the debug_line section when .loc is used, Lline_table_start1 is not emitted and we can't refer to it when calculating at_stmt_list for a compile unit. llvm-svn: 182344	2013-05-21 00:57:22 +00:00
Reed Kotler	0fed8d4ef7	Add some additional functions to the list of helper functions for pic calls. These need to be there so we don't try and use helper functions when we call those. As part of this, make sure that we properly exclude helper functions in pic mode when indirect calls are involved. llvm-svn: 182343	2013-05-21 00:50:30 +00:00
Richard Smith	a13a12d317	Comment update: these things are called "configuration names" these days, not "triples". Also remove the implication that they're only used for specifying a target. llvm-svn: 182335	2013-05-20 23:55:41 +00:00
Sean Silva	8ca1178f45	LangRef.rst: Clarify how basic blocks without named label are handled. Describe that they are assigned numbered label using the same counter as for unnamed temporaries. Based on http://llvm.org/bugs/show_bug.cgi?id=16043 and mailing list discussion. Patch by Paul Sokolovsky! llvm-svn: 182332	2013-05-20 23:31:12 +00:00
David Blaikie	e63d5d1633	PR14606: Debug Info for namespace aliases/DW_TAG_imported_module This resolves the last of the PR14606 failures in the GDB 7.5 test suite by implementing an optional name field for DW_TAG_imported_modules/DIImportedEntities and using that to implement C++ namespace aliases (eg: "namespace X = Y;"). llvm-svn: 182328	2013-05-20 22:50:35 +00:00
Daniel Dunbar	bf2e7b593e	[docs] Minor doc tweaks. llvm-svn: 182324	2013-05-20 22:39:48 +00:00
Bill Wendling	eda5418e89	The DWARF EH pass doesn't need the TargetMachine, only the TargetLoweringBase like the other EH passes. llvm-svn: 182321	2013-05-20 21:54:18 +00:00
Bill Wendling	47447589c9	No need to store the TargetMachine variable in this class. llvm-svn: 182317	2013-05-20 21:28:28 +00:00
Bill Wendling	5f4740390e	Remove unused #include. llvm-svn: 182315	2013-05-20 20:59:12 +00:00
Hal Finkel	a969df84ab	Rename LoopSimplify.h to LoopUtils.h As discussed, LoopUtils.h is a better name. llvm-svn: 182314	2013-05-20 20:46:30 +00:00
Sebastian Pop	0bfafbaf52	add polly to check-all llvm-svn: 182308	2013-05-20 18:49:15 +00:00
Akira Hatanaka	5de4416962	[mips] Add (setne $lhs, 0) instruction selection pattern. llvm-svn: 182307	2013-05-20 18:18:07 +00:00
Akira Hatanaka	1cb024207f	[mips] Trap on integer division by zero. By default, a teq instruction is inserted after integer divide. No divide-by-zero checks are performed if option "-mnocheck-zero-division" is used. llvm-svn: 182306	2013-05-20 18:07:43 +00:00
Hal Finkel	e6d7c285b3	Remove copied preheader insertion logic from PPCCTRLoops Now that the preheader insertion logic in LoopSimplify is externally exposed, use it, and remove the copy-and-pasted version. No functionality change intended. llvm-svn: 182300	2013-05-20 16:47:10 +00:00
Hal Finkel	a12d82b421	Expose InsertPreheaderForLoop from LoopSimplify to other passes Other passes, PPC counter-loop formation for example, also need to add loop preheaders outside of the regular loop simplification pass. This makes InsertPreheaderForLoop a global function so that it can be used by other passes. No functionality change intended. llvm-svn: 182299	2013-05-20 16:47:07 +00:00
Justin Holewinski	4c47d87ba6	[NVPTX] Fix mis-use of CurrentFnSym in NVPTXAsmPrinter. This was causing a symbol name error in the output PTX. llvm-svn: 182298	2013-05-20 16:42:18 +00:00
Justin Holewinski	18f3a1ffe6	[NVPTX] Add programmatic interface to NVVMReflect pass llvm-svn: 182297	2013-05-20 16:42:16 +00:00
Hal Finkel	0859ef29d5	Rename PPC MTCTRse to MTCTRloop As the pairing of this instruction form with the bdnz/bdz branches is now enforced by the verification pass, make it clear from the name that these are used only for counter-based loops. No functionality change intended. llvm-svn: 182296	2013-05-20 16:08:37 +00:00
Hal Finkel	8ca3884147	Add a PPCCTRLoops verification pass When asserts are enabled, this adds a verification pass for PPC counter-loop formation. Unfortunately, without sacrificing code quality, there is no better way of forming counter-based loops except at the (late) IR level. This means that we need to recognize, at the IR level, anything which might turn into a function call (or indirect branch). Because this is currently a finite set of things, and because SelectionDAG lowering is basic-block local, this can be done. Nevertheless, it is fragile, and failure results in a miscompile. This verification pass checks that all (reachable) counter-based branches are dominated by a loop mtctr instruction, and that no instructions in between clobber the counter register. If these conditions are not satisfied, then an ICE will be triggered. In short, this is to help us sleep better at night. llvm-svn: 182295	2013-05-20 16:08:17 +00:00
Benjamin Kramer	927ca942ce	R600: Fix bug detected by GCC warning. R600TextureIntrinsicsReplacer.cpp:232: warning: the address of ‘ArgsType’ will always evaluate as ‘true’ This doesn't have any effect on the output as a vararg intrinsic behaves the same way as a non-vararg one. llvm-svn: 182293	2013-05-20 15:58:43 +00:00
Tom Stellard	f0de44cc89	R600: Fix rotr.ll on non-asserts builds The -debug-only option is only available on asserts builds. llvm-svn: 182291	2013-05-20 15:28:48 +00:00
Tom Stellard	f1ee716446	R600/SI: Use a multiclass for MUBUF_Load_Helper This will simplify the instructions and also the pattern definitions. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 182288	2013-05-20 15:02:31 +00:00
Tom Stellard	b8458f88d6	R600/SI: Add a pattern for S_LOAD_DWORDX2_* instructions Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 182287	2013-05-20 15:02:28 +00:00
Tom Stellard	d2eebf001e	R600/SI: Add pattern for rotr Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 182286	2013-05-20 15:02:24 +00:00
Tom Stellard	5643c4ac72	R600: Swap the legality of rotl and rotr The hardware supports rotr and not rotl. llvm-svn: 182285	2013-05-20 15:02:19 +00:00
Tom Stellard	1cfd7a50bb	R600/SI: Add patterns for 64-bit shift operations Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 182284	2013-05-20 15:02:12 +00:00
Tom Stellard	459a79a81c	R600/SI: Use the same names for VOP3 operands and encoding fields This makes it possible to reorder the operands without breaking the encoding. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 182283	2013-05-20 15:02:08 +00:00
Tom Stellard	b35efba4d9	R600/SI: Make fitsRegClass() operands const Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 182282	2013-05-20 15:02:01 +00:00
Mihai Popa	f41e3f56a5	VSTn instructions have a number of encoding constraints which are not implemented. I have added these using wrapper methods around the original custom decoder (incidentally - this is a huge poorly written method that should be cleaned up. I have left it as is since the changes would be much to hard to review). llvm-svn: 182281	2013-05-20 14:57:05 +00:00
Mihai Popa	dcf0922720	Q registers are encoded in fields of the same length as D registers. As Q registers are half as many, the ARM reference manual mandates the least significant bit to be zeroed out. Failure to do so should result in an undefined instruction. With this change test/MC/Disassembler/ARM/invalid-VQADD-arm.txt is passing (removed XFAIL). llvm-svn: 182279	2013-05-20 14:42:43 +00:00
Richard Sandiford	312425f32d	[SystemZ] Add long branch pass Before this change, the SystemZ backend would use BRCL for all branches and only consider shortening them to BRC when generating an object file. E.g. a branch on equal would use the JGE alias of BRCL in assembly output, but might be shortened to the JE alias of BRC in ELF output. This was a useful first step, but it had two problems: (1) The z assembler isn't traditionally supposed to perform branch shortening or branch relaxation. We followed this rule by not relaxing branches in assembler input, but that meant that generating assembly code and then assembling it would not produce the same result as going directly to object code; the former would give long branches everywhere, whereas the latter would use short branches where possible. (2) Other useful branches, like COMPARE AND BRANCH, do not have long forms. We would need to do something else before supporting them. (Although COMPARE AND BRANCH does not change the condition codes, the plan is to model COMPARE AND BRANCH as a CC-clobbering instruction during codegen, so that we can safely lower it to a separate compare and long branch where necessary. This is not a valid transformation for the assembler proper to make.) This patch therefore moves branch relaxation to a pre-emit pass. For now, calls are still shortened from BRASL to BRAS by the assembler, although this too is not really the traditional behaviour. The first test takes about 1.5s to run, and there are likely to be more tests in this vein once further branch types are added. The feeling on IRC was that 1.5s is a bit much for a single test, so I've restricted it to SystemZ hosts for now. The patch exposes (and fixes) some typos in the main CodeGen/SystemZ tests. A later patch will remove the {{g}}s from that directory. llvm-svn: 182274	2013-05-20 14:23:08 +00:00
Benjamin Kramer	8e4b20f98d	Enable pod-like optimizations for pred and succ iterators. llvm-svn: 182257	2013-05-20 13:12:58 +00:00
Justin Holewinski	01f89f0428	[NVPTX] Add GenericToNVVM IR converter to better handle idiomatic LLVM IR inputs This converter currently only handles global variables in address space 0. For these variables, they are promoted to address space 1 (global memory), and all uses are updated to point to the result of a cvta.global instruction on the new variable. The motivation for this is address space 0 global variables are illegal since we cannot declare variables in the generic address space. Instead, we place the variables in address space 1 and explicitly convert the pointer to address space 0. This is primarily intended to help new users who expect to be able to place global variables in the default address space. llvm-svn: 182254	2013-05-20 12:13:32 +00:00
Justin Holewinski	700b6fa934	[NVPTX] Fix i1 kernel parameters and global variables. ABI rules say we need to use .u8 for i1 parameters for kernels. llvm-svn: 182253	2013-05-20 12:13:28 +00:00
Stepan Dyatkovskiy	d0e34a200f	PR15868 fix. Introduction: In case when stack alignment is 8 and GPRs parameter part size is not N8: we add padding to GPRs part, so part's last byte must be recovered at address K8-1. We need to do it, since remained (stack) part of parameter starts from address K8, and we need to "attach" "GPRs head" without gaps to it: Stack: \|---- 8 bytes block ----\| \|---- 8 bytes block ----\| \|---- 8 bytes... [ [padding] [GPRs head] ] [ ------ Tail passed via stack ------ ... FIX: Note, once we added padding we need to correct all* Arg offsets that are going after padded one. That's why we need this fix: Arg offsets were never corrected before this patch. See new test-cases included in patch. We also don't need to insert padding for byval parameters that are stored in GPRs only. We need pad only last byval parameter and only in case it outsides GPRs and stack alignment = 8. Though, stack area, allocated for recovered byval params, must satisfy "Size mod 8 = 0" restriction. This patch reduces stack usage for some cases: We can reduce ArgRegsSaveArea since inner N*4 bytes sized byval params my be "packed" with alignment 4 in some cases. llvm-svn: 182237	2013-05-20 08:01:34 +00:00
Renato Golin	9e18922d67	Disable remote MCJIT on pre-v6 ARM llvm-svn: 182235	2013-05-20 07:46:06 +00:00
Bob Wilson	29699c6365	Partially revert change in r181200 that tried to simplify JIT unit test #ifdefs. The export list for this test requires the following symbols to be available: JITTest_AvailableExternallyFunction JITTest_AvailableExternallyGlobal The change in r181200 commented them out, which caused the test to fail to link, at least on Darwin. I have only reverted the change for arm, since I can't test the other targets and since it sounds like that change was fixing real problems for those other targets. It should be possible to rearrange the code to keep those definitions outside the #ifdefs, but that should be done by someone who can reproduce the problems that r181200 was trying to fix. llvm-svn: 182233	2013-05-20 06:13:09 +00:00
Jakob Stoklund Olesen	f927800325	Also expand 64-bit bitcasts. llvm-svn: 182229	2013-05-20 01:01:43 +00:00
Jakob Stoklund Olesen	c7bc5fbc5c	Implement spill and fill of I64Regs. llvm-svn: 182228	2013-05-20 00:53:25 +00:00
Jakob Stoklund Olesen	751e9b8407	Mark i64 SETCC as expand so it is turned into a SELECT_CC. llvm-svn: 182227	2013-05-20 00:28:36 +00:00
Benjamin Kramer	8bad66e586	Replace some bit operations with simpler ones. No functionality change. llvm-svn: 182226	2013-05-19 22:01:57 +00:00
Jakob Stoklund Olesen	86c5469d26	Don't use %g0 to materialize 0 directly. The wired physreg doesn't work on tied operands like on MOVXCC. Add a README note to fix this later. llvm-svn: 182225	2013-05-19 21:47:13 +00:00
Jakob Stoklund Olesen	92ebf1153e	Select i64 values with %icc conditions. llvm-svn: 182224	2013-05-19 20:38:21 +00:00
Bob Wilson	111b0b6da4	Remove declaration of __clear_cache for __APPLE__. <rdar://problem/13924072> This fixes a bootstrapping problem with builds for Apple ARM targets. Clang had the wrong prototype for __clear_cache with ARM targets. Rafael fixed that in clang svn r181784 and r181810, but without those changes, we can't build this code for ARM because clang reports an error about the declaration in Memory.inc not matching the builtin declaration. Some of our buildbots need to use an older compiler that doesn't have the clang fix. Since __clear_cache is never used here when __APPLE__ is defined, I'm just conditionalizing the declaration to match that. I also moved the declaration of sys_icache_invalidate inside the conditional for __APPLE__ while I was at it. llvm-svn: 182223	2013-05-19 20:33:51 +00:00
Jakob Stoklund Olesen	7ca944b9db	Add floating point selects on %xcc predicates. llvm-svn: 182222	2013-05-19 20:33:11 +00:00
Jakob Stoklund Olesen	4a78c86a6a	Implement SPselectfcc for i64 operands. Also clean up the arguments to all the MOVCC instructions so the operands always are (true-val, false-val, cond-code). llvm-svn: 182221	2013-05-19 20:20:54 +00:00
Renato Golin	cf6979d896	SubArch support in MCJIT unittest llvm-svn: 182220	2013-05-19 20:10:10 +00:00
Venkatraman Govindaraju	3320e5a921	[Sparc] Rearrange integer registers' allocation order so that register allocator will use I and G registers before using L and O registers. Also, enable registers %g2-%g4 to be used in application and %g5 in 64 bit mode. llvm-svn: 182219	2013-05-19 20:07:20 +00:00
Tim Northover	c17f3f75c5	AArch64: enable MCJIT unittests llvm-svn: 182217	2013-05-19 19:44:56 +00:00
Jakob Stoklund Olesen	ead983cec9	Handle i64 FrameIndex nodes in SPARC v9 mode. llvm-svn: 182216	2013-05-19 19:14:24 +00:00

... 2 3 4 5 6 ...

92421 Commits