llvm-project

Commit Graph

Author	SHA1	Message	Date
Chris Bieneman	adffd01498	Adding llvm-shlib to CMake build system with a few new bells and whistles Summary: This patch adds a new CMake build setting LLVM_BUILD_LLVM_DYLIB, which defaults to OFF. When set to ON, this will generate a shared library containing most of LLVM. The contents of the shared library can be overriden by specifying LLVM_DYLIB_COMPONENTS. LLVM_DYLIB_COMPONENTS can be set to a semi-colon delimited list of any LLVM components that you llvm-config can resolve. On Windows, unless you are using Cygwin, you must specify an explicit symbol export file using LLVM_EXPORTED_SYMBOL_FILE. On Cygwin and all unix-like platforms if you do not specify LLVM_EXPORTED_SYMBOL_FILE, an export file containing only the LLVM C API will be auto-generated from the list of LLVM components specified in LLVM_DYLIB_COMPONENTS. Reviewers: rnk Reviewed By: rnk Subscribers: rnk, llvm-commits Differential Revision: http://reviews.llvm.org/D5890 llvm-svn: 220490	2014-10-23 17:22:14 +00:00
David Blaikie	bcfa8a56b1	Remove explicit (void) use of DwarfFile::DD that was accidentally left in r220452. Caught in post-commit review by Frédéric. llvm-svn: 220487	2014-10-23 16:12:58 +00:00
Renato Golin	6fb9c2ea70	Do not emit intermediate register for zero FP immediate This updates check for double precision zero floating point constant to allow use of instruction with immediate value rather than temporary register. Currently "a == 0.0", where "a" is of "double" type generates: vmov.i32 d16, #0x0 vcmpe.f64 d0, d16 With this change it becomes: vcmpe.f64 d0, #0 Patch by Sergey Dmitrouk. llvm-svn: 220486	2014-10-23 15:31:50 +00:00
Rafael Espindola	1b47a28ff9	clang-format two code snippets to make the next patch easy to read. llvm-svn: 220484	2014-10-23 15:20:05 +00:00
Rafael Espindola	a492fe972d	Add unittest for extreme alignments. llvm-svn: 220483	2014-10-23 14:45:19 +00:00
NAKAMURA Takumi	5b6f789d7a	Hexagon/Disassembler/LLVMBuild.txt: Update libdeps. llvm-svn: 220482	2014-10-23 11:32:16 +00:00
NAKAMURA Takumi	f459febb15	Hexagon/LLVMBuild.txt: Prune CRLF. llvm-svn: 220481	2014-10-23 11:32:03 +00:00
NAKAMURA Takumi	bd20251a4a	[CMake] Prune CRLF in CMakeLists.txt(s). llvm-svn: 220480	2014-10-23 11:31:50 +00:00
NAKAMURA Takumi	ae8d2cc9fe	[CMake] Prune trailing whitespace. llvm-svn: 220479	2014-10-23 11:31:33 +00:00
NAKAMURA Takumi	504bbf91cd	Revert r220427, "[Hexagon] Adding encoding bits for add opcode." It brought cyclic dependecy between HexagonAsmPrinter and HexagonDesc. llvm-svn: 220478	2014-10-23 11:31:22 +00:00
Zoran Jovanovic	42b8444372	[mips][microMIPS] Implement ADDIUR1SP instruction Differential Revision: http://reviews.llvm.org/D5153 llvm-svn: 220477	2014-10-23 11:13:59 +00:00
Zoran Jovanovic	bac3619b29	ps][microMIPS] Implement ADDIUR2 instruction Differential Revision: http://reviews.llvm.org/D5151 llvm-svn: 220476	2014-10-23 11:06:34 +00:00
Zoran Jovanovic	9bda2f1926	ps][microMIPS] Implement LI16 instruction Differential Revision: http://reviews.llvm.org/D5149 llvm-svn: 220475	2014-10-23 10:59:24 +00:00
Zoran Jovanovic	4a00fdc2e3	[mips][microMIPS] Implement CodeGen support for SLL16 and SRL16 instructions Differential Revision: http://reviews.llvm.org/D5774 llvm-svn: 220474	2014-10-23 10:42:01 +00:00
Oliver Stannard	39a85abddf	[Thumb2] Improve disassembly of memory hints Currently, the ARM disassembler will disassemble the Thumb2 memory hint instructions (PLD, PLDW and PLI), even for targets which do not have these instructions. This patch adds the required checks to the disassmebler. llvm-svn: 220472	2014-10-23 08:52:58 +00:00
Akira Hatanaka	2ee0e9e6ee	[ARM, stack protector] If supported, use armv7 instructions. This commit enables using movt/movw to load the stack guard address: movw r0, :lower16:(L_g3$non_lazy_ptr-(LPC0_0+8)) movt r0, :upper16:(L_g3$non_lazy_ptr-(LPC0_0+8)) ldr r0, [pc, r0] Previously a pc-relative load was emitted: ldr r0, LCPI0_0 ldr r0, [pc, r0] rdar://problem/18740489 llvm-svn: 220470	2014-10-23 04:17:05 +00:00
Frederic Riss	c1892e2d48	Assert that ValueHandleBase::ValueIsRAUWd doesn't change the tracked Value type. This invariant is enforced in Value::replaceAllUsesWith, thus it seems logical to apply it also to ValueHandles. This commit fixes InstCombine to not trigger the assertion during the removal of constant bitcasts in call instructions. Differential Revision: http://reviews.llvm.org/D5828 llvm-svn: 220468	2014-10-23 04:08:42 +00:00
Frederic Riss	05ad2e543f	Modernize doxygen comments in Support/Dwarf.h In post-commit review of r219442, Rafael pointed out that the comment style of the newly introduced helper didn't follow LLVM's coding standard. Modernize the whole file to the new standards. Differential Revision: http://reviews.llvm.org/D5918 llvm-svn: 220467	2014-10-23 04:08:38 +00:00
Frederic Riss	e939b43aa4	[dwarfdump] Dump DW_AT_ranges values inline in the debug_info dump. The output looks like that: DW_AT_ranges [FORM_data4] (0x00000000 [0x00000001000024a0 - 0x00000001000024c2) [0x0000000100002505 - 0x000000010000268b)) Differential Revision: http://reviews.llvm.org/D5712 llvm-svn: 220466	2014-10-23 04:08:34 +00:00
Peter Collingbourne	244ecf55bd	Add llvm-go tool. This tool lets us build LLVM components within the tree by setting up a $GOPATH that resembles a tree fetched in the normal way with "go get". It is intended that components such as the Go frontend will be built in-tree using this tool. Differential Revision: http://reviews.llvm.org/D5902 llvm-svn: 220462	2014-10-23 02:33:23 +00:00
Evgeniy Stepanov	7db296eba5	[msan] Emit checks for constant shadow values under an experimental flag. Does not change the default behavior. llvm-svn: 220457	2014-10-23 01:05:46 +00:00
David Blaikie	f299947bfa	[DebugInfo] Sink DwarfDebug::addCurrentFnArgument down into DwarfFile. Variable handling will be sunk into DwarfFile so that abstract variables and the like can be shared across multiple CUs (to handle cross-CU inlining, for example). llvm-svn: 220453	2014-10-23 00:16:05 +00:00
David Blaikie	2b22b1e9a2	[DebugInfo] Add DwarfDebug& to DwarfFile. Use the DwarfDebug in one function that previously took it as a parameter, and lay the foundation for use this for other operations coming soon. llvm-svn: 220452	2014-10-23 00:16:03 +00:00
David Blaikie	263a008525	[DebugInfo] Remove LexicalScopes::isCurrentFunctionScope and CSE a use of LexicalScopes::getCurrentFunctionScope Now that we're sure the only root (non-abstract) scope is the current function scope, there's no need for isCurrentFunctionScope, the property can be tested directly instead. llvm-svn: 220451	2014-10-23 00:06:27 +00:00
Derek Schuff	1fd051bfe8	Fix Mips nacl-mask test for new bundle-aligned label behavior After r220439 the behavior of labels in bundle-align mode changed, and I neglected to update this test. llvm-svn: 220447	2014-10-22 23:32:00 +00:00
Lang Hames	efe7e22673	[MCJIT] Make repeat calls to MCJIT::getPointerToFunction for declarations safe. MCJIT::getPointerForFunction adds the resulting address to the global mapping. This should be done via updateGlobalMapping rather than addGlobalMapping, since the latter asserts if a mapping already exists. MCJIT::getPointerToFunction is actually deprecated - hopefully we can remove it (or more likely re-task it) entirely soon. In the mean time it should at least work as advertised. <rdar://problem/18727946> llvm-svn: 220444	2014-10-22 23:18:42 +00:00
David Majnemer	ac7dc6e701	Attempt to fix the build after r220439 llvm-svn: 220440	2014-10-22 22:46:05 +00:00
Derek Schuff	5f708e5ec8	[MC] Attach labels to existing fragments instead of using a separate fragment Summary: Currently when emitting a label, a new data fragment is created for it if the current fragment isn't a data fragment. This change instead enqueues the label and attaches it to the next fragment (e.g. created for the next instruction) if possible. When bundle alignment is not enabled, this has no functionality change (it just results in fewer extra fragments being created). For bundle alignment, previously labels would point to the beginning of the bundle padding instead of the beginning of the emitted instruction. This was not only less efficient (e.g. jumping to the nops instead of past them) but also led to miscalculation of the address of the GOT (since MC uses a label difference rather than emitting a "." symbol). Fixes https://code.google.com/p/nativeclient/issues/detail?id=3982 Test Plan: regression test attached Reviewers: jvoung, eliben Subscribers: jfb, llvm-commits Differential Revision: http://reviews.llvm.org/D5915 llvm-svn: 220439	2014-10-22 22:38:06 +00:00
Colin LeMahieu	73a51a1a68	[Hexagon] Adding encoding bits for add opcode. Adding llvm-mc tests. Removing unit tests. http://reviews.llvm.org/D5624 llvm-svn: 220427	2014-10-22 20:58:35 +00:00
Chad Rosier	dcd2a3014c	[AArch64] Add support for the .inst directive. This has been implement using the MCTargetStreamer interface as is done in the ARM, Mips and PPC backends. Phabricator: http://reviews.llvm.org/D5891 PR20964 llvm-svn: 220422	2014-10-22 20:35:57 +00:00
Peter Collingbourne	04bca22822	Go: add binding for LLVMSetUnnamedAddr. llvm-svn: 220416	2014-10-22 20:20:27 +00:00
Benjamin Kramer	7ad22403fb	Strength reduce constant-sized vectors into arrays. No functionality change. llvm-svn: 220412	2014-10-22 19:55:26 +00:00
Peter Collingbourne	a7d1751d98	Do not add -gsplit-dwarf to LLVM_DEFINITIONS. This would cause the flag to appear in the output of "llvm-config --cppflags", which should contain only preprocessor flags. The -gsplit-dwarf flag in particular can cause problems with certain downstream users such as cgo. Differential Revision: http://reviews.llvm.org/D5895 llvm-svn: 220410	2014-10-22 19:49:19 +00:00
Benjamin Kramer	26ce8ff637	LoopVectorize: Simplify code. No functionality change. llvm-svn: 220405	2014-10-22 19:13:54 +00:00
Diego Novillo	19e7b7e27c	Shorten auto iterators for function basic blocks. Use consistent naming for basic block instances. No functional changes. llvm-svn: 220404	2014-10-22 18:39:50 +00:00
Matt Arsenault	64313c94ae	Fix number of operands in documentation for minnum / maxnum llvm-svn: 220402	2014-10-22 18:25:02 +00:00
Justin Bogner	72d1f2b61b	test: Make this test runnable in directories with @ in their names Jenkins likes to use directories with names involving the '@' character, which breaks the sed expression in this test. Switch to use '\|' on the assumption that it's less likely to show up in a path. llvm-svn: 220401	2014-10-22 18:18:54 +00:00
Hans Wennborg	db08566588	Fix VS2012 build; C++11 type aliases are not supported. llvm-svn: 220399	2014-10-22 17:47:49 +00:00
Colin LeMahieu	b424cb1e57	Ammending 220393 - Removing unused decoding tables. llvm-svn: 220397	2014-10-22 17:23:01 +00:00
Colin LeMahieu	9950d5c59a	Ammending 220393 - Removing unused functions. llvm-svn: 220396	2014-10-22 17:03:19 +00:00
Bill Schmidt	9c54bbd791	[PATCH] Support select-cc for VSFRC when VSX is enabled A previous patch enabled SELECT_VSRC and SELECT_CC_VSRC for VSX to handle <2 x double> cases. This patch adds SELECT_VSFRC and SELECT_CC_VSFRC to allow use of all 64 vector-scalar registers for the f64 type when VSX is enabled. The changes are analogous to those in the previous patch. I've added a new variant to vsx.ll to test the code generation. (I also cleaned up a little formatting in PPCInstrVSX.td from the previous patch.) llvm-svn: 220395	2014-10-22 16:58:20 +00:00
Diego Novillo	b368b7d558	Use auto iteration in lib/Transforms/Scalar/SampleProfile.cpp. No functional changes. llvm-svn: 220394	2014-10-22 16:51:50 +00:00
Colin LeMahieu	88ebb9e2da	[Hexagon] Adding basic disassembler. Marking all instructions as CodeGenOnly since encoding bits are not set yet. http://reviews.llvm.org/D5829?vs=on&id=15023&whitespace=ignore-all#toc llvm-svn: 220393	2014-10-22 16:49:14 +00:00
Philip Reames	d92c2a7592	Preserving 'nonnull' metadata in SimplifyCFG When we hoist two loads above an if, we can preserve the nonnull metadata. We could also do the same for sinking them, but we appear to not handle metadata at all in that case. Thanks to Hal for the review. Differential Revision: http://reviews.llvm.org/D5910 llvm-svn: 220392	2014-10-22 16:37:13 +00:00
Sanjay Patel	a92fa44740	Shrinkify libcalls: use float versions of double libm functions with fast-math (bug 17850) When a call to a double-precision libm function has fast-math semantics (via function attribute for now because there is no IR-level FMF on calls), we can avoid fpext/fptrunc operations and use the float version of the call if the input and output are both float. We already do this optimization using a command-line option; this patch just adds the ability for fast-math to use the existing functionality. I moved the cl::opt from InstructionCombining into SimplifyLibCalls because it's only ever used internally to that class. Modified the existing test cases to use the unsafe-fp-math attribute rather than repeating all tests. This patch should solve: http://llvm.org/bugs/show_bug.cgi?id=17850 Differential Revision: http://reviews.llvm.org/D5893 llvm-svn: 220390	2014-10-22 15:29:23 +00:00
Rafael Espindola	dbc0416b4b	Make two helper functions static. llvm-svn: 220389	2014-10-22 15:05:51 +00:00
Diego Novillo	a67c0b43e1	Change error to warning when a profile cannot be found. When the profile for a function cannot be applied, we use to emit an error. This seems extreme. The compiler can continue, it's just that the optimization opportunities won't include profile information. llvm-svn: 220386	2014-10-22 13:36:35 +00:00
Bill Schmidt	61e652334f	[PowerPC] Support select-cc for VSX The tests test/CodeGen/Generic/select-cc.ll and test/CodeGen/PowerPC/select-cc.ll both fail with VSX enabled. The problem is that the lowering logic for the SELECT and SELECT_CC operations doesn't currently support the VSX registers. This patch fixes that. In lib/Target/PowerPC/PPCInstrInfo.td, we have pseudos to handle this for other register classes. Similar pseudos are added in PPCInstrVSX.td (they must be there, because the "vsrc" register class definition appears there) for the VSRC register class. The SELECT_VSRC pseudo is then used in pattern matching for SELECT_CC. The rest of the patch just adds logic for SELECT_VSRC wherever similar logic appears for SELECT_VRRC. There are no new test cases because the existing tests above test this, along with a variant in test/CodeGen/PowerPC/vsx.ll. After discussion with Hal, a future patch will add similar _VSFRC variants to override f64 type handling (currently using F8RC). llvm-svn: 220385	2014-10-22 13:13:40 +00:00
Aaron Ballman	4934e4b598	Fixing a -Wsign-compare warning; NFC. I think it might make sense to make COFF::MaxNumberOfSections16 be a uint32_t, however, that may have wider-reaching implications in other projects, which is why I did not change that declaration. llvm-svn: 220384	2014-10-22 13:09:43 +00:00
Diego Novillo	8027b80b41	Support using sample profiles with partial debug info. Summary: When using a profile, we used to require the use -gmlt so that we could get access to the line locations. This is used to match line numbers in the input profile to the line numbers in the function's IR. But this is actually not necessary. The driver can provide source location tracking without the emission of debug information. In these cases, the annotation 'llvm.dbg.cu' is missing from the IR, but the actual line location annotations are still present. This patch adds a new way of looking for the start of the current function. Instead of looking through the compile units in llvm.dbg.cu, we can walk up the scope for the first instruction in the function with a debug loc. If that describes the function, we use it. Otherwise, we keep looking until we find one. If no such instruction is found, we then give up and produce an error. Reviewers: echristo, dblaikie Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D5887 llvm-svn: 220382	2014-10-22 12:59:00 +00:00
Arnaud A. de Grandmaison	9b3330546b	[AArch64] Cleanup A57PBQPConstraints And add a long awaited testcase. llvm-svn: 220381	2014-10-22 12:40:20 +00:00
Bruno Cardoso Lopes	c29520c5b3	[InstSimplify] Support constant folding to vector of pointers ConstantFolding crashes when trying to InstSimplify the following load: @a = private unnamed_addr constant %mst { i8* inttoptr (i64 -1 to i8), i8 inttoptr (i64 -1 to i8) }, align 8 %x = load <2 x i8>* bitcast (%mst* @a to <2 x i8>), align 8 This patch fix this by adding support to this type of folding: %x = load <2 x i8> bitcast (%mst* @a to <2 x i8>), align 8 ==> gets folded to: %x = <2 x i8> <i8 inttoptr (i64 -1 to i8), i8 inttoptr (i64 -1 to i8*)> llvm-svn: 220380	2014-10-22 12:18:48 +00:00
Jyoti Allur	3b68607eac	[Thumb/Thumb2] Implement restrictions on SP in register list on LDM, STM variants in thumb mode llvm-svn: 220379	2014-10-22 10:41:14 +00:00
Peter Zotov	d6a644adc6	[OCaml] Fix a typo in documentation. llvm-svn: 220377	2014-10-22 10:24:05 +00:00
Matt Arsenault	0cf39569bf	R600/SI: Add another failing testcase for i1 copies It's not handling phis. llvm-svn: 220371	2014-10-22 05:30:42 +00:00
Matt Arsenault	59102d38fb	R600/SI: Add failing testcase reduced from OpenCV This fails the verifier with: "Expected a VCSrc_32 register, but got a VReg_1 register" llvm-svn: 220368	2014-10-22 04:26:10 +00:00
Rafael Espindola	68bae2c7f6	Handle spaces and quotes in file names in MRI scripts. llvm-svn: 220364	2014-10-22 03:10:56 +00:00
Rafael Espindola	b0fc4cf3b9	Fix a gcc warning. Thanks to Filipe Cabecinhas for the report. llvm-svn: 220361	2014-10-22 02:23:31 +00:00
Filipe Cabecinhas	3954566554	Silence gcc's -Wcomment gcc's (4.7, I think) -Wcomment warning is not "as smart" as clang's and warns even if the line right after the backslash-newline sequence only has a line comment that starts at the beginning of the line. llvm-svn: 220360	2014-10-22 02:16:06 +00:00
Daniel Dunbar	02306e1cdb	[lit] Fix Python-3 compatibility, patch by Dan Liew. llvm-svn: 220357	2014-10-22 01:26:06 +00:00
Daniel Dunbar	21297a532e	[lit] Bump version number. llvm-svn: 220355	2014-10-22 00:48:23 +00:00
Daniel Dunbar	dfbb5c1b76	Fix ShTest parsing error when a keyword line doesn't end with a newline. llvm-svn: 220354	2014-10-22 00:34:31 +00:00
Matt Arsenault	8226ca29e2	Fix typo llvm-svn: 220353	2014-10-22 00:28:59 +00:00
Matt Arsenault	9886b0da3b	Try to fix documentation bot warning llvm-svn: 220352	2014-10-22 00:15:53 +00:00
Evgeniy Stepanov	35eb265421	[msan] Handle param-tls overflow. ParamTLS (shadow for function arguments) is of limited size. This change makes all arguments that do not fit unpoisoned, and avoids writing past the end of a TLS buffer. llvm-svn: 220351	2014-10-22 00:12:40 +00:00
Hans Wennborg	0b39fc0d16	Revert "Teach the load analysis to allow finding available values which require" (r220277) This seems to have caused PR21330. llvm-svn: 220349	2014-10-21 23:49:52 +00:00
Lang Hames	41d95947cf	[MCJIT] Defer application of AArch64 MachO GOT relocations until resolve time. On AArch64, GOT references are page relative (ADRP + LDR), so they can't be applied until we know exactly where, within a page, the GOT entry will be in the target address space. Fixes <rdar://problem/18693976>. llvm-svn: 220347	2014-10-21 23:41:15 +00:00
Rafael Espindola	915fbb3590	MRI scripts: Add addlib support. llvm-svn: 220346	2014-10-21 23:18:51 +00:00
JF Bastien	f42a6ea5ac	LTO: respect command-line options that disable vectorization. Summary: Patches 202051 and 208013 added calls to LTO's PassManager which unconditionally add LoopVectorizePass and SLPVectorizerPass instead of following the logic in PassManagerBuilder::populateModulePassManager and honoring the -vectorize-loops -run-slp-after-loop-vectorization flags. Reviewers: nadav, aschwaighofer, yijiang Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D5884 llvm-svn: 220345	2014-10-21 23:18:21 +00:00
Rafael Espindola	ef1c9ad864	Use a range loop. NFC. llvm-svn: 220344	2014-10-21 23:04:55 +00:00
Matt Arsenault	7c93690be0	Add minnum / maxnum codegen llvm-svn: 220342	2014-10-21 23:01:01 +00:00
Matt Arsenault	d6511b49ac	Add minnum / maxnum intrinsics These are named following the IEEE-754 names for these functions, rather than the libm fmin / fmax to avoid possible ambiguities. Some languages may implement something resembling fmin / fmax which return NaN if either operand is to propagate errors. These implement the IEEE-754 semantics of returning the other operand if either is a NaN representing missing data. llvm-svn: 220341	2014-10-21 23:00:20 +00:00
Duncan P. N. Exon Smith	44e5b4e533	IR: Reorder metadata bitcode serialization, NFC Enumerate `MDNode`'s operands before the node itself, so that the reader requires less RAUW. Although this will cause different code paths to be hit in the reader, this should effectively be no functionality change. llvm-svn: 220340	2014-10-21 22:27:47 +00:00
Matt Arsenault	75c658e2cc	R600/SI: Add missing parameter to div_fmas intrinsic llvm-svn: 220338	2014-10-21 22:20:55 +00:00
Duncan P. N. Exon Smith	60d87e7253	IR: Remove dead code in metadata bitcode writing, NFC No one cares how many uses each metadata value has, so don't bother counting. llvm-svn: 220337	2014-10-21 22:13:34 +00:00
Rafael Espindola	8a4635224b	Overwrite instead of adding to archives when creating them in mri scripts. This matches the behavior of GNU ar and also makes it easier to implemnt support for the addlib command. llvm-svn: 220336	2014-10-21 21:56:47 +00:00
Arnaud A. de Grandmaison	5c7fe7e97b	Pacify bots and simplify r220321 llvm-svn: 220335	2014-10-21 21:50:49 +00:00
Rafael Espindola	7661970d2f	Convert a few std::string with StringRef. NFC. This is a micro optimization, but also makes the code a bit more flexible. The MRIMembers variable is a short term hack. It is going away in the next commit. llvm-svn: 220334	2014-10-21 21:47:27 +00:00
Reid Kleckner	0c6fed5716	GCC has supported C++11 ref-qualifiers since 4.8.1 This requires incorporating __GNUC_PATCHLEVEL__ into our prerequisite check, and renaming our __GNUC_PREREQ to LLVM_GNUC_PREREQ, since it is now functionally different. Patch by Chilledheart! Differential Revision: http://reviews.llvm.org/D5879 llvm-svn: 220332	2014-10-21 21:15:45 +00:00
Matt Arsenault	8c4fb7cae0	R600: Use default GlobalDirective The overridden one wasn't inserting a space, so you would end up with .globalfoo llvm-svn: 220329	2014-10-21 21:08:36 +00:00
Rafael Espindola	4c1f801b81	Use a StringRef. No functionality change. llvm-svn: 220327	2014-10-21 21:07:49 +00:00
Philip Reames	d7c21364a9	Teach combineMetadata how to merge 'nonnull' metadata. combineMetadata is used when merging two instructions into one. This change teaches it how to merge 'nonnull' - i.e. only preserve it on the new instruction if it's set on both sources. This isn't actually used yet since I haven't adjusted any of the call sites to pass in nonnull as a 'known metadata'. llvm-svn: 220325	2014-10-21 21:02:19 +00:00
Philip Reames	b2d3f035e2	Preserve 'nonnull' when changing type of the load. When changing the type of a load in Chandler's recent InstCombine changes, we can preserve the new 'nonnull' metadata. I considered adding an assert since 'nonnull' is only valid on pointer types, but casting a pointer to a non-pointer would involve more than a bitcast anyways. If someone extends this transform to handle more than bitcasts, the verifier will report the malformed IR, so a separate assertion isn't needed. Also, the fpmath flags would have the same problem. llvm-svn: 220324	2014-10-21 21:00:03 +00:00
Philip Reames	0ca58b33cf	Extend the verifier to check usage of 'nonnull' metadata. The recently added !nonnull metadata is only valid on loads of pointer type. llvm-svn: 220323	2014-10-21 20:56:29 +00:00
Arnaud A. de Grandmaison	a61262f989	[PBQP] Teach PassConfig to tell if the default register allocator is used. This enables targets to adapt their pass pipeline to the register allocator in use. For example, with the AArch64 backend, using PBQP with the cortex-a57, the FPLoadBalancing pass is no longer necessary. llvm-svn: 220321	2014-10-21 20:47:22 +00:00
Rafael Espindola	ea16d6e2b8	Move code a bit to avoid a few declarations. NFC. llvm-svn: 220317	2014-10-21 20:34:57 +00:00
Arnaud A. de Grandmaison	ece7fe0e16	[PBQP] Add a testcase for r220302: Fix coalescing benefits llvm-svn: 220316	2014-10-21 20:10:21 +00:00
David Majnemer	d205602a0b	InstCombine: Simplify FoldICmpCstShrCst This function was complicated by the fact that it tried to perform canonicalizations that were already preformed by InstSimplify. Remove this extra code and move the tests over to InstSimplify. Add asserts to make sure our preconditions hold before we make any assumptions. llvm-svn: 220314	2014-10-21 19:51:55 +00:00
Rafael Espindola	f03ae4efa7	Drop support for an old version of ld64 (from darwin 9). llvm-svn: 220310	2014-10-21 18:31:09 +00:00
Sanjay Patel	d5aa255146	remove function names from comments; NFC llvm-svn: 220309	2014-10-21 18:26:57 +00:00
Rafael Espindola	4bbdeda8be	Convert two tests to use llvm-readobj. llvm-svn: 220308	2014-10-21 18:24:31 +00:00
Matt Arsenault	e306a32325	R600/SI: Add pattern for bswap llvm-svn: 220304	2014-10-21 16:25:08 +00:00
Arnaud A. de Grandmaison	0dea74b069	[PBQP] Check for out of bound access in DEBUG builds It is just too easy to use a virtual register intead of a NodeId without a compiler warning. This does not fix the fundamental problem, i.e. both have the same underlying types, but increases the likelyhood to detect it. llvm-svn: 220303	2014-10-21 16:24:21 +00:00
Arnaud A. de Grandmaison	d3648d0cbe	[PBQP] Fix coalescing benefits As coalescing registers is a benefit, the cost should be improved (i.e. made smaller) when coalescing is possible. llvm-svn: 220302	2014-10-21 16:24:15 +00:00
NAKAMURA Takumi	9ff272f382	X86AsmInstrumentation.cpp: Dissolve initializer-ranged-for. MSC17 disliked it. llvm-svn: 220301	2014-10-21 16:22:52 +00:00
Aaron Ballman	16f4f78b4b	Silence a -Wcast-qual warning; NFC. llvm-svn: 220300	2014-10-21 16:12:37 +00:00
Colin LeMahieu	7055365e77	Test commit Fixing brief comment. llvm-svn: 220299	2014-10-21 16:03:10 +00:00
Rafael Espindola	90b8570c50	Comment cleanup. NFC. Don't duplicate names in comments and remove useless ones. Hopefully anyone reading this knows what main is. llvm-svn: 220298	2014-10-21 15:49:46 +00:00
Rafael Espindola	c9b33ff9ba	Add support for addmod to mri scripts. llvm-svn: 220294	2014-10-21 14:46:17 +00:00
Bill Schmidt	5c6cb813b6	[PowerPC] Avoid VSX FMA mutate when killed product reg = addend reg With VSX enabled, test/CodeGen/PowerPC/recipest.ll exposes a bug in the FMA mutation pass. If we have a situation where a killed product register is the same register as the FMA target, such as: %vreg5<def,tied1> = XSNMSUBADP %vreg5<tied0>, %vreg11, %vreg5, %RM<imp-use>; VSFRC:%vreg5 F8RC:%vreg11 then the substitution makes no sense. We end up getting a crash when we try to extend the interval associated with the killed product register, as there is already a live range for %vreg5 there. This patch just disables the mutation under those circumstances. Since recipest.ll generates different code with VMX enabled, I've modified that test to use -mattr=-vsx. I've borrowed the code from that test that exposed the bug and placed it in fma-mutate.ll, where it tests several mutation opportunities including the "bad" one. llvm-svn: 220290	2014-10-21 13:02:37 +00:00
Oliver Stannard	cdb8db8d3c	[ARM] NEON 32-bit scalar moves are also available in VFPv2 The 32-bit variants of the NEON scalar<->GPR move instructions are also available in VFPv2. The 8- and 16-bit variants do require NEON. Note that the checks in the test file are all -DAG because they are checking a mixture of stdout and stderr, and the ordering is not guaranteed. llvm-svn: 220288	2014-10-21 11:49:14 +00:00
Yuri Gorshenin	171eb8dbeb	[asan-asm-instrumentation] Fixed memory accesses with rbp as a base or an index register. Summary: Fixed memory accesses with rbp as a base or an index register. Reviewers: eugenis Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D5819 llvm-svn: 220283	2014-10-21 10:22:27 +00:00
Oliver Stannard	38e6d45a46	[Thumb2] LDRS?[BH] cannot load to the PC The Thumb2 LDRS?[BH] instructions are not valid when the destination register is the PC (these encodings are used for preload hints). llvm-svn: 220278	2014-10-21 09:14:15 +00:00
Chandler Carruth	aa72a6dd3b	Teach the load analysis to allow finding available values which require inttoptr or ptrtoint cast provided there is datalayout available. Eventually, the datalayout can just be required but in practice it will always be there today. To go with the ability to expose available values requiring a ptrtoint or inttoptr cast, helpers are added to perform one of these three casts. These smarts are necessary to finish canonicalizing loads and stores to the operational type requirements without regressing fundamental combines. I've added some test cases. These should actually improve as the load combining and store combining improves, but they may fundamentally be highlighting some missing combines for select in addition to exercising the specific added logic to load analysis. llvm-svn: 220277	2014-10-21 09:00:40 +00:00
Zoran Jovanovic	592239d498	[mips][microMIPS] Implement ADDU16 and SUBU16 instructions Differential Revision: http://reviews.llvm.org/D5118 llvm-svn: 220276	2014-10-21 08:44:58 +00:00
Zoran Jovanovic	81ceebc56e	[mips][microMIPS] Implement AND16, NOT16, OR16 and XOR16 instructions Differential Revision: http://reviews.llvm.org/D5117 llvm-svn: 220275	2014-10-21 08:32:40 +00:00
Zoran Jovanovic	b0852e5410	[mips][microMIPS] Implement microMIPS 16-bit instructions registers Differential Revision: http://reviews.llvm.org/D5116 llvm-svn: 220273	2014-10-21 08:23:11 +00:00
Rafael Espindola	c606bfe660	Fix a bit of confusion about .set and produce more readable assembly. Every target we support has support for assembly that looks like a = b - c .long a What is special about MachO is that the above combination suppresses the production of a relocation. With this change we avoid producing the intermediary labels when they don't add any value. llvm-svn: 220256	2014-10-21 01:17:30 +00:00
Paul Robinson	f60e0a160f	Do not attribute static allocas to the call site's DebugLoc. When functions are inlined, instructions without debug information are attributed to the call site's DebugLoc. After inlining, inlined static allocas are moved to the caller's entry block, adjacent to the caller's original static alloca instructions. By retaining the call site's DebugLoc, these instructions could cause instructions that were subsequently inserted at the entry block to pick up the same DebugLoc. Patch by Wolfgang Pieb! llvm-svn: 220255	2014-10-21 01:00:55 +00:00
Rafael Espindola	f16a66973c	Make this test a bit more strict. llvm-svn: 220253	2014-10-21 00:47:49 +00:00
Chandler Carruth	97192421e1	Teach lit to filter the host LDFLAGS down from the build system and into the CGO build environment. This lets things like -rpath propagate down to the C++ code that is built along side the Go bindings when testing them. Patch by Peter Collingbourne, and verified that it works by me. llvm-svn: 220252	2014-10-21 00:36:28 +00:00
David Blaikie	df9515324d	PR21202: Memory leak in Windows RWMutexImpl when using SRWLOCK llvm-svn: 220251	2014-10-21 00:34:39 +00:00
Rafael Espindola	74dd8547db	Make AsmPrinter::EmitLabelOffsetDifference a static helper and simplify. It had exactly one caller in a position where we know hasSetDirective is true. llvm-svn: 220250	2014-10-21 00:25:49 +00:00
Lang Hames	2d0d096bd1	[MCJIT] Temporarily revert r220245 - it broke several bots. (See e.g. http://bb.pgr.jp/builders/cmake-llvm-x86_64-linux/builds/17653) llvm-svn: 220249	2014-10-21 00:24:02 +00:00
Philip Reames	5a3f5f751b	Introduce enum values for previously defined metadata types. (NFC) Our metadata scheme lazily assigns IDs to string metadata, but we have a mechanism to preassign them as well. Using a preassigned ID is helpful since we get compile time type checking, and avoid some (minimal) string construction and comparison. This change adds enum value for three existing metadata types: + MD_nontemporal = 9, // "nontemporal" + MD_mem_parallel_loop_access = 10, // "llvm.mem.parallel_loop_access" + MD_nonnull = 11 // "nonnull" I went through an updated various uses as well. I made no attempt to get all uses; I focused on the ones which were easily grepable and easily to translate. For example, there were several items in LoopInfo.cpp I chose not to update. llvm-svn: 220248	2014-10-21 00:13:20 +00:00
Philip Reames	bf9676f7f0	Extend the verifier to validate range metadata on calls and invokes. Range metadata applies to loads, call, and invokes. We were validating that metadata applied to loads was correct according to the LangRef, but we were not validating metadata applied to calls or invokes. This change extracts the checking functionality to a common location, reuses it for all valid locations, and adds a simple test to ensure a misused range on a call gets reported. llvm-svn: 220246	2014-10-20 23:52:07 +00:00
Lang Hames	84801c217c	[MCJIT] Make MCJIT honor symbol visibility settings when populating the global symbol table. Patch by Anthony Pesch. Thanks Anthony! llvm-svn: 220245	2014-10-20 23:39:54 +00:00
Quentin Colombet	06355199f1	[X86] Fix a bug in the lowering of the mask of VSELECT. X86 code to lower VSELECT messed a bit with the bits set in the mask of VSELECT when it knows it can be lowered into BLEND. Indeed, only the high bits need to be set for those and it optimizes those accordingly. However, when the mask is a compile time constant, the lowering will be handled by the generic optimizer and those modifications will generate bad code in the generic optimizer. This patch fixes that by preventing the optimization if the VSELECT will be handled by the generic optimizer. <rdar://problem/18675020> llvm-svn: 220242	2014-10-20 23:13:30 +00:00
Philip Reames	cdb72f369f	Introduce a 'nonnull' metadata on Load instructions. The newly introduced 'nonnull' metadata is analogous to existing 'nonnull' attributes, but applies to load instructions rather than call arguments or returns. Long term, it would be nice to combine these into a single construct. The value of the load is allowed to vary between successive loads, but null is not a valid value to be loaded by any load marked nonnull. Reviewed by: Hal Finkel Differential Revision: http://reviews.llvm.org/D5220 llvm-svn: 220240	2014-10-20 22:40:55 +00:00
Simon Pilgrim	2f9548a3ef	[X86] Memory folding for commutative instructions (updated) This patch improves support for commutative instructions in the x86 memory folding implementation by attempting to fold a commuted version of the instruction if the original folding fails - if that folding fails as well the instruction is 're-commuted' back to its original order before returning. Updated version of r219584 (reverted in r219595) - the commutation attempt now explicitly ensures that neither of the commuted source operands are tied to the destination operand / register, which was the source of all the regressions that occurred with the original patch attempt. Added additional regression test case provided by Joerg Sonnenberger. Differential Revision: http://reviews.llvm.org/D5818 llvm-svn: 220239	2014-10-20 22:14:22 +00:00
Rafael Espindola	c71e005030	Explain why we don't always use --gc-sections. llvm-svn: 220237	2014-10-20 21:37:38 +00:00
Tim Northover	23075ccee7	ARM: rework Thumb1 frame index rewriting The previous code had a few problems, motivating the choices here. 1. It could create instructions clobbering CPSR, but the incoming MachineInstr didn't reflect this. A potential source of corruption. This is why the patch has a new PseudoInst for before lowering. 2. Similarly, there was some code to handle the incoming instruction not being ARMCC::AL, but this would have caused massive problems if it was actually invoked when a complex offset needing more than one instruction was requested. 3. It wasn't designed to handle unaligned pointers (or offsets). These should probably be minimised anyway, but the code needs to deal with them properly regardless. 4. It had some rather dubious ad-hoc code to avoid calling emitThumbRegPlusImmediate, a function which should be designed to do precisely this job. We seem to cover the common cases correctly now, and hopefully can enhance emitThumbRegPlusImmediate to handle any extra optimisations we need to add in future. llvm-svn: 220236	2014-10-20 21:28:41 +00:00
Alexey Samsonov	204607bc90	Try to fix GCC error about invalid use of const_cast in const version of ErrorOr::get() llvm-svn: 220233	2014-10-20 20:41:21 +00:00
Alexey Samsonov	3f3ea33531	Constify getELFDynamicSymbolIterators standalone function. NFC. llvm-svn: 220232	2014-10-20 20:33:20 +00:00
Alexey Samsonov	3c6915fa40	Add const version of OwningBinary::getBinary llvm-svn: 220231	2014-10-20 20:32:47 +00:00
Alexey Samsonov	798ff92c3c	Be more specific about return type of MachOUniversalBinary::getObjectForArch llvm-svn: 220230	2014-10-20 20:30:57 +00:00
Alexey Samsonov	4a7eb380cc	Constify input argument of RelocVisitor and DWARFContext constructors. NFC. llvm-svn: 220228	2014-10-20 20:28:51 +00:00
Dan Liew	2f9e28c84f	Teach Lit to catch OSError exceptions when creating a process during the execution of a shell command. This can happen for example if the ``RUN:`` line calls a python script which can work correctly under Linux/OSX but will not work under Windows. A more useful error message is now shown rather than an unhelpful backtrace. llvm-svn: 220227	2014-10-20 20:14:28 +00:00
Robert Khasanov	65c2756869	Moved out IIT_V64 from common values section. Thanks Juergen Ributzka for notice. llvm-svn: 220224	2014-10-20 19:25:05 +00:00
Gerolf Hoflehner	c47baed0b5	[AArch64] test case for compfail fixed by r219748 llvm-svn: 220206	2014-10-20 16:08:33 +00:00
Steven Wu	d05affcc81	Fix Intrinsic::getType not working with vararg VarArg Intrinsic functions are encoded with "void" type as the last argument. Now Intrinsic::getType can correctly return all the intrinsic function type. llvm-svn: 220205	2014-10-20 15:47:24 +00:00
Oliver Stannard	6672f37ed2	[Thumb2] RFE, SRS and "SUBS pc, lr" are undefined on v7M These instructions are related to the v7[AR] exception model, and are not defined on v7M. llvm-svn: 220204	2014-10-20 15:37:35 +00:00
Sid Manning	c374ac97c8	Remove unnecessary else. llvm-svn: 220200	2014-10-20 13:08:19 +00:00
NAKAMURA Takumi	748627cc2a	Revert r220174, "Always use -Wl,-gc-sections on our build." It dropped required functions for plugins with gnu ld 2.20 and 2.21. Failing Tests (1): LLVM :: Feature/load_module.ll Hello: bin/opt: symbol lookup error: lib/LLVMHello.so: undefined symbol: _ZN4llvm11raw_ostream13write_escapedENS_9StringRefEb Failing Tests (1): Clang :: Frontend/plugins.c error: unable to load plugin 'lib/PrintFunctionNames.so': 'lib/PrintFunctionNames.so: undefined symbol: _ZN5clang15PluginASTAction6anchorEv' I think we should inspect linker's version or behavior to introduce --gc-sections for --export-dynamic. llvm-svn: 220198	2014-10-20 12:12:21 +00:00
Oliver Stannard	e8f63a54b4	[ARM] Do not select SMULW[BT] or SMLAW[BT] The current instruction selection patterns for SMULW[BT] and SMLAW[BT] are incorrect. These instructions multiply a 32-bit and a 16-bit value (both signed) and return the top 32 bits of the 48-bit result. This preserves the 16 bits of overflow, whereas the patterns they currently match truncate the result to 16 bits then sign extend. To select these instructions, we would need to match an ISD::SMUL_LOHI, a sign extend, two shifts and an or. There is no way to match SMUL_LOHI in an instruction pattern as it defines multiple values, so this would have to be done in C++. I have raised http://llvm.org/bugs/show_bug.cgi?id=21297 to cover allowing correct selection of these instructions. This fixes http://llvm.org/bugs/show_bug.cgi?id=19396 llvm-svn: 220196	2014-10-20 11:30:35 +00:00
Oliver Stannard	fce039240a	[Thumb] Fix crash in Thumb1RegisterInfo::rewriteFrameIndex This function can, for some offsets from the SP, split one instruction into two. Since it re-uses the original instruction as the first instruction of the result, we need ensure its result register is not marked as dead before we use it in the second instruction. llvm-svn: 220194	2014-10-20 11:00:18 +00:00
Chandler Carruth	f67321cb26	Switch the default DataLayout to be little endian, and make the variable be BigEndian so the default can continue to be zero-initialized. This is one of the prerequisites to making DataLayout a constant and always available part of every module. llvm-svn: 220193	2014-10-20 10:41:29 +00:00
Chandler Carruth	798f882b68	Remove some completely superfluous trailing comments and clang-format this header to remove numerous formatting inconsistencies that impede making simple changes here without large diffs. llvm-svn: 220192	2014-10-20 10:35:11 +00:00
Chandler Carruth	e1e2c6e219	Clean up the comments and doxygen for DataLayout. llvm-svn: 220191	2014-10-20 10:27:53 +00:00
Chandler Carruth	a32038b006	Fix a miscompile introduced in r220178. The original code had an implicit assumption that if the test for allocas or globals was reached, the two pointers were not equal. With my changes to make the pointer analysis more powerful here, I also had to guard against circumstances where the results weren't useful. That in turn violated the assumption and gave rise to a circumstance in which we could have a store with both the queried pointer and stored pointer rooted at the same alloca. Clearly, we cannot ignore such a store. There are other things we might do in this code to better handle the case of both pointers ending up at the same alloca or global, but it seems best to at least make the test explicit in what it intends to check. I've added tests for both the alloca and global case here. llvm-svn: 220190	2014-10-20 10:03:01 +00:00
David Majnemer	f3cadce84c	IR: Replace DataLayout::RoundUpAlignment with RoundUpToAlignment No functional change intended, just cleaning up some code. llvm-svn: 220187	2014-10-20 06:13:33 +00:00
Chandler Carruth	6665d62117	Fix a somewhat subtle pair of issues with JumpThreading I introduced in r220178. First, the creation routine doesn't insert prior to the terminator of the basic block provided, but really at the end of the basic block. Instead, get the terminator and insert before that. The next issue was that we need to ensure multiple PHI node entries for a single predecessor re-use the same cast instruction rather than creating new ones. All of the logic here was without tests previously. I've reduced and added a test case from the test suite that crashed without both of these fixes. llvm-svn: 220186	2014-10-20 05:34:36 +00:00
Lang Hames	799e434e3f	[PBQP] Use DenseSet rather than std::set for PBQP's PoolCostAllocator implementation. This is good for a ~6% reduction in total compile time on the nightly test suite when running with -regalloc=pbqp. llvm-svn: 220183	2014-10-20 04:26:23 +00:00
Chandler Carruth	eeec35ae1c	Teach the load analysis driving core instcombine logic and other bits of logic to look through pointer casts, making them trivially stronger in the face of loads and stores with intervening pointer casts. I've included a few test cases that demonstrate the kind of folding instcombine can do without pointer casts and then variations which obfuscate the logic through bitcasts. Without this patch, the variations all fail to optimize fully. This is more important now than it has been in the past as I've started moving the load canonicialization to more closely follow the value type requirements rather than the pointer type requirements and thus this needs to be prepared for more pointer casts. When I made the same change to stores several test cases regressed without logic along these lines so I wanted to systematically improve matters first. llvm-svn: 220178	2014-10-20 00:24:14 +00:00
Chandler Carruth	b5f4c32830	Add a datalayout string to this test so that it exercises the full gamut of InstCombine rather than just the bits enabled when datalayout is optional. The primary fixes here are because now things are little endian. In good news, silliness like this seems like it will be going away as we've got pretty stong consensus on dropping optional datalayout entirely. llvm-svn: 220176	2014-10-20 00:11:31 +00:00
Rafael Espindola	c4df33be5a	Always use -Wl,-gc-sections on our build. Both bfd ld and gold correctly handle --export-dynamic, so gc-sections is safe even for binaries that support plugins. llvm-svn: 220174	2014-10-19 23:24:46 +00:00
Bill Schmidt	941a3244ff	[PowerPC] Clean up -mattr=+vsx tests to always specify -mcpu We recently discovered an issue that reinforces what a good idea it is to always specify -mcpu in our code generation tests, particularly for -mattr=+vsx. This patch ensures that all tests that specify -mattr=+vsx also specify -mcpu=pwr7 or -mcpu=pwr8, as appropriate. Some of the uses of -mattr=+vsx added recently don't make much sense (when specified for -mtriple=powerpc-apple-darwin8 or -march=ppc32, for example). For cases like this I've just removed the extra VSX test commands; there's enough coverage without them. llvm-svn: 220173	2014-10-19 21:29:21 +00:00
Bill Schmidt	87982a1e9b	[PowerPC] Temporarily disable VSX for PowerPC fast-isel tests Patch by Bill Seurer; some comment formatting changes by me. There are a few PowerPC test cases for FastISel support that currently fail with VSX support enabled. The temporary workaround under discussion in http://reviews.llvm.org/D5362 helps, but the tests still fail because they specify -fast-isel-abort, and the VSX workaround punts back to SelectionDAG. We have plans to fix FastISel permanently for VSX, but until that's in place these tests are preventing us from enabling VSX by default. Therefore we are adding -mattr=-vsx to these tests until the full support is ready. llvm-svn: 220172	2014-10-19 20:48:47 +00:00
Bill Schmidt	fff860979d	[PowerPC] Re-enable VSX test line for fma.ll with -mcpu=pwr7 The VSX testing variant in test/CodeGen/PowerPC/fma.ll had to be disabled because of unexpected behavior on many of the builders. I tracked this down to a situation that occurs when the VSX attribute is enabled for a target that disables the MI early scheduling pass. This patch adds -mcpu=pwr7 to make this predictable. The other issue will be addressed separately. llvm-svn: 220171	2014-10-19 20:27:56 +00:00
Lang Hames	b27a3b0d43	[ADT] Add a 'find_as' operation to DenseSet. This operation is analogous to its counterpart in DenseMap: It allows lookup via cheap-to-construct keys (provided that getHashValue and isEqual are implemented for the cheap key-type in the DenseMapInfo specialization). Thanks to Chandler for the review. llvm-svn: 220168	2014-10-19 19:36:33 +00:00
Chandler Carruth	bc6378defb	Do a better and more complete job of preserving metadata when combining loads. This handles many more cases than just the AA metadata, some of them suggested by Hal in his review of the AA metadata handling patch. I've tried to test this behavior where tractable to do so. I'll point out that I have specifically not included a test for debuginfo because it was going to require 2 or 3 times as much work to craft some input which would survive the "helpful" stripping of debug info metadata that doesn't match the desired schema. This is another good example of why the current state of write-ability for our debug info metadata is unacceptable. I spent over 30 minutes trying to conjure some test case that would survive, even copying from other debug info tests, but it always failed to survive with no explanation of why or how I might fix it. =[ llvm-svn: 220165	2014-10-19 10:46:46 +00:00
Chandler Carruth	5b8cd2f73c	Move previously dead code to handle computing the known bits of an alias up to where it actually works as intended. The problem is that a GlobalAlias isa GlobalValue and so the prior block handled all of the cases. This allows us to constant fold based on the actual constant expression in the global alias. As an example, see the last function in the newly added test case which explicitly aligns an unaligned pointer using constant expression math. Without this change, we fail to see that and fold an alignment test to zero. llvm-svn: 220164	2014-10-19 09:06:56 +00:00
David Majnemer	312c3e5f39	InstCombine: (sub (or A B) (xor A B)) --> (and A B) The following implements the transformation: (sub (or A B) (xor A B)) --> (and A B). Patch by Ankur Garg! Differential Revision: http://reviews.llvm.org/D5719 llvm-svn: 220163	2014-10-19 08:32:32 +00:00
David Majnemer	59939acd26	InstCombine: Optimize icmp eq/ne (shl Const2, A), Const1 The following implements the optimization for sequences of the form: icmp eq/ne (shl Const2, A), Const1 Such sequences can be transformed to: icmp eq/ne A, (TrailingZeros(Const1) - TrailingZeros(Const2)) This handles only the equality operators for now. Other operators need to be handled. Patch by Ankur Garg! llvm-svn: 220162	2014-10-19 08:23:08 +00:00
Chandler Carruth	a801dd5799	Fix a long-standing miscompile in the load analysis that was uncovered by my refactoring of this code. The method isSafeToLoadUnconditionally assumes that the load will proceed with the preferred type alignment. Given that, it has to ensure that the alloca or global is at least that aligned. It has always done this historically when a datalayout is present, but has never checked it when the datalayout is absent. When I refactored the code in r220156, I exposed this path when datalayout was present and that turned the latent bug into a patent bug. This fixes the issue by just removing the special case which allows folding things without datalayout. This isn't worth the complexity of trying to tease apart when it is or isn't safe without actually knowing the preferred alignment. llvm-svn: 220161	2014-10-19 08:17:50 +00:00
Chandler Carruth	8a99373812	Switch how the datalayout availability test is handled in this code to make much more sense and in theory be more correct. If you trace the code alllll the way back to when it was first introduced, the comments make it slightly more clear what was going on here. At that time, the only way Base != V was if DL (then TD) was non-null. As a consequence, if DL was null, that meant we were loading directly from the alloca or global found above the test. After refactoring, this has become at least terribly subtle and potentially incorrect. There are many forms of pointer manipulation that can be traversed without DataLayout, and some of them would in fact change the size of object being loaded vs. allocated. Rather than this subtlety, I've hoisted the actual 'return true' bits into the code which actually found an alloca or global and based them on the loaded pointer being that alloca or global. This is both more clear and safer. I've also added comments about exactly why this set of predicates is used. I've also corrected a misleading comment about globals -- if overridden they may not just have a different size, they may be null and completely unsafe to load from! Hopefully this confuses the next reader a bit less. I don't have any test cases or anything, the patch is motivated strictly to improve the readability of the code. llvm-svn: 220156	2014-10-19 00:42:16 +00:00
Bob Wilson	1e1f13862e	Use triple predicate functions instead of checking values directly. NFC. llvm-svn: 220155	2014-10-19 00:39:30 +00:00
Chandler Carruth	38e98d5782	Rename 'TD' to 'DL' in this function as the argument is now a DataLayout argument. llvm-svn: 220151	2014-10-18 23:47:22 +00:00
Chandler Carruth	1f27f03849	Fix the other comment to use modern doxygen style and be a bit more direct. Notably, comment on the fact that the loaded type is significant in that it determines how wide of an access must be safe. llvm-svn: 220150	2014-10-18 23:46:17 +00:00
Chandler Carruth	be49df3d2c	More formatting cleanup brought to you by clang-format. llvm-svn: 220149	2014-10-18 23:41:25 +00:00
Chandler Carruth	b56052f44d	Clean up doxygen syntax and reword comments to flow better, have a brief section, and not have unfinished sentence fragments. llvm-svn: 220147	2014-10-18 23:31:55 +00:00
Chandler Carruth	d67244df4e	Clean up the formatting and trailing whitespace of a routine before editting it. llvm-svn: 220146	2014-10-18 23:19:03 +00:00
Lang Hames	9d7f81fff9	[PBQP] Move register-allocation specific PBQP code into RegAllocPBQP.h. Just clean-up - no functional change. llvm-svn: 220145	2014-10-18 22:23:55 +00:00
Lang Hames	ad0962aec5	[PBQP] Replace the interference-constraints algorithm with a faster version loosely based on linear scan. On x86-64 this is good for a ~2% drop in compile time on the nightly test suite. llvm-svn: 220143	2014-10-18 17:26:07 +00:00
Chandler Carruth	be9dccd64d	Preserve AA metadata when combining (cast (load (...))) -> (load (cast (...))). llvm-svn: 220141	2014-10-18 11:00:12 +00:00
Chandler Carruth	2f75fcfef3	[InstCombine] Do an about-face on how LLVM canonicalizes (cast (load ...)) and (load (cast ...)): canonicalize toward the former. Historically, we've tried to load using the type of the pointer, and tried to match that type as closely as possible removing as many pointer casts as we could and trading them for bitcasts of the loaded value. This is deeply and fundamentally wrong. Repeat after me: memory does not have a type! This was a hard lesson for me to learn working on SROA. There is only one thing that should actually drive the type used for a pointer, and that is the type which we need to use to load from that pointer. Matching up pointer types to the loaded value types is very useful because it minimizes the physical size of the IR required for no-op casts. Similarly, the only thing that should drive the type used for a loaded value is how that value is used! Again, this minimizes casts. And in fact, the only thing motivating types in any part of LLVM's IR are the types used by the operations in the IR. We should match them as closely as possible. I've ended up removing some tests here as they were testing bugs or behavior that is no longer present. Mostly though, this is just cleanup to let the tests continue to function as intended. The only fallout I've found so far from this change was SROA and I have fixed it to not be impeded by the different type of load. If you find more places where this change causes optimizations not to fire, those too are likely bugs where we are assuming that the type of pointers is "significant" for optimization purposes. llvm-svn: 220138	2014-10-18 06:36:22 +00:00
Chandler Carruth	71009cad95	Remove a test that was ported from the old llvm-gcc frontend test suite. This test is pretty awesome. It is claiming to test devirtualization. However, the code in question is not in fact devirtualized by LLVM. If you take the original C++ test case and run it through Clang at -O3 we fail to devirtualize it completely. It also isn't a sufficiently focused test case. The reason we fail to devirtualize it isn't because of any missing instcombine though. Instead, it is because we fail to emit an available externally vtable and thus the vtable is just an external and completely opaque. If I cause the vtable to be emitted, we successfully devirtualize things. Anyways, I'm just removing it because it is providing negative value at this point: it isn't representative of the output of Clang really, LLVM isn't doing the transform it claims to be testing, LLVM's failure to do the transform isn't actually an LLVM bug at all and we shouldn't be testing for it here, and finally the test is written in such a way that it will trivially pass even when the point of the test is failing. llvm-svn: 220137	2014-10-18 06:36:18 +00:00
Nick Kledzik	4e1ef9951d	[llvm-objdump] don't test timestamp dump as that is time zone dependent llvm-svn: 220123	2014-10-18 02:28:01 +00:00
Nick Kledzik	600f245b8a	[llvm-objdump] enhance test case for mach-o -private-headers llvm-svn: 220120	2014-10-18 01:50:55 +00:00
Nick Kledzik	3b2aa057e6	[llvm-objdump] Fix mach-o binding decompression error llvm-svn: 220119	2014-10-18 01:21:02 +00:00
Chandler Carruth	2dc9682e59	[SROA] Change how SROA does vector-based promotion of allocas to handle cases where the alloca type, the load types, and the store types used all disagree. Previously, the only way that vector-based promotion occured was if the alloca type was a vector type. This was one of the very few remaining uses of the alloca's type to guide SROA/mem2reg left in LLVM. It turns out it was a bad idea. The alloca type can change very easily based on the mixture of types loaded and stored to that alloca. We shouldn't be relying on it as a signal for very much. Instead, the source of truth should be loads and stores. We should canonicalize the loads and stores as much as possible and then rely on them exclusively in SROA. When looking and loads and stores, we may find many different candidate vector types. This change will let SROA try all of them to find a vector type which is a viable way to promote the entire alloca to a vector register. With this change, it becomes possible to do better canonicalization and optimization of loads and stores without breaking SROA in random ways, and that should allow fixing a core source of performance loss in hot numerical loops such as those in Eigen. llvm-svn: 220116	2014-10-18 00:44:02 +00:00
Aaron Watry	8114437a8f	R600/SI: Add global atomicrmw xchg v2: Add separate offset/no-offset tests Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Matt Arsenault <matthew.arsenault@amd.com> llvm-svn: 220110	2014-10-17 23:33:03 +00:00
Aaron Watry	d672ee2a47	R600/SI: Add global atomicrmw xor v2: Add separate offset/no-offset tests Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Matt Arsenault <matthew.arsenault@amd.com> llvm-svn: 220109	2014-10-17 23:33:01 +00:00
Aaron Watry	8a911e6926	R600/SI: Add global atomicrmw or v2: Add separate offset/no-offset tests Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Matt Arsenault <matthew.arsenault@amd.com> llvm-svn: 220108	2014-10-17 23:32:59 +00:00
Aaron Watry	58c9992f15	R600/SI: Add global atomicrmw min/umin v2: Add separate offset/no-offset tests Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Matt Arsenault <matthew.arsenault@amd.com> llvm-svn: 220107	2014-10-17 23:32:57 +00:00
Aaron Watry	29f295d7a5	R600/SI: Add global atomicrmw max/umax v2: Add separate offset/no-offset tests Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Matt Arsenault <matthew.arsenault@amd.com> llvm-svn: 220106	2014-10-17 23:32:56 +00:00
Aaron Watry	621278034c	R600/SI: Add global atomicrmw and v2: Add separate offset/no-offset tests Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Matt Arsenault <matthew.arsenault@amd.com> llvm-svn: 220105	2014-10-17 23:32:54 +00:00
Aaron Watry	328f1bae8e	R600/SI: Add global atomicrmw sub v2: Add separate offset/no-offset tests Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Matt Arsenault <matthew.arsenault@amd.com> llvm-svn: 220104	2014-10-17 23:32:52 +00:00
Aaron Watry	28682cf205	R600/SI: Fix/add tests for atomicrmw add The previous tests claimed to test constant offsets in the function name, but the tests weren't actually testing them. Clone the tests, and do testing of all combinations of the following: 1) with/without constant pointer offset 2) 32/64-bit addressing modes 3) Usage and non-usage of the return value from the atomicrmw Reviewed-by: Matt Arsenault <matthew.arsenault@amd.com> llvm-svn: 220103	2014-10-17 23:32:50 +00:00
Aaron Watry	1d13d36520	R600: Rename atomic_load global tests to atomic_add The function name now matches what it's actually testing. Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Matt Arsenault <matthew.arsenault@amd.com> llvm-svn: 220102	2014-10-17 23:32:49 +00:00
Evgeniy Stepanov	e08633e900	[msan] Fix handling of byval arguments with large alignment. MSan param-tls slots are 8-byte aligned. This change clips alignment of memcpy into param-tls to 8. llvm-svn: 220101	2014-10-17 23:29:44 +00:00
Pete Cooper	230332f4fe	Check for dynamic alloca's when selecting lifetime intrinsics. TL;DR: Indexing maps with [] creates missing entries. The long version: When selecting lifetime intrinsics, we index the static alloca map with the AllocaInst we find for that lifetime. Trouble is, we don't first check to see if this is a dynamic alloca. On the attached example, this causes a dynamic alloca to create an entry in the static map, and returns 0 (the default) as the frame index for that lifetime. 0 was used for the frame index of the stack protector, which given that it now has a lifetime, is coloured, and merged with other stack slots. PEI would later trigger an assert because it expects the stack protector to not be dead. This fix ensures that we only get frame indices for static allocas, ie, those in the map. Dynamic ones are effectively dropped, which is suboptimal, but at least isn't completely broken. rdar://problem/18672951 llvm-svn: 220099	2014-10-17 22:59:33 +00:00
Bill Schmidt	296bb31f64	[PowerPC] Disable +vsx RUN line for fma.ll due to inconsistency on other builders llvm-svn: 220094	2014-10-17 21:32:22 +00:00
Rafael Espindola	7da1ea83a9	Revert "TRE: make TRE a bit more aggressive" This reverts commit r219899. This also updates byval-tail-call.ll to make it clear what was breaking. Adding r219899 again will cause the load/store to disappear. llvm-svn: 220093	2014-10-17 21:25:48 +00:00
Bill Schmidt	ba637db298	[PowerPC] Change assert to better form llvm-svn: 220092	2014-10-17 21:19:59 +00:00
Matt Arsenault	a708358e93	R600/SI: Remove redundant setting of instruction bits These are all set on the instruction base classes. llvm-svn: 220091	2014-10-17 21:13:11 +00:00
Bill Schmidt	a087d74250	[PowerPC] Change liveness testing in VSX FMA mutation pass With VSX enabled, LLVM crashes when compiling test/CodeGen/PowerPC/fma.ll. I traced this to the liveness test that's revised in this patch. The interval test is designed to only work for virtual registers, but in this case the AddendSrcReg is physical. Since there is already a walk of the MIs between the AddendMI and the FMA, I added a check for def/kill of the AddendSrcReg in that loop. At Hal Finkel's request, I converted the liveness test to an assert restricted to virtual registers. I've changed the fma.ll test to have VSX and non-VSX variants so we can test both kinds of multiply-adds. llvm-svn: 220090	2014-10-17 21:02:44 +00:00
Peter Collingbourne	ce80084446	Disable ccache for go tests. Should fix llvm-clang-lld-x86_64-debian-fast bot. llvm-svn: 220071	2014-10-17 18:32:36 +00:00
Matt Arsenault	933c38df40	Fix typo llvm-svn: 220068	2014-10-17 18:02:31 +00:00
Matt Arsenault	e184482bf8	R600/SI: Also check for FPImm literal constants llvm-svn: 220067	2014-10-17 18:00:50 +00:00
Matt Arsenault	d282ada508	R600/SI: Allow commuting with source modifiers llvm-svn: 220066	2014-10-17 18:00:48 +00:00
Matt Arsenault	8943d24949	R600/SI: Simplify code with hasModifiersSet llvm-svn: 220065	2014-10-17 18:00:45 +00:00
Matt Arsenault	ace5b76739	R600/SI: Fix general commuting breaking src mods The generic code trying to use findCommutedOpIndices won't understand that it needs to swap the modifier operands also, so it should fail if they are set. llvm-svn: 220064	2014-10-17 18:00:43 +00:00
Matt Arsenault	ffc5d5bbf0	R600/SI: Cleanup code with ChangeToFPImmediate llvm-svn: 220063	2014-10-17 18:00:41 +00:00
Matt Arsenault	6d3cd544bb	R600/SI: Allow comuting fp immediates llvm-svn: 220062	2014-10-17 18:00:39 +00:00
Matt Arsenault	aa5ccfb566	R600/SI: Use early return instead of checking condition twice Any commutable instruction will have at least src1. llvm-svn: 220061	2014-10-17 18:00:37 +00:00
Peter Collingbourne	fde87a0cdf	We also need to catch OSError here. llvm-svn: 220058	2014-10-17 17:46:46 +00:00
Matt Arsenault	328b1193b5	R600/SI: Use complex pattern for MUBUF load patterns. This eliminates a use of the SI_ADDR64_RSRC pseudo llvm-svn: 220057	2014-10-17 17:43:00 +00:00
Matt Arsenault	83a535ff6b	R600/SI: Remove SI_BUFFER_RSRC pseudo Just use REG_SEQUENCE directly, so there are fewer instructions to need to deal with later. llvm-svn: 220056	2014-10-17 17:42:56 +00:00
Juergen Ributzka	ad2363f9ee	[Stackmaps] Enable invoking the patchpoint intrinsic. Patch by Kevin Modzelewski Reviewers: atrick, ributzka Reviewed By: ributzka Subscribers: llvm-commits, reames Differential Revision: http://reviews.llvm.org/D5634 llvm-svn: 220055	2014-10-17 17:39:00 +00:00
Andrea Di Biagio	c48cb86f05	[X86] Fix missed selection of non-temporal store of zero vector. When the input to a store instruction was a zero vector, the backend always selected a normal vector store regardless of the non-temporal hint. This is fixed by this patch. This fixes PR19370. llvm-svn: 220054	2014-10-17 17:27:06 +00:00
James Molloy	f497d5511d	[AArch64] Fix a silent codegen fault in BUILD_VECTOR lowering. We should be talking about the number of source elements, not the number of destination elements, given we know at this point that the source and dest element numbers are not the same. While we're at it, avoid writing to std::vector::end()... Bug found with random testing and a lot of coffee. llvm-svn: 220051	2014-10-17 17:06:31 +00:00
Rafael Espindola	44c661b611	Don't crash if find_executable return None. This was crashing when trying to run the tests on Windows. llvm-svn: 220048	2014-10-17 16:07:43 +00:00
Bill Schmidt	2d1128acb2	[PowerPC] Enable use of lxvw4x/stxvw4x in VSX code generation Currently the VSX support enables use of lxvd2x and stxvd2x for 2x64 types, but does not yet use lxvw4x and stxvw4x for 4x32 types. This patch adds that support. As with lxvd2x/stxvd2x, this involves straightforward overriding of the patterns normally recognized for lvx/stvx, with preference given to the VSX patterns when VSX is enabled. In addition, the logic for permitting misaligned memory accesses is modified so that v4r32 and v4i32 are treated the same as v2f64 and v2i64 when VSX is enabled. Finally, the DAG generation for unaligned loads is changed to just use a normal LOAD (which will become lxvw4x) on P8 and later hardware, where unaligned loads are preferred over lvsl/lvx/lvx/vperm. A number of tests now generate the VSX loads/stores instead of lvx/stvx, so this patch adds VSX variants to those tests. I've also added <4 x float> tests to the vsx.ll test case, and created a vsx-p8.ll test case to be used for testing code generation for the P8Vector feature. For now, that simply tests the unaligned load/store behavior. This has been tested along with a temporary patch to enable the VSX and P8Vector features, with no new regressions encountered with or without the temporary patch applied. llvm-svn: 220047	2014-10-17 15:13:38 +00:00
Jan Vesely	54468a5a58	Mips: Only set divrem i64 to custom on 64bit Reviewed-by: Daniel Sanders <daniel.sanders@imgtec.com> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 220046	2014-10-17 14:45:28 +00:00
Jan Vesely	b535c902e6	R600: Add EG to FMA test Reviewed-by: Tom Stellard <tom@stellard.net> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 220045	2014-10-17 14:45:27 +00:00
Jan Vesely	af62cf4db0	SelectionDAG: Add sext_inreg optimizations v2: use dyn_cast fixup comments v3: use cast Reviewed-by: Matt Arsenault <arsenm2@gmail.com> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 220044	2014-10-17 14:45:25 +00:00
Vasileios Kalintiris	238692beb9	[mips] Add support for COP1's Branch-On-Cond-Likely instructions Summary: Depends on D5782 Reviewers: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D5802 llvm-svn: 220042	2014-10-17 14:08:28 +00:00
Vasileios Kalintiris	6d1e64896d	[mips] Add support for COP0's Branch-On-Cond-Likely instructions Reviewers: dsanders Reviewed By: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D5782 llvm-svn: 220036	2014-10-17 12:38:35 +00:00
Hal Finkel	dd38c0b876	[DSE] Remove no-data-layout-only type-based overlap checking DSE's overlap checking contained special logic, used only when no DataLayout was available, which inferred a complete overwrite when the pointee types were equal. This logic seems fine for regular loads/stores, but does not work for memcpy and friends. Instead of fixing this, I'm just removing it. Philosophically, transformations should not contain enhanced behavior used only when data layout is lacking (data layout should be strictly additive), and maintaining these rarely-tested code paths seems not worthwhile at this stage. Credit to Aliaksei Zasenka for the bug report and the diagnosis. The test case (slightly reduced from that provided by Aliaksei) replaces the original contents of test/Transforms/DeadStoreElimination/no-targetdata.ll -- a few other tests have been updated to have a data layout. llvm-svn: 220035	2014-10-17 11:56:00 +00:00
Peter Collingbourne	7e350c927c	Fix bashism in build.sh. llvm-svn: 220027	2014-10-17 02:20:40 +00:00
Rafael Espindola	b66130209b	Add back commits r219835 and a fixed version of r219829. The only difference from r219829 is using getOrCreateSectionSymbol(*ELFSec) instead of GetOrCreateSymbol(ELFSec->getSectionName()) in ELFObjectWriter which causes us to use the correct section symbol even if we have multiple sections with the same name. Original messages: r219829: Correctly handle references to section symbols. When processing assembly like .long .text we were creating a new undefined symbol .text. GAS on the other hand would handle that as a reference to the .text section. This patch implements that by creating the section symbols earlier so that they are visible during asm parsing. The patch also updates llvm-readobj to print the symbol number in the relocation dump so that the test can differentiate between two sections with the same name. r219835: Allow forward references to section symbols. llvm-svn: 220021	2014-10-17 01:48:58 +00:00
Bill Schmidt	32e9c6465b	[PPC] Adjust some PowerPC tests to account for presence/absence of VSX Patch by Bill Seurer; committed on his behalf. These test cases generate slightly different code sequences when VSX is activated and thus fail. The update turns off VSX explicitly for the existing checks and then adds a second set of checks for most of them that test the VSX instruction output. llvm-svn: 220019	2014-10-17 01:41:22 +00:00
Rafael Espindola	0eb2cec936	Add a test that would have found the bug in r219829. llvm-svn: 220016	2014-10-17 01:34:23 +00:00
Akira Hatanaka	0d0c78180d	ARM: Fix a bug which was causing convergence failure in constant-island pass. The bug is in ARMConstantIslands::createNewWater where the upper bound of the new water split point is computed: // This could point off the end of the block if we've already got constant // pool entries following this block; only the last one is in the water list. // Back past any possible branches (allow for a conditional and a maximally // long unconditional). if (BaseInsertOffset + 8 >= UserBBI.postOffset()) { BaseInsertOffset = UserBBI.postOffset() - UPad - 8; DEBUG(dbgs() << format("Move inside block: %#x\n", BaseInsertOffset)); } The split point is supposed to be somewhere between the machine instruction that loads from the constant pool entry and the end of the basic block, before branch instructions. The code above is fine if the basic block is large enough and there are a sufficient number of instructions following the machine instruction. However, if the machine instruction is near the end of the basic block, BaseInsertOffset can point to the machine instruction or another instruction that precedes it, and this can lead to convergence failure. This commit fixes this bug by ensuring BaseInsertOffset is larger than the offset of the instruction following the constant-loading instruction. rdar://problem/18581150 llvm-svn: 220015	2014-10-17 01:31:47 +00:00
Rafael Espindola	4544a4062c	Revert commit r219835 and r219829. Revert "Correctly handle references to section symbols." Revert "Allow forward references to section symbols." Rui found a regression I am debugging. llvm-svn: 220010	2014-10-17 01:06:02 +00:00
Peter Zotov	9e40430102	[OCaml] Add Llvm.instr_clone. llvm-svn: 220008	2014-10-17 01:02:40 +00:00
Peter Zotov	aff492c6fd	[LLVM-C] Add LLVMInstructionClone. llvm-svn: 220007	2014-10-17 01:02:34 +00:00
Alexander Potapenko	7aaf514092	[llvm-symbolizer] Introduce the -dsym-hint option. llvm-symbolizer will consult one of the .dSYM paths passed via -dsym-hint if it fails to find the .dSYM bundle at the default location. llvm-svn: 220004	2014-10-17 00:50:19 +00:00
Matt Arsenault	bfaab76f6b	R600/SI: Simplify debug printing llvm-svn: 219999	2014-10-17 00:36:20 +00:00
Peter Collingbourne	659670d933	Add our own copy of the find_executable function to cope with installations that do not have the distutils.spawn package. Should hopefully fix the aarch64 buildbot. llvm-svn: 219991	2014-10-16 23:43:20 +00:00
Matt Arsenault	661a031af6	R600/SI: Remove another VALU pattern llvm-svn: 219988	2014-10-16 23:33:37 +00:00
Peter Collingbourne	82e3e373b3	Initial version of Go bindings. This code is based on the existing LLVM Go bindings project hosted at: https://github.com/go-llvm/llvm Note that all contributors to the gollvm project have agreed to relicense their changes under the LLVM license and submit them to the LLVM project. Differential Revision: http://reviews.llvm.org/D5684 llvm-svn: 219976	2014-10-16 22:48:02 +00:00
Peter Collingbourne	e186319319	Introduce LLVMParseCommandLineOptions C API function. llvm-svn: 219975	2014-10-16 22:47:52 +00:00
Juergen Ributzka	fd4633e1a5	Reduce code duplication between patchpoint and non-patchpoint lowering. NFC. This is in preparation for another patch that makes patchpoints invokable. Reviewers: atrick, ributzka Reviewed By: ributzka Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D5657 llvm-svn: 219967	2014-10-16 21:26:35 +00:00
Chandler Carruth	8393406f05	[SROA] Switch the common variable name for the 'AllocaSlices' class to 'AS'. Using 'S' as this was a terrible idea. Arguably, 'AS' is not much better, but it at least follows the idea of using initialisms and removes active confusion about the AllocaSlices variable and a Slice variable. llvm-svn: 219963	2014-10-16 21:11:55 +00:00
Chandler Carruth	61747042c1	[SROA] More range-based cleanups to SROA, these brought to you by clang-modernize. I did have to clean up the variable types and whitespace a bit because the use of auto made the code much less readable here. llvm-svn: 219962	2014-10-16 21:05:14 +00:00
Chandler Carruth	57d4cae202	[SROA] Switch a couple of overly complex iterator accessors to just be ArrayRef accessors. I think this even came up in review that this was over-engineered, and indeed it was. Time to un-build it. llvm-svn: 219958	2014-10-16 20:42:08 +00:00
Robin Morisset	e2de06bef6	Erase fence insertion from SelectionDAGBuilder.cpp (NFC) Summary: Backends can use setInsertFencesForAtomic to signal to the middle-end that montonic is the only memory ordering they can accept for stores/loads/rmws/cmpxchg. The code lowering those accesses with a stronger ordering to fences + monotonic accesses is currently living in SelectionDAGBuilder.cpp. In this patch I propose moving this logic out of it for several reasons: - There is lots of redundancy to avoid: extremely similar logic already exists in AtomicExpand. - The current code in SelectionDAGBuilder does not use any target-hooks, it does the same transformation for every backend that requires it - As a result it is plain unsound, as it was apparently designed for ARM. It happens to mostly work for the other targets because they are extremely conservative, but Power for example had to switch to AtomicExpand to be able to use lwsync safely (see r218331). - Because it produces IR-level fences, it cannot be made sound ! This is noted in the C++11 standard (section 29.3, page 1140): ``` Fences cannot, in general, be used to restore sequential consistency for atomic operations with weaker ordering semantics. ``` It can also be seen by the following example (called IRIW in the litterature): ``` atomic<int> x = y = 0; int r1, r2, r3, r4; Thread 0: x.store(1); Thread 1: y.store(1); Thread 2: r1 = x.load(); r2 = y.load(); Thread 3: r3 = y.load(); r4 = x.load(); ``` r1 = r3 = 1 and r2 = r4 = 0 is impossible as long as the accesses are all seq_cst. But if they are lowered to monotonic accesses, no amount of fences can prevent it.. This patch does three things (I could cut it into parts, but then some of them would not be tested/testable, please tell me if you would prefer that): - it provides a default implementation for emitLeadingFence/emitTrailingFence in terms of IR-level fences, that mimic the original logic of SelectionDAGBuilder. As we saw above, this is unsound, but the best that can be done without knowing the targets well (and there is a comment warning about this risk). - it then switches Mips/Sparc/XCore to use AtomicExpand, relying on this default implementation (that exactly replicates the logic of SelectionDAGBuilder, so no functional change) - it finally erase this logic from SelectionDAGBuilder as it is dead-code. Ideally, each target would define its own override for emitLeading/TrailingFence using target-specific fences, but I do not know the Sparc/Mips/XCore memory model well enough to do this, and they appear to be dealing fine with the ARM-inspired default expansion for now (probably because they are overly conservative, as Power was). If anyone wants to compile fences more agressively on these platforms, the long comment should make it clear why he should first override emitLeading/TrailingFence. Test Plan: make check-all, no functional change Reviewers: jfb, t.p.northover Subscribers: aemerson, llvm-commits Differential Revision: http://reviews.llvm.org/D5474 llvm-svn: 219957	2014-10-16 20:34:57 +00:00
Matt Arsenault	70c82173f3	R600/SI: Remove unnecessary VALU patterns These haven't been necessary since allowing selecting SALU instructions in non-entry blocks was enabled. llvm-svn: 219956	2014-10-16 20:31:50 +00:00
Chandler Carruth	c659df9389	[SROA] Start more deeply moving SROA to use ranges rather than just iterators. There are a ton of places where it essentially wants ranges rather than just iterators. This is just the first step that adds the core slice range typedefs and uses them in a couple of places. I still have to explicitly construct them because they've not been punched throughout the entire set of code. More range-based cleanups incoming. llvm-svn: 219955	2014-10-16 20:24:07 +00:00
Matt Arsenault	a3fe7c62d1	R600: Fix nonsensical implementation of computeKnownBits for BFE This was resulting in invalid simplifications of sdiv llvm-svn: 219953	2014-10-16 20:07:40 +00:00
Rafael Espindola	11aaaeebe0	Delete -std-compile-opts. These days -std-compile-opts was just a silly alias for -O3. llvm-svn: 219951	2014-10-16 20:00:02 +00:00
Bjorn Steinbrink	d20816fde9	Allow call-slop optzn for destinations with a suitable dereferenceable attribute Summary: Currently, call slot optimization requires that if the destination is an argument, the argument has the sret attribute. This is to ensure that the memory access won't trap. In addition to sret, we can also allow the optimization to happen for arguments that have the new dereferenceable attribute, which gives the same guarantee. Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D5832 llvm-svn: 219950	2014-10-16 19:43:08 +00:00
Jonathan Roelofs	ec81c0b40d	Fix lang-ref doc bug: s/icmp lt/icmp slt/ llvm-svn: 219947	2014-10-16 19:28:10 +00:00
Nick Kledzik	15558914ab	[llvm-objdump] Fix -private-headers for mach-o to print all LC_*_DYLIB variants llvm-svn: 219945	2014-10-16 18:58:20 +00:00
Sanjay Patel	c699a6117b	fold: sqrt(x * x * y) -> fabs(x) * sqrt(y) If a square root call has an FP multiplication argument that can be reassociated, then we can hoist a repeated factor out of the square root call and into a fabs(). In the simplest case, this: y = sqrt(x * x); becomes this: y = fabs(x); This patch relies on an earlier optimization in instcombine or reassociate to put the multiplication tree into a canonical form, so we don't have to search over every permutation of the multiplication tree. Because there are no IR-level FastMathFlags for intrinsics (PR21290), we have to use function-level attributes to do this optimization. This needs to be fixed for both the intrinsics and in the backend. Differential Revision: http://reviews.llvm.org/D5787 llvm-svn: 219944	2014-10-16 18:48:17 +00:00
Juergen Ributzka	03a0611061	[AArch64] Fix miscompile of sdiv-by-power-of-2. When the constant divisor was larger than 32bits, then the optimized code generated for the AArch64 backend would emit the wrong code, because the shift was defined as a shift of a 32bit constant '(1<<Lg2(divisor))' and we would loose the upper 32bits. This fixes rdar://problem/18678801. llvm-svn: 219934	2014-10-16 16:41:15 +00:00
Vasileios Kalintiris	167c372118	[mips] Account for endianess when expanding BuildPairF64/ExtractElementF64 nodes. Summary: In order to support big endian targets for the BuildPairF64 nodes we just need to swap the low/high pair registers. Additionally, for the ExtractElementF64 nodes we have to calculate the correct stack offset with respect to the node's register/operand that we want to extract. Reviewers: dsanders Reviewed By: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D5753 llvm-svn: 219931	2014-10-16 15:41:51 +00:00
Vasileios Kalintiris	711028f718	[mips] Marked the DI/EI instruction aliases as MIPS32r2 Reviewers: dsanders Reviewed By: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D5751 llvm-svn: 219927	2014-10-16 15:23:52 +00:00
Vasileios Kalintiris	f445a56b61	Test commit access: remove extra new line at the end of file llvm-svn: 219925	2014-10-16 14:37:00 +00:00
Benjamin Kramer	0445380f4f	Add missing header guard. llvm-svn: 219922	2014-10-16 10:10:07 +00:00
Akira Hatanaka	5c221ef98f	Reapply r219832 - InstCombine: Narrow switch instructions using known bits. The code committed in r219832 asserted when it attempted to shrink a switch statement whose type was larger than 64-bit. llvm-svn: 219902	2014-10-16 06:00:46 +00:00
Saleem Abdulrasool	7f52921976	TRE: make TRE a bit more aggressive Make tail recursion elimination a bit more aggressive. This allows us to get tail recursion on functions that are just branches to a different function. The fact that the function takes a byval argument does not restrict it from being optimised into just a tail call. llvm-svn: 219899	2014-10-16 03:27:30 +00:00
Akira Hatanaka	40c2cf4afc	Revert r219832. llvm-svn: 219884	2014-10-16 01:17:02 +00:00
Hal Finkel	2400c96cc3	[LVI] Add some additional comments about caching and context instructions Philip Reames and I had a long conversation about this, mostly because it is not obvious why the current logic is correct. Hopefully, these comments will prevent such confusion in the future. llvm-svn: 219882	2014-10-16 00:40:05 +00:00
NAKAMURA Takumi	e870f23389	llvm/Support/Options.h: Use \tparam. [-Wdocumentation] llvm-svn: 219881	2014-10-16 00:14:57 +00:00
Matt Arsenault	f1b34cf6b6	R600: Remove dead function llvm-svn: 219879	2014-10-16 00:08:09 +00:00
Sanjoy Das	360b1ed5f2	Revert "r219834 - Teach ScalarEvolution to sharpen range information" This change breaks the asan buildbots: http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux/builds/13468 llvm-svn: 219878	2014-10-15 23:46:04 +00:00
Hal Finkel	68dc3c7ab2	Preserve non-byval pointer alignment attributes using @llvm.assume when inlining For pointer-typed function arguments, enhanced alignment can be asserted using the 'align' attribute. When inlining, if this enhanced alignment information is not otherwise available, preserve it using @llvm.assume-based alignment assumptions. llvm-svn: 219876	2014-10-15 23:44:41 +00:00

... 3 4 5 6 7 ...

109208 Commits