llvm-project

Commit Graph

Author	SHA1	Message	Date
Tom Stellard	4ad4177399	R600: Add missing file to CMakeLists.txt llvm-svn: 220998	2014-10-31 20:56:36 +00:00
Tom Stellard	5b2927fe83	R600: Don't promote allocas when one of the users is a ptrtoint instruction We need to figure out how to track ptrtoint values all the way until result is converted back to a pointer in order to correctly rewrite the pointer type. llvm-svn: 220997	2014-10-31 20:52:04 +00:00
Tom Stellard	aa73831757	R600: Make sure to inline all internal functions Function calls aren't supported yet. llvm-svn: 220996	2014-10-31 20:52:02 +00:00
Duncan P. N. Exon Smith	7779b4fd8e	IR: Instruction::setMetadata() should use cast_or_null Not sure why this assertion didn't fire locally [1], but in r220994 `Instruction::setMetadata()` should be using `cast_or_null`. [1]: http://lab.llvm.org:8011/builders/llvm-hexagon-elf/builds/12327 llvm-svn: 220995	2014-10-31 20:28:04 +00:00
Duncan P. N. Exon Smith	e5d641ebca	IR: MDNode => Value: Instruction::setMetadata() Change `Instruction::setMetadata()` API to accept `Value` instead of `MDNode`. Part of PR21433. llvm-svn: 220994	2014-10-31 20:13:11 +00:00
Bill Schmidt	1ca69fa64d	[PowerPC] Initial VSX intrinsic support, with min/max for vector double Now that we have initial support for VSX, we can begin adding intrinsics for programmer access to VSX instructions. This patch adds basic support for VSX intrinsics in general, and tests it by implementing intrinsics for minimum and maximum for the vector double data type. The LLVM portion of this is quite straightforward. There is a companion patch for Clang. llvm-svn: 220988	2014-10-31 19:19:07 +00:00
Chad Rosier	7bb413e3ba	[AArch64] Check Dest Register Liveness in CondOpt pass. Our internal test reveals such case should not be transformed: cmp x17, #3 b.lt .LBB10_15 ... subs x12, x12, #1 b.gt .LBB10_1 where x12 is a liveout, becomes: cmp x17, #2 b.le .LBB10_15 ... subs x12, x12, #2 b.ge .LBB10_1 Unable to provide test case as it's difficult to reproduce on community branch. http://reviews.llvm.org/D6048 Patch by Zhaoshi Zheng <zhaoshiz@codeaurora.org>! llvm-svn: 220987	2014-10-31 19:02:38 +00:00
Kostya Serebryany	ea48bdc702	[asan] do not treat inline asm calls as indirect calls llvm-svn: 220985	2014-10-31 18:38:23 +00:00
Quentin Colombet	c32615dfef	[CodeGenPrepare] Move extractelement close to store if they can be combined. This patch adds an optimization in CodeGenPrepare to move an extractelement right before a store when the target can combine them. The optimization may promote any scalar operations to vector operations in the way to make that possible. Context Some targets use different register files for both vector and scalar operations. This means that transitioning from one domain to another may incur copy from one register file to another. These copies are not coalescable and may be expensive. For example, according to the scheduling model, on cortex-A8 a vector to GPR move is 20 cycles. Motivating Example Let us consider an example: define void @foo(<2 x i32>* %addr1, i32* %dest) { %in1 = load <2 x i32>* %addr1, align 8 %extract = extractelement <2 x i32> %in1, i32 1 %out = or i32 %extract, 1 store i32 %out, i32* %dest, align 4 ret void } As it is, this IR generates the following assembly on armv7: vldr d16, [r0] @vector load vmov.32 r0, d16[1] @ cross-register-file copy: 20 cycles orr r0, r0, #1 @ scalar bitwise or str r0, [r1] @ scalar store bx lr Whereas we could generate much faster code: vldr d16, [r0] @ vector load vorr.i32 d16, #0x1 @ vector bitwise or vst1.32 {d16[1]}, [r1:32] @ vector extract + store bx lr Half of the computation made in the vector is useless, but this allows to get rid of the expensive cross-register-file copy. Proposed Solution To avoid this cross-register-copy penalty, we promote the scalar operations to vector operations. The penalty will be removed if we manage to promote the whole chain of computation in the vector domain. Currently, we do that only when the chain of computation ends by a store and the target is able to combine an extract with a store. Stores are the most likely candidates, because other instructions produce values that would need to be promoted and so, extracted as some point[1]. Moreover, this is customary that targets feature stores that perform a vector extract (see AArch64 and X86 for instance). The proposed implementation relies on the TargetTransformInfo to decide whether or not it is beneficial to promote a chain of computation in the vector domain. Unfortunately, this interface is rather inaccurate for this level of details and although this optimization may be beneficial for X86 and AArch64, the inaccuracy will lead to the optimization being too aggressive. Basically in TargetTransformInfo, everything that is legal has a cost of 1, whereas, even if a vector type is legal, usually a vector operation is slightly more expensive than its scalar counterpart. That will lead to too many promotions that may not be counter balanced by the saving of the cross-register-file copy. For instance, on AArch64 this penalty is just 4 cycles. For now, the optimization is just enabled for ARM prior than v8, since those processors have a larger penalty on cross-register-file copies, and the scope is limited to basic blocks. Because of these two factors, we limit the effects of the inaccuracy. Indeed, I did not want to build up a fancy cost model with block frequency and everything on top of that. [1] We can imagine targets that can combine an extractelement with other instructions than just stores. If we want to go into that direction, the current interfaces must be augmented and, moreover, I think this becomes a global isel problem. Differential Revision: http://reviews.llvm.org/D5921 <rdar://problem/14170854> llvm-svn: 220978	2014-10-31 17:52:53 +00:00
Kostya Serebryany	001ea5fe15	[asan] fix caller-calee instrumentation to emit new cache for every call site llvm-svn: 220973	2014-10-31 17:11:27 +00:00
David Blaikie	626507fab3	Update the non-pthreads fallback for RWMutex on Unix Tested this by #if 0'ing out the pthreads implementation, which indicated that this fallback was not currently compiling successfully and applying this patch resolves that. Patch by Andy Chien. llvm-svn: 220969	2014-10-31 17:02:30 +00:00
Rafael Espindola	67926f10e8	Unify and update link-messages.ll and redefinition.ll. NFC. llvm-svn: 220968	2014-10-31 16:52:30 +00:00
David Blaikie	fb2efde341	Correct assert text from r220923 Noticed in post-commit review by Adrian Prantl. llvm-svn: 220967	2014-10-31 16:45:36 +00:00
Rafael Espindola	3e8bc6a8c3	Mark a few variables const. NFC. llvm-svn: 220964	2014-10-31 16:08:17 +00:00
NAKAMURA Takumi	0ad1c308ea	[CMake] llvm/examples: Update libdeps for unoptimized builds. llvm-svn: 220962	2014-10-31 15:27:16 +00:00
Chad Rosier	a675e550ca	[AArch64] CondOpt pass is missing FCMP instructions when searching backward for a CMP which defines the flags used by B.CC. http://reviews.llvm.org/D6047 Patch by Zhaoshi Zheng <zhaoshiz@codeaurora.org>! llvm-svn: 220961	2014-10-31 15:17:36 +00:00
Bradley Smith	9992b167ae	[SCEV] Improve Scalar Evolution's use of no {un,}signed wrap flags In a case where we have a no {un,}signed wrap flag on the increment, if RHS - Start is constant then we can avoid inserting a max operation bewteen the two, since we can statically determine which is greater. This allows us to unroll loops such as: void testcase3(int v) { for (int i=v; i<=v+1; ++i) f(i); } llvm-svn: 220960	2014-10-31 11:40:32 +00:00
Ulrich Weigand	c8c2ea2854	[PowerPC] Load BlockAddress values from the TOC in 64-bit SVR4 code Since block address values can be larger than 2GB in 64-bit code, they cannot be loaded simply using an @l / @ha pair, but instead must be loaded from the TOC, just like GlobalAddress, ConstantPool, and JumpTable values are. The commit also fixes a bug in PPCLinuxAsmPrinter::doFinalization where temporary labels could not be used as TOC values, since code would attempt (and fail) to use GetOrCreateSymbol to create a symbol of the same name as the temporary label. llvm-svn: 220959	2014-10-31 10:33:14 +00:00
Peter Zotov	e2b8b1431c	[OCaml] Ensure consistent naming. Specifically: * Directories match module names. * Test names match module names. * The language is called "OCaml", not "Ocaml". llvm-svn: 220958	2014-10-31 09:19:03 +00:00
Peter Zotov	b1f54ff42f	[OCaml] Rework Llvm_executionengine using ctypes. Since JIT->MCJIT migration, most of the ExecutionEngine interface became deprecated and/or broken. This especially affected the OCaml bindings, as runFunction is no longer available, and unlike in C, it is not possible to coerce a pointer to a function and call it in OCaml. In practice, LLVM 3.5 shipped completely unusable Llvm_executionengine. The GenericValue interface and runFunction were essentially a poor man's FFI. As such, this interface was removed and instead a dependency on ctypes >=0.3 added, which handled platform-specific aspects of accessing data and calling functions. The new interface does not expose JIT (which is a shim around MCJIT), as well as the interpreter (which can't handle a lot of valid IR). Llvm_executionengine.add_global_mapping is currently unusable due to PR20656. llvm-svn: 220957	2014-10-31 09:05:36 +00:00
Rafael Espindola	5b24759dff	Move an input file to Inputs instead of using RUN: true. llvm-svn: 220953	2014-10-31 05:54:15 +00:00
David Majnemer	c7d7c6fb3a	Object, COFF: Cleanup symbol type code, improve binutils compatibility Do a better job classifying symbols. This increases the consistency between the COFF handling code and the ELF side of things. llvm-svn: 220952	2014-10-31 05:07:00 +00:00
Rafael Espindola	e5204efeaf	merge tests for constant linking. llvm-svn: 220951	2014-10-31 05:04:16 +00:00
Rafael Espindola	0ae225b6b5	Move definition closer to use. NFC. llvm-svn: 220949	2014-10-31 04:46:38 +00:00
Hao Liu	e02b1a068f	PR20557: Fix the bug that bogus cpu parameter crashes llc on AArch64 backend. Initial patch by Oleg Ranevskyy. llvm-svn: 220945	2014-10-31 02:35:34 +00:00
NAKAMURA Takumi	c158884726	Threading.h: Give named parameters to llvm::call_once(flag,UserFn). [-Wdocumentation] llvm-svn: 220941	2014-10-31 00:54:20 +00:00
Ahmed Bougacha	9f336c4ec5	[SelectionDAG] When scalarizing trunc, don't assert for legal operands. r212242 introduced a legalizer hook, originally to let AArch64 widen v1i{32,16,8} rather than scalarize, because the legalizer expected, when scalarizing the result of a conversion operation, to already have scalarized the operands. On AArch64, v1i64 is legal, so that commit ensured operations such as v1i32 = trunc v1i64 wouldn't assert. It did that by choosing to widen v1 types whenever possible. However, v1i1 types, for which there's no legal widened type, would still trigger the assert. This commit fixes that, by only scalarizing a trunc's result when the operand has already been scalarized, and introducing an extract_elt otherwise. This is similar to r205625. Fixes PR20777. llvm-svn: 220937	2014-10-30 23:46:50 +00:00
Hans Wennborg	6cda0d7269	Speculative fix for Windows build after r220932 llvm-svn: 220936	2014-10-30 23:10:01 +00:00
Chris Bieneman	970555bc08	EXPORTED_SYMBOL_FILE using mingw and cmake Summary: This is a fix for the command line syntax error while building LTO when using MinGW. Patch By: jsroemer Reviewers: rnk Reviewed By: rnk Subscribers: rnk, beanz, llvm-commits Differential Revision: http://reviews.llvm.org/D5476 llvm-svn: 220935	2014-10-30 22:37:58 +00:00
NAKAMURA Takumi	d9913e6d35	llvm/test/Transforms/SampleProfile/syntax.ll: Relax MISSING-FILE not to check locale-aware message catalog. llvm-svn: 220934	2014-10-30 22:28:46 +00:00
Louis Gerbarg	e8f9c78247	Fix incorrect invariant check in DAG Combine Earlier this summer I fixed an issue where we were incorrectly combining multiple loads that had different constraints such alignment, invariance, temporality, etc. Apparently in one case I made copt paste error and swapped alignment and invariance. Tests included. rdar://18816719 llvm-svn: 220933	2014-10-30 22:21:03 +00:00
Chris Bieneman	14e2bcccfb	Removing the static initializer in ManagedStatic.cpp by using llvm_call_once to initialize the ManagedStatic mutex. Summary: This patch adds an llvm_call_once which is a wrapper around std::call_once on platforms where it is available and devoid of bugs. The patch also migrates the ManagedStatic mutex to be allocated using llvm_call_once. These changes are philosophically equivalent to the changes added in r219638, which were reverted due to a hang on Win32 which was the result of a bug in the Windows implementation of std::call_once. Reviewers: aaron.ballman, chapuni, chandlerc, rnk Reviewed By: rnk Subscribers: majnemer, llvm-commits Differential Revision: http://reviews.llvm.org/D5922 llvm-svn: 220932	2014-10-30 22:07:09 +00:00
Justin Bogner	d249a3b3b2	llvm-cov: Follow LLVM naming conventions This renames a few things that are using an unusual naming convention. llvm-svn: 220929	2014-10-30 20:57:49 +00:00
Justin Bogner	f6c5055764	llvm-cov: Don't manually parse an option for no reason We're using cl::opt here, but for some reason we're reading out one particular option by hand instead. This makes -help and the like behave rather poorly, so let's not do it this way. llvm-svn: 220928	2014-10-30 20:51:24 +00:00
Rafael Espindola	d1e64b1e93	Fix the merging of the constantness of declarations. The langref says: LLVM explicitly allows declarations of global variables to be marked constant, even if the final definition of the global is not. This capability can be used to enable slightly better optimization of the program, but requires the language definition to guarantee that optimizations based on the ‘constantness’ are valid for the translation units that do not include the definition. Given that definition, when merging two declarations, we have to drop constantness if of of them is not marked contant, since the Module without the constant marker might not have the necessary guarantees. llvm-svn: 220927	2014-10-30 20:50:23 +00:00
Justin Bogner	533e26103e	llvm-cov: Very basic top level help llvm-svn: 220926	2014-10-30 20:29:48 +00:00
Philip Reames	4cb4d3e048	Add handling for range metadata in ValueTracking isKnownNonZero If we load from a location with range metadata, we can use information about the ranges of the loaded value for optimization purposes. This helps to remove redundant checks and canonicalize checks for other optimization passes. This particular patch checks whether a value is known to be non-zero from the range metadata. Currently, these tests are against InstCombine. In theory, all of these should be InstSimplify since we're not inserting any new instructions. Moving the code may follow in a separate change. Reviewed by: Hal Differential Revision: http://reviews.llvm.org/D5947 llvm-svn: 220925	2014-10-30 20:25:19 +00:00
Rafael Espindola	f9a94abd4e	Update test to pass .ll to llvm-link and use Inputs. llvm-svn: 220924	2014-10-30 20:23:59 +00:00
David Blaikie	76fd43c653	PR21408: Workaround the appearance of duplicate variables due to problems when inlining two calls to the same function from the same call site. llvm-svn: 220923	2014-10-30 20:20:11 +00:00
Diego Novillo	047d5d1a0a	Fix comment spelling and tidy diagnostic call in profile reader. No functional changes. llvm-svn: 220922	2014-10-30 20:19:19 +00:00
Peter Zotov	cf8f7a10b7	lit: PR21417: don't try to update OCAMLPATH if LibDir is empty. llvm-svn: 220919	2014-10-30 19:26:42 +00:00
Diego Novillo	77a5a5fcda	Fix Twine corruption problem with diagnostics. This fixes the autobuilders I broke with a recent patch. Thanks echristo and dblaikie for beating me with a clue stick. llvm-svn: 220918	2014-10-30 18:48:41 +00:00
Diego Novillo	c572e92c76	Add profile writing capabilities for sampling profiles. Summary: This patch finishes up support for handling sampling profiles in both text and binary formats. The new binary format uses uleb128 encoding to represent numeric values. This makes profiles files about 25% smaller. The profile writer class can write profiles in the existing text and the new binary format. In subsequent patches, I will add the capability to read (and perhaps write) profiles in the gcov format used by GCC. Additionally, I will be adding support in llvm-profdata to manipulate sampling profiles. There was a bit of refactoring needed to separate some code that was in the reader files, but is actually common to both the reader and writer. The new test checks that reading the same profile encoded as text or raw, produces the same results. Reviewers: bogner, dexonsmith Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D6000 llvm-svn: 220915	2014-10-30 18:00:06 +00:00
Tim Northover	a94cd25188	ARM: test default values for TAG_CPU_unaligned_access attribute. It should be on for every target that supports unaligned accesses (e.g. not v6m). Patch by Charlie Turner. llvm-svn: 220912	2014-10-30 17:05:44 +00:00
Simon Atanasyan	6e3949a35c	[Mips] Add new Mips specific e_flags. No functional changes. llvm-svn: 220910	2014-10-30 14:56:02 +00:00
Robert Khasanov	af318f7073	[AVX512] Added VBROADCAST{SS/SD} encoding for VL subset. Refactored through AVX512_maskable llvm-svn: 220908	2014-10-30 14:21:47 +00:00
Peter Collingbourne	dd3486ece1	[dfsan] New calling convention for custom functions with variadic arguments. Summary: The previous calling convention prevented custom functions from being able to access argument labels unless it knew how many variadic arguments there were, and of which type. This restriction made it impossible to correctly model functions in the printf family, as it is legal to pass more arguments than required to those functions. We now pass arguments in the following order: non-vararg arguments labels for non-vararg arguments [if vararg function, pointer to array of labels for vararg arguments] [if non-void function, pointer to label for return value] vararg arguments Differential Revision: http://reviews.llvm.org/D6028 llvm-svn: 220906	2014-10-30 13:22:57 +00:00
Peter Zotov	4a69306576	[OCaml] Expose LLVMCloneModule. llvm-svn: 220903	2014-10-30 08:30:12 +00:00
Peter Zotov	e75657b727	[OCaml] Expose LLVM{Get,Set}DLLStorageClass. llvm-svn: 220902	2014-10-30 08:30:08 +00:00
Peter Zotov	f481471495	[OCaml] Test code emission in Llvm_target. Prior to this commit, the Llvm_target tests (ab)used the Llvm_executionengine as a mechanism to initialize at least some target. This needlessly restricted tests to builds which can emit code for their host architecture. llvm-svn: 220901	2014-10-30 08:30:01 +00:00
Peter Zotov	d6d49fd0e2	[OCaml] Enable backtraces in tests. llvm-svn: 220900	2014-10-30 08:29:57 +00:00
Peter Zotov	668f9670a6	[OCaml] [autoconf] Migrate to ocamlfind. This commit updates the OCaml bindings and tests to use ocamlfind. The bindings are migrated in order to use ctypes, which are now required for MCJIT-backed Llvm_executionengine. The tests are migrated in order to use OUnit and to verify that the distributed META.llvm allows to build working executables. Every OCaml toolchain invocation is now chained through ocamlfind, which (in theory) allows to cross-compile the OCaml bindings. The configure script now checks for ctypes (>= 0.2.3) and OUnit (>= 2). The code depending on these libraries will be added later. The configure script does not check the package versions in order to keep changes less invasive. Additionally, OCaml bindings will now be automatically enabled if ocamlfind is detected on the system, rather than ocamlc, as it was before. llvm-svn: 220899	2014-10-30 08:29:45 +00:00
Peter Zotov	1b254f9a56	[OCaml] De-duplicate llvm_raise and llvm_string_of_message. llvm-svn: 220898	2014-10-30 08:29:29 +00:00
Rafael Espindola	919fb535a0	Enable the slp vectorizer in the gold plugin. llvm-svn: 220887	2014-10-30 00:38:54 +00:00
Rafael Espindola	8391dbd1c6	Enable the loop vectorizer in the gold plugin. llvm-svn: 220886	2014-10-30 00:11:24 +00:00
Rafael Espindola	4a3b6cf3c1	Replace also-emit-llvm with save-temps. The also-emit-llvm option only supported getting the IR before optimizations. This patch replaces it with a more generic save-temps option that saves the IR both before and after optimizations. llvm-svn: 220885	2014-10-29 23:54:45 +00:00
NAKAMURA Takumi	32c87aced3	Untabify. llvm-svn: 220884	2014-10-29 23:44:35 +00:00
NAKAMURA Takumi	27b6d47f36	llvm/test/Transforms/LoopRotate/nosimplifylatch.ll: Fix possibly mis-repeatedly-pasted test. llvm-svn: 220880	2014-10-29 23:05:12 +00:00
Kevin Enderby	b28ed015ce	Run clang-format on tools/llvm-objdump/MachODump.cpp . No functional change. llvm-svn: 220875	2014-10-29 21:28:24 +00:00
Yi Jiang	323d57336c	Test Case for r220872:Do not simplifyLatch for loops where hoisting increments couldresult in extra live range interferance llvm-svn: 220873	2014-10-29 20:20:33 +00:00
Yi Jiang	ab19fff4d8	Do not simplifyLatch for loops where hoisting increments couldresult in extra live range interferance llvm-svn: 220872	2014-10-29 20:19:47 +00:00
Jan Wen Voung	ce2164f45c	Fix getRelocationValueString to return the symbol name for EM_386. Summary: This helps llvm-objdump -r to print out the symbol name along with the relocation type on x86. Adjust existing tests from checking for "Unknown" to check for the symbol now. Test Plan: Adjusted test/Object tests. Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D5987 llvm-svn: 220866	2014-10-29 18:37:13 +00:00
Alexander Kornienko	633c814f8f	Enable display of compiler diagnostics in clang-tidy by default. llvm-svn: 220863	2014-10-29 17:29:38 +00:00
Robert Khasanov	595e598869	[AVX512] Implemented AVX512VL FP bnary packed instructions (VADDP, VSUBP, VMULP, VDIVP, VMAXP, VMINP) Refactored through AVX512_maskable Added encoding tests for them. llvm-svn: 220858	2014-10-29 15:43:02 +00:00
NAKAMURA Takumi	f51a34ec1f	Whitespace. llvm-svn: 220857	2014-10-29 15:23:11 +00:00
Michael Kuperstein	f9c3480322	Fix build with CMake if LLVM_USE_INTEL_JITEVENTS option is enabled * Added LLVM libraries required for IntelJITEvents to LLVMBuild.txt. * Removed 'jit' library from llvm-jitlistener. * Added support for OptionalLibraries to llvm-build cmake files generator. Patch by aleksey.a.bader@intel.com Differential Revision: http://reviews.llvm.org/D5646 llvm-svn: 220848	2014-10-29 09:18:49 +00:00
Peter Zotov	f3f23c1f99	[OCaml] Expose Llvm.parse_command_line_options. llvm-svn: 220847	2014-10-29 08:16:18 +00:00
Peter Zotov	f58626d5c9	[OCaml] Expose Llvm_target.TargetMachine.add_analysis_passes. llvm-svn: 220846	2014-10-29 08:16:14 +00:00
Peter Zotov	ad383afad4	[OCaml] If compiled without --enable-shared, hide packages from toplevel. Pretend they do not exist using exists_if. This is better than the current situation, which is the error: Error: The external function `llvm_global_succ' is not available but still somewhat confusing. llvm-svn: 220845	2014-10-29 08:16:06 +00:00
Peter Zotov	5f28729c61	[OCaml] Expose Llvm_bitwriter.write_bitcode_to_memory_buffer. llvm-svn: 220844	2014-10-29 08:16:01 +00:00
Peter Zotov	662538ac40	[OCaml] Drop support for 3.12.1 and earlier. In practice this means: * Always using -g flag. * Embedding -cclib -lstdc++ into the corresponding cma/cmxa file. This also moves -lstdc++ in a single place. * Using caml_named_value instead of a homegrown mechanism. llvm-svn: 220843	2014-10-29 08:15:54 +00:00
Peter Zotov	e447b61c50	[OCaml] Synchronize transformations with LLVM-C. Also, rearrange the functions in a way that allows to quickly compare C headers and .mli/glue files. llvm-svn: 220842	2014-10-29 08:15:21 +00:00
NAKAMURA Takumi	815d752b93	macho-symbolized-disassembly.test: Don't check C++ demangler unconditionally. For example, MS PSDK is not expected to have <cxxabi.h>. You should introduce the new feature in lit.cfg corresponding to HAVE_CXXABI_H if you would like to test demangler. llvm-svn: 220840	2014-10-29 08:08:21 +00:00
Saleem Abdulrasool	56ade2991b	test: tweak inlined-allocs test Remove pointless checks for storage of uninteresting values. Ensure that we perform basic alias analysis to make the test more correct. Finally, apply a stylistic change to the test. llvm-svn: 220839	2014-10-29 06:31:11 +00:00
Seo Sanghyeon	a4fcf71bca	VMCore was renamed to IR long time ago llvm-svn: 220838	2014-10-29 05:20:39 +00:00
Kevin Enderby	04bf6931cc	Update llvm-objdump’s Mach-O symbolizer code to demangle C++ names. llvm-svn: 220833	2014-10-28 23:39:46 +00:00
Peter Zotov	1f351ca15a	[OCaml] PR5595: Pass LDFLAGS to tests via -cclib. llvm-svn: 220832	2014-10-28 23:31:13 +00:00
Peter Zotov	5df08c8938	[OCaml] PR14083, PR9606: Only pick *.odoc files from current build target. When several build targets, e.g. Debug+Asserts and Release+Asserts are present, ocamldoc complains of duplicate interfaces. llvm-svn: 220831	2014-10-28 22:45:25 +00:00
Peter Zotov	796537119b	[OCaml] Fix whitespace. llvm-svn: 220829	2014-10-28 22:39:42 +00:00
Peter Zotov	dacfc64b57	[OCaml] PR9719, PR14727: Make tests run without ocamlopt. Previously, tests hardcoded ocamlopt and cmxa, which broke builds on machines without ocamlopt. Instead, they now fall back to ocamlc. As a side effect this fixes PR14727, which was caused by a crude hack that replaced gcc with g++ everywhere in the ocamlopt native compiler path and passes it back using -cc. Now the tests use the same technique as META, i.e. -cclib -lstdc++. It might be more fragile than using g++ explicitly, but it will break when the installed package will also break, which is good. llvm-svn: 220828	2014-10-28 22:39:36 +00:00
Peter Zotov	1afb7497c7	[OCaml] PR19859: Add functions to query and modify branches. Patch by Gabriel Radanne <drupyog@zoho.com>. llvm-svn: 220818	2014-10-28 19:47:02 +00:00
Peter Zotov	2481c75f8b	[C API] PR19859: Add functions to query and modify branches. Patch by Gabriel Radanne <drupyog@zoho.com>. llvm-svn: 220817	2014-10-28 19:46:56 +00:00
Peter Zotov	6074c344de	[OCaml] PR19859: Add tests for reading the values of numeric constants. Patch by Gabriel Radanne <drupyog@zoho.com>. llvm-svn: 220816	2014-10-28 19:46:52 +00:00
Peter Zotov	fec0486a30	[OCaml] PR19859: Add Llvm.{fcmp_predicate,float_of_const}. Patch by Gabriel Radanne <drupyog@zoho.com>. llvm-svn: 220815	2014-10-28 19:46:48 +00:00
Peter Zotov	1d98e6ddef	[C API] PR19859: Add LLVMGetFCmpPredicate and LLVMConstRealGetDouble. Patch by Gabriel Radanne <drupyog@zoho.com>. llvm-svn: 220814	2014-10-28 19:46:44 +00:00
Saleem Abdulrasool	d178ada55e	Transforms: reapply SVN r219899 This restores the commit from SVN r219899 with an additional change to ensure that the CodeGen is correct for the case that was identified as being incorrect (originally PR7272). In the case that during inlining we need to synthesize a value on the stack (i.e. for passing a value byval), then any function involving that alloca must be stripped of its tailness as the restriction that it does not access the parent's stack no longer holds. Unfortunately, a single alloca can cause a rippling effect through out the inlining as the value may be aliased or may be mutated through an escaped external call. As such, we simply track if an alloca has been introduced in the frame during inlining, and strip any tail calls. llvm-svn: 220811	2014-10-28 18:27:37 +00:00
Robert Khasanov	1cf354c92f	[AVX512] Fix VSQRT packed instructions internal names. No functional change llvm-svn: 220808	2014-10-28 18:22:41 +00:00
Robert Khasanov	eb12639375	[AVX512] Extended avx512_sqrt_packed (sqrt instructions) to VL subset. Refactored through AVX512_maskable llvm-svn: 220806	2014-10-28 18:15:20 +00:00
Robert Khasanov	3e534c93b9	[AVX-512] Expanded rsqrt/rcp instructions to VL subset. Refactored multiclass through AVX512_maskable llvm-svn: 220783	2014-10-28 16:37:13 +00:00
Robert Khasanov	784c3f0d6e	[AVX512] Removed special case for cmp instructions in getVectorMaskingNode. Now cmp intrinsics lower as other intrinsics through VSELECT, and then VSELECT tranforms to AND in PerformSELECTCombine. No functional change. llvm-svn: 220779	2014-10-28 16:17:14 +00:00
Robert Khasanov	4441c4d31b	[x86] Simplify vector selection if condition value type matches vselect value type and true value is all ones or false value is all zeros. This transformation worked if selector is produced by SETCC, however SETCC is needed only if we consider to swap operands. So I replaced SETCC check for this case. Added tests for vselect of <X x i1> values. llvm-svn: 220777	2014-10-28 15:59:40 +00:00
Aaron Ballman	5af8ba49a6	Silencing an "enumeral and non-enumeral type in conditional expression" warning; NFC. llvm-svn: 220775	2014-10-28 13:12:13 +00:00
Robert Khasanov	dd09a8f320	[AVX512] Bring back vector-shuffle lowering support through broadcasts Ffter commit at rev219046 512-bit broadcasts lowering become non-optimal. Most of tests on broadcasting and embedded broadcasting were changed and they doesn’t produce efficient code. Example below is from commit changes (it’s the first test from test/CodeGen/X86/avx512-vbroadcast.ll): define <16 x i32> @_inreg16xi32(i32 %a) { ; CHECK-LABEL: _inreg16xi32: ; CHECK: ## BB#0: -; CHECK-NEXT: vpbroadcastd %edi, %zmm0 +; CHECK-NEXT: vmovd %edi, %xmm0 +; CHECK-NEXT: vpbroadcastd %xmm0, %ymm0 +; CHECK-NEXT: vinserti64x4 $1, %ymm0, %zmm0, %zmm0 ; CHECK-NEXT: retq %b = insertelement <16 x i32> undef, i32 %a, i32 0 %c = shufflevector <16 x i32> %b, <16 x i32> undef, <16 x i32> zeroinitializer ret <16 x i32> %c } Here, 256-bit broadcast was generated instead of 512-bit one. In this patch 1) I added vector-shuffle lowering through broadcasts 2) Removed asserts and branches likes because this is incorrect - assert(Subtarget->hasDQI() && "We can only lower v8i64 with AVX-512-DQI"); 3) Fixed lowering tests llvm-svn: 220774	2014-10-28 12:28:51 +00:00
NAKAMURA Takumi	d0e13af22c	Reformat partially, where I touched for whitespace changes. llvm-svn: 220773	2014-10-28 11:54:52 +00:00
NAKAMURA Takumi	5af50a5470	LoopRerollPass.cpp: Use range-based loop. NFC. llvm-svn: 220772	2014-10-28 11:54:05 +00:00
NAKAMURA Takumi	335a7bcf1e	Untabify and whitespace cleanups. llvm-svn: 220771	2014-10-28 11:53:30 +00:00
Peter Zotov	3ebd0bf24d	[OCaml] Enable -g for debug builds. We don't care about pre-3.12.1 anymore. llvm-svn: 220767	2014-10-28 06:15:41 +00:00
Peter Zotov	110f62912b	[OCaml] Fix whitespace. llvm-svn: 220766	2014-10-28 06:15:18 +00:00
David Blaikie	ff468a5e0b	Minimize the scope of some variables, NFC. llvm-svn: 220759	2014-10-28 02:57:26 +00:00
Reid Kleckner	9ccce99e1d	X86: Implement the vectorcall calling convention This is a Microsoft calling convention that supports both x86 and x86_64 subtargets. It passes vector and floating point arguments in XMM0-XMM5, and passes them indirectly once they are consumed. Homogenous vector aggregates of up to four elements can be passed in sequential vector registers, but this part is not implemented in LLVM and will be handled in Clang. On 32-bit x86, it is similar to fastcall in that it uses ecx:edx as integer register parameters and is callee cleanup. On x86_64, it delegates to the normal win64 calling convention. Reviewers: majnemer Differential Revision: http://reviews.llvm.org/D5943 llvm-svn: 220745	2014-10-28 01:29:26 +00:00
Tim Northover	00917897b2	AArch64: enable Cortex-A57 FP balancing on Cortex-A53. Benchmarks have shown that it's harmless to the performance there, and having a unified set of passes between the two cores where possible helps big.LITTLE deployment. Patch by Z. Zheng. llvm-svn: 220744	2014-10-28 01:24:32 +00:00
Rafael Espindola	9f8eff31db	Remove the PreserveSource linker mode. I noticed that it was untested, and forcing it on caused some tests to fail: LLVM :: Linker/metadata-a.ll LLVM :: Linker/prefixdata.ll LLVM :: Linker/type-unique-odr-a.ll LLVM :: Linker/type-unique-simple-a.ll LLVM :: Linker/type-unique-simple2-a.ll LLVM :: Linker/type-unique-simple2.ll LLVM :: Linker/type-unique-type-array-a.ll LLVM :: Linker/unnamed-addr1-a.ll LLVM :: Linker/visibility1.ll If it is to be resurrected, it has to be fixed and we should probably have a -preserve-source command line option in llvm-mc and run tests with and without it. llvm-svn: 220741	2014-10-28 00:24:16 +00:00
NAKAMURA Takumi	949fb6d276	AArch64InstrInfo.h: Fix a warning introduced in clang r220703. [-Winconsistent-missing-override] llvm-svn: 220739	2014-10-27 23:29:27 +00:00
Adam Nemet	cf7a4a2660	[AVX512] Add vpermil variable version This is implemented via a multiclass that derives from the vperm imm multiclass. Fixes <rdar://problem/18426089> llvm-svn: 220737	2014-10-27 23:08:40 +00:00
Adam Nemet	8d85b0cd3e	[AVX512] Clean up avx512_perm_imm to use X86VectorVTInfo No functionality change. No change in X86.td.expanded except that we only set the CD8 attributes for the memory variants. (This shouldn't be used unless we have a memory operand.) llvm-svn: 220736	2014-10-27 23:08:37 +00:00
Adam Nemet	9aad13164e	[AVX512] Derive vpermil* from avx512_perm_imm This used to derive from avx512_pshuf_imm which is confusing. NFC. Compared X86.td.expanded. llvm-svn: 220735	2014-10-27 23:08:34 +00:00
Adam Nemet	c51cee85b7	[AVX512] Fix copy-and-paste bugs in vpermil 1) i512mem -> f512mem (this is the packed FP input being permuted) 2) element size is 64 bits in EVEX_CD8 for PD. (A good illustration why X86VectorVTInfo is useful) llvm-svn: 220734	2014-10-27 23:08:31 +00:00
Rafael Espindola	4160f5d3ac	Make it easier to pass a custom diagnostic handler to the IR linker. llvm-svn: 220732	2014-10-27 23:02:10 +00:00
Pete Cooper	7c801dc90b	Fix a stackmap bug introduced in r220710. For a call to not return in to the stackmap shadow, the shadow must end with the call. To do this, we must insert any required nops before the call, and not after it. llvm-svn: 220728	2014-10-27 22:38:45 +00:00
Jingyue Wu	53c1612ed1	[ScalarEvolution] Guard dump() with #if to be consistent with its definition in ScalarEvolution.cpp llvm-svn: 220721	2014-10-27 21:14:41 +00:00
Rafael Espindola	5f13d6461b	Fix bug where sys::Wait could wait on wrong pid. Setting ChildPid to -1 would cause waitpid to wait for any child process. Patch by Daniel Reynaud! llvm-svn: 220717	2014-10-27 20:30:04 +00:00
Juergen Ributzka	7ccebec668	[FastISel][AArch64] Emit immediate version of icmp (subs) for null pointer check. This is a minor change to use the immediate version when the operand is a null value. This should get rid of an unnecessary 'mov' instruction in debug builds and align the code more with the one generated by SelectionDAG. This fixes rdar://problem/18785125. llvm-svn: 220713	2014-10-27 19:58:36 +00:00
Juergen Ributzka	0190fea941	[FastISel][AArch64] Optimize compare-and-branch for i1 to use 'tbz'. Minor enhancement to use 'tbz' for i1 compare-and-branch to get rid of an 'and' instruction. This fixes rdar://problem/18784953. llvm-svn: 220712	2014-10-27 19:46:23 +00:00
Pete Cooper	3c0af35232	Stackmap shadows should consider call returns a branch target. To avoid emitting too many nops, a stackmap shadow can include emitted instructions in the shadow, but these must not include branch targets. A return from a call should count as a branch target as patching over the instructions after the call would lead to incorrect behaviour for threads currently making that call, when they return. llvm-svn: 220710	2014-10-27 19:40:35 +00:00
Juergen Ributzka	90f741a2ce	[FastISel][AArch64] Use 'cbz' also for null values (pointers). The pattern matching for a 'ConstantInt' value was too restrictive. Checking for a 'Constant' with a bull value is sufficient for using an 'cbz/cbnz' instruction. This fixes rdar://problem/18784732. llvm-svn: 220709	2014-10-27 19:38:05 +00:00
Juergen Ributzka	eae91040d8	[FastISel][AArch64] Don't fold the 'and' instruction into the 'tbz/tbnz' instruction if it is in a different basic block. This fixes a bug where the input register was not defined for the 'tbz/tbnz' instruction. This happened, because we folded the 'and' instruction from a different basic block. This fixes rdar://problem/18784013. llvm-svn: 220704	2014-10-27 19:16:48 +00:00
Juergen Ributzka	6de054a25a	[FastISel][AArch64] Fix load/store with frame indices. At higher optimization levels the LLVM IR may contain more complex patterns for loads/stores from/to frame indices. The 'computeAddress' function wasn't able to handle this and triggered an assertion. This fix extends the possible addressing modes for frame indices. This fixes rdar://problem/18783298. llvm-svn: 220700	2014-10-27 18:21:58 +00:00
Kostya Serebryany	4f8f0c5aa2	[asan] experimental tracing for indirect calls, llvm part. llvm-svn: 220699	2014-10-27 18:13:56 +00:00
Lang Hames	69fa70efb3	[PBQP] Remove a spurious 'typename' keyword. This was causing an error on MSVC. llvm-svn: 220690	2014-10-27 17:59:51 +00:00
Lang Hames	bfd1c3cfa5	[PBQP] Clarify ambiguous-looking typedef. This was causing an error on the hexagon bots. llvm-svn: 220689	2014-10-27 17:52:05 +00:00
Lang Hames	5fe30ca56f	[PBQP] Unique allowed-sets for nodes in the PBQP graph and use pairs of these sets as keys into a cache of interference matrice values in the Interference constraint adder. Creating interference matrices was one of the large remaining time-sinks in PBQP. Caching them reduces the total compile time (when using PBQP) on the nightly test suite by ~10%. llvm-svn: 220688	2014-10-27 17:44:25 +00:00
Michael Gottesman	d71825c3cb	Add MapVector::rbegin(), MapVector::rend() to completment MapVector::begin(), MapVector::end(). These just delegate to the underlying vector type in the MapVector. Also just add in some sanity unittests. llvm-svn: 220687	2014-10-27 17:20:53 +00:00
NAKAMURA Takumi	729be14435	Prune CRLF. llvm-svn: 220678	2014-10-27 12:37:26 +00:00
Oliver Stannard	79efe41a0c	[ARM] Select VMAXNM and VMINNM regardless of operand order Currently, the ARM backend will select the VMAXNM and VMINNM for these C expressions: (a < b) ? a : b (a > b) ? a : b but not these expressions: (a > b) ? b : a (a < b) ? b : a This patch allows all of these expressions to be matched. llvm-svn: 220671	2014-10-27 09:23:02 +00:00
Yuri Gorshenin	3e22bb8c54	[asan-asm-instrumentation] Added comment describing how asm instrumentation works. Summary: [asan-asm-instrumentation] Added comment describing how asm instrumentation works. Reviewers: eugenis Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D5970 llvm-svn: 220670	2014-10-27 08:38:54 +00:00
Rui Ueyama	292fb6d7be	Re-commit r220667. C++ source given to check_cxx_source_compile should have define "main". llvm-svn: 220669	2014-10-27 08:16:18 +00:00
NAKAMURA Takumi	59c74b225a	Fix unicode chars into ascii in comment lines. llvm-svn: 220668	2014-10-27 08:08:18 +00:00
Rui Ueyama	7ef5796b20	Revert "Include stddef.h before including cxxabi.h" to un-break buildbot This reverts commit r220665. llvm-svn: 220667	2014-10-27 08:03:21 +00:00
Rui Ueyama	5f3212137d	Include stddef.h before including cxxabi.h On FreeBSD 10.0, size_t needs to be defined before including cxxabi.h. Currenty HAVE_CXXABI_H is not defined on FreeBSD because of that reason. This patch teaches cmake and configure how to include it. http://reviews.llvm.org/D5940 llvm-svn: 220665	2014-10-27 07:37:57 +00:00
David Majnemer	c8bdd23acf	InstCombine: Fix a combine assuming that icmp operands were integers An icmp may have pointer arguments, it isn't limited to integers or vectors of integers. This fixes PR21388. llvm-svn: 220664	2014-10-27 05:47:49 +00:00
Rafael Espindola	18c89411b8	LinkModules.cpp: don't repeat names in comments. llvm-svn: 220662	2014-10-27 02:35:46 +00:00
David Blaikie	df1cca3a24	Remove some unnecessary casts. llvm-svn: 220658	2014-10-26 23:37:04 +00:00
Lang Hames	5af35a9ee1	[PBQP] Tidying up as per Dave Blaikie's suggesions for r220642. llvm-svn: 220655	2014-10-26 22:12:02 +00:00
Lang Hames	6bac95a33b	[PBQP] Explicitly define copy/move operations for NodeMetadata to keep VS happy. Hopefully this fixes the bug that was introduced in r220642, and not-quite-fixed in r220649. llvm-svn: 220653	2014-10-26 21:55:54 +00:00
Arnold Schwaighofer	eb1a38fa73	Add an option to the LTO code generator to disable vectorization during LTO We used to always vectorize (slp and loop vectorize) in the LTO pass pipeline. r220345 changed it so that we used the PassManager's fields 'LoopVectorize' and 'SLPVectorize' out of the desire to be able to disable vectorization using the cl::opt flags 'vectorize-loops'/'slp-vectorize' which the before mentioned fields default to. Unfortunately, this turns off vectorization because those fields default to false. This commit adds flags to the LTO library to disable lto vectorization which reconciles the desire to optionally disable vectorization during LTO and the desired behavior of defaulting to enabled vectorization. We really want tools to set PassManager flags directly to enable/disable vectorization and not go the route via cl::opt flags in PassManagerBuilder.cpp. llvm-svn: 220652	2014-10-26 21:50:58 +00:00
Lang Hames	37ea9315e6	[PBQP] Re-commit r220642 with a workaround for quirky Visual Studio behavior. Apparently unique_ptr'ifying NodeMetadata exposed an issue in VS where it occasionally tries to synthesize copy constructors instead of moves. Hopefully explicitly deleting the copy constructor and defining the move constructor will fix this. llvm-svn: 220649	2014-10-26 20:57:16 +00:00
Peter Zotov	2ab0d305a4	[OCaml] Expose existing documentation in ocamldoc. Patch by Gabriel Radanne <drupyog@zoho.com>. llvm-svn: 220648	2014-10-26 20:45:22 +00:00
Hans Wennborg	deb10060ab	Revert "[PBQP] Unique-ptrify some PBQP Metadata structures. No functional change." (r220642) It broke the Windows build: [1/19] Building CXX object lib\CodeGen\CMakeFiles\LLVMCodeGen.dir\RegAllocPBQP.cpp.obj C:\bb-win7\ninja-clang-i686-msc17-R\llvm-project\llvm\include\llvm/CodeGen/RegAllocPBQP.h(132) : error C2248: 'std::unique_ptr<_Ty>::unique_ptr' : cannot access private member declared in class 'std::unique_ptr<_Ty>' with [ _Ty=unsigned int [] ] D:\Program Files (x86)\Microsoft Visual Studio 11.0\VC\include\memory(1600) : see declaration of 'std::unique_ptr<_Ty>::unique_ptr' with [ _Ty=unsigned int [] ] This diagnostic occurred in the compiler generated function 'llvm::PBQP::RegAlloc::NodeMetadata::NodeMetadata(const llvm::PBQP::RegAlloc::NodeMetadata &)' llvm-svn: 220645	2014-10-26 19:50:13 +00:00
Lang Hames	2e3e3d94c8	[PBQP] Unique-ptrify some PBQP Metadata structures. No functional change. llvm-svn: 220642	2014-10-26 18:50:52 +00:00
Lang Hames	99405777bc	[PBQP] Tidy up CostAllocator.h: fix variable case, rename CostPool to ValuePool. No functional change. This just brings things more in-line with coding standards, and makes ValuePool's functionality clearer (it's not tied to pooling costs, and we may want to use it to hold other things in the future). llvm-svn: 220641	2014-10-26 18:16:27 +00:00
Elena Demikhovsky	4b01b7306c	AVX-512: Fixed encoding of VPBROADCASTM and added SKX forms of this instruction llvm-svn: 220638	2014-10-26 09:52:24 +00:00
Andrew Trick	dd925ad218	LSR: Minor cleanup after Daniel's patch. Combine the Inserted an Done sets into a Visited set. llvm-svn: 220623	2014-10-25 19:59:30 +00:00
Andrew Trick	9ccbed5a12	Fix LSR compile time. This is a simple fix that brings the compilation time from 5min to 5s on a specific real-world example. It's a large chain of computation in a crypto routine (always a problem for SCEV). A unit test is not feasible and there would be no way to check it. The fix is just basic good practice for dealing with SCEVs, there's no risk of regression. Patch by Daniel Reynaud! llvm-svn: 220622	2014-10-25 19:42:07 +00:00
Peter Zotov	8cdf0425a0	[OCaml] hexagon can't run MCJIT tests, XFAIL it. llvm-svn: 220621	2014-10-25 19:01:14 +00:00
Peter Zotov	3944e6e223	[OCaml] Unbreak Llvm_executionengine.initialize_native_target. First, return true on success, as it is the OCaml convention. Second, also initialize the native assembly printer, which is, despite the name, required for MCJIT operation. Since this function did not initialize the assembly printer earlier and no function to initialize native assembly printer was available elsewhere, it is safe to break its interface: it means that it simply could not be used successfully before. llvm-svn: 220620	2014-10-25 18:50:02 +00:00
Peter Zotov	d1531a2349	[OCaml] Expose Llvm_executionengine.ExecutionEngine.create_mcjit. llvm-svn: 220619	2014-10-25 18:49:56 +00:00
Jingyue Wu	fe72fcebf6	[SeparateConstOffsetFromGEP] Fixed a bug related to unsigned modulo The dividend in "signed % unsigned" is treated as unsigned instead of signed, causing unexpected behavior such as -64 % (uint64_t)24 == 0. Added a regression test in split-gep.ll Patched by Hao Liu. llvm-svn: 220618	2014-10-25 18:34:03 +00:00
Benjamin Kramer	aa139573f6	Unbreak the build. llvm-svn: 220617	2014-10-25 18:20:17 +00:00
Benjamin Kramer	63207bc9c3	Clean up assume intrinsic pattern matching, no need to check that the argument is a value. Also make it const safe and remove superfluous casting. NFC. llvm-svn: 220616	2014-10-25 18:09:01 +00:00
Jingyue Wu	b723152379	[SeparateConstOffsetFromGEP] Fixed a bug in rebuilding OR expressions The two operands of the new OR expression should be NextInChain and TheOther instead of the two original operands. Added a regression test in split-gep.ll. Hao Liu reported this bug, and provded the test case and an initial patch. Thanks! llvm-svn: 220615	2014-10-25 17:36:21 +00:00
Simon Pilgrim	a63672665f	[X86][SSE] Vector integer/float conversion memory folding Tidied up some entries in the folding tables so that they are under the correct comment section (they were categorised as AVX2 instructions when they're AVX1). Minor patch agreed with qcolombet. llvm-svn: 220613	2014-10-25 08:11:20 +00:00
David Majnemer	2abb8183b5	InstCombine: Remove overzealous asserts These asserts can trigger if the worklist iteration order is sufficiently unlucky. Instead of adding special case logic to handle these edge conditions, just bail out on trying to transform them: InstSimplify will get them when it reaches them on the worklist. This fixes PR21378. N.B. No test case is included because any test would rely on the fragile worklist iteration order. llvm-svn: 220612	2014-10-25 07:13:13 +00:00
Rafael Espindola	98ab63ce98	Allow the C API users to keep relying on the OutMessages parameter. Should fix the Ocaml tests. llvm-svn: 220611	2014-10-25 04:31:08 +00:00
Rafael Espindola	8f2d615252	Remove unused variable. llvm-svn: 220610	2014-10-25 04:07:53 +00:00
Rafael Espindola	d12b4a334b	Update the error handling of lib/Linker. Instead of passing a std::string&, use the new diagnostic infrastructure. llvm-svn: 220608	2014-10-25 04:06:10 +00:00
Jingyue Wu	ea51161a94	[NVPTX] aligned byte-buffers for vector return types Summary: Fixes PR21100 which is caused by inconsistency between the declared return type and the expected return type at the call site. The new behavior is consistent with nvcc and the NVPTXTargetLowering::getPrototype function. Test Plan: test/Codegen/NVPTX/vector-return.ll Reviewers: jholewinski Reviewed By: jholewinski Subscribers: llvm-commits, meheff, eliben, jholewinski Differential Revision: http://reviews.llvm.org/D5612 llvm-svn: 220607	2014-10-25 03:46:16 +00:00
Rafael Espindola	6e0b559d4f	Add a test for the -suppress-warnings option. llvm-svn: 220603	2014-10-25 01:14:15 +00:00
Evgeniy Stepanov	d337a59db5	[msan] Make -msan-check-constant-shadow a bit stronger. Allow (under the experimental flag) non-Instructions to participate in MSan checks. llvm-svn: 220601	2014-10-24 23:34:15 +00:00
Rafael Espindola	5a52e6dc9e	Modernize the error handling of the Materialize function. llvm-svn: 220600	2014-10-24 22:50:48 +00:00
Kevin Enderby	2813f496d9	Fix a Mach-O assembler segfault for a subtraction expression with an undefined symbol. In a Mach-O object file a relocatable expression of the form SymbolA - SymbolB + constant is allowed when both symbols are defined in a section. But when either symbol is undefined it is an error. The code was crashing when it had an undefined symbol in this case. And should have printed a error message using the location information in the relocation entry. rdar://18678402 llvm-svn: 220599	2014-10-24 22:39:40 +00:00
Frederic Riss	987fe22789	Sink DwarfUnit::constructImportedEntityDIE into DwarfCompileUnit. So that it has access to getOrCreateGlobalVariableDIE. If we ever support decsribing using directive in C++ classes (thus requiring support in type units), it will certainly use another mechanism anyway. Differential Revision: http://reviews.llvm.org/D5975 llvm-svn: 220594	2014-10-24 21:31:09 +00:00
Simon Pilgrim	fd080af0c5	[X86][SSE] Bitcast assertion in XFormVExtractWithShuffleIntoLoad Minor patch to fix an issue in XFormVExtractWithShuffleIntoLoad where a load is unary shuffled, then bitcast (to a type with the same number of elements) before extracting an element. An undef was created for the second shuffle operand using the original (post-bitcasted) vector type instead of the pre-bitcasted type like the rest of the shuffle node - this was then causing an assertion on the different types later on inside SelectionDAG::getVectorShuffle. Differential Revision: http://reviews.llvm.org/D5917 llvm-svn: 220592	2014-10-24 21:04:41 +00:00
Colin LeMahieu	838307b31f	[Hexagon] Resubmission of 220427 Modified library structure to deal with circular dependency between HexagonInstPrinter and HexagonMCInst. Adding encoding bits for add opcode. Adding llvm-mc tests. Removing unit tests. http://reviews.llvm.org/D5624 llvm-svn: 220584	2014-10-24 19:00:32 +00:00
Matt Arsenault	4b7bd2d52f	Fix copy paste comment llvm-svn: 220581	2014-10-24 18:13:10 +00:00
Rafael Espindola	d4bcefc7d9	Don't ever call materializeAllPermanently during LTO. To do this, change the representation of lazy loaded functions. The previous representation cannot differentiate between a function whose body has been removed and one whose body hasn't been read from the .bc file. That means that in order to drop a function, the entire body had to be read. llvm-svn: 220580	2014-10-24 18:13:04 +00:00
Sanjay Patel	f924e11967	Allow AVX vrsqrtps generation. This is a follow-on to r220570 that allows a 256-bit (v8f32) version of vrsqrtps to be generated. llvm-svn: 220579	2014-10-24 17:59:18 +00:00
David Blaikie	80e5b1ebd1	DebugInfo: Sink DwarfDebug::ScopeVariables down into DwarfFile (part of refactoring to allow subprogram emission in both the skeleton and main units to enable -gmlt-like data to be included in the skeleton for live inlined backtracing purposes) llvm-svn: 220578	2014-10-24 17:57:34 +00:00
Sanjay Patel	514253cf43	remove class/function/variable names from comments; NFC llvm-svn: 220577	2014-10-24 17:55:59 +00:00
David Blaikie	0dc623c232	Remove DwarfDebug::FirstCU as it has no use It was only being used as a flag to identify the lack of debug info from within endModule - use the section labels for that instead. llvm-svn: 220575	2014-10-24 17:53:38 +00:00
Sanjay Patel	957efc23bb	Use rsqrt (X86) to speed up reciprocal square root calcs This is a first step for generating SSE rsqrt instructions for reciprocal square root calcs when fast-math is allowed. For now, be conservative and only enable this for AMD btver2 where performance improves significantly - for example, 29% on llvm/projects/test-suite/SingleSource/Benchmarks/BenchmarkGame/n-body.c (if we convert the data type to single-precision float). This patch adds a two constant version of the Newton-Raphson refinement algorithm to DAGCombiner that can be selected by any target via a parameter returned by getRsqrtEstimate().. See PR20900 for more details: http://llvm.org/bugs/show_bug.cgi?id=20900 Differential Revision: http://reviews.llvm.org/D5658 llvm-svn: 220570	2014-10-24 17:02:16 +00:00
Daniel Sanders	e2e25da4b6	[mips] Replace MipsABIEnum with a MipsABIInfo class. Summary: No functional change yet, it's just an object replacement for an enum. It will allow us to gather ABI information in a single place so that we can start testing for properties of the ABI's instead of the ABI itself. For example we will eventually be able to use: ABI.MinStackAlignmentInBytes() instead of: (isABI_N32() \|\| isABI_N64()) ? 16 : 8 which is clearer and more maintainable. Reviewers: matheusalmeida Reviewed By: matheusalmeida Differential Revision: http://reviews.llvm.org/D3341 llvm-svn: 220568	2014-10-24 16:15:27 +00:00
Benjamin Kramer	014601d56e	[Object] Fix MachO's getUuid to return a pointer into the object instead of a dangling ArrayRef. This works because uuid's are always little endian so it's not swapped. Fixes use-after-return reported by asan. llvm-svn: 220567	2014-10-24 15:52:05 +00:00
Aaron Ballman	be63968a5c	These functions are not actually defined for NDEBUG or !LLVM_DUMP_ENABLED, so guarding the declarations as well. NFC, silences MSVC warnings in release builds. llvm-svn: 220565	2014-10-24 15:16:39 +00:00
Daniel Sanders	f815c137e6	[mips] Fix >80-column line llvm-svn: 220564	2014-10-24 14:46:00 +00:00
Daniel Sanders	8d69aedbd5	[mips] Remove redundant code in RetCC_MipsN. NFC. Summary: i32 is always promoted to i64 so it no longer makes sense to assign i32 to registers. Reviewers: vmedic Reviewed By: vmedic Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D5964 llvm-svn: 220561	2014-10-24 13:49:54 +00:00
Daniel Sanders	19f01658fe	[mips] For N32/N64, structs must be passed in the upper bits of a register. Summary: Most structs were fixed by r218451 but those of between >32-bits and <64-bits remained broken since they were not marked with [ASZ]ExtUpper. This patch fixes the remaining cases by using CCPromoteToUpperBitsInType<i64> on i64's in addition to i32 and smaller. Reviewers: vmedic Reviewed By: vmedic Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D5963 llvm-svn: 220556	2014-10-24 13:09:19 +00:00
Oliver Stannard	f7a5afc3f2	[AArch64] Fix fast-isel of cbz of i1, i8, i16 This fixes a miscompilation in the AArch64 fast-isel which was triggered when a branch is based on an icmp with condition eq or ne, and type i1, i8 or i16. The cbz instruction compares the whole 32-bit register, so values with the bottom 1, 8 or 16 bits clear would cause the wrong branch to be taken. llvm-svn: 220553	2014-10-24 09:54:41 +00:00
Timur Iskhodzhanov	b320527e39	Update test/MC/ARM/coff-debugging-secrel.ll expectations to fix breakage caused by r220544 llvm-svn: 220548	2014-10-24 06:24:07 +00:00
Marcello Maggioni	2259400f8e	Added reset of LexicalScope in LiveDebugVariables reset function. llvm-svn: 220545	2014-10-24 02:46:50 +00:00
Timur Iskhodzhanov	2bc90fdbdc	Fix PR21189 -- Emit symbol subsection required to debug LLVM-built binaries with VS2012+ Reviewed at http://reviews.llvm.org/D5772 llvm-svn: 220544	2014-10-24 01:27:45 +00:00
David Blaikie	f4c504e03c	DebugInfo: Remove DwarfDebug::addScopeVariable now that it's just a trivial wrapper llvm-svn: 220542	2014-10-24 00:43:47 +00:00
Ahmed Bougacha	26f63f77cd	Make test for r220533 more robust by using GPR pattern. llvm-svn: 220541	2014-10-24 00:03:46 +00:00
Adam Nemet	832ec5e911	[AVX512] FMA support for the 231 variants This is asm/diasm-only support, similar to AVX. For ISeling the register variant, they are no different from 213 other than whether the multiplication or the addition operand is destructed. For ISeling the memory variant, i.e. to fold a load, they are no different than the 132 variant. The addition operand (op3) in both cases can come from memory. Again the ony difference is which operand is destructed. There could be a post-RA pass that would convert a 213 or 132 into a 231. Part of <rdar://problem/17082571> llvm-svn: 220540	2014-10-24 00:03:00 +00:00
Adam Nemet	26371ce131	[AVX512] Introduce fma3p_forms from AVX This multiclass generates the different forms: 213, 231, 132 in AVX. 132 in AVX512 is a separate class but I am planning to use this same multiclass to generate 231 relying on the nice the null_frag trick from AVX to disable codegen pattern for 231. No functionality change, no change in X86.td.expanded except for the different instruction definition names. llvm-svn: 220539	2014-10-24 00:02:55 +00:00
Nick Lewycky	592d84974c	If requested, apply function merging at -O0 too. It's useful there to reduce the time to compile. llvm-svn: 220537	2014-10-23 23:49:31 +00:00
Timur Iskhodzhanov	eb229ca928	Make getDISubprogram(const Function *F) available in LLVM Reviewed at http://reviews.llvm.org/D5950 llvm-svn: 220536	2014-10-23 23:46:28 +00:00
Ahmed Bougacha	7daf3b89f9	[SelectionDAG] Teach the vector scalarizer about FP conversions. This adds support for legalization of instructions of the form: [fp_conv] <1 x i1> %op to <1 x double> where fp_conv is one of fpto[us]i, [us]itofp. This used to assert because they were simply missing from the vector operand scalarizer. A similar problem arose in r190830, with trunc instead. Fixes PR20778. Differential Revision: http://reviews.llvm.org/D5810 llvm-svn: 220533	2014-10-23 22:49:25 +00:00
Ahmed Bougacha	e05afd4d1e	Update comment and fix typos in assert message. (NFC) llvm-svn: 220531	2014-10-23 22:40:34 +00:00
Juergen Ributzka	c91611967f	Update llvm.donothing documentation. llvm.donothing is no longer the only intrinsic that can be invoked. llvm-svn: 220530	2014-10-23 22:36:13 +00:00
Tim Northover	e4c7be56bf	ScheduleDAG: record PhysReg dependencies represented by CopyFromReg nodes x86's CMPXCHG -> EFLAGS consumer wasn't being recorded as a real EFLAGS dependency because it was represented by a pair of CopyFromReg(EFLAGS) -> CopyToReg(EFLAGS) nodes. ScheduleDAG was expecting the source to be an implicit-def on the instruction, where the result numbers in the DAG and the Uses list in TableGen matched up precisely. The Copy notation seems much more robust, so this patch extends ScheduleDAG rather than refactoring x86. Should fix PR20376. llvm-svn: 220529	2014-10-23 22:31:48 +00:00
David Blaikie	1dd573db45	DebugInfo: Remove DwarfDebug::CurrentFnArguments since we have to handle argument ordering of other arguments (abstract arguments) in the same way and already have code for that too. While refactoring this code I was confused by both the name I had introduced (addNonArgumentVariable... but it has all this logic to handle argument numbering and keep things in order?) and by the redundancy. Seems when I fixed the misordered inlined argument handling, I didn't realize it was mostly redundant with the argument ordering code (which I may've also written, I'm not sure). So let's just rely on the more general case. The only oddity in output this produces is that it means when we emit all the variables for the current function, we don't track when we've finished the argument variables and are about to start the local variables and insert DW_AT_unspecified_parameters (for varargs functions) there. Instead it ends up after the local variables, scopes, etc. But this isn't invalid and doesn't cause DWARF consumers problems that I know of... so we'll just go with that because it makes the code nice & simple. (though, let's see what the buildbots have to say about this - crosses fingers) There will be some cleanup commits to follow to remove the now trivial wrappers, etc. llvm-svn: 220527	2014-10-23 22:27:50 +00:00
Timur Iskhodzhanov	56af52f852	PR21189: Teach llvm-readobj to dump bits of COFF symbol subsections required to debug using VS2012+ Reviewed at http://reviews.llvm.org/D5755 Thanks to Andrey Guskov for his help investigating this! llvm-svn: 220526	2014-10-23 22:25:31 +00:00
David Blaikie	3b5c84008d	DebugInfo: Sink DwarfDebug::addNonArgumentScopeVariable into DwarfFile. llvm-svn: 220520	2014-10-23 22:04:30 +00:00
Hans Wennborg	db53e30eba	MachODump.cpp: fix MSVC build llvm-svn: 220518	2014-10-23 21:59:17 +00:00
Ahmed Bougacha	5175bcf43a	[X86] Improve mul w/ overflow codegen, to MUL8+SETO. Currently, @llvm.smul.with.overflow.i8 expands to 9 instructions, where 3 are really needed. This adds X86ISD::UMUL8/SMUL8 SD nodes, and custom lowers them to MUL8/IMUL8 + SETO. i8 is a special case because there is no two/three operand variants of (I)MUL8, so the first operand and return value need to go in AL/AX. Also, we can't write patterns for these instructions: TableGen refuses patterns where output operands don't match SDNode results. In this case, instructions where the output operand is an implicitly defined register. A related special case (and FIXME) exists for MUL8 (X86InstrArith.td): // FIXME: Used for 8-bit mul, ignore result upper 8 bits. // This probably ought to be moved to a def : Pat<> if the // syntax can be accepted. [(set AL, (mul AL, GR8:$src)), (implicit EFLAGS)] Ideally, these go away with UMUL8, but we still need to improve TableGen support of implicit operands in patterns. Before this change: movsbl %sil, %eax movsbl %dil, %ecx imull %eax, %ecx movb %cl, %al sarb $7, %al movzbl %al, %eax movzbl %ch, %esi cmpl %eax, %esi setne %al After: movb %dil, %al imulb %sil seto %al Also, remove a made-redundant testcase for PR19858, and enable more FastISel ALU-overflow tests for SelectionDAG too. Differential Revision: http://reviews.llvm.org/D5809 llvm-svn: 220516	2014-10-23 21:55:31 +00:00
David Blaikie	ff4181adec	DebugInfo: Remove DwarfDebug::addCurrentFnArgument declaration now that it's moved to DwarfFile. llvm-svn: 220515	2014-10-23 21:53:17 +00:00
Sanjay Patel	848309da7c	Handle sqrt() shrinking in SimplifyLibCalls like any other call This patch removes a chunk of special case logic for folding (float)sqrt((double)x) -> sqrtf(x) in InstCombineCasts and handles it in the mainstream path of SimplifyLibCalls. No functional change intended, but I loosened the restriction on the existing sqrt testcases to allow for this optimization even without unsafe-fp-math because that's the existing behavior. I also added a missing test case for not shrinking the llvm.sqrt.f64 intrinsic in case the result is used as a double. Differential Revision: http://reviews.llvm.org/D5919 llvm-svn: 220514	2014-10-23 21:52:45 +00:00
Kevin Enderby	078be601cf	Change the name of the field BindTable to bindtable to not over lap the type. Should fix the build bot issues from commit r220500. llvm-svn: 220504	2014-10-23 19:53:12 +00:00
Peter Collingbourne	dfc98c2e61	Make llvm-go test dependency optional. llvm-svn: 220503	2014-10-23 19:51:40 +00:00
Rafael Espindola	957eae22f4	Make llvm-link behave a bit more like LTO. * Read modules lazily * Don't treat the first file specially, instead merge all inputs into an empty module. llvm-svn: 220501	2014-10-23 19:40:45 +00:00

... 2 3 4 5 6 ...

109365 Commits