llvm-project

Commit Graph

Author	SHA1	Message	Date
Chandler Carruth	b4b09c7a6c	[x86] Remove a test that wasn't doing anything really. We have plenty of better tests for zext of vectors at this point. llvm-svn: 218811	2014-10-01 20:50:58 +00:00
Chandler Carruth	ab5ddea2cb	[x86] Add a 32-bit run to the sext test, and remove a sad vec_sext.ll test file. This old test had a bunch of functions that were never even checked. =/ The only thing it really did was to make sure that we did something reasonable in 32-bit mode with SSE4.1. Adding another run line to the main vector-sext.ll test seems a better way to do that. llvm-svn: 218810	2014-10-01 20:49:54 +00:00
Fariborz Jahanian	77a835bf56	Objective-C Modernizer. Patch to remove dangling space before the semicolon wahen modernizing to use NS_ENUM/NS_OPTIONS macros. rdar://18498539 llvm-svn: 218809	2014-10-01 20:46:32 +00:00
Enrico Granata	e85e84a769	Add a new SBExecutionContext class that wraps an ExecutionContextRef. This class is a convenient way at the API level to package a target,process,thread and frame all together - or just a subset of those llvm-svn: 218808	2014-10-01 20:43:45 +00:00
Chandler Carruth	bbbdb9f0ee	[x86] Teach both sext and zext vector tests to cover a nice wide range of architectures: SSE2, SSSE3, SSE4.1, AVX, and AVX2. Unfortunately, this exposses the absolute horror of the code we generate for many of these patterns. Anyone wanting to familiarize themselves with the x86 backend and improve performance could do a lot of good sitting down and making these test cases not look so terrible. While the new vector shuffle code I'm working on well help some, it won't fix all of the crimes here. llvm-svn: 218807	2014-10-01 20:41:36 +00:00
Adrian Prantl	e6579cd9a6	Update testcase to new intrinsic format llvm-svn: 218806	2014-10-01 20:40:12 +00:00
Eric Christopher	36448af7f5	Rework the PPC TargetMachine so that the non-function specific overrides happen at TargetMachine creation and not on every subtarget creation. llvm-svn: 218805	2014-10-01 20:38:26 +00:00
Eric Christopher	12f4a78581	constify TargetMachine parameter for X86TargetLowering. llvm-svn: 218804	2014-10-01 20:38:22 +00:00
Sanjay Patel	7b2cd9ad86	Make the sqrt intrinsic return undef for a negative input. As discussed here: http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20140609/220598.html And again here: http://lists.cs.uiuc.edu/pipermail/llvmdev/2014-September/077168.html The sqrt of a negative number when using the llvm intrinsic is undefined. We should return undef rather than 0.0 to match the definition in the LLVM IR lang ref. This change should not affect any code that isn't using "no-nans-fp-math"; ie, no-nans is a requirement for generating the llvm intrinsic in place of a sqrt function call. Unfortunately, the behavior introduced by this patch will not match current gcc, xlc, icc, and possibly other compilers. The current clang/llvm behavior of returning 0.0 doesn't either. We knowingly approve of this difference with the other compilers in an attempt to flag code that is invoking undefined behavior. A front-end warning should also try to convince the user that the program will fail: http://llvm.org/bugs/show_bug.cgi?id=21093 Differential Revision: http://reviews.llvm.org/D5527 llvm-svn: 218803	2014-10-01 20:36:33 +00:00
Chandler Carruth	b5c9e04b51	[x86] Sort the ISA-specific RUN lines for vector-sext.ll to go from oldest to newest. This makes more sense to me and is more consistent with other tests. llvm-svn: 218802	2014-10-01 20:32:44 +00:00
Tim Northover	6a1ef73140	ARM: yes it can (as of r218789) llvm-svn: 218801	2014-10-01 20:31:58 +00:00
Chandler Carruth	c66ea0fc12	[x86] Rename avx-{s,z}ext.ll to vector-{s,z}ext.ll. These tests are far and away the best sext and zext tests we have for vectors. I'm going to merge the other similar tests into them and expand the ISA coverage. llvm-svn: 218800	2014-10-01 20:30:30 +00:00
Chandler Carruth	011088fc46	[x86] Cleanup and re-generate the checks for avx-zext.ll using the new script. llvm-svn: 218799	2014-10-01 20:27:16 +00:00
Duncan P. N. Exon Smith	f3dc429ac7	DIBuilder: Encapsulate DIExpression's element type Update for corresponding LLVM API change for `DIBuilder::createExpression()`. llvm-svn: 218798	2014-10-01 20:26:18 +00:00
Duncan P. N. Exon Smith	611afb229c	DIBuilder: Encapsulate DIExpression's element type `DIExpression`'s elements are 64-bit integers that are stored as `ConstantInt`. The accessors already encapsulate the storage. This commit updates the `DIBuilder` API to also encapsulate that. llvm-svn: 218797	2014-10-01 20:26:08 +00:00
Nick Kledzik	22c9073ada	Add MachOLinkingContext parameter to MachOFileNode constructor. No functionality change. This removes a down-cast from LinkingContext to MachOLinkingContext. Also, remove const from LinkingContext::createImplicitFiles() to remove the need for another const cast. Seems reasonable for createImplicitFiles() to need to modify the context (MachOLinkingContext does). llvm-svn: 218796	2014-10-01 20:24:30 +00:00
Chandler Carruth	fbba2fa8d9	[x86] Generate the FileCheck assertions for avx-blend.ll with my new script to make them nice and predictable. This will ease updating them for the new vector shuffle lowering and seeing the delta if any. llvm-svn: 218795	2014-10-01 20:19:45 +00:00
Chandler Carruth	1f569b05b6	[x86] Clean up and generate detailed FileCheck assertions for avx-sext.ll using my new script. Also add an AVX2 mode to this test. Part of cleaning up the test suite before enabling the new vector shuffle lowering. This also highlights some of the abysmal failures of the old shuffle lowering. Check out those 'pinsrw' and 'pextrw' sequences! llvm-svn: 218794	2014-10-01 20:19:32 +00:00
Johannes Doerfert	c7b719fc03	Annotate LLVM-IR for all parallel loops This change allows to annotate all parallel loops with loop id metadata. Furthermore, it will annotate memory instructions with llvm.mem.parallel_loop_access metadata for all surrounding parallel loops. This is especially usefull if an external paralleliser is used. This also removes the PollyLoopInfo class and comments the LoopAnnotator. A test case for multiple parallel loops is attached. llvm-svn: 218793	2014-10-01 20:10:44 +00:00
Bruno Cardoso Lopes	e3c513a965	[MemoryDepAnalysis] Fix compile time slowdown - Problem One program takes ~3min to compile under -O2. This happens after a certain function A is inlined ~700 times in a function B, inserting thousands of new BBs. This leads to 80% of the compilation time spent in GVN::processNonLocalLoad and MemoryDependenceAnalysis::getNonLocalPointerDependency, while searching for nonlocal information for basic blocks. Usually, to avoid spending a long time to process nonlocal loads, GVN bails out if it gets more than 100 deps as a result from MD->getNonLocalPointerDependency. However this only happens after all nonlocal information for BBs have been computed, which is the bottleneck in this scenario. For instance, there are 8280 times where getNonLocalPointerDependency returns deps with more than 100 bbs and from those, 600 times it returns more than 1000 blocks. - Solution Bail out early during the nonlocal info computation whenever we reach a specified threshold. This patch proposes a 100 BBs threshold, it also reduces the compile time from 3min to 23s. - Testing The test-suite presented no compile nor execution time regressions. Some numbers from my machine (x86_64 darwin): - 17s under -Oz (which avoids inlining). - 1.3s under -O1. - 2m51s under -O2 ToT *** 23s under -O2 w/ Result.size() > 100 - 1m54s under -O2 w/ Result.size() > 500 With NumResultsLimit = 100, GVN yields the same outcome as in the unlimited 3min version. http://reviews.llvm.org/D5532 rdar://problem/18188041 llvm-svn: 218792	2014-10-01 20:07:13 +00:00
Sanjay Patel	0e4a83e89c	Don't repeat function/variable name in comment. NFC. llvm-svn: 218791	2014-10-01 19:39:32 +00:00
Adam Nemet	fd6a73d70a	[X86 disasm tblegen backend] Clean up numPhysicalOperands asserts No functionality change intended. This implements Elena's idea to put the new additionalOperand outside the switch to cover all cases (http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20140929/237763.html). Note only nontrivial change is in MRMSrcMemFrm. This requires an inclusive interval of [2, 4] because we have prefix-dependent optional immediate operand. llvm-svn: 218790	2014-10-01 19:28:11 +00:00
Tim Northover	5d72c5de02	ARM: allow copying of CPSR when all else fails. As with x86 and AArch64, certain situations can arise where we need to spill CPSR in the middle of a calculation. These should be avoided where possible (MRS/MSR is rather expensive), which ARM is actually better at than the other two since it tries to Glue defs to uses, but as a last ditch effort, copying is better than crashing. rdar://problem/18011155 llvm-svn: 218789	2014-10-01 19:21:03 +00:00
Adrian Prantl	2706eb031d	Update CGDebugInfo to the updated API in LLVM. Complex address expressions are no longer part of DIVariable, but rather an extra argument to the debug intrinsics. http://reviews.llvm.org/D4919 rdar://problem/17994491 llvm-svn: 218788	2014-10-01 18:55:34 +00:00
Adrian Prantl	87b7eb9d0f	Move the complex address expression out of DIVariable and into an extra argument of the llvm.dbg.declare/llvm.dbg.value intrinsics. Previously, DIVariable was a variable-length field that has an optional reference to a Metadata array consisting of a variable number of complex address expressions. In the case of OpPiece expressions this is wasting a lot of storage in IR, because when an aggregate type is, e.g., SROA'd into all of its n individual members, the IR will contain n copies of the DIVariable, all alike, only differing in the complex address reference at the end. By making the complex address into an extra argument of the dbg.value/dbg.declare intrinsics, all of the pieces can reference the same variable and the complex address expressions can be uniqued across the CU, too. Down the road, this will allow us to move other flags, such as "indirection" out of the DIVariable, too. The new intrinsics look like this: declare void @llvm.dbg.declare(metadata %storage, metadata %var, metadata %expr) declare void @llvm.dbg.value(metadata %storage, i64 %offset, metadata %var, metadata %expr) This patch adds a new LLVM-local tag to DIExpressions, so we can detect and pretty-print DIExpression metadata nodes. What this patch doesn't do: This patch does not touch the "Indirect" field in DIVariable; but moving that into the expression would be a natural next step. http://reviews.llvm.org/D4919 rdar://problem/17994491 Thanks to dblaikie and dexonsmith for reviewing this patch! Note: I accidentally committed a bogus older version of this patch previously. llvm-svn: 218787	2014-10-01 18:55:02 +00:00
Duncan P. N. Exon Smith	08a83be3ea	LTO: Add missing target triple from r218784 llvm-svn: 218786	2014-10-01 18:49:58 +00:00
Reed Kotler	b9dc248e9e	Add fptrunc to mips fast-sel Summary: Implement conversion of 64 to 32 bit floating point numbers (fptrunc) in mips fast-isel Test Plan: fptrunc.ll checked also with 4 internal mips build bot flavors mip32r1/miprs32r2 and at -O0 and -O2 Reviewers: dsanders Reviewed By: dsanders Subscribers: rfuhler Differential Revision: http://reviews.llvm.org/D5553 llvm-svn: 218785	2014-10-01 18:47:02 +00:00
Duncan P. N. Exon Smith	30c9242caa	LTO: Ignore disabled diagnostic remarks r206400 and r209442 added remarks that are disabled by default. However, if a diagnostic handler is registered, the remarks are sent unfiltered to the handler. This is the right behaviour for clang, since it has its own filters. However, the diagnostic handler exposed in the LTO API receives only the severity and message. It doesn't have the information to filter by pass name. For LTO, disabled remarks should be filtered by the producer. I've changed `LLVMContext::setDiagnosticHandler()` to take a `bool` argument indicating whether to respect the built-in filters. This defaults to `false`, so other consumers don't have a behaviour change, but `LTOCodeGenerator::setDiagnosticHandler()` sets it to `true`. To make this behaviour testable, I added a `-use-diagnostic-handler` command-line option to `llvm-lto`. This fixes PR21108. llvm-svn: 218784	2014-10-01 18:36:03 +00:00
David Blaikie	847b37ec8a	Add an immovable type to test Optional<T>::emplace more rigorously after r218732. llvm-svn: 218783	2014-10-01 18:29:44 +00:00
Adrian Prantl	b458dc2eee	Revert r218778 while investigating buldbot breakage. "Move the complex address expression out of DIVariable and into an extra" llvm-svn: 218782	2014-10-01 18:10:54 +00:00
Adrian Prantl	af11fdba0a	Reverting r218777 while investigating buildbot breakage. "Update CGDebugInfo to the updated API in LLVM." llvm-svn: 218781	2014-10-01 18:10:14 +00:00
Fariborz Jahanian	7bd22e98be	c++ error recovery. Build a valid AST when trying to recover from parse error parsing the default argument. Patch prevents crash after spewing 100s of errors caused by someone who forgot to compile in c++11 mode. So no test. rdar://18508589 llvm-svn: 218780	2014-10-01 18:03:51 +00:00
Samuel Benzaquen	e261aa4311	Do not use delegated constructors. Do not use delegated constructors. It is not supported on all platforms yet. Fixes build broken by r218769. llvm-svn: 218779	2014-10-01 17:58:42 +00:00
Adrian Prantl	25a7174e7a	Move the complex address expression out of DIVariable and into an extra argument of the llvm.dbg.declare/llvm.dbg.value intrinsics. Previously, DIVariable was a variable-length field that has an optional reference to a Metadata array consisting of a variable number of complex address expressions. In the case of OpPiece expressions this is wasting a lot of storage in IR, because when an aggregate type is, e.g., SROA'd into all of its n individual members, the IR will contain n copies of the DIVariable, all alike, only differing in the complex address reference at the end. By making the complex address into an extra argument of the dbg.value/dbg.declare intrinsics, all of the pieces can reference the same variable and the complex address expressions can be uniqued across the CU, too. Down the road, this will allow us to move other flags, such as "indirection" out of the DIVariable, too. The new intrinsics look like this: declare void @llvm.dbg.declare(metadata %storage, metadata %var, metadata %expr) declare void @llvm.dbg.value(metadata %storage, i64 %offset, metadata %var, metadata %expr) This patch adds a new LLVM-local tag to DIExpressions, so we can detect and pretty-print DIExpression metadata nodes. What this patch doesn't do: This patch does not touch the "Indirect" field in DIVariable; but moving that into the expression would be a natural next step. http://reviews.llvm.org/D4919 rdar://problem/17994491 Thanks to dblaikie and dexonsmith for reviewing this patch! llvm-svn: 218778	2014-10-01 17:55:39 +00:00
Adrian Prantl	1400aaf8c8	Update CGDebugInfo to the updated API in LLVM. Complex address expressions are no longer part of DIVariable, but rather an extra argument to the debug intrinsics. http://reviews.llvm.org/D4919 rdar://problem/17994491 llvm-svn: 218777	2014-10-01 17:55:09 +00:00
Tom Stellard	79243d9664	R600: Call EmitFunctionHeader() in the AsmPrinter to populate the ELF symbol table llvm-svn: 218776	2014-10-01 17:15:17 +00:00
Tom Stellard	0a4e9a3b25	C API: Add LLVMCloneModule() llvm-svn: 218775	2014-10-01 17:14:57 +00:00
Fariborz Jahanian	5afc869f96	Adds 'override' to overriding methods. NFC. These were uncoveredby my yet undelivered patch. llvm-svn: 218774	2014-10-01 16:56:40 +00:00
Todd Fiala	e825f44761	thread state coordinator: replaced shortened type name Func suffix with Function. ThreadIDFunc => ThreadIDFunction LogFunc => LogIDFunction We try to avoid abbreviations/shortened names. Adjusted function parameter names as well to replace _func with _function. llvm-svn: 218773	2014-10-01 16:08:20 +00:00
Alexander Kornienko	97e8c3f6b5	[clang-tidy] Clarify a comment. No functional changes. llvm-svn: 218772	2014-10-01 15:50:31 +00:00
Jingyue Wu	fd47fb9976	Revert r216862 due to a performance regression Reported by Alexey Volkov in PR21115 llvm-svn: 218771	2014-10-01 15:22:13 +00:00
Todd Fiala	241ce99503	Minor tweak to Ed's FreeBSD fix. Fall back to including the Linux version if not on __FreeBSD__. Also covers __ANDROID__ case. llvm-svn: 218770	2014-10-01 15:10:37 +00:00
Samuel Benzaquen	f28d997083	Refactor Matcher<T> and DynTypedMatcher to reduce overhead of casts. Summary: This change introduces DynMatcherInterface and changes the internal representation of DynTypedMatcher and Matcher<T> to use a generic interface instead. It removes unnecessary indirections and virtual function calls when converting matchers by implicit and dynamic casts. DynTypedMatcher now remembers the stricter type in the chain of casts and checks it before calling into DynMatcherInterface. This change improves our clang-tidy related benchmark by ~14%. Also, it opens the door for more optimizations of this kind that are coming in future changes. As a side effect of removing these template instantiations, it also speeds up compilation of Dynamic/Registry.cpp by ~17% and reduces the number of symbols generated by ~30%. Reviewers: klimek Subscribers: klimek, cfe-commits Differential Revision: http://reviews.llvm.org/D5542 llvm-svn: 218769	2014-10-01 15:08:07 +00:00
Toma Tabacu	c4c202a9a7	[mips] Rename emit and parse functions for the .cpload assembler directive. NFC. Summary: It's better if we have a consistent name for .cpload-related functions. Reviewers: dsanders Reviewed By: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D5437 llvm-svn: 218768	2014-10-01 14:53:19 +00:00
Tom Stellard	3a35d8f4c2	R600/SI: Add a generic pseudo EXP instruction llvm-svn: 218767	2014-10-01 14:44:45 +00:00
Tom Stellard	0c238c2fbe	R600/SI: Add generic pseudo MTBUF instructions llvm-svn: 218766	2014-10-01 14:44:43 +00:00
Tom Stellard	c470c96e6b	R600/SI: Add generic pseudo SMRD instructions llvm-svn: 218765	2014-10-01 14:44:42 +00:00
Evgeniy Stepanov	d90c20bc26	[asan] Scale back mmap_limit_mb test. There is some strange interaction between mmap limit and unlimited stack (ulimit -s unlimited), which results in this test failing when run with "make". llvm-svn: 218764	2014-10-01 14:21:05 +00:00
Oliver Stannard	d4e0a4fd2c	[ARM] Allow selecting VRINT[APMXZR] and VCVT[BT] instructions for FPv5 Currently, we only codegen the VRINT[APMXZR] and VCVT[BT] instructions when targeting ARMv8, but they are actually present on any target with FP-ARMv8. Note that FP-ARMv8 is called FPv5 when is is part of an M-profile core, but they have the same instructions so we model them both as FPARMv8 in the ARM backend. llvm-svn: 218763	2014-10-01 13:13:18 +00:00
Ed Maste	81f59a09f2	Add a bandaid to fix the FreeBSD build r218568 added an explicit #include of the Linux ProcessMonitor.h to POSIXThread.cpp, rather than including just "ProcessMonitor.h" and relying on the build infrastructure for the appropriate paths. For now add #ifdefs in the source to use the FreeBSD or Linux header as appropriate; a cleaner fix (and perhaps some refactoring of the POSIX classes) should still be done later. llvm-svn: 218762	2014-10-01 12:56:39 +00:00

... 3 4 5 6 7 ...

183864 Commits All Branches Search

183864 Commits

All Branches