llvm-project

Commit Graph

Author	SHA1	Message	Date
Alexei Starovoitov	7e453bb8be	BPF: emit an error message for unsupported signed division operation Signed-off-by: Yonghong Song <yhs@plumgrid.com> Signed-off-by: Alexei Starovoitov <ast@fb.com> llvm-svn: 263842	2016-03-18 22:02:47 +00:00
Chris Bieneman	4c0e45e8f6	Missed a few non-ascii characters llvm-svn: 263841	2016-03-18 21:59:33 +00:00
Chris Bieneman	ebb1dd9381	Updates based on post-commit review of r263834 * Renamed to be camel case, consistent with other docs. * Fixed non-ascii characters (this is what I get for writing docs on an iPad). llvm-svn: 263840	2016-03-18 21:57:51 +00:00
David Majnemer	22b72c1564	[obj2yaml, COFF] Assert that the alignment is not bogus llvm-svn: 263839	2016-03-18 21:51:14 +00:00
Easwaran Raman	26628d3015	Interface to get/set profile summary metadata to module Differential Revision: http://reviews.llvm.org/D17894 llvm-svn: 263835	2016-03-18 21:29:30 +00:00
Chris Bieneman	c5269c0a4b	[Docs] New documentation for advanced build configurations This document covers how to use some of the new complex build configurations CMake supports. Feedback and improvements welcomed! llvm-svn: 263834	2016-03-18 21:16:26 +00:00
Kostya Serebryany	49e409068a	[libFuzzer] add a flag close_fd_mask so that we can silence spammy targets by closing stderr/stdout llvm-svn: 263831	2016-03-18 20:58:29 +00:00
Matthias Braun	0d208fc9f6	MILexer: Add ErrorCallbackType typedef; NFC llvm-svn: 263829	2016-03-18 20:41:11 +00:00
Sanjoy Das	74af78e3b0	[IndVars] Make the fix for PR26973 more obvious; NFCI llvm-svn: 263828	2016-03-18 20:37:11 +00:00
Sanjoy Das	60fb899f28	[IndVars] Pass the right loop to isLoopInvariantPredicate The loop on IVOperand's incoming values assumes IVOperand to be an induction variable on the loop over which `S Pred X` is invariant; otherwise loop invariant incoming values to IVOperand are not guaranteed to dominate the comparision. This fixes PR26973. llvm-svn: 263827	2016-03-18 20:37:07 +00:00
Mike Aizatsky	075ed3eec1	Revert "[sancov] specifying sanitizer coverage dependencies." This fails on arm. This reverts commit 52c8e0f7119d1ea1050c0708565a8c92b73386d2. llvm-svn: 263825	2016-03-18 20:34:58 +00:00
Nicolai Haehnle	fa771811b3	AMDGPU: add missing braces around multi-line if block This fixes an issue with rL263658 pointed out by Tom Stellard. llvm-svn: 263823	2016-03-18 20:32:04 +00:00
Mike Spertus	024f4c17d1	Typesafe visualization of PointerIntPairs in Visual Studio In the <DisplayString> of PointerIntPair , I cast the pointer to the actual type, so VS can leverage it while visualizing, not unlike the recent change to PointerUnion visualization. In the expansion, the current code is casting to the incorrect type (wrong number of stars), so I fixed that as well. llvm-svn: 263821	2016-03-18 20:06:16 +00:00
Chad Rosier	cdfd7e7201	[AArch64] Enable more load clustering in the MI Scheduler. This patch adds unscaled loads and sign-extend loads to the TII getMemOpBaseRegImmOfs API, which is used to control clustering in the MI scheduler. This is done to create more opportunities for load pairing. I've also added the scaled LDRSWui instruction, which was missing from the scaled instructions. Finally, I've added support in shouldClusterLoads for clustering adjacent sext and zext loads that too can be paired by the load/store optimizer. Differential Revision: http://reviews.llvm.org/D18048 llvm-svn: 263819	2016-03-18 19:21:02 +00:00
Reid Kleckner	fbd7787d7e	[codeview] Only emit function ids for inlined functions We aren't referencing any other kind of function currently. Should save a bit on our debug info size. llvm-svn: 263817	2016-03-18 18:54:32 +00:00
Colin LeMahieu	0143146514	[MCParser] Accept uppercase radix variants 0X and 0B Differential Revision: http://reviews.llvm.org/D14781 llvm-svn: 263802	2016-03-18 18:22:07 +00:00
Mike Aizatsky	4f7994c8cb	[sancov] specifying sanitizer coverage dependencies. Summary: These dependencies would be used in the future to reduce the number of instrumented blocks(http://reviews.llvm.org/rL262103) This is submitted as a separate CL because of previous problems with ARM. Subscribers: aemerson Differential Revision: http://reviews.llvm.org/D18227 llvm-svn: 263797	2016-03-18 17:33:21 +00:00
Colin LeMahieu	307a83d76a	[llvm-objdump] Print <unknown> in place of instruction text if it couldn't be disassembled. llvm-svn: 263793	2016-03-18 16:26:48 +00:00
Nicolai Haehnle	95e8ffd398	AMDGPU: Overload return type of llvm.amdgcn.buffer.load.format Summary: Allow the selection of BUFFER_LOAD_FORMAT_x and _XY. Do this now before the frontend patches land in Mesa. Eventually, we may want to automatically reduce the size of loads at the LLVM IR level, which requires such overloads, and in some cases Mesa can generate them directly. Reviewers: tstellarAMD, arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D18255 llvm-svn: 263792	2016-03-18 16:24:40 +00:00
Nicolai Haehnle	ad63638f6d	AMDGPU/SI: Add llvm.amdgcn.buffer.atomic.* intrinsics Summary: These intrinsics expose the BUFFER_ATOMIC_* instructions and will be used by Mesa to implement atomics with buffer semantics. The intrinsic interface matches that of buffer.load.format and buffer.store.format, except that the GLC bit is not exposed (it is automatically deduced based on whether the return value is used). The change of hasSideEffects is required for TableGen to accept the pattern that matches the intrinsic. Reviewers: tstellarAMD, arsenm Subscribers: arsenm, rivanvx, llvm-commits Differential Revision: http://reviews.llvm.org/D18151 llvm-svn: 263791	2016-03-18 16:24:31 +00:00
Nicolai Haehnle	3003ba00a3	AMDGPU: use ComplexPattern for offsets in llvm.amdgcn.buffer.load/store.format Summary: We cannot easily deduce that an offset is in an SGPR, but the Mesa frontend cannot easily make use of an explicit soffset parameter either. Furthermore, it is likely that in the future, LLVM will be in a better position than the frontend to choose an SGPR offset if possible. Since there aren't any frontend uses of these intrinsics in upstream repositories yet, I would like to take this opportunity to change the intrinsic signatures to a single offset parameter, which is then selected to immediate offsets or voffsets using a ComplexPattern. Reviewers: arsenm, tstellarAMD, mareko Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D18218 llvm-svn: 263790	2016-03-18 16:24:20 +00:00
Sam Kolton	a74cd526e9	[AMDGPU] Assembler: Change dpp_ctrl syntax to match sp3 Review: http://reviews.llvm.org/D18267 llvm-svn: 263789	2016-03-18 15:35:51 +00:00
Benjamin Kramer	d96b0c14fb	[Fuzzer] Guard no_sanitize_memory attributes behind __has_feature. Otherwise GCC fails to build it because it doesn't know the attribute. llvm-svn: 263787	2016-03-18 14:19:19 +00:00
Krzysztof Parzyszek	902b61ed04	Fix some typos; NFC Patch by Dongyuan Liu. Differential Revision: http://reviews.llvm.org/D17977 llvm-svn: 263786	2016-03-18 14:08:42 +00:00
Simon Atanasyan	05f4c803bb	[llvm-objdump] Move test case to the X86 sub-directory because it depends on X86 target supporting. NFC. llvm-svn: 263781	2016-03-18 09:52:12 +00:00
Pete Cooper	64d075ec07	Always cache resolved paths as it even saves on StringMap lookups. Now that the resolved path cache stores the StringRef's, its best to just always cache the results, even when realpath isn't used. This way we'll still avoid the StringMap hashing and lookup. This also conveniently reorganises this code in a way I need for a future patch. llvm-svn: 263777	2016-03-18 05:04:04 +00:00
Ehsan Amiri	631ed04af0	adding another optimization opportunity to readme file llvm-svn: 263775	2016-03-18 04:02:25 +00:00
Pete Cooper	ef4e36ad99	Use StringRef's in resolved path cache to avoid extra internString lookups. NFC. ResolvedPaths was storing std::string's as a cache. We would then take those strings and look them up in the internString pool to get a unique StringRef for each path. This patch changes ResolvedPaths to store the StringRef pointing in to the internString pool itself. This way, when getResolvedPath returns a string, we know we have the StringRef we would find in the pool anyway. We can avoid the duplicate memory of the std::string's, and also the time from the lookup. Unfortunately my profiles show no runtime change here, but it should still save memory allocations which is nice. Reviewed by Frederic Riss. Differential Revision: http://reviews.llvm.org/D18259 llvm-svn: 263774	2016-03-18 03:48:09 +00:00
Kostya Serebryany	c43b584c1c	[libFuzzer] read corpus dirs recursively llvm-svn: 263773	2016-03-18 01:36:00 +00:00
Adam Nemet	709e3046ee	[LoopDataPrefetch] Add TTI to limit the number of iterations to prefetch ahead Summary: It can hurt performance to prefetch ahead too much. Be conservative for now and don't prefetch ahead more than 3 iterations on Cyclone. Reviewers: hfinkel Subscribers: llvm-commits, mzolotukhin Differential Revision: http://reviews.llvm.org/D17949 llvm-svn: 263772	2016-03-18 00:27:43 +00:00
Adam Nemet	6d8beeca53	[LoopDataPrefetch/Aarch64] Allow selective prefetching of large-strided accesses Summary: And use this TTI for Cyclone. As it was explained in the original RFC (http://thread.gmane.org/gmane.comp.compilers.llvm.devel/92758), the HW prefetcher work up to 2KB strides. I am also adding tests for this and the previous change (D17943): * Cyclone prefetching accesses with a large stride * Cyclone not prefetching accesses with a small stride * Generic Aarch64 subtarget not prefetching either Reviewers: hfinkel Subscribers: aemerson, rengolin, llvm-commits, mzolotukhin Differential Revision: http://reviews.llvm.org/D17945 llvm-svn: 263771	2016-03-18 00:27:38 +00:00
Adam Nemet	53e758fc55	[Aarch64] Add pass LoopDataPrefetch for Cyclone Summary: This wires up the pass for Cyclone but keeps it off for now because we need a few more TTIs. The getPrefetchMinStride value is not very well tuned right now but it works well with CFP2006/433.milc which motivated this. Tests will be added as part of the upcoming large-stride prefetching patch. Reviewers: t.p.northover Subscribers: llvm-commits, aemerson, hfinkel, rengolin Differential Revision: http://reviews.llvm.org/D17943 llvm-svn: 263770	2016-03-18 00:27:29 +00:00
Kostya Serebryany	945761b8c2	[libFuzzer] improve -merge functionality llvm-svn: 263769	2016-03-18 00:23:29 +00:00
Lang Hames	01864a2d01	[Support] Refactor Error unit tests to avoid duplicating work. Suggested by Dave Blaikie in review for r263749. Thanks Dave! llvm-svn: 263768	2016-03-18 00:12:37 +00:00
Mike Aizatsky	363d331f99	[sancov] html report: replacing uncovered functions with function coverage % table. Differential Revision: http://reviews.llvm.org/D18256 llvm-svn: 263767	2016-03-18 00:12:14 +00:00
Peter Collingbourne	a1f8625662	DebugInfo: Add ability to not emit DW_AT_vtable_elem_location for virtual functions. A virtual index of -1u indicates that the subprogram's virtual index is unrepresentable (for example, when using the relative vtable ABI), so do not emit a DW_AT_vtable_elem_location attribute for it. Differential Revision: http://reviews.llvm.org/D18236 llvm-svn: 263765	2016-03-17 23:58:03 +00:00
Lang Hames	64e936f41c	[Support] Address some of dblaikie's feedback for r263749. Fixes some missing std::moves and take Expected<T> by rvalue reference in the call operator. llvm-svn: 263764	2016-03-17 23:43:33 +00:00
Mike Aizatsky	26cc0377bc	Revert "allow lambdas in mapped_iterator" MSVC as usual: C:\Buildbot\Slave\llvm-clang-lld-x86_64-scei-ps4-windows10pro-fast\llvm.src\include\llvm/ADT/STLExtras.h(120): error C2100: illegal indirection C:\Buildbot\Slave\llvm-clang-lld-x86_64-scei-ps4-windows10pro-fast\llvm.src\include\llvm/IR/Instructions.h(3966): note: see reference to class template instantiation 'llvm::mapped_iterator<llvm::User::op_iterator,llvm::CatchSwitchInst::DerefFnTy>' being compiled This reverts commit e091dd63f1f34e043748e28ad160d3bc17731168. llvm-svn: 263760	2016-03-17 23:32:20 +00:00
Mike Aizatsky	35aff03cf2	allow lambdas in mapped_iterator Differential Revision: http://reviews.llvm.org/D17311 llvm-svn: 263759	2016-03-17 23:22:22 +00:00
Alexey Samsonov	c4b18305d7	[Docs] Fix a typo. llvm-svn: 263754	2016-03-17 23:08:01 +00:00
Tim Shen	5cdf75084a	[PPC, FastISel] Fix ordered/unordered fcmp For fcmp, major concern about the following 6 cases is NaN result. The comparison result consists of 4 bits, indicating lt, eq, gt and un (unordered), only one of which will be set. The result is generated by fcmpu instruction. However, bc instruction only inspects one of the first 3 bits, so when un is set, bc instruction may jump to to an undesired place. More specifically, if we expect an unordered comparison and un is set, we expect to always go to true branch; in such case UEQ, UGT and ULT still give false, which are undesired; but UNE, UGE, ULE happen to give true, since they are tested by inspecting !eq, !lt, !gt, respectively. Similarly, for ordered comparison, when un is set, we always expect the result to be false. In such case OGT, OLT and OEQ is good, since they are actually testing GT, LT, and EQ respectively, which are false. OGE, OLE and ONE are tested through !lt, !gt and !eq, and these are true. llvm-svn: 263753	2016-03-17 22:27:58 +00:00
Lang Hames	6935c2d322	[Support] Add ExitOnError utility to support tools that use the exit-on-error idiom. Most LLVM tool code exits immediately when an error is encountered and prints an error message to stderr. The ExitOnError class supports this by providing two call operators - one for Errors, and one for Expected<T>s. Calls to code that can return Errors (or Expected<T>s) can use these calls to bail out on error, and otherwise continue as if the operation had succeeded. E.g. Error foo(); Expected<int> bar(); int main(int argc, char *argv[]) { ExitOnError ExitOnErr; ExitOnErr.setBanner(std::string("Error in ") + argv[0] + ":"); // Exit if foo returns an error. No need to manually check error return. ExitOnErr(foo()); // Exit if bar returns an error, otherwise unwrap the contained int and // continue. int X = ExitOnErr(bar()); // ... return 0; } llvm-svn: 263749	2016-03-17 21:28:49 +00:00
Lang Hames	01a3cf4d31	[Support] Make Error::isA<T>() works on success values. llvm-svn: 263745	2016-03-17 20:35:00 +00:00
Adam Nemet	b0c4eae073	[LoopVectorize] Annotate versioned loop with noalias metadata Summary: Use the new LoopVersioning facility (D16712) to add noalias metadata in the vector loop if we versioned with memchecks. This can enable some optimization opportunities further down the pipeline (see the included test or the benchmark improvement quoted in D16712). The test also covers the bug I had in the initial version in D16712. The vectorizer did not previously use LoopVersioning. The reason is that the vectorizer performs its transformations in single shot. It creates an empty single-block vector loop that it then populates with the widened, if-converted instructions. Thus creating an intermediate versioned scalar loop seems wasteful. So this patch (rather than bringing in LoopVersioning fully) adds a special interface to LoopVersioning to allow the vectorizer to add no-alias annotation while still performing its own versioning. As the vectorizer propagates metadata from the instructions in the original loop to the vector instructions we also check the pointer in the original instruction and see if LoopVersioning can add no-alias metadata based on the issued memchecks. Reviewers: hfinkel, nadav, mzolotukhin Subscribers: mzolotukhin, llvm-commits Differential Revision: http://reviews.llvm.org/D17191 llvm-svn: 263744	2016-03-17 20:32:37 +00:00
Adam Nemet	5eccf07df3	[LoopVersioning] Annotate versioned loop with noalias metadata Summary: If we decide to version a loop to benefit a transformation, it makes sense to record the now non-aliasing accesses in the newly versioned loop. This allows non-aliasing information to be used by subsequent passes. One example is 456.hmmer in SPECint2006 where after loop distribution, we vectorize one of the newly distributed loops. To vectorize we version this loop to fully disambiguate may-aliasing accesses. If we add the noalias markers, we can use the same information in a later DSE pass to eliminate some dead stores which amounts to ~25% of the instructions of this hot memory-pipeline-bound loop. The overall performance improves by 18% on our ARM64. The scoped noalias annotation is added in LoopVersioning. The patch then enables this for loop distribution. A follow-on patch will enable it for the vectorizer. Eventually this should be run by default when versioning the loop but first I'd like to get some feedback whether my understanding and application of scoped noalias metadata is correct. Essentially my approach was to have a separate alias domain for each versioning of the loop. For example, if we first version in loop distribution and then in vectorization of the distributed loops, we have a different set of memchecks for each versioning. By keeping the scopes in different domains they can conveniently be defined independently since different alias domains don't affect each other. As written, I also have a separate domain for each loop. This is not necessary and we could save some metadata here by using the same domain across the different loops. I don't think it's a big deal either way. Probably the best is to review the tests first to see if I mapped this problem correctly to scoped noalias markers. I have plenty of comments in the tests. Note that the interface is prepared for the vectorizer which needs the annotateInstWithNoAlias API. The vectorizer does not use LoopVersioning so we need a way to pass in the versioned instructions. This is also why the maps have to become part of the object state. Also currently, we only have an AA-aware DSE after the vectorizer if we also run the LTO pipeline. Depending how widely this triggers we may want to schedule a DSE toward the end of the regular pass pipeline. Reviewers: hfinkel, nadav, ashutosh.nema Subscribers: mssimpso, aemerson, llvm-commits, mcrosier Differential Revision: http://reviews.llvm.org/D16712 llvm-svn: 263743	2016-03-17 20:32:32 +00:00
Justin Bogner	ae341c6e9b	Bitcode: Error out instead of crashing on corrupt metadata I hit a crash in the bitcode reader on some corrupt input where an MDString had somehow been attached to an instruction instead of an MDNode. This input is pretty bogus, but we shouldn't be crashing on bad input here. This change adds error handling in all of the places where we currently have unchecked casts from Metadata to MDNode, which means we'll error out instead of crashing for that sort of input. Unfortunately, I don't have tests. Hitting this requires flipping bits in the input bitcode, and committing corrupt binary files to catch these cases is a bit too opaque and unmaintainable. llvm-svn: 263742	2016-03-17 20:12:06 +00:00
Tim Northover	498c56c240	ARM: stop asserting on weird <3 x Ty> vectors in ISelLowering. llvm-svn: 263741	2016-03-17 20:10:28 +00:00
Kostya Serebryany	c5575aabd6	[libFuzzer] deprecate several flags llvm-svn: 263739	2016-03-17 19:59:39 +00:00
Kostya Serebryany	23dbc390af	[libFuzzer] add __attribute__((no_sanitize_memory)) to two functions that may be called from signal handler(s) or from msan. This will hopefully avoid msan false reports which I can't reproduce llvm-svn: 263737	2016-03-17 19:42:35 +00:00
Michael J. Spencer	6979e74ce0	[msan fix] unitalized variable llvm-svn: 263736	2016-03-17 19:16:54 +00:00
Guozhi Wei	7b390ec4cd	[InstCombine] Combine A->B->A BitCast This patch enhances InstCombine to handle following case: A -> B bitcast PHI B -> A bitcast llvm-svn: 263734	2016-03-17 18:47:20 +00:00
Sanjoy Das	c9058ca9e0	[Statepoints] Export a magic constant into a header; NFC llvm-svn: 263733	2016-03-17 18:42:17 +00:00
Filipe Cabecinhas	c80cb3d0c6	[lit] Enqueue tests on a separate thread to not hit limits on parallel queues Summary: The multiprocessing.Queue.put() call can hang if we try queueing all the tests before starting to take them out of the queue. The current implementation hangs if tests exceed 2^^15, on Mac OS X. This might happen with a ninja check-all if one has a bunch of llvm projects. Reviewers: delcypher, bkramer Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D17609 llvm-svn: 263731	2016-03-17 18:27:33 +00:00
Valery Pykhtin	c4546ec0cf	[AMDGPU] add VI disassembler tests. NFC. Autogenerated from the corresponding assembler tests with a few FIXME added (will fix soon). Differential Revision: http://reviews.llvm.org/D18249 llvm-svn: 263729	2016-03-17 17:56:33 +00:00
Petar Jovanovic	0b44f24033	[PowerPC] Disable CTR loops optimization for soft float operations This patch prevents CTR loops optimization when using soft float operations inside loop body. Soft float operations use function calls, but function calls are not allowed inside CTR optimized loops. Patch by Aleksandar Beserminji. Differential Revision: http://reviews.llvm.org/D17600 llvm-svn: 263727	2016-03-17 17:11:33 +00:00
Derek Schuff	d4207ba0f6	[WebAssembly] Stackify code emitted by eliminateFrameIndex and SP writeback Summary: MRI::eliminateFrameIndex can emit several instructions to do address calculations; these can usually be stackified. Because instructions with FI operands can have subsequent operands which may be expression trees, find the top of the leftmost tree and insert the code before it, to keep the LIFO property. Also use stackified registers when writing back the SP value to memory in the epilog; it's unnecessary because SP will not be used after the epilog, and it results in better code. Differential Revision: http://reviews.llvm.org/D18234 llvm-svn: 263725	2016-03-17 17:00:29 +00:00
David Majnemer	511391feaa	[COFF] Refactor section alignment calculation Section alignment isn't completely trivial, let it live in one place so that we may reuse it in LLVM. llvm-svn: 263722	2016-03-17 16:55:18 +00:00
David Majnemer	62fed0c354	Forgot to commit this with r263692 llvm-svn: 263721	2016-03-17 16:55:11 +00:00
Changpeng Fang	234fcb81d3	AMDGPU/SI: Do not generate s_waitcnt after ds_permute/ds_bpermute Symmary: ds_permute/ds_bpermute do not read memory so s_waitcnt is not needed. Reviewers arsenm, tstellarAMD Subscribers llvm-commits, arsenm Differential Revision: http://reviews.llvm.org/D18197 llvm-svn: 263720	2016-03-17 16:43:50 +00:00
Nicolai Haehnle	79cad857a0	AMDGPU: mark atomic instructions as sources of divergence Summary: As explained by the comment, threads will typically see different values returned by atomic instructions even if the arguments are equal. Reviewers: arsenm, tstellarAMD Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D18156 llvm-svn: 263719	2016-03-17 16:21:59 +00:00
Simon Pilgrim	0f37fbac51	[X86][SSE] Simplified blend-with-zero combining We were being too aggressive in trying to combine a shuffle into a blend-with-zero pattern, often resulting in a endless loop of contrasting combines This patch stops the combine if we already have a blend in place (means we miss some domain corrections) llvm-svn: 263717	2016-03-17 15:59:36 +00:00
Sanjay Patel	9e23fedaf0	propagate 'unpredictable' metadata on select instructions This is similar to D18133 where we allowed profile weights on select instructions. This extends that change to also allow the 'unpredictable' attribute of branches to apply to selects. A test to check that 'unpredictable' metadata is preserved when cloning instructions was checked in at: http://reviews.llvm.org/rL263648 Differential Revision: http://reviews.llvm.org/D18220 llvm-svn: 263716	2016-03-17 15:30:52 +00:00
Saleem Abdulrasool	071a099102	ARM: Revert SVN r253865, 254158, fix windows division The two changes together weakened the test and caused a regression with division handling in MSVC mode. They were applied to avoid an assertion being triggered in the block frequency analysis. However, the underlying problem was simply being masked rather than solved properly. Address the actual underlying problem and revert the changes. Rather than analyze the cause of the assertion, the division failure was assumed to be an overflow. The underlying issue was a subtle bug in the BB construction in the emission of the div-by-zero check (WIN__DBZCHK). We did not construct the proper successor information in the basic blocks, nor did we update the PHIs associated with the basic block when we split them. This would result in assertions being triggered in the block frequency analysis pass. Although the original tests are being removed, the tests themselves performed very little in terms of validation but merely tested that we did not assert when generating code. Update this with new tests that actually ensure that we do not regress on the code generation. llvm-svn: 263714	2016-03-17 14:10:49 +00:00
Simon Atanasyan	16f2460575	[llvm-objdump] Add REQUIRES x86 directive to fix buildbots llvm-svn: 263708	2016-03-17 11:09:21 +00:00
Simon Atanasyan	34223a7e5d	[llvm-objdump] Add '0x' prefix to a target displacement number to accent its hex format It might be hard to recognize a hexadecimal number without '0x' prefix. Besides that '0x' prefix corresponds to GNU objdump behaviour. Differential Revision: http://reviews.llvm.org/D18207 llvm-svn: 263705	2016-03-17 10:43:44 +00:00
Simon Atanasyan	58ee875296	[mips] Use `formatImm` call to print immediate value in the `MipsInstPrinter` That allows, for example, to print hex-formatted immediates using llvm-objdump --print-imm-hex command line option. Differential Revision: http://reviews.llvm.org/D18195 llvm-svn: 263704	2016-03-17 10:43:36 +00:00
Scott Egerton	d65377da78	[mips] Eliminate instances of "potentially uninitialised local variable" warnings, NFC Summary: This should eliminate all occurrences of this within LLVMMipsAsmParser. This patch is in response to http://reviews.llvm.org/D17983. I was unable to reproduce the warnings on my machine so please advise if this fixes the warnings. Reviewers: ariccio, vkalintiris, dsanders Subscribers: dblaikie, dsanders, llvm-commits Differential Revision: http://reviews.llvm.org/D18087 llvm-svn: 263703	2016-03-17 10:37:51 +00:00
Wilfred Hughes	1bf1be9933	Remove obselete reference to TypeResolve from the tutorial. TypeResolve went away in r134829 in 2011. llvm-svn: 263702	2016-03-17 10:20:58 +00:00
Wilfred Hughes	b59b488e21	Minor grammar fix in kaleidoscope tutorial. llvm-svn: 263700	2016-03-17 10:18:13 +00:00
Wilfred Hughes	aa1ea69c71	Further typo fixes in kaleidoscope tutorial. llvm-svn: 263697	2016-03-17 09:26:45 +00:00
Wilfred Hughes	2d6b4e568e	Fix typo in kaleidoscope tutorial. llvm-svn: 263696	2016-03-17 09:09:07 +00:00
David Majnemer	6f66f0a343	[yaml2obj, COFF] Correctly handle section alignment The section alignment field was marked optional but not provided a default value: initialize it with 0. While we are here, ensure that the section alignment is plausible. llvm-svn: 263692	2016-03-17 05:43:26 +00:00
Sanjoy Das	312038872d	[Statepoints] Separate out logic for statepoint directives; NFC This splits out the logic that maps the `"statepoint-id"` attribute into the actual statepoint ID, and the `"statepoint-num-patch-bytes"` attribute into the number of patchable bytes the statpeoint is lowered into. The new home of this logic is in IR/Statepoint.cpp, and this refactoring will support similar functionality when lowering calls with deopt operand bundles in the future. llvm-svn: 263685	2016-03-17 01:56:10 +00:00
Sanjoy Das	c291e1fd4f	[Statepoint] Remove unused header; NFC llvm-svn: 263684	2016-03-17 01:15:27 +00:00
Sanjoy Das	d6fc46ea03	[Statepoints] Minor NFC cleanups Mostly code simplifcations, and bringing up IR/Statepoints.cpp up to LLVM coding style. llvm-svn: 263683	2016-03-17 00:47:18 +00:00
Sanjoy Das	3a02019fbc	[SelectionDAG] Remove visitStatepoint; NFC This way we have a single entry point into StatepointLowering. The method was a direct dispatch to LowerStatepoint anyway. llvm-svn: 263682	2016-03-17 00:47:14 +00:00
Pete Cooper	c3b847db4f	Require allocator parameter to YAML traits. The allocator here can still be a nullptr, but this atleast makes the single caller which needed nullptr be explicit about it. Note, lld started always passing a parameter here as of r263680. If anything builds out of sync, that would be why errors may occur. llvm-svn: 263681	2016-03-17 00:34:54 +00:00
Sanjay Patel	3b32ebb97b	use FileCheck for tighter checking llvm-svn: 263679	2016-03-16 23:39:37 +00:00
Pete Cooper	5b78308689	Add optional allocator to YAML code to avoid leaking lld atoms. In lld we allocate atoms on an allocator and so don't run their destructors. This means we also shouldn't allocate memory inside them without that also being on an allocator. Reviewed by Lang Hames and Rafael Espindola. llvm-svn: 263676	2016-03-16 23:29:31 +00:00
Sanjay Patel	b672e792f2	reduce check strings; no need to check IR comments llvm-svn: 263675	2016-03-16 23:22:01 +00:00
Sanjay Patel	355c77e796	use FileCheck for tighter checking llvm-svn: 263674	2016-03-16 23:20:20 +00:00
Chris Bieneman	671d0dda7d	Upgrade TBAA before upgrading intrinsics Summary: If TBAA is on an intrinsic and it gets upgraded and drops the TBAA we hit an odd assert. We should just upgrade the TBAA first because it doesn't have side-effects. Reviewers: reames, apilipenko, manmanren Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D18229 llvm-svn: 263673	2016-03-16 23:17:54 +00:00
Sanjoy Das	43e33d61c6	Fix indentation; NFC llvm-svn: 263672	2016-03-16 23:11:21 +00:00
Sanjoy Das	70697ff74d	Extract out a SelectionDAGBuilder::LowerAsStatepoint; NFC Summary: This is a step towards implementing "direct" lowering of calls and invokes with deopt operand bundles into STATEPOINT nodes (as opposed to having them mandatorily pass through RewriteStatepointsForGC, which is the case today). This change extracts out a `SelectionDAGBuilder::LowerAsStatepoint` helper function that is able to lower a "statepoint like thing", and uses it to lower `gc.statepoint` calls. This is an NFC now, but in a later change we will use `LowerAsStatepoint` to directly lower calls and invokes with operand bundles without going through an intermediate `gc.statepoint` IR representation. FYI: I expect `SelectionDAGBuilder::StatepointInfo` will evolve as I add support for lowering non gc.statepoints, right now it is fairly tightly coupled with an IR level `gc.statepoint`. Reviewers: reames, pgavlin, JosephTremoulet Subscribers: sanjoy, mcrosier, llvm-commits Differential Revision: http://reviews.llvm.org/D18106 llvm-svn: 263671	2016-03-16 23:08:00 +00:00
Sanjay Patel	6cec10572f	use FileCheck for tighter checking I'm testing out a script that auto-generates the check lines. It's 98% copied from utils/update_llc_test_checks.py. If others think this is useful, please let me know. llvm-svn: 263668	2016-03-16 22:34:57 +00:00
Sanjay Patel	cb775fcf22	use FileCheck for tighter checking I'm testing out a script that auto-generates the check lines. It's 98% copied from utils/update_llc_test_checks.py. If others think this is useful, please let me know. llvm-svn: 263667	2016-03-16 22:29:07 +00:00
Xinliang David Li	897d2923a2	Variable name cleanup /NFC llvm-svn: 263666	2016-03-16 22:13:41 +00:00
James Y Knight	f44fc5219f	Tweak some atomics functions in preparation for larger changes; NFC. - Rename getATOMIC to getSYNC, as llvm will soon be able to emit both '__sync' libcalls and '__atomic' libcalls, and this function is for the '__sync' ones. - getInsertFencesForAtomic() has been replaced with shouldInsertFencesForAtomic(Instruction), so that the decision can be made per-instruction. This functionality will be used soon. - emitLeadingFence/emitTrailingFence are no longer called if shouldInsertFencesForAtomic returns false, and thus don't need to check the condition themselves. llvm-svn: 263665	2016-03-16 22:12:04 +00:00
Sanjoy Das	19c6159833	[SelectionDAG] Extract out populateCallLoweringInfo; NFC SelectionDAGBuilder::populateCallLoweringInfo is now used instead of SelectionDAGBuilder::lowerCallOperands. The populateCallLoweringInfo interface is more composable in face of design changes like http://reviews.llvm.org/D18106 llvm-svn: 263663	2016-03-16 20:49:31 +00:00
Vedant Kumar	aa0cae6208	[ProfileData] Make a utility method public, NFC The swift frontend needs to be able to look up PGO function name variables based on the original raw function name. That's because it's not possible to create PGO function name variables while emitting swift IR. Instead, we have to create the name variables while lowering swift IR to llvm IR, at which point we fix up all calls to the increment intrinsic to point to the right name variable. llvm-svn: 263662	2016-03-16 20:49:26 +00:00
Nicolai Haehnle	ef160de3e5	AMDGPU: Prevent uniform loops from becoming infinite Summary: Uniform loops where the branch leaving the loop is predicated on VCCNZ must be skipped if EXEC = 0, otherwise they will be infinite. Reviewers: tstellarAMD, arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D18137 llvm-svn: 263658	2016-03-16 20:14:33 +00:00
Colin LeMahieu	bb0cdfb9f7	[Hexagon] Adding missing break in switch statement. Extra operands would have been appended to the end. llvm-svn: 263657	2016-03-16 20:00:38 +00:00
Chad Rosier	fea398188c	[SLP] Make DataLayout a member variable. llvm-svn: 263656	2016-03-16 19:48:42 +00:00
Geoff Berry	56fabf9b55	Revert "[LSR] Create fewer redundant instructions." This reverts commit r263644. Investigating bootstrap failures. llvm-svn: 263655	2016-03-16 19:21:47 +00:00
Simon Pilgrim	b5a20f0fec	Removed trailing whitespace llvm-svn: 263650	2016-03-16 18:37:44 +00:00
Simon Pilgrim	53f27fc00f	[X86] Reduced alignment of widened vector load/stores to better match PR26953 cases llvm-svn: 263649	2016-03-16 18:32:44 +00:00
Sanjay Patel	fd24fb1b2b	add checks for 'unpredictable' metadata preservation llvm-svn: 263648	2016-03-16 18:15:34 +00:00
Sanjay Patel	be37e62e0c	fix function names; NFC llvm-svn: 263646	2016-03-16 18:00:09 +00:00
Evgeniy Stepanov	4b96ed693a	[msan] Add a comment with a bug link. llvm-svn: 263645	2016-03-16 17:39:17 +00:00
Geoff Berry	459b750871	[LSR] Create fewer redundant instructions. Summary: Fix LSRInstance::HoistInsertPosition() to check the original insert position block first for a canonical insertion point that is dominated by all inputs. This leads to SCEV being able to reuse more instructions since it currently tracks the instructions it creates for reuse by keeping a table of <Value, insert point> pairs. Reviewers: atrick Subscribers: mcrosier, mzolotukhin, llvm-commits Differential Revision: http://reviews.llvm.org/D18001 llvm-svn: 263644	2016-03-16 17:29:49 +00:00
Simon Pilgrim	60228bdb80	[X86] Regenerated + extended widened vector conversion tests - Ensure we test X86 + X64 - sitopfp / uitofp requires testing for SSE2 and SSE42 as well (part of the fix for PR26953) llvm-svn: 263640	2016-03-16 15:33:43 +00:00
Michel Danzer	302f83ac4e	AMDGPU: Verify instructions in non-debug builds as well And emit an error if it fails. This prevents illegal instructions from getting sent to the GPU, which would potentially result in a hang. This is a candidate for the stable branch(es). Reviewed-by: Marek Olšák <marek.olsak@amd.com> llvm-svn: 263627	2016-03-16 09:10:42 +00:00
Michel Danzer	beb79ceb19	AMDGPU/SI: Clean up indentation in SIInstrInfo::getDefaultRsrcDataFormat Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 263626	2016-03-16 09:10:35 +00:00
Igor Breger	0ba7b04f5f	AVX512BW: Fix SRA v64i8 lowering. Use PCMPGTM (cmp result in k register) for 512bit vector because PCMPGT supported only for 128/256bit. Differential Revision: http://reviews.llvm.org/D18204 llvm-svn: 263624	2016-03-16 08:48:26 +00:00
Vedant Kumar	ddfec8cb55	[Bitcode] Add compatibility test for the 3.8 release Fork off compatibility.ll for the 3.8 release. The *.bc file in this commit was produced using a Release build of the release_38 branch. llvm-svn: 263620	2016-03-16 05:43:03 +00:00
Haicheng Wu	7873857a88	[JumpThreading] See through Cast Instructions To capture more jump-thread opportunity. llvm-svn: 263618	2016-03-16 04:52:52 +00:00
Lang Hames	9fb8e1b2a5	[Support] Update Error unit test to remove implementation specific behaviour. llvm-svn: 263610	2016-03-16 01:20:54 +00:00
Lang Hames	f7f6d3e93f	[Support] Add the 'Error' class for structured error handling. This patch introduces the Error classs for lightweight, structured, recoverable error handling. It includes utilities for creating, manipulating and handling errors. The scheme is similar to exceptions, in that errors are described with user-defined types. Unlike exceptions however, errors are represented as ordinary return types in the API (similar to the way std::error_code is used). For usage notes see the LLVM programmer's manual, and the Error.h header. Usage examples can be found in unittests/Support/ErrorTest.cpp. Many thanks to David Blaikie, Mehdi Amini, Kevin Enderby and others on the llvm-dev and llvm-commits lists for lots of discussion and review. llvm-svn: 263609	2016-03-16 01:02:46 +00:00
Simon Pilgrim	39d2411c3b	[X86] Regenerated widen load tests llvm-svn: 263608	2016-03-16 00:41:21 +00:00
Simon Pilgrim	f7cb16f7de	[X86][SSE41] Additional tests for extracting zeroable shuffle elements We can currently only match zeroable vector elements of the same size as the shuffle type - these tests demonstrate the problem and a solution will be shortly added in an updated D14261 llvm-svn: 263606	2016-03-16 00:13:36 +00:00
Haicheng Wu	64d9d7c3f7	Revert "[JumpThreading] Simplify Instructions first in ComputeValueKnownInPredecessors()" Not sure it handles undef properly. llvm-svn: 263605	2016-03-15 23:38:47 +00:00
Justin Bogner	68b28d0054	llvm-bcanalyzer: Fix handling of attribute group entries These were printing as "UnknownCode3", since we were looking for them inside PARAMATTR blocks instead of PARAMATTR_GROUP blocks. llvm-svn: 263597	2016-03-15 22:37:25 +00:00
Adam Nemet	c979c6e123	Turn LoopLoadElimination on again The latent bug that LLE exposed in the LoopVectorizer was resolved (PR26952). The pass can be disabled with -mllvm -enable-loop-load-elim=0 llvm-svn: 263595	2016-03-15 22:26:12 +00:00
Mike Aizatsky	298516ffa9	[libfuzzer] speeding up corpus load llvm-svn: 263591	2016-03-15 21:47:21 +00:00
Bjorn Steinbrink	37ca462508	Also handle the new Rust pers fn to isCatchAll() llvm-svn: 263585	2016-03-15 20:57:07 +00:00
Bjorn Steinbrink	59fdec673d	Add Rust's personality function to the list of known personality functions Reviewers: majnemer Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D18192 llvm-svn: 263581	2016-03-15 20:35:45 +00:00
Evgeniy Stepanov	d6e91369d8	[msan] Don't put module constructors in comdats. There is something strange going on with debug info (.eh_frame_hdr) disappearing when msan.module_ctor are placed in comdat sections. Moving this functionality under flag, disabled by default. llvm-svn: 263579	2016-03-15 20:25:47 +00:00
Mehdi Amini	ef156c6701	Add missing error handling in llvm-lto Annoyingly, ErrorOr allows to not check the error when things go well. It will crash badly when there is an error though. It should runtime assert when it is used without being checked! From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 263577	2016-03-15 20:17:55 +00:00
Teresa Johnson	1396809b17	[ThinLTO] Record all global variable defs in the summary Record all variable defs with a summary record to aid in building a complete reference graph and locating constant variable defs to import. llvm-svn: 263576	2016-03-15 19:35:45 +00:00
Quentin Colombet	fdc838e97f	[MIR] Add a test case for the diagnostic of a wrongly typed generic instruction llvm-svn: 263573	2016-03-15 18:31:29 +00:00
Quentin Colombet	a3f2abad55	[AArch64] Move GlobalISel test cases into a GlobalISel subdirectory llvm-svn: 263572	2016-03-15 18:30:00 +00:00
Chris Bieneman	ef43d448d4	[CMake] Add PACKAGE_VENDOR for customizing version output Summary: This change adds a PACKAGE_VENDOR variable. When set it makes the version output more closely resemble the clang version output. Reviewers: aprantl, bogner Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D18159 llvm-svn: 263566	2016-03-15 18:07:46 +00:00
Adam Nemet	fdb20595a1	[LV] Preserve LoopInfo when store predication is used This was a latent bug that got exposed by the change to add LoopSimplify as a dependence to LoopLoadElimination. Since LoopInfo was corrupted after LV, LoopSimplify mis-compiled nbench in the test-suite (more details in the PR). The problem was that when we create the blocks for predicated stores we didn't add those to any loops. The original testcase for store predication provides coverage for this assuming we verify LI on the way out of LV. Fixes PR26952. llvm-svn: 263565	2016-03-15 18:06:20 +00:00
Davide Italiano	dfdf278ebf	[MC] Rename TLSDESC as it's not ARM specific. Similarly to what was done for TLSCALL in r263515. llvm-svn: 263564	2016-03-15 17:29:52 +00:00
Changpeng Fang	01f6062227	AMDGPU/SI: Implement GroupStaticSize Intrinsic for Dynamic LDS Summary: Static LDS size is saved in MachineFunctionInfo::LDSSize, We define a pseudo instruction with usesCustomInserter bit set. Then, in EmitInstrWithCustomInserter, we replace this pseudo instruction with a mov of MachineFunctionInfo::LDSSize. Reviewers: arsenm tstellarAMD Subscribers llvm-commits, arsenm Differential Revision: http://reviews.llvm.org/D18064 llvm-svn: 263563	2016-03-15 17:28:44 +00:00
Hemant Kulkarni	c030f23b8b	[llvm-readobj] Impl GNU style printing of sections and relocations Differential Revision: http://reviews.llvm.org/D17523 llvm-svn: 263561	2016-03-15 17:25:31 +00:00
Douglas Katzman	708eeb0519	Myriad: Add new sparc CPU kinds. llvm-svn: 263557	2016-03-15 16:41:47 +00:00
Benjamin Kramer	96f4b12880	[GlobalOpt] Don't look through aliases when sorting names of globals. If both are different aliases to the same value the sorting becomes non-deterministic as array_pod_sort is not stable. llvm-svn: 263550	2016-03-15 14:18:26 +00:00
Chad Rosier	ebe559019b	[SLP] Update comment to reflect reality. NFC. llvm-svn: 263548	2016-03-15 13:27:58 +00:00
Nikolay Haustov	cb9dddb1d7	[AMDGPU] Assembler: Update SOP* tests Add VI encodings. Reformat sopp.s to match style of other files. Differential Revision: http://reviews.llvm.org/D18084 llvm-svn: 263540	2016-03-15 07:44:57 +00:00
David Majnemer	0ab61bfb37	[llvm-objdump] Add support for dumping the PE TLS directory The PE TLS directory contains information about where the TLS data resides in the image, what functions should be executed when threads are created, etc. llvm-svn: 263537	2016-03-15 06:14:01 +00:00
Andrew Wilkins	36761bf924	bindings/go: reinstate TargetMachine.TargetData Summary: LLVMGetTargetDataLayout was removed from the C API, and then TargetMachine.TargetData was removed. Later, LLVMCreateTargetMachineData was added to the C API, and we now expose this via the Go API. Reviewers: deadalnix, pcc Subscribers: cierniak, llvm-commits, axw Differential Revision: http://reviews.llvm.org/D18173 llvm-svn: 263530	2016-03-15 05:04:06 +00:00
Lang Hames	abda4d2526	[MachO] Extend the alt_entry support for aliases added in r263521 to expressions of the form 'a = .' and 'a = Ltmp'. llvm-svn: 263528	2016-03-15 04:20:49 +00:00
Eric Christopher	257338ff0f	Use some braces to format this a little better. llvm-svn: 263527	2016-03-15 03:01:31 +00:00
Teresa Johnson	2794f71575	BitcodeWriter dyn_cast cleanup for r263275 (NFC) Address review suggestions from dblaikie: change a few dyn_cast to cast and fold a cast into if condition. llvm-svn: 263526	2016-03-15 02:41:29 +00:00
Eric Christopher	ee00abe5e6	Fix llvm/llvm/lib/Transforms/Utils/LoopUnroll.cpp:285:53: error: suggest parentheses around '&&' within '\|\|' [-Werror=parentheses]. llvm-svn: 263525	2016-03-15 02:19:06 +00:00
Teresa Johnson	b43027d1e0	Move global ID computation from Function to GlobalValue (NFC) Since the static getGlobalIdentifier and getGUID methods are now called for global values other than functions, reflect that by moving these methods to the GlobalValue class. llvm-svn: 263524	2016-03-15 02:13:19 +00:00
Fiona Glaser	a4b1ace461	DenseMap: make .resize() do the intuitive thing In some places, like InstCombine, we resize a DenseMap to fit the elements we intend to put in it, then insert those elements (to avoid continual reallocations as it grows). But .resize(foo) doesn't actually do what people think; it resizes to foo buckets (which is really an implementation detail the user of DenseMap probably shouldn't care about), not the space required to fit foo elements. DenseMap grows if 3/4 of its buckets are full, so this actually causes one forced reallocation every time instead of avoiding a reallocation. This patch makes .resize(foo) do the intuitive thing: it grows to the size necessary to fit foo elements without new allocations. Also include a test to verify that .resize() actually does what we think it does. llvm-svn: 263522	2016-03-15 01:50:46 +00:00
Lang Hames	1b640e05ba	[MachO] Add MachO alt-entry directive support. This patch adds support for the MachO .alt_entry assembly directive, and uses it for global aliases with non-zero GEP offsets. The alt_entry flag indicates that a symbol should be layed out immediately after the preceding symbol. Conceptually it introduces an alternate entry point for a function or data structure. E.g.: safe_foo: // check preconditions for foo .alt_entry fast_foo fast_foo: // body of foo, can assume preconditions. The .alt_entry flag is also implicitly set on assembly aliases of the form: a = b + C where C is a non-zero constant, since these have the same effect as an alt_entry symbol: they introduce a label that cannot be moved relative to the preceding one. Setting the alt_entry flag on aliases of this form fixes http://llvm.org/PR25381. llvm-svn: 263521	2016-03-15 01:43:05 +00:00
Kostya Serebryany	0c5e3af862	[libFuzzer] use max_len exactly equal to the max size of input. Fix 32-bit build llvm-svn: 263518	2016-03-15 01:28:00 +00:00
Sanjoy Das	c11460e051	[StatepointLowering] Move an assertion; NFCI Instead of running an explicit loop over `gc.relocate` calls hanging off of a `gc.statepoint`, assert the validity of the type of the value being relocated in `visitRelocate`. llvm-svn: 263516	2016-03-15 01:16:31 +00:00
Davide Italiano	249c45d92e	[MC] Rename TLSCALL as it's not ARM specific. `MCSymbolRefExpr` variant kind for TLSCALL is prefixed with _ARM_ since this is how it was originally implemented. The X86_64 version is exactly the same so there's no reason to create a new variant, we can just rename the existing one to be machine-independent. This generalization is the first step to implement support for GNU2 TLS dialect in MC. Differential Revision: http://reviews.llvm.org/D18160 llvm-svn: 263515	2016-03-15 00:25:22 +00:00
Teresa Johnson	26ab5772b0	[ThinLTO] Renaming of function index to module summary index (NFC) (Resubmitting after fixing missing file issue) With the changes in r263275, there are now more than just functions in the summary. Completed the renaming of data structures (started in r263275) to reflect the wider scope. In particular, changed the FunctionIndex* data structures to ModuleIndex*, and renamed related variables and comments. Also renamed the files to reflect the changes. A companion clang patch will immediately succeed this patch to reflect this renaming. llvm-svn: 263513	2016-03-15 00:04:37 +00:00
Eric Christopher	da8b3f1914	Temporarily Revert "[X86][SSE] Simplify vector LOAD + EXTEND on pre-SSE41 hardware" as it seems to be causing crashes during code generation in halide. PR forthcoming. This reverts commit r263303. llvm-svn: 263512	2016-03-14 23:59:57 +00:00
Justin Lebar	6827de19b2	[LoopUnroll] Respect the convergent attribute. Summary: Specifically, when we perform runtime loop unrolling of a loop that contains a convergent op, we can only unroll k times, where k divides the loop trip multiple. Without this change, we'll happily unroll e.g. the following loop for (int i = 0; i < N; ++i) { if (i == 0) convergent_op(); foo(); } into int i = 0; if (N % 2 == 1) { convergent_op(); foo(); ++i; } for (; i < N - 1; i += 2) { if (i == 0) convergent_op(); foo(); foo(); }. This is unsafe, because we've just added a control-flow dependency to the convergent op in the prelude. In general, runtime unrolling loops that contain convergent ops is safe only if we don't have emit a prelude, which occurs when the unroll count divides the trip multiple. Reviewers: resistor Subscribers: llvm-commits, mzolotukhin Differential Revision: http://reviews.llvm.org/D17526 llvm-svn: 263509	2016-03-14 23:15:34 +00:00
NAKAMURA Takumi	fa99425667	MathExtrasTest.cpp: Use EXPECT_DOUBLE_EQ here, instead of EXPECT_FLOAT_EQ. llvm-svn: 263508	2016-03-14 23:11:28 +00:00
Rui Ueyama	113819bf8b	Object: Add ELF types to ELFType. These types are defined in ELFFile, so in order to use them, you have to write ELFFile<ELFT>::SomeType. But there seems to be no reason to have ELFFile have these types. This patch allows you to write ELFT::SomeType instead. This simplifies libObject users. This is an example: http://reviews.llvm.org/D18129 http://reviews.llvm.org/D18130 llvm-svn: 263504	2016-03-14 22:58:21 +00:00
Amaury Sechet	bdb261b4c0	Imporove load to store => memcpy Summary: This now try to reorder instructions in order to help create the optimizable pattern. Reviewers: craig.topper, spatel, dexonsmith, Prazek, chandlerc, joker.eph, majnemer Differential Revision: http://reviews.llvm.org/D16523 llvm-svn: 263503	2016-03-14 22:52:27 +00:00
Manuel Jacob	6be355961e	Re-add ConstantFoldInstOperands form taking opcode and return type. Summary: This form was replaced by a form taking an instruction instead of opcode and return type in r258391. After committing this change (and some depending, follow-up changes) it turned out in the review thread to be controversial. The discussion didn't come to a conclusion yet. I'm re-adding the old form to fix the API regression and to provide a better base for discussion, possibly on llvm-dev. A difference to the original function is that it can't be called with GEPs (similarly to how it was already the case for compares). In order to support opaque pointers in the future, folding GEPs needs to be passed the source element type, which is not possible with the current API. Reviewers: dberlin, reames Subscribers: dblaikie, eddyb Differential Revision: http://reviews.llvm.org/D17901 llvm-svn: 263501	2016-03-14 22:34:17 +00:00
Easwaran Raman	f9709ee001	Remove code added for debugging purposes. NFC. llvm-svn: 263500	2016-03-14 22:23:28 +00:00
Justin Bogner	2414c5d46b	llvm-shlib: Remove the option to override __cxa_atexit If anybody is actually using this, it probably doesn't do what they think it does. This actually causes the dylib to export a __cxa_atexit symbol, so anything that links it probably loses their exit time destructors as well as disabling LLVM's. This just removes the option entirely. If somebody does need this behaviour we should figure out a more principled way to do it. This is effectively a revert of r223805. llvm-svn: 263498	2016-03-14 21:54:45 +00:00
Reid Kleckner	ecb4090716	llvm-config: fix --libs on Linux Summary: llvm-config --libs does not produce correct output since commit r260263 (llvm-config: Add preliminary Windows support) changed naming format of the libraries. This patch updates llvm-config to recognize new naming format and output correct linker flags. Ref: https://llvm.org/bugs/show_bug.cgi?id=26581 Patch by Vedran Miletić Reviewers: ehsan, rnk, pxli168 Subscribers: pxli168 Differential Revision: http://reviews.llvm.org/D17300 llvm-svn: 263497	2016-03-14 21:39:58 +00:00
Amaury Sechet	eae09c2c2a	Factor out MachineBlockPlacement::fillWorkLists. NFC Summary: There are places in MachineBlockPlacement where a worklist is filled in pretty much identical way. The code is duplicated. This refactor it so that the same code is used in both scenarii. Reviewers: chandlerc, majnemer, rafael, MatzeB, escha, silvas Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D18077 llvm-svn: 263495	2016-03-14 21:24:11 +00:00
Teresa Johnson	cec0cae313	Revert "[ThinLTO] Renaming of function index to module summary index (NFC)" This reverts commit r263490. Missed a file. llvm-svn: 263493	2016-03-14 21:18:10 +00:00
Teresa Johnson	892920b358	[ThinLTO] Renaming of function index to module summary index (NFC) With the changes in r263275, there are now more than just functions in the summary. Completed the renaming of data structures (started in r263275) to reflect the wider scope. In particular, changed the FunctionIndex* data structures to ModuleIndex*, and renamed related variables and comments. Also renamed the files to reflect the changes. A companion clang patch will immediately succeed this patch to reflect this renaming. llvm-svn: 263490	2016-03-14 21:05:56 +00:00
Keno Fischer	71b5a81e77	[JITEventListener] Fix some -Wundef warnings Summary: This check was added in rL152620, and has started causing downstream warnings in Julia: ``` In file included from /home/tkelman/Julia/julia-0.5/src/codegen.cpp:22:0: /home/tkelman/Julia/julia-0.5/usr/include/llvm/ExecutionEngine/JITEventListener.h:84:5: warning: "LLVM_USE_INTEL_JITEVENTS" is not defined [-Wundef] #if LLVM_USE_INTEL_JITEVENTS ^ /home/tkelman/Julia/julia-0.5/usr/include/llvm/ExecutionEngine/JITEventListener.h💯5: warning: "LLVM_USE_OPROFILE" is not defined [-Wundef] #if LLVM_USE_OPROFILE ^ ``` Patch by Tony Kelman. Reviewers: loladiro Differential Revision: http://reviews.llvm.org/D17254 llvm-svn: 263487	2016-03-14 20:49:06 +00:00
Adam Nemet	bb45810e4f	Revert "Turn LoopLoadElimination on again" This reverts commit r263472. There is an LNT failure on clang-ppc64be-linux-lnt. Turn this off, while I am investigating. llvm-svn: 263485	2016-03-14 20:38:55 +00:00
Sanjay Patel	ee52b6e77d	allow branch weight metadata on select instructions (PR26636) As noted in: https://llvm.org/bugs/show_bug.cgi?id=26636 This doesn't accomplish anything on its own. It's the first step towards preserving and using branch weights with selects. The next step would be to make sure we're propagating the info in all of the other places where we create selects (SimplifyCFG, InstCombine, etc). I don't think there's an easy fix to make this happen; we have to look at each transform individually to determine how to correctly propagate the weights. Along with that step, we need to then use the weights when making subsequent transform decisions such as discussed in http://reviews.llvm.org/D16836. The inliner test is independent but closely related. It verifies that metadata is preserved when both branches and selects are cloned. Differential Revision: http://reviews.llvm.org/D18133 llvm-svn: 263482	2016-03-14 20:18:59 +00:00
Justin Lebar	9d94397859	[attrs] Handle convergent CallSites. Summary: Previously we had a notion of convergent functions but not of convergent calls. This is insufficient to correctly analyze calls where the target is unknown, e.g. indirect calls. Now a call is convergent if it targets a known-convergent function, or if it's explicitly marked as convergent. As usual, we can remove convergent where we can prove that no convergent operations are performed in the call. Originally landed as r261544, then reverted in r261544 for (incidental) build breakage. Re-landed here with no changes. Reviewers: chandlerc, jingyue Subscribers: llvm-commits, tra, jhen, hfinkel Differential Revision: http://reviews.llvm.org/D17739 llvm-svn: 263481	2016-03-14 20:18:54 +00:00
Justin Lebar	ca35b090f8	[docs] "Straightforward" is one word. llvm-svn: 263480	2016-03-14 20:18:51 +00:00
Justin Lebar	20adbdd2c0	[docs] Fix typo in docs/CodeGenerator.rst. llvm-svn: 263479	2016-03-14 20:17:08 +00:00
Ulrich Weigand	52aa7fba3f	[SystemZ] Add missing isBranch flags to certain instruction Some instructions were missing isBranch, isCall, or isTerminator flags. This didn't really affect code generation since most of the affected patterns were used only for the AsmParser and/or disassembler. However, it could affect tools using the MC layer to disassemble and parse binary code (e.g. via MCInstrDesc::mayAffectControlFlow). llvm-svn: 263478	2016-03-14 20:16:30 +00:00
Keno Fischer	a91ae8336b	[SLPVectorizer] Fix dependency list Summary: DemandedBits was added to the requirements of SLPVectorizer in rL261212 (and various earlier version of it), but the appropriate initialization statement was accidentally forgotten. Ref [[ https://github.com/JuliaLang/julia/issues/14998 \| JuliaLang/julia#14998 ]]. Patch by Yichao Yu. Reviewers: mssimpso Differential Revision: http://reviews.llvm.org/D18152 llvm-svn: 263476	2016-03-14 20:04:24 +00:00
Aaron Ballman	be57dfa75d	Reverting r263474; it turns out there are uses of the public interface, but MSVC wasn't caring about them. http://lab.llvm.org:8080/green//job/clang-stage1-cmake-RA-incremental_build/21475/consoleFull#-45876453249ba4694-19c4-4d7e-bec5-911270d8a58c http://lab.llvm.org:8011/builders/llvm-clang-lld-x86_64-scei-ps4-ubuntu-fast/builds/8879 llvm-svn: 263475	2016-03-14 20:03:15 +00:00
Aaron Ballman	1a57daf314	The file header of TrailingObjects.h suggests that TrailingObjects be inherited privately because the public APIs it exposes are not meant as part of the interface. There's no functional change because nothing currently relied on these being inherited publicly. llvm-svn: 263474	2016-03-14 19:57:48 +00:00
Adam Nemet	5a19ae917b	Turn LoopLoadElimination on again The two issues that were discovered got fixed (r263058, r263173). The pass can be disabled with -mllvm -enable-loop-load-elim=0 llvm-svn: 263472	2016-03-14 19:40:25 +00:00
Michael Kuperstein	b7860fedd4	[AliasSetTracker] Do not strip pointer casts when processing MemSetInst This fixes PR26843. llvm-svn: 263462	2016-03-14 18:34:29 +00:00
Chad Rosier	27c352d26d	[AArch64] Refactor AArch64FrameLowering::emitPrologue. NFC. http://reviews.llvm.org/D18125 Patch by Aditya Kumar. llvm-svn: 263461	2016-03-14 18:24:34 +00:00
Quentin Colombet	40ce25b68b	[SpillPlacement] Fix a quadratic behavior in spill placement. The bad behavior happens when we have a function with a long linear chain of basic blocks, and have a live range spanning most of this chain, but with very few uses. Let say we have only 2 uses. The Hopfield network is only seeded with two active blocks where the uses are, and each iteration of the outer loop in `RAGreedy::growRegion()` only adds two new nodes to the network due to the completely linear shape of the CFG. Meanwhile, `SpillPlacer->iterate()` visits the whole set of discovered nodes, which adds up to a quadratic algorithm. This is an historical accident effect from r129188. When the Hopfield network is expanding, most of the action is happening on the frontier where new nodes are being added. The internal nodes in the network are not likely to be flip-flopping much, or they will at least settle down very quickly. This means that while `SpillPlacer->iterate()` is recomputing all the nodes in the network, it is probably only the two frontier nodes that are changing their output. Instead of recomputing the whole network on each iteration, we can maintain a SparseSet of nodes that need to be updated: - `SpillPlacement::activate()` adds the node to the todo list. - When a node changes value (i.e., `update()` returns true), its neighbors are added to the todo list. - `SpillPlacement::iterate()` only updates the nodes in the list. The result of Hopfield iterations is not necessarily exact. It should converge to a local minimum, but there is no guarantee that it will find a global minimum. It is possible that updating nodes in a different order will cause us to switch to a different local minimum. In other words, this is not NFC, but although I saw a few runtime improvements and regressions when I benchmarked this change, those were side effects and actually the performance change is in the noise as expected. Huge thanks to Jakob Stoklund Olesen <stoklund@2pi.dk> for his feedbacks, guidance and time for the review. llvm-svn: 263460	2016-03-14 18:21:25 +00:00
Chad Rosier	6d98655070	[AArch64] Break the dependency between FP and SP when possible. When the SP in not changed because of realignment/VLAs etc., we restore the SP by using the previous value of SP and not the FP. Breaking the dependency will help in cases when the epilog of a callee is close to the epilog of the caller; for then "sub sp, fp, #" depends on the load restoring the FP in the epilog of the callee. http://reviews.llvm.org/D18060 Patch by Aditya Kumar and Evandro Menezes. llvm-svn: 263458	2016-03-14 18:17:41 +00:00
Quentin Colombet	de89d5f81f	[ADT] Add a pop_back_val method to the SparseSet container. The next commit will use it. llvm-svn: 263455	2016-03-14 18:10:41 +00:00
Chad Rosier	7a21bb196b	[Mips] Fix -Wunused-private-field warning after r263444. llvm-svn: 263454	2016-03-14 18:10:20 +00:00
Sanjay Patel	7506852709	[DAG] use !isUndef() ; NFCI llvm-svn: 263453	2016-03-14 18:09:43 +00:00
Sanjay Patel	5719584129	[DAG] use isUndef() ; NFCI llvm-svn: 263448	2016-03-14 17:28:46 +00:00
Tom Stellard	331f981cc9	AMDGPU/SI: Handle wait states required for DPP instructions Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D17543 llvm-svn: 263447	2016-03-14 17:05:56 +00:00
Sanjay Patel	62d707c8d9	[x86, AVX] replace masked load with full vector load when possible Converting masked vector loads to regular vector loads for x86 AVX should always be a win. I raised the legality issue of reading the extra memory bytes on llvm-dev. I did not see any objections. 1. x86 already does this kind of optimization for multiple scalar loads -> vector load. 2. If other targets have the same flexibility, we could move this transform up to CGP or DAGCombiner. Differential Revision: http://reviews.llvm.org/D18094 llvm-svn: 263446	2016-03-14 16:54:43 +00:00
Daniel Sanders	e8efff373a	[mips] MIPS32R6 compact branch support Summary: MIPSR6 introduces a class of branches called compact branches. Unlike the traditional MIPS branches which have a delay slot, compact branches do not have a delay slot. The instruction following the compact branch is only executed if the branch is not taken and must not be a branch. It works by generating compact branches for MIPS32R6 when the delay slot filler cannot fill a delay slot. Then, inspecting the generated code for forbidden slot hazards (a compact branch with an adjacent branch or other CTI) and inserting nops to clear this hazard. Patch by Simon Dardis. Reviewers: vkalintiris, dsanders Subscribers: MatzeB, dsanders, llvm-commits Differential Revision: http://reviews.llvm.org/D16353 llvm-svn: 263444	2016-03-14 16:24:05 +00:00
Marek Olsak	ed2213e6ef	AMDGPU/SI: Incomplete shader binaries need to finish execution at the end Reviewers: tstellarAMD, arsenm Subscribers: arsenm Differential Revision: http://reviews.llvm.org/D18058 llvm-svn: 263441	2016-03-14 15:57:14 +00:00
Nicolai Haehnle	74127fe8d7	AMDGPU: mark llvm.amdgcn.image.atomic.* as a source of divergence Summary: When multiple threads perform an atomic op with the same arguments, they will usually see different return values. Reviewers: arsenm, tstellarAMD Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D18101 llvm-svn: 263440	2016-03-14 15:37:18 +00:00
Vasileios Kalintiris	42db3ff47f	[mips] Use range-based for loops. NFC. llvm-svn: 263438	2016-03-14 15:05:30 +00:00
Benjamin Kramer	1082fa66a5	Revert "Recommitted r261633 "Supporting all entities declared in lexical scope in LLVM debug info." After fixing PR26715 at r263379." This reverts commit r263424. Breaks self-host. llvm-svn: 263437	2016-03-14 14:58:36 +00:00
Ulrich Weigand	cdce026b4d	[SystemZ] Avoid LER on z13 due to partial register dependencies On the z13, it turns out to be more efficient to access a full floating-point register than just the upper half (as done e.g. by the LE and LER instructions). Current code already takes this into account when loading from memory by using the LDE instruction in place of LE. However, we still generate LER, which shows the same performance issues as LE in certain circumstances. This patch changes the back-end to emit LDR instead of LER to implement FP32 register-to-register copies on z13. llvm-svn: 263431	2016-03-14 13:50:03 +00:00
Chad Rosier	00bd82cade	[CVP] Replace nonnegative with positive, per Philip's request. NFC. llvm-svn: 263430	2016-03-14 13:48:00 +00:00
Zlatko Buljan	fba68931ed	[mips] Fix an issue with long double when function roundl is defined Differential Revision: http://reviews.llvm.org/D17760 llvm-svn: 263428	2016-03-14 12:50:23 +00:00
Daniel Sanders	127d2d2b46	[mips] Range check uimm16_64 Summary: Reviewers: vkalintiris Subscribers: llvm-commits, dsanders Differential Revision: http://reviews.llvm.org/D17725 llvm-svn: 263427	2016-03-14 12:44:44 +00:00
Amjad Aboud	ab0378b16c	Recommitted r261633 "Supporting all entities declared in lexical scope in LLVM debug info." After fixing PR26715 at r263379. llvm-svn: 263424	2016-03-14 12:03:20 +00:00
Daniel Sanders	cfa3483c8e	[mips] Simplify ordering of range checked immediate classes. Summary: With the addition of checks to ensure that operands have a strict ordering it has become tricky to manage the order in the way I originally intended. This patch linearizes the ordering which simplifies the implementation but requires an order that is arbitrary in places. Here are some examples: * uimm4 < uimm5 < uimm6 * simm4 < uimm4 < simm5 < uimm5 * uimm5 < uimm5_plus1 (1..32) < uimm5_plus32 (32..63) < uimm6 The term 'superset' starts to break down here since the _plus classes are not true supersets of uimm5 (but they are still subsets of uimm6). * uimm5 < uimm5_64, and uimm5 < vsplat_uimm5 This is entirely arbitrary. We need an ordering and what we pick is unimportant since only one is possible for a given mnemonic. Reviewers: vkalintiris Subscribers: llvm-commits, dsanders Differential Revision: http://reviews.llvm.org/D17723 llvm-svn: 263423	2016-03-14 11:46:30 +00:00
Nikolay Haustov	79af6b33e0	[AMDGPU] Assembler: SOP* instruction fixes s_bitset0_b64, s_bitset1_b64 has 32-bit src0, not 64-bit. s_rfe_b64 has just one destination operand and no source. Uncomment S_BITCMP* and S_SETVSKIP, adjust SOPC_* classes for that. Add s_memrealtime test and change comments in smem.s to follow common style. Change test for s_memtime to use non-zero register to make it really test encoding. Add tests for s_buffer_load*. Add tests for SOPC instructions (same for SI and VI) Differential Revision: http://reviews.llvm.org/D18040 llvm-svn: 263420	2016-03-14 11:17:19 +00:00
Daniel Sanders	19b7f76afa	[mips] Range check uimm6_lsl2. Summary: Reviewers: vkalintiris Subscribers: dsanders, llvm-commits Differential Revision: http://reviews.llvm.org/D17291 llvm-svn: 263419	2016-03-14 11:16:56 +00:00
Hans Wennborg	369ebfe4c9	Try to fix build of WebAssemblyRegStackify.cpp on Windows It's failing to build on VS2015 with: C:\b\build\slave\ClangToTWin\build\src\third_party\llvm\lib\Target\WebAssembly\WebAssemblyRegStackify.cpp(520): error C2668: 'llvm::make_reverse_iterator': ambiguous call to overloaded function C:\b\build\slave\ClangToTWin\build\src\third_party\llvm\include\llvm/ADT/STLExtras.h(217): note: could be 'std::reverse_iterator<llvm::MachineBasicBlock::iterator> llvm::make_reverse_iterator<llvm::MachineInstrBundleIterator<llvm::MachineInstr>>(IteratorTy)' with [ IteratorTy=llvm::MachineInstrBundleIterator<llvm::MachineInstr> ] C:\b\depot_tools\win_toolchain\vs_files\391bbf1220d3edcd3cc3fccdb56224181e3b13a7\win_sdk\bin\..\..\VC\include\xutility(1217): note: or 'std::reverse_iterator<llvm::MachineBasicBlock::iterator> std::make_reverse_iterator<llvm::MachineInstrBundleIterator<llvm::MachineInstr>>(_RanIt)' [found using argument-dependent lookup] with [ _RanIt=llvm::MachineInstrBundleIterator<llvm::MachineInstr> ] I don't have VS2015 locally at the moment, but hopefully this will help. llvm-svn: 263418	2016-03-14 11:04:15 +00:00
Igor Breger	a949100532	AVX512: icmp operation should be always lowered to CMPM (AVX-512) instruction on SKX. implemented by delena Differential Revision: http://reviews.llvm.org/D18054 llvm-svn: 263417	2016-03-14 10:26:39 +00:00
Valery Pykhtin	0f97f17152	[AMDGPU] AsmParser: Factor out parseRegister. NFC. llvm-svn: 263411	2016-03-14 07:43:42 +00:00
Valery Pykhtin	9e33c7f5d3	[AMDGPU] AsmParser: refactor post push_back vector access. NFC. llvm-svn: 263409	2016-03-14 05:25:44 +00:00
David Majnemer	b9456a5eb3	[CodeView] Consistently handle overly large symbol names Overly large symbol names weren't correctly handled for leaf function records. llvm-svn: 263408	2016-03-14 05:15:09 +00:00
Valery Pykhtin	f91911c3ae	[AMDGPU] AsmParser: remove redundant isReg checks. NFC. llvm-svn: 263407	2016-03-14 05:01:45 +00:00
Haicheng Wu	d60ae33d29	[CVP] Convert an SDiv to a UDiv if both operands are known to be nonnegative The motivating example is this for (j = n; j > 1; j = i) { i = j / 2; } The signed division is safely to be changed to an unsigned division (j is known to be larger than 1 from the loop guard) and later turned into a single shift without considering the sign bit. llvm-svn: 263406	2016-03-14 03:24:28 +00:00
Amaury Sechet	7b05a4c2cb	Add facility to add/remove/check attribute on function and arguments. Summary: This comes from work to make attribute manipulable via the C API. Reviewers: gottesmm, hfinkel, baldrick, echristo, tejohnson Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D18128 llvm-svn: 263404	2016-03-14 01:37:29 +00:00
Junmo Park	917cceb6fa	[MCSchedule] Remove comments about MinLatency. NFC Summary: There is no definition about MinLatency any more. Reviewers: mcrosier, spatel, hfinkel Differential Revision: http://reviews.llvm.org/D18079 llvm-svn: 263403	2016-03-14 00:36:19 +00:00
Simon Pilgrim	9b7aaafc6a	[X86][XOP] Added target shuffle combine tests for XOP's VPPERM 2-op shuffle Actual combing support will be added in a future patch llvm-svn: 263402	2016-03-14 00:18:26 +00:00
David Blaikie	a757f76681	Remove some unused variables llvm-svn: 263396	2016-03-13 22:00:18 +00:00
Mehdi Amini	ba9fba81d6	Remove PreserveNames template parameter from IRBuilder This reapplies r263258, which was reverted in r263321 because of issues on Clang side. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 263393	2016-03-13 21:05:13 +00:00
Simon Pilgrim	5f1326f8cc	[X86][SSE] Added truncated vector arithmetic tests. For cases where we are truncating an integer vector arithmetic result, it may be better to pre-truncate the input operands - no code to support this yet (scalar is done with SimplifyDemandedBits but adding vector support could be a lot of work) but these tests represent the current codegen status. Example bugs: PR14666, PR22703 llvm-svn: 263384	2016-03-13 19:08:01 +00:00
Simon Pilgrim	035b19ecf5	[X86][SSE41] Avoid variable blend for constant v8i16 shifts The SSE41 v8i16 shift lowering using (v)pblendvb is great for non-constant shift amounts, but if it is constant then we can efficiently reduce the VSELECT to shuffles with the pre-SSE41 lowering. llvm-svn: 263383	2016-03-13 18:35:59 +00:00
Amjad Aboud	62f6f5cc80	Fixed DIBuilder to verify that same imported entity will not be added twice to the "imports" list of the DICompileUnit. Differential Revision: http://reviews.llvm.org/D17884 llvm-svn: 263379	2016-03-13 11:11:39 +00:00
David Majnemer	1256125fb7	[CodeView] Truncate display names Fundamentally, the length of a variable or function name is bound by the maximum size of a record: 0xffff. However, the name doesn't live in a vacuum; other data is associated with the name, lowering the bound further. We would naively attempt to emit the name, causing us to assert because the record would no-longer fit in 16-bits. Instead, truncate the name but preserve as much as we can. While I have tested this locally, I've decided to not commit it due to the test's size. N.B. While this behavior is undesirable, it is better than MSVC's behavior. They seem to truncate to ~4000 characters. llvm-svn: 263378	2016-03-13 10:53:30 +00:00
David Majnemer	90a021fb02	[Bitcode] Make writeComdats less strange It had a weird artificial limitation on the write side: the comdat name couldn't be bigger than 2**16. However, the reader had no such limitation. Make the reader and the writer agree. llvm-svn: 263377	2016-03-13 08:01:03 +00:00
Fiona Glaser	2e5c0c2858	ConstantFoldInstruction: avoid wasted calls to ConstantFoldConstantExpression Check to see if all operands are constant before calling simplify on them so that we don't perform wasted simplifications. llvm-svn: 263374	2016-03-13 05:36:15 +00:00
Matt Arsenault	cb5af451b3	Fix build llvm-svn: 263372	2016-03-13 05:22:08 +00:00
Matt Arsenault	69fdf9b2e4	APFloat: Fix ilogb for denormals llvm-svn: 263370	2016-03-13 05:12:32 +00:00
Matt Arsenault	afa31cf4cc	APFloat: Fix scalbn handling of denormals This was incorrect for denormals, and also failed on longer exponent ranges. llvm-svn: 263369	2016-03-13 05:11:51 +00:00
Rui Ueyama	cb1d961385	Define IsRela static const member to Elf_Rel type. So that we can write RelTy::IsRela to query its type. llvm-svn: 263367	2016-03-13 04:55:44 +00:00
Craig Topper	955308fbee	[X86] Remove many operands that represent memory stores from outs to ins. These operands are the registers and immediates that specify the memory address not the memory itself thus they are inputs. llvm-svn: 263354	2016-03-13 02:56:31 +00:00
Amaury Sechet	b325686764	Add echo test for constant data arrays in the LLVM C API llvm-svn: 263350	2016-03-13 00:58:25 +00:00
Amaury Sechet	006ce6327e	Use templated version of unwrap instead of cats in the Core.cpp. NFC llvm-svn: 263349	2016-03-13 00:54:40 +00:00
Amaury Sechet	c78768f17d	Move LLVMConstStructInContext so that declarationa nd definition order match. NFC llvm-svn: 263348	2016-03-13 00:40:12 +00:00
Sanjay Patel	610da4fbaf	update test to use FileCheck llvm-svn: 263347	2016-03-12 21:09:26 +00:00
Sanjay Patel	9da9c76627	fix documentation comments; NFC llvm-svn: 263346	2016-03-12 20:44:58 +00:00
Sanjay Patel	17c2690146	fix documentation comments; NFC llvm-svn: 263345	2016-03-12 20:44:30 +00:00
Sanjay Patel	5658b58936	remove unnecessary cast; NFC llvm-svn: 263343	2016-03-12 18:17:41 +00:00
Sanjay Patel	2e0027706a	fix formatting; NFC llvm-svn: 263342	2016-03-12 18:05:53 +00:00
Sanjay Patel	5781d840dd	use range loops; NFCI llvm-svn: 263341	2016-03-12 16:52:17 +00:00
Sanjay Patel	c4acbae63f	[x86, InstCombine] delete x86 SSE2 masked store with zero mask This follows up on the related AVX instruction transforms, but this one is too strange to do anything more with. Intel's behavioral description of this instruction in its Software Developer's Manual is tragi-comic. llvm-svn: 263340	2016-03-12 15:16:59 +00:00
Nemanja Ivanovic	bd56e4e25a	Fix for PR 26378 This patch corresponds to review: http://reviews.llvm.org/D17712 We were not clearing the TOC vector in PPCAsmPrinter when initializing it. This caused duplicate definition asserts when the pass is reused on the module (i.e. with -compile-twice or in JIT contexts). llvm-svn: 263338	2016-03-12 10:23:07 +00:00
Teresa Johnson	8fc20ade65	Use default destructor and remove unnecessary virtual destructor Only the virtual destructor in the base class is needed, and can use the default. llvm-svn: 263335	2016-03-12 05:38:22 +00:00
Kostya Serebryany	241fb61fdb	[libFuzzer] refresh docs more llvm-svn: 263332	2016-03-12 03:23:02 +00:00
Kostya Serebryany	7ead926582	[libFuzzer] refresh docs more llvm-svn: 263331	2016-03-12 03:11:27 +00:00
Kostya Serebryany	abca88e481	[libFuzzer] refresh docs more llvm-svn: 263330	2016-03-12 03:05:37 +00:00
Chandler Carruth	4a543cc7a1	[lit] Hack lit to allow a test suite to request that it is run "early". This lets us for example start running the unit test suite early. For 'check-llvm' on my machine, this drops the tim e from 44s to 32s!!!!! It's pretty ugly. I barely know how to write Python, so feel free to just tell me how I should write it instead. =D Thanks to Filipe and others for help. Differential Revision: http://reviews.llvm.org/D18089 llvm-svn: 263329	2016-03-12 03:03:31 +00:00
Kostya Serebryany	d11dc176b1	[libFuzzer] refresh docs llvm-svn: 263328	2016-03-12 02:56:25 +00:00
Sanjoy Das	ecf96c9516	Make gc relocates more strongly typed; NFC Don't use a `Value ` where we can use a stronger `GCRelocateInst ` type. llvm-svn: 263327	2016-03-12 02:54:27 +00:00
Quentin Colombet	cf9732b417	[X86] Make sure we do not clobber RBX with cmpxchg when used as a base pointer. cmpxchg[8\|16]b uses RBX as one of its argument. In other words, using this instruction clobbers RBX as it is defined to hold one the input. When the backend uses dynamically allocated stack, RBX is used as a reserved register for the base pointer. Reserved registers have special semantic that only the target understands and enforces, because of that, the register allocator don’t use them, but also, don’t try to make sure they are used properly (remember it does not know how they are supposed to be used). Therefore, when RBX is used as a reserved register but defined by something that is not compatible with that use, the register allocator will not fix the surrounding code to make sure it gets saved and restored properly around the broken code. This is the responsibility of the target to do the right thing with its reserved register. To fix that, when the base pointer needs to be preserved, we use a different pseudo instruction for cmpxchg that save rbx. That pseudo takes two more arguments than the regular instruction: - One is the value to be copied into RBX to set the proper value for the comparison. - The other is the virtual register holding the save of the value of RBX as the base pointer. This saving is done as part of isel (i.e., we emit a copy from rbx). cmpxchg_save_rbx <regular cmpxchg args>, input_for_rbx_reg, save_of_rbx_as_bp This gets expanded into: rbx = copy input_for_rbx_reg cmpxchg <regular cmpxchg args> rbx = save_of_rbx_as_bp Note: The actual modeling of the pseudo is a bit more complicated to make sure the interferes that appears after the pseudo gets expanded are properly modeled before that expansion. This fixes PR26883. llvm-svn: 263325	2016-03-12 02:25:27 +00:00
Kostya Serebryany	64d24578d8	[libFuzzer] try to use max_len based on the items of the corpus instead of blindly defaulting to 64 bytes. llvm-svn: 263323	2016-03-12 01:57:04 +00:00
Eric Christopher	35abd051c0	Temporarily revert: commit ae14bf6488e8441f0f6d74f00455555f6f3943ac Author: Mehdi Amini <mehdi.amini@apple.com> Date: Fri Mar 11 17:15:50 2016 +0000 Remove PreserveNames template parameter from IRBuilder Summary: Following r263086, we are now relying on a flag on the Context to discard Value names in release builds. Reviewers: chandlerc Subscribers: mzolotukhin, llvm-commits Differential Revision: http://reviews.llvm.org/D18023 From: Mehdi Amini <mehdi.amini@apple.com> git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@263258 91177308-0d34-0410-b5e6-96231b3b80d8 until we can figure out what to do about clang and Release build testing. This reverts commit 263258. llvm-svn: 263321	2016-03-12 01:47:22 +00:00
Eric Christopher	b3965fef56	Add Lanai.def to the module map. llvm-svn: 263319	2016-03-12 01:31:53 +00:00
Michael Zolotukhin	789ac4531c	[LoopUnroll] Convert some existing tests to unit-tests. Summary: As we now have unit-tests for UnrollAnalyzer, we can convert some existing tests to this format. It should make the tests more robust. Reviewers: chandlerc, sanjoy Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D17904 llvm-svn: 263318	2016-03-12 01:28:56 +00:00
Teresa Johnson	071509c22a	Fix a memory leak due to missing virtual destructors Caused a couple of sanitizer bot failures in ThinLTO tests due to r263275. llvm-svn: 263317	2016-03-12 01:23:57 +00:00
Wilfred Hughes	73a0dac96d	Updating source languages entry in FAQ. Dragonegg is no longer actively maintained[1], and the Pypy team is not actively pursuing LLVM[2]. 1: http://reviews.llvm.org/D9331 2: http://rpython.readthedocs.org/en/latest/faq.html#could-we-use-llvm llvm-svn: 263314	2016-03-12 00:43:26 +00:00
Chris Matthews	7d0ec128c9	Fix the docs I broke llvm-svn: 263309	2016-03-11 23:31:02 +00:00
Mike Aizatsky	037fee594b	[sancov] using md5 for anchors in attempt to reduce file size. Differential Revision: http://reviews.llvm.org/D18102 llvm-svn: 263308	2016-03-11 23:28:28 +00:00
Mike Aizatsky	22973fd00e	Don't crash sancov if file is unreadable. Summary: Caller can provides the list of .so files where some files are unreadable (e.g linux-vdso.so.1). It's more convenient to handler this in sancov with warning then making all callers to check files. Reviewers: aizatsky Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D18103 llvm-svn: 263307	2016-03-11 23:26:41 +00:00
Chris Matthews	5c605d7943	Extend test-suite docs to describe how to run test-suite with cmake+lit llvm-svn: 263305	2016-03-11 22:33:36 +00:00
Mehdi Amini	33661070c5	Minor cleanup and documentation to IRMover (NFC) From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 263304	2016-03-11 22:19:06 +00:00
Simon Pilgrim	33d57c7547	[X86][SSE] Simplify vector LOAD + EXTEND on pre-SSE41 hardware Improve vector extension of vectors on hardware without dedicated VSEXT/VZEXT instructions. We already convert these to SIGN_EXTEND_VECTOR_INREG/ZERO_EXTEND_VECTOR_INREG but can further improve this by using the legalizer instead of prematurely splitting into legal vectors in the combine as this only properly helps for lowering to VSEXT/VZEXT. Removes a lot of unnecessary any_extend + mask pattern - (Fix for PR25718). Differential Revision: http://reviews.llvm.org/D17932 llvm-svn: 263303	2016-03-11 22:18:05 +00:00
Ahmed Bougacha	171f7b9986	[AArch64] Don't blindly lower f16/f128 FCCMPs. Instead, extend f16 (like we do when lowering a standalone SETCC), and let f128 be legalized to the RT calls. Fixes PR26803. llvm-svn: 263301	2016-03-11 22:02:58 +00:00
Teresa Johnson	f2bc850ee8	Initialize CalleeInfo to fix bot after r263275 Hopefully will fix garbage output in http://lab.llvm.org:8011/builders/clang-x86-win2008-selfhost/builds/6901 llvm-svn: 263297	2016-03-11 21:34:45 +00:00
Rui Ueyama	a8b64fe64f	Define NT_GNU_BUILD_ID constant. llvm-svn: 263288	2016-03-11 20:20:25 +00:00
Dan Gohman	da323e88ea	[WebAssembly] Add `final` keywords to a few more subclasses, for consistency. llvm-svn: 263287	2016-03-11 19:45:37 +00:00
George Burgess IV	b42b762bca	[MemorySSA] Make a return type reflect reality. NFC. llvm-svn: 263286	2016-03-11 19:34:03 +00:00
Sanjoy Das	b51325dbdb	Introduce @llvm.experimental.deoptimize Summary: This intrinsic, together with deoptimization operand bundles, allow frontends to express transfer of control and frame-local state from one (typically more specialized, hence faster) version of a function into another (typically more generic, hence slower) version. In languages with a fully integrated managed runtime this intrinsic can be used to implement "uncommon trap" like functionality. In unmanaged languages like C and C++, this intrinsic can be used to represent the slow paths of specialized functions. Note: this change does not address how `@llvm.experimental_deoptimize` is lowered. That will be done in a later change. Reviewers: chandlerc, rnk, atrick, reames Subscribers: llvm-commits, kmod, mjacob, maksfb, mcrosier, JosephTremoulet Differential Revision: http://reviews.llvm.org/D17732 llvm-svn: 263281	2016-03-11 19:08:34 +00:00
Vedant Kumar	e5a9a275d3	[PGO] Skip value profile instrumentation of inline asm Value profile instrumentation treats inline asm calls like they are indirect calls. This causes problems when the 'Callee' is passed to a ptrtoint cast -- the verifier rightly claims that this is bogus and crashes opt. llvm-svn: 263278	2016-03-11 18:57:48 +00:00

... 3 4 5 6 7 ...

129002 Commits