llvm-project

Commit Graph

Author	SHA1	Message	Date
Tim Corringham	702fe45bcd	[AMDGPU] add __builtin_amdgcn_s_getpc Summary: Added the builtin corresponding to the s_getpc intrinsic added in llvm D32862 Subscribers: kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye Differential Revision: https://reviews.llvm.org/D33276 llvm-svn: 303861	2017-05-25 14:16:11 +00:00
Sanjay Patel	5150612012	[InstCombine] make icmp-mul fold more efficient There's probably a lot more like this (see also comments in D33338 about responsibility), but I suspect we don't usually get a visible manifestation. Given the recent interest in improving InstCombine efficiency, another potential micro-opt that could be repeated several times in this function: morph the existing icmp pred/operands instead of creating a new instruction. llvm-svn: 303860	2017-05-25 14:13:57 +00:00
Tim Corringham	32d0d38679	[AMDGPU] add intrinsic for s_getpc Summary: The s_getpc instruction is exposed as intrinsic llvm.amdgcn.s.getpc. Reviewers: arsenm Reviewed By: arsenm Subscribers: kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye Differential Revision: https://reviews.llvm.org/D32862 llvm-svn: 303859	2017-05-25 14:04:14 +00:00
Oren Ben Simhon	7bf27f03f2	[X86] Adding vpopcntd and vpopcntq instructions AVX512_VPOPCNTDQ is a new feature set that was published by Intel. The patch represents the LLVM side of the addition of two new intrinsic based instructions (vpopcntd and vpopcntq). Differential Revision: https://reviews.llvm.org/D33169 llvm-svn: 303858	2017-05-25 13:45:23 +00:00
Oren Ben Simhon	140c1fb9ec	[X86] Adding avx512_vpopcntdq feature set and its intrinsics AVX512_VPOPCNTDQ is a new feature set that was published by Intel. The patch represents the Clang side of the addition of six intrinsics for two new machine instructions (vpopcntd and vpopcntq). It also includes the addition of the new feature set. Differential Revision: https://reviews.llvm.org/D33170 llvm-svn: 303857	2017-05-25 13:44:11 +00:00
Marshall Clow	1d02996d28	Make for_each_n only avaliable on C++17 llvm-svn: 303856	2017-05-25 13:40:57 +00:00
James Molloy	dc2d64bc35	[GVNSink] Pacify MSVC Don't convert an unsigned to a pointer for a sentinel, use a size_t instead. llvm-svn: 303855	2017-05-25 13:14:10 +00:00
Pavel Labath	e8cd2cca91	Revert "Fix FDE indexing while scan debug_info section." This reverts commit r303847 as it introduces a number of regressions. Investigation has showed that we are parsing the CIE entries in the debug_frame section incorrectly -- we are parsing them the same way as eh_frame, but the entries in debug_frame have a couple of extra entries which have not been taken into account. llvm-svn: 303854	2017-05-25 13:13:12 +00:00
James Molloy	2a237f19f1	[GVNSink] Don't define operator<< in NDEBUG Without debug macros enabled, the raw_ostream operator<< overload is unused. llvm-svn: 303852	2017-05-25 13:11:18 +00:00
Krzysztof Parzyszek	5960a57ef7	[CodeGen] Pessimize aliasing for member unions (and may-alias) objects Use the TBAA info of the omnipotent char for these objects. Differential Revision: https://reviews.llvm.org/D33328 llvm-svn: 303851	2017-05-25 12:55:47 +00:00
James Molloy	a929063233	[GVNSink] GVNSink pass This patch provides an initial prototype for a pass that sinks instructions based on GVN information, similar to GVNHoist. It is not yet ready for commiting but I've uploaded it to gather some initial thoughts. This pass attempts to sink instructions into successors, reducing static instruction count and enabling if-conversion. We use a variant of global value numbering to decide what can be sunk. Consider: [ %a1 = add i32 %b, 1 ] [ %c1 = add i32 %d, 1 ] [ %a2 = xor i32 %a1, 1 ] [ %c2 = xor i32 %c1, 1 ] \ / [ %e = phi i32 %a2, %c2 ] [ add i32 %e, 4 ] GVN would number %a1 and %c1 differently because they compute different results - the VN of an instruction is a function of its opcode and the transitive closure of its operands. This is the key property for hoisting and CSE. What we want when sinking however is for a numbering that is a function of the uses of an instruction, which allows us to answer the question "if I replace %a1 with %c1, will it contribute in an equivalent way to all successive instructions?". The (new) PostValueTable class in GVN provides this mapping. This pass has some shown really impressive improvements especially for codesize already on internal benchmarks, so I have high hopes it can replace all the sinking logic in SimplifyCFG. Differential revision: https://reviews.llvm.org/D24805 llvm-svn: 303850	2017-05-25 12:51:11 +00:00
Florian Gross	7ce3a83c50	(no commit message) llvm-svn: 303849	2017-05-25 11:43:06 +00:00
Pavel Labath	45dde23756	Recommit "RunThreadPlan: Fix halting logic in IgnoreBreakpoints = false" This is a resubmit of r303732, which was reverted due to a regression. The original patch caused a regression in TestLoadUnload, which has only showed up when running the remote test suite. The problem there was that we interrupted the target just as it has hit the rendezvous breakpoint in the dlopen call. This meant that the stop reason was set to "breakpoint" even though the event would not have been broadcast if we had not stopped the process. I fix this by checking StopInfo->ShouldNotify() before stopping. I also add a new test for the handling of conditional breakpoints in expressions, which I noticed to be broken (pr33164) Differential Revision: https://reviews.llvm.org/D33283 llvm-svn: 303848	2017-05-25 10:50:06 +00:00
Hafiz Abid Qadeer	0b5d6e5d0e	Fix FDE indexing while scan debug_info section. There are some differences between eh_frame and debug_frame formats that are not considered by DWARFCallFrameInfo::GetFDEIndex. An FDE entry contains CIE_pointer in debug_frame in same place as cie_id in eh_frame. As described in dwarf standard (section 6.4.1), CIE_pointer is an "offset into the .debug_frame section". So, variable cie_offset should be equal cie_id for debug_frame. FDE entries with zeroth CIE pointer (which is actually placed in cie_id variable) shouldn't be ignored also. I have also added a little change which allow to use debug_info section when eh_frame is absent. This case really can take place on some platforms. Patch from tatyana-krasnukha. https://reviews.llvm.org/D33504 llvm-svn: 303847	2017-05-25 10:21:29 +00:00
Egor Churaev	1db4c88a9a	[OpenCL] reserve_id_t cannot be used as argument to kernel function Reviewers: Anastasia Reviewed By: Anastasia Subscribers: yaxunl, cfe-commits, bader Differential Revision: https://reviews.llvm.org/D33483 llvm-svn: 303846	2017-05-25 07:18:37 +00:00
Chandler Carruth	f4d62c480c	[PM] Teach the PGO instrumentation pasess to run GlobalDCE before instrumenting code. This is important in the new pass manager. The old pass manager's inliner has a small DCE routine embedded within it. The new pass manager relies on the actual GlobalDCE pass for this. Without this patch, instrumentation profiling with the new PM results in massive code bloat in the object files because the instrumentation itself ends up preventing DCE from working to remove the code. We should probably change the instrumentation (and/or DCE) so that we can eliminate dead code even if instrumented, but we shouldn't even spend the time generating instrumentation for that code so this still seems like a good patch. Differential Revision: https://reviews.llvm.org/D33535 llvm-svn: 303845	2017-05-25 07:15:09 +00:00
Egor Churaev	c1e4611754	[OpenCL] Added regression test on invalid vector initialization. Summary: This patch increases code coverage. Reviewers: Anastasia Reviewed By: Anastasia Subscribers: cfe-commits, bader, yaxunl Differential Revision: https://reviews.llvm.org/D33489 llvm-svn: 303844	2017-05-25 06:55:02 +00:00
Chandler Carruth	dd2e275a47	[PM/Unswitch] Fix a bug in the domtree update logic for the new unswitch pass. The original logic only considered direct successors of the hoisted domtree nodes, but that isn't really enough. If there are other basic blocks that are completely within the subtree, their successors could just as easily be impacted by the hoisting. The more I think about it, the more I think the correct update here is to hoist every block on the dominance frontier which has an idom in the chain we hoist across. However, this is subtle enough that I'd definitely appreciate some more eyes on it. Sadly, if this is the correct algorithm, it requires computing a (highly localized) dominance frontier. I've done this in the simplest (IE, least code) way I could come up with, but that may be too naive. Suggestions welcome here, dominance update algorithms are not an area I've studied much, so I don't have strong opinions. In good news, with this patch, turning on simple unswitch passes the LLVM test suite for me with asserts enabled. Differential Revision: https://reviews.llvm.org/D32740 llvm-svn: 303843	2017-05-25 06:33:36 +00:00
Vitaly Buka	4974f108ac	[compiler-rt] Change default of allow_user_segv_handler to true Reviewers: eugenis Subscribers: srhines, kubamracek, llvm-commits Differential Revision: https://reviews.llvm.org/D32443 llvm-svn: 303842	2017-05-25 06:29:30 +00:00
Craig Topper	ae066a0d47	[MVT] Fix the identation of the start of the MVT class. NFC llvm-svn: 303841	2017-05-25 06:15:05 +00:00
Craig Topper	37e46bfbf5	[SelectionDAG] Fix off by one in a compare in getOperationAction. If Op is equal to array_lengthof, the lookup would be out of bounds, but we were only checking for greater than. I suspect nothing ever passes in the equal value because its a sentinel to mark the end of the builtin opcodes and not a real opcode. So really this fix is just so that the code looks right and makes sense. llvm-svn: 303840	2017-05-25 05:38:40 +00:00
Tobias Grosser	6e770813c9	Drop newline in docs builder to see if Polly docs are updated llvm-svn: 303839	2017-05-25 05:38:05 +00:00
Eric Fiselier	39b56d80a1	Remove <experimental/coroutine> from the module map for now. It doesn't work unless modules are enabled llvm-svn: 303838	2017-05-25 05:30:05 +00:00
Eric Fiselier	d791d4ea3c	Disable the coroutines tests until Clang bumps __cpp_coroutines to reflect recent changes llvm-svn: 303837	2017-05-25 05:11:40 +00:00
Eric Fiselier	3ca9185073	Add <experimental/coroutine> This patch adds the library portions of the coroutines PDTS, which should now be supported by Clang. llvm-svn: 303836	2017-05-25 04:36:24 +00:00
Eric Fiselier	c81c8cbe77	Fix broken links on C++1z status page llvm-svn: 303835	2017-05-25 04:09:07 +00:00
Chandler Carruth	29c22d2835	[LegacyPM] Make the 'addLoop' method accept a loop to add rather than having it internally allocate the loop. This is a much more flexible API and necessary in the new loop unswitch to reasonably support both new and old PMs in common code. It also just seems like a cleaner separation of concerns. NFC, this should just be a pure refactoring. Differential Revision: https://reviews.llvm.org/D33528 llvm-svn: 303834	2017-05-25 03:01:31 +00:00
Marshall Clow	d5c65ffa8d	Add non-parallel version of for_each_n (+tests) from the Parallelism TS llvm-svn: 303833	2017-05-25 02:29:54 +00:00
Jim Ingham	d2a7e8538b	Fix the warning when you pass -c to step/next/si/ni. During some cleanup the test for whether the thread plan accepted an iteration count was reversed, so we give a warning when it will actually work, and don't when it won't. <rdar://problem/32379280> llvm-svn: 303832	2017-05-25 02:24:18 +00:00
Eric Fiselier	da8f9b5b1b	[coroutines] Fix fallthrough diagnostics for coroutines Summary: This patch fixes a number of issues with the analysis warnings emitted when a coroutine may reach the end of the function w/o returning. * Fix bug where coroutines with `return_value` are incorrectly diagnosed as missing `co_return`'s. * Rework diagnostic message to no longer say "non-void coroutine", because that implies the coroutine doesn't have a void return type, which it might. In this case a non-void coroutine is one who's promise type does not contain `return_void()` As a side-effect of this patch, coroutine bodies that contain an invalid coroutine promise objects are marked as invalid. Reviewers: GorNishanov, rsmith, aaron.ballman, majnemer Reviewed By: GorNishanov Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D33532 llvm-svn: 303831	2017-05-25 02:16:53 +00:00
Galina Kistanova	1754fee864	Fixed nondeterminism in RuleMatcher::emit. llvm-svn: 303829	2017-05-25 01:51:53 +00:00
Vitaly Buka	bf40f1b6dd	[libFuzzer] Don't replace custom signal handlers. Summary: This allows to keep handlers installed by sanitizers. In other cases third-party code can replace handlers after libFuzzer initialization anyway. Reviewers: kcc Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D33522 llvm-svn: 303828	2017-05-25 01:43:13 +00:00
George Karpenkov	a1c532784d	Fix coverage check for full post-dominator basic blocks. Coverage instrumentation which does not instrument full post-dominators and full-dominators may skip valid paths, as the reasoning for skipping blocks may become circular. This patch fixes that, by only skipping full post-dominators with multiple predecessors, as such predecessors by definition can not be full-dominators. llvm-svn: 303827	2017-05-25 01:41:46 +00:00
Gor Nishanov	1fbc01f70f	[coroutines] CoroFrame.cpp conform to coding convention (s/repeat/Repeat) (NFC) llvm-svn: 303826	2017-05-25 01:07:10 +00:00
Gor Nishanov	0ea1863b27	[coroutines] Relocate instructions that maybe spilled after coro.begin Summary: Frontend generates store instructions after allocas, for example: ``` define i8* @f(i64 %this) "coroutine.presplit"="1" personality i32 0 { entry: %this.addr = alloca i64 store i64 %this, i64* %this.addr .. %hdl = call i8* @llvm.coro.begin(token %id, i8* %alloc) ``` Such instructions may require spilling into coro.frame, but, coro-frame address is only available after coro.begin and thus needs to be moved after coro.begin. The only instructions that should not be moved are the arguments of coro.begin and all of their operands. Reviewers: GorNishanov, majnemer Reviewed By: GorNishanov Subscribers: llvm-commits, EricWF Differential Revision: https://reviews.llvm.org/D33527 llvm-svn: 303825	2017-05-25 00:46:20 +00:00
Marshall Clow	29b75d6986	Add some constexpr tests for optional's move/copy ctor llvm-svn: 303824	2017-05-25 00:22:33 +00:00
Kamil Rytarowski	269eec03d6	Correct compiler warnings and Debug build of the NetBSD target Correct files present only in the NetBSD build. llvm-svn: 303823	2017-05-24 23:59:50 +00:00
Tony Jiang	0a429f040e	[PowerPC] Fix a performance bug for PPC::XXSLDWI. There are some VectorShuffle Nodes in SDAG which can be selected to XXSLDWI instruction, this patch recognizes them and does the selection to improve the PPC performance. llvm-svn: 303822	2017-05-24 23:48:29 +00:00
Rafael Espindola	8b78185e00	Print symbols from COFF import libraries. This change allows llvm-nm to print symbols found in import libraries, in part by allowing COFFImportFiles to be casted to SymbolicFiles. Patch by Dave Lee! llvm-svn: 303821	2017-05-24 23:40:36 +00:00
Eugene Zelenko	75480cce12	[CodeGen] Fix some Clang-tidy modernize-use-using and Include What You Use warnings; other minor fixes (NFC). llvm-svn: 303820	2017-05-24 23:10:29 +00:00
Gor Nishanov	1f72d75714	[coroutines] Allow rematerialization upto 4 times. Remove incorrect assert Reviewers: majnemer Subscribers: EricWF, llvm-commits Differential Revision: https://reviews.llvm.org/D33524 llvm-svn: 303819	2017-05-24 23:01:02 +00:00
Sanjay Patel	07b1ba54b5	[InstCombine] use m_APInt to allow icmp-mul-mul vector fold The swapped operands in the first test is a manifestation of an inefficiency for vectors that doesn't exist for scalars because the IRBuilder checks for an all-ones mask for scalars, but not vectors. llvm-svn: 303818	2017-05-24 22:58:17 +00:00
Jonathan Roelofs	3c8f953f61	Allow builds to set COMPILER_RT_OS_DIR differently from CMAKE_SYSTEM_NAME llvm-svn: 303817	2017-05-24 22:41:49 +00:00
Sanjay Patel	a8ac360a0c	[InstCombine] add tests for icmp eq (mul X, C), (mul Y, C); NFC llvm-svn: 303816	2017-05-24 22:36:14 +00:00
Rui Ueyama	0e8521c05a	Reduce indentation. NFC. llvm-svn: 303815	2017-05-24 22:36:11 +00:00
Rui Ueyama	9aa82f76ac	Garbage collect dllimported symbols. This is a different implementation than r303225 (which was reverted in r303270, re-submitted in r303304 and then re-reverted in r303527). In the previous patch, I tried to add Live bit to each dllimported symbol. It turned out that it didn't work with "oldnames.lib" which contains a lot of weak aliases to dllimported symbols. The way we handle weak aliases is to check if undefined symbols can be resolved using weak aliases, and if so, memcpy the Defined symbols to weak Undefined symbols, so that any references to weak aliases automatically see defined symbols instead of undefined ones. This memcpy happens before MarkLive kicks in. That means we may have multiple copies of dllimported symbols. So turning on one instance's Live bit is not enough. This patch moves the Live bit to dllimport file. Since multiple copies of dllsymbols still point to the same file, we can use it as the central repository to keep track of liveness. Differential Revision: https://reviews.llvm.org/D33520 llvm-svn: 303814	2017-05-24 22:30:06 +00:00
Tim Northover	9d891185ad	Revert "Sema: allow imaginary constants via GNU extension if UDL overloads not present." This reverts commit r303697. It broke libc++ tests that were specifically checking incompatibility in C++14 mode. llvm-svn: 303813	2017-05-24 22:18:35 +00:00
Rafael Espindola	a28414d7ec	Simplify MipsRldMapSection::writeTo. It is not clear why a synthetic section wants to use padding defined in the linker script. The padding is for the space between sections. It was also missing a test. llvm-svn: 303812	2017-05-24 22:04:32 +00:00
Hans Wennborg	0eec1f0b96	Fix negate-overflow.cpp test on Windows after r303440 lit would interpret the exit code as failuire. llvm-svn: 303809	2017-05-24 21:52:40 +00:00
Sanjay Patel	3e8935bdc5	[InstCombine] move tests and use FileCheck; NFC llvm-svn: 303808	2017-05-24 21:48:25 +00:00

... 2 3 4 5 6 ...

263214 Commits All Branches Search

263214 Commits

All Branches