llvm-project

Commit Graph

Author	SHA1	Message	Date
Richard Smith	5d10289639	Factor out repeated code for deducing a non-type template parameter as a given argument value. No functionality change intended. llvm-svn: 290576	2016-12-27 03:59:58 +00:00
Craig Topper	6ad5bcc8ac	[AVX-512] Replace masked 512-bit pmuldq and pmuludq builtins with the newly added unmasked versions and selects. llvm-svn: 290575	2016-12-27 03:46:16 +00:00
Craig Topper	39b9e32493	foo llvm-svn: 290574	2016-12-27 03:46:13 +00:00
Craig Topper	89b3e0223f	[AVX-512] Add 512-bit unmasked intrinsics for pmuldq and pmuludq so we can add them to InstCombine with the 128 and 256 bit versions. The 128 and 256 bit masked intrinsics are currently unused by clang. The sse and avx2 unmasked intrinsics are used instead. The new 512-bit intrinsic will be used to do the same. Then all masked versions will removed and autoupgraded. llvm-svn: 290573	2016-12-27 03:46:05 +00:00
Chandler Carruth	03130d981c	[PM] Teach the inliner in the new PM to merge attributes after inlining. Also enable the new PM in the attributes test case which caught this issue. llvm-svn: 290572	2016-12-27 03:39:54 +00:00
Saleem Abdulrasool	c1861d86e1	test: filter out -fPIE/-pie on Windows Use some new substitutions to avoid duplicating the tests for just dropped flags. -fPIC/-fPIE/-fpic/-fpie do not make sense on Windows as they can cause ELF-style PIC. Substitute away the flag on Windows. This should repair the windows buildbots. llvm-svn: 290571	2016-12-27 03:16:20 +00:00
Chandler Carruth	62c8b81ea8	[Inliner] Modernize all of the inliner tests that were using grep. This mostly involved converting from grep to FileCheck and tidying up the IR used. In one case (invoke_test-3.ll) the test had become completely pointless as we use 'resume' rather than 'unwind' now, and even then it did not occur at the end of the line. llvm-svn: 290570	2016-12-27 02:47:37 +00:00
Saleem Abdulrasool	a5ad591918	Driver: switch Windows to static RelocModel Windows uses PE/COFF which is inherently position independent. The use of the PIC model is unnecessary. In fact, we would generate invalid code using the ELF PIC model when PIC was enabled previously. Now that we no longer accept -fPIC and -fpoc, this switches the internal representation to the static model to permit us to make PIC modules invalid when targeting Windows. This should not change the code generation, only the internal state management. llvm-svn: 290569	2016-12-27 02:20:35 +00:00
Vitaly Buka	b67bfd2fd7	[tsan] Fix string to bool conversion warnings in tsan tests llvm-svn: 290568	2016-12-27 02:05:24 +00:00
Richard Smith	01bfa68fec	Check and build conversion sequences for non-type template arguments in dependent contexts when processing the template in C++11 and C++14, just like we do in C++98 and C++1z. This allows us to diagnose invalid templates earlier. llvm-svn: 290567	2016-12-27 02:02:09 +00:00
Craig Topper	7f8540b5e7	[AVX-512][InstCombine] Teach InstCombine to turn masked scalar add/sub/mul/div with rounding intrinsics into normal IR operations if the rounding mode is CUR_DIRECTION. An earlier commit added support for unmasked scalar operations. At that time isel wouldn't generate an optimal sequence for masked operations, but that has now been fixed. llvm-svn: 290566	2016-12-27 01:56:30 +00:00
Craig Topper	a0439377e6	[InstCombine][AVX-512] Add masked scalar add/sub/mul/div intrinsic test cases that don't have a CUR_DIRECTION rounding mode. The CUR_DIRECTION case will be optimized in a future commit so this provides coverage for the other cases. llvm-svn: 290565	2016-12-27 01:56:27 +00:00
Craig Topper	83f2145c18	[AVX-512] Add isel patterns to turn native masked scalar add/sub/mul/div into masked instructions. llvm-svn: 290564	2016-12-27 01:56:24 +00:00
Craig Topper	5035b1212b	[AVX-512] Add tests to show missed opportunities for combining masking with scalar arithmetic operations. These particular sequences will be generated after a future change to teach InstCombine to turn masked scalar arithmetic intrinsics into native IR. llvm-svn: 290563	2016-12-27 01:56:22 +00:00
Chandler Carruth	0ee8bb11c3	[PM] Move the collection of call sites to a more appropriate place inside of `InlineFunction`. Prior to this, call instructions are specifically being rewritten and replaced within the inlined region, invalidating some of the call sites. Several of these regions are using the same technique to walk the inlined region so this seems clearly safe up to this point. I've also added a short circuit to the scan for call sites based on what other code is doing. With this, the most common crash I've found in the new inliner code is fixed. I've turned it on for another test case that covers this scenario. I'll make my way through most of the other inliner test cases just to get some easy coverage next. llvm-svn: 290562	2016-12-27 01:24:50 +00:00
Ekaterina Romanova	dffe45b3e6	[DOXYGEN] Improved doxygen comments for x86 intrinsics. Improved doxygen comments for the following intrinsics headers: __wmmintrin_pclmul.h, bmiintrin.h, emmintrin.h, f16cintrin.h, immintrin.h, mmintrin.h, pmmintrin.h, tmmintrin.h Added \n commands to insert a line breaks where necessary, since one long line of documentation is nearly unreadable. Formatted comments to fit into 80 chars. In some cases added \a command in front of the parameter names to display them in italics. llvm-svn: 290561	2016-12-27 00:49:38 +00:00
Chandler Carruth	88c4ffb4e0	[PM] The new pass manager requires a registered target for these, and given that they hard code specific triples that seems reasonable so add the REQUIRES. llvm-svn: 290560	2016-12-27 00:31:34 +00:00
Craig Topper	020b228155	[AVX-512][InstCombine] Teach InstCombine to turn packed add/sub/mul/div with rounding intrinsics into normal IR operations if the rounding mode is CUR_DIRECTION. llvm-svn: 290559	2016-12-27 00:23:16 +00:00
Chandler Carruth	6d1b83ef87	[PH] Teach the new PM code path to support -disable-llvm-passes. This is kind of funny because I specifically did work to make this easy and then it didn't actually get implemented. I've also ported a set of tests that rely on this functionality to run with the new PM as well as the old PM so that we don't mess this up in the future. llvm-svn: 290558	2016-12-27 00:13:09 +00:00
Chandler Carruth	6e9bb7e064	[PM] Teach the always inliner in the new pass manager to support removing fully-dead comdats without removing dead entries in comdats with live members. This factors the core logic out of the current inliner's internals to a reusable utility and leverages that in both places. The factored out code should also be (minorly) more efficient in cases where we have very few dead functions or dead comdats to consider. I've added a test case to cover this behavior of the always inliner. This is the last significant bug in the new PM's always inliner I've found (so far). llvm-svn: 290557	2016-12-26 23:43:27 +00:00
Mehdi Amini	4506e447c1	[doc] Add mention of the difference in optimization level between Release and RelWithDebInfo in Cmake.rst This is surprising to many people. llvm-svn: 290556	2016-12-26 23:42:12 +00:00
Chandler Carruth	cc44ab63b6	[ADT] Add an llvm::erase_if utility to make the standard erase+remove_if pattern easier to write. Differential Revision: https://reviews.llvm.org/D28120 llvm-svn: 290555	2016-12-26 23:30:44 +00:00
Simon Pilgrim	c9cf7fc7a4	[InstCombine][X86] Add DemandedElts support for PMULDQ/PMULUDQ instructions PMULDQ/PMULUDQ vXi64 instructions only use the even numbered v2Xi32 input elements which SimplifyDemandedVectorElts should try and use. Differential Revision: https://reviews.llvm.org/D28119 llvm-svn: 290554	2016-12-26 23:28:17 +00:00
Chandler Carruth	d9eaa54ef4	[ADT] Add a boring std::partition wrapper similar to our std::remove_if wrapper. llvm-svn: 290553	2016-12-26 23:10:40 +00:00
Richard Smith	4f9b3f4a5b	Update comment to match dr1770. llvm-svn: 290552	2016-12-26 22:28:29 +00:00
Daniel Berlin	85f91b0ec3	clang-format NewGVN files llvm-svn: 290551	2016-12-26 20:06:58 +00:00
Daniel Berlin	85cbc8c097	Misc cleanups and simplifications for NewGVN. Mostly use a bit more idiomatic C++ where we can, so we can combine some things later. Reviewers: davide Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28111 llvm-svn: 290550	2016-12-26 19:57:25 +00:00
Daniel Berlin	d59e8010c5	Don't use our own incorrect version of isTriviallyDeadInstruction in NewGVN. Fixes PR/31472 llvm-svn: 290549	2016-12-26 18:44:36 +00:00
Davide Italiano	fe7a3ee51e	[NewGVN] Add a flag to enable the pass via `-mllvm`. NewGVN can be tested passing `-mllvm -enable-newgvn` to clang. Differential Revision: https://reviews.llvm.org/D28059 llvm-svn: 290548	2016-12-26 18:26:19 +00:00
Simon Pilgrim	6f3e1ea403	Wdocumentation fix llvm-svn: 290547	2016-12-26 18:11:49 +00:00
Davide Italiano	8ea5e4fcae	[NewGVN] Change test to reflect difference between GVN and NewGVN. The current GVN algorithm folds unconditional branches to, it claims, expose more PRE oportunities. The folding, if really needed, (which is not sure, as it's not really proved it improves analysis) can be done by an earlier cleanup pass instead of GVN itself. Ack'ed/SGTM'd by Daniel Berlin. Differential Revision: https://reviews.llvm.org/D28117 llvm-svn: 290546	2016-12-26 18:10:09 +00:00
Simon Pilgrim	cd9d729461	Wdocumentation fix llvm-svn: 290545	2016-12-26 17:48:19 +00:00
Simon Pilgrim	e8a5ab35ca	[X86][AVX512] Added v64i8 reverse shuffle test (PR31470) llvm-svn: 290544	2016-12-26 17:38:58 +00:00
Davide Italiano	a312ca845c	[NewGVN] Fold lookupOperandLeader() when there's only one use. NFCI. llvm-svn: 290543	2016-12-26 16:19:34 +00:00
Bryant Wong	b5e03b61e2	[InstCombiner] Simplify lib calls to `round{,f}` Differential Revision: https://reviews.llvm.org/D28110 llvm-svn: 290542	2016-12-26 14:29:29 +00:00
Marina Yatsina	c5cf7a8b00	Fix build error caused by r290539. llvm-svn: 290541	2016-12-26 13:16:40 +00:00
Marina Yatsina	168b954611	[inline-asm]No error for conflict between inputs\outputs and clobber list Updated test according to commit 290539: According to extended asm syntax, a case where the clobber list includes a variable from the inputs or outputs should be an error - conflict. for example: const long double a = 0.0; int main() { char b; double t1 = a; __asm__ ("fucompp": "=a" (b) : "u" (t1), "t" (t1) : "cc", "st", "st(1)"); return 0; } This should conflict with the output - t1 which is st, and st which is st aswell. The patch fixes it. Commit on behald of Ziv Izhar. Differential Revision: https://reviews.llvm.org/D15075 llvm-svn: 290540	2016-12-26 12:24:49 +00:00
Marina Yatsina	c42fd03bf8	[inline-asm]No error for conflict between inputs\outputs and clobber list According to extended asm syntax, a case where the clobber list includes a variable from the inputs or outputs should be an error - conflict. for example: const long double a = 0.0; int main() { char b; double t1 = a; __asm__ ("fucompp": "=a" (b) : "u" (t1), "t" (t1) : "cc", "st", "st(1)"); return 0; } This should conflict with the output - t1 which is st, and st which is st aswell. The patch fixes it. Commit on behald of Ziv Izhar. Differential Revision: https://reviews.llvm.org/D15075 llvm-svn: 290539	2016-12-26 12:23:42 +00:00
Tobias Grosser	600941351e	Update to isl-0.18-17-g2844ebf This update improves isl's ability to coalesce different convex sets/maps, especially when the contain existentially quantified variables. llvm-svn: 290538	2016-12-26 12:11:40 +00:00
Chandler Carruth	80db76d556	Test the different scenarios of GlobalDCE and comdats more systematically and document in the test what all is going on. This replaces the PR-named test that was the only coverage for GlobalDCE and comdats previously. I wrote this because I wasn't certain how comdat DCE was supposed to work and wanted to step through what GlobalDCE did to fully understand it. After talking to folks and reading the code and really staring at things it all makes sense but it seemed good to help write down some of this in a more explicit and fully covering test case. For example, it seemed like a bug that GlobalDCE didn't consider comdat participation of ifuncs. Specifically it seemed like an accident because testing didn't really cover that case. But in fact, ifuncs specifically cannot participate in a comdat despite having that API. The new test case covers this and explicitly documents that DCE gets to fire here even though there are comdats involved. Also, we didn't have any positive tests for the challenging cases such as usage cycles between comdat participants that might make them seem alive except that there is no external edge into the cycle. llvm-svn: 290537	2016-12-26 08:54:01 +00:00
Craig Topper	5ef13ba18b	[AVX-512] Fix some patterns to use extended register classes. llvm-svn: 290536	2016-12-26 07:26:07 +00:00
Craig Topper	7b788ada2d	[AVX-512][InstCombine] Teach InstCombine to turn scalar add/sub/mul/div with rounding intrinsics into normal IR operations if the rounding mode is CUR_DIRECTION. Summary: I only do this for unmasked cases for now because isel is failing to fold the mask. I'll try to fix that soon. I'll do the same thing for packed add/sub/mul/div in a future patch. Reviewers: delena, RKSimon, zvi, craig.topper Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D27879 llvm-svn: 290535	2016-12-26 06:33:19 +00:00
Saleem Abdulrasool	c47e1aab1c	test: add explicit triples to the invocation llvm-svn: 290534	2016-12-26 04:00:54 +00:00
Saleem Abdulrasool	d133dc226f	Driver: warn on -fPIC/-fpic/-fPIE/-fpie on Windows Use of these flags would result in the use of ELF-style PIE/PIC code which is incorrect on Windows. Windows is inherently PIC by means of the DLL slide that occurs at load. This also mirrors the behaviour on GCC for MinGW. Currently, the Windows x86_64 forces the relocation model to PIC (Level 2). This is unchanged for now, though we should remove any assumptions on that and change it to a static relocation model. llvm-svn: 290533	2016-12-26 03:35:24 +00:00
Craig Topper	f56d985f77	[AVX-512] Don't assume that the rounding mode argument to intrinsics is a constant. While clang will guarantee this, nothing in the backend will. A non-constant value will now result in an isel error instead of just asserting or crashing due to a bad cast during lowering. llvm-svn: 290532	2016-12-26 01:40:17 +00:00
Chandler Carruth	0cf829c171	Fix some bad indentation that I or another introduced somehow. llvm-svn: 290531	2016-12-26 01:20:59 +00:00
Craig Topper	e328045711	[AVX-512][InstCombine] Teach InstCombine to converted masked vpermv intrinsics into shufflevector instructions Summary: This patch adds support for converting the masked vpermv intrinsics into shufflevector instructions if the indices are constants. We also need to wrap a select instruction around the shuffle to take care of the masking part. InstCombine will take care of optimizing the select if the mask is constant so I didn't bother checking for that. Reviewers: zvi, delena, spatel, RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D27825 llvm-svn: 290530	2016-12-25 23:58:57 +00:00
Bryant Wong	c6b46d80c8	Fix `update_test_checks.py` bug that incorrectly truncates IR body. Differential Revision: https://reviews.llvm.org/D26619 llvm-svn: 290529	2016-12-25 23:46:55 +00:00
Chandler Carruth	cb22b89f3f	[ADT] Add a generic concatenating iterator and range (take 2). This recommits r290512 that was reverted when MSVC failed to compile it. Since then I've played with various approaches using rextester.com (where I was able to reproduce the failure) and think that I have a solution thanks in part to the help of Dave Blaikie! It seems MSVC just has a defective `decltype` in this version. Manually writing out the type seems to do the trick, even though it is .... quite complicated. Original commit message: This allows both defining convenience iterator/range accessors on types which walk across N different independent ranges within the object, and more direct and simple usages with range based for loops such as shown in the unittest. The same facilities are used for both. They end up quite small and simple as it happens. I've also switched an iterator on `Module` to use this. I would like to add another convenience iterator that includes even more sequences as part of it and seeing this one already present motivated me to actually abstract it away and introduce a general utility. Differential Revision: https://reviews.llvm.org/D28093 llvm-svn: 290528	2016-12-25 23:41:14 +00:00
Bryant Wong	4213d94142	[MemorySSA] Define a restricted upward AccessList splice. Differential Revision: https://reviews.llvm.org/D26661 llvm-svn: 290527	2016-12-25 23:34:07 +00:00

1 2 3 4 5 ...

250605 Commits All Branches Search

250605 Commits

All Branches