llvm-project

Commit Graph

Author	SHA1	Message	Date
Bryant Wong	b5e03b61e2	[InstCombiner] Simplify lib calls to `round{,f}` Differential Revision: https://reviews.llvm.org/D28110 llvm-svn: 290542	2016-12-26 14:29:29 +00:00
Marina Yatsina	c5cf7a8b00	Fix build error caused by r290539. llvm-svn: 290541	2016-12-26 13:16:40 +00:00
Marina Yatsina	168b954611	[inline-asm]No error for conflict between inputs\outputs and clobber list Updated test according to commit 290539: According to extended asm syntax, a case where the clobber list includes a variable from the inputs or outputs should be an error - conflict. for example: const long double a = 0.0; int main() { char b; double t1 = a; __asm__ ("fucompp": "=a" (b) : "u" (t1), "t" (t1) : "cc", "st", "st(1)"); return 0; } This should conflict with the output - t1 which is st, and st which is st aswell. The patch fixes it. Commit on behald of Ziv Izhar. Differential Revision: https://reviews.llvm.org/D15075 llvm-svn: 290540	2016-12-26 12:24:49 +00:00
Marina Yatsina	c42fd03bf8	[inline-asm]No error for conflict between inputs\outputs and clobber list According to extended asm syntax, a case where the clobber list includes a variable from the inputs or outputs should be an error - conflict. for example: const long double a = 0.0; int main() { char b; double t1 = a; __asm__ ("fucompp": "=a" (b) : "u" (t1), "t" (t1) : "cc", "st", "st(1)"); return 0; } This should conflict with the output - t1 which is st, and st which is st aswell. The patch fixes it. Commit on behald of Ziv Izhar. Differential Revision: https://reviews.llvm.org/D15075 llvm-svn: 290539	2016-12-26 12:23:42 +00:00
Tobias Grosser	600941351e	Update to isl-0.18-17-g2844ebf This update improves isl's ability to coalesce different convex sets/maps, especially when the contain existentially quantified variables. llvm-svn: 290538	2016-12-26 12:11:40 +00:00
Chandler Carruth	80db76d556	Test the different scenarios of GlobalDCE and comdats more systematically and document in the test what all is going on. This replaces the PR-named test that was the only coverage for GlobalDCE and comdats previously. I wrote this because I wasn't certain how comdat DCE was supposed to work and wanted to step through what GlobalDCE did to fully understand it. After talking to folks and reading the code and really staring at things it all makes sense but it seemed good to help write down some of this in a more explicit and fully covering test case. For example, it seemed like a bug that GlobalDCE didn't consider comdat participation of ifuncs. Specifically it seemed like an accident because testing didn't really cover that case. But in fact, ifuncs specifically cannot participate in a comdat despite having that API. The new test case covers this and explicitly documents that DCE gets to fire here even though there are comdats involved. Also, we didn't have any positive tests for the challenging cases such as usage cycles between comdat participants that might make them seem alive except that there is no external edge into the cycle. llvm-svn: 290537	2016-12-26 08:54:01 +00:00
Craig Topper	5ef13ba18b	[AVX-512] Fix some patterns to use extended register classes. llvm-svn: 290536	2016-12-26 07:26:07 +00:00
Craig Topper	7b788ada2d	[AVX-512][InstCombine] Teach InstCombine to turn scalar add/sub/mul/div with rounding intrinsics into normal IR operations if the rounding mode is CUR_DIRECTION. Summary: I only do this for unmasked cases for now because isel is failing to fold the mask. I'll try to fix that soon. I'll do the same thing for packed add/sub/mul/div in a future patch. Reviewers: delena, RKSimon, zvi, craig.topper Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D27879 llvm-svn: 290535	2016-12-26 06:33:19 +00:00
Saleem Abdulrasool	c47e1aab1c	test: add explicit triples to the invocation llvm-svn: 290534	2016-12-26 04:00:54 +00:00
Saleem Abdulrasool	d133dc226f	Driver: warn on -fPIC/-fpic/-fPIE/-fpie on Windows Use of these flags would result in the use of ELF-style PIE/PIC code which is incorrect on Windows. Windows is inherently PIC by means of the DLL slide that occurs at load. This also mirrors the behaviour on GCC for MinGW. Currently, the Windows x86_64 forces the relocation model to PIC (Level 2). This is unchanged for now, though we should remove any assumptions on that and change it to a static relocation model. llvm-svn: 290533	2016-12-26 03:35:24 +00:00
Craig Topper	f56d985f77	[AVX-512] Don't assume that the rounding mode argument to intrinsics is a constant. While clang will guarantee this, nothing in the backend will. A non-constant value will now result in an isel error instead of just asserting or crashing due to a bad cast during lowering. llvm-svn: 290532	2016-12-26 01:40:17 +00:00
Chandler Carruth	0cf829c171	Fix some bad indentation that I or another introduced somehow. llvm-svn: 290531	2016-12-26 01:20:59 +00:00
Craig Topper	e328045711	[AVX-512][InstCombine] Teach InstCombine to converted masked vpermv intrinsics into shufflevector instructions Summary: This patch adds support for converting the masked vpermv intrinsics into shufflevector instructions if the indices are constants. We also need to wrap a select instruction around the shuffle to take care of the masking part. InstCombine will take care of optimizing the select if the mask is constant so I didn't bother checking for that. Reviewers: zvi, delena, spatel, RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D27825 llvm-svn: 290530	2016-12-25 23:58:57 +00:00
Bryant Wong	c6b46d80c8	Fix `update_test_checks.py` bug that incorrectly truncates IR body. Differential Revision: https://reviews.llvm.org/D26619 llvm-svn: 290529	2016-12-25 23:46:55 +00:00
Chandler Carruth	cb22b89f3f	[ADT] Add a generic concatenating iterator and range (take 2). This recommits r290512 that was reverted when MSVC failed to compile it. Since then I've played with various approaches using rextester.com (where I was able to reproduce the failure) and think that I have a solution thanks in part to the help of Dave Blaikie! It seems MSVC just has a defective `decltype` in this version. Manually writing out the type seems to do the trick, even though it is .... quite complicated. Original commit message: This allows both defining convenience iterator/range accessors on types which walk across N different independent ranges within the object, and more direct and simple usages with range based for loops such as shown in the unittest. The same facilities are used for both. They end up quite small and simple as it happens. I've also switched an iterator on `Module` to use this. I would like to add another convenience iterator that includes even more sequences as part of it and seeing this one already present motivated me to actually abstract it away and introduce a general utility. Differential Revision: https://reviews.llvm.org/D28093 llvm-svn: 290528	2016-12-25 23:41:14 +00:00
Bryant Wong	4213d94142	[MemorySSA] Define a restricted upward AccessList splice. Differential Revision: https://reviews.llvm.org/D26661 llvm-svn: 290527	2016-12-25 23:34:07 +00:00
Bryant Wong	a07d9b1460	[AliasAnalysis] Teach BasicAA about memcpy. Differential Revision: https://reviews.llvm.org/D27034 llvm-svn: 290526	2016-12-25 22:42:27 +00:00
Daniel Berlin	d7c12ee54c	Value number stores and memory states so we can detect when memory states are equivalent (IE store of same value to memory). Reviewers: davide Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28084 llvm-svn: 290525	2016-12-25 22:23:49 +00:00
Daniel Berlin	65f5f0d728	Rename GVNExpression ops_ members to op_* to match conventions in the rest of LLVM llvm-svn: 290524	2016-12-25 22:10:37 +00:00
Lang Hames	c9d0ff1302	[Orc][RPC] Add a ParallelCallGroup utility for dispatching and waiting on multiple asynchronous RPC calls. ParallelCallGroup allows multiple asynchronous calls to be dispatched, and provides a wait method that blocks until all asynchronous calls have been executed on the remote and all return value handlers run on the local machine. This will allow, for example, the JIT client to issue memory allocation calls for all sections in parallel, then block until all memory has been allocated on the remote and the allocated addresses registered with the client, at which point the JIT client can proceed to applying relocations. llvm-svn: 290523	2016-12-25 21:55:05 +00:00
Richard Smith	993f203278	Fix assertion failure when deducing an auto-typed argument against a different-width int. llvm-svn: 290522	2016-12-25 20:21:12 +00:00
Kuba Mracek	a6a177389c	[sanitizer] Define some CPU type symbols (like CPU_SUBTYPE_X86_64_H) when they're not available. This allows compiler-rt to be built on older macOS SDKs, where there symbols are not defined. Patch by Jeremy Huddleston Sequoia <jeremyhu@apple.com>. llvm-svn: 290521	2016-12-25 20:03:40 +00:00
Lang Hames	aac390ee85	[Orc][RPC] Clang-format RPCUtils header. Some of the recent RPC call type-checking changes weren't formatted prior to commit. llvm-svn: 290520	2016-12-25 19:55:59 +00:00
Greg Clayton	1eb0bca178	Add newline to end of file to quiet warnings. llvm-svn: 290519	2016-12-25 18:41:47 +00:00
Roman Gareev	1c2927b209	Specify the default values of the cache parameters If the parameters of the target cache (i.e., cache level sizes, cache level associativities) are not specified or have wrong values, we use ones for parameters of the macro-kernel and do not perform data-layout optimizations of the matrix multiplication. In this patch we specify the default values of the cache parameters to be able to apply the pattern matching optimizations even in this case. Since there is no typical values of this parameters, we use the parameters of Intel Core i7-3820 SandyBridge that also help to attain the high-performance on IBM POWER System S822 and IBM Power 730 Express server. Reviewed-by: Tobias Grosser <tobias@grosser.es> Differential Revision: https://reviews.llvm.org/D28090 llvm-svn: 290518	2016-12-25 16:32:28 +00:00
Michael Zuckerman	86602e85dd	revert commit 290516 llvm-svn: 290517	2016-12-25 12:45:18 +00:00
Michael Zuckerman	45aa420640	Commit try added new empty line llvm-svn: 290516	2016-12-25 12:01:34 +00:00
Amjad Aboud	e2aab8c30c	[DebugInfo] Added support for Checksum debug info feature. Differential Revision: https://reviews.llvm.org/D27641 llvm-svn: 290515	2016-12-25 10:12:27 +00:00
Amjad Aboud	7faeecc8f7	[DebugInfo] Added support for Checksum debug info feature. Differential Revision: https://reviews.llvm.org/D27642 llvm-svn: 290514	2016-12-25 10:12:09 +00:00
Chandler Carruth	5dc0bba4e4	Revert r290512: [ADT] Add a generic concatenating iterator and range. This code doesn't work on MSVC for reasons that elude me and I've not yet covinced a workaround to compile cleanly so reverting for now while I play with it. llvm-svn: 290513	2016-12-25 09:36:24 +00:00
Chandler Carruth	fba73aec72	[ADT] Add a generic concatenating iterator and range. This allows both defining convenience iterator/range accessors on types which walk across N different independent ranges within the object, and more direct and simple usages with range based for loops such as shown in the unittest. The same facilities are used for both. They end up quite small and simple as it happens. I've also switched an iterator on `Module` to use this. I would like to add another convenience iterator that includes even more sequences as part of it and seeing this one already present motivated me to actually abstract it away and introduce a general utility. Differential Revision: https://reviews.llvm.org/D28093 llvm-svn: 290512	2016-12-25 08:22:50 +00:00
Richard Smith	87d263e870	Fix some subtle wrong partial ordering bugs particularly with C++1z auto-typed non-type template parameters. During partial ordering, when checking the substituted deduced template arguments match the original, check the types of non-type template arguments match even if they're dependent. The only way we get dependent types here is if they really represent types of the other template (which are supposed to be modeled as being substituted for unique, non-dependent types). In order to make this work for auto-typed non-type template arguments, we need to be able to perform auto deduction even when the initializer and (potentially) the auto type are dependent, support for which is the bulk of this patch. (Note that this requires the ability to deduce only a single level of a multi-level dependent type.) llvm-svn: 290511	2016-12-25 08:05:23 +00:00
George Rimar	31a46b4835	[ELF] - Fix mistype in comment. NFC. llvm-svn: 290510	2016-12-25 06:49:17 +00:00
David Majnemer	a5cfddc367	[MS ABI] Mangle unnamed enums correctly Unnamed enums take the name of the first enumerator they define. llvm-svn: 290509	2016-12-25 05:26:02 +00:00
Kelvin Li	83c451e998	[OpenMP] Sema and parsing for 'target teams distribute' pragma This patch is to implement sema and parsing for 'target teams distribute' pragma. Differential Revision: https://reviews.llvm.org/D28015 llvm-svn: 290508	2016-12-25 04:52:54 +00:00
Mehdi Amini	690952d15e	MetadataLoader: replace the tracking of ForwardReferences and UnresolvedNodes with a set-based solution (NFC) This makes it explicit what is the exact list to handle, and it looks much more easy to manipulate and understand that the previous custom tracking of min/max to express the range where to look for. Differential Revision: https://reviews.llvm.org/D28089 llvm-svn: 290507	2016-12-25 04:22:54 +00:00
Mehdi Amini	4f90ee0010	MetadataLoader: add an extra assertion in Placeholders flush (NFC) We don't expect any forward reference at this point. llvm-svn: 290506	2016-12-25 03:55:53 +00:00
Anton Yartsev	5ac3720620	Fix for PR15623 (corrected r290413 reverted at 290415). The patch eliminates unwanted ProgramState checker data propagation from an operand of the logical operation to operation result. The patch also simplifies an assume of a constraint of the form: "(exp comparison_op expr) != 0" to true into an assume of "exp comparison_op expr" to true. (And similarly, an assume of the form "(exp comparison_op expr) == 0" to true as an assume of exp comparison_op expr to false.) which improves precision overall. https://reviews.llvm.org/D22862 llvm-svn: 290505	2016-12-25 00:57:51 +00:00
Daniel Berlin	a7b624ec6a	Add range iterator for blocks in MemoryPhi llvm-svn: 290504	2016-12-24 21:52:10 +00:00
Shoaib Meenai	fe1aacd014	[libc++] Make __num_get_float hidden It's an internal function and shouldn't be exported. It's also a source of discrepancy in the published ABI list; these symbols aren't exported for me on CentOS 7 or Ubuntu 16.04, leading to spurious check-cxx-abilist failures. Differential Revision: https://reviews.llvm.org/D27153 llvm-svn: 290503	2016-12-24 18:05:32 +00:00
Simon Pilgrim	3265d951b6	[InstCombine][X86] Add tests showing missed opportunities to simplify PMULUDQ/PMULDQ inputs. PMULUDQ/PMULDQ - only the even elements (0, 2, 4, 6) of the vXi32 inputs are required. llvm-svn: 290502	2016-12-24 17:30:19 +00:00
Bryant Wong	430f98a58b	Test commit. llvm-svn: 290501	2016-12-24 17:26:38 +00:00
Marshall Clow	da520dcbeb	Fix bug #31387 - not checking end iterator when parsing decimal escape. Thanks to Karen for the report. llvm-svn: 290500	2016-12-24 17:21:03 +00:00
Davide Italiano	463c32eaf6	[NewGVN] Prefer `auto` to explicit type when the latter is obvious. llvm-svn: 290499	2016-12-24 17:17:21 +00:00
Davide Italiano	4f84764e32	[NewGVN] Simplify several equals() member functions. NFCI. llvm-svn: 290498	2016-12-24 17:14:19 +00:00
Richard Smith	0da6dc47d1	Factor out duplication between partial ordering for class template partial specializations and variable template partial specializations. llvm-svn: 290497	2016-12-24 16:40:51 +00:00
Davide Italiano	d42deb4014	[PM] Remove vestiges of NoAA. NFCI. llvm-svn: 290496	2016-12-24 16:14:05 +00:00
Yaron Keren	1c4bbc9a41	Deduplicate several GD.getDecl() calls into Decl * local variable. llvm-svn: 290495	2016-12-24 15:32:39 +00:00
Ed Maste	178a4e5f8d	llvm-objdump: sort phdr type strings in advance of adding new ones llvm-svn: 290494	2016-12-24 14:53:45 +00:00
Malcolm Parsons	0cc3051d8e	[clang-tidy] Remove local hasInClassInitializer matcher. NFC llvm-svn: 290493	2016-12-24 14:30:29 +00:00

1 2 3 4 5 ...

250571 Commits All Branches Search

250571 Commits

All Branches