llvm-project

Commit Graph

Author	SHA1	Message	Date
Daniel Berlin	85cbc8c097	Misc cleanups and simplifications for NewGVN. Mostly use a bit more idiomatic C++ where we can, so we can combine some things later. Reviewers: davide Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28111 llvm-svn: 290550	2016-12-26 19:57:25 +00:00
Daniel Berlin	d59e8010c5	Don't use our own incorrect version of isTriviallyDeadInstruction in NewGVN. Fixes PR/31472 llvm-svn: 290549	2016-12-26 18:44:36 +00:00
Davide Italiano	fe7a3ee51e	[NewGVN] Add a flag to enable the pass via `-mllvm`. NewGVN can be tested passing `-mllvm -enable-newgvn` to clang. Differential Revision: https://reviews.llvm.org/D28059 llvm-svn: 290548	2016-12-26 18:26:19 +00:00
Davide Italiano	8ea5e4fcae	[NewGVN] Change test to reflect difference between GVN and NewGVN. The current GVN algorithm folds unconditional branches to, it claims, expose more PRE oportunities. The folding, if really needed, (which is not sure, as it's not really proved it improves analysis) can be done by an earlier cleanup pass instead of GVN itself. Ack'ed/SGTM'd by Daniel Berlin. Differential Revision: https://reviews.llvm.org/D28117 llvm-svn: 290546	2016-12-26 18:10:09 +00:00
Simon Pilgrim	cd9d729461	Wdocumentation fix llvm-svn: 290545	2016-12-26 17:48:19 +00:00
Simon Pilgrim	e8a5ab35ca	[X86][AVX512] Added v64i8 reverse shuffle test (PR31470) llvm-svn: 290544	2016-12-26 17:38:58 +00:00
Davide Italiano	a312ca845c	[NewGVN] Fold lookupOperandLeader() when there's only one use. NFCI. llvm-svn: 290543	2016-12-26 16:19:34 +00:00
Bryant Wong	b5e03b61e2	[InstCombiner] Simplify lib calls to `round{,f}` Differential Revision: https://reviews.llvm.org/D28110 llvm-svn: 290542	2016-12-26 14:29:29 +00:00
Chandler Carruth	80db76d556	Test the different scenarios of GlobalDCE and comdats more systematically and document in the test what all is going on. This replaces the PR-named test that was the only coverage for GlobalDCE and comdats previously. I wrote this because I wasn't certain how comdat DCE was supposed to work and wanted to step through what GlobalDCE did to fully understand it. After talking to folks and reading the code and really staring at things it all makes sense but it seemed good to help write down some of this in a more explicit and fully covering test case. For example, it seemed like a bug that GlobalDCE didn't consider comdat participation of ifuncs. Specifically it seemed like an accident because testing didn't really cover that case. But in fact, ifuncs specifically cannot participate in a comdat despite having that API. The new test case covers this and explicitly documents that DCE gets to fire here even though there are comdats involved. Also, we didn't have any positive tests for the challenging cases such as usage cycles between comdat participants that might make them seem alive except that there is no external edge into the cycle. llvm-svn: 290537	2016-12-26 08:54:01 +00:00
Craig Topper	5ef13ba18b	[AVX-512] Fix some patterns to use extended register classes. llvm-svn: 290536	2016-12-26 07:26:07 +00:00
Craig Topper	7b788ada2d	[AVX-512][InstCombine] Teach InstCombine to turn scalar add/sub/mul/div with rounding intrinsics into normal IR operations if the rounding mode is CUR_DIRECTION. Summary: I only do this for unmasked cases for now because isel is failing to fold the mask. I'll try to fix that soon. I'll do the same thing for packed add/sub/mul/div in a future patch. Reviewers: delena, RKSimon, zvi, craig.topper Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D27879 llvm-svn: 290535	2016-12-26 06:33:19 +00:00
Craig Topper	f56d985f77	[AVX-512] Don't assume that the rounding mode argument to intrinsics is a constant. While clang will guarantee this, nothing in the backend will. A non-constant value will now result in an isel error instead of just asserting or crashing due to a bad cast during lowering. llvm-svn: 290532	2016-12-26 01:40:17 +00:00
Chandler Carruth	0cf829c171	Fix some bad indentation that I or another introduced somehow. llvm-svn: 290531	2016-12-26 01:20:59 +00:00
Craig Topper	e328045711	[AVX-512][InstCombine] Teach InstCombine to converted masked vpermv intrinsics into shufflevector instructions Summary: This patch adds support for converting the masked vpermv intrinsics into shufflevector instructions if the indices are constants. We also need to wrap a select instruction around the shuffle to take care of the masking part. InstCombine will take care of optimizing the select if the mask is constant so I didn't bother checking for that. Reviewers: zvi, delena, spatel, RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D27825 llvm-svn: 290530	2016-12-25 23:58:57 +00:00
Bryant Wong	c6b46d80c8	Fix `update_test_checks.py` bug that incorrectly truncates IR body. Differential Revision: https://reviews.llvm.org/D26619 llvm-svn: 290529	2016-12-25 23:46:55 +00:00
Chandler Carruth	cb22b89f3f	[ADT] Add a generic concatenating iterator and range (take 2). This recommits r290512 that was reverted when MSVC failed to compile it. Since then I've played with various approaches using rextester.com (where I was able to reproduce the failure) and think that I have a solution thanks in part to the help of Dave Blaikie! It seems MSVC just has a defective `decltype` in this version. Manually writing out the type seems to do the trick, even though it is .... quite complicated. Original commit message: This allows both defining convenience iterator/range accessors on types which walk across N different independent ranges within the object, and more direct and simple usages with range based for loops such as shown in the unittest. The same facilities are used for both. They end up quite small and simple as it happens. I've also switched an iterator on `Module` to use this. I would like to add another convenience iterator that includes even more sequences as part of it and seeing this one already present motivated me to actually abstract it away and introduce a general utility. Differential Revision: https://reviews.llvm.org/D28093 llvm-svn: 290528	2016-12-25 23:41:14 +00:00
Bryant Wong	4213d94142	[MemorySSA] Define a restricted upward AccessList splice. Differential Revision: https://reviews.llvm.org/D26661 llvm-svn: 290527	2016-12-25 23:34:07 +00:00
Bryant Wong	a07d9b1460	[AliasAnalysis] Teach BasicAA about memcpy. Differential Revision: https://reviews.llvm.org/D27034 llvm-svn: 290526	2016-12-25 22:42:27 +00:00
Daniel Berlin	d7c12ee54c	Value number stores and memory states so we can detect when memory states are equivalent (IE store of same value to memory). Reviewers: davide Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28084 llvm-svn: 290525	2016-12-25 22:23:49 +00:00
Daniel Berlin	65f5f0d728	Rename GVNExpression ops_ members to op_* to match conventions in the rest of LLVM llvm-svn: 290524	2016-12-25 22:10:37 +00:00
Lang Hames	c9d0ff1302	[Orc][RPC] Add a ParallelCallGroup utility for dispatching and waiting on multiple asynchronous RPC calls. ParallelCallGroup allows multiple asynchronous calls to be dispatched, and provides a wait method that blocks until all asynchronous calls have been executed on the remote and all return value handlers run on the local machine. This will allow, for example, the JIT client to issue memory allocation calls for all sections in parallel, then block until all memory has been allocated on the remote and the allocated addresses registered with the client, at which point the JIT client can proceed to applying relocations. llvm-svn: 290523	2016-12-25 21:55:05 +00:00
Lang Hames	aac390ee85	[Orc][RPC] Clang-format RPCUtils header. Some of the recent RPC call type-checking changes weren't formatted prior to commit. llvm-svn: 290520	2016-12-25 19:55:59 +00:00
Greg Clayton	1eb0bca178	Add newline to end of file to quiet warnings. llvm-svn: 290519	2016-12-25 18:41:47 +00:00
Michael Zuckerman	86602e85dd	revert commit 290516 llvm-svn: 290517	2016-12-25 12:45:18 +00:00
Michael Zuckerman	45aa420640	Commit try added new empty line llvm-svn: 290516	2016-12-25 12:01:34 +00:00
Amjad Aboud	7faeecc8f7	[DebugInfo] Added support for Checksum debug info feature. Differential Revision: https://reviews.llvm.org/D27642 llvm-svn: 290514	2016-12-25 10:12:09 +00:00
Chandler Carruth	5dc0bba4e4	Revert r290512: [ADT] Add a generic concatenating iterator and range. This code doesn't work on MSVC for reasons that elude me and I've not yet covinced a workaround to compile cleanly so reverting for now while I play with it. llvm-svn: 290513	2016-12-25 09:36:24 +00:00
Chandler Carruth	fba73aec72	[ADT] Add a generic concatenating iterator and range. This allows both defining convenience iterator/range accessors on types which walk across N different independent ranges within the object, and more direct and simple usages with range based for loops such as shown in the unittest. The same facilities are used for both. They end up quite small and simple as it happens. I've also switched an iterator on `Module` to use this. I would like to add another convenience iterator that includes even more sequences as part of it and seeing this one already present motivated me to actually abstract it away and introduce a general utility. Differential Revision: https://reviews.llvm.org/D28093 llvm-svn: 290512	2016-12-25 08:22:50 +00:00
Mehdi Amini	690952d15e	MetadataLoader: replace the tracking of ForwardReferences and UnresolvedNodes with a set-based solution (NFC) This makes it explicit what is the exact list to handle, and it looks much more easy to manipulate and understand that the previous custom tracking of min/max to express the range where to look for. Differential Revision: https://reviews.llvm.org/D28089 llvm-svn: 290507	2016-12-25 04:22:54 +00:00
Mehdi Amini	4f90ee0010	MetadataLoader: add an extra assertion in Placeholders flush (NFC) We don't expect any forward reference at this point. llvm-svn: 290506	2016-12-25 03:55:53 +00:00
Daniel Berlin	a7b624ec6a	Add range iterator for blocks in MemoryPhi llvm-svn: 290504	2016-12-24 21:52:10 +00:00
Simon Pilgrim	3265d951b6	[InstCombine][X86] Add tests showing missed opportunities to simplify PMULUDQ/PMULDQ inputs. PMULUDQ/PMULDQ - only the even elements (0, 2, 4, 6) of the vXi32 inputs are required. llvm-svn: 290502	2016-12-24 17:30:19 +00:00
Bryant Wong	430f98a58b	Test commit. llvm-svn: 290501	2016-12-24 17:26:38 +00:00
Davide Italiano	463c32eaf6	[NewGVN] Prefer `auto` to explicit type when the latter is obvious. llvm-svn: 290499	2016-12-24 17:17:21 +00:00
Davide Italiano	4f84764e32	[NewGVN] Simplify several equals() member functions. NFCI. llvm-svn: 290498	2016-12-24 17:14:19 +00:00
Davide Italiano	d42deb4014	[PM] Remove vestiges of NoAA. NFCI. llvm-svn: 290496	2016-12-24 16:14:05 +00:00
Ed Maste	178a4e5f8d	llvm-objdump: sort phdr type strings in advance of adding new ones llvm-svn: 290494	2016-12-24 14:53:45 +00:00
Simon Pilgrim	0d66d29678	[SelectionDAG] Early out from computeKnownBits when we know we will have no common bits. Avoid extra (recursive) calls to computeKnownBits if we already know that there are no common known bits. llvm-svn: 290490	2016-12-24 12:59:35 +00:00
Chandler Carruth	534d644b86	[PM] Try to improve the comments here to make what's going on more clear. Based on post-commit review suggestion from Sean. (Thanks!) llvm-svn: 290488	2016-12-24 05:11:17 +00:00
Daniel Berlin	8a6a86146c	Mark isOnlyReachableViaThisEdge as const llvm-svn: 290468	2016-12-24 00:04:07 +00:00
Mehdi Amini	4fe6a8c826	Add an assertion for cl::opt names: they can't start with '-' llvm-svn: 290467	2016-12-23 23:55:26 +00:00
Mehdi Amini	70a02b0923	llvm-size: remove leading dash in '-radix' option cl::opt does not accept such option llvm-svn: 290466	2016-12-23 23:55:08 +00:00
Mehdi Amini	a9d7aacd4d	llvm-readobj: remove leading dash in '-a' option (ARMAttributesShort) cl::opt does not accept such option llvm-svn: 290465	2016-12-23 23:54:52 +00:00
Mehdi Amini	95c1f91545	llvm-lto2: remove leading '-' for cl::opt declaration llvm-svn: 290464	2016-12-23 23:54:34 +00:00
Mehdi Amini	14f19bd012	llvm-lto2: Print diagnostics before exiting (NFC) llvm-svn: 290463	2016-12-23 23:54:17 +00:00
Mehdi Amini	4c80946850	llvm-lto: pass errs() to the module verifier (NFC) It is more friendly to have the actual diagnostic when the verifier fails. llvm-svn: 290462	2016-12-23 23:53:57 +00:00
Chandler Carruth	cdfdd4330a	[PM] Remove a bunch of junk that snuck in when I failed at manipulating my editor to close and commit the patch. Sorry for the noise. llvm-svn: 290460	2016-12-23 23:39:31 +00:00
Chandler Carruth	4eaff12ba2	[PM] Teach the always inlining test case to be much more strict about whether functions are removed, and fix the new PM's always inliner to actually pass this test. Without this, the new PM's always inliner leaves all the functions kicking around which won't work out very well given the semantics of always inline. Doing this really highlights how frustrating the current alwaysinline semantic contract is though -- why can we put it on external functions, etc? Also I've added a number of tricky and interesting test cases for removing functions with the always inliner. There is one remaining case not handled -- fully removing comdats -- and I've left a FIXME about this. llvm-svn: 290457	2016-12-23 23:33:35 +00:00
Chandler Carruth	f32f63f222	[PM] Clean up test case and comments a bit. NFC. llvm-svn: 290456	2016-12-23 23:33:32 +00:00
Chandler Carruth	060ad61fbe	[PM] Add support for building a default AA pipeline to the PassBuilder. Pretty boring and lame as-is but necessary. This is definitely a place we'll end up with extension hooks longer term. =] Differential Revision: https://reviews.llvm.org/D28076 llvm-svn: 290449	2016-12-23 20:38:19 +00:00
Mehdi Amini	94f86ad4e0	Function-import: Disable IRVerifier on lazy-loaded modules: the ODR TypeUniquing generates invalid debug info. llvm-svn: 290442	2016-12-23 19:19:44 +00:00
Mehdi Amini	fc06b83ee7	Fix build after r290437 (missing include) llvm-svn: 290438	2016-12-23 18:04:51 +00:00
Mehdi Amini	9a9077fdad	FunctionImport: fix typo '#ifndef NDEBUG' instead of '#ifndef DEBUG' llvm-svn: 290437	2016-12-23 17:59:24 +00:00
Jan Vesely	206a510e54	AMDGPU: split ret/noret patterns for global atomics Differential Revision: https://reviews.llvm.org/D27989 llvm-svn: 290435	2016-12-23 15:34:51 +00:00
Davide Italiano	b9ff23a402	[LICM] Plug a leak freeing the ASTs before clearing the map. llvm-svn: 290433	2016-12-23 15:02:35 +00:00
Piotr Padlewski	383edba1fd	[MemDep] NFC changes llvm-svn: 290428	2016-12-23 13:13:32 +00:00
Davide Italiano	34f94384a5	[LICM] Work around LICM needs to maintain state across loops. The pass creates some state which expects to be cleaned up by a later instance of the same pass. opt-bisect happens to expose this not ideal design because calling skipLoop() will result in this state not being cleaned up at times and an assertion firing in `doFinalization()`. Chandler tells me the new pass manager will give us options to avoid these design traps, but until it's not ready, we need a workaround for the current pass infrastructure. Fix provided by Andy Kaylor, see the review for a complete discussion. Differential Revision: https://reviews.llvm.org/D25848 llvm-svn: 290427	2016-12-23 13:12:50 +00:00
Renato Golin	21da340f7a	[AArch64] Cortex-A57 FDIV/FSQRT scheduling fix (W-unit) According to the Cortex-A57 doc, FDIV/FSQRT instructions should use F0 unit (W-unit in AArch64SchedA57.td, the same as cryptography instructions), not F1 unit (X-unit in td, like ASIMD absolute diff accum SABA/UABA). This patch changes FDIV/FSQRT scheduling declarations to use A57UnitW instead of A57UnitX. Also, latencies for those instructions are corrected. Patch by Andrew Zhogin. llvm-svn: 290426	2016-12-23 12:51:41 +00:00
Florian Hahn	898127fe36	Revert r290423 because it broke the sanitizer-x86_64-linux-autoconf buildbot. llvm-svn: 290425	2016-12-23 12:26:11 +00:00
Florian Hahn	1d6b1a7b79	[framelowering] Skip dbg values when getting next/previous instruction. Summary: In mergeSPUpdates, debug values need to be ignored when getting the previous element, otherwise debug data could have an impact on codegen. In eliminateCallFramePseudoInstr, debug values after the erased element could have an impact on codegen and should be skipped. Closes PR31319 (https://llvm.org/bugs/show_bug.cgi?id=31319) Reviewers: mkuper, MatzeB, aprantl Subscribers: gbedwell, llvm-commits Differential Revision: https://reviews.llvm.org/D27688 llvm-svn: 290423	2016-12-23 11:35:00 +00:00
Davide Italiano	0ff941620c	[NewGVN] Remove (for now) unused code. NFCI. llvm-svn: 290420	2016-12-23 10:28:30 +00:00
Mehdi Amini	96cdc49305	[ThinLTO] Verify lazy-loaded source module for function importing when assertions are enabled (NFC) llvm-svn: 290416	2016-12-23 05:16:19 +00:00
Mehdi Amini	9f926f70c1	MetadataLoader: split the creation of a single metadata out of a Record into its own function (NFC) This is pure code motion, will just make it more reusable when I'll attempt to lazy-load Metadats on-demand. llvm-svn: 290414	2016-12-23 03:59:18 +00:00
Dan Gohman	00d734d89b	[WebAssembly] Annotate call and load/store immediates. These will be used to guide the binary encoding of these immediates. llvm-svn: 290412	2016-12-23 03:23:52 +00:00
Zijiao Ma	bf6007bd1b	Make the canonicalisation on shifts benifit to more case. 1.Fix pessimized case in FIXME. 2.Add tests for it. 3.The canonicalisation on shifts results in different sequence for tests of machine-licm.Correct some check lines. Differential Revision: https://reviews.llvm.org/D27916 llvm-svn: 290410	2016-12-23 02:56:07 +00:00
Mehdi Amini	37c178b6f5	MetadataLoader: Reinitialize MinFwdRef/MaxFwdRef after resolving cycles (NFC) This put the Loader back in a consistent state. llvm-svn: 290409	2016-12-23 02:20:12 +00:00
Mehdi Amini	5ae6170fc2	MetadataLoader: Add an assertion for the implicit invariant of PlaceHolder while loading Metadata (NFC) llvm-svn: 290408	2016-12-23 02:20:09 +00:00
Mehdi Amini	70a9cd4cbe	MetadataLoader: Make sure every member of MetadataLoader are initialized (NFC) llvm-svn: 290407	2016-12-23 02:20:07 +00:00
Mehdi Amini	ec68dd49bf	MetadataLoader: Refactor "IsImporting" into the Pimpl for the MetadataLoader (NFC) Keeping all the state together will make it easier to handle. llvm-svn: 290406	2016-12-23 02:20:02 +00:00
Chandler Carruth	eb119ece4a	Fix some DOS-style line endings that I suspect snuck in from one of the frustrating Subversion clients that fails to do line ending translation of text files. llvm-svn: 290404	2016-12-23 02:02:26 +00:00
NAKAMURA Takumi	f931d66e0d	KillTheDoctor.cpp: Appease cases on case-senstitive host, like mingw on linux. llvm-svn: 290402	2016-12-23 01:39:26 +00:00
NAKAMURA Takumi	3166854827	KillTheDoctor: Add a required system lib, psapi. KillTheDoctor itself uses Win32 API directly. llvm-svn: 290401	2016-12-23 01:39:20 +00:00
Chandler Carruth	ee08676102	Enable '-Wstring-conversion' and fix some bad asserts that it helped find. Notable is the assert in NewGVN which had no effect because of the bug. llvm-svn: 290400	2016-12-23 01:38:06 +00:00
George Burgess IV	ccae43a247	Don't consider allocsize functions to be allocation functions. This patch fixes some ASAN unittest failures on FreeBSD. See the cfe-commits email thread for r290169 for more on those. According to the LangRef, the allocsize attribute only tells us about the number of bytes that exist at the memory location pointed to by the return value of a function. It does not necessarily mean that the function will only ever allocate. So, we need to be very careful about treating functions with allocsize as general allocation functions. This patch makes us fully conservative in this regard, though I suspect that we have room to be a bit more aggressive if we want. This has a FIXME that can be fixed by a relatively straightforward refactor; I just wanted to keep this patch minimal. If this sticks, I'll come back and fix it in a few days. llvm-svn: 290397	2016-12-23 01:18:09 +00:00
Sanjoy Das	50fef4321b	NFC code motion in ImplicitNullChecks Extract out two large lambdas into top level member functions. llvm-svn: 290395	2016-12-23 00:41:24 +00:00
Sanjoy Das	9a129807f3	Reimplement depedency tracking in the ImplicitNullChecks pass Summary: This change rewrites a core component in the ImplicitNullChecks pass for greater simplicity since the original design was over-complicated for no good reason. Please review this as essentially a new pass. The change is almost NFC and I've added a test case for a scenario that this new code handles that wasn't handled earlier. The implicit null check pass, at its core, is a code hoisting transform. It differs from "normal" code transforms in that it speculates potentially faulting instructions (by design), but a lot of the usual hazard detection logic (register read-after-write etc.) still applies. We previously detected hazards by keeping track of registers defined and used by machine instructions over an instruction range, but that was unwieldy and did not actually confer any performance benefits. The intent was to have linear time complexity over the number of machine instructions considered, but it ended up being N^2 is practice. This new version is more obviously O(N^2) (with N capped to 8 by default) in hazard detection. It does not attempt to be clever in tracking register uses or defs (the previous cleverness here was a source of bugs). Once this is checked in, I'll extract out the `IsSuitableMemoryOp` and `CanHoistLoadInst` lambda into member functions (they're too complicated to be inline lambdas) and do some other related NFC cleanups. Reviewers: reames, anna, atrick Subscribers: mcrosier, llvm-commits Differential Revision: https://reviews.llvm.org/D27592 llvm-svn: 290394	2016-12-23 00:41:21 +00:00
Chris Bieneman	7e98468f1e	[ObjectYAML] Fixing a compiler warning Accidentally re-defined the variable instead of setting it. Oops! llvm-svn: 290388	2016-12-22 22:58:07 +00:00
Quentin Colombet	3749f33888	[GlobalISel] More fix for the size vs. type typo. NFC. I missed those in my previous commit (r290378). llvm-svn: 290387	2016-12-22 22:50:34 +00:00
Chris Bieneman	e0e451d927	[ObjectYAML] Support for DWARF debug_info section This patch adds support for YAML<->DWARF for debug_info sections. This re-lands r290147, reverted in 290148, re-landed in r290204 after fixing the issue that caused bots to fail (thank you UBSan!), and reverted again in r290209 due to failures on big endian systems. After adding support for preserving endianness, this should be good now. llvm-svn: 290386	2016-12-22 22:44:27 +00:00
Ahmed Bougacha	1277833aa6	[AArch64] Simplify indexed-memory testcase. NFC. We're only testing the addressing mode on the stores; we don't need to load/store pointers we can simply pass/return. llvm-svn: 290385	2016-12-22 22:27:05 +00:00
Evgeniy Stepanov	27d4c9b71b	[cfi] Emit jump tables as a function-level inline asm. Use a dummy private function with inline asm calls instead of module level asm blocks for CFI jumptables. The main advantage is that now jumptable codegen can be affected by the function attributes (like target_cpu on ARM). Module level asm gets the default subtarget based on the target triple, which is often not good enough. This change also uses asm constraints/arguments to reference jumptable targets and aliases directly. We no longer do asm name mangling in an IR pass. Differential Revision: https://reviews.llvm.org/D28012 llvm-svn: 290384	2016-12-22 22:22:35 +00:00
Chris Bieneman	e477fb9591	[ObjectYAML] Fixing big endian bots from r290381 Bot URL: http://lab.llvm.org:8011/builders/clang-s390x-linux/builds/2505 llvm-svn: 290383	2016-12-22 22:16:04 +00:00
Chris Bieneman	55de3a2449	[ObjectYAML] MachO support for endianness This patch adds support to the macho<->yaml tools for preserving endianness in MachO structures and DWARF data. llvm-svn: 290381	2016-12-22 21:58:03 +00:00
Quentin Colombet	fa5960a28b	[MachineVerifier] Check that even generic vregs comply to regclass constraints. We used to not check generic vregs, but that is actually a mistake given nothing in the GlobalISel pipeline is going to fix the constraints on target specific instructions. Therefore, the target has to have them right from the start. llvm-svn: 290380	2016-12-22 21:56:39 +00:00
Quentin Colombet	f372150f73	[AArch64] Change a test to use a generic instr instead of a target specific one. Target specific instructions have requirements that are not compatible with what we want to test here. Namely, target specific instructions must have their operands properly mapped on register classes. llvm-svn: 290379	2016-12-22 21:56:37 +00:00
Quentin Colombet	e08cc599b8	[MIRParser] Fix a typo in comment and error message. We have long switched from size to type. llvm-svn: 290378	2016-12-22 21:56:35 +00:00
Quentin Colombet	f38015e5fe	[AArch64][CallLowering] Constraint registers on target specific instruction The InstructionSelect pass will not look at target specific instructions since they are already selected. As a result, the operands of target specific instructions must be properly constrained, because it is not going to fix them. This fixes invalid register classes on call instruction. llvm-svn: 290377	2016-12-22 21:56:31 +00:00
Quentin Colombet	9751e61fe1	[MIRParser] Non-generic virtual register may have a type. When generic virtual registers get constrained, because of a use on a target specific operation for instance, we end up with regular virtual registers with a type and that's perfectly fine. llvm-svn: 290376	2016-12-22 21:56:29 +00:00
Quentin Colombet	7e1f66d6f5	[RegisterBankInfo] Allow to set a register class when nothing else is set This is going to be needed to be able to constraint register class on target specific instruction while the RegBankSelect pass did not run yet. llvm-svn: 290375	2016-12-22 21:56:26 +00:00
Quentin Colombet	b4e71185b2	[GlobalISel] Refactor the logic to constraint registers. Move the logic to constraint register from InstructionSelector to a utility function. It will be required by other passes in the GlobalISel pipeline. llvm-svn: 290374	2016-12-22 21:56:19 +00:00
Matt Arsenault	0b26e47345	AMDGPU: Invert cmp + select with constant Canonicalize a select with a constant to the false side. This enables more instruction shrinking opportunities since an inline immediate can be used for the false side of v_cndmask_b32_e32. This seems to usually be better but causes some code size regressions in some tests. llvm-svn: 290372	2016-12-22 21:40:08 +00:00
Tim Shen	53ddc1d0f4	[PowerPC] Add ppc support to update_llc_test_checks.py, and ppc tests. NFC. Reviewers: chandlerc, hfinkel, echristo, iteratee Subscribers: mehdi_amini, nemanjai, llvm-commits Differential Revision: https://reviews.llvm.org/D28036 llvm-svn: 290370	2016-12-22 20:59:39 +00:00
Krzysztof Parzyszek	3885d87c60	[Hexagon] Add DAG mutations for machine pipeliner llvm-svn: 290366	2016-12-22 19:44:55 +00:00
Wei Mi	a2f0b594c2	Redo store splitting in CodeGenPrepare. This is a succeeding patch of https://reviews.llvm.org/D22840 to address the issue when a value to be merged into an int64 pair is in a different BB. Redoing the store splitting in CodeGenPrepare so we can match the pattern across multiple BBs and move some instructions into the same BB. We still keep the code in dag combine so that we can catch cases that show up after DAG combining runs. Differential Revision: https://reviews.llvm.org/D25914 llvm-svn: 290365	2016-12-22 19:44:45 +00:00
Wei Mi	f3f01aba48	Change the interface of TLI.isMultiStoresCheaperThanBitsMerge. This is for splitMergedValStore in DAG Combine to share the target query interface with similar logic in CodeGenPrepare. Differential Revision: https://reviews.llvm.org/D24707 llvm-svn: 290363	2016-12-22 19:38:22 +00:00
Petar Jovanovic	8a4e63994e	[mips] Fix compact branch hazard detection, part 2 Follow up to D27209 fix, this patch now properly handles single transient instruction in basic block. Patch by Aleksandar Beserminji. Differential Revision: https://reviews.llvm.org/D27856 llvm-svn: 290361	2016-12-22 19:29:50 +00:00
Krzysztof Parzyszek	8839124848	Add the DAG mutation interface to the software pipeliner llvm-svn: 290360	2016-12-22 19:21:20 +00:00
Reid Kleckner	c2b56634cf	Pass -Wa,-mbig-obj in 64-bit mingw builds COFF has a 2**16 section limit, and on Win64, every COMDAT function creates at least 3 sections: .text, .pdata, and .xdata. For MSVC, we enable bigobj on a file-by-file basis, but GCC appears to hit the limit on different files. Fixes PR25953 llvm-svn: 290358	2016-12-22 19:12:14 +00:00
Reid Kleckner	143a937f79	Build KillTheDoctor with mingw-w64 compiler-rt uses it in its lit tests. llvm-svn: 290357	2016-12-22 19:11:42 +00:00
Krzysztof Parzyszek	df24da221e	Fix two bugs in the pipeliner in renaming phis in the prolog and epilog When the pipeliner is renaming phi values, it may need to iterate through the phi operands to check for other phis. However, the pipeliner should stop once it reaches a phi that is outside the pipelined loop. Also, when the generateExistingPhis code is unable to reuse an existing phi, the default code that computes the PhiOp2 is only to be used when the pipeliner is generating the kernel. Otherwise, the phi may be a value computed earlier in the same epilog. Patch by Brendon Cahoon. llvm-svn: 290355	2016-12-22 18:49:55 +00:00
Matt Arsenault	941632839f	AMDGPU: Use i16 for i16 shift amount llvm-svn: 290351	2016-12-22 16:36:25 +00:00
Davide Italiano	e05e3306a3	[NewGVN] Add the pass to PassRegistry.def. We need to hook up here to get it working with the new PM. Add a test while here (and remove a typo). llvm-svn: 290350	2016-12-22 16:35:02 +00:00
Matt Arsenault	3c97e2030a	AMDGPU: Fix missing 16-bit cmpx instructions llvm-svn: 290349	2016-12-22 16:27:14 +00:00
Matt Arsenault	18f56be3d2	AMDGPU: Use i16 comparison instructions llvm-svn: 290348	2016-12-22 16:27:11 +00:00
Matt Arsenault	fef7beb6a6	AMDGPU: Fixed '!NodePtr->isKnownSentinel()' assert Caused by dereferencing end iterator when trying to const cast the iterator. Patch by Martin Sherburn llvm-svn: 290347	2016-12-22 16:06:32 +00:00
Davide Italiano	7e274e02ae	[GVN] Initial check-in of a new global value numbering algorithm. The code have been developed by Daniel Berlin over the years, and the new implementation goal is that of addressing shortcomings of the current GVN infrastructure, i.e. long compile time for large testcases, lack of phi predication, no load/store value numbering etc... The current code just implements the "core" GVN algorithm, although other pieces (load coercion, phi handling, predicate system) are already implemented in a branch out of tree. Once the core is stable, we'll start adding pieces on top of the base framework. The test currently living in test/Transform/NewGVN are a copy of the ones in GVN, with proper `XFAIL` (missing features in NewGVN). A flag will be added in a future commit to enable NewGVN, so that interested parties can exercise this code easily. Differential Revision: https://reviews.llvm.org/D26224 llvm-svn: 290346	2016-12-22 16:03:48 +00:00
Dan Gohman	8b4340a5dd	[WebAssembly] Add an "explicit" keyword to a constructor. llvm-svn: 290345	2016-12-22 16:03:02 +00:00
Dan Gohman	207ed22660	[WebAssembly] Don't use variadic operand indices in the MCOperandInfo array. llvm-svn: 290344	2016-12-22 16:00:55 +00:00
Dan Gohman	728926ac59	[WebAssembly] Don't old negative load/store offsets in fast-isel. WebAssembly's load/store offsets are unsigned and don't wrap, so it's not valid to fold in a negative offset. llvm-svn: 290342	2016-12-22 15:15:10 +00:00
Sam Kolton	a568e3dde7	[AMDGPU] Add pseudo SDWA instructions Summary: This is needed for later SDWA support in CodeGen. Reviewers: vpykhtin, tstellarAMD Subscribers: arsenm, kzhuravl, wdng, nhaehnle, yaxunl, tony-tye Differential Revision: https://reviews.llvm.org/D27412 llvm-svn: 290338	2016-12-22 12:57:41 +00:00
Sam Kolton	a6792a39c4	[AMDGPU] Disassembler: fix for disaasembling v_mac_f32/16_dpp/sdwa Summary: Real instruction should copy constraints from real instruction. This allows auto-generated disassembler to correctly process tied operands. Reviewers: nhaustov, vpykhtin, tstellarAMD Subscribers: arsenm, kzhuravl, wdng, nhaehnle, yaxunl, tony-tye Differential Revision: https://reviews.llvm.org/D27847 llvm-svn: 290336	2016-12-22 11:30:48 +00:00
Ayman Musa	9ff608cdc6	[X86][AVX2] Passing the appropriate memory operand class to VPMADDWD instruction. Replacing the memory operand in the ymm version of VPMADDWD from i128mem to i256mem. Differential Revision: https://reviews.llvm.org/D28024 llvm-svn: 290333	2016-12-22 08:42:46 +00:00
Chandler Carruth	0d1d49507b	[PM] Loosen the check ever so slightly -- MSVC appears to not include a space after the comma in template arguments with our hacky type name system. llvm-svn: 290331	2016-12-22 07:53:20 +00:00
Chandler Carruth	ee6865f425	[PM] Make a couple of CHECK lines a bit more precise, NFC. I was staring at these and didn't realize these were module-layer proxies as opposed to some other layer. Justin and I have a plan to rename things to make the names themselves much easier to reason about, but I at least want the CHECK lines to be precise for now. llvm-svn: 290328	2016-12-22 07:14:35 +00:00
Chandler Carruth	9c36c922d9	[PM] Remove now-dead extern template and explicit instantiation declarations. We're using a custom class here instead of the helper template, these bits just didn't get deleted when the other bits did get deleted. This was found by a really nice MSVC warning about explicitly instantiating a template where some member functions aren't defined and thus can't be instantiatied. llvm-svn: 290327	2016-12-22 07:14:33 +00:00
Chandler Carruth	e3f5064b72	[PM] Introduce a reasonable port of the main per-module pass pipeline from the old pass manager in the new one. I'm not trying to support (initially) the numerous options that are currently available to customize the pass pipeline. If we end up really wanting them, we can add them later, but I suspect many are no longer interesting. The simplicity of omitting them will help a lot as we sort out what the pipeline should look like in the new PM. I've also documented to the best of my ability why each pass or group of passes is used so that reading the pipeline is more helpful. In many cases I think we have some questionable choices of ordering and I've left FIXME comments in place so we know what to come back and revisit going forward. But for now, I've left it as similar to the current pipeline as I could. Lastly, I've had to comment out several places where passes are not ported to the new pass manager or where the loop pass infrastructure is not yet ready. I did at least fix a few bugs in the loop pass infrastructure uncovered by running the full pipeline, but I didn't want to go too far in this patch -- I'll come back and re-enable these as the infrastructure comes online. But I'd like to keep the comments in place because I don't want to lose track of which passes need to be enabled and where they go. One thing that seemed like a significant API improvement was to require that we don't build pipelines for O0. It seems to have no real benefit. I've also switched back to returning pass managers by value as at this API layer it feels much more natural to me for composition. But if others disagree, I'm happy to go back to an output parameter. I'm not 100% happy with the testing strategy currently, but it seems at least OK. I may come back and try to refactor or otherwise improve this in subsequent patches but I wanted to at least get a good starting point in place. Differential Revision: https://reviews.llvm.org/D28042 llvm-svn: 290325	2016-12-22 06:59:15 +00:00
Adrian Prantl	5542da4bbc	Fix an assertion in DwarfExpression when emitting fragments in vector registers When DwarfExpression is emitting a fragment that is located in a register and that fragment is smaller than the register, and the register must be composed from sub-registers (are you still with me?) the last DW_OP_piece operation must not be larger than the size of the fragment itself, since the last piece of the fragment could be smaller than the last subregister that is being emitted. rdar://problem/29779065 llvm-svn: 290324	2016-12-22 06:10:41 +00:00
Adrian Prantl	49797ca6be	Refactor the DIExpression fragment query interface (NFC) ... so it becomes available to DIExpressionCursor. llvm-svn: 290322	2016-12-22 05:27:12 +00:00
Matt Arsenault	485dacd90c	DAG: Add helper for testing constant values There are helpers for testing for constant or constant build_vector, and for splat ConstantFP vectors, but not for a constantfp or non-splat ConstantFP vector. llvm-svn: 290317	2016-12-22 04:39:45 +00:00
Matt Arsenault	3de76b9dc8	AMDGPU: Fix missing commute table entries for cmpx No tests because these aren't currently used anywhere. llvm-svn: 290316	2016-12-22 04:39:41 +00:00
Mehdi Amini	9d3248b765	[ThinLTO] Save 8B per summary entry by rearranging the fields (NFC) Size goes from 72B to 64B per entry. Differential Revision: https://reviews.llvm.org/D27970 llvm-svn: 290314	2016-12-22 04:09:29 +00:00
Matt Arsenault	e7d8ed32f9	AMDGPU: Swap order of operands in fadd/fsub combine FMA is canonicalized to constant in the middle operand. Do the same so fmad matches and avoid an extra combine step. llvm-svn: 290313	2016-12-22 04:03:40 +00:00
Matt Arsenault	46e6b7adef	AMDGPU: Check fast math flags in fadd/fsub combines llvm-svn: 290312	2016-12-22 04:03:35 +00:00
Matt Arsenault	770ec8680a	AMDGPU: Form more FMAs if fusion is allowed Extend the existing fadd/fsub->fmad combines to produce FMA if allowed. llvm-svn: 290311	2016-12-22 03:55:35 +00:00
Matt Arsenault	d8b73d5304	AMDGPU: Move combines into separate functions llvm-svn: 290309	2016-12-22 03:44:42 +00:00
Matt Arsenault	ef82ad94ea	AMDGPU: Enable some f32 fadd/fsub combines for f16 llvm-svn: 290308	2016-12-22 03:40:39 +00:00
Matt Arsenault	9e22bc2cd3	AMDGPU: Implement isFMAFasterThanFMulAndFAdd for f16 llvm-svn: 290307	2016-12-22 03:21:48 +00:00
Matt Arsenault	2920f62423	AMDGPU: setcc test cleanup llvm-svn: 290306	2016-12-22 03:21:45 +00:00
Matt Arsenault	cdff21b14e	AMDGPU: Allow rcp and rsq usage with f16 llvm-svn: 290302	2016-12-22 03:05:44 +00:00
Matt Arsenault	4052a576c0	AMDGPU: Custom lower f16 fdiv llvm-svn: 290301	2016-12-22 03:05:41 +00:00
Matt Arsenault	ce84130f85	AMDGPU: Implement f16 fcanonicalize llvm-svn: 290300	2016-12-22 03:05:37 +00:00
Matt Arsenault	4e55c1ec11	AMDGPU: Update isFPImmLegal for f16 I don't think this matters because ConstantFP is legal. llvm-svn: 290299	2016-12-22 03:05:30 +00:00
Peter Collingbourne	704f814a5e	Clear the PendingTypeTests vector after moving from it. This is to put the vector into a well defined state. Apparently the state of a vector after being moved from is valid but unspecified. Found with clang-tidy. llvm-svn: 290298	2016-12-22 02:52:23 +00:00
Haicheng Wu	9ac20a1e10	[AArch64] Correct the check of signed 9-bit imm in getIndexedAddressParts(). -256 is a legal indexed address part. Differential Revision: https://reviews.llvm.org/D27537 llvm-svn: 290296	2016-12-22 01:39:24 +00:00
Easwaran Raman	180bd9f6b3	Pass GetAssumptionCache to InlineFunctionInfo constructor Differential revision: https://reviews.llvm.org/D28038 llvm-svn: 290295	2016-12-22 01:07:01 +00:00
David Majnemer	5fa7d48bb8	[NVVMIntrRange] Only set range metadata if none is already present The range metadata inserted by NVVMIntrRange is pessimistic, range metadata already present could be more precise. llvm-svn: 290294	2016-12-22 00:51:59 +00:00
Adrian Prantl	1eadba1c8c	Renumber testcase metadata nodes after r290153. This patch renumbers the metadata nodes in debug info testcases after https://reviews.llvm.org/D26769. This is a separate patch because it causes so much churn. This was implemented with a python script that pipes the testcases through llvm-as - \| llvm-dis - and then goes through the original and new output side-by side to insert all comments at a close-enough location. Differential Revision: https://reviews.llvm.org/D27765 llvm-svn: 290292	2016-12-22 00:45:21 +00:00
Adrian Prantl	58c1910642	[LLParser] Make the line field of DIMacro(File) optional. Otherwise these records do not survive roundtrips. llvm-svn: 290291	2016-12-22 00:29:00 +00:00
Adrian Prantl	ec9ebba778	Legalize metadata in legacy testcases llvm-svn: 290288	2016-12-21 23:38:17 +00:00
Adrian Prantl	762e4b72c6	Legalize metadata in legacy testcases llvm-svn: 290287	2016-12-21 23:36:06 +00:00
Adrian Prantl	aad5df484c	Legalize metadata in legacy testcases llvm-svn: 290286	2016-12-21 23:30:35 +00:00
Adrian Prantl	b767f31290	Legalize metadata in legacy testcases llvm-svn: 290285	2016-12-21 23:28:49 +00:00
Ahmed Bougacha	36f7035bd7	[GlobalISel] Add basic Selector-emitter tblgen backend. This adds a basic tablegen backend that analyzes the SelectionDAG patterns to find simple ones that are eligible for GlobalISel-emission. That's similar to FastISel, with one notable difference: we're not fed ISD opcodes, so we need to map the SDNode operators to generic opcodes. That's done using GINodeEquiv in TargetGlobalISel.td. Otherwise, this is mostly boilerplate, and lots of filtering of any kind of "complicated" pattern. On AArch64, this is sufficient to match G_ADD up to s64 (to ADDWrr/ADDXrr) and G_BR (to B). Differential Revision: https://reviews.llvm.org/D26878 llvm-svn: 290284	2016-12-21 23:26:20 +00:00
Ahmed Bougacha	aa9fe53278	[AsmWriter] Remove redundant cast<>s. NFC. llvm-svn: 290283	2016-12-21 23:26:13 +00:00
Dan Gohman	a2b9b349e7	[WebAssembly] Fix the opcode value for i64.rotr. llvm-svn: 290281	2016-12-21 23:09:42 +00:00
Peter Collingbourne	1b4137a7f9	IR: Function summary representation for type tests. Each function summary has an attached list of type identifier GUIDs. The idea is that during the regular LTO phase we would match these GUIDs to type identifiers defined by the regular LTO module and store the resolutions in a top-level "type identifier summary" (which will be implemented separately). Differential Revision: https://reviews.llvm.org/D27967 llvm-svn: 290280	2016-12-21 23:03:45 +00:00
Mike Aizatsky	bfe5045b9c	[sancov] skip duplicated points llvm-svn: 290278	2016-12-21 22:10:01 +00:00
Mike Aizatsky	987f6420ac	[sancov] hash prefix results in huge merge files, use shorter prefix llvm-svn: 290277	2016-12-21 22:09:57 +00:00
Haicheng Wu	6bb0e39321	[AArch64] Remove a redundant check. NFC. The case AM.Scale == 0 is already handled by the code right above. Differential Revision: https://reviews.llvm.org/D28003 llvm-svn: 290275	2016-12-21 21:40:47 +00:00
Greg Clayton	78a07bfa66	Add the ability for DWARFDie objects to get the parent DWARFDie. In order for the llvm DWARF parser to be used in LLDB we will need to be able to get the parent of a DIE. This patch adds that functionality by changing the DWARFDebugInfoEntry class to store a depth field instead of a sibling index. Using a depth field allows us to easily calculate the sibling and the parent without increasing the size of DWARFDebugInfoEntry. I tested llvm-dsymutil on a debug version of clang where this fully parses DWARF in over 1200 .o files to verify there was no serious regression in performance. Added a full suite of unit tests to test this functionality. Differential Revision: https://reviews.llvm.org/D27995 llvm-svn: 290274	2016-12-21 21:37:06 +00:00

1 2 3 4 5 ...

142538 Commits