llvm-project

Commit Graph

Author	SHA1	Message	Date
Peter Collingbourne	1f03422610	ThinLTOBitcodeWriter: Try harder to discard unused references to the merged module. If the thin module has no references to an internal global in the merged module, we need to make sure to preserve that property if the global is a member of a comdat group, as otherwise promotion can end up adding global symbols to the comdat, which is not allowed. This situation can arise if the external global in the thin module has dead constant users, which would cause use_empty() to return false and would cause us to try to promote it. To prevent this from happening, discard the dead constant users before asking whether a global is empty. Differential Revision: https://reviews.llvm.org/D40593 llvm-svn: 319494	2017-11-30 23:05:52 +00:00
Zachary Turner	f0e4c6a819	Simplify the DenseSet used for hashing CodeView records. This was storing the hash alongside the key so that the hash doesn't need to be re-computed every time, but in doing so it was allocating a structure to keep the key size small in the DenseMap. This is a noble goal, but it also leads to a pointer indirection on every probe, and this cost of this pointer indirection ends up being higher than the cost of having a slightly larger entry in the hash table. Removing this not only simplifies the code, but yields a small but noticeable performance improvement in the type merging algorithm. llvm-svn: 319493	2017-11-30 23:00:30 +00:00
Matt Arsenault	84445dd13c	AMDGPU: Use gfx9 carry-less add/sub instructions llvm-svn: 319491	2017-11-30 22:51:26 +00:00
Reid Kleckner	ba4014e9dc	XOR the frame pointer with the stack cookie when protecting the stack Summary: This strengthens the guard and matches MSVC. Reviewers: hans, etienneb Subscribers: hiraditya, JDevlieghere, vlad.tsyrklevich, llvm-commits Differential Revision: https://reviews.llvm.org/D40622 llvm-svn: 319490	2017-11-30 22:41:21 +00:00
Sam Clegg	9138b7b005	Add visibility flag to Wasm symbol flags The LLVM "hidden" flag needs to be passed through the Wasm intermediate objects in order for the linker to apply it to the final Wasm object. The corresponding change in LLD is here: https://github.com/WebAssembly/lld/pull/14 Patch by Nicholas Wilson Differential Revision: https://reviews.llvm.org/D40442 llvm-svn: 319488	2017-11-30 22:34:58 +00:00
Dan Gohman	d6b165341d	[memcpyopt] Commit file missed in r319482. This change was meant to be included with r319482 but was accidentally omitted. llvm-svn: 319483	2017-11-30 22:13:13 +00:00
Dan Gohman	59e4c0b938	[memcpyopt] Teach memcpyopt to optimize across basic blocks This teaches memcpyopt to make a non-local memdep query when a local query indicates that the dependency is non-local. This notably allows it to eliminate many more llvm.memcpy calls in common Rust code, often by 20-30%. Fixes PR28958. Differential Revision: https://reviews.llvm.org/D38374 llvm-svn: 319482	2017-11-30 22:10:53 +00:00
Davide Italiano	9d939c8f19	[InlineCost] Prefer getFunction() to two calls to getParent(). Improves clarity, also slightly cheaper. NFCI. llvm-svn: 319481	2017-11-30 22:10:35 +00:00
Shoaib Meenai	a7ac2cb6fe	[llvm] Add stripped installation targets CMake's generated installation scripts support `CMAKE_INSTALL_DO_STRIP` to enable stripping the installed binaries. LLVM's build system doesn't expose this option to the `install-` targets, but it's useful in conjunction with `install-distribution`. Add a new function to create the install targets, which creates both the regular install target and a second install target that strips during installation. Change the creation of all installation targets to use this new function. Stripping doesn't make a whole lot of sense for some installation targets (e.g. the LLVM headers), but consistency doesn't hurt. I'll make other repositories (e.g. clang, compiler-rt) use this in a follow-up, and then add an `install-distribution-stripped` target to actually accomplish the end goal of creating a stripped distribution. I don't want to do that step yet because the creation of that target would depend on the presence of the `install-*-stripped` target for each distribution component, and the distribution components from other repositories will be missing that target right now. Differential Revision: https://reviews.llvm.org/D40620 llvm-svn: 319480	2017-11-30 21:48:26 +00:00
Krzysztof Parzyszek	d76814200b	[Hexagon] Implement HexagonSubtarget::useAA() llvm-svn: 319477	2017-11-30 21:25:28 +00:00
Krzysztof Parzyszek	2dddd6004f	[Hexagon] Fix wrong check in test/CodeGen/Hexagon/newvaluejump-solo.mir llvm-svn: 319476	2017-11-30 21:23:19 +00:00
Daniel Sanders	0c43b3a023	[globalisel][tablegen] Add support for relative AtomicOrderings No test yet because the relevant rules are blocked on the atomic_load, and atomic_store nodes. llvm-svn: 319475	2017-11-30 21:05:59 +00:00
Krzysztof Parzyszek	8c67461859	[Hexagon] Fix wrong pass in testcase llvm-svn: 319471	2017-11-30 20:39:15 +00:00
Krzysztof Parzyszek	44555225a6	[Hexagon] Solo instructions cannot be used with new value jumps llvm-svn: 319470	2017-11-30 20:32:54 +00:00
Yaxun Liu	1db0f718b5	[AMDGPU] Convert test/tools/llvm-objdump/AMDGPU/source-lines.ll to amdgiz Differential Revision: https://reviews.llvm.org/D40653 llvm-svn: 319469	2017-11-30 20:27:56 +00:00
Craig Topper	d4257565cf	[X86] Promote i8 CTPOP to i32 instead of i16 when we have the POPCNT instruction. The 32-bit version is shorter to encode and the zext we emit for the promotion is likely going to be a 32-bit zero extend anyway. llvm-svn: 319468	2017-11-30 20:15:31 +00:00
Jake Ehrlich	ef3b80c57b	[llvm-objcopy] Add support for --only-keep/-j and --keep This change adds support for the --only-keep option and the -j alias as well. A common use case for these being used together is to dump a specific section's data. Additionally the --keep option is added (GNU objcopy doesn't have this) to avoid removing a bunch of things. This allows people to err on the side of stripping aggressively and then to keep the specific bits that they need for their application. Differential Revision: https://reviews.llvm.org/D39021 llvm-svn: 319467	2017-11-30 20:14:53 +00:00
Daniel Sanders	aef1dfc690	[aarch64][globalisel] Legalize G_ATOMIC_CMPXCHG_WITH_SUCCESS and G_ATOMICRMW_* G_ATOMICRMW_* is generally legal on AArch64. The exception is G_ATOMICRMW_NAND. G_ATOMIC_CMPXCHG_WITH_SUCCESS needs to be lowered to G_ATOMIC_CMPXCHG with an external comparison. Note that IRTranslator doesn't generate these instructions yet. llvm-svn: 319466	2017-11-30 20:11:42 +00:00
Amara Emerson	d78d65c2a4	[GlobalISel][IRTranslator] Fix crash during translation of zero sized loads/stores/args/returns. This fixes PR35358. rdar://35619533 Differential Revision: https://reviews.llvm.org/D40604 llvm-svn: 319465	2017-11-30 20:06:02 +00:00
Xinliang David Li	c23d2c6883	[PGO] Skip counter promotion for infinite loops Differential Revision: http://reviews.llvm.org/D40662 llvm-svn: 319462	2017-11-30 19:16:25 +00:00
Michal Gorny	b661757cd1	[cmake] Include project name in Sphinx doctree dir to fix race conditions Modify add_sphinx_target() to include the project name alongside builder in Sphinx doctree directory. This aims to avoid crashes due to race conditions between multiple Sphinx instances running in parallel that attempt to create or read that directory simultaneously. This problem has originally been addressed in r283188. However, that commit presumed that there will be only one target per builder being run. However, r314863 introduced a second manpage target, reintroducing the race condition. Differential Revision: https://reviews.llvm.org/D40656 llvm-svn: 319461	2017-11-30 19:09:22 +00:00
Daniel Sanders	f499b2bf1f	[globalisel][tablegen] Add support for specific immediates in the match pattern This enables a few rules such as ARM's uxtb instruction. llvm-svn: 319457	2017-11-30 18:48:35 +00:00
Zachary Turner	ca6dbf1440	Split TypeTableBuilder into two classes. llvm-svn: 319456	2017-11-30 18:39:50 +00:00
Zachary Turner	123ef6355f	[llvm-readobj] Fix mismatched line endings llvm-svn: 319453	2017-11-30 18:33:34 +00:00
Dan Gohman	78c19d60a9	[WebAssembly] Revert r319186 "Support bitcasted function addresses with varargs." The patch broke Emscripten's EM_ASM macros, which utiltize unprototyped functions. See https://bugs.llvm.org/show_bug.cgi?id=35385 for details. llvm-svn: 319452	2017-11-30 18:16:49 +00:00
Francis Visoiu Mistrih	c7832045d5	[MIR] Fix DebugInfo tests after r319445 llvm-svn: 319447	2017-11-30 16:48:53 +00:00
Francis Visoiu Mistrih	c71cced0aa	[CodeGen] Always use `printReg` to print registers in both MIR and debug output As part of the unification of the debug format and the MIR format, always use `printReg` to print all kinds of registers. Updated the tests using '_' instead of '%noreg' until we decide which one we want to be the default one. Differential Revision: https://reviews.llvm.org/D40421 llvm-svn: 319445	2017-11-30 16:12:24 +00:00
Igor Laevsky	0cdf7fdc48	[FuzzMutate] Bailout from injecting into empty basic blocks. In rare cases we can receive request to inject into completelly empty basic block. In the normal case all basic blocks contain at least terminator instruction, but it is possible that the only instruction is catchpad instruction which is not part of the instruction iterator. This case seems rare enough to not care about it. Submiting without review, since it seems almost NFC. I couldn't come up with any reasonable way to test this. llvm-svn: 319444	2017-11-30 15:41:58 +00:00
Igor Laevsky	33031926b6	[FuzzMutate] Correctly handle vector types in the insertvalue operation Differential Revision: https://reviews.llvm.org/D40397 llvm-svn: 319442	2017-11-30 15:31:13 +00:00
Igor Laevsky	65902db279	[FuzzMutate] Don't use index operands as sinks Differential Revision: https://reviews.llvm.org/D40396 llvm-svn: 319441	2017-11-30 15:29:16 +00:00
Igor Laevsky	48147d012b	[FuzzMutate] Pick correct index for the insertvalue instruction Differential Revision: https://reviews.llvm.org/D40395 llvm-svn: 319440	2017-11-30 15:26:48 +00:00
Igor Laevsky	faacdf8d54	[FuzzMutate] Don't create load as a new source if it doesn't match with the descriptor Differential Revision: https://reviews.llvm.org/D40394 llvm-svn: 319439	2017-11-30 15:24:41 +00:00
Igor Laevsky	444afc82c0	[FuzzMutate] Don't crash when we can't remove instruction from empty function Differential Revision: https://reviews.llvm.org/D40393 llvm-svn: 319438	2017-11-30 15:07:38 +00:00
Sanjay Patel	7fb231202c	[LangRef] clarify semantics of the frem instruction As noted in D40594, the frem instruction corresponds to fmod() except that it can't set errno. I modified the text that we currently use for intrinsics that map to libm functions and applied it to frem. Differential Revision: https://reviews.llvm.org/D40629 llvm-svn: 319437	2017-11-30 14:59:03 +00:00
Alexey Bataev	d60250dd9b	[InstCombine] Additional test for PR35354, NFC. llvm-svn: 319436	2017-11-30 14:33:58 +00:00
Nemanja Ivanovic	db7e77047c	[PowerPC] Recommit r314244 with refactoring and off by default This re-commits everything that was pulled in r314244. The transformation is off by default (patch to enable it to follow). The code is refactored to have a single entry-point and provide fine-grained control over patterns that it selects. This patch also fixes the bugs in the original code. Everything that failed with the original patch has been re-tested with this patch (with the transformation turned on). So the patch to turn this on is soon to follow. Differential Revision: https://reviews.llvm.org/D38575 llvm-svn: 319434	2017-11-30 13:39:10 +00:00
Simon Pilgrim	bb791b3dbd	[X86][AVX512] Tag fcmp/ptest/ternlog instructions scheduler classes llvm-svn: 319433	2017-11-30 13:18:06 +00:00
Simon Pilgrim	1c7556fb29	[X86][AVX512] Regenerate avx512 schedule tests llvm-svn: 319432	2017-11-30 13:09:21 +00:00
Sean Eveson	a6bcd53d52	[MC] Function stack size section. Re applying after fixing issues in the diff, sorry for any painful conflicts/merges! Original RFC: http://lists.llvm.org/pipermail/llvm-dev/2017-August/117028.html This change adds a '.stack-size' section containing metadata on function stack sizes to output ELF files behind the new -stack-size-section flag. The section contains pairs of function symbol references (8 byte) and stack sizes (unsigned LEB128). The contents of this section can be used to measure changes to stack sizes between different versions of the compiler or a source base. The advantage of having a section is that we can extract this information when examining binaries that we didn't build, and it allows users and tools easy access to that information just by referencing the binary. There is a follow up change to add an option to clang. Thanks. Reviewers: hfinkel, MatzeB Reviewed By: MatzeB Subscribers: thegameg, asb, llvm-commits Differential Revision: https://reviews.llvm.org/D39788 llvm-svn: 319430	2017-11-30 13:05:14 +00:00
Sean Eveson	661e4fbf83	Revert r319423: [MC] Function stack size section. I messed up the diff. llvm-svn: 319429	2017-11-30 12:43:25 +00:00
Diana Picus	f003d9ff95	[ARM GlobalISel] Bail out for byval Fallback if we have a byval parameter or argument since we don't support them yet. llvm-svn: 319428	2017-11-30 12:23:44 +00:00
Francis Visoiu Mistrih	93ef145862	[CodeGen] Print "%vreg0" as "%0" in both MIR and debug output As part of the unification of the debug format and the MIR format, avoid printing "vreg" for virtual registers (which is one of the current MIR possibilities). Basically: * find . $ -name ".mir" -o -name ".cpp" -o -name ".h" -o -name ".ll" $ -type f -print0 \| xargs -0 sed -i '' -E "s/%vreg([0-9]+)/%\1/g" * grep -nr '%vreg' . and fix if needed * find . $ -name ".mir" -o -name ".cpp" -o -name ".h" -o -name ".ll" $ -type f -print0 \| xargs -0 sed -i '' -E "s/ vreg([0-9]+)/ %\1/g" * grep -nr 'vreg[0-9]\+' . and fix if needed Differential Revision: https://reviews.llvm.org/D40420 llvm-svn: 319427	2017-11-30 12:12:19 +00:00
Simon Pilgrim	d1a7d0c3f1	[X86][AVX512] Tag binop/rounding/sae instructions scheduler classes llvm-svn: 319424	2017-11-30 12:01:52 +00:00
Sean Eveson	f77b4d2f38	[MC] Function stack size section. Summary: Original RFC: http://lists.llvm.org/pipermail/llvm-dev/2017-August/117028.html I wasn't sure who to put as reviewers, so please add/remove people as appropriate. This change adds a '.stack-size' section containing metadata on function stack sizes to output ELF files behind the new -stack-size-section flag. The section contains pairs of function symbol references (8 byte) and stack sizes (unsigned LEB128). The contents of this section can be used to measure changes to stack sizes between different versions of the compiler or a source base. The advantage of having a section is that we can extract this information when examining binaries that we didn't build, and it allows users and tools easy access to that information just by referencing the binary. There is a follow up change to add an option to clang. Thanks. Reviewers: hfinkel, MatzeB Reviewed By: MatzeB Subscribers: thegameg, asb, llvm-commits Differential Revision: https://reviews.llvm.org/D39788 llvm-svn: 319423	2017-11-30 12:01:16 +00:00
Sam Parker	4bd776e001	[DAGCombine] Refactor ReduceLoadWidth visitAND attempts to narrow the width of extending loads that are then masked off. ReduceLoadWidth already exists for a similar purpose and handles shifts, so I've moved the code to handle AND nodes there. Differential Revision: https://reviews.llvm.org/D39595 llvm-svn: 319421	2017-11-30 11:49:11 +00:00
Serge Guelton	24386867b8	Support generic lowering of vector bswap llvm-svn: 319419	2017-11-30 11:06:22 +00:00
Simon Pilgrim	3e5987cf8d	[X86][AVX512] Tag RCP/RSQRT/GETEXP instructions scheduler classes llvm-svn: 319418	2017-11-30 10:48:47 +00:00
Jonas Devlieghere	1c223018ef	[dsymutil] Exclude namespace from ifdef in CFBundle Should fix build failure introduced by r319416 on non-darwin hosts. llvm-svn: 319417	2017-11-30 10:41:31 +00:00
Jonas Devlieghere	c635376d7c	[dsymutil] Upstream getBundleInfo implementation This patch implements `getBundleInfo`, which uses CoreFoundation to obtain information about the CFBundle. This information is needed to populate the Plist in the dSYM bundle. This change only applies to darwin and is an NFC as far as other platforms are concerned. Differential revision: https://reviews.llvm.org/D40244 llvm-svn: 319416	2017-11-30 10:25:28 +00:00
Hiroshi Inoue	21e8ded4d2	Revert rL319407: [SROA] enable splitting for non-whole-alloca loads and stores This reverts commit rL319407 due to failures in some buildbot. llvm-svn: 319410	2017-11-30 08:29:51 +00:00
Jonas Paulsson	b9a2467501	[SystemZ] Bugfix in adjustSubwordCmp. Csmith generated a program where a store after load to the same address did not get chained after the new load created during DAG legalizing, and so performed an illegal overwrite of the expected value. When the new zero-extending load is created, the chain users of the original load must be updated, which was not done previously. A similar case was also found and handled in lowerBITCAST. Review: Ulrich Weigand https://reviews.llvm.org/D40542 llvm-svn: 319409	2017-11-30 08:18:50 +00:00
Hiroshi Inoue	422e80aee2	[SROA] enable splitting for non-whole-alloca loads and stores Currently, SROA splits loads and stores only when they are accessing the whole alloca. This patch relaxes this limitation to allow splitting a load/store if all other loads and stores to the alloca are disjoint to or fully included in the current load/store. If there is no other load or store that crosses the boundary of the current load/store, the current splitting implementation works as is. The whole-alloca loads and stores meet this new condition and so they are still splittable. Here is a simplified motivating example. struct record { long long a; int b; int c; }; int func(struct record r) { for (int i = 0; i < r.c; i++) r.b++; return r.b; } When updating r.b (or r.c as well), LLVM generates redundant instructions on some platforms (such as x86_64, ppc64); here, r.b and r.c are packed into one 64-bit GPR when the struct is passed as a method argument. With this patch, the above example is compiled into only few instructions without loop. Without the patch, unnecessary loop-carried dependency is introduced by SROA and the loop cannot be eliminated by the later optimizers. Differential Revision: https://reviews.llvm.org/D32998 llvm-svn: 319407	2017-11-30 07:44:46 +00:00
Craig Topper	a495744d2c	[X86] Optimize avx2 vgatherqps for v2f32 with v2i64 index type. Normal type legalization will widen everything. This requires forcing 0s into the mask register. We can instead choose the form that only reads 2 elements without zeroing the mask. llvm-svn: 319406	2017-11-30 07:01:40 +00:00
Craig Topper	321a8b9b63	[X86] Make sure we don't remove sign extends of masks with AVX2 masked gathers. We don't use k-registers and instead use the MSB so we need to make sure we sign extend the mask to the msb. llvm-svn: 319405	2017-11-30 06:31:31 +00:00
Dean Michael Berris	9850276267	[XRay][docs] Update documentation on new default for xray_naive_log= We've recently changed the default for `xray_naive_log=` to be `false` instead of `true` to make it more consistent with the FDR mode logging implementation. This means we will now ask users to explicitly choose which version of the XRay logging is being used. llvm-svn: 319400	2017-11-30 05:35:51 +00:00
Graham Yiu	70293fa27a	- Removed unused lamba (IsReturnBlock) causing build bots to fail for r319398 - Added lit testcases that were supposed to be part of r319398 llvm-svn: 319399	2017-11-30 03:36:57 +00:00
Graham Yiu	8b1882c186	With PGO information, we can do more aggressive outlining of cold regions in the inline candidate function. This contrasts with the scheme of keeping only the 'early return' portion of the inline candidate and outlining the rest of the function as a single function call. Support for outlining multiple regions of each function is added, as well as some basic heuristics to determine which regions are good to outline. Outline candidates limited to regions that are single-entry & single-exit. We also avoid outlining regions that produce live-exit variables, which may inhibit some forms of code motion (like commoning). Fallback to the regular partial inlining scheme is retained when either i) no regions are identified for outlining in the function, or ii) the outlined function could not be inlined in any of its callers. Differential Revision: https://reviews.llvm.org/D38190 llvm-svn: 319398	2017-11-30 02:41:36 +00:00
Kostya Serebryany	954cfd56c7	[libFuzzer] mention one more trophie in the Linux Kernel llvm-svn: 319397	2017-11-30 02:26:47 +00:00
Matt Arsenault	caf0ed4d74	AMDGPU: Allow negative MUBUF vaddr for gfx9 GFX9 does not enable bounds checking for the resource descriptors used for private access, so it should be OK to use vaddr with a potentially negative value. llvm-svn: 319393	2017-11-30 00:52:40 +00:00
Rafael Espindola	a4e418f713	Check alignment in getSectionContentsAsArray. While the ArrayRef can technically have unaligned data, it would be extremely surprising if iterating over it caused undefined behavior when a reference to the underlying type was bound. llvm-svn: 319392	2017-11-30 00:44:22 +00:00
Vedant Kumar	80fbb85555	[Coverage] Use the most-recent completed region count (PR35437) This is a fix for the coverage segment builder. If multiple regions must be popped off the active stack at once, and more than one of them end at the same location, emit a segment using the count from the most-recent completed region. Fixes PR35437, rdar://35760630 Testing: invoked llvm-cov on a stage2 build of clang, additional unit tests, check-profile llvm-svn: 319391	2017-11-30 00:28:23 +00:00
Peter Collingbourne	9e3175bb6b	LowerTypeTests: Deduplicate code. NFC. llvm-svn: 319390	2017-11-30 00:27:08 +00:00
Peter Collingbourne	943aca3c27	LowerTypeTests: Remove unnecessary cast. NFC. llvm-svn: 319387	2017-11-30 00:02:55 +00:00
Craig Topper	56a41d4b3a	[X86] Remove some questionable looking code that seems to be looking through a VZEXT to create a larger VSEXT. If the input the vzext was signed this would do the wrong thing. Not sure how to test this. llvm-svn: 319382	2017-11-29 23:08:25 +00:00
Joerg Sonnenberger	4b1acff9b3	First step towards more human-friendly PPC assembler output: - add -ppc-reg-with-percent-prefix option to use %r3 etc as register names - split off logic for Darwinish verbose conditional codes into a helper function - be explicit about Darwin vs AIX vs GNUish assembler flavors Based on the patch from Alexandre Yukio Yamashita Differential Revision: https://reviews.llvm.org/D39016 llvm-svn: 319381	2017-11-29 23:05:56 +00:00
Sam Clegg	da8d83f911	[WebAssembly] Update test expectations for gcc torture tests I believe these were recently fixed by: https://reviews.llvm.org/rL319186 Differential Revision: https://reviews.llvm.org/D40619 llvm-svn: 319380	2017-11-29 23:05:50 +00:00
Zachary Turner	52d036e693	[CodeView] Factor some code out of TypeTableBuilder. This class had some code that would automatically remap type indices before hashing and serializing. The only caller of this method was the TypeStreamMerger anyway, and the method doesn't make general sense, and prevents making certain future improvements to the class. So, factoring this up one level into the TypeStreamMerger where it belongs. llvm-svn: 319377	2017-11-29 22:41:56 +00:00
Craig Topper	cf461a0a32	[SelectionDAG][X86] Teach promotion legalization for fp_to_sint/fp_to_uint to insert an assertsext/assertzext based on the original type If we put in an assertsext/zext here, we're able to generate better truncate code using pack on pre-avx512 targets. Similar is already done during type legalization. This is the equivalent for op legalization Differential Revision: https://reviews.llvm.org/D40591 llvm-svn: 319368	2017-11-29 22:15:43 +00:00
Dan Gohman	580c102ab8	[WebAssembly] Fix fptoui lowering bounds To fully avoid trapping on wasm, fptoui needs a second check to ensure that the operand isn't below the supported range. llvm-svn: 319354	2017-11-29 20:20:11 +00:00
Sam Clegg	51d90c8c6b	Add libstd++-4.8 exceptions to ubsan_blacklist.txt Differential Revision: https://reviews.llvm.org/D40589 llvm-svn: 319353	2017-11-29 20:10:14 +00:00
Krzysztof Parzyszek	f4dcc42e7b	[Hexagon] Remove HexagonISD::PACKHL llvm-svn: 319352	2017-11-29 19:59:29 +00:00
Krzysztof Parzyszek	6a8e5f4b0f	[Hexagon] Create helpers extractVector and insertVector in lowering llvm-svn: 319351	2017-11-29 19:58:10 +00:00
Simon Pilgrim	4d2c703492	[X86][AVX512] Tag RCP/RSQRT/GETEXP instructions scheduler classes (REVERSION) Accidental commit of incomplete patch llvm-svn: 319346	2017-11-29 19:37:38 +00:00
Zachary Turner	3e3936da93	Make TypeTableBuilder inherit from TypeCollection. A couple of places in LLD were passing references to TypeTableCollections around, which makes it hard to change the implementation at runtime. However, these cases only needed to iterate over the types in the collection, and TypeCollection already provides a handy abstract interface for this purpose. By implementing this interface, we can get rid of the need to pass TypeTableBuilder references around, which should allow us to swap the implementation at runtime in subsequent patches. llvm-svn: 319345	2017-11-29 19:35:21 +00:00
Zachary Turner	85082013e6	Fix line endings in llvm-pdbutil.cpp llvm-svn: 319340	2017-11-29 19:29:25 +00:00
Simon Pilgrim	87034cb498	[X86][AVX512] Tag RCP/RSQRT/GETEXP instructions scheduler classes llvm-svn: 319338	2017-11-29 19:19:59 +00:00
Simon Pilgrim	36be852cee	[X86][AVX512] Tag 3OP (shuffles, double-shifts and GFNI) instructions scheduler classes llvm-svn: 319337	2017-11-29 18:52:20 +00:00
Nirav Dave	bafaa53c4d	[ARM][DAG] Revert Disable post-legalization store merge for ARM Partially reverting enabling of post-legalization store merge (r319036) for just ARM backend as it is causing incorrect code in some Thumb2 cases. llvm-svn: 319331	2017-11-29 18:06:13 +00:00
Greg Bedwell	5764997ff2	[cmake] Replace -Wall with /W4 in clang-cl options now that -Wall aliases -Weverything Instead, reuse the code-path for cl.exe that adds /W4 , which for clang-cl aliases clang's "-Wall -Wextra" which matches what clang-cl's /Wall previously aliased. This should restore the verbosity of a Windows selfhost build back to its previous levels. Differential Revision: https://reviews.llvm.org/D40603 llvm-svn: 319330	2017-11-29 18:05:32 +00:00
Greg Bedwell	34a83f0faf	Make check-lit tests respect LLVM_LIT_TOOLS_DIR Differential Revision: https://reviews.llvm.org/D40520 llvm-svn: 319329	2017-11-29 18:05:26 +00:00
Zaara Syeda	76fe100696	[Power9] add more tests for D38287; NFC llvm-svn: 319328	2017-11-29 17:26:20 +00:00
Sanjay Patel	e0f906c915	[InstCombine] add tests for select-of-constants; NFC These are variants of a test that was originally added in: https://reviews.llvm.org/rL75531 ...but removed with: https://reviews.llvm.org/rL159230 llvm-svn: 319327	2017-11-29 17:21:39 +00:00
Simon Pilgrim	6a00970ade	[X86][AVX512] Add itinerary argument to all AVX512_maskable_* wrappers. NFCI All default to NoItinerary llvm-svn: 319326	2017-11-29 17:21:15 +00:00
Adam Nemet	95e0c5fc6c	Add opt-viewer testing Detects whether we have the Python modules (pygments, yaml) required by opt-viewer and hooks this up to REQUIRES. This fixes https://bugs.llvm.org/show_bug.cgi?id=34129 (the lack of opt-viewer testing). It's also related to https://github.com/apple/swift/pull/12938 and the idea is to expose LLVM_HAVE_OPT_VIEWER_MODULES to the Swift cmake. Differential Revision: https://reviews.llvm.org/D40202 Fixes since the first commit: 1. Disable syntax highlighting as different versions of pygments generate different HTML 2. Use llvm-cxxfilt from the build llvm-svn: 319324	2017-11-29 17:07:41 +00:00
Sander de Smalen	6a3bf1f84a	Reverted r319315 because of unused functions (due to PPR not yet being used by any instructions). llvm-svn: 319321	2017-11-29 15:14:39 +00:00
Simon Pilgrim	1401a75341	[X86][AVX512] Tag VPERMILV instruction scheduler class llvm-svn: 319316	2017-11-29 14:58:34 +00:00
Sander de Smalen	2b6338b2bc	[AArch64][SVE] Asm: Add SVE predicate register definitions and parsing support Summary: Patch [1/4] in a series to add parsing of predicates and properly parse SVE ZIP1/ZIP2 instructions. Reviewers: rengolin, kristof.beyls, fhahn, mcrosier, evandro, echristo, efriedma Reviewed By: fhahn Subscribers: aemerson, javed.absar, llvm-commits, tschuett Differential Revision: https://reviews.llvm.org/D40360 llvm-svn: 319315	2017-11-29 14:34:18 +00:00
Diana Picus	863b5b05f1	[ARM GlobalISel] Fix selecting G_BRCOND When lowering a G_BRCOND, we generate a TSTri of the condition against 1, which sets the flags, and then a Bcc which branches based on the value of the flags. Unfortunately, we were using the wrong condition code to check whether we need to branch (EQ instead of NE), which caused all our branches to do the opposite of what they were intended to do. This patch fixes the issue by using the correct condition code. llvm-svn: 319313	2017-11-29 14:20:06 +00:00
Simon Pilgrim	756348c1c9	[X86][AVX512] Setup unary (PABS/VPLZCNT/VPOPCNT/VPCONFLICT/VMOV*DUP) instruction scheduler classes llvm-svn: 319312	2017-11-29 13:49:51 +00:00
Dmitry Preobrazhensky	1ac7177abb	[AMDGPU][MC][GFX9] Corrected mapping of GFX9 v_add/sub/subrev_u32 When translating pseudo to MC, v_add/sub/subrev_u32 shall be mapped via a separate table as GFX8 has opcodes with the same names. These instructions shall also be labelled as renamed for pseudoToMCOpcode to handle them correctly. Reviewers: arsenm Differential Revision: https://reviews.llvm.org/D40550 llvm-svn: 319311	2017-11-29 13:33:40 +00:00
Simon Pilgrim	e3291de2b8	[X86][SSE] Merged sse2_unpack and sse2_unpack PUNPCK instruction templates. NFCI. llvm-svn: 319310	2017-11-29 12:12:27 +00:00
Simon Pilgrim	da95772230	[X86][SSE] Merged sse2_pack and sse2_pack_y PACKSS/PACKUS instruction templates. NFCI. llvm-svn: 319308	2017-11-29 11:35:45 +00:00
Max Kazantsev	9545a408b6	[SCEV][NFC] Break from loop after we found first non-Phi in getAddRecExprPHILiterally llvm-svn: 319306	2017-11-29 10:54:16 +00:00
Oliver Stannard	9ea2eaeb50	[ARM] Add support for armv7e-m to the .arch directive This will allow compilation of assembly files targeting armv7e-m without having to specify the Tag_CPU_arch attribute as a workaround. Differential revision: https://reviews.llvm.org/D40370 Patch by Ian Tessier! llvm-svn: 319303	2017-11-29 10:12:15 +00:00
Serguei Katkov	d4df744434	[CGP] Enable complex addr mode Enable complex addr modes after two critical fixes: rL319109 and rL319292 llvm-svn: 319302	2017-11-29 09:48:50 +00:00
Jonas Paulsson	66c54414e3	Comment fix in SelectionDAG.h /// Replace any uses of From with To, leaving - /// uses of other values produced by From.Val alone. + /// uses of other values produced by From.getNode() alone. void ReplaceAllUsesOfValueWith(SDValue From, SDValue To); (this is what it says in the .cpp file above this method) llvm-svn: 319301	2017-11-29 09:16:37 +00:00
Craig Topper	e3515001b9	[X86] Remove setOperationAction Promote for ISD::SINT_TO_FP MVT::v8i16/v16i8/v16i16. A DAG combine ensures these ops are always promoted to vXi32. llvm-svn: 319298	2017-11-29 08:19:36 +00:00
Max Kazantsev	1c3b622820	[SCEV][NFC] Remove condition that can never happen due to check few lines above llvm-svn: 319293	2017-11-29 06:10:36 +00:00
Serguei Katkov	5036459ae3	[CGP] Fix common type handling in optimizeMemoryInst If common type is different we should bail out due to we will not be able to create a select or Phi of these values. Basically it is done in ExtAddrMode::compare however it does not work if we handle the null first and then two values of different types. so add a check in initializeMap as well. The check in ExtAddrMode::compare is used as earlier bail out. Reviewers: reames, john.brawn Reviewed By: john.brawn Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D40479 llvm-svn: 319292	2017-11-29 05:51:26 +00:00
Sean Fertile	aab3ef76d9	[PowerPC] Relax the checking on AND/AND8 in isSignOrZeroExtended. Separate the handling of AND/AND8 out from PHI/OR/ISEL checking. The reasoning is the others need all their operands to be sign/zero extended for their output to also be sign/zero extended. This is true for AND and sign-extension, but for zero-extension we only need at least one of the input operands to be zero extended for the result to also be zero extended. Differential Revision: https://reviews.llvm.org/D39078 llvm-svn: 319289	2017-11-29 04:09:29 +00:00
Matt Arsenault	9a7e29ae91	AMDGPU: Use stricter regexes for add instructions Match the entire _co as one optional piece rather than a set of characters to match multiple times. llvm-svn: 319275	2017-11-29 02:25:14 +00:00
Bruno Cardoso Lopes	b448206cb6	[Modules] Add textual headers for recently added .def files Keep module.modulemap up to date and get rid of -Wincomplete-umbrella warnings rdar://problem/35711925 llvm-svn: 319273	2017-11-29 01:53:49 +00:00
Matt Arsenault	b655fa9ce2	DAG: Add nuw when splitting loads and stores The object can't straddle the address space wrap around, so I think it's OK to assume any offsets added to the base object pointer can't overflow. Similar logic already appears to be applied in SelectionDAGBuilder when lowering aggregate returns. llvm-svn: 319272	2017-11-29 01:25:12 +00:00
Adrian Prantl	5da51f435a	llvm-dwarfdump: honor the --show-children option when dumping a specific DIE. llvm-svn: 319271	2017-11-29 01:12:22 +00:00
Matt Arsenault	3f71c0e3ee	AMDGPU: Select DS insts without m0 initialization GFX9 stopped using m0 for most DS instructions. Select a different instruction without the use. I think this will be less error prone than trying to manually maintain m0 uses as needed. llvm-svn: 319270	2017-11-29 00:55:57 +00:00
Don Hinton	5fb3ad71a2	Rollback r319176. The ';' separators in LLVM_TARGETS_TO_BUILD disappear when list variables are evaluated in custom commands. llvm-svn: 319268	2017-11-29 00:47:16 +00:00
Craig Topper	fbf7b3bf3e	[X86] Promote fp_to_sint v16f32->v16i16/v16i8 to avoid scalarization. llvm-svn: 319266	2017-11-29 00:32:09 +00:00
Zachary Turner	4c1fa68590	Fix a warning. llvm-svn: 319263	2017-11-29 00:13:44 +00:00
Adam Nemet	90e8c122ee	Revert "Add opt-viewer testing" This reverts commit r319188. Breaks when c++filt is not available. llvm-svn: 319262	2017-11-29 00:10:48 +00:00
Craig Topper	8261c8c066	[X86] Add test cases for fptosi v16f32->v16i8/v16i16 to show scalarization. llvm-svn: 319261	2017-11-29 00:02:22 +00:00
Zachary Turner	29b081dcd1	[NFC] Minor cleanups in CodeView TypeTableBuilder. llvm-svn: 319260	2017-11-28 23:57:13 +00:00
Craig Topper	88ffb5d4d5	[X86] Mark ISD::FP_TO_UINT v16i8/v16i16 as Promote under AVX512 instead of legal. Fix infinite loop in op legalization when promotion requires 2 steps. Previously we had an isel pattern to add the truncate. Instead use Promote to add the truncate to the DAG before isel. The Promote legalization code had to be updated to prevent an infinite loop if promotion took multiple steps because it wasn't remembering the previously tried value. llvm-svn: 319259	2017-11-28 23:56:02 +00:00
Craig Topper	3f749c2d4b	[X86] Regenerate avx512-schedule test. For some reason some sqrt instructions were missing the scheduling comments. llvm-svn: 319258	2017-11-28 23:55:59 +00:00
Matt Arsenault	607a756651	AMDGPU: Enable IPRA llvm-svn: 319256	2017-11-28 23:40:12 +00:00
Simon Pilgrim	b9aa93cb93	[X86] Tag CLFLUSHOPT with same scheduling behaviour as CLFLUSH llvm-svn: 319253	2017-11-28 23:25:42 +00:00
Daniel Sanders	40c5cbfb08	[globalisel][tablegen] Fix PR35375 by sign-extending the table value to match getConstantVRegVal() Summary: From the bug report: > The problem is that it fails when trying to compare -65536 (or 4294901760) to 0xFFFF,0000. This is because the > constant in the instruction is sign extended to 64 bits (0xFFFF,FFFF,FFFF,0000) and then compared to the non > extended 64 bit version expected by TableGen. > > In contrast, the DAGISelEmitter generates special code for AND immediates (OPC_CheckAndImm), which does not > sign extend. This patch doesn't introduce the special case for AND (and OR) immediates since the majority of it is related to handling known bits that have no effect on the result and GlobalISel doesn't detect known-bits at this time. Instead this patch just ensures that the immediate is extended consistently on both sides of the check. Thanks to Diana Picus for the detailed bug report. Reviewers: rovka Reviewed By: rovka Subscribers: kristof.beyls, javed.absar, llvm-commits Differential Revision: https://reviews.llvm.org/D40532 llvm-svn: 319252	2017-11-28 23:18:54 +00:00
Simon Pilgrim	a675071b7a	[X86] Add CLFLUSHOPT schedule tests llvm-svn: 319250	2017-11-28 23:12:12 +00:00
Simon Pilgrim	f490c6efee	[X86][SSE] Add SSE_SHUFP OpndItins Update multi-classes to take the scheduling OpndItins instead of hard coding it. Will be reused in the AVX512 equivalents. llvm-svn: 319249	2017-11-28 23:09:18 +00:00
Simon Pilgrim	c6c2103e1b	[X86] Test clflushopt intrinsic on 32 and 64-bit targets llvm-svn: 319247	2017-11-28 23:04:42 +00:00
Simon Pilgrim	8f62394751	[X86][SSE] Add SSE_UNPCK/SSE_PUNPCK OpndItins Update multi-classes to take the scheduling OpndItins instead of hard coding it. Will be reused in the AVX512 equivalents. llvm-svn: 319245	2017-11-28 22:55:08 +00:00
Simon Pilgrim	1bc7b0e148	[X86][SSE] Use SSE_PACK OpndItins in PACKSS/PACKUS instruction definitions Update multi-classes to take the scheduling OpndItins instead of hard coding it. SSE_PACK will be reused in the AVX512 equivalents. llvm-svn: 319243	2017-11-28 22:47:45 +00:00
Adam Nemet	80fb55625b	Remove this test After r319235, we no longer generate this remark. llvm-svn: 319242	2017-11-28 22:39:38 +00:00
Simon Pilgrim	14d3fd29f8	Fix VS2017 narrowing conversion warning. NFCI llvm-svn: 319240	2017-11-28 22:32:43 +00:00
Craig Topper	ab9bfc904b	[X86] Remove unused variable. llvm-svn: 319239	2017-11-28 22:28:23 +00:00
Adam Nemet	2e92289014	Demote this opt remark to DEBUG. From a random opt-stat output: Top 10 remarks: tailcallelim/tailcall 53% inline/AlwaysInline 13% gvn/LoadClobbered 13% inline/Inlined 8% inline/TooCostly 2% inline/NoDefinition 2% licm/LoadWithLoopInvariantAddressInvalidated 2% licm/Hoisted 1% asm-printer/InstructionCount 1% prologepilog/StackSize 1% llvm-svn: 319235	2017-11-28 22:11:00 +00:00
Craig Topper	a27f1e675a	[X86] Remove code from combineUIntToFP that tried to favor UINT_TO_FP if legal when zero extending from vXi8/vX816. The UINT_TO_FP is immediately converted to SINT_TO_FP when the node is re-evaluated because we'll detect that the sign bit is zero. llvm-svn: 319234	2017-11-28 22:08:51 +00:00
Craig Topper	3aaa71f222	[X86] Remove custom lowering for uint_to_fp from vXi8/vXi16. We have a DAG combine that uses a zero extend that should prevent this from ever occurring now. llvm-svn: 319233	2017-11-28 22:08:48 +00:00
Daniel Sanders	766646517f	[globalisel][tablegen] Add support for importing G_ATOMIC_CMPXCHG, G_ATOMICRMW_* rules from SelectionDAG. GIM_CheckNonAtomic has been replaced by GIM_CheckAtomicOrdering to allow it to support a wider range of orderings. This has then been used to import patterns using nodes such as atomic_cmp_swap, atomic_swap, and atomic_load_*. llvm-svn: 319232	2017-11-28 22:07:05 +00:00
Adrian Prantl	77d90b0c39	SROA: Don't create variable fragments that are outside of the variable. An alloca may be larger than a variable that is described to be stored there. Don't create a dbg.value for fragments that are outside of the variable. This fixes PR35447. https://bugs.llvm.org/show_bug.cgi?id=35447 llvm-svn: 319230	2017-11-28 21:30:38 +00:00
Don Hinton	f5aab5454e	[cmake] Pass LLVM_USE_LINKER flag when building host tools, e.g., LLVM_OPTIMIZED_TABLEGEN=ON, and not crosscompiling. Differential Revision: https://reviews.llvm.org/D39734 llvm-svn: 319228	2017-11-28 21:23:30 +00:00
Alexey Bataev	ab5f3f2b33	[SLP] Additional test for PR35354, NFC. llvm-svn: 319224	2017-11-28 20:48:24 +00:00
Mandeep Singh Grang	e0173664e9	[Hexagon] Use stable sort for HexagonShuffler to remove non-deterministic ordering Summary: This fixes failures in the following tests uncovered by D39245: LLVM :: CodeGen/Hexagon/args.ll LLVM :: CodeGen/Hexagon/constp-extract.ll LLVM :: CodeGen/Hexagon/expand-condsets-basic.ll LLVM :: CodeGen/Hexagon/gp-rel.ll LLVM :: CodeGen/Hexagon/packetize_cond_inst.ll LLVM :: CodeGen/Hexagon/simple_addend.ll LLVM :: CodeGen/Hexagon/swp-stages4.ll LLVM :: CodeGen/Hexagon/swp-vmult.ll LLVM :: CodeGen/Hexagon/swp-vsum.ll LLVM :: MC/Hexagon/align.s LLVM :: MC/Hexagon/asmMap.s LLVM :: MC/Hexagon/dis-duplex-p0.s LLVM :: MC/Hexagon/double-vector-producer.s LLVM :: MC/Hexagon/inst_select.ll LLVM :: MC/Hexagon/instructions/j.s Reviewers: colinl, kparzysz, adasgupt, slarin Reviewed By: kparzysz Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D40227 llvm-svn: 319223	2017-11-28 20:48:10 +00:00
Daniel Sanders	7b361b50d8	[aarch64][globalisel] Add missing tests from r319216 llvm-svn: 319220	2017-11-28 20:27:59 +00:00
Sean Fertile	e200016ea9	[PowerPC] Allow tail calls of fastcc functions from C CallingConv functions. Allow fastcc callees to be tail-called from ccc callers. Differential Revision: https://reviews.llvm.org/D40355 llvm-svn: 319218	2017-11-28 20:25:58 +00:00
Daniel Sanders	7fe7acc6b1	[aarch64][globalisel] Define G_ATOMIC_CMPXCHG and G_ATOMICRMW_* and make them legal The IRTranslator cannot generate these instructions at the moment so there's no issue with not having implemented ISel for them yet. D40092 will add G_ATOMIC_CMPXCHG_WITH_SUCCESS and G_ATOMICRMW_* to the IRTranslator and a further patch will add support for lowering G_ATOMIC_CMPXCHG_WITH_SUCCESS into G_ATOMIC_CMPXCHG with an external success check via the `Lower` action. The separation of G_ATOMIC_CMPXCHG_WITH_SUCCESS and G_ATOMIC_CMPXCHG is to import SelectionDAG rules while still supporting targets that prefer to custom lower the original LLVM-IR-like operation. llvm-svn: 319216	2017-11-28 20:21:15 +00:00
Mandeep Singh Grang	230b0a1477	[SelectionDAG] Make sorting predicate stronger to remove non-deterministic ordering Summary: Recommitting this with the correct sorting predicate. The Low field of Clusters is a ConstantInt and cannot be directly compared. So we needed to invoke slt (signed less than) to compare correctly. This fixes failures in the following tests uncovered by D39245: LLVM :: CodeGen/ARM/ifcvt3.ll LLVM :: CodeGen/ARM/switch-minsize.ll LLVM :: CodeGen/X86/switch.ll LLVM :: CodeGen/X86/switch-bt.ll LLVM :: CodeGen/X86/switch-density.ll Reviewers: hans, fhahn Reviewed By: hans Subscribers: aemerson, llvm-commits, kristof.beyls Differential Revision: https://reviews.llvm.org/D40541 llvm-svn: 319210	2017-11-28 19:55:54 +00:00
Simon Pilgrim	d49bd0cd87	[X86][SSE] Add SSE_HADDSUB/SSE_PABS/SSE_PALIGN OpndItins Update multi-classes to take the scheduling OpndItins instead of hard coding it. Will be reused in the AVX512 equivalents. llvm-svn: 319209	2017-11-28 19:39:47 +00:00
Craig Topper	dd4295626b	[X86] In lowerVectorShuffleAsElementInsertion, if were able to find a scalar i8 or i16 and need to zero extend it, make sure we use a vXi32 type of the full vector width. Previously, this was hardcoded to v4i32, but if the input type is 256 bits we need to use v8i32. Fixes PR35443 llvm-svn: 319208	2017-11-28 19:25:45 +00:00
Francis Visoiu Mistrih	3aa8eaa951	[CodeGen] Fix doxygen \file comment style llvm-svn: 319207	2017-11-28 19:23:39 +00:00
Francis Visoiu Mistrih	d4b340b460	[CodeGen] Fix doxygen llvm-svn: 319206	2017-11-28 19:15:46 +00:00
Sanjay Patel	1a72f67006	[InstCombine] auto-generate complete test checks; NFC llvm-svn: 319205	2017-11-28 19:13:23 +00:00
Krzysztof Parzyszek	081e458e90	[Hexagon] Make sure to zero-extend bytes before building a vector llvm-svn: 319204	2017-11-28 19:13:17 +00:00
Sanjay Patel	b1a97d3774	[InstCombine] auto-generate complete test checks; NFC llvm-svn: 319203	2017-11-28 19:07:28 +00:00
Daniel Sanders	17d277b734	[mir] Print/Parse both MOLoad and MOStore when they occur together. Summary: They're not always mutually exclusive. read-modify-write atomics are both at the same time. One example of this is the SWP instructions on AArch64. Another example is GlobalISel's G_ATOMICRMW_* generic instructions which will be added in a later patch. Reviewers: arphaman, aemerson Reviewed By: aemerson Subscribers: aemerson, javed.absar, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D40157 llvm-svn: 319202	2017-11-28 18:57:02 +00:00
Rafael Espindola	bba7f862d8	Fix non assert build warnings. llvm-svn: 319200	2017-11-28 18:50:08 +00:00
Hans Wennborg	ca46db957d	EntryExitInstrumenter: set DebugLocs on the inserted call instructions (PR35412) Apparently the verifier requires that inlineable calls in a function with debug info have debug locations. llvm-svn: 319199	2017-11-28 18:44:26 +00:00
Zachary Turner	6900de1dfb	[CodeView] Refactor / Rewrite TypeSerializer and TypeTableBuilder. The motivation behind this patch is that future directions require us to be able to compute the hash value of records independently of actually using them for de-duplication. The current structure of TypeSerializer / TypeTableBuilder being a single entry point that takes an unserialized type record, and then hashes and de-duplicates it is not flexible enough to allow this. At the same time, the existing TypeSerializer is already extremely complex for this very reason -- it tries to be too many things. In addition to serializing, hashing, and de-duplicating, ti also supports splitting up field list records and adding continuations. All of this functionality crammed into this one class makes it very complicated to work with and hard to maintain. To solve all of these problems, I've re-written everything from scratch and split the functionality into separate pieces that can easily be reused. The end result is that one class TypeSerializer is turned into 3 new classes SimpleTypeSerializer, ContinuationRecordBuilder, and TypeTableBuilder, each of which in isolation is simple and straightforward. A quick summary of these new classes and their responsibilities are: - SimpleTypeSerializer : Turns a non-FieldList leaf type into a series of bytes. Does not do any hashing. Every time you call it, it will re-serialize and return bytes again. The same instance can be re-used over and over to avoid re-allocations, and in exchange for this optimization the bytes returned by the serializer only live until the caller attempts to serialize a new record. - ContinuationRecordBuilder : Turns a FieldList-like record into a series of fragments. Does not do any hashing. Like SimpleTypeSerializer, returns references to privately owned bytes, so the storage is invalidated as soon as the caller tries to re-use the instance. Works equally well for LF_FIELDLIST as it does for LF_METHODLIST, solving a long-standing theoretical limitation of the previous implementation. - TypeTableBuilder : Accepts sequences of bytes that the user has already serialized, and inserts them by de-duplicating with a hash table. For the sake of convenience and efficiency, this class internally stores a SimpleTypeSerializer so that it can accept unserialized records. The same is not true of ContinuationRecordBuilder. The user is required to create their own instance of ContinuationRecordBuilder. Differential Revision: https://reviews.llvm.org/D40518 llvm-svn: 319198	2017-11-28 18:33:17 +00:00
Simon Pilgrim	4fecbd8871	[X86][X87] Tag FP_TO_INT_IN_MEM pseudos with hasNoSchedulingInfo We don't need scheduling info for pseudos llvm-svn: 319197	2017-11-28 18:10:29 +00:00
Francis Visoiu Mistrih	aa739695a4	[CodeGen] Separate MachineOperand implementation from MachineInstr Move the implementation to its own file. Differential Revision: https://reviews.llvm.org/D40419 llvm-svn: 319194	2017-11-28 17:58:43 +00:00
Francis Visoiu Mistrih	946e394e33	[CodeGen] Cleanup MachineOperand * clang-format * move doxygen from the implementation to headers * remove duplicate doxygen llvm-svn: 319193	2017-11-28 17:58:38 +00:00
Konstantin Zhuravlyov	06ae4ec78e	AMDGPU: Add num spilled s/vgprs to metadata This was requested by tools. Differential Revision: https://reviews.llvm.org/D40321 llvm-svn: 319192	2017-11-28 17:51:08 +00:00
Adam Nemet	353f7cbc21	Add opt-viewer testing Detects whether we have the Python modules (pygments, yaml) required by opt-viewer and hooks this up to REQUIRES. This fixes https://bugs.llvm.org/show_bug.cgi?id=34129 (the lack of opt-viewer testing). It's also related to https://github.com/apple/swift/pull/12938 and the idea is to expose LLVM_HAVE_OPT_VIEWER_MODULES to the Swift cmake. Differential Revision: https://reviews.llvm.org/D40202 llvm-svn: 319188	2017-11-28 17:26:28 +00:00
Francis Visoiu Mistrih	9d7bb0cb40	[CodeGen] Print register names in lowercase in both MIR and debug output As part of the unification of the debug format and the MIR format, always print registers as lowercase. * Only debug printing is affected. It now follows MIR. Differential Revision: https://reviews.llvm.org/D40417 llvm-svn: 319187	2017-11-28 17:15:09 +00:00
Dan Gohman	2803bfaf00	[WebAssembly] Support bitcasted function addresses with varargs. Generalize FixFunctionBitcasts to handle varargs functions. This in particular fixes the case where clang bitcasts away a varargs when calling a K&R-style function. This avoids interacting with tricky ABI details because it operates at the LLVM IR level before varargs ABI details are exposed. This fixes PR35385. llvm-svn: 319186	2017-11-28 17:15:03 +00:00
Matt Arsenault	e123aba94e	DAG: Legalize truncstores to illegal int types Truncate to a legal int type, and produce a new truncstore from a narrower type. llvm-svn: 319185	2017-11-28 17:11:30 +00:00
Simon Pilgrim	ece5bc358a	[X86][X87] Tag FTST x87 instruction scheduler class Looking through Agner, FTST is very similar to generic float compare behaviour, so I've added them to the existing IIC_FCOMI (WriteFAdd) tags. llvm-svn: 319184	2017-11-28 16:57:20 +00:00
Sanjay Patel	14230e02ff	[InstCombine] add tests from D39421 to show current transforms; NFC llvm-svn: 319182	2017-11-28 16:40:30 +00:00
Francis Visoiu Mistrih	14bd3b9f21	[Support] Add unit test for printLowerCase Add test case for the function added in r319171. llvm-svn: 319177	2017-11-28 16:11:56 +00:00
Don Hinton	17fdf32cc1	[cmake] Remove redundant call to cmake when building host tools. Summary: Remove the redundant, config-time call to cmake when building host tools for cross compiles or optimized tablegen.. The config-time call to cmake is redundant because it will always get called again when the CONFIGURE_LLVM_${target_name} target fires at build-time. This speeds up initial configuration, but has no affect on build behavior. Reviewers: beanz Reviewed By: beanz Subscribers: mgorny, llvm-commits Differential Revision: https://reviews.llvm.org/D40229 llvm-svn: 319176	2017-11-28 16:08:57 +00:00
Simon Pilgrim	0747a7e8c3	[X86][X87] Tag FABS/FCHS/FSQRT/FSIN/FCOS x87 instruction scheduler classes Atom's FABS/FCHS/FSQRT latencies taken from Agner. Note: I just added FSIN and FCOS to the existing IIC_FSINCOS itinerary, which is actually a more costly instruction. llvm-svn: 319175	2017-11-28 15:03:42 +00:00
Jonas Paulsson	f0ff20f1f0	Use getStoreSize() in various places instead of 'BitSize >> 3'. This is needed for cases when the memory access is not as big as the width of the data type. For instance, storing i1 (1 bit) would be done in a byte (8 bits). Using 'BitSize >> 3' (or '/ 8') would e.g. give the memory access of an i1 a size of 0, which for instance makes alias analysis return NoAlias even when it shouldn't. There are no tests as this was done as a follow-up to the bugfix for the case where this was discovered (r318824). This handles more similar cases. Review: Björn Petterson https://reviews.llvm.org/D40339 llvm-svn: 319173	2017-11-28 14:44:32 +00:00
Simon Pilgrim	b843dc26e4	[X86][X86] Add some x87 schedule tests Still missing some instructions: mainly loads/stores/system ops, all flagged as TODO. llvm-svn: 319172	2017-11-28 14:35:52 +00:00
Francis Visoiu Mistrih	26d6fc1f0e	[Support] Merge toLower / toUpper implementations Merge the ones from StringRef and StringExtras. llvm-svn: 319171	2017-11-28 14:22:27 +00:00
Francis Visoiu Mistrih	9d419d3b0c	[CodeGen] Rename functions PrintReg* to printReg* LLVM Coding Standards: Function names should be verb phrases (as they represent actions), and command-like function should be imperative. The name should be camel case, and start with a lower case letter (e.g. openFile() or isFoo()). Differential Revision: https://reviews.llvm.org/D40416 llvm-svn: 319168	2017-11-28 12:42:37 +00:00
Simon Pilgrim	8dc603b031	[X86][3DNow] Add instruction itinerary and scheduling classes for femms/prefetch/prefetchw llvm-svn: 319167	2017-11-28 12:37:35 +00:00
Peter Smith	a939257a42	[ARM][AArch64] Workaround ARM/AArch64 peculiarity in clearing icache. Certain ARM implementations treat icache clear instruction as a memory read, and CPU segfaults on trying to clear cache on !PROT_READ page. We workaround this in Memory::protectMappedMemory by adding PROT_READ to affected pages, clearing the cache, and then setting desired protection. This fixes "AllocationTests/MappedMemoryTest.***/3" unit-tests on affected hardware. Reviewers: psmith, zatrazz, kristof.beyls, lhames Reviewed By: lhames Subscribers: llvm-commits, krytarowski, peter.smith, jgreenhalgh, aemerson, rengolin Patch by maxim-kuvrykov! Differential Revision: https://reviews.llvm.org/D40423 llvm-svn: 319166	2017-11-28 12:34:05 +00:00
Chandler Carruth	c34f789e38	Add a new pass to speculate around PHI nodes with constant (integer) operands when profitable. The core idea is to (re-)introduce some redundancies where their cost is hidden by the cost of materializing immediates for constant operands of PHI nodes. When the cost of the redundancies is covered by this, avoiding materializing the immediate has numerous benefits: 1) Less register pressure 2) Potential for further folding / combining 3) Potential for more efficient instructions due to immediate operand As a motivating example, consider the remarkably different cost on x86 of a SHL instruction with an immediate operand versus a register operand. This pattern turns up surprisingly frequently, but is somewhat rarely obvious as a significant performance problem. The pass is entirely target independent, but it does rely on the target cost model in TTI to decide when to speculate things around the PHI node. I've included x86-focused tests, but any target that sets up its immediate cost model should benefit from this pass. There is probably more that can be done in this space, but the pass as-is is enough to get some important performance on our internal benchmarks, and should be generally performance neutral, but help with more extensive benchmarking is always welcome. One awkward part is that this pass has to be scheduled after everything that can eliminate these kinds of redundancies. This includes SimplifyCFG, GVN, etc. I'm open to suggestions about better places to put this. We could in theory make it part of the codegen pass pipeline, but there doesn't really seem to be a good reason for that -- it isn't "lowering" in any sense and only relies on pretty standard cost model based TTI queries, so it seems to fit well with the "optimization" pipeline model. Still, further thoughts on the pipeline position are welcome. I've also only implemented this in the new pass manager. If folks are very interested, I can try to add it to the old PM as well, but I didn't really see much point (my use case is already switched over to the new PM). I've tested this pretty heavily without issue. A wide range of benchmarks internally show no change outside the noise, and I don't see any significant changes in SPEC either. However, the size class computation in tcmalloc is substantially improved by this, which turns into a 2% to 4% win on the hottest path through tcmalloc for us, so there are definitely important cases where this is going to make a substantial difference. Differential revision: https://reviews.llvm.org/D37467 llvm-svn: 319164	2017-11-28 11:32:31 +00:00
Florian Hahn	25ea91a838	[TailRecursionElimination] Skip debug intrinsics. Summary: I think we do not need to analyze debug intrinsics here, as they should not impact codegen. This has 2 benefits: 1) slightly less work to do and 2) avoiding generating optimization remarks for converting calls to debug intrinsics to tail calls, which are not really helpful for users. Based on work by Sander de Smalen. Reviewers: davide, trentxintong, aprantl Reviewed By: aprantl Subscribers: llvm-commits, JDevlieghere Tags: #debug-info Differential Revision: https://reviews.llvm.org/D40440 llvm-svn: 319158	2017-11-28 09:32:25 +00:00
Nicolai Haehnle	b4f28deda0	AMDGPU: Re-organize the outer loop of SILoadStoreOptimizer Summary: The entire algorithm operates per basic-block, so for cache locality it should be better to re-optimize a basic-block immediately rather than in a separate loop. I don't have performance measurements. Change-Id: I85106570bd623c4ff277faaa50ee43258e1ddcc5 Reviewers: arsenm, rampitec Subscribers: kzhuravl, wdng, yaxunl, dstuttard, tpr, llvm-commits, t-tye Differential Revision: https://reviews.llvm.org/D40344 llvm-svn: 319156	2017-11-28 08:42:46 +00:00
Nicolai Haehnle	39980dac0b	AMDGPU: Consistently check for immediates in SIInstrInfo::FoldImmediate Summary: The PeepholeOptimizer pass calls this function solely based on checking DefMI->isMoveImmediate(), which only checks the MoveImm bit of the instruction description. So it's up to FoldImmediate itself to properly check that DefMI actually moves from an immediate. I don't have a separate test case for this, but the next patch introduces a test case which happens to crash without this change. This error is caught by the assertion in MachineOperand::getImm(). Change-Id: I88e7cdbcf54d75e1a296822e6fe5f9a5f095bbf8 Reviewers: arsenm, rampitec Subscribers: kzhuravl, wdng, yaxunl, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D40342 llvm-svn: 319155	2017-11-28 08:41:50 +00:00
Max Kazantsev	6e78ad35cc	[SCEV][NFC] More efficient caching in CompareValueComplexity Currently, we use a set of pairs to cache responces like `CompareValueComplexity(X, Y) == 0`. If we had proved that `CompareValueComplexity(S1, S2) == 0` and `CompareValueComplexity(S2, S3) == 0`, this cache does not allow us to prove that `CompareValueComplexity(S1, S3)` is also `0`. This patch replaces this set with `EquivalenceClasses` that merges Values into equivalence sets so that any two values from the same set are equal from point of `CompareValueComplexity`. This, in particular, allows us to prove the fact from example above. Differential Revision: https://reviews.llvm.org/D40429 llvm-svn: 319153	2017-11-28 08:26:43 +00:00
Martin Storsjo	04b68446eb	[COFF] Implement constructor priorities The priorities in the section name suffixes are zero padded, allowing the linker to just do a lexical sort. Add zero padding for .ctors sections in ELF as well. Differential Revision: https://reviews.llvm.org/D40407 llvm-svn: 319150	2017-11-28 08:07:18 +00:00
Max Kazantsev	cf9b1b24ce	[SCEV][NFC] More efficient caching in CompareSCEVComplexity Currently, we use a set of pairs to cache responces like `CompareSCEVComplexity(X, Y) == 0`. If we had proved that `CompareSCEVComplexity(S1, S2) == 0` and `CompareSCEVComplexity(S2, S3) == 0`, this cache does not allow us to prove that `CompareSCEVComplexity(S1, S3)` is also `0`. This patch replaces this set with `EquivalenceClasses` any two values from the same set are equal from point of `CompareSCEVComplexity`. This, in particular, allows us to prove the fact from example above. Differential Revision: https://reviews.llvm.org/D40428 llvm-svn: 319149	2017-11-28 07:48:12 +00:00
Max Kazantsev	115607226a	[GVN] Prevent ScalarPRE from hoisting across instructions that don't pass control flow to successors This is to address a problem similar to those in D37460 for Scalar PRE. We should not PRE across an instruction that may not pass execution to its successor unless it is safe to speculatively execute it. Differential Revision: https://reviews.llvm.org/D38619 llvm-svn: 319147	2017-11-28 07:07:55 +00:00
Adam Nemet	bf74f64e67	Revert "Add opt-viewer testing" This reverts commit r319073. Bot fails with a mismatch that looks like pygments-generated HTML. llvm-svn: 319146	2017-11-28 06:22:29 +00:00
Dan Gohman	3ff73cfbcd	[WebAssembly] Handle errors better in fast-isel. Fast-isel routines need to bail out in the case that fast-isel fails on the operands. This fixes https://bugs.llvm.org/show_bug.cgi?id=35064 llvm-svn: 319144	2017-11-28 05:36:42 +00:00
Craig Topper	640a3c1e2a	[X86] Remove some unused pattern fragments from td file. NFC llvm-svn: 319143	2017-11-28 05:23:57 +00:00
Simon Dardis	3aeb1a5404	[DAGCombine] Disable finding better chains for stores at O0 Unoptimized IR can have linear sequences of stores to an array, where the initial GEP for the first store is formed from the pointer to the array, and the GEP for each store after the first is formed from the previous GEP with some offset in an inductive fashion. The (large) resulting DAG when analyzed by DAGCombine undergoes an excessive number of combines as each store node is examined every time its' offset node is combined with any child of the offset. One of the transformations is findBetterNeighborChains which assists MergeConsecutiveStores. The former relies on repeated chain walking to do its' work, however MergeConsecutiveStores is disabled at O0 which makes the transformation redundant. Any optimization level other than O0 would invoke InstCombine which would resolve the chain of GEPs into flat base + offset GEP for each store which does not exhibit the repeated examination of each store to the array. Disabling this optimization fixes an excessive compile time issue (30~ minutes for the test case provided) at O0. Reviewers: niravd, craig.topper, t.p.northover Differential Revision: https://reviews.llvm.org/D40193 llvm-svn: 319142	2017-11-28 04:07:59 +00:00
Matthias Braun	eca985847c	MachineVerifier: Improve register operand checks This fixes cases where we wouldn't perform various register operand checks just because we didn't happen to have a definition in the MCInstrDesc. This changes the code to only skip the tests that actually depend on the MCInstrDesc definition. This makes the machine verifier spot the problem from https://llvm.org/PR33071 after the pass that actually caused it. llvm-svn: 319141	2017-11-28 03:54:20 +00:00
Matthias Braun	a6d5374ee6	MachineVerifier: Improve PHI operand checking Additional checks for phi operands: - first operand should be a virtual register def. It should not be tied, implicit, internalread, earlyclobber or a read. - The other operands should be register/mbb operands next to each other - The register operands should not be implicit, internalread, earlyclobber, debug or tied. - We can perform most of the PHI checks even for unreachable blocks. llvm-svn: 319140	2017-11-28 03:54:19 +00:00
Matthias Braun	adf7582d14	lit: Bring back -Dtool=xxx feature lost in r313928 llvm-svn: 319139	2017-11-28 03:23:07 +00:00
Rafael Espindola	3ecd20430c	Use FILE_FLAG_DELETE_ON_CLOSE for TempFile on windows. We won't see the temp file no more. llvm-svn: 319137	2017-11-28 01:41:22 +00:00
Craig Topper	ddbc340c20	[X86] Make zero extend from v16i1/v8i1 to v16i8/v8i16/v16i16 not scalarize under AVX512. llvm-svn: 319136	2017-11-28 01:36:33 +00:00
Craig Topper	5befc5bfce	[X86] Add command line without AVX512BW/AVX512VL to bitcast-int-to-vector-bool-zext.ll. llvm-svn: 319135	2017-11-28 01:36:31 +00:00
Rafael Espindola	2c4e920f0c	Move code. NFC. This moves the TempFile implementation so that it can use system specific code. llvm-svn: 319134	2017-11-28 01:34:20 +00:00
Peter Collingbourne	1621c20ffc	Reland r319090, "COFF: Do not create SectionChunks for discarded comdat sections." with a fix for debug sections. If /debug was not specified, readSection will return a null pointer for debug sections. If the debug section is associative with another section, we need to make sure that the section returned from readSection is not a null pointer before adding it as an associative section. Differential Revision: https://reviews.llvm.org/D40533 llvm-svn: 319133	2017-11-28 01:30:07 +00:00
Rafael Espindola	c06f55e1e8	This reverts commit r319096 and r319097. Revert "[SROA] Propagate !range metadata when moving loads." Revert "[Mem2Reg] Clang-format unformatted parts of this file. NFCI." Davide says they broke a bot. llvm-svn: 319131	2017-11-28 01:25:38 +00:00
Matthias Braun	5d01e708e1	ARM: Fix PR32578 https://llvm.org/PR32578 I simplified and converted the reproducer into a lit test. Patch by Vedant Kumar! llvm-svn: 319130	2017-11-28 01:17:52 +00:00
Dan Gohman	cdd48b8a6b	[WebAssembly] Fix trapping behavior in fptosi/fptoui. This adds code to protect WebAssembly's `trunc_s` family of opcodes from values outside their domain. Even though such conversions have full undefined behavior in C/C++, LLVM IR's `fptosi` and `fptoui` do not, and only return undef. This also implements the proposed non-trapping float-to-int conversion feature and uses that instead when available. llvm-svn: 319128	2017-11-28 01:13:40 +00:00
Adrian Prantl	d7f6f1636d	SROA: Avoid creating a fragment expression that covers the entire variable. Fixes PR35416. https://bugs.llvm.org/show_bug.cgi?id=35416 llvm-svn: 319126	2017-11-28 00:57:53 +00:00
Adrian Prantl	3e0e1d0934	Move getVariableSize from Verifier.cpp into DIVariable::getSize() (NFC) llvm-svn: 319125	2017-11-28 00:57:51 +00:00
Craig Topper	8b9cd03824	[X86] Remove unnecessary fp<->int setOperationAction lines from a hasVLX block. NFCI These lines all exist identically either under SSE2, AVX2 or AVX512. Given that VLX implies all of those, these aren't providing anything new. llvm-svn: 319124	2017-11-28 00:41:12 +00:00
Craig Topper	ce732e7c30	[X86] Remove duplicate calls to setOperationAction. NFCI These same calls exist a few lines down. llvm-svn: 319122	2017-11-28 00:16:42 +00:00
Rafael Espindola	bce112c9e9	Add an F_Delete flag. For now this only changes the handle Access. llvm-svn: 319121	2017-11-28 00:12:44 +00:00
Craig Topper	dbd4a7fecc	[DAGCombiner] Don't combine aext(setcc) if the setcc is already using the target's preferred result type. With AVX512 vXi1 types are legal so we shouldn't be extending them. This change is similar to existing code in the zext(setcc) combine. llvm-svn: 319120	2017-11-27 23:51:40 +00:00
Craig Topper	57c02d18b9	[DAGCombiner] Use EVT::changeVectorElementTypeToInteger() instead of implementing manually. llvm-svn: 319119	2017-11-27 23:51:31 +00:00
Rafael Espindola	d19c2e8126	Add OpenFlags to the create(Unique\|Temporary)File interfaces. This will allow a future F_Delete flag to be specified when we want the file to be automatically deleted on close. llvm-svn: 319117	2017-11-27 23:44:11 +00:00
Craig Topper	256cc48df6	[X86] Teach getSetCCResultType to handle more than just SimpleVTs when looking at larger than 512-bit vectors. Which VTs are considered simple is determined by the superset of the legal types of all targets in LLVM. If we're looking at VTs that are going to be split down to 512-bits we should allow any VT not just simple ones since the simple list changes over time as new targets are added. llvm-svn: 319110	2017-11-27 22:56:10 +00:00
Petr Hosek	6163329caa	[CMake] Pass LLVM_HOST_TRIPLE to external projects LLVM runtimes rely on LLVM_HOST_TRIPLE being set in their builds and tests so make sure it's being passed down. Differential Revision: https://reviews.llvm.org/D40515 llvm-svn: 319109	2017-11-27 22:50:48 +00:00
Petr Hosek	a08d65ded2	[CMake][runtimes] Support monorepo layout with runtimes build We introduce a new variable LLVM_ENABLE_RUNTIMES which works similarly to LLVM_ENABLE_PROJECTS and allows specifying runtimes that will be enabled in the runtimes build. Differential Revision: https://reviews.llvm.org/D40233 llvm-svn: 319107	2017-11-27 22:31:11 +00:00

... 2 3 4 5 6 ...

157419 Commits