llvm-project

Commit Graph

Author	SHA1	Message	Date
Artem Belevich	c2cd5d5ce0	[Split GEP] handle trunc() in separate-const-offset-from-gep pass. Let separate-const-offset-from-gep pass handle trunc() when it calculates constant offset relative to base. The pass itself may insert trunc() instructions when it canonicalises array indices to pointer-size integers and needs to handle trunc() in order to evaluate the offset. Differential Revision: https://reviews.llvm.org/D46732 llvm-svn: 332142	2018-05-11 21:13:19 +00:00
Yaxun Liu	deba150c27	[AMDGPU] Fix compilation failure when IR contains comdat Remove a useless SwitchSection which also causes compilation failure when IR contains comdat. The SwitchSection is useless because the current section is already correct text section for the function therefore no need to switch. It causes compilation failure for comdat because functions with comdat has specific text section, not the default .text section. Since HIP uses comdat, this bug caused failures for HIP. Differential Revision: https://reviews.llvm.org/D46770 llvm-svn: 332137	2018-05-11 20:40:14 +00:00
Sanjay Patel	0ddf09a36c	[DAG] reduce code duplication; NFCI llvm-svn: 332133	2018-05-11 20:08:23 +00:00
Daniel Neilson	f6651d4d94	[InstCombine] Handle atomic memset in the same way as regular memset Summary: This change adds handling of the atomic memset intrinsic to the code path that simplifies the regular memset. In practice this means that we will now also expand a small constant-length atomic memset into a single unordered atomic store. Reviewers: apilipenko, skatkov, mkazantsev, anna, reames Reviewed By: reames Subscribers: reames, llvm-commits Differential Revision: https://reviews.llvm.org/D46660 llvm-svn: 332132	2018-05-11 20:04:50 +00:00
David Blaikie	c30365ce1d	Move standard library inclusions to after internal inclusions. llvm-svn: 332124	2018-05-11 19:21:40 +00:00
Vedant Kumar	99d5c072f0	[DAGCombiner] Set the right SDLoc on extended SETCC uses (7/N) ExtendSetCCUses updates SETCC nodes which use a load (OriginalLoad) to reflect a simplification to the load (ExtLoad). Based on my reading, ExtendSetCCUses may create new nodes to extend a constant attached to a SETCC. It also creates fresh SETCC nodes which refer to any updated operands. ISTM that the location applied to the new constant and SETCC nodes should be the same as the location of the ExtLoad. This was suggested by Adrian in https://reviews.llvm.org/D45995. Part of: llvm.org/PR37262 Differential Revision: https://reviews.llvm.org/D46216 llvm-svn: 332119	2018-05-11 18:40:10 +00:00
Vedant Kumar	fd340a4047	[DAGCombiner] Set the right SDLoc on a newly-created sextload (6/N) This teaches tryToFoldExtOfLoad to set the right location on a newly-created extload. With that in place, the logic for performing a certain ([s\|z]ext (load ...)) combine becomes identical for sexts and zexts, and we can get rid of one copy of the logic. The test case churn is due to dependencies on IROrders inherited from the wrong SDLoc. Part of: llvm.org/PR37262 Differential Revision: https://reviews.llvm.org/D46158 llvm-svn: 332118	2018-05-11 18:40:08 +00:00
Vedant Kumar	f0e5f7c45e	[DAGCombiner] Factor out duplicated logic for an extload combine, NFC (5/N) Part of the logic for combining (zext (load ...)) and (sext (load ...)) is duplicated. This creates problems because bugs in one version have to be fixed again in the other version. To address this, as a first step, I've extracted the duplicate logic into a helper. I'll fix the debug location bug in the helper and eliminate the copy of its logic in a followup. Part of: llvm.org/PR37262 Differential Revision: https://reviews.llvm.org/D46157 llvm-svn: 332117	2018-05-11 18:40:02 +00:00
Chris Matthews	d1386a88dd	[LIT] replace output escapes wit a cdata block CDATA blocks don't need to have XML stuff escaped. Makes sense to wrap output in them instead of escaping. llvm-svn: 332116	2018-05-11 18:38:02 +00:00
Nico Weber	06b0c2fe2c	make add_llvm_fuzzer calls slightly more consisten with other cmake llvm-svn: 332112	2018-05-11 17:58:52 +00:00
David Bolvansky	cd93c4ef1a	[InstCombine] snprintf optimizations Reviewers: spatel, efriedma, majnemer, rja, bkramer Reviewed By: rja, bkramer Subscribers: mstorsjo, rja, llvm-commits Differential Revision: https://reviews.llvm.org/D46285 llvm-svn: 332110	2018-05-11 17:50:49 +00:00
Simon Pilgrim	661ae7778d	[X86][BtVer2] Model ymm move as double pumped instructions We still need to handle mmx/xmm moves as 'decode-only' no-pipe instructions llvm-svn: 332109	2018-05-11 17:38:36 +00:00
Alex Bradbury	bca0c3cdb6	[RISCV] Support .option rvc and norvc assembler directives These directives allow the 'C' (compressed) extension to be enabled/disabled within a single file. Differential Revision: https://reviews.llvm.org/D45864 Patch by Kito Cheng llvm-svn: 332107	2018-05-11 17:30:28 +00:00
Martin Storsjo	0d7c37756b	[Analysis] Validate the return type of s(n)printf like libcalls If the sprintf function is static (as on mingw-w64, where many stdio functions are static inline wrappers), earlier optimization passes could optimize out the return value altogether, and make it void, which could break optimizations of this libcall that touch the return value. This fixes the issue discussed in PR37408 for the sprintf function. Differential Revision: https://reviews.llvm.org/D46752 llvm-svn: 332106	2018-05-11 16:53:56 +00:00
Simon Pilgrim	706403bab8	[X86][MMX] Tag MMX Move/Load/Store as WriteVec schedule classes Fixes an issue on SLM/Btver2 where we had instructions were being treated as scalar loads/stores llvm-svn: 332104	2018-05-11 16:38:59 +00:00
Geoff Berry	60460268c0	[AArch64] Fix performPostLD1Combine to check for constant lane index. Summary: performPostLD1Combine in AArch64ISelLowering looks for vector insert_vector_elt of a loaded value which it can optimize into a single LD1LANE instruction. The code checking for the pattern was not checking if the lane index was a constant which could cause two problems: - an assert when lowering the LD1LANE ISD node since it assumes an constant operand - an assert in isel if the lane index value depends on the post-incremented base register Both of these issues are avoided by simply checking that the lane index is a constant. Fixes bug 35822. Reviewers: t.p.northover, javed.absar Subscribers: rengolin, kristof.beyls, mcrosier, llvm-commits Differential Revision: https://reviews.llvm.org/D46591 llvm-svn: 332103	2018-05-11 16:25:06 +00:00
Simon Dardis	ae2010813f	[mips] Rename Filler to MipsDelaySlotFiller and initialize the pass llvm-svn: 332102	2018-05-11 16:13:53 +00:00
Sanjoy Das	82105e2a7d	Use iteration instead of recursion in CFIInserter Summary: This recursive step can overflow the stack. Reviewers: djokov, petarj Subscribers: mcrosier, jlebar, bixia, llvm-commits Differential Revision: https://reviews.llvm.org/D46671 llvm-svn: 332101	2018-05-11 15:54:46 +00:00
Davide Italiano	6e1f7bf316	[Reassociate] Prevent infinite loops when processing PHIs. Phi nodes can reside in live blocks but one of their incoming arguments can come from a dead block. Dead blocks and reassociate don't play nice together. In fact, reassociate performs an RPO as a first step to avoid processing dead blocks. The reason why Reassociate might not fixpoint when examining dead blocks is that the following: %xor0 = xor i16 %xor1, undef %xor1 = xor i16 %xor0, undef is perfectly valid LLVM IR (if it appears in a dead block), so the worklist algorithm keeps pushing the two instructions for reexamination. Note that this is not Reassociate fault, at least not entirely. It's llvm that has a weird definition of dominance. Fixes PR37390. llvm-svn: 332100	2018-05-11 15:45:36 +00:00
Nico Weber	0f2a48c1ae	Remove unused SyncExecutor and make it clearer that the whole file is only used if LLVM_ENABLE_THREADS llvm-svn: 332098	2018-05-11 15:25:38 +00:00
Simon Dardis	d4169ad7c1	[mips] Enable disassembly of fused (negative) multiply add/sub instructions Reviewers: atanasyan, smaksimovic, abeserminji Differential Revision: https://reviews.llvm.org/D46392 llvm-svn: 332097	2018-05-11 15:21:40 +00:00
Simon Pilgrim	032a01f74a	[X86][SLM] Vector stores only use the MEC port. Confirmed by both Agner and Intel's AOM - the IEC/FPC are not required for pure load/stores (even if its a partial update). Can't fix WriteStore until all RMW instructions are cleaned up though.... llvm-svn: 332096	2018-05-11 15:16:15 +00:00
Simon Pilgrim	22dd72b995	[X86] Split WriteF/WriteVec Move/Load/Store scheduler classes by vector width Fixes a SNB issue that was missing vlddqu/vmovntdqa ymm instructions llvm-svn: 332094	2018-05-11 14:30:54 +00:00
Daniel Neilson	8f30ec65b0	[InstCombine] Unify handling of atomic memtransfer with non-atomic memtransfer Summary: This change reworks the handling of atomic memcpy within the instcombine pass. Previously, a constant length atomic memcpy would be lowered into loads & stores as long as no more than 16 load/store pairs are created. This is quite different from the lowering done for a non-atomic memcpy; which only ever lowers into a single load/store pair of no more than 8 bytes. Larger constant-sized memcpy calls are expanded to load/stores in later passes, such as SelectionDAG lowering. In this change the behaviour for atomic memcpy is unified with non-atomic memcpy; atomic memcpy is now treated in the same was as non-atomic memcpy has always been. We leave it to later passes to lower longer-length atomic memcpy calls. Due to the structure of the pass's handling of memtransfer intrinsics, this change also gives us handling of atomic memmove that we did not previously have. Reviewers: apilipenko, skatkov, mkazantsev, anna, reames Reviewed By: reames Subscribers: reames, llvm-commits Differential Revision: https://reviews.llvm.org/D46658 llvm-svn: 332093	2018-05-11 14:30:02 +00:00
Simon Pilgrim	ead11e4d4b	[X86] Added scheduler helper classes to split move/load/store by size Nothing uses this yet but this will allow us to specialize MMX/XMM/YMM/ZMM vector moves. llvm-svn: 332090	2018-05-11 12:46:54 +00:00
Sven van Haastregt	5ad5c3c7ed	[APFloat] Set losesInfo on no-op convert losesInfo would be left unset when no conversion needs to be done. A caller such as InstCombine's fitsInFPType would then branch on an uninitialized value. Caught using valgrind on an out-of-tree target. Differential Revision: https://reviews.llvm.org/D46645 llvm-svn: 332087	2018-05-11 09:45:42 +00:00
Jakub Kuderski	0cbc1b0de4	[IRTests] Verify PDT instead of DT Summary: Fix two typos which result in verifying wrong data structures (DT) instead of PDT in DominatorTreeBatchUpdatesTest. Reviewers: davide, kuhar, grosser, dberlin Reviewed By: davide, kuhar, dberlin Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D46696 llvm-svn: 332086	2018-05-11 09:30:29 +00:00
Tom Stellard	dcc95e9385	AMDGPU/GlobalISel: Implement select() for 32-bit G_FPTOUI Reviewers: arsenm, nhaehnle Subscribers: kzhuravl, wdng, yaxunl, rovka, kristof.beyls, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D45883 llvm-svn: 332082	2018-05-11 05:44:16 +00:00
Alexander Shaposhnikov	18b5fb7b84	[llvm-strip] Add support for -remove-section This diff adds support for -remove-section to llvm-strip. Test plan: make check-all Differential revision: https://reviews.llvm.org/D46567 llvm-svn: 332081	2018-05-11 05:27:06 +00:00
Craig Topper	9968af4a2a	[X86] Remove and autoupgrade the avx512.mask.store.ss intrinsic. llvm-svn: 332079	2018-05-11 04:33:18 +00:00
Alexander Shaposhnikov	191913e3e7	[llvm-objcopy] Update remove-section.test Verify that the input binary is not getting modified and add an invocation which uses -remove-section instead of -R. Test plan: make check-all llvm-svn: 332078	2018-05-11 04:30:57 +00:00
Brian Gesiak	c651113439	[Coroutines] PR34897: Fix incorrect elisions Summary: https://bugs.llvm.org/show_bug.cgi?id=34897 demonstrates an incorrect coroutine frame allocation elision in the coro-elide pass. The elision is performed on the basis that the SSA variables from all llvm.coro.begin are directly referenced in subsequent llvm.coro.destroy instructions. However, this ignores the fact that the function may exit through paths that do not run these destroy instructions. In the sample program from PR34897, for example, the llvm.coro.destroy instruction is only executed in exception handling code. When the coroutine function exits normally, llvm.coro.destroy is not called. Eliding the allocation in this case causes a subsequent reference to the coroutine handle from outside of the function to access freed memory. To fix the issue, when finding an llvm.coro.destroy for each llvm.coro.begin, only consider llvm.coro.destroy that are executed along non-exceptional paths. Test Plan: 1. Download the sample program from https://bugs.llvm.org/show_bug.cgi?id=34897, compile it with `clang++ -fcoroutines-ts -stdlib=libc++ -std=c++1z -O2`, and run it. It should print `"run1\ncheck1\nrun2\ncheck2"` and then exit successfully. 2. Compile https://godbolt.org/g/mCKfnr and confirm it is still optimized to a single instruction, 'return 1190'. 3. `check-llvm` Reviewers: rsmith, GorNishanov, eric_niebler Reviewed By: GorNishanov Subscribers: andrewrk, lewissbaker, EricWF, llvm-commits Differential Revision: https://reviews.llvm.org/D43242 llvm-svn: 332077	2018-05-11 03:12:28 +00:00
Brian Gesiak	82de4e6b93	[Support] Add docs for 'openFileFor{Write,Read}' Summary: Add documentation for the LLVM Support functions `openFileForWrite` and `openFileForRead`. The `openFileForRead` parameter `RealPath`, in particular, I think warranted some explanation. In addition, make the behavior of the functions more consistent across platforms. Prior to this patch, Windows would set or not set the result file descriptor based on the nature of the error, whereas Unix would consistently set it to `-1` if the open failed. Make Windows consistently set it to `-1` as well. Test Plan: 1. `ninja check-llvm` 2. `ninja docs-llvm-html` Reviewers: zturner, rnk, danielmartin, scanon Reviewed By: danielmartin, scanon Subscribers: scanon, danielmartin, llvm-commits Differential Revision: https://reviews.llvm.org/D46499 llvm-svn: 332075	2018-05-11 01:47:27 +00:00
Kostya Serebryany	a2759327fd	[sanitizer-coverage] don't instrument a function if it's entry block ends with 'unreachable' llvm-svn: 332072	2018-05-11 01:09:39 +00:00
Kamil Rytarowski	02c432a72b	Register NetBSD/i386 in AddressSanitizer.cpp Summary: Ship kNetBSD_ShadowOffset32 set to 1ULL << 30. This is prepared for the amd64 kernel runtime. Sponsored by <The NetBSD Foundation> Reviewers: vitalybuka, joerg, kcc Reviewed By: vitalybuka Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D46724 llvm-svn: 332069	2018-05-11 00:58:01 +00:00
Craig Topper	4b026e5ebd	[InstCombine] Add tests for cases where we don't recognize type promoted rotate idioms. These rotates take the form (x << (n & mask)) \| (x >> (-n & mask)) where mask is bitwidth - 1. If x has been promoted to a wider type than its original bit width due to type promotion we fail to narrower it and therefore don't recognize it as a rotate. llvm-svn: 332068	2018-05-11 00:46:09 +00:00
Chris Matthews	7d6224d2fa	Support Unsupported Tests in xunit output We were reporting "Unsupported" tests in xunit as passes, however since they are not run, it make more sense to mark them as skipped. The Junit xml standard has support for that, so lets use it. llvm-svn: 332065	2018-05-11 00:25:43 +00:00
Chris Matthews	5f74591847	Refactor xunit test case builder to not use as much str addition String concatenation in python is slow. Refactor to not concatenate the possibly large strings of test output and instead write them directly to the output file. llvm-svn: 332064	2018-05-11 00:25:42 +00:00
Wei Mi	0c2f6be662	[SampleFDO] Don't treat warm callsite with inline instance in the profile as cold We found current sampleFDO had a performance issue when triaging a regression. For a callsite with inline instance in the profile, even if hot callsite inliner cannot inline it, it may still execute enough times and should not be treated as cold in regular inliner later. However, currently if such callsite is not inlined by hot callsite inliner, and the BB where the callsite locates doesn't get samples from other instructions inside of it, the callsite will have no profile metadata annotated. In regular inliner cost analysis, if the callsite has no profile annotated and its caller has profile information, it will be treated as cold. The fix changes the isCallsiteHot check and chooses to compare CallsiteTotalSamples with hot cutoff value computed by ProfileSummaryInfo. Differential Revision: https://reviews.llvm.org/D45377 llvm-svn: 332058	2018-05-10 23:02:27 +00:00
Vedant Kumar	e0b5f86b30	[STLExtras] Add distance() for ranges, pred_size(), and succ_size() This commit adds a wrapper for std::distance() which works with ranges. As it would be a common case to write `distance(predecessors(BB))`, this also introduces `pred_size()` and `succ_size()` helpers to make that easier to write. Differential Revision: https://reviews.llvm.org/D46668 llvm-svn: 332057	2018-05-10 23:01:54 +00:00
Chris Matthews	4855c5f717	[LIT] Move xunit tests tests into their own location, and and add failures Failures will increase coverage. llvm-svn: 332056	2018-05-10 22:51:28 +00:00
Craig Topper	ea78a261de	[InstCombine] Replace an 'if' that should always be true with an assert. The bitwidth of the operation should always be wider than the result width of the truncate since we don't recurse through any width changing operations. llvm-svn: 332055	2018-05-10 22:45:28 +00:00
Sam Clegg	16c16827bc	[WebAssembly] Initial Disassembler. This implements a new table-gen emitter to create tables for a wasm disassembler, and a dissassembler to use them. Comes with 2 tests, that tests a few instructions manually. Is also able to disassemble large .wasm files with objdump reasonably. Not working so well, to be addressed in followups: - objdump appears to be passing an incorrect starting point. - since the disassembler works an instruction at a time, and it is disassembling stack instruction, it has no idea of pseudo register assignments. These registers are required for the instruction printing code that follows. For now, all such registers appear in the output as $0. Patch by Wouter van Oortmerssen Differential Revision: https://reviews.llvm.org/D45848 llvm-svn: 332052	2018-05-10 22:16:44 +00:00
Craig Topper	1ee19ae126	[X86] Add new patterns for masked scalar load/store to match clang's codegen from r331958. Clang's codegen now uses 128-bit masked load/store intrinsics in IR. The backend will widen to 512-bits on AVX512F targets. So this patch adds patterns to detect codegen's widening and patterns for AVX512VL that don't get widened. We may be able to drop some of the old patterns, but I leave that for a future patch. llvm-svn: 332049	2018-05-10 21:49:16 +00:00
Martin Storsjo	86e6742c17	Revert "[InstCombine] snprintf optimizations" This reverts commit SVN r331889, which could trigger failed assertions for cases where the snprintf function is declared with a vaguely differing signature (e.g. being defined as static inline), see PR37408. llvm-svn: 332043	2018-05-10 21:23:36 +00:00
Tom Stellard	1e0edad4bb	AMDGPU/GlobalISel: Implement select() for G_BITCAST s32 <--> <2 x s16> Reviewers: arsenm, nhaehnle Subscribers: kzhuravl, wdng, yaxunl, rovka, kristof.beyls, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D45881 llvm-svn: 332042	2018-05-10 21:20:10 +00:00
Robert Widmann	4b0084bfcf	[LLVM-C] Consolidate llgo's DIBuilder Bindings Summary: Move and correct LLVMDIBuilderCreateTypedef. This is the last API in DIBuilderBindings.h, so it is being removed and the C API will now be re-exported from IRBindings.h. Reviewers: whitequark, harlanhaskins, deadalnix Reviewed By: whitequark Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D46725 llvm-svn: 332041	2018-05-10 21:10:06 +00:00
Tom Stellard	1dc90204bf	AMDGPU/GlobalISel: Enable TableGen'd instruction selector Reviewers: arsenm, nhaehnle Reviewed By: arsenm Subscribers: kzhuravl, wdng, mgorny, yaxunl, rovka, kristof.beyls, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D45994 llvm-svn: 332039	2018-05-10 20:53:06 +00:00
Sanjay Patel	c7bb14301a	[InstCombine] add folds for minnum(-a, -b) --> -maxnum(a, b) This is similar to what we do for integer min/max with 'not' ops (rL321882). This should fix: https://bugs.llvm.org/show_bug.cgi?id=37404 https://bugs.llvm.org/show_bug.cgi?id=37405 llvm-svn: 332031	2018-05-10 20:03:13 +00:00
Wolfgang Pieb	f2b6915ed4	[DWARF] Fixing a bug in DWARF v5 string offsets tables where the length encoded the contribution length excluding the table header. Instead it must encode the contribution length minus the length field itself. Reviewer: JDevliegehere Differential Revision: https://reviews.llvm.org/D45922 llvm-svn: 332030	2018-05-10 20:02:34 +00:00

1 2 3 4 5 ...

163984 Commits