llvm-project

Commit Graph

Author	SHA1	Message	Date
Florian Hahn	893aea58ea	[LoopUnroll] Allow unrolling if the unrolled size does not exceed loop size. Summary: In the following cases, unrolling can be beneficial, even when optimizing for code size: 1) very low trip counts 2) potential to constant fold most instructions after fully unrolling. We can unroll in those cases, by setting the unrolling threshold to the loop size. This might highlight some cost modeling issues and fixing them will have a positive impact in general. Reviewers: vsk, efriedma, dmgreen, paquette Reviewed By: paquette Differential Revision: https://reviews.llvm.org/D60265 llvm-svn: 358586	2019-04-17 15:57:43 +00:00
Roman Lebedev	0080645846	[CVP] processOverflowIntrinsic(): don't crash if constant-holding happened As reported by Mikael Holmén in post-commit review in https://reviews.llvm.org/D60791#1469765 llvm-svn: 358559	2019-04-17 06:35:07 +00:00
Eric Christopher	e29874eaa0	Revert "Add basic loop fusion pass." Per request. This reverts commit r358543/ab70da07286e618016e78247e4a24fcb84077fda. llvm-svn: 358553	2019-04-17 04:55:24 +00:00
Eric Christopher	cee313d288	Revert "Temporarily Revert "Add basic loop fusion pass."" The reversion apparently deleted the test/Transforms directory. Will be re-reverting again. llvm-svn: 358552	2019-04-17 04:52:47 +00:00
Eric Christopher	a863435128	Temporarily Revert "Add basic loop fusion pass." As it's causing some bot failures (and per request from kbarton). This reverts commit r358543/ab70da07286e618016e78247e4a24fcb84077fda. llvm-svn: 358546	2019-04-17 02:12:23 +00:00
Kit Barton	ab70da0728	Add basic loop fusion pass. This patch adds a basic loop fusion pass. It will fuse loops that conform to the following 4 conditions: 1. Adjacent (no code between them) 2. Control flow equivalent (if one loop executes, the other loop executes) 3. Identical bounds (both loops iterate the same number of iterations) 4. No negative distance dependencies between the loop bodies. The pass does not make any changes to the IR to create opportunities for fusion. Instead, it checks if the necessary conditions are met and if so it fuses two loops together. The pass has not been added to the pass pipeline yet, and thus is not enabled by default. It can be run stand alone using the -loop-fusion option. Phabricator: https://reviews.llvm.org/D55851 llvm-svn: 358543	2019-04-17 01:37:00 +00:00
Sanjay Patel	e08783e2f5	[EarlyCSE] detect equivalence of selects with inverse conditions and commuted operands (PR41101) This is 1 of the problems discussed in the post-commit thread for: rL355741 / http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20190311/635516.html and filed as: https://bugs.llvm.org/show_bug.cgi?id=41101 Instcombine tries to canonicalize some of these cases (and there's room for improvement there independently of this patch), but it can't always do that because of extra uses. So we need to recognize these commuted operand patterns here in EarlyCSE. This is similar to how we detect commuted compares and commuted min/max/abs. Differential Revision: https://reviews.llvm.org/D60723 llvm-svn: 358523	2019-04-16 20:41:20 +00:00
Nikita Popov	52b24ee932	[CVP] Simplify umulo and smulo that cannot overflow If a umul.with.overflow or smul.with.overflow operation cannot overflow, simplify it to a simple mul nuw / mul nsw. After the refactoring in D60668 this is just a matter of removing an explicit check against multiplications. Differential Revision: https://reviews.llvm.org/D60791 llvm-svn: 358521	2019-04-16 20:31:41 +00:00
Simon Pilgrim	82ffa88a04	[SLP] Refactoring of the operand reordering code. This is a refactoring patch which should have all the functionality of the current code. Its goal is twofold: i. Cleanup and simplify the reordering code, and ii. Generalize reordering so that it will work for an arbitrary number of operands, not just 2. This is the second patch in a series of patches that will enable operand reordering across chains of operations. An example of this was presented in EuroLLVM'18 https://www.youtube.com/watch?v=gIEn34LvyNo . Committed on behalf of @vporpo (Vasileios Porpodas) Differential Revision: https://reviews.llvm.org/D59973 llvm-svn: 358519	2019-04-16 19:27:00 +00:00
Nikita Popov	5a30177906	[CVP] Add tests for non-overflowing mulo; NFC Should be simplified to simple mul. llvm-svn: 358517	2019-04-16 19:25:35 +00:00
Nikita Popov	5ecd6a48b9	[InstCombine] Prune fshl/fshr with masked operands If a constant shift amount is used, then only some of the LHS/RHS operand bits are demanded and we may be able to simplify based on that. InstCombineSimplifyDemanded already had the necessary support for that, we just weren't calling it with fshl/fshr as root. In particular, this allows us to relax some masked funnel shifts into simple shifts, as shown in the tests. Patch by Shawn Landden. Differential Revision: https://reviews.llvm.org/D60660 llvm-svn: 358515	2019-04-16 19:05:49 +00:00
Nikita Popov	f700081a7d	[InstCombine] Add tests for fshl/fshr with masked operands; NFC Baseline tests for D60660. Patch by Shawn Landden. Differential Revision: https://reviews.llvm.org/D60688 llvm-svn: 358514	2019-04-16 19:05:40 +00:00
Philip Reames	c44b68e2b7	[Tests] Add branch_weights to latches so that test is not effected by future profitability patch to LoopPredication llvm-svn: 358506	2019-04-16 16:32:59 +00:00
Hans Wennborg	21eb771dcb	Re-commit r357452: SimplifyCFG SinkCommonCodeFromPredecessors: Also sink function calls without used results (PR41259) The original commit caused false positives from AddressSanitizer's use-after-scope checks, which have now been fixed in r358478. > The code was previously checking that candidates for sinking had exactly > one use or were a store instruction (which can't have uses). This meant > we could sink call instructions only if they had a use. > > That limitation seemed a bit arbitrary, so this patch changes it to > "instruction has zero or one use" which seems more natural and removes > the need to special-case stores. > > Differential revision: https://reviews.llvm.org/D59936 llvm-svn: 358483	2019-04-16 12:13:25 +00:00
Quentin Colombet	fda0426888	[LSR] Rewrite misses some fixup locations if it splits critical edge If LSR split critical edge during rewriting phi operands and phi node has other pending fixup operands, we need to update those pending fixups. Otherwise formulae will not be implemented completely and some instructions will not be eliminated. llvm.org/PR41445 Differential Revision: https://reviews.llvm.org/D60645 Patch by: Denis Bakhvalov <denis.bakhvalov@intel.com> llvm-svn: 358457	2019-04-15 22:23:46 +00:00
Sanjay Patel	800a0c3e4b	[EarlyCSE] add more tests for double-negated select condition; NFC llvm-svn: 358454	2019-04-15 21:51:51 +00:00
Sanjay Patel	5ae05d810c	[EarlyCSE] add test for select condition double-negation; NFC llvm-svn: 358444	2019-04-15 20:25:31 +00:00
Philip Reames	af808ee2ee	[Tests] Add a few more tests for LoopPredication w/invariant loads Making sure to cover an important legality cornercase. llvm-svn: 358439	2019-04-15 19:45:27 +00:00
Wolfgang Pieb	4fe42214e2	[DEBUGINFO] Prevent Instcombine from dropping debuginfo when removing zexts Zexts can be treated like no-op casts when it comes to assessing whether their removal affects debug info. Reviewer: aprantl Differential Revision: https://reviews.llvm.org/D60641 llvm-svn: 358431	2019-04-15 17:36:29 +00:00
Hiroshi Yamauchi	09e539fcae	[PGO] Profile guided code size optimization. Summary: Enable some of the existing size optimizations for cold code under PGO. A ~5% code size saving in big internal app under PGO. The way it gets BFI/PSI is discussed in the RFC thread http://lists.llvm.org/pipermail/llvm-dev/2019-March/130894.html Note it doesn't currently touch loop passes. Reviewers: davidxl, eraman Reviewed By: eraman Subscribers: mgorny, javed.absar, smeenai, mehdi_amini, eraman, zzheng, steven_wu, dexonsmith, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D59514 llvm-svn: 358422	2019-04-15 16:49:00 +00:00
Sanjay Patel	0e0bb0e24a	[EarlyCSE] add tests for selects with commuted operands (PR41101); NFC llvm-svn: 358420	2019-04-15 16:01:05 +00:00
Philip Reames	fbe64a2cfb	[LoopPred] Hoist and of predicated checks where legal If we have multiple range checks which can be predicated, hoist the and of the results outside the loop. This minorly cleans up the resulting IR, but the main motivation is as a building block for D60093. llvm-svn: 358419	2019-04-15 15:53:25 +00:00
Sanjay Patel	c71433335a	[EarlyCSE] regenerate test checks; NFC llvm-svn: 358407	2019-04-15 14:02:37 +00:00
Sanjay Patel	5e13cd2e61	[InstCombine] canonicalize fdiv after fmul if reassociation is allowed (X / Y) * Z --> (X * Z) / Y This can allow other optimizations/reassociations as shown in the test diffs. llvm-svn: 358404	2019-04-15 13:23:38 +00:00
Serguei Katkov	f54328372b	[NewPM] Add Option handling for SimplifyCFG This patch enables passing options to SimplifyCFGPass via the passes pipeline. Reviewers: chandlerc, fedor.sergeev, leonardchan, philip.pfaffe Reviewed By: fedor.sergeev Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D60675 llvm-svn: 358379	2019-04-15 08:57:53 +00:00
Philip Reames	0eeb2cd491	[Tests] Add tests for D60659, and make adjustments to others to make diff clear Three related changes: 1) auto-gen several test files 2) Add the new tests at the bottom of said files 3) Adjust a couple of other test files not to use stores to constants when trying to test constexpr address handling llvm-svn: 358344	2019-04-13 22:12:56 +00:00
Nikita Popov	040871db48	[CVP] Add tests for range of with.overflow result; NFC Test range of with.overflow result in the no-overflow branch. llvm-svn: 358341	2019-04-13 19:43:51 +00:00
Nikita Popov	41e284b9c3	[CVP] Fix inverted predicates in test; NFC Checked the wrong direction in the umul tests... fix predicated to line up with the test name. llvm-svn: 358331	2019-04-13 11:47:36 +00:00
Nikita Popov	25c1aa15a7	[CVP] Add tests for with.overflow used as condition; NFC llvm-svn: 358330	2019-04-13 11:40:16 +00:00
Chen Zheng	87dd0e06dc	[InstCombine] Canonicalize (-X srem Y) to -(X srem Y). Differential Revision: https://reviews.llvm.org/D60647 llvm-svn: 358328	2019-04-13 09:21:22 +00:00
Chen Zheng	fc59a0326b	[InstCombine] [NFC] add testcases for canonicalizing (-X srem Y) to -(X srem Y). llvm-svn: 358327	2019-04-13 07:34:55 +00:00
Philip Reames	b091cc081d	[InstCombine] Fix a nasty miscompile introduced w/masked.gather demanded elts This fixes a miscompile which was introduced in r356510 (https://reviews.llvm.org/D57372). The problem is that the original patch removed pointer operands where the load results we're demanded, but without considering the legality of the load itself. If the masked.gather had active, but undemanded, lanes, then we could end up creating a load which loaded from an undef address. The result could be a segfault, or, in theory, an arbitrary read from a random memory location into an used register. llvm-svn: 358299	2019-04-12 18:26:56 +00:00
Nikita Popov	00a0d5d1de	[CVP] Set NSW/NUW flags when simplifying with.overflow When CVP determines that a with.overflow intrinsic cannot overflow, it currently inserts a simple add/sub. As we already determined that there can be no overflow, we should add the appropriate NUW/NSW flag. Differential Revision: https://reviews.llvm.org/D60585 llvm-svn: 358298	2019-04-12 18:18:17 +00:00
Philip Reames	7a60cd38af	[Tests] Checkin a test demonstrating a miscompile so that patch which fixes it shows a clear diff llvm-svn: 358296	2019-04-12 18:11:58 +00:00
Jeremy Morse	32afe6a1f8	[DebugInfo] Fix pr41175 Dead Store Elimination missing debug loc Bug: https://bugs.llvm.org/show_bug.cgi?id=41175 In the bug test case the DSE pass is shortening the range of memory that a memset is working on. A getelementptr is generated so that the new starting address can be passed to memset. This instruction was not given a DebugLoc. To fix the bug, copy the DebugLoc from the memset instruction. Patch by Orlando Cazalet-Hyams! Differential Revision: https://reviews.llvm.org/D60556 llvm-svn: 358270	2019-04-12 09:47:35 +00:00
Fangrui Song	d5c404246f	[ConstantFold] Don't evaluate FP or FP vector casts or truncations when simplifying icmp Fix PR41476 llvm-svn: 358262	2019-04-12 07:34:30 +00:00
Nikita Popov	6ffa1511ea	[CVP] Generate full test checks for overflows.ll; NFC llvm-svn: 358229	2019-04-11 21:10:39 +00:00
Rong Xu	959ef16859	[PGO] Better handling of profile hash mismatch We currently assume profile hash conflicts will be caught by an upfront check and we assert for the cases that escape the check. The assumption is not always true as there are chances of conflict. This patch prints a warning and skips annotating the function for the escaped cases,. Differential Revision: https://reviews.llvm.org/D60154 llvm-svn: 358225	2019-04-11 20:54:17 +00:00
Simon Pilgrim	8d083c5e0b	[ConstantFold] ExtractConstantBytes - handle shifts on large integer types Use APInt instead of getZExtValue from the ConstantInt until we can confirm that the shift amount is in range. Reduced from OSS-Fuzz #14169 - https://bugs.chromium.org/p/oss-fuzz/issues/detail?id=14169 llvm-svn: 358192	2019-04-11 16:39:31 +00:00
Erik Pilkington	cb5c7bd9eb	Fix a hang when lowering __builtin_dynamic_object_size If the ObjectSizeOffsetEvaluator fails to fold the object size call, then it may litter some unused instructions in the function. When done repeatably in InstCombine, this results in an infinite loop. Fix this by tracking the set of instructions that were inserted, then removing them on failure. rdar://49172227 Differential revision: https://reviews.llvm.org/D60298 llvm-svn: 358146	2019-04-10 23:42:11 +00:00
Nikita Popov	0a8228fd28	[InstCombine] Handle ssubo always overflow Following D60483 and D60497, this adds support for AlwaysOverflows handling for ssubo. This is the last case we can handle right now. Differential Revision: https://reviews.llvm.org/D60518 llvm-svn: 358100	2019-04-10 16:32:15 +00:00
Nikita Popov	7a543c3758	[InstCombine] ssubo X, C -> saddo X, -C ssubo X, C is equivalent to saddo X, -C. Make the transformation in InstCombine and allow the logic implemented for saddo to fold prior usages of add nsw or sub nsw with constants. Patch by Dan Robertson. Differential Revision: https://reviews.llvm.org/D60061 llvm-svn: 358099	2019-04-10 16:27:36 +00:00
Nikita Popov	ef23e88480	[InstCombine] Handle saddo always overflow Followup to D60483: Handle AlwaysOverflow conditions for saddo as well. Differential Revision: https://reviews.llvm.org/D60497 llvm-svn: 358095	2019-04-10 16:18:01 +00:00
David Stenberg	fab4bdf4b9	Add REQUIRES: asserts to test using -debug-only llvm-svn: 358057	2019-04-10 08:44:57 +00:00
Florian Hahn	db1a69c250	[VPLAN] Minor improvement to testing and debug messages. 1. Use computed VF for stress testing. 2. If the computed VF does not produce vector code (VF smaller than 2), force VF to be 4. 3. Test vectorization of i64 data on AArch64 to make sure we generate VF != 4 (on X86 that was already tested on AVX). Patch by Francesco Petrogalli <francesco.petrogalli@arm.com> Differential Revision: https://reviews.llvm.org/D59952 llvm-svn: 358056	2019-04-10 08:17:28 +00:00
Nikita Popov	09020ec2a7	[InstCombine] Handle usubo always overflow Check AlwaysOverflow condition for usubo. The implementation is the same as the existing handling for uaddo and umulo. Handling for saddo and ssubo will follow (smulo doesn't have the necessary ValueTracking support). Differential Revision: https://reviews.llvm.org/D60483 llvm-svn: 358052	2019-04-10 07:10:53 +00:00
Chen Zheng	5e13ff1da2	[InstCombine] Canonicalize (-X s/ Y) to -(X s/ Y). Differential Revision: https://reviews.llvm.org/D60395 llvm-svn: 358050	2019-04-10 06:52:09 +00:00
Akira Hatanaka	9ca9d32b6b	[ObjC][ARC] Convert the retainRV marker that is passed as a named metadata into a module flag in the auto-upgrader and make the ARC contract pass read the marker as a module flag. This is needed to fix a bug where ARC contract wasn't inserting the retainRV marker when LTO was enabled, which caused objects returned from a function to be auto-released. rdar://problem/49464214 Differential Revision: https://reviews.llvm.org/D60303 llvm-svn: 358047	2019-04-10 06:20:20 +00:00
Nikita Popov	c176b708e4	[InstCombine] Add with.overflow always overflow tests; NFC The uadd and umul cases are currently handled, the usub, sadd, ssub and smul cases are not. usub, sadd and ssub already have the necessary ValueTracking support, smul doesn't. llvm-svn: 358031	2019-04-09 20:02:23 +00:00
Nikita Popov	2f5e9de8d1	Revert "[InstCombine] [InstCombine] Canonicalize (-X s/ Y) to -(X s/ Y)." This reverts commit `1383a91689`. sdiv-canonicalize.ll fails after this revision. The fold needs to be moved outside the branch handling constant operands. However when this is done there are further test changes, so I'm reverting this in the meantime. llvm-svn: 358026	2019-04-09 18:32:38 +00:00

1 2 3 4 5 ...

12453 Commits