llvm-project

Commit Graph

Author	SHA1	Message	Date
Daniel Berlin	8a00270838	MemorySSA: Add support for caching clobbering access in stores Summary: This enables us to cache the clobbering access for stores, despite the fact that we can't rewrite the use-def chains themselves. Early testing shows that, after this change, for larger testcases, it will be a significant net positive (memory and time) to remove the walker caching. Reviewers: george.burgess.iv, davide Subscribers: Prazek, llvm-commits Differential Revision: https://reviews.llvm.org/D31567 llvm-svn: 299322	2017-04-02 05:09:15 +00:00
Daniel Berlin	9a9c9ff260	NewGVN: Don't try to kill off the stored value of stores when processing the congruence class of the store. Because we use the stored value of a store as the def, it isn't dead just because it appears as a def when it comes from a store. Note: I have not hit any cases with the memory code as it is where this breaks anything, just because of what memory congruences we actually allow. In a followup that improves memory congruence, this bug actually breaks real stuff (but the verifier catches it). llvm-svn: 299300	2017-04-01 09:44:33 +00:00
Daniel Berlin	9b4984926c	NewGVN: Clean up GVNExpression memory hierarchy, restructure hash computation a bit so we don't have to redefine it for loads, stores, and calls llvm-svn: 299299	2017-04-01 09:44:29 +00:00
Daniel Berlin	871ecd90ca	NewGVN: Use def_chain iterator in singleReachablePhiPath instead of recursion llvm-svn: 299298	2017-04-01 09:44:24 +00:00
Daniel Berlin	07275c3065	Move def_chain iterator to MemorySSA.h so it can be reused llvm-svn: 299297	2017-04-01 09:44:19 +00:00
Daniel Berlin	d042031f0f	MemorySSA: Push const correctness further. llvm-svn: 299295	2017-04-01 09:01:12 +00:00
Daniel Berlin	7500c5641e	MemorySSA: Kill the WalkTargetCache now that we have getBlockDefs. llvm-svn: 299294	2017-04-01 08:59:45 +00:00
Craig Topper	47fd2de304	[APInt] Fix bugs in isShiftedMask to match behavior of the similar function in MathExtras.h This removes a parameter from the routine that was responsible for a lot of the issue. It was a bit count that had to be set to the BitWidth of the APInt and would get passed to getLowBitsSet. This guaranteed the call to getLowBitsSet would create an all ones value. This was then compared to (V \| (V-1)). So the only shifted masks we detected had to have the MSB set. The one in tree user is a transform in InstCombine that never fires due to earlier transforms covering the case better. I've submitted a patch to remove it completely, but for now I've just adapted it to the new interface for isShiftedMask. llvm-svn: 299273	2017-03-31 22:23:42 +00:00
Craig Topper	e625d74271	[InstCombine] When adding an Instruction and its Users to the worklist at the same time, make sure we put the Users in first. Then put in the instruction. This way we ensure we immediately revisit the instruction and do any additional optimizations before visiting the users. Otherwise we might visit the users, then the instruction, then users again, then instruction again. llvm-svn: 299267	2017-03-31 21:35:30 +00:00
Craig Topper	885fa12e8a	[APInt] Remove shift functions from APIntOps namespace. Replace the few users with the APInt class methods. NFCI llvm-svn: 299248	2017-03-31 20:01:16 +00:00
Joerg Sonnenberger	28bed106e0	Do not translate rint into nearbyint, but truncate it like nearbyint. A common way to implement nearbyint is by fiddling with the floating point environment and calling rint. This is used at least by the BSD libm and musl. As such, canonicalizing the latter to the former will create infinite loops for libm and generally pessimize performance, at least when the generic C versions are used. This change preserves the rint in the libcall translation and also handles the domain truncation logic, so that rint with float argument will be reduced to rintf etc. llvm-svn: 299247	2017-03-31 19:58:07 +00:00
Dehao Chen	fed890ea3a	Fix the InstCombine to reserve the VP metadata and sets correct call count. Summary: Currently the VP metadata was dropped when InstCombine converts a call to direct call. This patch converts the VP metadata to branch_weights so that its hotness is recorded. Reviewers: eraman, davidxl Reviewed By: davidxl Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D31344 llvm-svn: 299228	2017-03-31 15:59:52 +00:00
Mikael Holmen	79235bd4d8	[Scalarizer] Handle scalar arguments in vector GEP Summary: Triggered by commit r298620: "[LV] Vectorize GEPs". If we encounter a vector GEP with scalar arguments, we splat the scalar into a vector of appropriate size before we scatter the argument. Reviewers: arsenm, mehdi_amini, bkramer Reviewed By: arsenm Subscribers: bjope, mssimpso, wdng, llvm-commits Differential Revision: https://reviews.llvm.org/D31416 llvm-svn: 299186	2017-03-31 06:29:49 +00:00
Peter Collingbourne	6b193966ac	ThinLTOBitcodeWriter: Use Module::global_values(). NFCI. llvm-svn: 299132	2017-03-30 23:43:08 +00:00
Craig Topper	79e5bc528d	[InstCombine] Fix typo last->least. NFC llvm-svn: 299123	2017-03-30 22:28:55 +00:00
Matt Arsenault	79f837c254	AMDGPU: Add all atomicrmw fields to atomic.inc/dec Add scope, order, isVolatile llvm-svn: 299122	2017-03-30 22:21:40 +00:00
Hongbin Zheng	bfd7c38de7	[SimplifyIndvar] Replace the sdiv used by IV if we can prove both of its operands are non-negative Since there is no sdiv in SCEV, an 'udiv' is a better canonical form than an 'sdiv' as the user of induction variable Differential Revision: https://reviews.llvm.org/D31488 llvm-svn: 299118	2017-03-30 21:56:56 +00:00
Simon Pilgrim	68168d17b9	Spelling mistakes in comments. NFCI. Based on corrections mentioned in patch for clang for PR27635 llvm-svn: 299072	2017-03-30 12:59:53 +00:00
Matthew Simpson	c8f0aeccda	[InstCombine] Correct the check for vector GEPs Some of the GEP combines (e.g., descaling) can't handle vector GEPs. We have an existing check that attempts to bail out if given a vector GEP. However, the check only tests the GEP's pointer operand. A GEP results in a vector of pointers if at least one of its operands is vector-typed (e.g., its pointer operand could be a scalar, but its index could be a vector). We should just check the type of the GEP itself. This should fix PR32414. Reference: https://bugs.llvm.org/show_bug.cgi?id=32414 Differential Revision: https://reviews.llvm.org/D31470 llvm-svn: 299017	2017-03-29 18:23:08 +00:00
Filipe Cabecinhas	8b94273fe6	Cleanup in preparation for D30703. NFCI Make the enumerators follow the coding convention and start with OW_... llvm-svn: 298996	2017-03-29 14:42:27 +00:00
Anna Thomas	923e574bff	[InstCombine] For select rule, use positive check of constant int for select operand. NFCI llvm-svn: 298906	2017-03-28 09:32:24 +00:00
Alex Shlyapnikov	bbd5cc63d7	Revert "[asan] Delay creation of asan ctor." Speculative revert. Some libfuzzer tests are affected. This reverts commit r298731. llvm-svn: 298890	2017-03-27 23:11:50 +00:00
Alex Shlyapnikov	09171aa31f	Revert "[asan] Put ctor/dtor in comdat." Speculative revert, some libfuzzer tests are affected. This reverts commit r298756. llvm-svn: 298889	2017-03-27 23:11:47 +00:00
Matthew Simpson	b8ff4a4a70	[LV] Transform truncations of non-primary induction variables The vectorizer tries to replace truncations of induction variables with new induction variables having the smaller type. After r295063, this optimization was applied to all integer induction variables, including non-primary ones. When optimizing the truncation of a non-primary induction variable, we still need to transform the new induction so that it has the correct start value. This should fix PR32419. Reference: https://bugs.llvm.org/show_bug.cgi?id=32419 llvm-svn: 298882	2017-03-27 20:07:38 +00:00
Anna Thomas	f57ae33381	[InstCombine] Avoid incorrect folding of select into phi nodes when incoming element is a vector type Summary: We are incorrectly folding selects into phi nodes when the incoming value of a phi node is a constant vector. This optimization is done in `FoldOpIntoPhi` when the select condition is a phi node with constant incoming values. Without the fix, we are miscompiling (i.e. incorrectly folding the select into the phi node) when the vector contains non-zero elements. This patch fixes the miscompile and we will correctly fold based on the select vector operand (see added test cases). Reviewers: majnemer, sanjoy, spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D31189 llvm-svn: 298845	2017-03-27 13:52:51 +00:00
Serge Pavlov	b71bb80c2d	[LoopUnroll] Remap references in peeled iteration References in cloned blocks must be remapped prior to dominator calculation. Differential Revision: https://reviews.llvm.org/D31281 llvm-svn: 298811	2017-03-26 16:46:53 +00:00
Joerg Sonnenberger	fa7367428a	Split the SimplifyCFG pass into two variants. The first variant contains all current transformations except transforming switches into lookup tables. The second variant contains all current transformations. The switch-to-lookup-table conversion results in code that is more difficult to analyze and optimize by other passes. Most importantly, it can inhibit Dead Code Elimination. As such it is often beneficial to only apply this transformation very late. A common example is inlining, which can often result in range restrictions for the switch expression. Changes in execution time according to LNT: SingleSource/Benchmarks/Misc/fp-convert +3.03% MultiSource/Benchmarks/ASC_Sequoia/CrystalMk/CrystalMk -11.20% MultiSource/Benchmarks/Olden/perimeter/perimeter -10.43% and a couple of smaller changes. For perimeter it also results 2.6% a smaller binary. Differential Revision: https://reviews.llvm.org/D30333 llvm-svn: 298799	2017-03-26 06:44:08 +00:00
Chandler Carruth	0d256c0f5d	[IR] Make SwitchInst::CaseIt almost a normal iterator. This moves it to the iterator facade utilities giving it full random access semantics, etc. It can also now be used with standard algorithms like std::all_of and std::any_of and range adaptors like llvm::reverse. Also make the semantics of iterating match what every other iterator uses and forbid decrementing past the begin iterator. This was used as a hacky way to work around iterator invalidation. However, every instance trying to do this failed to actually avoid touching invalid iterators despite the clear documentation that the removed and all subsequent iterators become invalid including the end iterator. So I've added a return of the next iterator to removeCase and rewritten the loops that were doing this to correctly follow the iterator pattern of either incremneting or removing and assigning fresh values to the iterator and the end. In one case we were trying to go backwards to make this cleaner but it doesn't actually work. I've made that code match the code we use everywhere else to remove cases as we iterate. This changes the order of cases in one test output and I moved that test to CHECK-DAG so it wouldn't care -- the order isn't semantically meaningful anyways. llvm-svn: 298791	2017-03-26 02:49:23 +00:00
Craig Topper	47596dd4cc	[InstCombine] Change the interface of SimplifyDemandedBits so that it takes the instruction and operand instead of the Use. The first thing it did was get the User for the Use to get the instruction back. This requires looking through the Uses for the User using the waymarking walk. That's pretty fast, but its probably still better to just pass the Instruction we already had. llvm-svn: 298772	2017-03-25 06:52:52 +00:00
Davide Italiano	e9781e7b2f	[NewGVN] Adjust NDEBUG markers. This avoids 'used but not defined' warnings in Release builds with GCC. llvm-svn: 298760	2017-03-25 02:40:02 +00:00
Evgeniy Stepanov	71bb8f1ad0	[asan] Put ctor/dtor in comdat. When possible, put ASan ctor/dtor in comdat. The only reason not to is global registration, which can be TU-specific. This is not the case when there are no instrumented globals. This is also limited to ELF targets, because MachO does not have comdat, and COFF linkers may GC comdat constructors. The benefit of this is a lot less __asan_init() calls: one per DSO instead of one per TU. It's also necessary for the upcoming gc-sections-for-globals change on Linux, where multiple references to section start symbols trigger quadratic behaviour in gold linker. llvm-svn: 298756	2017-03-25 01:01:11 +00:00
Craig Topper	8fbb74b5b2	Revert r298711 "[InstCombine] Provide a way to calculate KnownZero/One for Add/Sub in SimplifyDemandedUseBits without recursing into ComputeKnownBits" Tsan bot is failing. llvm-svn: 298745	2017-03-24 22:12:10 +00:00
Ivan Krasin	c2124e185c	Revert r298620: [LV] Vectorize GEPs Reason: breaks linking Chromium with LLD + ThinLTO (a pass crashes) LLVM bug: https://bugs.llvm.org//show_bug.cgi?id=32413 Original change description: [LV] Vectorize GEPs This patch adds support for vectorizing GEPs. Previously, we only generated vector GEPs on-demand when creating gather or scatter operations. All GEPs from the original loop were scalarized by default, and if a pointer was to be stored to memory, we would have to build up the pointer vector with insertelement instructions. With this patch, we will vectorize all GEPs that haven't already been marked for scalarization. The patch refines collectLoopScalars to more exactly identify the scalar GEPs. The function now more closely resembles collectLoopUniforms. And the patch moves vector GEP creation out of vectorizeMemoryInstruction and into the main vectorization loop. The vector GEPs needed for gather and scatter operations will have already been generated before vectoring the memory accesses. Original Differential Revision: https://reviews.llvm.org/D30710 llvm-svn: 298735	2017-03-24 20:49:43 +00:00
Evgeniy Stepanov	64e872a91f	[asan] Delay creation of asan ctor. Create the constructor in the module pass. This in needed for the GC-friendly globals change, where the constructor can be put in a comdat in some cases, but we don't know about that in the function pass. llvm-svn: 298731	2017-03-24 20:42:15 +00:00
Matt Arsenault	4c7795dd31	AMDGPU: Fold rcp/rsq of undef to undef llvm-svn: 298725	2017-03-24 19:04:57 +00:00
Matt Arsenault	18bb24a1be	TTI: Split IsSimple in MemIntrinsicInfo All this did before was assert in EarlyCSE. llvm-svn: 298724	2017-03-24 18:56:43 +00:00
Teresa Johnson	428b9e0627	[ThinLTO] Correct counting of functions in inliner stats Summary: Declarations need to be filtered out when counting functions. Reviewers: eraman Subscribers: Prazek, llvm-commits Differential Revision: https://reviews.llvm.org/D31336 llvm-svn: 298720	2017-03-24 17:59:06 +00:00
Craig Topper	d4521c2fc2	[InstCombine] Provide a way to calculate KnownZero/One for Add/Sub in SimplifyDemandedUseBits without recursing into ComputeKnownBits SimplifyDemandedUseBits for Add/Sub already recursed down LHS and RHS for simplifying bits. If that didn't provide any simplifications we fall back to calling computeKnownBits which will recurse again. Instead just take the known bits for LHS and RHS we already have and call into a new function in ValueTracking that can calculate the known bits given the LHS/RHS bits. llvm-svn: 298711	2017-03-24 16:56:51 +00:00
Benjamin Kramer	46f5e2c47b	Make GCC happy again. llvm-svn: 298702	2017-03-24 14:15:35 +00:00
Daniel Berlin	ffc30781f4	NewGVN: Small cleanup of two dominance related functions to make them easier to understand. llvm-svn: 298692	2017-03-24 06:33:51 +00:00
Daniel Berlin	0e9001131d	NewGVN: Small cleanup of useless expression deletion, and don't uselessly create two expressions in symbolic store evaluation. llvm-svn: 298691	2017-03-24 06:33:48 +00:00
Daniel Berlin	9d0796e5d0	NewGVN: Fix PR32403 - Handling of undef in phis was not quite correct due to LLVM's view of phi nodes. It would cause NewGVN not to fixpoint in some interesting edge cases. llvm-svn: 298687	2017-03-24 05:30:34 +00:00
Craig Topper	36f2e0eee8	[InstCombine] Use range-based for loop. NFC llvm-svn: 298680	2017-03-24 02:58:02 +00:00
Craig Topper	df73e7c5b7	[InstCombine] Fix 80 column violation I accidentally introduced. NFC llvm-svn: 298679	2017-03-24 02:57:59 +00:00
Reid Kleckner	392f062675	[sancov] Don't instrument blocks with no insertion point This prevents crashes when attempting to instrument functions containing C++ try. Sanitizer coverage will still fail at runtime when an exception is thrown through a sancov instrumented function, but that seems marginally better than what we have now. The full solution is to color the blocks in LLVM IR and only instrument blocks that have an unambiguous color, using the appropriate token. llvm-svn: 298662	2017-03-23 23:30:41 +00:00
Dehao Chen	722e94061b	Set the prof weight correctly for call instructions in DeadArgumentElimination. Summary: In DeadArgumentElimination, the call instructions will be replaced. We also need to set the prof weights so that function inlining can find the correct profile. Reviewers: eraman Reviewed By: eraman Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D31143 llvm-svn: 298660	2017-03-23 23:26:00 +00:00
Bryant Wong	def79b21e4	[MetaRenamer] Don't rename library functions. Library functions can have specific semantics that affect the behavior of certain passes. DSE, for instance, gives special treatment to malloc-ed pointers but not to pointers returned from an equivalently typed (but differently named) function. MetaRenamer ought not to alter program semantics, so library functions must remain untouched. Reviewers: mehdi_amini, majnemer, chandlerc, davide Reviewed By: davide Subscribers: davide, llvm-commits Differential Revision: https://reviews.llvm.org/D31304 llvm-svn: 298659	2017-03-23 23:21:07 +00:00
Dehao Chen	8c88671985	Disable loop unrolling and icp in SamplePGO ThinLTO compile phase Summary: loop unrolling and icp will make the sample profile annotation much harder in the backend. So disable these 2 optimization in the ThinLTO compile phase. Will add a test in cfe in a separate patch. Reviewers: tejohnson Reviewed By: tejohnson Subscribers: mehdi_amini, llvm-commits, Prazek Differential Revision: https://reviews.llvm.org/D31217 llvm-svn: 298646	2017-03-23 21:20:05 +00:00
Craig Topper	74494d0179	[InstCombine] Remove some code from visitAnd that dealt with trying to reduce the LHS of a sub to 0. This should now be fully handled by SimplifyDemandedInstructionBits now. Now that we call ShrinkDemandedConstant on the RHS of sub this should be taken care of. This code doesn't trigger on any in tree regressions, but did before ShrinkDemandedConstant was added to the RHS. llvm-svn: 298644	2017-03-23 21:00:13 +00:00
Teresa Johnson	0c6a4ff8dc	[ThinLTO] Add support for emitting minimized bitcode for thin link Summary: The cumulative size of the bitcode files for a very large application can be huge, particularly with -g. In a distributed build environment, all of these files must be sent to the remote build node that performs the thin link step, and this can exceed size limits. The thin link actually only needs the summary along with a bitcode symbol table. Until we have a proper bitcode symbol table, simply stripping the debug metadata results in significant size reduction. Add support for an option to additionally emit minimized bitcode modules, just for use in the thin link step, which for now just strips all debug metadata. I plan to add a cc1 option so this can be invoked easily during the compile step. However, care must be taken to ensure that these minimized thin link bitcode files produce the same index as with the original bitcode files, as these original bitcode files will be used in the backends. Specifically: 1) The module hash used for caching is typically produced by hashing the written bitcode, and we want to include the hash that would correspond to the original bitcode file. This is because we want to ensure that changes in the stripped portions affect caching. Added plumbing to emit the same module hash in the minimized thin link bitcode file. 2) The module paths in the index are constructed from the module ID of each thin linked bitcode, and typically is automatically generated from the input file path. This is the path used for finding the modules to import from, and obviously we need this to point to the original bitcode files. Added gold-plugin support to take a suffix replacement during the thin link that is used to override the identifier on the MemoryBufferRef constructed from the loaded thin link bitcode file. The assumption is that the build system can specify that the minimized bitcode file has a name that is similar but uses a different suffix (e.g. out.thinlink.bc instead of out.o). Added various tests to ensure that we get identical index files out of the thin link step. Reviewers: mehdi_amini, pcc Subscribers: Prazek, llvm-commits Differential Revision: https://reviews.llvm.org/D31027 llvm-svn: 298638	2017-03-23 19:47:39 +00:00

1 2 3 4 5 ...

17522 Commits