llvm-project

Commit Graph

Author	SHA1	Message	Date
Chris Ray	ba3741cb2b	[X86] Fixing flag usage for RCL and RCR Summary: The RCL and RCR instructions use the carry flag. Reviewers: craig.topper Reviewed By: craig.topper Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D29237 llvm-svn: 293441	2017-01-29 20:05:30 +00:00
Matthias Braun	a4976c6166	MachineInstr: Remove parameter from dump() The primary use of the dump() functions in LLVM is for use in a debugger. Unfortunately lldb does not seem to handle default arguments so using `p SomeMI.dump()` fails and you have to type the longer `p SomeMI.dump(nullptr)`. Remove the paramter to make the most common use easy. (You can always construct something like `p SomeMI.print(dbgs(),MyTII)` if you need more features). Differential Revision: https://reviews.llvm.org/D29241 llvm-svn: 293440	2017-01-29 18:20:42 +00:00
Simon Pilgrim	76073f8d22	[X86][SSE] Lower scalar_to_vector(0) to zero vector Replaces an xor+movd/movq with an xorps which will be shorter in codesize, avoid an int-fpu transfer, allow modern cores to fast path the result during decode and helps other combines recognise an all-zero vector. The only reason I can think of that we'd want to keep scalar_to_vector in this case is to help recognise the upper elts are undef but this doesn't seem to be a problem. Differential Revision: https://reviews.llvm.org/D29097 llvm-svn: 293438	2017-01-29 18:13:37 +00:00
Matthias Braun	de58b61b5d	llvm-c: Keep LLVMDumpModule() even in release builds While this probably should be considered a dump debugger utility, the C API currently has no other ways to print a module to stderr for error reporting purposes, so keep it even in release builds. llvm-svn: 293436	2017-01-29 17:52:03 +00:00
Sanjay Patel	062adaab83	[InstCombine] enable (X >>?,exact C1) << C2 --> X << (C2 - C1) for vectors with splats llvm-svn: 293435	2017-01-29 17:11:18 +00:00
Saleem Abdulrasool	5282eed06c	ARM: support `-mlong-calls` with AEABI TLS on ELF Support lowering AEABI TLS access (__aeabi_read_tp) with long calls. This requires adjusting the call sequence to use an indirect call to get full addressability. Resolves PR31769! llvm-svn: 293433	2017-01-29 16:46:22 +00:00
Sanjay Patel	14a4b8185f	[ValueTracking] clean up lookThroughCast; NFCI 1. Use auto with dyn_cast. 2. Don't use else after return. 3. Convert chain of 'else if' to switch. 4. Improve variable names. llvm-svn: 293432	2017-01-29 16:34:57 +00:00
Elena Demikhovsky	17fe27f1f2	[X86 Codegen] Fixed a bug in unsigned saturation PACKUSWB converts Signed word to Unsigned byte, (the same about DW) and it can't be used for umin+truncate pattern. AVX-512 VPMOVUS* instructions fit the pattern since they convert Unsigned to Unsigned. See https://llvm.org/bugs/show_bug.cgi?id=31773 Differential Revision: https://reviews.llvm.org/D29196 llvm-svn: 293431	2017-01-29 13:18:30 +00:00
Daniel Berlin	9f376b7b37	NewGVN: Fix where newline is printed in debug printing of memory equivalence llvm-svn: 293428	2017-01-29 10:26:03 +00:00
Igor Breger	9ea154d4ad	[X86][GlobalISel] Add limited argument lowering support to the IRTranslator. Summary: Add limited (i8/i16/i32/i64) argument lowering support to the IRTranslator. Inspired by commit 289940. Reviewers: t.p.northover, qcolombet, ab, zvi, rovka Reviewed By: rovka Subscribers: dberris, rovka, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D28987 llvm-svn: 293427	2017-01-29 08:35:42 +00:00
Chandler Carruth	8e9c0a8472	[ArgPromote] Move static helpers to modern LLVM naming conventions while here. NFC. Simple refactoring while prepping a port to the new PM. Differential Revision: https://reviews.llvm.org/D29249 llvm-svn: 293426	2017-01-29 08:03:21 +00:00
Chandler Carruth	ae9ce3d402	[ArgPromote] Run clang-format to normalize remarkably idiosyncratic formatting that has evolved here over the past years prior to making somewhat invasive changes to thread new PM support through the business logic. Differential Revision: https://reviews.llvm.org/D29248 llvm-svn: 293425	2017-01-29 08:03:19 +00:00
Chandler Carruth	cd836cd4ee	[ArgPromote] Re-arrange the code in a more typical, logical way. This arranges the static helpers in an order where they are defined prior to their use to avoid the need of forward declarations, and collect the core pass components at the bottom below their helpers. This also folds one trivial function into the pass itself. Factoring this 'runImpl' was an attempt to help porting to the new pass manager, however in my attempt to begin this port in earnest it turned out to not be a substantial help. I think it will be easier to factor things without it. This is an NFC change and does a minimal amount of edits over all. Subsequent NFC cleanups will normalize the formatting with clang-format and improve the basic doxygen commenting. Differential Revision: https://reviews.llvm.org/D29247 llvm-svn: 293424	2017-01-29 08:03:16 +00:00
Craig Topper	135da1faf5	[SelectionDAG] Make SDNode::getConstantOperandVal an inline method. It's operation already exists manually in many places without using the method. llvm-svn: 293421	2017-01-29 06:08:02 +00:00
Justin Hibbits	10b6147e23	Add some Book-E instructions to the asm parser and printer. Summary: Adds the following instructions: * mfpmr * mtpmr * icblc * icblq * icbtls Fix the scheduling for mtspr on e5500, which uses CFX0, instead of SFX0/SFX1 as on e500mc. Addresses PR 31538. Differential Revision: https://reviews.llvm.org/D29002 llvm-svn: 293417	2017-01-29 04:55:57 +00:00
Craig Topper	4753736abf	[DAGCombiner] Use unsigned for a constant vector index instead of APInt. The type system requires that the number of vector elements should fit in 32-bits so this should be safe. llvm-svn: 293414	2017-01-29 04:38:21 +00:00
Craig Topper	d15730902b	[DAGCombiner] Remove unnecessary check on the size of the type of the index of EXTRACT_SUBVECTOR. The type system already requires that the number of vector elements must fit in 32-bits so an index should as well. Even if the type of the index were larger all we care about is that the constant index can fit in 64-bits so that we can call getZExtValue. llvm-svn: 293413	2017-01-29 04:38:19 +00:00
Craig Topper	24cdbe8fa6	[DAGCombiner] Make sure index of EXTRACT_SUBVECTOR is a constant before trying to use getConstantOperandVal. llvm-svn: 293412	2017-01-29 04:38:16 +00:00
Xinliang David Li	fd3f645f9d	Add support to dump dot graph block layout after MBP Differential Revision: https://reviews.llvm.org/D29141 llvm-svn: 293408	2017-01-29 01:57:02 +00:00
Davide Italiano	9d8f6f8a45	Remove inclusion of SSAUpdater from several passes. It is, in fact, unused. Found while reviewing Danny's new SSAUpdater and porting passes to it to see how the new API looked like. llvm-svn: 293407	2017-01-29 01:55:24 +00:00
Craig Topper	6533e40e9d	[X86] Fix vector ANDN matching to work correctly when both inputs to the AND are XORs. llvm-svn: 293403	2017-01-28 23:52:09 +00:00
Davide Italiano	9b8738d7c8	[PM] MLSM has been enabled for a way. Reclaim a cl::opt. llvm-svn: 293401	2017-01-28 23:45:37 +00:00
Kostya Serebryany	ac2a633467	[libfuzzer] include errno.h. On Ubuntu 14.04 we got away w/o it, but other systems seem to require it llvm-svn: 293389	2017-01-28 18:56:05 +00:00
Will Dietz	f47d26ac2b	RuntimeDyldELF: Don't abort on R_X86_64_NONE, it's a no-oop. llvm-svn: 293388	2017-01-28 18:39:01 +00:00
Will Dietz	10294b932c	AMDGPU: Add GlobalISel to required_libraries. llvm-svn: 293387	2017-01-28 18:13:08 +00:00
Mohammad Shahid	3121334d32	[SLP] Vectorize loads of consecutive memory accesses, accessed in non-consecutive (jumbled) way. The jumbled scalar loads will be sorted while building the tree and these accesses will be marked to generate shufflevector after the vectorized load with proper mask. Reviewers: hfinkel, mssimpso, mkuper Differential Revision: https://reviews.llvm.org/D26905 Change-Id: I9c0c8e6f91a00076a7ee1465440a3f6ae092f7ad llvm-svn: 293386	2017-01-28 17:59:44 +00:00
Arpith Chacko Jacob	2b156edf56	[NVPTX] Add intrinsics to support named barriers. Support for barrier synchronization between a subset of threads in a CTA through one of sixteen explicitly specified barriers. These intrinsics are not directly exposed in CUDA but are critical for forthcoming support of OpenMP on NVPTX GPUs. The intrinsics allow the synchronization of an arbitrary (multiple of 32) number of threads in a CTA at one of 16 distinct barriers. The two intrinsics added are as follows: call void @llvm.nvvm.barrier.n(i32 10) waits for all threads in a CTA to arrive at named barrier #10. call void @llvm.nvvm.barrier(i32 15, i32 992) waits for 992 threads in a CTA to arrive at barrier #15. Detailed description of these intrinsics are available in the PTX manual. http://docs.nvidia.com/cuda/parallel-thread-execution/#parallel-synchronization-and-communication-instructions Reviewers: hfinkel, jlebar Differential Revision: https://reviews.llvm.org/D17657 llvm-svn: 293384	2017-01-28 16:38:15 +00:00
Daniel Sanders	b96a945bf5	stripDebugInfo() should remove DILocation's found in !llvm.loop metadata Summary: Patch by Michele Scandale (with a small tweak to 'CHECK-NOT' the last DILocation in the test) Subscribers: bogner, llvm-commits Differential Revision: https://reviews.llvm.org/D27980 llvm-svn: 293377	2017-01-28 11:22:05 +00:00
Taewook Oh	505a25aec5	[InstCombine] Merge DebugLoc when speculatively hoisting store instruction Summary: Along with https://reviews.llvm.org/D27804, debug locations need to be merged when hoisting store instructions as well. Not sure if just dropping debug locations would make more sense for this case, but as the branch instruction will have at least different discriminator with the hoisted store instruction, I think there will be no difference in practice. Reviewers: aprantl, andreadb, danielcdh Reviewed By: aprantl Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D29062 llvm-svn: 293372	2017-01-28 07:05:43 +00:00
Matthias Braun	194ded551c	Use print() instead of dump() in code llvm-svn: 293371	2017-01-28 06:53:55 +00:00
Richard Trieu	3de487b2e8	[WebAssembly] Use print instead of dump method. This fixes non-debug non-assert builds after r293359. llvm-svn: 293368	2017-01-28 03:23:49 +00:00
Matthias Braun	25bcaba50e	Use print() instead of dump() in code The dump() functions are meant to be used in a debugger, code should typically use something like print(errs()); llvm-svn: 293365	2017-01-28 02:47:46 +00:00
Daniel Berlin	ee6e3a598a	MemorySSA: Allow movement to arbitrary places Summary: Extend the MemorySSAUpdater API to allow movement to arbitrary places Reviewers: davide, george.burgess.iv Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D29239 llvm-svn: 293363	2017-01-28 02:26:39 +00:00
Quentin Colombet	8cf1163c4f	[RegisterBankInfo] Emit proper type for remapped registers. When the OperandsMapper creates virtual registers, it used to just create plain scalar register with the right size. This may confuse the instruction selector because we lose the information of the instruction using those registers what supposed to do. The MachineVerifier complains about that already. With this patch, the OperandsMapper still creates plain scalar register, but the expectation is for the mapping function to remap the type properly. The default mapping function has been updated to do that. rdar://problem/30231850 llvm-svn: 293362	2017-01-28 02:23:48 +00:00
Daniel Berlin	2f1ab4ba79	MemorySSA: Fix block numbering invalidation and replacement bugs discovered by updater llvm-svn: 293361	2017-01-28 02:22:52 +00:00
Matthias Braun	8c209aa877	Cleanup dump() functions. We had various variants of defining dump() functions in LLVM. Normalize them (this should just consistently implement the things discussed in http://lists.llvm.org/pipermail/cfe-dev/2014-January/034323.html For reference: - Public headers should just declare the dump() method but not use LLVM_DUMP_METHOD or #if !defined(NDEBUG) \|\| defined(LLVM_ENABLE_DUMP) - The definition of a dump method should look like this: #if !defined(NDEBUG) \|\| defined(LLVM_ENABLE_DUMP) LLVM_DUMP_METHOD void MyClass::dump() { // print stuff to dbgs()... } #endif llvm-svn: 293359	2017-01-28 02:02:38 +00:00
Daniel Berlin	ae6b8b6933	MemorySSA: Move updater to its own file llvm-svn: 293357	2017-01-28 01:35:02 +00:00
Daniel Berlin	60ead05f80	Introduce a basic MemorySSA updater, that supports insertDef, insertUse, moveBefore and moveAfter operations. Summary: This creates a basic MemorySSA updater that handles arbitrary insertion of uses and defs into MemorySSA, as well as arbitrary movement around the CFG. It replaces the current splice API. It can be made to handle arbitrary control flow changes. Currently, it uses the same updater algorithm from D28934. The main difference is because MemorySSA is single variable, we have the complete def and use list, and don't need anyone to give it to us as part of the API. We also have to rename stores below us in some cases. If we go that direction in that patch, i will merge all the updater implementations (using an updater_traits or something to provide the get* functions we use, called read/write in that patch). Sadly, the current SSAUpdater algorithm is way too slow to use for what we are doing here. I have updated the tests we have to basically build memoryssa incrementally using the updater api, and make sure it still comes out the same. Reviewers: george.burgess.iv Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D29047 llvm-svn: 293356	2017-01-28 01:23:13 +00:00
Quentin Colombet	351099022a	[RegisterCoalescing] Recommit the patch "Remove partial redundent copy". In r292621, the recommit fixes a bug related with live interval update after the partial redundent copy is moved. This recommit solves an additional bug related to the lack of update of subranges. The original patch is to solve the performance problem described in PR27827. Register coalescing sometimes cannot remove a copy because of interference. But if we can find a reverse copy in one of the predecessor block of the copy, the copy is partially redundent and we may remove the copy partially by moving it to the predecessor block without the reverse copy. Differential Revision: https://reviews.llvm.org/D28585 Re-apply r292621 Revert "Revert rL292621. Caused some internal build bot failures in apple." This reverts commit r292984. Original patch: Wei Mi <wmi@google.com> Subrange fix: Mostly Matthias Braun <matze@braunis.de> llvm-svn: 293353	2017-01-28 01:05:27 +00:00
Evgeniy Stepanov	d0852873e5	Fix memory leak in globalisel. #0 0x89cdeb in operator new[](unsigned long) /code/llvm/projects/compiler-rt/lib/asan/asan_new_delete.cc:84:37 #1 0x4ec87c4 in llvm::RegisterBankInfo::ValueMapping const* llvm::RegisterBankInfo::getOperandsMapping<llvm::RegisterBankInfo::ValueMapping const* const>(llvm::RegisterBankInfo::ValueMapping const const, llvm::RegisterBankInfo::ValueMapping const const) const /code/llvm/lib/CodeGen/GlobalISel/RegisterBankInfo.cpp:297:9 #2 0x9327ee in llvm::AArch64RegisterBankInfo::getInstrMapping(llvm::MachineInstr const&) const /code/llvm/lib/Target/AArch64/AArch64RegisterBankInfo.cpp:540:30 #3 0x4eb8d07 in llvm::RegBankSelect::assignInstr(llvm::MachineInstr&) /code/llvm/lib/CodeGen/GlobalISel/RegBankSelect.cpp:546:24 #4 0x4eb9dd2 in llvm::RegBankSelect::runOnMachineFunction(llvm::MachineFunction&) /code/llvm/lib/CodeGen/GlobalISel/RegBankSelect.cpp:624:12 #5 0x3141875 in llvm::MachineFunctionPass::runOnFunction(llvm::Function&) /code/llvm/lib/CodeGen/MachineFunctionPass.cpp:62:13 #6 0x396128d in llvm::FPPassManager::runOnFunction(llvm::Function&) /code/llvm/lib/IR/LegacyPassManager.cpp:1513:27 #7 0x3961832 in llvm::FPPassManager::runOnModule(llvm::Module&) /code/llvm/lib/IR/LegacyPassManager.cpp:1534:16 #8 0x3962540 in runOnModule /code/llvm/lib/IR/LegacyPassManager.cpp:1590:27 #9 0x3962540 in llvm::legacy::PassManagerImpl::run(llvm::Module&) /code/llvm/lib/IR/LegacyPassManager.cpp:1693 #10 0x8ae368 in compileModule(char*, llvm::LLVMContext&) /code/llvm/tools/llc/llc.cpp:562:8 #11 0x8a7a1b in main /code/llvm/tools/llc/llc.cpp:316:22 llvm-svn: 293351	2017-01-28 00:46:30 +00:00
Eugene Zelenko	e79c077ef9	[ARM] Fix some Clang-tidy modernize and Include What You Use warnings; other minor fixes (NFC). llvm-svn: 293348	2017-01-27 23:58:02 +00:00
Tim Northover	12bd22fbee	GlobalISel: don't leak super-entry BB when merging with IR-level one. We have to delete the block manually or it leaks. That triggers failures in -fsanitize=leak bots (unsurprisingly), which should be fixed by this patch. llvm-svn: 293347	2017-01-27 23:54:31 +00:00
Sanjay Patel	febcb9ce54	[InstCombine] move icmp transforms that might be recognized as min/max and inf-loop (PR31751) This is a minimal patch to avoid the infinite loop in: https://llvm.org/bugs/show_bug.cgi?id=31751 But the general problem is bigger: we're not canonicalizing all of the min/max forms reported by value tracking's matchSelectPattern(), and we don't define min/max consistently. Some code uses matchSelectPattern(), other code uses matchers like m_Umax, and others have their own inline definitions which may be subtly different from any of the above. The reason that the test cases in this patch need a cast op to trigger is because we don't (yet) canonicalize all min/max forms based on matchSelectPattern() in canonicalizeMinMaxWithConstant(), but we do make min/max+cast transforms based on matchSelectPattern() in visitSelectInst(). The location of the icmp transforms that trigger the inf-loop seems arbitrary at best, so I'm moving those behind the min/max fence in visitICmpInst() as the quick fix. llvm-svn: 293345	2017-01-27 23:26:27 +00:00
Peter Collingbourne	5ad775f2e8	Analysis: Add appropriate const qualification to functions in TypeMetadataUtils.cpp. NFC. llvm-svn: 293341	2017-01-27 22:55:30 +00:00
Kostya Serebryany	6d58dbb62f	[libFuzzer] make shmem more robust in the presence of signals llvm-svn: 293339	2017-01-27 22:41:30 +00:00
Artem Tamazov	33b01e9cfe	[AMDGPU][mc] Fix memory corruption uncovered by AddressSanitizer during coverage/smoke Gfx7/8 testing. Coverage/smoke Gfx7/8 tests were committed r292922 but then reverted by r292974 due to AddressSanitizer failure, which is fixed by this patch. Tests to be re-committed soon. llvm-svn: 293338	2017-01-27 22:19:42 +00:00
Tim Northover	d8b85584f2	GlobalISel: set correct regclass for LOAD_STACK_GUARD. Since it's not actually a generic MI, its register operands need a RegClass, which is conveniently the target's pointer RegClass. llvm-svn: 293335	2017-01-27 21:31:24 +00:00
Tim Northover	c9bc8a5580	GlobalISel: mark incoming landing-pad registers as live. Should fix machine verifier failures. llvm-svn: 293334	2017-01-27 21:31:17 +00:00
Krzysztof Parzyszek	35ce5dac7f	[Hexagon] Remove unused variable (and silence a warning) llvm-svn: 293331	2017-01-27 20:40:14 +00:00
Mehdi Amini	453ab3522b	Fix ASAN failure in cxa_demangle Found with ASAN + libFuzzer by Kostya Serebryany <kcc@google.com> llvm-svn: 293330	2017-01-27 20:32:16 +00:00
Mehdi Amini	888dee444b	Global DCE performance improvement Change the original algorithm so that it scales better when meeting very large bitcode where every instruction does not implies a global. The target query is "how to you get all the globals referenced by another global"? Before this patch, it was doing this by walking the body (or the initializer) and collecting the references. What this patch is doing, it precomputing the answer to this query for the whole module by walking the use-list of every global instead. Patch by: Serge Guelton <serge.guelton@telecom-bretagne.eu> Differential Revision: https://reviews.llvm.org/D28549 llvm-svn: 293328	2017-01-27 19:48:57 +00:00
Xinliang David Li	d289e4541f	[PGO] add debug option to view raw count after prof use annotation Differential Revision: https://reviews.llvm.org/D29045 llvm-svn: 293325	2017-01-27 19:06:25 +00:00
Matthias Braun	c91e28af4b	ScheduleDAGInstrs: Do not try to toggle kill flags on debug uses Preparation for upcoming changes. No testcase as none of the public targets bundles early enough and has a post machine scheduler enabled at the same time. The error is also easily catched by asserts. llvm-svn: 293324	2017-01-27 18:53:07 +00:00
Matthias Braun	26e8c350f9	ScheduleDAGInstrs: Cleanup toggleKillFlag(); NFC llvm-svn: 293323	2017-01-27 18:53:05 +00:00
Matthias Braun	bd7d91838e	ScheduleDAGInstrs: Cleanup; NFC Comment, doxygen and a bit of whitespace cleanup. llvm-svn: 293322	2017-01-27 18:53:00 +00:00
Tom Stellard	08efb7ebf6	AMDGPU/SI: Move some ISel helpers into utils so they can be shared with GISel Reviewers: arsenm Reviewed By: arsenm Subscribers: kzhuravl, wdng, nhaehnle, yaxunl, llvm-commits, tony-tye Differential Revision: https://reviews.llvm.org/D29068 llvm-svn: 293321	2017-01-27 18:41:14 +00:00
Konstantin Zhuravlyov	a304c83608	[AMDGPU] Grab MCSubtargetInfo from TargetMachine instead of constructing it Differential Revision: https://reviews.llvm.org/D29224 llvm-svn: 293318	2017-01-27 18:32:40 +00:00
Chris Ray	535e7d1547	[X86] Adding FFREEP instruction. Summary: Small change to get the FREEP instruction to decode properly. Reviewers: craig.topper Reviewed By: craig.topper Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D29193 llvm-svn: 293314	2017-01-27 18:02:53 +00:00
Anna Thomas	e7d865e34e	NFC: Add debug tracing for more cases where loop unrolling fails. llvm-svn: 293313	2017-01-27 17:57:05 +00:00
Matt Arsenault	d8f7ea381f	AMDGPU: Enable FeatureFlatForGlobal on Volcanic Islands Accomplishes what r292982 was supposed to, which ended up only really making the necessary test changes. This should be applied to the 4.0 branch. Patch by Vedran Miletić <vedran@miletic.net> llvm-svn: 293310	2017-01-27 17:42:26 +00:00
Matt Arsenault	32b9600a7e	NVPTX: Make NVPTXInferAddressSpaces preserve CFG llvm-svn: 293308	2017-01-27 17:30:39 +00:00
Jun Bum Lim	b99a06b7c9	[CodeGenPrep]No negative cost in the ExtLd promotion Summary: This change prevent the signed value of cost from being negative as the value is passed as an unsigned argument. Reviewers: mcrosier, jmolloy, qcolombet, javed.absar Reviewed By: mcrosier, qcolombet Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28871 llvm-svn: 293307	2017-01-27 17:16:37 +00:00
Stanislav Mekhanoshin	f6c1feb8c3	[AMDGPU] Turn AMDGPUUnifyMetadata back into module pass With the adjustPassManager interface that is now possible to use custom early module passes. Differential Revision: https://reviews.llvm.org/D29189 llvm-svn: 293300	2017-01-27 16:38:10 +00:00
Mehdi Amini	1726fc698c	Fix BasicAA incorrect assumption on GEP This is fixing pr31761: BasicAA is deducing NoAlias on the result of the GEP if the base pointer is itself NoAlias. This is possible only if the NoAlias on the base pointer is deduced with a non-sized query: this should guarantee that the pointers are belonging to different memory allocation and that the GEP can't legally jump from one to another. Differential Revision: https://reviews.llvm.org/D29216 llvm-svn: 293293	2017-01-27 16:12:22 +00:00
Ivan Krasin	c05c9db364	Avoid using unspecified ordering in MetadataLoader::MetadataLoaderImpl::parseOneMetadata. Summary: MetadataLoader::MetadataLoaderImpl::parseOneMetadata uses the following construct in a number of places: ``` MetadataList.assignValue(<...>, NextMetadataNo++); ``` There, NextMetadataNo gets incremented, and since the order of arguments evaluation is not specified, that can happen before or after other arguments are evaluated. In a few cases the other arguments indirectly use NextMetadataNo. For instance, it's ``` MetadataList.assignValue( GET_OR_DISTINCT(DIModule, (Context, getMDOrNull(Record[1]), getMDString(Record[2]), getMDString(Record[3]), getMDString(Record[4]), getMDString(Record[5]))), NextMetadataNo++); ``` getMDOrNull calls getMD that uses NextMetadataNo: ``` MetadataList.getMetadataFwdRef(NextMetadataNo); ``` Therefore, the order of evaluation becomes important. That caused a very subtle LLD crash that only happens if compiled with GCC or if LLD is built with LTO. In the case if LLD is compiled with Clang and regular linking mode, everything worked as intended. This change extracts incrementing of NextMetadataNo outside of the arguments list to guarantee the correct order of evaluation. For the record, this has taken 3 days to track to the origin. It all started with a ThinLTO bot in Chrome not being able to link a target if debug info is enabled. Reviewers: pcc, mehdi_amini Reviewed By: mehdi_amini Subscribers: aprantl, llvm-commits Differential Revision: https://reviews.llvm.org/D29204 llvm-svn: 293291	2017-01-27 15:54:49 +00:00
Simon Dardis	ca74dd79e9	[mips] Recommit: "N64 static relocation model support" This patch makes one change to GOT handling and two changes to N64's relocation model handling. Furthermore, the jumptable encodings have been corrected for static N64. Big GOT handling is now done via a new SDNode MipsGotHi - this node is unconditionally lowered to an lui instruction. The first change to N64's relocation handling is the lifting of the restriction that N64 always uses PIC. Now it is possible to target static environments. The second change adds support for 64 bit symbols and enables them by default. Previously N64 had patterns for sym32 mode only. In this mode all symbols are assumed to have 32 bit addresses. sym32 mode support is selectable with attribute 'sym32'. A follow on patch for clang will add the necessary frontend parameter. This partially resolves PR/23485. Thanks to Brooks Davis for reporting the issue! This version corrects a "Conditional jump or move depends on uninitialised value(s)" error detected by valgrind present in the original commit. Reviewers: dsanders, seanbruno, zoran.jovanovic, vkalintiris Differential Revision: https://reviews.llvm.org/D23652 llvm-svn: 293279	2017-01-27 11:36:52 +00:00
Alexey Bataev	4015bf8372	[SLP] Refactoring of horizontal reduction analysis, NFC. Some checks in SLP horizontal reduction analysis function are performed several times, though it is enough to perform these checks only once during an initial attempt at adding candidate for the reduction instruction/reduced value. Differential Revision: https://reviews.llvm.org/D29175 llvm-svn: 293274	2017-01-27 10:54:04 +00:00
Chandler Carruth	fd2d7c72fc	[LICM] When we are recomputing the alias sets for a subloop, we cannot skip sub-subloops. The logic to skip subloops dated from when this code was shared with the cached case. Once it was factored out to only run in the case of recomputed subloops it became a dangerous bug. If a subsubloop contained an interfering instruction it would be silently skipped from the alias sets for LICM. With the old pass manager this was extremely hard to trigger as it would require failing to visit these subloops with the LICM pass but then visiting the outer loop somehow. I've not yet contrived any test case that actually manages to trigger this. But with the new pass manager we don't do the cross-loop caching hack that the old PM does and so we recompute alias set information from first principles. While this seems much cleaner and simpler it exposed this bug and would subtly miscompile code due to failing to correctly model the aliasing constraints of deeply nested loops. llvm-svn: 293273	2017-01-27 10:27:32 +00:00
Jonas Paulsson	bb0ed3e732	[DAGTypeLegalizer] Handle SIGN/ZERO_EXTEND in WidenVecRes_Convert(). In case of a SIGN/ZERO_EXTEND of an incomplete vector type (using only a partial number of available vector elements), WidenVecRes_Convert() used to resort to scalarization. This patch adds a handling of the (common) case where an input vector can be found of same width as the widened result vector, by converting the node to SIGN/ZERO_EXTEND_VECTOR_INREG. Review: Eli Friedman llvm-svn: 293268	2017-01-27 07:46:26 +00:00
Richard Trieu	0b79aa3373	Fix unused variable warning. llvm-svn: 293260	2017-01-27 06:06:05 +00:00
Saleem Abdulrasool	26c00e3700	ARM: fix vectorized division on WoA The Windows on ARM target uses custom division for normal division as the backend needs to insert division-by-zero checks. However, it is designed to only handle non-vectorized division. ARM has custom lowering for vectorized division as that can avoid loading registers with the values and invoke a division routine for each one, preferring to lower using NEON instructions. Fall back to the custom lowering for the NEON instructions if we encounter a vectorized division. Resolves PR31778! llvm-svn: 293259	2017-01-27 03:41:53 +00:00
Daniel Berlin	c479686af2	NewGVN: Add basic dead and redundant store elimination Summary: This adds basic dead and redundant store elimination to NewGVN. Unlike our current DSE, it will happily do cross-block DSE if it meets our requirements. We get a bunch of DSE's simple.ll cases, and some stuff it doesn't. Unlike DSE, however, we only try to eliminate stores of the same value to the same memory location, not just general stores to the same memory location. Reviewers: davide Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D29149 llvm-svn: 293258	2017-01-27 02:37:11 +00:00
NAKAMURA Takumi	0d299191d0	NVPTXCodeGen: Add IPO to libdeps, since r293189. llvm-svn: 293256	2017-01-27 02:11:10 +00:00
Tim Shen	601ba8c583	[APFloat] Reduce some dispatch boilerplates. NFC. Summary: This is an attempt to reduce the verbose manual dispatching code in APFloat. This doesn't handle multiple dispatch on single discriminator (e.g. APFloat::add(const APFloat&)), nor handles multiple dispatch on multiple discriminators (e.g. APFloat::convert()). Reviewers: hfinkel, echristo, jlebar Subscribers: mehdi_amini, llvm-commits Differential Revision: https://reviews.llvm.org/D29161 llvm-svn: 293255	2017-01-27 02:11:07 +00:00
Justin Lebar	25ebe2d767	[NVPTX] [InstCombine] Add llvm_unreachable to appease MSVC. llvm-svn: 293253	2017-01-27 02:04:07 +00:00
Justin Lebar	e3ac0fb948	[NVPTX] Fix use-after-stack-free bug in InstCombineCalls. Introduced in r293244. llvm-svn: 293251	2017-01-27 01:49:39 +00:00
Xin Tong	e5f8d643d4	Constant fold switch inst when looking for trivial conditions to unswitch on. Summary: Constant fold switch inst when looking for trivial conditions to unswitch on. Reviewers: sanjoy, chenli, hfinkel, efriedma Subscribers: llvm-commits, mzolotukhin Differential Revision: https://reviews.llvm.org/D29037 llvm-svn: 293250	2017-01-27 01:42:20 +00:00
Chandler Carruth	baabda9317	[PM] Port LoopLoadElimination to the new pass manager and wire it into the main pipeline. This is a very straight forward port. Nothing weird or surprising. This brings the number of missing passes from the new PM's pipeline down to three. llvm-svn: 293249	2017-01-27 01:32:26 +00:00
Quentin Colombet	89dbea06f1	[ARM][LegalizerInfo] Specify the type of the opcode. This is to fix the win7 bot that does not seem to be very good at infering the type when it gets used in an initiliazer list. llvm-svn: 293248	2017-01-27 01:30:46 +00:00
Quentin Colombet	24203cf997	[AArch64][LegalizerInfo] Specify the type of the opcode. This is an attempt to fix the win7 bot that does not seem to be very good at infering the type when it gets used in an initiliazer list. llvm-svn: 293246	2017-01-27 01:13:30 +00:00
Quentin Colombet	e15e460c05	Revert "[AArch64][LegalizerInfo] Specify the type of the initialization list." This reverts commit r293238. Even with that the win7 bot is still failing: http://lab.llvm.org:8011/builders/lld-x86_64-win7/builds/3862 llvm-svn: 293245	2017-01-27 01:13:25 +00:00
Justin Lebar	698c31b8db	[NVPTX] Upgrade NVVM intrinsics in InstCombineCalls. Summary: There are many NVVM intrinsics that we can't entirely get rid of, but that nonetheless often correspond to target-generic LLVM intrinsics. For example, if flush denormals to zero (ftz) is enabled, we can convert @llvm.nvvm.ceil.ftz.f to @llvm.ceil.f32. On the other hand, if ftz is disabled, we can't do this, because @llvm.ceil.f32 will be lowered to a non-ftz PTX instruction. In this case, we can, however, simplify the non-ftz nvvm ceil intrinsic, @llvm.nvvm.ceil.f, to @llvm.ceil.f32. These transformations are particularly useful because they let us constant fold instructions that appear in libdevice, the bitcode library that ships with CUDA and essentially functions as its libm. Reviewers: tra Subscribers: hfinkel, majnemer, llvm-commits Differential Revision: https://reviews.llvm.org/D28794 llvm-svn: 293244	2017-01-27 00:58:58 +00:00
Justin Lebar	322c127bee	[ValueTracking] Add comment that CannotBeOrderedLessThanZero does the wrong thing for powi. Summary: CannotBeOrderedLessThanZero(powi(x, exp)) returns true if CannotBeOrderedLessThanZero(x). But powi(-0, exp) is negative if exp is odd, so we actually want to return SignBitMustBeZero(x). Except that also isn't right, because we want to return true if x is NaN, even if x has a negative sign bit. What we really need in order to fix this is a consistent approach in this function to handling the sign bit of NaNs. Without this it's very difficult to say what the correct behavior here is. Reviewers: hfinkel, efriedma, sanjoy Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28927 llvm-svn: 293243	2017-01-27 00:58:34 +00:00
Justin Lebar	cb9b41dd76	[LangRef] Make @llvm.sqrt(x) return undef, rather than have UB, for negative x. Summary: Some frontends emit a speculate-and-select idiom for sqrt, wherein they compute sqrt(x), check if x is negative, and select NaN if it is: %cmp = fcmp olt double %a, -0.000000e+00 %sqrt = call double @llvm.sqrt.f64(double %a) %ret = select i1 %cmp, double 0x7FF8000000000000, double %sqrt This is technically UB as the LangRef is written today if %a is ever less than -0. But emitting code that's compliant with the current definition of sqrt would require a branch, which would then prevent us from matching this idiom in SelectionDAG (which we do today -- ISD::FSQRT has defined behavior on negative inputs), because SelectionDAG looks at one BB at a time. Nothing in LLVM takes advantage of this undefined behavior, as far as we can tell, and the fact that llvm.sqrt has UB dates from its initial addition to the LangRef. Reviewers: arsenm, mehdi_amini, hfinkel Subscribers: wdng, llvm-commits Differential Revision: https://reviews.llvm.org/D28797 llvm-svn: 293242	2017-01-27 00:58:03 +00:00
Chandler Carruth	a95ff38924	[PM] Flesh out almost all of the late loop passes. With this the per-module pass pipeline is extremely close to the legacy PM. The missing pieces are: - PruneEH (or some equivalent) - ArgumentPromotion - LoopLoadElimination - LoopUnswitch I'm going to work through those in essentially that order but this seems like a worthwhile incremental step toward the end state. One difference in what I have here from the legacy PM is that I've consolidated some of the per-function passes at the very end of the pipeline into the main optimization function pipeline. The intervening passes are really uninteresting and so this seems very likely to have any effect other than minor improvement to locality. Note that there are still some failures in the test suite, but the compiler doesn't crash or assert. Differential Revision: https://reviews.llvm.org/D29114 llvm-svn: 293241	2017-01-27 00:50:21 +00:00
Kostya Serebryany	70182deaae	[libFuzzer] simplify the value profiling callback further: don't use (idx MOD prime) on the hot path where it is useless anyway llvm-svn: 293239	2017-01-27 00:39:12 +00:00
Quentin Colombet	86fc8305ec	[AArch64][LegalizerInfo] Specify the type of the initialization list. This is an attempt to fix the win7 bot that does not seem to be very good at infering the type. llvm-svn: 293238	2017-01-27 00:39:03 +00:00
Kostya Serebryany	8e9ac42742	[libFuzzer] make sure (again) that __builtin_popcountl is compiled into popcnt llvm-svn: 293237	2017-01-27 00:20:55 +00:00
Kostya Serebryany	7f058972ee	[libFuzzer] simplify the value profile code and disable asan/msan on it llvm-svn: 293236	2017-01-27 00:09:59 +00:00
Adrian McCarthy	8f713190e7	NFC: Rename PDB_ReaderType::Raw to Native for consistency with the NativeSession rename. llvm-svn: 293235	2017-01-27 00:01:55 +00:00
Eugene Zelenko	e6cf4374b0	[ARM] Fix some Clang-tidy modernize and Include What You Use warnings; other minor fixes (NFC). llvm-svn: 293229	2017-01-26 23:40:06 +00:00
Tim Northover	09aac4ad2a	GlobalISel: support debug intrinsics. The translation scheme is mostly cribbed from FastISel, and it's not entirely convincing semantically. But it does seem to work in the common cases and allow variables to be printed so it can't be all wrong. llvm-svn: 293228	2017-01-26 23:39:14 +00:00
Sanjoy Das	7516192a71	Revert a couple of InstCombine/Guard checkins This change reverts: r293061: "[InstCombine] Canonicalize guards for NOT OR condition" r293058: "[InstCombine] Canonicalize guards for AND condition" They miscompile cases like: ``` declare void @llvm.experimental.guard(i1, ...) define void @test_guard_not_or(i1 %A, i1 %B) { %C = or i1 %A, %B %D = xor i1 %C, true call void(i1, ...) @llvm.experimental.guard(i1 %D, i32 20, i32 30)[ "deopt"() ] ret void } ``` because they do transfer the `i32 20, i32 30` parameters to newly created guard instructions. llvm-svn: 293227	2017-01-26 23:38:11 +00:00
Andrew Kaylor	a0a1164ce4	Add intrinsics for constrained floating point operations This commit introduces a set of experimental intrinsics intended to prevent optimizations that make assumptions about the rounding mode and floating point exception behavior. These intrinsics will later be extended to specify flush-to-zero behavior. More work is also required to model instruction dependencies in machine code and to generate these instructions from clang (when required by pragmas and/or command line options that are not currently supported). Differential Revision: https://reviews.llvm.org/D27028 llvm-svn: 293226	2017-01-26 23:27:59 +00:00
Chandler Carruth	79b733bc6b	[PM] Enable the main loop pass pipelines with everything but loop-unswitch in the main pipelines for the new PM. All of these now work, and Clang built using this pipeline can build the test suite and SPEC without hitting any asserts of ASan failures. There are still some bugs hiding though -- 7 tests regress with the new PM. I'm going to be investigating these, but it seems worthwhile to at least get the pipelines in place so that others can play with them, and they aren't completely broken. Differential Revision: https://reviews.llvm.org/D29113 llvm-svn: 293225	2017-01-26 23:21:17 +00:00
Krzysztof Parzyszek	d6c8e3c9ce	[Hexagon] Require IPO library in Hexagon build This should unbreak the Hexagon build bots. llvm-svn: 293221	2017-01-26 23:03:22 +00:00
Daniel Berlin	1ea5f324bd	NewGVN: Fix bug exposed by PR31761 Summary: This does not actually fix the testcase in PR31761 (discussion is ongoing on the testcase), but does fix a bug it exposes, where stores were not properly clobbering loads. We accomplish this by unifying the memory equivalence infratructure back into the normal congruence infrastructure, and then properly destroying congruence classes when memory state leaders disappear. Reviewers: davide Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D29195 llvm-svn: 293216	2017-01-26 22:21:48 +00:00
Sanjay Patel	50753f02c2	[InstCombine] fold (X >>u C) << C --> X & (-1 << C) We already have this fold when the lshr has one use, but it doesn't need that restriction. We may be able to remove some code from foldShiftedShift(). Also, move the similar: (X << C) >>u C --> X & (-1 >>u C) ...directly into visitLShr to help clean up foldShiftByConstOfShiftByConst(). That whole function seems questionable since it is called by commonShiftTransforms(), but there's really not much in common if we're checking the shift opcodes for every fold. llvm-svn: 293215	2017-01-26 22:08:10 +00:00
Krzysztof Parzyszek	c8b943860f	[Hexagon] Add Hexagon-specific loop idiom recognition pass llvm-svn: 293213	2017-01-26 21:41:10 +00:00
Daniel Berlin	db3c7be069	NewGVN: Add algorithm overview llvm-svn: 293212	2017-01-26 21:39:49 +00:00

1 2 3 4 5 ...

99053 Commits