llvm-project

Commit Graph

Author	SHA1	Message	Date
Jeremy Morse	6d3cd3b4ec	[DebugInfo][DAG] Refactor dbg.value lowering into its own method This is a pure copy-and-paste job, moving the logic for lowering dbg.value intrinsics to SDDbgValues into its own function. This is ahead of adding some more users of this logic. Differential Revision: https://reviews.llvm.org/D57697 llvm-svn: 353950	2019-02-13 15:53:10 +00:00
Andrea Di Biagio	245163ffd0	[MCA] Store a bitmask of used groups in the instruction descriptor. This is to speedup 'checkAvailability' queries in class ResourceManager. No functional change intended. llvm-svn: 353949	2019-02-13 14:56:06 +00:00
Jeremy Morse	a9a11aac0f	[DebugInfo][DAG] Limit special-casing of dbg.values for Arguments SelectionDAGBuilder has special handling for dbg.value intrinsics that are understood to define the location of function parameters on entry to the function. To enable this, we avoid recording a dbg.value as a virtual register reference if it might be such a parameter, so that it later hits EmitFuncArgumentDbgValue. This patch reduces the set of circumstances where we avoid recording a dbg.value as a virtual register reference, to allow more "normal" variables to be recorded that way. We now only bypass for potential parameters if: * The dbg.value operand is an Argument, * The Variable is a parameter, and * The Variable is not inlined. meaning it's very likely that the dbg.value is a function-entry parameter location. Differential Revision: https://reviews.llvm.org/D57584 llvm-svn: 353948	2019-02-13 13:37:33 +00:00
Max Kazantsev	3fe9ad7a9f	[NFC] Add const qualifiers where possible llvm-svn: 353941	2019-02-13 11:54:45 +00:00
Serge Guelton	699c22839a	Revert r353927 llvm-svn: 353940	2019-02-13 11:35:45 +00:00
Diana Picus	aa4118a873	[ARM GlobalISel] Support G_SELECT for Thumb2 Same as arm mode, but slightly different opcodes. llvm-svn: 353938	2019-02-13 11:25:32 +00:00
Andrea Di Biagio	318f990aee	[MCA][Scheduler] Use latency information to further classify busy instructions. This patch introduces a new instruction stage named 'IS_PENDING'. An instruction transitions from the IS_DISPATCHED to the IS_PENDING stage if input registers are not available, but their latency is known. This patch also adds a new set of instructions named 'PendingSet' to class Scheduler. The idea is that the PendingSet will only contain instructions that have reached the IS_PENDING stage. By construction, an instruction in the PendingSet is only dependent on instructions that have already reached the execution stage. The plan is to use this knowledge to identify bottlenecks caused by data dependencies (see PR37494). Differential Revision: https://reviews.llvm.org/D58066 llvm-svn: 353937	2019-02-13 11:02:42 +00:00
Jeremy Morse	f10af3f134	[DebugInfo][InstCombine] Prefer to salvage debuginfo over sinking it When instcombine sinks an instruction between two basic blocks, it sinks any dbg.value users in the source block with it, to prevent debug use-before-free. However we can do better by attempting to salvage the debug users, which would avoid moving where the variable location changes. If we successfully salvage, still sink a (cloned) dbg.value with the sunk instruction, as the sunk instruction is more likely to be "live" later in the compilation process. If we can't salvage dbg.value users of a sunk instruction, mark the dbg.values in the original block as being undef. This terminates any earlier variable location range, and represents the fact that we've optimized out the variable location for a portion of the program. Differential Revision: https://reviews.llvm.org/D56788 llvm-svn: 353936	2019-02-13 10:54:53 +00:00
Serge Guelton	f8ffb926e2	Missing header llvm-svn: 353933	2019-02-13 10:19:06 +00:00
Max Kazantsev	2bb95e7c76	[GuardWidening] Support widening of explicitly expressed guards This patch adds support of guards expressed in explicit form via `widenable_condition` in Guard Widening pass. Differential Revision: https://reviews.llvm.org/D56075 Reviewed By: reames llvm-svn: 353932	2019-02-13 09:56:30 +00:00
David Stenberg	9dbeca3d77	[DebugInfo] Stop changing labels for register-described parameter DBG_VALUEs Summary: This is a follow-up to D57510. This patch stops DebugHandlerBase from changing the starting label for the first non-overlapping, register-described parameter DBG_VALUEs to the beginning of the function. That code did not consider what defined the registers, which could result in the ranges for the debug values starting before their defining instructions. We currently do not emit debug values for constant values directly at the start of the function, so this code is still useful for such values, but my intention is to remove the code from DebugHandlerBase completely when we get there. One reason for removing it is that the code violates the history map's ranges, which I think can make it quite confusing when troubleshooting. In D57510, PrologEpilogInserter was amended so that parameter DBG_VALUEs now are kept at the start of the entry block, even after emission of prologue code. That was done to reduce the degradation of debug completeness from this patch. PR40638 is another example, where the lexical-scope trimming that LDV does, in combination with scheduling, results in instructions after the prologue being left without locations. There might be other cases where the DBG_VALUEs are pushed further down, for which the DebugHandlerBase code may be helpful, but as it now quite often result in incorrect locations, even after the prologue, it seems better to remove that code, and try to work our way up with accurate locations. In the long run we should maybe not aim to provide accurate locations inside the prologue. Some single location descriptions, at least those referring to stack values, generate inaccurate values inside the epilogue, so we maybe should not aim to achieve accuracy for location lists. However, it seems that we now emit line number programs that can result in GDB and LLDB stopping inside the prologue when doing line number stepping into functions. See PR40188 for more information. A summary of some of the changed test cases is available in PR40188#c2. Reviewers: aprantl, dblaikie, rnk, jmorse Reviewed By: aprantl Subscribers: jdoerfert, jholewinski, jvesely, javed.absar, llvm-commits Tags: #debug-info, #llvm Differential Revision: https://reviews.llvm.org/D57511 llvm-svn: 353928	2019-02-13 09:34:07 +00:00
Serge Guelton	ab061d351e	Make llvm::Optional<T> trivially copyable when T is trivially copyable This is an ever-recurring issue (see https://bugs.llvm.org/show_bug.cgi?id=39427 and https://bugs.llvm.org/show_bug.cgi?id=35978) but I believe that thanks to https://reviews.llvm.org/D54472 we can now ship a decent implementation of this. Basically the fact that llvm::is_trivially_copyable has a consistent behavior across compilers should prevent any ABI issue, and using in-place new instead of memcpy should keep compiler bugs away. Differential Revision: https://reviews.llvm.org/D57097 llvm-svn: 353927	2019-02-13 09:31:22 +00:00
Michal Gorny	5590548f6b	[llvm] [cmake] Provide split include paths in LLVMConfig Modify LLVMConfig to provide split variables for in-source and generated include paths. Currently, it uses a single value for both LLVM_INCLUDE_DIRS and LLVM_INCLUDE_DIR which works for install tree but fails hard at build tree (where LLVM_INCLUDE_DIR incorrectly contains multiple values). Instead, put the generated directory in LLVM_INCLUDE_DIR, and the source tree in LLVM_MAIN_INCLUDE_DIR which is consistent with in-LLVM builds. For install tree, both variables will have the same value. Differential Revision: https://reviews.llvm.org/D58109 llvm-svn: 353924	2019-02-13 08:34:40 +00:00
Anton Afanasyev	ca9aff9353	[X86][SLP] Enable SLP vectorization for 128-bit horizontal X86 instructions (add, sub) Try to use 64-bit SLP vectorization. In addition to horizontal instrs this change triggers optimizations for partial vector operations (for instance, using low halfs of 128-bit registers xmm0 and xmm1 to multiply <2 x float> by <2 x float>). Fixes llvm.org/PR32433 llvm-svn: 353923	2019-02-13 08:26:43 +00:00
Craig Topper	9b61f48e4b	[X86] Use default expansion for (i64 fp_to_uint f80) when avx512 is enabled on 64-bit targets to match what happens without avx512. In 64-bit mode prior to avx512 we use Expand, but with avx512 we need to make f32/f64 conversions Legal so we use Custom and then do our own expansion for f80. But this seems to produce codegen differences relative to avx2. This patch corrects this. llvm-svn: 353921	2019-02-13 07:42:34 +00:00
Craig Topper	3099e442a6	[X86] Refactor the FP_TO_INTHelper interface. NFCI -Pull the final stack load creation from the two callers into the helper. -Return a single SDValue instead of a std::pair. -Remove the Replace flag which isn't really needed. llvm-svn: 353920	2019-02-13 07:42:31 +00:00
Eugene Leviant	2db1062906	[llvm-objcopy] Add --strip-unneeded-symbol(s) Differential revision: https://reviews.llvm.org/D58027 llvm-svn: 353919	2019-02-13 07:34:54 +00:00
Max Kazantsev	5cf777e413	[LoopSimplifyCFG] Re-enable const branch folding by default Known underlying bugs have been fixed, intensive fuzz testing did not find any new problems. Re-enabling by default. Feel free to revert if it causes any functional failures. llvm-svn: 353911	2019-02-13 06:12:48 +00:00
Fangrui Song	12d5599000	[llvm-readobj] Dump GNU_PROPERTY_X86_FEATURE_2_{NEEDED,USED} notes in .note.gnu.property Summary: And change the output ("X86 features" -> "x86 feature") a bit. Reviewers: grimar, xiangzhangllvm, hjl.tools, rupprecht Reviewed By: rupprecht Subscribers: rupprecht, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D58112 llvm-svn: 353908	2019-02-13 01:51:45 +00:00
Reid Kleckner	afe1e3e669	[MC] Make symbol version errors non-fatal We stil don't have a source location, which is pretty lame, but at least we won't tell the user to file a clang bug report anymore. Fixes PR40712 llvm-svn: 353907	2019-02-13 01:39:32 +00:00
Jonas Devlieghere	9ea90acfeb	[dsymutil] Improve readability of cloneAllCompileUnits (NFC) Add some newlines and improve consistency between the two loops. llvm-svn: 353904	2019-02-13 00:32:21 +00:00
Jonas Devlieghere	1bf1b9857f	[dsymutil] Don't clone empty CUs The DWARF standard says that an empty compile unit is not valid: > Each such contribution consists of a compilation unit header (see > Section 7.5.1.1 on page 200) followed by a single DW_TAG_compile_unit or > DW_TAG_partial_unit debugging information entry, together with its > children. Therefore we shouldn't clone them in dsymutil. Differential revision: https://reviews.llvm.org/D57979 llvm-svn: 353903	2019-02-13 00:32:06 +00:00
Alina Sbirlea	0a8bc14ad7	[MemorySSA & LoopPassManager] Add remaining book keeping [NFCI]. Add plumbing to get MemorySSA in the remaining loop passes. Also update unit test to add the dependency. [EnableMSSALoopDependency remains disabled]. llvm-svn: 353901	2019-02-12 23:48:02 +00:00
Matt Arsenault	4cd9509e1d	AMDGPU: Try to use function specific ST Subtargets are a function level property, so ideally we would eliminate everywhere that needs to check the global one. Rename the function to try avoiding confusion. llvm-svn: 353900	2019-02-12 23:44:13 +00:00
Matt Arsenault	d24296e282	AMDGPU: Ignore CodeObjectV3 when inlining This was inhibiting inlining of library functions when clang was invoking the inliner directly. This is covering a bit of a mess with subtarget feature handling, and this shouldn't be a subtarget feature. The behavior is different depending on whether you are using a -mattr flag in clang, or llc, opt. llvm-svn: 353899	2019-02-12 23:30:11 +00:00
Jonas Paulsson	749dc51e45	[SystemZ] Remember to cast value to void to disable warning. Hopefully fixes buildbot problems. llvm-svn: 353898	2019-02-12 23:13:18 +00:00
Alina Sbirlea	8567ff0c34	[LICM] Cap the clobbering calls in LICM. Summary: Unlimitted number of calls to getClobberingAccess can lead to high compile times in pathological cases. Switching EnableLicmCap flag from bool to int, and enabling to default 100. (tested to be appropriate for current bechmarks) We can revisit this value when enabling MemorySSA. Reviewers: sanjoy, chandlerc, george.burgess.iv Subscribers: jlebar, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57968 llvm-svn: 353897	2019-02-12 23:05:40 +00:00
Philip Reames	3908221356	[Tests] A few more live-in deopt lowering tests Nothing super interesting, just making sure obvious cases work. llvm-svn: 353895	2019-02-12 23:00:07 +00:00
Konstantin Zhuravlyov	6220d62e5c	AMDGPU/NFC: Remove SubtargetFeatureISAVersion since it is not used anywhere llvm-svn: 353892	2019-02-12 22:49:49 +00:00
Konstantin Zhuravlyov	acb231c8d8	AMDGPU: Remove duplicate processor (gfx900) llvm-svn: 353889	2019-02-12 22:29:25 +00:00
David Major	5b07e30408	[gn build] Separate debug and optimization settings This patch adds an `is_optimized` variable, orthogonal to `is_debug`, to allow for a gn analogue to `RelWithDebInfo` builds. As part of this we'll want to explicitly enable GC+ICF, for the sake of `is_debug && is_optimized` builds. The flags normally default to true except that if you pass `/DEBUG` they default to false. Differential Revision: https://reviews.llvm.org/D58075 llvm-svn: 353888	2019-02-12 22:24:45 +00:00
Bjorn Pettersson	ecd0960718	[SelectionDAG] Clean up comments in SelectionDAGBuilder.h. NFC Remove redundant function/variable names from doxygen comments (as suggested in https://reviews.llvm.org/D57697). llvm-svn: 353886	2019-02-12 22:11:20 +00:00
Erik Pilkington	4ecd7a90a6	Fix auto-upgrade for the new parameter to llvm.objectsize r352664 added a 'dynamic' parameter to objectsize, but the AutoUpgrade changes were incomplete. Also, fix an off-by-one error I made in the upgrade logic that is now no longer unreachable. Differential revision: https://reviews.llvm.org/D58071 llvm-svn: 353884	2019-02-12 21:55:38 +00:00
Sanjay Patel	cf3a906fb4	[ConstProp] add test for miscompile from bitcast transform; NFC This problem goes with the fix in D51215. llvm-svn: 353883	2019-02-12 21:49:56 +00:00
Jordan Rupprecht	08c3841b21	[llvm-dwp] Use color-formatted error reporting llvm-svn: 353876	2019-02-12 20:37:33 +00:00
Sean Fertile	9850a48275	Fix undefined behaviour in PPCInstPrinter::printBranchOperand. Fix the undefined behaviour introduced by my previous patch r353865 (left shifting a potentially negative value), which was caught by the bots that run UBSan. llvm-svn: 353874	2019-02-12 20:03:04 +00:00
Jordan Rupprecht	706a965295	[llvm-dwp] Avoid writing the output dwp file when there is an error Summary: Use ToolOutputFile to clean up the output file unless dwp actually finishes successfully. Reviewers: dblaikie Reviewed By: dblaikie Subscribers: jdoerfert, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D58130 llvm-svn: 353873	2019-02-12 20:00:51 +00:00
Nikita Popov	a3be17ea1c	[AArch64] Expand v8i8 cttz (PR39729) Fix for https://bugs.llvm.org/show_bug.cgi?id=39729. Rather than adding just a case for v8i8 I'm setting cttz to expand for all vector types. Differential Revision: https://reviews.llvm.org/D58008 llvm-svn: 353872	2019-02-12 18:55:53 +00:00
Philip Reames	7403fac3a8	[InlineSpiller] Fix a crash due to lack of forward progress from remat (try 2) This is a recommit of r335091 Add more test cases for deopt-operands via regalloc, and r335077 [InlineSpiller] Fix a crash due to lack of forward progress from remat specifically for STATEPOINT. They were reverted due to a crash. This change includes the text of both original changes, but also includes three aditional pieces: 1) A bug fix for the observed crash. I had failed to record the failed remat value as live which resulted in an instruction being deleted which still had uses. With the machine verifier, this is caught quickly. Without it, we fail in StackSlotColoring due to an empty live interval from LiveStack. 2) A test case which demonstrates the fix for (1). See @test11. 3) A control flag which defaults to disabling this for the moment. Once I've run more extensive validaton, I will switch the default and then remove this flag. llvm-svn: 353871	2019-02-12 18:33:01 +00:00
Jonas Paulsson	34bead750c	[SystemZ] Use VGM whenever possible to load FP immediates. isFPImmLegal() has been extended to recognize certain FP immediates that can be built with VGM (Vector Generate Mask). These scalar FP immediates (that were previously loaded from the constant pool) are now selected as VGMF/VGMG in Select(). Review: Ulrich Weigand https://reviews.llvm.org/D58003 llvm-svn: 353867	2019-02-12 18:06:06 +00:00
Sean Fertile	c069452027	[PowerPC] Fix printing of negative offsets in call instruction dissasembly. llvm-svn: 353865	2019-02-12 17:48:22 +00:00
Jessica Paquette	acbb7ca26c	[GlobalISel][NFC] Gardening: Make translateSimpleUnaryIntrinsic general Instead of only having this code work for unary intrinsics, have it work for an arbitrary number of parameters. Factor out the cases that fall under this (fma, pow). This makes it a bit easier to add more intrinsics which don't require any special work. Differential Revision: https://reviews.llvm.org/D58079 llvm-svn: 353863	2019-02-12 17:38:34 +00:00
Daniel Sanders	dff673bb52	[tablegen] Add locations to many PrintFatalError() calls Summary: While working on the GISel Combiner, I noticed I was producing location-less error messages fairly often and set about fixing this. In the process, I noticed quite a few places elsewhere in TableGen that also neglected to include a relevant location. This patch adds locations to errors that relate to a specific record (or a field within it) and also have easy access to the relevant location. This is particularly useful when multiclasses are involved as many of these errors refer to the full name of a record and it's difficult to guess which substring is grep-able. Unfortunately, tablegen currently only supports Record granularity so it's not currently possible to point at a specific Init so these sometimes point at the record that caused the error rather than the precise origin of the error. Reviewers: bogner, aditya_nandakumar, volkan, aemerson, paquette, nhaehnle Reviewed By: nhaehnle Subscribers: jdoerfert, nhaehnle, asb, rbar, johnrusso, simoncook, apazos, sabuasal, niosHD, jrtc27, zzheng, edward-jones, rogfer01, MartinMosbeck, brucehoult, the_o, PkmX, jocewei, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D58077 llvm-svn: 353862	2019-02-12 17:36:57 +00:00
Jessica Paquette	0e71e73faa	[GlobalISel][AArch64] Select llvm.bswap* for non-vector types This teaches the IRTranslator to emit G_BSWAP when it runs into Intrinsic::bswap. This allows us to select G_BSWAP for non-vector types in AArch64. Add a select-bswap.mir test, and add global isel checks to a couple existing tests in test/CodeGen/AArch64. This doesn't handle every bswap case, since some of these rely on known bits stuff. This just lets us handle the naive case. Differential Revision: https://reviews.llvm.org/D58081 llvm-svn: 353861	2019-02-12 17:28:17 +00:00
Simon Pilgrim	5338f41ced	[X86][AVX] Enable shuffle combining support for zero_extend A more limited version of rL352997 that had to be disabled in rL353198 - allow extension of any 128/256/512 bit vector that at least uses byte sized scalars. llvm-svn: 353860	2019-02-12 17:22:35 +00:00
Sanjay Patel	86fac11d5a	[DAGCombiner] convert logic-of-setcc into bit magic (PR40611) If we're comparing some value for equality against 2 constants and those constants have an absolute difference of just 1 bit, then we can offset and mask off that 1 bit and reduce to a single compare against zero: and/or (setcc X, C0, ne), (setcc X, C1, ne/eq) --> setcc ((add X, -C1), ~(C0 - C1)), 0, ne/eq https://rise4fun.com/Alive/XslKj This transform is disabled by default using a TLI hook ("convertSetCCLogicToBitwiseLogic()"). That should be overridden for AArch64, MIPS, Sparc and possibly others based on the asm shown in: https://bugs.llvm.org/show_bug.cgi?id=40611 llvm-svn: 353859	2019-02-12 17:07:47 +00:00
Sanjay Patel	ab7e26a2de	[x86] add negative tests for setcc folds; NFC llvm-svn: 353855	2019-02-12 16:44:37 +00:00
whitequark	77ccc2eba4	[SelectionDAG] Fix return calling convention in expansion of ?MULO Summary: The SMULO/UMULO DAG nodes, when not directly supported by the target, expand to a multiplication twice as wide. In case that the resulting type is not legal, the legalizer cannot directly call the intrinsic with the wide arguments; instead, it "pre-lowers" them by splitting them in halves. rL283203 made sure that on big endian targets, the legalizer passes the argument halves in the correct order. It did not do the same for the return value halves because the existing code used a hack; it put an illegal type into DAG and hoped that nothing would break and it would be correctly lowered elsewhere. rL307207 fixed this, handling return value halves similar to how argument handles are handled, but did not take big-endian targets into account. This commit fixes the expansion on big-endian targets, such as the out-of-tree OR1K target. Reviewers: eli.friedman, vadimcn Subscribers: george-hopkins, efriedma, llvm-commits Differential Revision: https://reviews.llvm.org/D45355 llvm-svn: 353854	2019-02-12 16:41:50 +00:00
Andrea Di Biagio	d30fff9a90	[MCA] Improved debug prints. NFC llvm-svn: 353852	2019-02-12 16:18:57 +00:00
Simon Pilgrim	015cc0f0fa	[PowerPC] Regenerate test llvm-svn: 353851	2019-02-12 16:10:50 +00:00
Matt Arsenault	a180554020	AMDGPU/GlobalISel: Add more insert/extract testcases llvm-svn: 353848	2019-02-12 15:04:03 +00:00
David Green	c93c6f3274	[Codegen] Make sure kill flags are not incorrect from removed machine phi's We need to clear the kill flags on both SingleValReg and OldReg, to ensure they remain conservatively correct. Differential Revision: https://reviews.llvm.org/D58114 llvm-svn: 353847	2019-02-12 15:02:57 +00:00
Jordan Rupprecht	4b78d4f347	[llvm-dwp] Abort when dwo_id is unset Summary: An empty dwo_id indicates a degenerate .dwo file that should not have been generated in the first place. Instead of discovering this error later when merging with another degenerate .dwo file, print an error immediately when noticing an unset dwo_id, including the filename of the offending file. Test case created by compiling a trivial file w/ `-fno-split-dwarf-inlining -gmlt -gsplit-dwarf -c` prior to r353771 Reviewers: dblaikie Reviewed By: dblaikie Subscribers: jdoerfert, aprantl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D58085 llvm-svn: 353846	2019-02-12 15:01:07 +00:00
Matt Arsenault	00ccd13c73	AMDGPU/GlobalISel: Only make f16 constants legal on f16 targets We could deal with it, but there's no real point. llvm-svn: 353845	2019-02-12 14:54:55 +00:00
Matt Arsenault	996c66620e	GlobalISel: Use default rounding mode when extending fconstant I don't think this matters since the values should all be exactly representable. llvm-svn: 353844	2019-02-12 14:54:54 +00:00
Matt Arsenault	1cf713664d	GlobalISel: Move some more legalize cases into functions llvm-svn: 353843	2019-02-12 14:54:52 +00:00
Sam McCall	a860219c5e	[LoopSimplifyCFG] Fix test broken in release mode in r353813 llvm-svn: 353842	2019-02-12 14:43:30 +00:00
Max Kazantsev	4a1c02987e	[NFC] Simplify code & reduce nest slightly llvm-svn: 353832	2019-02-12 11:31:46 +00:00
Jeremy Morse	b33a5c7347	[DebugInfo] Don't salvage load operations (PR40628). Salvaging a redundant load instruction into a debug expression hides a memory read from optimisation passes. Passes that alter memory behaviour (such as LICM promoting memory to a register) aren't aware of these debug memory reads and leave them unaltered, making the debug variable location point somewhere unsafe. Teaching passes to know about these debug memory reads would be challenging and probably incomplete. Finding dbg.value instructions that need to be fixed would likely be computationally expensive too, as more analysis would be required. It's better to not generate debug-memory-reads instead, alas. Changed tests: * DeadStoreElim: test for salvaging of intermediate operations contributing to the dead store, instead of salvaging of the redundant load, * GVN: remove debuginfo behaviour checks completely, this behaviour is still covered by other tests, * InstCombine: don't test for salvaged loads, we're removing that behaviour. Differential Revision: https://reviews.llvm.org/D57962 llvm-svn: 353824	2019-02-12 10:54:30 +00:00
David Stenberg	bbd2f97293	[DebugInfo] Keep parameter DBG_VALUEs before prologue code Summary: This is a preparatory change for removing the code from DebugHandlerBase::beginFunction() which changes the starting label for the first non-overlapping DBG_VALUEs of parameters to the beginning of the function. It does that to be able to show parameters when entering a function. However, that code does not consider what defines the values, which can result in the ranges for the debug values starting before their defining instructions. That code is removed in a follow-up patch. When prologue code is inserted, it leads to DBG_VALUEs that start directly in the entry block being moved down after the prologue instructions. This patch fixes that by stashing away DBG_VALUEs for parameters before emitting the prologue, and then reinserts them at the start of the block. This assumes that there is no target that somehow clobbers parameter registers in the frame setup; there is no such case in the lit tests at least. See PR40188 for more information. Reviewers: aprantl, dblaikie, rnk, jmorse Reviewed By: aprantl Subscribers: bjope, llvm-commits Tags: #debug-info Differential Revision: https://reviews.llvm.org/D57510 llvm-svn: 353823	2019-02-12 10:51:27 +00:00
Max Kazantsev	2a184af221	[IndVars] Fix corner case with unreachable Phi inputs. PR40454 Logic in `getInsertPointForUses` doesn't account for a corner case when `Def` only comes to a Phi user from unreachable blocks. In this case, the incoming value may be arbitrary (and not even available in the input block) and break the loop-related invariants that are asserted below. In fact, if we encounter this situation, no IR modification is needed. This Phi will be simplified away with nearest cleanup. Differential Revision: https://reviews.llvm.org/D58045 Reviewed By: spatel llvm-svn: 353816	2019-02-12 09:59:44 +00:00
Fangrui Song	8e0d5ac715	[llvm-readobj] Only allow 4-byte pr_data Summary: AMD64 psABI says: "The pr_data field of each property contains a 4-byte unsigned integer." Thus we don't need to handle 8-byte pr_data. Reviewers: mike.dvoretsky, grimar, craig.topper, xiangzhangllvm, hjl.tools Reviewed By: grimar Subscribers: rupprecht, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D58103 llvm-svn: 353815	2019-02-12 09:56:01 +00:00
George Rimar	b1d6f52005	[llvm-readobj] - Simplify .gnu.version_r dumping a bit. Current implementation takes "Number of needed versions" from DT_VERNEEDNUM dynamic tag entry. Though it would be a bit simpler to take it from sh_info section header field directly: https://docs.oracle.com/cd/E19683-01/816-1386/chapter6-94076/index.html Differential revision: https://reviews.llvm.org/D58048 llvm-svn: 353814	2019-02-12 09:50:04 +00:00
Max Kazantsev	bf6af8fbf0	[LoopSimplifyCFG] Change logic of dead loops removal to avoid hitting asserts The function `LI.erase` has some invariants that need to be preserved when it tries to remove a loop which is not the top-level loop. In particular, it requires loop's preheader to be strictly in loop's parent. Our current logic of deletion of dead blocks may erase the information about preheader before we handle the loop, and therefore we may hit this assertion. This patch changes the logic of loop deletion: we make them top-level loops before we actually erase them. This allows us to trigger the simple branch of `erase` logic which just detatches blocks from the loop and does not try to do some complex stuff that need this invariant. Thanks to @uabelho for reporting this! Differential Revision: https://reviews.llvm.org/D57221 Reviewed By: fedor.sergeev llvm-svn: 353813	2019-02-12 09:37:00 +00:00
George Rimar	b87ea73706	[yaml2obj/obj2yaml] - Move `Info` field out from `Section` class. ELFYAML.h contains a `Section` class which is a base for a few other sections classes that are used for mapping different section types. `Section` has a `StringRef Info` field used for storing sh_info. At the same time, sh_info has very different meanings for sections and cannot be processed in a similar way generally, for example ELFDumper does not handle it in `dumpCommonSection` but do that in `dumpGroup` and `dumpCommonRelocationSection` respectively. At this moment, we have and handle it as a string, because that was possible for the current use case. But also it can simply be a number: For SHT_GNU_verdef is "The number of version definitions within the section." The patch moves `Info` field out to be able to have it as a number. With that change, each class will be able to decide what type and purpose of the sh_info field it wants to use. I also had to edit 2 test cases. This is because patch fixes a bug. Previously we accepted yaml files with Info fields for all sections (for example, for SHT_DYNSYM too). But we do not handle it and the resulting objects had zero sh_info fields set for such sections. Now it is accepted only for sections that supports it. Differential revision: https://reviews.llvm.org/D58054 llvm-svn: 353810	2019-02-12 09:08:59 +00:00
Hans Wennborg	be4c0ff00a	LibFuzzer.rst: double backticks llvm-svn: 353809	2019-02-12 09:08:52 +00:00
Max Kazantsev	9aae9da947	Delete blocks from DTU to avoid dangling pointers llvm-svn: 353804	2019-02-12 08:10:29 +00:00
Max Kazantsev	6bf861597c	[LoopSimplifyCFG] Pay respect to LCSSA when removing dead blocks Utility function that we use for blocks deletion always unconditionally removes one-input Phis. In LoopSimplifyCFG, it can lead to breach of LCSSA form. This patch alters this function to keep them if needed. Differential Revision: https://reviews.llvm.org/D57231 Reviewed By: fedor.sergeev llvm-svn: 353803	2019-02-12 07:48:07 +00:00
Max Kazantsev	20b9189975	[NFC] Rename DontDeleteUselessPHIs --> KeepOneInputPHIs llvm-svn: 353801	2019-02-12 07:09:29 +00:00
Philip Reames	b6dc6eb8bb	[Statepoint Lowering] Update misleading comments about chains llvm-svn: 353800	2019-02-12 06:25:58 +00:00
Max Kazantsev	0686d1ae41	[NFC] Add parameter for keeping one-input Phis in DeleteDeadBlock(s) llvm-svn: 353799	2019-02-12 06:14:27 +00:00
Craig Topper	7670ede434	[X86] Collapse FP_TO_INT16_IN_MEM/FP_TO_INT32_IN_MEM/FP_TO_INT64_IN_MEM into a single opcode using memory VT to distinquish. NFC llvm-svn: 353798	2019-02-12 06:14:18 +00:00
Craig Topper	d7303ecd0b	[X86] Remove the value type operand from the floating point load/store MemIntrinsicSDNodes. Use the MemoryVT instead. NFCI We already have the memory VT, we can just match from that during isel. llvm-svn: 353797	2019-02-12 06:14:16 +00:00
Shoaib Meenai	9e624d5410	[build] Remove a stray comment. NFC The CMake change associated with this comment was removed but the comment got left behind. Add a newline instead. llvm-svn: 353793	2019-02-12 02:25:27 +00:00
Petr Hosek	d3ebe7126b	[CMake] Don't override required compiler flags in the runtimes build Ensure that HandleLLVMOptions adds all necessary required flags, including -Wno-error when building with LLVM_ENABLE_WERROR enabled. Differential Revision: https://reviews.llvm.org/D58092 llvm-svn: 353790	2019-02-12 02:11:25 +00:00
Sanjay Patel	093b896dcb	[x86] add tests for logic of setcc (PR40611); NFC llvm-svn: 353789	2019-02-12 01:46:30 +00:00
Sanjay Patel	14fb86310f	[PowerPC] add tests for logic of setcc (PR40611); NFC llvm-svn: 353788	2019-02-12 01:46:26 +00:00
David Blaikie	43d6122f73	Fix r353771 to target linux only (split-dwarf isn't supported on macho) llvm-svn: 353785	2019-02-12 01:19:00 +00:00
Eli Friedman	806136f8ef	[LoopReroll] Fix reroll root legality checking. The code checked that the first root was an appropriate distance from the base value, but skipped checking the other roots. This could lead to rerolling a loop that can't be legally rerolled (at least, not without rewriting the loop in a non-trivial way). Differential Revision: https://reviews.llvm.org/D56812 llvm-svn: 353779	2019-02-12 00:33:25 +00:00
Philip Reames	5292a3b6aa	[Test] Use autogenerated checks for more statepoint tests llvm-svn: 353776	2019-02-12 00:12:46 +00:00
Philip Reames	8663b00ce1	[Tests] Fill out a few tests around gc relocation uniquing llvm-svn: 353773	2019-02-12 00:01:39 +00:00
David Blaikie	104dcb348f	DebugInfo: Split DWARF + gmlt + no-split-dwarf-inlining shouldn't emit anything to the .dwo file This configuration (due to r349207) was intended not to emit any DWO CU, but a degenerate CU was still being emitted - containing a header and a DW_TAG_compile_unit with no attributes. Under that situation, emit nothing to the .dwo file. (since this is a dynamic property of the input the .dwo file is still emitted, just with nothing in it (so a valid, but empty, ELF file) - if some other CU didn't satisfy this criteria, its DWO CU would still go there, etc) llvm-svn: 353771	2019-02-12 00:00:38 +00:00
Philip Reames	6a3862e3c2	[Test] Autogenerate a statepoint test and actual show the reload llvm-svn: 353770	2019-02-11 23:55:24 +00:00
Philip Reames	5906a6591c	Be conservative about unordered accesses for the moment Background: As described in https://reviews.llvm.org/D57601, I'm working towards separating volatile and atomic in the MMO uses for atomic instructions. In https://reviews.llvm.org/D57593, I fixed a bug where isUnordered was returning the wrong result, but didn't account for the fact I was getting slightly ahead of myself. While both uses of isUnordered are correct (as far as I can tell), we don't have tests to demonstrate this and being aggressive gets in the way of having the removal of volatile truly be non-functional. Once D57601 lands, I will return to these call sites, revert this patch, and add the appropriate tests to show the expected behaviour. Differential Revision: https://reviews.llvm.org/D57802 llvm-svn: 353766	2019-02-11 23:34:33 +00:00
Daniel Sanders	6cbc92915a	[tblgen] Add a timer covering the time spent reading the Instruction defs This patch adds a -time-regions option to tablegen that can enable timers (currently only one) that assess the performance of tablegen itself. This can be useful for identifying scaling problems with tablegen backends. This particular timer has allowed me to ignore time that is not attributed the GISel combiner pass. It's useful by itself but it is particularly useful in combination with https://reviews.llvm.org/D52954 which causes this period of time to be annotated within Xcode Instruments which in turn allows profile samples and recorded allocations attributed to reading instructions to be filtered out. llvm-svn: 353763	2019-02-11 23:02:02 +00:00
Matt Arsenault	b2d245771f	GlobalISel: Verify G_EXTRACT llvm-svn: 353759	2019-02-11 22:12:43 +00:00
Evandro Menezes	f4a369596f	[TargetLibraryInfo] Update run time support for Windows It seems that, since VC19, the `float` C99 math functions are supported for all targets, unlike the C89 ones. According to the discussion at https://reviews.llvm.org/D57625. llvm-svn: 353758	2019-02-11 22:12:01 +00:00
Ana Pazos	9a3dc3e60b	[LegalizeTypes] Expand FNEG to bitwise op for IEEE FP types Summary: Except for custom floating point types x86_fp80 and ppc_fp128, expand Y = FNEG(X) to Y = X ^ sign mask to avoid library call. Using bitwise operation can improve code size and performance. Reviewers: efriedma Reviewed By: efriedma Subscribers: efriedma, kpn, arsenm, eli.friedman, javed.absar, rbar, johnrusso, simoncook, sabuasal, niosHD, zzheng, edward-jones, rogfer01, MartinMosbeck, brucehoult, the_o, PkmX, jocewei, asb, llvm-commits Differential Revision: https://reviews.llvm.org/D57875 llvm-svn: 353757	2019-02-11 22:10:08 +00:00
Scott Linder	72a0f4e8db	[IRReader] Expose getLazyIRModule Currently there is no way to lazy-load an in-memory IR module without first writing it to disk. This patch just exposes the existing implementation of getLazyIRModule. This is effectively a revert of rL212364 Differential Revision: https://reviews.llvm.org/D56203 llvm-svn: 353755	2019-02-11 22:01:13 +00:00
Matt Arsenault	18ec382698	GlobalISel: Implement moreElementsVector for implicit_def llvm-svn: 353754	2019-02-11 22:00:39 +00:00
Matt Arsenault	68fc38ce80	GlobalISel: Fix not calling the observer when legalizing G_EXTRACT llvm-svn: 353750	2019-02-11 21:33:54 +00:00
Daniel Sanders	24e0af6906	[globalisel] Correct string emitted by GISelChangeObserver::erasingInstr() The API indicates that the MI is about to be erased rather than it has been erased. llvm-svn: 353746	2019-02-11 20:45:19 +00:00
Craig Topper	75eb0af874	[X86] Correct the memory operand for the FLD emitted in FP_TO_INTHelper for 32-bit SSE targets. We were using DstTy, but that represents the integer type we are converting to which is i64 in this case. The FLD is part of an intermediate step to get from the SSE registers to the x87 registers. If the floating point type is f32, the memory operand should reflect a 4 byte access not an 8 byte access. The store we used to get from SSE to the stack is using the corect size. While there, consistenly use TheVT in place of Op.getOperand(0).getValueType() throughout the function. llvm-svn: 353745	2019-02-11 20:38:10 +00:00
Matt Davis	22c21934ce	[llvm-cxxfilt] Split and demangle stdin input Summary: Originally, llvm-cxxfilt would treat a line as a single mangled item to be demangled. If a mangled name appears in the middle of that string, that name would not be demangled. GNU c++filt splits and demangles every word in a string that is piped to it via stdin. Prior to this patch llvm-cxxfilt would never split strings piped to it. This patch replicates the GNU behavior and splits strings that are piped to it via stdin. This fixes PR39990 Reviewers: compnerd, jhenderson, davide Reviewed By: compnerd, jhenderson Subscribers: erik.pilkington, jhenderson, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57350 llvm-svn: 353743	2019-02-11 20:30:53 +00:00
Daniel Sanders	b31180d0de	[globalisel] Restore comment explaining the nits of GISelChangeObserver::createdInstr() llvm-svn: 353741	2019-02-11 20:05:49 +00:00
Alina Sbirlea	d77edc00a8	[MemorySSA] Remove verifyClobberSanity. Summary: This verification may fail after certain transformations due to BasicAA's fragility. Added a small explanation and a testcase that triggers the assert in checkClobberSanity (before its removal). Addresses PR40509. Reviewers: george.burgess.iv Subscribers: sanjoy, jlebar, llvm-commits, Prazek Tags: #llvm Differential Revision: https://reviews.llvm.org/D57973 llvm-svn: 353739	2019-02-11 19:51:21 +00:00
Michael Kruse	77a614a6e1	Refactor setAlreadyUnrolled() and setAlreadyVectorized(). Loop::setAlreadyUnrolled() and LoopVectorizeHints::setLoopAlreadyUnrolled() both add loop metadata that stops the same loop from being transformed multiple times. This patch merges both implementations. In doing so we fix 3 potential issues: * setLoopAlreadyUnrolled() kept the llvm.loop.vectorize/interleave.* metadata even though it will not be used anymore. This already caused problems such as http://llvm.org/PR40546. Change the behavior to the one of setAlreadyUnrolled which deletes this loop metadata. * setAlreadyUnrolled() used to create a new LoopID by calling MDNode::get with nullptr as the first operand, then replacing it by the returned references using replaceOperandWith. It is possible that MDNode::get would instead return an existing node (due to de-duplication) that then gets modified. To avoid, use a fresh TempMDNode that does not get uniqued with anything else before replacing it with replaceOperandWith. * LoopVectorizeHints::matchesHintMetadataName() only compares the suffix of the attribute to set the new value for. That is, when called with "enable", would erase attributes such as "llvm.loop.unroll.enable", "llvm.loop.vectorize.enable" and "llvm.loop.distribute.enable" instead of the one to replace. Fortunately, function was only called with "isvectorized". Differential Revision: https://reviews.llvm.org/D57566 llvm-svn: 353738	2019-02-11 19:45:44 +00:00
Sanjay Patel	587fd849f0	[InstCombine] Fix matchRotate bug when one operand is a ConstantExpr shift This bug seems to be harmless in release builds, but will cause an error in UBSAN builds or an assertion failure in debug builds. When it gets to this opcode comparison, it assumes both of the operands are BinaryOperators, but the prior m_LogicalShift will also match a ConstantExpr. The cast<BinaryOperator> will assert in a debug build, or reading an invalid value for BinaryOp from memory with ((BinaryOperator*)constantExpr)->getOpcode() will cause an error in a UBSAN build. The test I added will fail without this change in debug/UBSAN builds, but not in release. Patch by: @AndrewScheidecker (Andrew Scheidecker) Differential Revision: https://reviews.llvm.org/D58049 llvm-svn: 353736	2019-02-11 19:26:27 +00:00
Bjorn Pettersson	4892f06e06	[SelectionDAGBuilder] Add restrictions to EmitFuncArgumentDbgValue Summary: This patch fixes PR40587. When a dbg.value instrinsic is emitted to the DAG by using EmitFuncArgumentDbgValue the resulting DBG_VALUE is hoisted to the beginning of the entry block. I think the idea is to be able to locate a formal argument already from the start of the function. However, EmitFuncArgumentDbgValue only checked that the value that was used to describe a variable was originating from a function parameter, not that the variable itself actually was an argument to the function. So when for example assigning a local variable "local" the value from an argument "a", the assocated DBG_VALUE instruction would be hoisted to the beginning of the function, even if the scope for "local" started somewhere else (or if "local" was mapped to other values earlier in the function). This patch adds some logic to EmitFuncArgumentDbgValue to check that the variable being described actually is an argument to the function. And that the dbg.value being lowered already is in the entry block. Otherwise we bail out, and the dbg.value will be handled as an ordinary dbg.value (not as a "FuncArgumentDbgValue"). A tricky situation is when both the variable and the value is related to function arguments, but not neccessarily the same argument. We make sure that we do not describe the same argument more than once as a "FuncArgumentDbgValue". This solution works as long as opt has injected a "first" dbg.value that corresponds to the formal argument at the function entry. Reviewers: jmorse, aprantl Subscribers: jyknight, hiraditya, fedor.sergeev, dstenb, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57702 llvm-svn: 353735	2019-02-11 19:23:30 +00:00
Alina Sbirlea	605b21739d	[LICM&MSSA] Limit store hoisting. Summary: If there is no clobbering access for a store inside the loop, that store can only be hoisted if there are no interfearing loads. A more general verification introduced here: there are no loads that are not optimized to an access outside the loop. Addresses PR40586. Reviewers: george.burgess.iv Subscribers: sanjoy, jlebar, Prazek, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57967 llvm-svn: 353734	2019-02-11 19:07:15 +00:00
Evandro Menezes	4b86c474ff	[TargetLibraryInfo] Update run time support for Windows It seems that the run time for Windows has changed and supports more math functions than it used to, especially on AArch64, ARM, and AMD64. Fixes PR40541. Differential revision: https://reviews.llvm.org/D57625 llvm-svn: 353733	2019-02-11 19:02:28 +00:00
Jessica Paquette	e57fe23f70	[AArch64][GlobalISel] Add isel support for a couple vector exts/truncs Add support for - v4s16 <-> v4s32 - v2s64 <-> v2s32 And update tests that use them to show that we generate the correct instructions. Differential Revision: https://reviews.llvm.org/D57832 llvm-svn: 353732	2019-02-11 18:56:39 +00:00
Jessica Paquette	828de9fc4b	[GlobalISel][AArch64] NFC: Remove unnecessary IR from select-fp-casts.mir The IR section in this test doesn't do anything, so there's no point in it being there. Since it's redundant, just remove it. llvm-svn: 353731	2019-02-11 18:41:22 +00:00
Jordan Rupprecht	5b7ad42729	[DebugInfo] Fix /usr/lib/debug llvm-symbolizer lookup with relative paths Summary: rL189250 added a realpath call, and rL352916 because realpath breaks assumptions with some build systems. However, the /usr/lib/debug case has been clarified, falling back to /usr/lib/debug is currently broken if the obj passed in is a relative path. Adding a call to use absolute paths when falling back to /usr/lib/debug fixes that while still not making any realpath assumptions. This also adds a --fallback-debug-path command line flag for testing (since we probably can't write to /usr/lib/debug from buildbot environments), but was also verified manually: ``` $ rm -f path/to/dwarfdump-test.elf-x86-64 $ strace llvm-symbolizer --obj=relative/path/to/dwarfdump-test.elf-x86-64.debuglink 0x40113f \|& grep dwarfdump ``` Lookups went to relative/path/to/dwarfdump-test.elf-x86-64, relative/path/to/.debug/dwarfdump-test.elf-x86-64, and then finally /usr/lib/debug/absolute/path/to/dwarfdump-test.elf-x86-64. Reviewers: dblaikie, samsonov Reviewed By: dblaikie Subscribers: krytarowski, aprantl, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57916 llvm-svn: 353730	2019-02-11 18:05:48 +00:00
Andrea Di Biagio	23ff2aa47c	[MCA][Scheduler] Track resources that were found busy when issuing an instruction. This is a follow up of r353706. When the scheduler fails to issue a ready instruction to the underlying pipelines, it now updates a mask of 'busy resource units'. That information will be used in future to obtain the set of "problematic" resources in the case of bottlenecks caused by resource pressure. No functional change intended. llvm-svn: 353728	2019-02-11 17:55:47 +00:00
Roland Froese	732fe22454	[PowerPC] Avoid scalarization of vector truncate The PowerPC code generator currently scalarizes vector truncates that would fit in a vector register, resulting in vector extracts, scalar operations, and vector merges. This patch custom lowers a vector truncate that would fit in a register to a vector shuffle instead. Differential Revision: https://reviews.llvm.org/D56507 llvm-svn: 353724	2019-02-11 17:29:14 +00:00
Jessica Paquette	ebdb021031	[GlobalISel][AArch64] Select G_FFLOOR This teaches the legalizer about G_FFLOOR, and lets us select G_FFLOOR in AArch64. It updates the existing floating point tests, and adds a select-floor.mir test. Differential Revision: https://reviews.llvm.org/D57486 llvm-svn: 353722	2019-02-11 17:22:58 +00:00
Jessica Paquette	f472f31876	Recommit "[GlobalISel] Add IRTranslator support for G_FFLOOR" After the changes introduced in r353586, this instruction doesn't cause any issues for any backend. Original review: https://reviews.llvm.org/D57485 llvm-svn: 353720	2019-02-11 17:16:32 +00:00
Matt Arsenault	9dba67f431	GlobalISel: Add G_FCANONICALIZE instruction llvm-svn: 353719	2019-02-11 17:05:20 +00:00
Valery Pykhtin	e1c338e527	[AMDGPU] fix atomic_optimizations_buffer.ll test after DPP combiner was enabled by default. Related commits: rL353691, rL353703. llvm-svn: 353717	2019-02-11 16:28:42 +00:00
Simon Pilgrim	9ea8f49a83	[X86] Regenerate insertelement tests Add common X86/X64 prefixes (and use X86 instead of X32) llvm-svn: 353716	2019-02-11 16:16:09 +00:00
David Greene	5caa550649	Add recipes for migrating downstream branches of git mirrors Add some common recipes for downstream users developing on top of the existing git mirrors. These instructions show how to migrate local branches to the monorepo. Differential Revision: https://reviews.llvm.org/D56550 llvm-svn: 353713	2019-02-11 15:40:02 +00:00
Benjamin Kramer	711950c116	Move some classes into anonymous namespaces. NFC. llvm-svn: 353710	2019-02-11 15:16:21 +00:00
Andrea Di Biagio	83e68854d5	[MCA] Return a mask of busy resources from method ResourceManager::checkAvailability(). NFCI In case of bottlenecks caused by pipeline pressure, we want to be able to correctly report the set of problematic pipelines. This is a first step towards adding support for bottleneck hints in llvm-mca (see PR37494). No functional change intended. llvm-svn: 353706	2019-02-11 14:53:04 +00:00
Benjamin Kramer	582c16013d	[AMDGPU] Remove unused variable llvm-svn: 353704	2019-02-11 14:49:54 +00:00
Neil Henning	8c10fa1a90	[AMDGPU] Fix DPP sequence in atomic optimizer. This commit fixes the DPP sequence in the atomic optimizer (which was previously missing the row_shr:3 step), and works around a read_register exec bug by using a ballot instead. Differential Revision: https://reviews.llvm.org/D57737 llvm-svn: 353703	2019-02-11 14:44:14 +00:00
Sam McCall	e825ba9165	Revert "[X86][SSE] Generalize X86ISD::BLENDI support to more value types" This reverts commit r353610. It causes a miscompile visible in macro expansion in a bootstrapped clang. http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20190211/626590.html llvm-svn: 353699	2019-02-11 14:05:36 +00:00
Max Kazantsev	0136e7a246	[TEST] Add missing opportunity test for PR39673 llvm-svn: 353693	2019-02-11 12:58:18 +00:00
Sam Parker	8ff143033a	[ARM] Add v8m.base pattern for add negative imm The v8m.base ISA contains movw, which can operate on an unsigned 16-bit value. Add the pattern that converts an add with a negative value, that could fit into 16-bits when negated, into a sub with that positive value. Differential Revision: https://reviews.llvm.org/D57942 llvm-svn: 353692	2019-02-11 11:35:42 +00:00
Valery Pykhtin	ded96df01e	[AMDGPU] Enable DPP combiner pass by default. Related revisions: https://reviews.llvm.org/D55444, https://reviews.llvm.org/D55314 llvm-svn: 353691	2019-02-11 11:15:03 +00:00
Sam Parker	3fbacd4964	[NFC][ARM] Simplify loop-indexing codegen test Remove unnecessary offset checks, CHECK-BASE checks and add some extra -NOT checks and TODO comments. llvm-svn: 353689	2019-02-11 10:52:49 +00:00
Max Kazantsev	8ec0c5e02f	[TEST] Add failing test from PR40454 llvm-svn: 353688	2019-02-11 10:44:57 +00:00
Diana Picus	2d29cc311b	test-release.sh: Add option to use ninja Allow the use of ninja instead of make. This is useful on some platforms where we'd like to be able to limit the number of link jobs without slowing down the other steps of the release. This patch adds a -use-ninja command line option, which sets the generator to Ninja both for LLVM and the test-suite. It also deals with some differences between make and ninja: * DESTDIR handling - ninja doesn't like this to be listed after the target, but both make and ninja can handle it before the command * Verbose mode - ninja uses -v, make uses VERBOSE=1 * Keep going mode - make has a -k mode, which builds as much as possible even when failures are encountered; for ninja we need to set a hard limit (we use 100 since most people won't look at 100 failures anyway) I haven't tested with gmake. llvm-svn: 353685	2019-02-11 10:30:22 +00:00
Eugene Leviant	6bcf6358eb	Attempt to fix buildbot after r353679 #2 llvm-svn: 353683	2019-02-11 10:17:17 +00:00
Carlos Alberto Enciso	e848d426a7	[DWARF] LLVM ERROR: Broken function found, while removing Debug Intrinsics. Check that when SimplifyCFG is flattening a 'br', all their debug intrinsic instructions are removed, including any dbg.label referencing a label associated with the basic blocks being removed. As the test case involves a CFG transformation, move it to the correct location. Differential Revision: https://reviews.llvm.org/D57444 llvm-svn: 353682	2019-02-11 10:16:38 +00:00
Eugene Leviant	6aaa8bfef8	Attempt to fix buildbot after r353679 llvm-svn: 353681	2019-02-11 10:12:19 +00:00
Eugene Leviant	317f9e7ae7	Small refactoring of FileError. NFC. Differential revision: https://reviews.llvm.org/D57945 llvm-svn: 353679	2019-02-11 09:49:37 +00:00
Sjoerd Meijer	150ccb889e	[ARM] LoadStoreOptimizer: reoder limit The whole design of generating LDMs/STMs is fragile and unreliable: it depends on rescheduling here in the LoadStoreOptimizer that isn't register pressure aware and regalloc that isn't aware of generating LDMs/STMs. This patch adds a (hidden) option to control the total number of instructions that can be re-ordered. I appreciate this looks only a tiny bit better than a hard-coded constant, but at least it allows more easy experimentation with different values for now. Ideally we calculate this reorder limit based on some heuristics, and take register pressure into account. I might be looking into that next. Differential Revision: https://reviews.llvm.org/D57954 llvm-svn: 353678	2019-02-11 09:37:42 +00:00
Chandler Carruth	9beadff6a5	Move CFLGraph and the AA summary code over to the new `CallBase` instruction base class rather than the `CallSite` wrapper. llvm-svn: 353676	2019-02-11 09:25:41 +00:00
Michal Gorny	6e54cda2ac	[llvm] [cmake] Use current directory in GenerateVersionFromVCS Find dependent scripts of GenerateVersionFromVCS in current directory rather than ../../cmake/modules. I do not see any reason why the former would not work and The latter is incorrect when GenerateVersionFromVCS is used from install directory (i.e. in stand-alone builds). Differential Revision: https://reviews.llvm.org/D57996 llvm-svn: 353674	2019-02-11 09:07:07 +00:00
Chandler Carruth	2d2a4359a2	Remove `CallSite` from the CodeMetrics analysis, moving it to the new `CallBase` and simpler APIs therein. llvm-svn: 353673	2019-02-11 09:03:32 +00:00
Chandler Carruth	73634358a1	Remove a declaration that is dead, and not even implemented any longer. llvm-svn: 353672	2019-02-11 09:03:26 +00:00
Sjoerd Meijer	0cc50c6b87	[ARM] LoadStoreOptimizer: just a clean-up. NFC. Differential Revision: https://reviews.llvm.org/D57955 llvm-svn: 353670	2019-02-11 08:47:59 +00:00
Chandler Carruth	1f5550326f	Update more files added with the old header to the new one. llvm-svn: 353667	2019-02-11 08:25:56 +00:00
Chandler Carruth	127252b7d9	Update new files added to llvm-objcopy to use the new file header. llvm-svn: 353666	2019-02-11 08:25:19 +00:00
Chandler Carruth	b53f0e1145	Update files that were mistakenly added with the old file header to the new one. llvm-svn: 353665	2019-02-11 08:07:38 +00:00
Chandler Carruth	3b387a7e3c	Update files that were mistakenly added with the old file header. llvm-svn: 353664	2019-02-11 08:07:32 +00:00
Chandler Carruth	dac20a8254	[CallSite removal] Port InstSimplify over to use `CallBase` both in its interface and implementation. Port code with: `cast<CallBase>(CS.getInstruction())`. llvm-svn: 353662	2019-02-11 07:54:10 +00:00
Chandler Carruth	751d95fb9b	[CallSite removal] Migrate ConstantFolding APIs and implementation to `CallBase`. Users have been updated. You can see how to update any out-of-tree usages: pass `cast<CallBase>(CS.getInstruction())`. llvm-svn: 353661	2019-02-11 07:51:44 +00:00
Chandler Carruth	3160734af1	[CallSite removal] Migrate the statepoint GC infrastructure to use the `CallBase` class rather than `CallSite` wrappers. I pushed this change down through most of the statepoint infrastructure, completely removing the use of CallSite where I could reasonably do so. I ended up making a couple of cut-points: generic call handling (instcombine, TLI, SDAG). As soon as it hit truly generic handling with users outside the immediate code, I simply transitioned into or out of a `CallSite` to make this a reasonable sized chunk. Differential Revision: https://reviews.llvm.org/D56122 llvm-svn: 353660	2019-02-11 07:42:30 +00:00
Craig Topper	5b1beda001	[X86] Removed unused SDTypeProfile. NFC llvm-svn: 353659	2019-02-11 07:30:48 +00:00
Nico Weber	fd6bf97b6f	gn build: Fix clang-tidy dep on ClangSACheckers. Patch by Mirko Bonadei <mbonadei@webrtc.org>! Differential Revision: https://reviews.llvm.org/D57998 llvm-svn: 353657	2019-02-11 03:09:57 +00:00
Simon Pilgrim	f6e6c369c0	[X86] EltsFromConsecutiveLoads - replace SmallBitVector with APInt (NFC). Minor refactor to simplify some incoming patches to improve broadcast loads. llvm-svn: 353655	2019-02-10 22:45:48 +00:00
Mandeep Singh Grang	ea246114bb	[GlobalISel] Regex the opcodes in unit test to fix non-deterministic ordering Differential Revision: https://reviews.llvm.org/D57988 llvm-svn: 353652	2019-02-10 19:53:43 +00:00
Nikita Popov	a0e96bd56d	[CodeGen][X86] Don't scalarize vector saturating add/sub Now that we have vector support for [US](ADD\|SUB)O we no longer need to scalarize when expanding [US](ADD\|SUB)SAT. This matches what the cost model already does. Differential Revision: https://reviews.llvm.org/D57348 llvm-svn: 353651	2019-02-10 19:06:38 +00:00
Simon Pilgrim	a303186ef3	[AArch64] Regenerate bswap tests llvm-svn: 353648	2019-02-10 18:27:37 +00:00
Simon Pilgrim	ce10312986	[X86] Add basic bitreverse/bswap combine tests Shows missing SimplifyDemandedBits support llvm-svn: 353647	2019-02-10 18:07:03 +00:00
Simon Pilgrim	c5744d4d69	[DAG] Add optional AllowUndefs to isNullOrNullSplat No change in default behaviour (AllowUndefs = false) llvm-svn: 353646	2019-02-10 17:42:15 +00:00
Simon Pilgrim	5a82a788a2	[DAGCombine] Simplify funnel shifts with undef/zero args to bitshifts Now that we have SimplifyDemandedBits support for funnel shifts (rL353539), we need to simplify funnel shifts back to bitshifts in cases where either argument has been folded to undef/zero. Differential Revision: https://reviews.llvm.org/D58009 llvm-svn: 353645	2019-02-10 17:04:00 +00:00
Simon Pilgrim	06a61b0b2b	[X86] Add masked variable tests for funnel undef/zero argument combines I've avoided 'modulo' masks as we'll SimplifyDemandedBits those in the future, and we just need to check that the shift variable is 'in range' llvm-svn: 353644	2019-02-10 15:46:32 +00:00
Sanjay Patel	833550fc74	[x86] narrow 256-bit horizontal ops via demanded elements 256-bit horizontal math ops are an x86 monstrosity (and thankfully have not been extended to 512-bit AFAIK). The two 128-bit halves operate on separate halves of the inputs. So if we don't demand anything in the upper half of the result, we can extract the low halves of the inputs, do the math, and then insert that result into a 256-bit output. All of the extract/insert is free (ymm<-->xmm), so we're left with a narrower (cheaper) version of the original op. In the affected tests based on: https://bugs.llvm.org/show_bug.cgi?id=33758 https://bugs.llvm.org/show_bug.cgi?id=38971 ...we see that the h-op narrowing can result in further narrowing of other math via existing generic transforms. I originally drafted this patch as an exact pattern match starting from extract_vector_elt, but I thought we might see diffs starting from extract_subvector too, so I changed it to a more general demanded elements solution. There are no extra existing regression test improvements from that switch though, so we could go back. Differential Revision: https://reviews.llvm.org/D57841 llvm-svn: 353641	2019-02-10 15:22:06 +00:00
Simon Pilgrim	76683e7b58	[X86] Add additional tests for funnel undef/zero argument combines As suggested on D58009 llvm-svn: 353640	2019-02-10 14:54:57 +00:00
Sanjay Patel	2f319420f9	[TargetLowering] refactor setcc folds to fix another miscompile (PR40657) SimplifySetCC still has much room for improvement, but this should fix the remaining problem examples from: https://bugs.llvm.org/show_bug.cgi?id=40657 The initial fix for this problem was rL353615. llvm-svn: 353639	2019-02-10 14:29:57 +00:00
Simon Pilgrim	fd541e9a5b	[X86][SSE] Add SimplifyDemandedBits test for BLENDVPD llvm-svn: 353638	2019-02-10 12:55:44 +00:00
Fangrui Song	709a3e7488	[Local] Delete a redundant check. NFC isInstructionTriviallyDead also performs the use_empty() check. llvm-svn: 353637	2019-02-10 09:25:56 +00:00
George Rimar	5cb317315c	[yaml2obj] - Fix .dynamic section entries writing for 32bit targets. This was introduced by me in r353613. I tried to fix Big-endian bot and replaced uintX_t -> ELFT::Xword. But ELFT::Xword is a packed<uint64_t>, so it is always 8 bytes and that was obviously incorrect. My intention was to use something like packed<uint> actually, which size is target dependent. Patch fixes this bug and adds a test case, since no bots seems reported this. llvm-svn: 353636	2019-02-10 08:35:38 +00:00
Craig Topper	f37ea96922	[X86] Move some vector InstAliases out from under unnecessary 'let Predicates'. NFCI We don't have any assembler predicates for vector ISAs so this isn't necessary. It just adds extra lines and identation. llvm-svn: 353631	2019-02-10 02:34:31 +00:00
Craig Topper	a97857b5b5	[InstCombine] Fix an unused variable warning. llvm-svn: 353630	2019-02-10 02:21:29 +00:00
Simon Pilgrim	a561d46633	[X86] Add tests for funnel undef argument combines If one of the shifted arguments is undef we should be folding to a regular shift. llvm-svn: 353628	2019-02-09 22:21:09 +00:00
Simon Pilgrim	6bf7b30b10	[X86] CombineOr - fold to generic funnel shifts As discussed on D57389, this is a first step towards moving the SHLD/SHRD matching code to DAGCombiner using FSHL/FSHR instead. There's a bit of work to do before I can do that, so this just folds to FSHL/FSHR in the existing code (handling the different SHRD/FSHR argument ordering), which fixes the issue we had with i16 shift amounts not being correctly masked. llvm-svn: 353626	2019-02-09 20:34:59 +00:00
Sanjay Patel	586ad01fb6	[x86] add another test for setcc miscompile (PR40657); NFC llvm-svn: 353625	2019-02-09 20:06:11 +00:00
Nico Weber	89a4deea96	gn build: Merge r353590 llvm-svn: 353621	2019-02-09 17:58:16 +00:00
Nico Weber	a2f60933e5	llvm-lib: Implement /list flag Differential Revision: https://reviews.llvm.org/D57952 llvm-svn: 353620	2019-02-09 17:33:04 +00:00
Sanjay Patel	7467510453	[TargetLowering] add tests to show effect of setcc sub->shift; NFC There's effectively no difference for the cases with variables. We just trade a sub for an add on those. But the case with a subtract from constant would require an extra move instruction on x86, so this looks like a reasonable generic combine. llvm-svn: 353619	2019-02-09 17:03:59 +00:00
Sanjay Patel	f31cf49c58	[x86] add test for setcc sub->shift transform; NFC llvm-svn: 353618	2019-02-09 16:41:20 +00:00
Simon Pilgrim	ab28321768	[X86] Regenerate test. llvm-svn: 353616	2019-02-09 16:27:19 +00:00
Sanjay Patel	887ac1b38c	[TargetLowering] avoid miscompile in setcc transform (PR40657) llvm-svn: 353615	2019-02-09 15:59:02 +00:00
George Rimar	6404af8646	[yaml2elf.cpp] - Fix compilation under linux. Fixes errors like: /home/ssglocal/clang-cmake-x86_64-sde-avx512-linux/clang-cmake-x86_64-sde-avx512-linux/llvm/tools/yaml2obj/yaml2elf.cpp:597:5: error: need ‘typename’ before ‘ELFT:: Xword’ because ‘ELFT’ is a dependent scope ELFT::Xword Tag = (ELFT::Xword)DE.Tag; llvm-svn: 353614	2019-02-09 15:18:52 +00:00
George Rimar	291bbe5e2c	[yaml2elf] - An attemp to fix s390x BB after r353607. s390x is big-endian and seems r353607 had an issue with endianess, Bot was unhappy: http://lab.llvm.org:8011/builders/clang-s390x-linux-lnt/builds/11168/steps/ninja%20check%201/logs/stdio This should fix it. llvm-svn: 353613	2019-02-09 15:03:19 +00:00
Nikita Popov	37bce93e36	Revert "[SelectionDAG] Extract [US]MULO expansion into TL method; NFC" This reverts commit r353611. Triggers an assertion during the libcall expansion on ARM. llvm-svn: 353612	2019-02-09 13:54:02 +00:00
Nikita Popov	7de44ed945	[SelectionDAG] Extract [US]MULO expansion into TL method; NFC In preparation for supporting vector expansion. Also drop a variant of ExpandLibCall, of which the MULO expansions were the only user. llvm-svn: 353611	2019-02-09 13:29:22 +00:00
Simon Pilgrim	690a2889d8	[X86][SSE] Generalize X86ISD::BLENDI support to more value types D42042 introduced the ability for the ExecutionDomainFixPass to more easily change between BLENDPD/BLENDPS/PBLENDW as the domains required. With this ability, we can avoid most bitcasts/scaling in the DAG that was occurring with X86ISD::BLENDI lowering/combining, blend with the vXi32/vXi64 vectors directly and use isel patterns to lower to the float vector equivalent vectors. This helps the shuffle combining and SimplifyDemandedVectorElts be more aggressive as we lose track of fewer UNDEF elements than when we go up/down through bitcasts. I've introduced a basic blend(bitcast(x),bitcast(y)) -> bitcast(blend(x,y)) fold, there are more generalizations I can do there (e.g. widening/scaling and handling the tricky v16i16 repeated mask case). The vector-reduce-smin/smax regressions will be fixed in a future improvement to SimplifyDemandedBits to peek through bitcasts and support X86ISD::BLENDV. Differential Revision: https://reviews.llvm.org/D57888 llvm-svn: 353610	2019-02-09 13:13:59 +00:00
George Rimar	0745ca7830	[lib/ObjectYAML] - Fix BB after r353607 [2]. NFC. The second and the last place it seems. Error was: [ 4%] Building CXX object lib/Support/CMakeFiles/LLVMSupport.dir/Error.cpp.o /Users/buildslave/as-bldslv9_new/lld-x86_64-darwin13/llvm.src/lib/ObjectYAML/ELFYAML.cpp:993:15: error: unused variable 'Object' [-Werror,-Wunused-variable] const auto Object = static_cast<ELFYAML::Object >(IO.getContext()); llvm-svn: 353609	2019-02-09 12:14:20 +00:00
George Rimar	cc22d887ac	[lib/ObjectYAML] - Fix BB after r353607. NFC. Error was: [ 4%] Building CXX object lib/Support/CMakeFiles/LLVMSupport.dir/DAGDeltaAlgorithm.cpp.o /Users/buildslave/as-bldslv9_new/lld-x86_64-darwin13/llvm.src/lib/ObjectYAML/ELFYAML.cpp:666:15: error: unused variable 'Object' [-Werror,-Wunused-variable] const auto Object = static_cast<ELFYAML::Object >(IO.getContext()); (http://lab.llvm.org:8011/builders/lld-x86_64-darwin13/builds/29920) llvm-svn: 353608	2019-02-09 12:04:39 +00:00
George Rimar	0e7ed91264	[yaml2obj][obj2yaml] - Add support for dumping/parsing .dynamic sections. This teaches the tools to parse and dump the .dynamic section and its dynamic tags. Differential revision: https://reviews.llvm.org/D57691 llvm-svn: 353606	2019-02-09 11:34:28 +00:00
Fangrui Song	6e679f8ba5	[GlobalOpt] Simplify __cxa_atexit elimination cxxDtorIsEmpty checks callers recursively to determine if the __cxa_atexit-registered function is empty, and eliminates the __cxa_atexit call accordingly. This recursive check is unnecessary as redundant instructions and function calls can be removed by early-cse and inliner. In addition, cxxDtorIsEmpty does not mark visited function and it may visit a function exponential times (multiplication principle). llvm-svn: 353603	2019-02-09 09:18:37 +00:00
Petr Hosek	3ef9918d25	[CMake] Don't set <PROJECT>_STANDALONE_BUILD We shouldn't be treating runtimes builds as standalone builds since we have enough of the context loaded into the runtimes environment. Differential Revision: https://reviews.llvm.org/D57992 llvm-svn: 353601	2019-02-09 03:06:56 +00:00
Hubert Tong	8c2a236358	[MC] Clean up unused inline function and non-anchor defaulted destructors; NFCI Summary: Take care of some missing clean-ups that belong with r249548 and some other copy/paste that had happened. In particular, the destructors are no longer vtable anchors after r249548; and `setSectionName` in `MCSectionWasm` is private and unused since r313058 culled its only caller. The destructors are now implicitly defined, and the unused function is removed. Reviewers: nemanjai, jasonliu, grosbach Reviewed By: nemanjai Subscribers: sbc100, aheejin, sunfish, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57182 llvm-svn: 353597	2019-02-09 02:11:51 +00:00
Gabor Buella	53980b24b7	Extra processing for BitCast + PHI in InstCombine For some specific cases with bitcast A->B->A with intervening PHI nodes InstCombiner::optimizeBitCastFromPhi transformation creates extra PHI nodes, which are actually a copy of already created PHI or in another words, they are redundant. These extra PHI nodes could lead to extra move instructions generated after DeSSA transformation. This happens when several conditions are met - SROA kicks in and creates new alloca; - there is a simple assignment L = R, which falls under 'canonicalize loads' done by combineLoadToOperationType (this transformation is by default). Exactly this transformation is the reason of bitcasts generated; - the alloca is then used in A->B->A + PHI chain; - there is a loop unrolling. As a result optimizeBitCastFromPhi creates as many of PHI nodes for each new SROA alloca as loop unrolling factor is. These new extra PHI nodes are redundant actually except of one and should not be created. Moreover the idea of optimizeBitCastFromPhi is to get rid of the cast (when possible) but that doesn't happen in these conditions. The proposed fix is to do the cast replacement for the whole calculated/accumulated PHI closure not for one cast only, which is an argument to the optimizeBitCastFromPhi. These will help to accomplish several things: 1) avoid extra PHI nodes generated as all casts which may trigger optimizeBitCastFromPhi transformation will be replaced, 3) bitcasts will be replaced, and 3) create more opportunities to remove dead code, which appears after the replacement. A new test case shows that it's possible to get rid of all bitcasts completely and get quite good code reduction. Author: Igor Tsimbalist <igor.v.tsimbalist@intel.com> Reviewed By: Carrot Differential Revision: https://reviews.llvm.org/D57053 llvm-svn: 353595	2019-02-09 01:44:28 +00:00
Stanislav Mekhanoshin	344968fdb4	[AMDGPU] Split idot4/8 signed and unsigned tests. NFC. llvm-svn: 353593	2019-02-09 01:02:28 +00:00
Mikhail R. Gadelha	3289ccd848	This reverts commit 1440a848a635849b97f7a5cfa0ecc40d37451f5b. and commit a1853e834c65751f92521f7481b15cf0365e796b. They broke arm and aarch64 llvm-svn: 353590	2019-02-09 00:46:12 +00:00
Jessica Paquette	c230c13d4b	Recommit "[GlobalISel] Introduce a generic floating point floor opcode, G_FFLOOR"" After r353586, we won't fail on the AMDGPU floor pattern that was killing the importer before. llvm-svn: 353589	2019-02-09 00:37:31 +00:00
Stanislav Mekhanoshin	0e858b028d	[AMDGPU] Split dot-insts feature Differential Revision: https://reviews.llvm.org/D57971 llvm-svn: 353587	2019-02-09 00:34:21 +00:00
Jessica Paquette	1ed1dd6d95	[GlobalISel] Skip patterns that define complex suboperands twice instead of dying If we run into a pattern that looks like this: add (complex $x, $y) (complex $x, $z) We should skip the pattern instead of asserting/doing something unpredictable. This makes us return an Error in that case, and adds a testcase for skipped patterns. Differential Revision: https://reviews.llvm.org/D57980 llvm-svn: 353586	2019-02-09 00:29:13 +00:00
Nico Weber	760fee27fe	gn build: Merge r353566 llvm-svn: 353585	2019-02-09 00:21:06 +00:00
Sergey Dmitriev	afd612ece9	[NFC] Avoid passing blocks vector to the OutlineRegionInfo constructor by value. Reviewers: vsk, fhahn, davidxl Reviewed By: vsk Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57957 llvm-svn: 353582	2019-02-08 23:52:15 +00:00
Sanjay Patel	1386d99c77	[x86] add test for miscompiling setcc transform (PR40657); NFC llvm-svn: 353580	2019-02-08 23:34:57 +00:00
Francis Visoiu Mistrih	8bc57953b7	Re-apply r353553 "[GISel][NFC]: Add missing call to record CSE hits in the CSEMIRBuilder" With a fix after r353563 that adds some more opcodes. llvm-svn: 353579	2019-02-08 23:34:11 +00:00
Francis Visoiu Mistrih	decba8aa06	Revert r353553 "[GISel][NFC]: Add missing call to record CSE hits in the CSEMIRBuilder" This reverts commit r353553. This breaks CodeGen/AArch64/GlobalISel/legalize-ext-csedebug-output.mir: http://green.lab.llvm.org/green/job/clang-stage1-cmake-RA-incremental/57963/console llvm-svn: 353575	2019-02-08 22:49:43 +00:00
Craig Topper	e08e2b6067	[Docs] Use code-block:: text for part of the callbr documentation to attempt to make the bot happy. llvm-svn: 353567	2019-02-08 21:09:33 +00:00
Craig Topper	fcb63c4c6c	[X86] Add FPCW as an implicit use on floating point load instructions. These instructions can generate a stack overflow exception so technically they read the stack overflow exception mask bit. llvm-svn: 353564	2019-02-08 20:50:09 +00:00
Craig Topper	784929d045	Implementation of asm-goto support in LLVM This patch accompanies the RFC posted here: http://lists.llvm.org/pipermail/llvm-dev/2018-October/127239.html This patch adds a new CallBr IR instruction to support asm-goto inline assembly like gcc as used by the linux kernel. This instruction is both a call instruction and a terminator instruction with multiple successors. Only inline assembly usage is supported today. This also adds a new INLINEASM_BR opcode to SelectionDAG and MachineIR to represent an INLINEASM block that is also considered a terminator instruction. There will likely be more bug fixes and optimizations to follow this, but we felt it had reached a point where we would like to switch to an incremental development model. Patch by Craig Topper, Alexander Ivchenko, Mikhail Dvoretckii Differential Revision: https://reviews.llvm.org/D53765 llvm-svn: 353563	2019-02-08 20:48:56 +00:00
Vedant Kumar	0e5dd512aa	[CodeExtractor] Restore outputs after creating exit stubs When CodeExtractor saves the result of InvokeInst at the first insertion point of the 'normal destination' basic block, this block can be omitted in the outlined region, so store is placed outside of the function. The suggested solution is to process saving outputs after creating exit stubs for new function, and stores will be placed in that blocks before return in this case. Patch by Sergei Kachkov! Fixes llvm.org/PR40455. Differential Revision: https://reviews.llvm.org/D57919 llvm-svn: 353562	2019-02-08 20:48:04 +00:00
Matt Arsenault	ca9583a70a	AMDGPU/GlobalISel: Fix broken tests llvm-svn: 353559	2019-02-08 19:59:39 +00:00
Matt Arsenault	564f0f832c	AMDGPU: Eliminate GPU specific SubtargetFeatures Inline compatability is determined from the individual feature bits. These are just sets of the separate features, but will always be treated as incompatible unless they are specifically ignored. Defining the ISA version number here in tablegen would be nice, but it turns out this wasn't actually used. llvm-svn: 353558	2019-02-08 19:59:32 +00:00
Nemanja Ivanovic	92a8c36735	[DAGCombine] Optimize pow(X, 0.75) to sqrt(X) * sqrt(sqrt(X)) The sqrt case is faster and we already do this for the case where the exponent is 0.25. This adds the 0.75 case which is also not sensitive to signed zeros. Patch by Whitney Tsang (Whitney) Differential revision: https://reviews.llvm.org/D57434 llvm-svn: 353557	2019-02-08 19:50:58 +00:00
Aditya Nandakumar	01e818a97d	[GISel][NFC]: Add missing call to record CSE hits in the CSEMIRBuilder https://reviews.llvm.org/D57932 Add some logging + tests to make sure CSEInfo prints debug output. reviewed by: arsenm llvm-svn: 353553	2019-02-08 19:41:13 +00:00
Jonathan Metzman	b98fea9c11	Document libFuzzer on Windows. Summary: Document that libFuzzer supports Windows, how to get it, and its limitations. Reviewers: kcc, morehouse, rnk, metzman Reviewed By: kcc, rnk, metzman Subscribers: hans, rnk Differential Revision: https://reviews.llvm.org/D57597 llvm-svn: 353551	2019-02-08 19:35:04 +00:00
Rong Xu	017bbd96cf	[Cmake] Add an option to build LLVM using the experimental new pass manager Add LLVM_USE_NEWPM to build LLVM using the experimental new pass manager. Differential Revision: http://reviews.llvm.org/D57068 llvm-svn: 353550	2019-02-08 19:31:03 +00:00
Matt Arsenault	d7047276ec	AMDGPU: Remove GCN features and predicates These are no longer necessary since the R600 tablegen files are split out now. llvm-svn: 353548	2019-02-08 19:18:01 +00:00
Reid Kleckner	987d331fab	[InstrProf] Implement static profdata registration Summary: The motivating use case is eliminating duplicate profile data registered for the same inline function in two object files. Before this change, users would observe multiple symbol definition errors with VC link, but links with LLD would succeed. Users (Mozilla) have reported that PGO works well with clang-cl and LLD, but when using LLD without this static registration, we would get into a "relocation against a discarded section" situation. I'm not sure what happens in that situation, but I suspect that duplicate, unused profile information was retained. If so, this change will reduce the size of such binaries with LLD. Now, Windows uses static registration and is in line with all the other platforms. Reviewers: davidxl, wmi, inglorion, void, calixte Subscribers: mgorny, krytarowski, eraman, fedor.sergeev, hiraditya, #sanitizers, dmajor, llvm-commits Tags: #sanitizers, #llvm Differential Revision: https://reviews.llvm.org/D57929 llvm-svn: 353547	2019-02-08 19:03:50 +00:00
Simon Pilgrim	eb6a47a462	[TargetLowering] Use ISD::FSHR in expandFixedPointMul Replace OR(SHL,SRL) pattern with ISD::FSHR (legalization expands this later if necessary) - this helps with the scale == 0 'undefined' drop-through case that was discussed on D55720. llvm-svn: 353546	2019-02-08 18:57:38 +00:00
Jonas Devlieghere	3d0213e483	[test] Run the verifier for dsymutil module tests Dsymutil has an option "verify" that runs the dwarf verifier on the generated dSYM. This patch enables this for the module tests. llvm-svn: 353544	2019-02-08 18:43:11 +00:00
Simon Pilgrim	478bb90779	[TargetLowering] Add SimplifyDemandedBits funnel shift support llvm-svn: 353539	2019-02-08 17:19:01 +00:00
Teresa Johnson	3ce8112dad	ArgumentPromotion should copy all metadata to new Function Summary: ArgumentPromotion had code to specifically move the dbg metadata over to the new function, but other metadata such as the function_entry_count !prof metadata was not. Replace code that moved dbg metadata with a call to copyMetadata. The old metadata is automatically removed when the old Function is removed. Reviewers: davidxl Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57846 llvm-svn: 353537	2019-02-08 17:08:27 +00:00
Craig Topper	41a1792b15	[X86] Remove isReMaterializable from X87 floating point constant loads and constant pool loads. Summary: These instructions update FPSW so they aren't generically safe to rematerialize into any location if FPSW is live for a comparison result. They also use FPCW for exception masking control. Though the only exception they can generate is stack overflow and we manage the stack ourselves so that's not really going to occur. Reviewers: RKSimon, spatel Reviewed By: RKSimon Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57934 llvm-svn: 353536	2019-02-08 17:07:54 +00:00
Simon Pilgrim	68457c1e52	[X86] Add basic funnel shift demanded bits tests llvm-svn: 353534	2019-02-08 16:51:16 +00:00
Sanjay Patel	e9cc26a56a	[x86] fix formatting; NFC (test commit #2 migrating to git) llvm-svn: 353533	2019-02-08 16:48:40 +00:00
Carl Ritson	494b8ac95a	[AMDGPU] Fix CS scratch setup on pre-GCN3 ASICs Summary: Prior to GCN3 s_load_dword offsets are in dwords rather than bytes. Thus the scratch buffer descriptor offset must be adjusted for pre-GCN3 ASICs. Reviewers: nhaehnle, tpr Reviewed By: nhaehnle Subscribers: sheredom, arsenm, kzhuravl, jvesely, wdng, yaxunl, dstuttard, t-tye, jfb, llvm-commits Differential Revision: https://reviews.llvm.org/D56496 llvm-svn: 353530	2019-02-08 15:41:11 +00:00
Nirav Dave	97011ccce0	Revert r353416 "[DAG] Cleanup unused nodes on failed store-to-load forward combine." This cleanup causes out-of-tree crashes. llvm-svn: 353527	2019-02-08 15:21:13 +00:00
Matt Arsenault	b0a227049f	AMDGPU/GlobalISel: Fix shift legalization for non-power-of-2 clampScalar doesn't do anything for non-power-of-2 in range. There should probably be a combination rule to reduce the number of matching rules. llvm-svn: 353526	2019-02-08 15:06:24 +00:00
Dmitry Preobrazhensky	942c273d64	[AMDGPU][MC] Added support of lds_direct operand See bug 39293: https://bugs.llvm.org/show_bug.cgi?id=39293 Reviewers: artem.tamazov, rampitec Differential Revision: https://reviews.llvm.org/D57889 llvm-svn: 353524	2019-02-08 14:57:37 +00:00
Matt Arsenault	0f2debb1c2	AMDGPU/GlobalISel: Fix non-power-of-2 implicit_def llvm-svn: 353522	2019-02-08 14:46:27 +00:00
Eugene Leviant	e08fe35d79	[llvm-objcopy] Add few file processing directives Differential revision: https://reviews.llvm.org/D57877 llvm-svn: 353521	2019-02-08 14:37:54 +00:00
Petar Avramovic	c98b26d326	[MIPS GlobalISel] Select any extending load and truncating store Make behavior of G_LOAD in widenScalar same as for G_ZEXTLOAD and G_SEXTLOAD. That is perform widenScalarDst to size given by the target and avoid additional checks in common code. Targets can reorder or add additional rules in LegalizeRuleSet for the opcode to achieve desired behavior. Select extending load that does not have specified type of extension into zero extending load. Select truncating store that stores number of bytes indicated by size in MachineMemoperand. Differential Revision: https://reviews.llvm.org/D57454 llvm-svn: 353520	2019-02-08 14:27:23 +00:00
Nico Weber	e44c21f5a4	gn build: Merge r353471, r353373. llvm-svn: 353518	2019-02-08 14:19:54 +00:00
Matt Arsenault	dc88a2ce35	AMDGPU/GlobalISel: Don't use a copy in addrspacecast lowering llvm-svn: 353516	2019-02-08 14:16:11 +00:00
Dmitry Preobrazhensky	62a0318dff	[AMDGPU][MC][CODEOBJECT] Added predefined symbols to access GPU minor and stepping numbers Added the following Code Object v3 symbols: .amdgcn.gfx_generation_minor .amdgcn.gfx_generation_stepping Reviewers: artem.tamazov, kzhuravl Differential Revision: https://reviews.llvm.org/D57826 llvm-svn: 353515	2019-02-08 13:51:31 +00:00
Valery Pykhtin	7fe97f8c7c	[AMDGPU] Fix DPP combiner Differential revision: https://reviews.llvm.org/D55444 dpp move with uses and old reg initializer should be in the same BB. bound_ctrl:0 is only considered when bank_mask and row_mask are fully enabled (0xF). Otherwise the old register value is checked for identity. Added add, subrev, and, or instructions to the old folding function. Kill flag is cleared for the src0 (DPP register) as it may be copied into more than one user. The pass is still disabled by default. llvm-svn: 353513	2019-02-08 11:59:48 +00:00
Carlos Alberto Enciso	08dc50f2fb	[DWARF] LLVM ERROR: Broken function found, while removing Debug Intrinsics. Check that when SimplifyCFG is flattening a 'br', all their debug intrinsic instructions are removed, including any dbg.label referencing a label associated with the basic blocks being removed. Differential Revision: https://reviews.llvm.org/D57444 llvm-svn: 353511	2019-02-08 10:57:26 +00:00
Eugene Leviant	fc6d29dff9	Attempt to fix build bot after r353509 llvm-svn: 353510	2019-02-08 10:51:08 +00:00
Eugene Leviant	340cb87e83	[llvm-objcopy] Add --redefine-syms Differential revision: https://reviews.llvm.org/D57738 llvm-svn: 353509	2019-02-08 10:33:16 +00:00
Hans Wennborg	f5db715862	Revert r353424 "[llvm-ar][libObject] Fix relative paths when nesting thin archives." This broke the Chromium build on Windows, see https://crbug.com/930058 > Summary: > When adding one thin archive to another, we currently chop off the relative path to the flattened members. For instance, when adding `foo/child.a` (which contains `x.txt`) to `parent.a`, whe > lattening it we should add it as `foo/x.txt` (which exists) instead of `x.txt` (which does not exist). > > As a note, this also undoes the `IsNew` parameter of handling relative paths in r288280. The unit test there still passes. > > This was reported as part of testing the kernel build with llvm-ar: https://patchwork.kernel.org/patch/10767545/ (see the second point). > > Reviewers: mstorsjo, pcc, ruiu, davide, david2050 > > Subscribers: hiraditya, llvm-commits > > Tags: #llvm > > Differential Revision: https://reviews.llvm.org/D57842 This reverts commit `bf990ab5aa`. llvm-svn: 353507	2019-02-08 10:16:45 +00:00
Petar Avramovic	56dc218dc1	[MIPS GlobalISel] Select mul Legalize and select G_MUL for s32 and smaller types for MIPS32. Differential Revision: https://reviews.llvm.org/D57816 llvm-svn: 353506	2019-02-08 10:11:33 +00:00
Max Kazantsev	6b63d3a277	[LoopSimplifyCFG] Use DTU.applyUpdates instead of insert/deleteEdge `insert/deleteEdge` methods in DTU can make updates incorrectly in some cases (see https://bugs.llvm.org/show_bug.cgi?id=40528), and it is recommended to use `applyUpdates` methods instead when it is needed to make a mass update in CFG. Differential Revision: https://reviews.llvm.org/D57316 Reviewed By: kuhar llvm-svn: 353502	2019-02-08 08:12:41 +00:00
Sam Parker	5b09834bc3	[ARM] Add OptMinSize to ARMSubtarget In many places in the backend, we like to know whether we're optimising for code size and this is performed by checking the current machine function attributes. A subtarget is created on a per-function basis, so it's possible to know when we're compiling for code size on construction so record this in the new object. Differential Revision: https://reviews.llvm.org/D57812 llvm-svn: 353501	2019-02-08 07:57:42 +00:00
Sergey Dmitriev	807960e6ef	[CodeExtractor] Update function's assumption cache after extracting blocks from it Summary: Assumption cache's self-updating mechanism does not correctly handle the case when blocks are extracted from the function by the CodeExtractor. As a result function's assumption cache may have stale references to the llvm.assume calls that were moved to the outlined function. This patch fixes this problem by removing extracted llvm.assume calls from the function’s assumption cache. Reviewers: hfinkel, vsk, fhahn, davidxl, sanjoy Reviewed By: hfinkel, vsk Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57215 llvm-svn: 353500	2019-02-08 06:55:18 +00:00
Heejin Ahn	df6770f0c9	[WebAssembly] Fix parseImmediate's memory alignment requirement This fixes the current failure in the x86-64 ubsan bot caused by r353496. llvm-svn: 353499	2019-02-08 04:06:56 +00:00
Aditya Nandakumar	c771675688	[GISel]: While constructing the GISelWorklist make sure we reserve at least the required size to the underlying dense map. https://reviews.llvm.org/D57931 This should save some unnecessary growing of the DenseMap. llvm-svn: 353498	2019-02-08 03:32:46 +00:00
Matt Arsenault	a8b4339c2f	AMDGPU/GlobalISel: Legalize addrspacecast Use a placeholder constant for now on targets that need the load from the queue ptr. llvm-svn: 353497	2019-02-08 02:40:47 +00:00
Wouter van Oortmerssen	0d9f3f7f95	[WebAssembly] Fixed Disassembler ignoring endian swap on big endian. Summary: This fixes: https://bugs.llvm.org/show_bug.cgi?id=40620 Reviewers: aheejin Subscribers: dschuff, sbc100, jgravelle-google, sunfish, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57933 llvm-svn: 353496	2019-02-08 01:43:23 +00:00
Craig Topper	738180cc7f	Fix the lowering issue of intrinsics llvm.localaddress on X86 Patch by Yuanke Luo Reviewers: craig.topper, annita.zhang, smaslov, rnk, wxiao3 Reviewed By: rnk Subscribers: efriedma, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57501 llvm-svn: 353492	2019-02-08 01:14:12 +00:00
Caroline Tice	cef4c29417	lvm-dwarfdump: Stop counting out-of-line subprogram in the "inlined functions" statistic. DW_TAG_subprogram DIEs should not be counted in the inlined function statistic. This also addresses the source variables count, as that uses the inlined function count in its calculations. Differential revision: https://reviews.llvm.org/D57849 llvm-svn: 353491	2019-02-08 00:51:33 +00:00
Craig Topper	c782f18835	[X86] Add FPCW as a register and start using it as an implicit use on floating point instructions. Summary: FPCW contains the rounding mode control which we manipulate to implement fp to integer conversion by changing the roudning mode, storing the value to the stack, and then changing the rounding mode back. Because we didn't model FPCW and its dependency chain, other instructions could be scheduled into the middle of the sequence. This patch introduces the register and adds it as an implciit def of FLDCW and implicit use of the FP binary arithmetic instructions and store instructions. There are more instructions that need to be updated, but this is a good start. I believe this fixes at least the reduced test case from PR40529. Reviewers: RKSimon, spatel, rnk, efriedma, andrew.w.kaylor Subscribers: dim, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57735 llvm-svn: 353489	2019-02-08 00:44:39 +00:00
Eli Friedman	29c0609301	[AArch64] Fix condition for "high-vector" DUP optimizations. AArch64 NEON has a bunch of instructions with a "2" suffix that extract the top half of the source vectors, instead of the bottom half. We have some DAGCombines to try to take advantage of that. However, they assumed that any EXTRACT_VECTOR was extracting the high half of the vector in question. This issue has apparently existed since the AArch64 backend was merged. Fixes https://bugs.llvm.org/show_bug.cgi?id=40632 . Differential Revision: https://reviews.llvm.org/D57862 llvm-svn: 353486	2019-02-08 00:23:35 +00:00
Petar Jovanovic	3cfcd75453	[mips][micromips] Fix how values in .gcc_except_table are calculated When a landing pad is calculated in a program that is compiled for micromips with -fPIC flag, it will point to an even address. Such an error will cause a segmentation fault, as the instructions in micromips are aligned on odd addresses. This patch sets the last bit of the offset where a landing pad is, to 1, which will effectively be an odd address and point to the instruction exactly. r344591 fixed this issue for -static compilation. Patch by Aleksandar Beserminji. Differential Revision: https://reviews.llvm.org/D57677 llvm-svn: 353480	2019-02-07 22:57:33 +00:00
Sanjay Patel	81f859d169	[x86] fix formatting; NFC llvm-svn: 353477	2019-02-07 22:36:55 +00:00
Dan Gohman	e086afa7a3	[WebAssembly] Update test output after rL353474. NFC. llvm-svn: 353476	2019-02-07 22:33:50 +00:00
Dan Gohman	29874cea31	[WebAssembly] Fix imported function symbol names that differ from their import names in the .o format Add a flag to allow symbols to have a wasm import name which differs from the linker symbol name, allowing the linker to link code using the import_module attribute. This is the MC/Object portion of the patch. Differential Revision: https://reviews.llvm.org/D57632 llvm-svn: 353474	2019-02-07 22:03:32 +00:00
Quentin Colombet	96f54de8ff	[InstCombine] Optimize `atomicrmw <op>, 0` into `load atomic` when possible This commit teaches InstCombine how to replace an atomicrmw operation into a simple load atomic. For a given `atomicrmw <op>`, this is possible when: 1. The ordering of that operation is compatible with a load (i.e., anything that doesn't have a release semantic). 2. <op> does not modify the value being stored Differential Revision: https://reviews.llvm.org/D57854 llvm-svn: 353471	2019-02-07 21:27:23 +00:00
Peter Collingbourne	82bf8e82c9	gn build: Make check-{clang,lld,llvm} pass on FreeBSD. Mostly achieved by assuming that anything that isn't Win or Mac is ELF, which seems reasonable enough for now. Differential Revision: https://reviews.llvm.org/D57870 llvm-svn: 353470	2019-02-07 21:24:30 +00:00
Florian Hahn	f557a94aa3	[LV] Remove unnecessary assignment to UserIC. llvm-svn: 353469	2019-02-07 21:23:37 +00:00
Sanjay Patel	781d883862	[InstCombine] Fix crashing from (icmp (bitcast ([su]itofp X)), Y) This fixes a class of bugs introduced by D44367, which transforms various cases of icmp (bitcast ([su]itofp X)), Y to icmp X, Y. If the bitcast is between vector types with a different number of elements, the current code will produce bad IR along the lines of: icmp <N x i32> ..., <M x i32> <...>. This patch suppresses the transform if the bitcast changes the number of vector elements. Patch by: @AndrewScheidecker (Andrew Scheidecker) Differential Revision: https://reviews.llvm.org/D57871 llvm-svn: 353467	2019-02-07 21:12:01 +00:00
Adrian Prantl	e794db8817	Move SMTSolver dump() methods out-of-line. This broke modularized non-local-submodule-visibility builds because the function bodies pulled in extra dependencies. llvm-svn: 353465	2019-02-07 21:03:18 +00:00
Nikita Popov	9d7e86a978	[CodeGen] Handle vector UADDO, SADDO, USUBO, SSUBO This is part of https://bugs.llvm.org/show_bug.cgi?id=40442. Vector legalization is implemented for the add/sub overflow opcodes. UMULO/SMULO are also handled as far as legalization is concerned, but they don't support vector expansion yet (so no tests for them). The vector result widening implementation is suboptimal, because it could result in a legalization loop. Differential Revision: https://reviews.llvm.org/D57639 llvm-svn: 353464	2019-02-07 21:02:22 +00:00
Shoaib Meenai	be9b65d89d	[cmake] Pass LLVM_TEMPORARILY_ALLOW_OLD_TOOLCHAIN to NATIVE configure We should propagate this down to host builds so that e.g. people using an optimized tablegen can do the sub-configure successfully. llvm-svn: 353463	2019-02-07 20:58:04 +00:00
Sanjay Patel	e7f46c3db3	[InstCombine] refactor folds for (icmp (bitcast X), Y); NFCI llvm-svn: 353462	2019-02-07 20:54:09 +00:00
Florian Hahn	ba5acbc4fe	[LV] Prevent interleaving if computeMaxVF returned None. As discussed in D57382, interleaving should be avoided if computeMaxVF returns None, same as we currently do for vectorization. Fixes https://bugs.chromium.org/p/oss-fuzz/issues/detail?id=6477 Reviewers: Ayal, dcaballe, hsaito, mkuper, rengolin Reviewed By: Ayal Differential Revision: https://reviews.llvm.org/D57837 llvm-svn: 353461	2019-02-07 20:49:10 +00:00
Matt Arsenault	e98cab11d7	GlobalISel: Try to fix bot failures Don't rely on order of evaluation of function arguments. llvm-svn: 353460	2019-02-07 20:44:08 +00:00
Simon Pilgrim	fe3ac70b18	[DAGCombiner] (add (umax X, C), -C) --> (usubsat X, C) (PR40111) Move the (add (umax X, C), -C) --> (usubsat X, C) X86 combine into generic DAGCombiner First of a number of saturated arithmetic folds that can be moved out of X86-specific code for PR40111. Differential Revision: https://reviews.llvm.org/D57754 llvm-svn: 353457	2019-02-07 20:14:43 +00:00
Matt Arsenault	fbec8fe93b	GlobalISel: Implement narrowScalar for shift main type This is pretty much directly ported from SelectionDAG. Doesn't include the shift by non-constant but known bits version, since there isn't a globalisel version of computeKnownBits yet. This shows a disadvantage of targets not specifically which type should be used for the shift amount. If type 0 is legalized before type 1, the operations on the shift amount type use the wider type (which are also less likely to legalize). This can be avoided by targets specifying legalization actions on type 1 earlier than for type 0. llvm-svn: 353455	2019-02-07 19:37:44 +00:00
Matt Arsenault	d914189a2e	AMDGPU/GlobalISel: Restrict g_implicit_def legality llvm-svn: 353452	2019-02-07 19:10:15 +00:00
Matt Arsenault	d6212f9f1b	GlobalISel: Fix artifact combiner constant legality checks for vectors Since G_CONSTANT is illegal for vectors, this needs to check what buildConstant will produce for a splat vector. llvm-svn: 353449	2019-02-07 18:58:28 +00:00
Matt Arsenault	60b33fb6fc	AMDGPU/GlobalISel: Don't use g_implicit_def in a few tests llvm-svn: 353443	2019-02-07 18:33:22 +00:00
Nirav Dave	9332fc2e19	Revert "[DAG] Cleanup of unused node in SimplifySelectCC." Causes ASAN use-after-poison errors. llvm-svn: 353442	2019-02-07 18:31:05 +00:00
Reid Kleckner	f21c022380	[InstrProf] Avoid reconstructing Triple, NFC llvm-svn: 353439	2019-02-07 18:16:22 +00:00
Matt Arsenault	c0f7569aab	AMDGPU/GlobalISel: Legalize fsqrt llvm-svn: 353438	2019-02-07 18:14:39 +00:00
Matt Arsenault	93fdec739b	AMDGPU/GlobalISel: Legalize some f16 operations llvm-svn: 353436	2019-02-07 18:03:11 +00:00
Teresa Johnson	c36c10ddfb	[HotColdSplit] With PGO add profile entry metadata to split cold function Summary: When compiling with profile data, ensure the split cold function gets cold function_entry_count metadata (just use 0 since it should be cold). Otherwise with function sections it will not be placed in the unlikely text section with other cold code. Reviewers: vsk Subscribers: sebpop, hiraditya, davidxl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57900 llvm-svn: 353434	2019-02-07 17:50:35 +00:00
Sanjay Patel	2d4b186844	[DAGCombiner] fold add/sub with bool operand based on target's boolean contents I noticed that we are missing this canonicalization in IR: rL352515 ...and then realized that we don't get this right in SDAG either, so this has to be fixed first regardless of what we choose to do in IR. The existing fold was limited to scalars and using the wrong predicate to guard the transform. We have a boolean contents TLI query that can be used to decide which direction to fold. This may eventually lead back to the problems/question in: https://bugs.llvm.org/show_bug.cgi?id=40486 ...but it makes no difference to that yet. Differential Revision: https://reviews.llvm.org/D57401 llvm-svn: 353433	2019-02-07 17:43:34 +00:00
Matt Arsenault	c83b82363c	GlobalISel: Implement fewerElementsVector for shifts Introduce a new function which handles instructions with multiple type indices, but have the same number of vector elements. Also legalize v2s16 shifts when applicable. llvm-svn: 353432	2019-02-07 17:38:00 +00:00
Matt Arsenault	91be65be65	GlobalISel: Try to make legalize rules more useful for vectors Mostly keep the existing functions on scalars, but add versions which also operate based on the vector element size. llvm-svn: 353430	2019-02-07 17:25:51 +00:00
Nirav Dave	24e60819f6	[DAG] Cleanup of unused node in SimplifySelectCC. llvm-svn: 353428	2019-02-07 17:13:55 +00:00
Sanjay Patel	a5c4a5e958	[x86] split more 256/512-bit shuffles in lowering This is intentionally a small step because it's hard to know exactly where we might introduce a conflicting transform with the code that tries to form wider shuffles. But I think this is safe - if we have a wide shuffle with 2 operands, then we should do better with an extract + narrow shuffle. Differential Revision: https://reviews.llvm.org/D57867 llvm-svn: 353427	2019-02-07 17:10:49 +00:00
Nirav Dave	4b12236f7d	[DAG] Cleanup unused node on failed SELECT Combine. llvm-svn: 353426	2019-02-07 16:57:50 +00:00
Jordan Rupprecht	bf990ab5aa	[llvm-ar][libObject] Fix relative paths when nesting thin archives. Summary: When adding one thin archive to another, we currently chop off the relative path to the flattened members. For instance, when adding `foo/child.a` (which contains `x.txt`) to `parent.a`, when flattening it we should add it as `foo/x.txt` (which exists) instead of `x.txt` (which does not exist). As a note, this also undoes the `IsNew` parameter of handling relative paths in r288280. The unit test there still passes. This was reported as part of testing the kernel build with llvm-ar: https://patchwork.kernel.org/patch/10767545/ (see the second point). Reviewers: mstorsjo, pcc, ruiu, davide, david2050 Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57842 llvm-svn: 353424	2019-02-07 16:41:06 +00:00
Nirav Dave	84e5bf0c95	[X86] Simplify casing. NFC. llvm-svn: 353417	2019-02-07 15:43:40 +00:00
Nirav Dave	724b81087d	[DAG] Cleanup unused nodes on failed store-to-load forward combine. llvm-svn: 353416	2019-02-07 15:38:14 +00:00
Alexandre Ganea	120366edc7	[CodeView] Fix cycles in debug info when merging Types with global hashes When type streams with forward references were merged using GHashes, cycles were introduced in the debug info. This was caused by GlobalTypeTableBuilder::insertRecordAs() not inserting the record on the second pass, thus yielding an empty ArrayRef at that record slot. Later on, upon PDB emission, TpiStreamBuilder::commit() would skip that empty record, thus offseting all indices that came after in the stream. This solution comes in two steps: 1. Fix the hash calculation, by doing a multiple-step resolution, iff there are forward references in the input stream. 2. Fix merge by resolving with multiple passes, therefore moving records with forward references at the end of the stream. This patch also adds support for llvm-readoj --codeview-ghash. Finally, fix dumpCodeViewMergedTypes() which previously could reference deleted memory. Fixes PR40221 Differential Revision: https://reviews.llvm.org/D57790 llvm-svn: 353412	2019-02-07 15:24:18 +00:00
Fangrui Song	e39b57386b	Fix misspelled filenames in file headers llvm-svn: 353408	2019-02-07 14:38:25 +00:00
Sam Parker	67756c09f2	[LSR] Generate cross iteration indexes Modify GenerateConstantOffsetsImpl to create offsets that can be used by indexed addressing modes. If formulae can be generated which result in the constant offset being the same size as the recurrence, we can generate a pre-indexed access. This allows the pointer to be updated via the single pre-indexed access so that (hopefully) no add/subs are required to update it for the next iteration. For small cores, this can significantly improve performance DSP-like loops. Differential Revision: https://reviews.llvm.org/D55373 llvm-svn: 353403	2019-02-07 13:32:54 +00:00
Diana Picus	75a04e2a77	[ARM GlobalISel] Support G_ICMP for Thumb2 Mark as legal and use the t2* equivalents of the arm mode instructions, e.g. t2CMPrr instead of plain CMPrr. llvm-svn: 353392	2019-02-07 11:05:33 +00:00
David Green	7e6da81633	[ARM] Reformat isRedundantFlagInstr for D57833. NFC llvm-svn: 353386	2019-02-07 10:51:04 +00:00
Jiong Wang	66b18e5755	[BPF] add code-gen support for JMP32 instructions JMP32 instructions has been added to eBPF ISA. They are 32-bit variants of existing BPF conditional jump instructions, but the comparison happens on low 32-bit sub-register only, therefore some unnecessary extensions could be saved. JMP32 instructions will only be available for -mcpu=v3. Host probe hook has been updated accordingly. JMP32 instructions will only be enabled in code-gen when -mattr=+alu32 enabled, meaning compiling the program using sub-register mode. For JMP32 encoding, it is a new instruction class, and is using the reserved eBPF class number 0x6. This patch has been tested by compiling and running kernel bpf selftests with JMP32 enabled. Acked-by: Yonghong Song <yhs@fb.com> Signed-off-by: Jiong Wang <jiong.wang@netronome.com> llvm-svn: 353384	2019-02-07 10:43:09 +00:00
Tim Northover	638110a208	AArch64: implement copy for paired GPR registers. When doing 128-bit atomics using CASP we might need to copy a GPRPair to a different register, but that was unimplemented up to now. llvm-svn: 353383	2019-02-07 10:35:34 +00:00
Craig Topper	428c14d1db	[BranchFolding] Remove dead code for handling EHPad blocks Summary: This code tries to handle the case where IBB is an EHPad, but there's an earlier check that uses PBB->hasEHPadSuccessor(). Where PBB is a predecessor of IBB. The hasEHPadSuccessor function would have visited IBB and seen that it was an EHPad and returned false. This would prevent us from reaching this code with IBB as an EHPad. Looks like this code was originally added in rL37427 (ancient) and made dead in rL143001. Reviewers: rnk, void, efriedma Reviewed By: rnk Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D57358 llvm-svn: 353375	2019-02-07 06:21:28 +00:00
JF Bastien	388cefa78d	Bump minimum toolchain version Summary: The RFC on moving past C++11 got good traction: http://lists.llvm.org/pipermail/llvm-dev/2019-January/129452.html This patch therefore bumps the toolchain versions according to our policy: llvm.org/docs/DeveloperPolicy.html#toolchain Subscribers: mgorny, jkorous, dexonsmith, llvm-commits, mehdi_amini, jyknight, rsmith, chandlerc, smeenai, hans, reames, lattner, lhames, erichkeane Differential Revision: https://reviews.llvm.org/D57264 llvm-svn: 353374	2019-02-07 05:20:00 +00:00
Mikhail R. Gadelha	eac500f0c3	Move the SMT API to LLVM Moved everything SMT-related to LLVM and updated the cmake scripts. Differential Revision: https://reviews.llvm.org/D54978 llvm-svn: 353373	2019-02-07 03:19:45 +00:00
Peter Collingbourne	c449409533	gn build: Merge the test part of r353237. llvm-svn: 353369	2019-02-07 02:40:49 +00:00
Sam Clegg	847b92947e	[WebAssembly] Update test output after rL353357. NFC. llvm-svn: 353368	2019-02-07 02:35:22 +00:00
Brad Smith	01227fea9e	Add OpenBSD support to be able to get the thread name llvm-svn: 353367	2019-02-07 02:06:58 +00:00
Sam Clegg	d6ef8da317	[WebAssembly] Add symbol flag to the binary format llvm.used Summary: Rather than add a new attribute See https://github.com/WebAssembly/tool-conventions/issues/64 Subscribers: dschuff, jgravelle-google, aheejin, sunfish, rupprecht, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57864 llvm-svn: 353360	2019-02-07 01:24:44 +00:00
Eric Christopher	40b1c07462	Fix a minor grammar thinko. llvm-svn: 353359	2019-02-07 01:22:07 +00:00
Sam Clegg	e450bd7a9d	[WebAssembly] Expand symbol flags shown by llvm-objdump --symbols Differential Revision: https://reviews.llvm.org/D57861 llvm-svn: 353357	2019-02-07 01:17:34 +00:00
Shoaib Meenai	18f0bd78e2	[cmake] Drop clang-tools-extra from LLVM_ALL_PROJECTS We iterate over the list and only enable projects from that list that are present in LLVM_ENABLE_PROJECTS and disable all other projects. Most users will only specify clang in LLVM_ENABLE_PROJECTS and expect clang-tools-extra to be implicitly enabled, so remove clang-tools-extra from LLVM_ALL_PROJECTS so that it doesn't get disabled instead. llvm-svn: 353354	2019-02-07 01:12:56 +00:00
Sam Clegg	1e71b04af6	Remove reference to non-existent function. NFC. This comment is old. The code in question was removed in rL203174 Differential Revision: https://reviews.llvm.org/D57856 llvm-svn: 353352	2019-02-07 00:11:43 +00:00
Jordan Rupprecht	db5036504e	[llvm-ar] Remove leading slash when printing thin archive members Reviewers: ruiu Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57845 llvm-svn: 353347	2019-02-06 21:50:45 +00:00
Shoaib Meenai	351314a14f	[cmake] Add all subprojects to LLVM_ALL_PROJECTS Make LLVM_ALL_PROJECTS reflect all top-level directories in the monorepo rather than an arbitrary subset. clang-tools-extra is technically unnecessary since it gets enabled by clang, but having it there for consistency shouldn't hurt either. Differential Revision: https://reviews.llvm.org/D57843 llvm-svn: 353346	2019-02-06 21:49:47 +00:00
Roland Froese	42f58498c5	[PowerPC] Add vector truncate test to prep for D56507 NFC llvm-svn: 353344	2019-02-06 21:34:44 +00:00
Shoaib Meenai	af8eadd94e	[cmake] Add openmp to LLVM_ALL_PROJECTS It'll get ignored in LLVM_ENABLE_PROJECTS after r353148 otherwise. llvm-svn: 353343	2019-02-06 21:08:17 +00:00
Jordan Rupprecht	d3a7e9d153	[libObject][NFC] Include filename in error message llvm-svn: 353341	2019-02-06 20:51:04 +00:00
Alina Sbirlea	6cba96ed52	[LICM/MSSA] Add promotion to scalars by building an AliasSetTracker with MemorySSA. Summary: Experimentally we found that promotion to scalars carries less benefits than sinking and hoisting in LICM. When using MemorySSA, we build an AliasSetTracker on demand in order to reuse the current infrastructure. We only build it if less than AccessCapForMSSAPromotion exist in the loop, a cap that is by default set to 250. This value ensures there are no runtime regressions, and there are small compile time gains for pathological cases. A much lower value (20) was found to yield a single regression in the llvm-test-suite and much higher benefits for compile times. Conservatively we set the current cap to a high value, but we will explore lowering it when MemorySSA is enabled by default. Reviewers: sanjoy, chandlerc Subscribers: nemanjai, jlebar, Prazek, george.burgess.iv, jfb, jsji, llvm-commits Differential Revision: https://reviews.llvm.org/D56625 llvm-svn: 353339	2019-02-06 20:25:17 +00:00
Nirav Dave	b3506bf985	[DAG] Immediately cleanup unused nodes from extend-based combines. llvm-svn: 353338	2019-02-06 20:12:03 +00:00
Michael Berg	f0d81a31b6	Move IR flag handling directly into builder calls for cases translated from Instructions in GlobalIsel Reviewers: aditya_nandakumar, volkan Reviewed By: aditya_nandakumar Subscribers: rovka, kristof.beyls, volkan, Petar.Avramovic Differential Revision: https://reviews.llvm.org/D57630 llvm-svn: 353336	2019-02-06 19:57:06 +00:00
Alina Sbirlea	910c6bef3e	[AliasSetTracker] Pass MustAlias to addPointer more often. Summary: Pass the alias info to addPointer when available. Will save an alias() call for must sets when adding a known Must or May alias. [Part of a series of cleanup patches] Reviewers: reames, mkazantsev Subscribers: sanjoy, jlebar, llvm-commits Differential Revision: https://reviews.llvm.org/D56613 llvm-svn: 353335	2019-02-06 19:55:12 +00:00
Craig Topper	1c7ee20819	[X86] Change the CPU on the test case for pr40529.ll to really show the bug. NFC llvm-svn: 353334	2019-02-06 19:50:59 +00:00
Nirav Dave	c6bfa103a5	[X86][DAG] Avoid creating dangling bitcast. combineExtractWithShuffle may leave a dangling bitcast which may prevent further optimization in later passes. Avoid constructing it unless it is used. llvm-svn: 353333	2019-02-06 19:45:47 +00:00
Sanjay Patel	29a710be6a	[x86] add tests for horizontal ops (PR38971, PR33758); NFC llvm-svn: 353332	2019-02-06 19:40:11 +00:00
Jonas Paulsson	b21dde0530	[SystemZ] Improved handling of the @llvm.ctlz intrinsic. Since SystemZ supports counting of leading zeros with the FLOGR instruction, isCheapToSpeculateCtlz() should return true, which it now does. ISD::CTLZ_ZERO_UNDEF i32 is now handled the same way as ISD::CTLZ is, which is needed since promotion to i64 is required and CTLZ_ZERO_UNDEF is only expanded to CTLZ if it is Legal or Custom. Review: Ulrich Weigand https://reviews.llvm.org/D57710 llvm-svn: 353330	2019-02-06 19:23:31 +00:00
Peter Collingbourne	02fc3c696c	build: Remove the cmake check for malloc.h. As far as I can tell, malloc.h is only being used here to provide a definition of mallinfo (malloc itself is declared in stdlib.h via cstdlib). We already have a macro for whether mallinfo is available, so switch to using that instead. Differential Revision: https://reviews.llvm.org/D57807 llvm-svn: 353329	2019-02-06 19:20:47 +00:00

... 4 5 6 7 8 ...

175390 Commits