llvm-project

Commit Graph

Author	SHA1	Message	Date
Teresa Johnson	a4ce3bfdda	[PGO] Function section hotness prefix should look at all blocks Summary: The function section prefix for PGO based layout (e.g. hot/unlikely) should look at the hotness of all blocks not just the entry BB. A function with a cold entry but a very hot loop should be placed in the hot section, for example, so that it is located close to other hot functions it may call. For SamplePGO it was already looking at the branch weights on calls, and I made that code conditional on whether this is SamplePGO since it was essentially a noop for instrumentation PGO anyway. Reviewers: davidxl Subscribers: eraman, llvm-commits Differential Revision: https://reviews.llvm.org/D41395 llvm-svn: 321197	2017-12-20 17:53:10 +00:00
Florian Hahn	012c8f97b2	[InstCombine] Add debug location to new caller. Reviewers: rnk, aprantl, majnemer Reviewed By: aprantl Differential Revision: https://reviews.llvm.org/D414 llvm-svn: 321191	2017-12-20 17:16:59 +00:00
Nemanja Ivanovic	b55b0ac160	[JumpTables] Let targets decide which switch instructions are suitable This commits the non-controversial part of https://reviews.llvm.org/D41029 (making the queries virtual). The PPC-specific portion of this will be committed in a subsequent patch once some of the finer points are ironed out. llvm-svn: 321182	2017-12-20 15:44:32 +00:00
Mohammad Shahid	3a934d6ab9	Revert r320548:[SLP] Vectorize jumbled memory loads llvm-svn: 321181	2017-12-20 15:26:59 +00:00
Krzysztof Parzyszek	3257e44c66	Add optional SelectionDAG* parameter to SValue::dump and SDValue::dumpr These functions simply call their counterparts in the associated SDNode, which do take an optional SelectionDAG. This change makes the legalization debug trace a little easier to read, since target-specific nodes will now have their names shown instead of "Unknown node #123". llvm-svn: 321180	2017-12-20 15:15:04 +00:00
Javed Absar	deca635e45	[SCEV] Fix Typo. NFC. llvm-svn: 321179	2017-12-20 15:06:26 +00:00
Alexey Bataev	88fb980a7c	[NVPTX] Initial adaptation of MCAsmStreamer/MCTargetStreamer for debug info in Cuda. Summary: Initial changes in interfaces of MCAsmStreamer/MCTargetStreamer for correct debug info emission for Cuda. 1. PTX foramt does not support `.ascii` directives. Added the ability to nullify it. 2. The initial function label must follow the first debug `.loc` directive, not be followed by. 3. DWARF sections must be enclosed in braces. Reviewers: hfinkel, probinson, jlebar, rafael, echristo Subscribers: sdardis, nemanjai, llvm-commits, aprantl Differential Revision: https://reviews.llvm.org/D40033 llvm-svn: 321178	2017-12-20 14:55:10 +00:00
Krzysztof Parzyszek	8f6b0c850a	[Hexagon] Adjust the value type for BCvt in LowerFormalArguments llvm-svn: 321177	2017-12-20 14:44:05 +00:00
Daniel Sanders	32de8bbd30	[globalisel][tablegen] Allow ImmLeaf predicates to use InstructionSelector members NFC for currently supported targets. This resolves a problem encountered by targets such as RISCV that reference `Subtarget` in ImmLeaf predicates. llvm-svn: 321176	2017-12-20 14:41:51 +00:00
Ilya Biryukov	75db08124c	Allow to apply cherry-picks when building Docker images. Reviewers: mehdi_amini, ioeric, klimek Reviewed By: ioeric Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D41393 llvm-svn: 321175	2017-12-20 14:39:07 +00:00
Florian Hahn	467abe3e4f	[LV] Remove unnecessary DoExtraAnalysis guard (silent bug) canVectorize is only checking if the loop has a normalized pre-header if DoExtraAnalysis is true. This doesn't make sense to me because reporting analysis information shouldn't alter legality checks. This is probably the result of a last minute minor change before committing (?). Patch by Diego Caballero. Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D40973 llvm-svn: 321172	2017-12-20 13:28:38 +00:00
Simon Pilgrim	a50eec0293	[X86][AVX2] Split more shuffle tests into 'slow' and 'fast' variable shuffles llvm-svn: 321171	2017-12-20 13:12:34 +00:00
Sander de Smalen	a2f3bed642	Trivial commit to force LLVM to run TableGen for Mips target after a change to the AsmMatcherEmitter, and should fix the buildbot failure on llvm-clang-x86_64-expensive-checks-win. The issue is also described here: http://lists.llvm.org/pipermail/llvm-dev/2017-December/119617.html llvm-svn: 321170	2017-12-20 12:45:40 +00:00
Florian Hahn	3cfdaa30e2	[TargetParser] Check size before accessing architecture version. Summary: This fixes a crash when invalid -march options like `armv` are provided. Based on a patch by Will Lovett. Reviewers: rengolin, samparker, mcrosier Reviewed By: samparker Subscribers: aemerson, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D41429 llvm-svn: 321166	2017-12-20 11:32:43 +00:00
Diana Picus	75ce852abe	[ARM GlobalISel] Fix assertion in RegBankSelect We get an assertion in RegBankSelect for code along the lines of my_32_bit_int = my_64_bit_int, which tends to translate into a 64-bit load, followed by a G_TRUNC, followed by a 32-bit store. This appears in a couple of places in the test-suite. At the moment, the legalizer doesn't distinguish between integer and floating point scalars, so a 64-bit load will be marked as legal for targets with VFP, and so will the rest of the sequence, leading to a slightly bizarre G_TRUNC reaching RegBankSelect. Since the current support for 64-bit integers is rather immature, this patch works around the issue by explicitly handling this case in RegBankSelect and InstructionSelect. In the future, we may want to revisit this decision and make sure 64-bit integer loads are narrowed before reaching RegBankSelect. llvm-svn: 321165	2017-12-20 11:27:10 +00:00
Florian Hahn	c3aa6d83fd	[ARM] Lower unsigned saturation to USAT Summary: Implement lower of unsigned saturation on an interval [0, k] where k + 1 is a power of two using USAT instruction in a similar way to how [~k, k] is lowered using SSAT on ARM models that supports it. Patch by Marten Svanfeldt Reviewers: t.p.northover, pbarrio, eastig, SjoerdMeijer, javed.absar, fhahn Reviewed By: fhahn Subscribers: fhahn, aemerson, javed.absar, llvm-commits, kristof.beyls Differential Revision: https://reviews.llvm.org/D41348 llvm-svn: 321164	2017-12-20 11:13:57 +00:00
Sander de Smalen	cd6be960ce	[AArch64][SVE] Re-submit patch series for ZIP1/ZIP2 This patch resubmits the SVE ZIP1/ZIP2 patch series consisting of of r320992, r320986, r320973, and r320970 by reverting https://reviews.llvm.org/rL321024. The issue that caused r321024 has been addressed in https://reviews.llvm.org/rL321158, so this patch-series should be safe to resubmit. llvm-svn: 321163	2017-12-20 11:02:42 +00:00
Tim Northover	6db5d027c6	AArch64: fix one more place movi.2d could be created. Somehow got missed out of r320965. llvm-svn: 321162	2017-12-20 10:45:39 +00:00
Bjorn Steinbrink	030123e8e8	Give up on array allocas in getPointerDereferenceableBytes Summary: As suggested by Eli Friedman, don't try to handle array allocas here, because of possible overflows, instead rely on instcombine converting them to allocations of array types. Reviewers: efriedma Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D41398 llvm-svn: 321159	2017-12-20 10:01:30 +00:00
Sander de Smalen	c067c30d9e	[AArch64] Asm: Fix parsing of register aliases that have a name starting with 'z' Summary: This fixes an issue as identified by @rnk in https://reviews.llvm.org/rL321029. Reviewers: rnk, fhahn, rengolin, efriedma, echristo, olista01 Reviewed By: rnk, fhahn Subscribers: aemerson, javed.absar, kristof.beyls, llvm-commits, rnk Differential Revision: https://reviews.llvm.org/D41382 llvm-svn: 321158	2017-12-20 09:45:45 +00:00
Sam Parker	daed9de622	[AArch64] CCSIDR2 system register Implement the 'Current Cache Size' register that has been introduced as part of the Armv8.3 architecture. I originally missed this, and (hopefully) should be the final patch for assembler support. Differential Revision: https://reviews.llvm.org/D41396 llvm-svn: 321155	2017-12-20 08:56:41 +00:00
Gadi Haber	0ae485c581	[X86][CLFLUSH]: Adding full coverage of MC encoding for the CLFLUSH isa sets.<NFC> NFC. Adding MC regressions tests to cover the CLFLSH and CLFLUSHOPT isa sets. This patch is part of a larger task to cover MC encoding of all X86 isa sets started in revision: https://reviews.llvm.org/D39952 Reviewers: zvi, RKSimon, craig.topper, m_zuckerman Differential Revision: https://reviews.llvm.org/D41331 Change-Id: Ifa643dd52f1b7184c52bc1806038dc74b234fc65 llvm-svn: 321153	2017-12-20 08:28:24 +00:00
Craig Topper	abed821c36	[X86] Optimize sign extends on index operand to gather/scatter to not sign extend past i32. The gather instruction will implicitly sign extend to the pointer width, we don't need to further extend it. This can prevent unnecessary splitting in some cases. There's still an issue that lowering on non-VLX can introduce another sign extend that doesn't get combined with shifts from a lowered sign_extend_inreg. llvm-svn: 321152	2017-12-20 07:36:59 +00:00
Martin Storsjo	2778fd0b59	[AArch64] Implement stack probing for windows Differential Revision: https://reviews.llvm.org/D41131 llvm-svn: 321150	2017-12-20 06:51:45 +00:00
Craig Topper	158d54d954	[X86] Add a missing return to combineGatherScatter after sucessful combine. Not sure how to test this cause I think the worst that happens is that we don't revisit the node a second time to look for additional combines. We used UpdateNodeOperands so the updating the DAG work was already done. llvm-svn: 321148	2017-12-20 06:44:50 +00:00
Hiroshi Inoue	11e571e0c6	[PowerPC] fix a bug in redundant compare elimination This patch fixes a bug in the redundant compare elimination reported in https://reviews.llvm.org/rL320786 and re-enables the optimization. The redundant compare elimination assumes that we can replace signed comparison with unsigned comparison for the equality check. But due to the difference in the sign extension behavior we cannot change the opcode if the comparison is against an immediate and the most significant bit of the immediate is one. Differential Revision: https://reviews.llvm.org/D41385 llvm-svn: 321147	2017-12-20 05:18:19 +00:00
Dan Gohman	aa3922819e	[memcpyopt] Teach memcpyopt to optimize across basic blocks This teaches memcpyopt to make a non-local memdep query when a local query indicates that the dependency is non-local. This notably allows it to eliminate many more llvm.memcpy calls in common Rust code, often by 20-30%. This is r319482 and r319483, along with fixes for PR35519: fix the optimization that merges stores into memsets to preserve cached memdep info, and fix memdep's non-local caching strategy to not assume that larger queries are always more conservative than smaller ones. Fixes PR28958 and PR35519. Differential Revision: https://reviews.llvm.org/D40802 llvm-svn: 321138	2017-12-20 01:36:25 +00:00
Craig Topper	b1ae03fd61	[X86] Improve coverage of fma negations. llvm-svn: 321137	2017-12-20 01:26:36 +00:00
Craig Topper	171fb15786	[X86] Fix probable typo in fma fneg test. llvm-svn: 321136	2017-12-20 01:26:35 +00:00
Craig Topper	aee3acb9a8	[X86] Remove code from combineSext that looks for MVT::i1 after operation legalization which can never happen. Type legalization guarantees this to be impossible since MVT::i1 isn't a legal type. llvm-svn: 321132	2017-12-20 01:00:01 +00:00
Dan Gohman	b5f53449e4	[WebAssembly] Disable tee_local optimizations when targeting the ELF ABI. These optimizations depend on the ExplicitLocals pass to lower TEE instructions, which is disabled in the ELF ABI, so disable them too. llvm-svn: 321131	2017-12-20 00:59:28 +00:00
Dan Gohman	83b162269f	[WebAssembly] Remove an obsolete comment. llvm-svn: 321127	2017-12-20 00:10:28 +00:00
Adrian McCarthy	86795d9166	Revert "Fix faulty assertion in debug info" This reverts commit e32def3f7ebe1136b7038336eff56a415a962bf2. llvm-svn: 321125	2017-12-19 23:34:37 +00:00
Adrian McCarthy	2ed8f36834	Fix faulty assertion in debug info It appears the code uses nullptr to represent a void type in debug metadata, which led to an assertion failure when building DeltaAlgorithm.cpp with a self-hosted clang on Windows. I'm not sure why/if the problem was Windows-specific. Fixes bug https://bugs.llvm.org/show_bug.cgi?id=35543 Differential Revision: https://reviews.llvm.org/D41264 llvm-svn: 321122	2017-12-19 23:01:17 +00:00
Craig Topper	fbdb236a8a	[X86] Add an assert to indicate that there is only once specific VT allowed at a certain point in LowerMULH. Helps with code readability a little. llvm-svn: 321118	2017-12-19 22:38:09 +00:00
Adrian Prantl	0e6694d111	Silence a bunch of implicit fallthrough warnings llvm-svn: 321114	2017-12-19 22:05:25 +00:00
Francis Visoiu Mistrih	f81727d138	[CodeGen] Move printing MO_BlockAddress operands to MachineOperand::print Work towards the unification of MIR and debug output by refactoring the interfaces. llvm-svn: 321113	2017-12-19 21:47:14 +00:00
Francis Visoiu Mistrih	cb2683d46a	[CodeGen] Move printing MO_IntrinsicID operands to MachineOperand::print Work towards the unification of MIR and debug output by refactoring the interfaces. llvm-svn: 321112	2017-12-19 21:47:10 +00:00
Francis Visoiu Mistrih	bbd610ae92	[CodeGen] Move printing MO_IntrinsicID operands to MachineOperand::print Work towards the unification of MIR and debug output by refactoring the interfaces. Also add support for printing with a null TargetIntrinsicInfo and no MachineFunction. llvm-svn: 321111	2017-12-19 21:47:05 +00:00
Francis Visoiu Mistrih	3b265c8fcf	[CodeGen] Move printing MO_FPImmediate operands to MachineOperand::print Work towards the unification of MIR and debug output by refactoring the interfaces. llvm-svn: 321110	2017-12-19 21:47:00 +00:00
Francis Visoiu Mistrih	8122660226	[CodeGen] Refactor printOffset from MO and MIRPrinter llvm-svn: 321109	2017-12-19 21:46:55 +00:00
Haicheng Wu	0be8825146	[CGP] Format. NFC Clang-format. llvm-svn: 321107	2017-12-19 20:53:32 +00:00
Matthias Braun	d2d7fb63f7	TargetLoweringBase: Fix darwinHasSinCos() Another followup to my refactoring in r321036: Turns out we can end up with an x86 darwin target that is not macos (simulator triples can look like i386-apple-ios) so we need the x86/32bit check in all cases. llvm-svn: 321104	2017-12-19 20:24:12 +00:00
Jonas Devlieghere	b2a03193dd	[dwarfdump][test] Add test case for r321064 Verify that -lookup takes a 64-bit address. llvm-svn: 321101	2017-12-19 19:42:32 +00:00
Mark Searles	e4f067ebe2	[AMDGPU] Turn off MergeConsecutiveStores() before Instruction Selection for AMDGPU. Commit dbbb6c5fc3642987430866dffdf710df4f616ac7 turned on MergeConsecutiveStores() before Instruction Selection for all targets. Enough AMDGPU compiles go into an infinite loop ( MergeConsecutiveStores() merges two stores; LegalizeStoreOps() un-merges; MergeConsecutiveStores() re-merges, etc. ) to warrant turning it off until the issues can be addressed. Differential Revision: https://reviews.llvm.org/D41377 llvm-svn: 321100	2017-12-19 19:26:23 +00:00
Haicheng Wu	5b106ef92e	[SeparateConstOffsetFromGEP] Fix a typo. NFC. do CSE for to => do CSE to llvm-svn: 321098	2017-12-19 18:49:21 +00:00
Simon Pilgrim	7cabb4c384	[X86] Regenerate popcnt tests llvm-svn: 321093	2017-12-19 18:05:13 +00:00
Amara Emerson	b6ddbef673	[GlobalISel][Legalizer] Fix crash when trying to lower G_FNEG of fp128 types. This doesn't add legalizer support, just prevents crashing so that we can gracefully fall back to SDAG. Fixes PR35690. llvm-svn: 321091	2017-12-19 17:21:35 +00:00
Nirav Dave	51425fa5ba	[DAG] Elide overlapping store Summary: Extend overlapping store elision to handle overwrites of stores by larger stores. Nontemporal tests have been modified to add memory dependencies to prevent store elision. Reviewers: craig.topper, rnk, t.p.northover Subscribers: javed.absar, hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D40969 llvm-svn: 321089	2017-12-19 17:10:56 +00:00
Simon Pilgrim	d873b6f6ba	[X86][AVX512] Attempt target shuffle combining to different types instead of early-out We try to prevent shuffle combining to value types that would stop the folding of masked operations, but by just returning early, we were failing to try different shuffle types. The TODOs are all still relevant here to improve codegen but we're lacking test examples. llvm-svn: 321085	2017-12-19 16:54:07 +00:00
Francis Visoiu Mistrih	874ae6faa5	[CodeGen] Move printing MO_CFIIndex operands to MachineOperand::print Work towards the unification of MIR and debug output by refactoring the interfaces. Before this patch we printed "<call frame instruction>" in the debug output. llvm-svn: 321084	2017-12-19 16:51:52 +00:00
Francis Visoiu Mistrih	348a4208e3	[CFGVPrinter] Fix -dot-cfg-only The refactoring in r281640 made -dot-cfg-only ignore the "-only" part. llvm-svn: 321079	2017-12-19 15:20:18 +00:00
Ben Dunbobbin	688669ad8a	[ThinLTO][C-API] Correct api comments Negative values never disabled the pruning - they simply set high values for the pruning interval. The behaviour now is that negative values set the maximum pruning interval (which appears to have been the intention from the start) see https://reviews.llvm.org/D41231. I have adjusted the comments to reflect this, removed any inaccurate statements, and corrected any typos I spotted in the English. Differential Revision: https://reviews.llvm.org/D41279 llvm-svn: 321078	2017-12-19 14:49:33 +00:00
Ben Dunbobbin	9ecb8b548c	[Support][CachePruning] Disable cache pruning regression fix borked by: rL284966 (see: https://reviews.llvm.org/D25730). Previously, Interval was unsigned (see: CachePruning.h), replacing the type with std::chrono::seconds (which is signed) causes a regression in behaviour because the c-api intends negative values to translate to large positive intervals to effectively disable the pruning (see comments on: setCachePruningInterval()). Differential Revision: https://reviews.llvm.org/D41231 llvm-svn: 321077	2017-12-19 14:42:38 +00:00
Simon Pilgrim	3feaf2a207	[X86] Fix uninitialized variable sanitizer warning from rL321074 llvm-svn: 321076	2017-12-19 14:34:35 +00:00
Haicheng Wu	b3689cabda	[InlineCost] Skip volatile loads when looking for repeated loads This is a follow-up fix of r320814. A test case is also added. llvm-svn: 321075	2017-12-19 13:42:58 +00:00
Simon Pilgrim	fd5df639a3	[X86][SSE] Add cpu feature for aggressive combining to variable shuffles As mentioned in D38318 and D40865, modern Intel processors prefer to combine multiple shuffles to a variable shuffle mask (PSHUFB/VPERMPS etc.) instead of having multiple stage 'fixed' shuffles which put more pressure on Port 5 (at the expense of extra shuffle mask loads). This patch provides a FeatureFastVariableShuffle target flag for Haswell+ CPUs that prefers combining 2 or more fixed shuffles to a single variable shuffle (default is 3 shuffles). The long term aim is to drive more of this from schedule data (probably via the MC) but we're not close to being ready for that yet. Differential Revision: https://reviews.llvm.org/D41323 llvm-svn: 321074	2017-12-19 13:16:43 +00:00
David Green	110844d21c	[ARM] Register the Thumb2SizeReducePass. NFC Also adds a simple test case. llvm-svn: 321072	2017-12-19 12:19:08 +00:00
Pavel Labath	605636d872	[Support] Add WritableMemoryBuffer class Summary: The motivation here is LLDB, where we need to fixup relocations in mmapped files before their contents can be read correctly. The MemoryBuffer class does exactly what we need, except that it maps the file in read-only mode. WritableMemoryBuffer reuses the existing machinery for opening and mmapping a file. The only difference is in the argument to the mapped_file_region constructor -- we create a private copy-on-write mapping, so that we can make changes to the mapped data, but the changes aren't carried over to the underlying file. This patch is based on an initial version by Zachary Turner. Reviewers: mehdi_amini, rnk, rafael, dblaikie, zturner Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D40291 llvm-svn: 321071	2017-12-19 12:15:50 +00:00
Simon Pilgrim	f6d4ab6daf	[X86][SSE] Use (V)PHMINPOSUW for vXi8 SMAX/SMIN/UMAX/UMIN horizontal reductions (PR32841) Extension to D39729 which performed this for vXi16, with the same bit flipping to handle SMAX/SMIN/UMAX cases, vXi8 UMIN horizontal reductions can be performed. This makes use of the fact that by performing a pair-wise i8 SHUFFLE/UMIN before PHMINPOSUW, we both get the UMIN of each pair but also zero-extend the upper bits ready for v8i16. Differential Revision: https://reviews.llvm.org/D41294 llvm-svn: 321070	2017-12-19 12:02:40 +00:00
Francis Visoiu Mistrih	2130e6a080	Fix: [YAML] Always double quote UTF-8 characters llvm-svn: 321069	2017-12-19 11:59:28 +00:00
Francis Visoiu Mistrih	f34eea5aa1	[YAML] Always double quote UTF-8 characters llvm-svn: 321068	2017-12-19 11:51:05 +00:00
Simon Dardis	1ade566c45	[mips] Handle the emission of microMIPSr6 sll instruction when used as a nop. This instruction is encoded as zero, so we have handle that case when checking for unimplemented opcodes when producing the encoding for an instruction. llvm-svn: 321066	2017-12-19 11:16:22 +00:00
Jonas Devlieghere	efb06387b7	[dwarfdump] Lookup needs to be an unsigned long long parameter. Before this patch, dwarfdump's lookup parameter only accepts unsigned. Given that for many current platforms the load address already exceeds unsigned (e.g. arm64 w/ 0x100000000), dwarfdump needs an unsigned long long parameter. Patch by: Dr. Michael 'Mickey' Lauer <mickey@vanille-media.de> llvm-svn: 321064	2017-12-19 09:45:26 +00:00
Max Kazantsev	fd95ee0c9a	[JumpThreading] Restrict PRE across instructions that don't pass control to successors PRE in JumpThreading should not be able to hoist copy of non-speculable loads across instructions that don't always transfer execution to their successors, otherwise they may introduce an unsafe load which otherwise would not be executed. The same problem for GVN was fixed as rL316975. Differential Revision: https://reviews.llvm.org/D40347 llvm-svn: 321063	2017-12-19 09:10:21 +00:00
Igor Laevsky	ce6f2d0190	[FuzzMutate] Don't crash when mutator is unable to find operation Differential Revision: https://reviews.llvm.org/D41009 llvm-svn: 321062	2017-12-19 08:52:51 +00:00
Bjorn Steinbrink	2da4d9d86d	Treat sret arguments as being dereferenceable in getPointerDereferenceableBytes() Reviewers: rnk, hfinkel, efriedma Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D41355 llvm-svn: 321061	2017-12-19 08:46:46 +00:00
Craig Topper	13142b10d5	[X86] Don't extend v16i8 non-uniform shifts to v16i32 if we have BWI. Use v16i16 instead. BWI supports shifting by word amounts. Even if VLX isn't support we can still widen to v32i16 and extract the lower half. For SKX its preferrable to not use 512-bit vector if we can. llvm-svn: 321059	2017-12-19 06:59:10 +00:00
Craig Topper	6e3091c265	[X86] Use a specific list of MVTs in combineShiftRightArithmetic instead of iterating over every integer VT and checking their size. Previously, we were checking for MVTs with sizes betwen 8 and 64 which only includes i8, i16, i32, and i64 today. But I don't think we should assume that and should list the types that are legal for x86. I also don't think we need i64 since type legalization is guaranteed to split those up. llvm-svn: 321058	2017-12-19 06:29:00 +00:00
Craig Topper	eb13a418e1	[X86] Remove unnecessary check for integer VT from combineShiftRightArithmetic. I doubt there's any way to create a ashr for an FP type. llvm-svn: 321057	2017-12-19 06:28:58 +00:00
Craig Topper	da853a9c2f	[X86] Remove dead code for turning vector shifts by large amounts into a zero vector. Pretty sure these are handled by a target independent DAG combine that turns them into undef these days. llvm-svn: 321056	2017-12-19 05:21:50 +00:00
Craig Topper	ad3a554889	[X86] Use ZERO_EXTEND instead of ANY_EXTEND when extending the shift amount for a non-uniform shift. My reading of the SDM says that all bits of the shift amount are used. If the value of the element is larger than the number of bits the result the shift result is zero. So I think we need to zero_extend here to avoid garbage in the upper bits. In reality we lower any_extend as zero_extend so in most cases it would be hard to hit this. llvm-svn: 321055	2017-12-19 04:52:04 +00:00
Serguei Katkov	768d6dd087	Fix APFloat from string conversion for Inf The method IEEEFloat::convertFromStringSpecials() does not recognize the "+Inf" and "-Inf" strings but these strings are printed for the double Infinities by the IEEEFloat::toString(). This patch adds the "+Inf" and "-Inf" strings to the list of recognized patterns in IEEEFloat::convertFromStringSpecials(). Re-landing after fix. Reviewers: sberg, bogner, majnemer, timshen, rnk, skatkov, gottesmm, bkramer, scanon, anna Reviewed By: anna Subscribers: mkazantsev, FlameTop, llvm-commits, reames, apilipenko Differential Revision: https://reviews.llvm.org/D38030 llvm-svn: 321054	2017-12-19 04:27:39 +00:00
Quentin Colombet	63a328c30c	[TableGen][GlobalISel] Reset the internal map of RuleMatchers just before the emission Between the creation of the last InstructionMatcher and the first emission of the related Rule, we need to clear the internal map of IDs. We used to do that right after the creation of the main InstructionMatcher when building the rule and although that worked, this is fragile because if for some reason some later code decides to create more InstructionMatcher before the final call to emit, then the IDs would be completely messed up. Move that to the beginning of "emit" so that the IDs are guarantee to be consistent. NFC. llvm-svn: 321053	2017-12-19 02:57:23 +00:00
Reid Kleckner	73177e71bf	Fix Wasm as a follow up to r321035 and the other one This array is tightly coupled with the .def file. Someone should look into fixing that. llvm-svn: 321050	2017-12-19 01:08:53 +00:00
Justin Bogner	4314f3adc2	update_mir_test_checks: Accept IR as input as well as MIR We need to handle IR for tests that want to do lowering (or just -stop-after with IR as input). I've run this on one AArch64 test to demonstrate what it looks like. llvm-svn: 321048	2017-12-19 00:49:04 +00:00
Jake Ehrlich	e8437de727	[llvm-objcopy] Add option to add a progbits section from a file This change adds support for adding progbits sections with contents from a file Differential Revision: https://reviews.llvm.org/D41212 llvm-svn: 321047	2017-12-19 00:47:30 +00:00
Matthias Braun	e29c0b8862	TargetLoweringBase: Followup to r321035 I missed some prefixes and the fact that on AArch64 we use "bzero" instead of "__bzero" as on X86 when doing my refactoring in r321035. Improve tests for bzero. llvm-svn: 321046	2017-12-19 00:43:00 +00:00
Matthias Braun	92de8b2405	TargetLowering: Fix InitLibcallCallingConvs() overriding things set in InitLibcalls() I missed the fact that the later called InitLibcallCallingConvs() overrides some things set in InitLibcalls() when I did the refactoring in r321036. Fix by merging InitLibcallCallingConvs() into InitLibcalls() and doing the initialization earlier. llvm-svn: 321045	2017-12-19 00:20:33 +00:00
Matthias Braun	a942d62983	TargetLowering: Fix off-by-one error This problem was present for a while, but somehow asan didn't catch it before the refactoring in r321036. llvm-svn: 321043	2017-12-19 00:05:10 +00:00
Sam Clegg	b23a20179a	[llvm-readobj] Dump wasm init functions llvm-svn: 321042	2017-12-19 00:04:41 +00:00
Matthias Braun	0282091c9f	TargetLoweringBase: Remove unnecessary watchos exception; NFC WatchOS isn't report as iOS (as opposed to tvos) so the exception I added in my last commit wasn't necessary after all. llvm-svn: 321041	2017-12-18 23:33:28 +00:00
Justin Bogner	930a95c269	update_mir_test_checks: Add "mir" to some states and regex names For tests that do lowering we need to support IR as input, so here we clarify some names to avoid ambiguity in upcoming commits. llvm-svn: 321039	2017-12-18 23:31:55 +00:00
Craig Topper	f19121d647	[X86] Don't use NOPL when the assembler is passed an empty CPU string. This recommits the change from r321026. I have a fix for the lld test now. llvm-svn: 321038	2017-12-18 23:31:43 +00:00
Matthias Braun	ef95969e5b	LiveStacks: Rename LiveStack.{h\|cpp} to LiveStacks.{h\|cpp}; NFC Filenames should match the name of the class they contain. llvm-svn: 321037	2017-12-18 23:19:44 +00:00
Matthias Braun	a4852d2c19	X86/AArch64/ARM: Factor out common sincos_stret logic; NFCI Note: - X86ISelLowering: setLibcallName(SINCOS) was superfluous as InitLibcalls() already does it. - ARMISelLowering: Setting libcallnames for sincos/sincosf seemed superfluous as in the darwin case it wouldn't be used while for all other cases InitLibcalls already does it. llvm-svn: 321036	2017-12-18 23:19:42 +00:00
Matthias Braun	a92cecfbda	AArch64/X86: Factor out common bzero logic; NFC llvm-svn: 321035	2017-12-18 23:14:28 +00:00
Krzysztof Parzyszek	e704583f23	[Hexagon] Cache loads to select to avoid traversing mutating DAG llvm-svn: 321034	2017-12-18 23:13:27 +00:00
Craig Topper	46832126e1	Revert part of r321026 "[X86] Don't use NOPL when the assembler is passed an empty CPU string." while I investigate how to fix an lld test failure. Looks like lld also needs to pass a -mcpu in some of its tests llvm-svn: 321033	2017-12-18 22:20:10 +00:00
Evandro Menezes	687df6380e	[AArch64] Expand test coverage of vector element shuffling to Exynos Make sure that all test cases are run for Exynos as well. Otherwise, NFC. llvm-svn: 321032	2017-12-18 22:17:39 +00:00
Quentin Colombet	eba10cbc88	[TableGen][GlobalISel] Make the arguments of the Instruction and Operand Matchers consistent Move InsnVarID and OpIdx at the beginning of the list of arguments for all the constructors of the OperandMatcher subclasses. This matches what we do for the InstructionMatcher. NFC. llvm-svn: 321031	2017-12-18 22:12:13 +00:00
Bob Haarman	ea5ff9fa6b	Fix buffer overrun in WindowsResourceCOFFWriter::writeSymbolTable() Summary: We were using sprintf(..., "$R06X", <some uint32_t>) to create strings that are expected to be exactly length 8, but this results in longer strings if the uint32_t is greater than 0xffffff. This change modifies the behavior as follows: - Uses the loop counter instead of the data offset. This gives us sequential symbol names, avoiding collisions as much as possible. - Masks the value to 0xffffff to avoid generating names longer than 8 bytes. - Uses formatv instead of sprintf. Fixes PR35581. Reviewers: ruiu, zturner Reviewed By: ruiu Subscribers: hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D41270 llvm-svn: 321030	2017-12-18 22:10:14 +00:00
Reid Kleckner	8f3c351aa3	Add test for .req directive starting with 'p' Reduced test case from libjpeg_turbo. llvm-svn: 321029	2017-12-18 22:01:18 +00:00
Jessica Paquette	8565d3af84	[MachineOutliner][NFC] Gardening: use std::any_of instead of bool + loop River Riddle suggested to use std::any_of instead of the bool + loop thing on r320229. This commit does that. llvm-svn: 321028	2017-12-18 21:44:52 +00:00
Craig Topper	4802d4e23e	[X86] Don't use NOPL when the assembler is passed an empty CPU string. Update tests to force a CPU with NOPL Empty string should be equivalent to "generic" which doesn't allow NOPL. Force tests to use specificy 'pentiumpro' to guarantee NOPL. Fixes PR35686 llvm-svn: 321026	2017-12-18 21:37:27 +00:00
Quentin Colombet	34688b9e38	[TableGen][GlobalISel] Refactor optimizeRules related bit to allow code reuse In theory, reapplying optimizeRules on each group matchers should give us a second nesting level on the matching table. In practice, we need more work to make that happen because all the predicates are actually not directly available through the predicate matchers list. NFC. llvm-svn: 321025	2017-12-18 21:25:53 +00:00
Reid Kleckner	37517a2ddd	Revert "[AArch64][SVE] Asm" changes, they broke libjpeg_turbo This reverts changes r320992, r320986, r320973, and r320970. r320970 by itself breaks the test case, and the rest depend on it. Test case will land soon. llvm-svn: 321024	2017-12-18 20:58:25 +00:00
Ivan A. Kosarev	a80c79b5bf	[Analysis] Generate more precise TBAA tags when one access encloses the other There are cases when two tags with different base types denote accesses to the same direct or indirect member of a structure type. Currently, merging of such tags results in a tag that represents an access to an object that has the type of that member. This patch changes this so that if one of the accesses encloses the other, then the generic tag is the one of the enclosed access. Differential Revision: https://reviews.llvm.org/D39557 llvm-svn: 321019	2017-12-18 20:05:20 +00:00
Teresa Johnson	915897e21b	[PGO] Fix handling of cold entry count for instrumented PGO Summary: In r277849, getEntryCount was changed to return None when the entry count was 0, specifically for SamplePGO where it means no samples were recorded. However, for instrumentation PGO a 0 entry count should be returned directly, since it does mean that the function was completely cold. Otherwise we end up treating these functions conservatively in isFunctionEntryCold() and isColdBB(). Instead, for SamplePGO use -1 when there are no samples, and change getEntryCount to return None when the value is -1. Reviewers: danielcdh, davidxl Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D41307 llvm-svn: 321018	2017-12-18 20:02:43 +00:00
Quentin Colombet	ec76d9c47f	[TableGen][GlobalISel] Optimize MatchTable for faster instruction selection * Context * Prior to this patchw, the table generated for matching instruction was straight forward but highly inefficient. Basically, each pattern generates its own set of self contained checks and actions. E.g., TableGen generated: // First pattern CheckNumOperand 3 CheckOpcode G_ADD ... Build ADDrr // Second pattern CheckNumOperand 3 CheckOpcode G_ADD ... Build ADDri // Third pattern CheckNumOperand 3 CheckOpcode G_SUB ... Build SUBrr * Problem * Because of that generation, a lot of check were redundant between each pattern and were checked every single time until we reach the pattern that matches. E.g., Taking the previous table, let say we are matching a G_SUB, that means we were going to check all the rules for G_ADD before looking at the G_SUB rule. In particular we are going to do: check 3 operands; PASS check G_ADD; FAIL ; Next rule check 3 operands; PASS (but we already knew that!) check G_ADD; FAIL (well it is still not true) ; Next rule check 3 operands; PASS (really!!) check G_SUB; PASS (at last :P) * Proposed Solution * This patch introduces a concept of group of rules (GroupMatcher) that share some predicates and only get checked once for the whole group. This patch only creates groups with one nesting level. Conceptually there is nothing preventing us for having deeper nest level. However, the current implementation is not smart enough to share the recording (aka capturing) of values. That limits its ability to do more sharing. For the given example the current patch will generate: // First group CheckOpcode G_ADD // First pattern CheckNumOperand 3 ... Build ADDrr // Second pattern CheckNumOperand 3 ... Build ADDri // Second group CheckOpcode G_SUB // Third pattern CheckNumOperand 3 ... Build SUBrr But if we allowed several nesting level, it could create a sub group for the checknumoperand 3. (We would need to call optimizeRules on the rules within a group.) * Result * With only one level of nesting, the instruction selection pass is up to 4x faster. For instance, one instruction now takes 500 checks, instead of 24k! With more nesting we could get in the tens I believe. Differential Revision: https://reviews.llvm.org/D39034 rdar://problem/34670699 llvm-svn: 321017	2017-12-18 19:47:41 +00:00
Dimitry Andric	e4f5d01033	Fix more inconsistent line endings. NFC. llvm-svn: 321016	2017-12-18 19:46:56 +00:00
Craig Topper	48176a5fb6	[X86] Minor formatting fix to getHostCPUFeatures. NFC llvm-svn: 321015	2017-12-18 19:40:11 +00:00
Jessica Paquette	02c124d644	[MachineOutliner] Recommit r320229 LR was undefined entering outlined functions that contain calls. This made the machine verifier unhappy when expensive checks were enabled. This fixes that. llvm-svn: 321014	2017-12-18 19:33:21 +00:00
Benjamin Kramer	efc7c88ea8	[PPC] Also disable the pre-emit version of reg+reg to reg+imm transformation. This has the same issue as the early pass disabled in r321010. llvm-svn: 321013	2017-12-18 19:21:56 +00:00
Don Hinton	0fa52c7db1	[cmake] Update experimental target error message Summary: Update this error message indicate this test only ensures experimental targets were passed via LLVM_EXPERIMENTAL_TARGETS_TO_BUILD. Originally, this test validated all targets, but in r184923, it was moved after the LLVMBUILDTOOL test, which also validates all targets, making that part of the test redundant. Differential Revision: https://reviews.llvm.org/D41273 llvm-svn: 321012	2017-12-18 19:15:15 +00:00
Paul Robinson	a06f8dcca6	Recommit "[DWARFv5] Dump an MD5 checksum in the line-table header." Adds missing support for DW_FORM_data16. Update of r320852/r320886, fixing the unittest again, this time use a raw char string for the test data. Differential Revision: https://reviews.llvm.org/D41090 llvm-svn: 321011	2017-12-18 19:08:35 +00:00
Benjamin Kramer	f4cc67acb6	[PPC] Disable reg+reg to reg+imm transformation. It creates invalid instructions. PR35688. llvm-svn: 321010	2017-12-18 18:56:57 +00:00
Dimitry Andric	e44dea9f6b	Fix inconsistent line endings in HexagonVectorLoopCarriedReuse.cpp. NFC. llvm-svn: 321009	2017-12-18 18:56:00 +00:00
Krzysztof Parzyszek	eba8c0c61b	[Hexagon] Higher versions of HVX imply presence of lower versions The code in Hexagon_MC::completeHVXFeatures wasn't setting all HVX- related features correctly. llvm-svn: 321008	2017-12-18 18:51:57 +00:00
Ivan A. Kosarev	422a380a3e	[IR] Support the new TBAA metadata format in IR verifier Differential Revision: https://reviews.llvm.org/D40438 llvm-svn: 321007	2017-12-18 18:46:44 +00:00
Dimitry Andric	ca5b0f3f12	Fix inconsistent line endings in ARCDisassembler.cpp. NFC. llvm-svn: 321006	2017-12-18 18:45:37 +00:00
Krzysztof Parzyszek	7259263790	i[Hexagon] ANY_EXTEND_VECTOR_INREG should be Custom, not Legal in r321004 llvm-svn: 321005	2017-12-18 18:41:52 +00:00
Krzysztof Parzyszek	6b589e593d	[Hexagon] Generate HVX code for vector sign-, zero- and any-extends Implement any-extend as zero-extend. llvm-svn: 321004	2017-12-18 18:32:27 +00:00
Simon Pilgrim	f947137ed0	[X86] Regenerate test to improve codegen testing for D41350 llvm-svn: 321003	2017-12-18 18:31:02 +00:00
Krzysztof Parzyszek	5439a70d97	[Hexagon] Prefer to widen HVX vectors instead of promoting llvm-svn: 321002	2017-12-18 18:21:01 +00:00
Matt Arsenault	d89d0b6494	Removed unused DominanceFrontier llvm-svn: 321001	2017-12-18 18:01:13 +00:00
Teresa Johnson	9ecaaff251	[ThinLTO] Make distributed indexes test more robust Modify test so that it passes in the reverse-iteration bot. We use DenseMap instead of std::map for the summaries to emit into distributed index files. The iteration order is not defined, but it is deterministic, which is good enough. llvm-svn: 321000	2017-12-18 18:00:32 +00:00
Xinliang David Li	19fb5b467b	[PGO] add MST min edge selection heuristic to ensure non-zero entry count Differential Revision: http://reviews.llvm.org/D41059 llvm-svn: 320998	2017-12-18 17:56:19 +00:00
Francis Visoiu Mistrih	b213b27ee3	[YAML] Add support for non-printable characters LLVM IR function names which disable mangling start with '\01' (https://www.llvm.org/docs/LangRef.html#identifiers). When an identifier like "\01@abc@" gets dumped to MIR, it is quoted, but only with single quotes. http://www.yaml.org/spec/1.2/spec.html#id2770814: "The allowed character range explicitly excludes the C0 control block allowed), the surrogate block #xD800-#xDFFF, #xFFFE, and #xFFFF." http://www.yaml.org/spec/1.2/spec.html#id2776092: "All non-printable characters must be escaped. [...] Note that escape sequences are only interpreted in double-quoted scalars." This patch adds support for printing escaped non-printable characters between double quotes if needed. Should also fix PR31743. Differential Revision: https://reviews.llvm.org/D41290 llvm-svn: 320996	2017-12-18 17:38:03 +00:00
Ivan A. Kosarev	04e1d01736	[IR] Add MDBuilder helpers for the new TBAA metadata format The new helpers are supposed to be used in clang to generate TBAA information in the new format proposed in this thread: http://lists.llvm.org/pipermail/llvm-dev/2017-November/118748.html Differential Revision: https://reviews.llvm.org/D39956 llvm-svn: 320993	2017-12-18 16:49:39 +00:00
Sander de Smalen	09f56a54d0	[AArch64][SVE] Asm: Improve diagnostics further when +sve is not specified Summary: Patch [4/4] in a series to add parsing of predicates and properly parse SVE ZIP1/ZIP2 instructions. This patch further improves diagnostic messages for when the SVE feature is not specified. Reviewers: rengolin, fhahn, olista01, echristo, efriedma Reviewed By: fhahn Subscribers: sdardis, aemerson, javed.absar, tschuett, llvm-commits, kristof.beyls Differential Revision: https://reviews.llvm.org/D40363 llvm-svn: 320992	2017-12-18 16:48:53 +00:00
Simon Dardis	fd8c65e868	Reland "[mips] Fix the target specific instruction verifier" Fix an off by one error in the bounds checking for 'dinsu' and update the ranges in the test comments so that they are accurate. This version has the correct commit message. Reviewers: atanasyan Differential Revision: https://reviews.llvm.org/D41183 llvm-svn: 320991	2017-12-18 15:56:40 +00:00
Sean Fertile	5fb624a3b8	[Memcpy Loop Lowering] Remove the fixed int8 lowering. Switch over to the lowering that uses target supplied operand types. Differential Revision: https://reviews.llvm.org/D41201 llvm-svn: 320989	2017-12-18 15:31:14 +00:00
Sander de Smalen	190979189a	[TableGen][AsmMatcherEmitter] Only choose specific diagnostic for enabled instruction Summary: When emitting a diagnostic for an invalid operand, a specific diagnostic should only be reported when the instruction being matched is actually enabled by the feature flags. Patch [3/4] in a series to add parsing of predicates and properly parse SVE ZIP1/ZIP2 instructions. This patch fixes bogus diagnostic messages for when the SVE feature is not specified. Reviewers: rengolin, craig.topper, olista01, sdardis, stoklund Reviewed By: olista01, sdardis Subscribers: fhahn, javed.absar, llvm-commits Differential Revision: https://reviews.llvm.org/D40362 llvm-svn: 320986	2017-12-18 14:34:24 +00:00
Max Kazantsev	1acab00229	[LVI] Support for ashr in LVI Enhance LVI to analyze the ‘ashr’ binary operation. This leverages the infrastructure in ConstantRange for the ashr operation. Patch by Surya Kumari Jangala! Differential Revision: https://reviews.llvm.org/D40886 llvm-svn: 320983	2017-12-18 14:23:30 +00:00
Diana Picus	8ee540c01a	[ARM GlobalISel] Fix G_(UN)MERGE_VALUES handling after r319524 r319524 has made more G_MERGE_VALUES/G_UNMERGE_VALUES pairs legal than are supported by the rest of the pipeline. Restrict that to only the cases that we can currently handle: packing 32-bit values into 64-bit ones, when we have hardware FP. llvm-svn: 320980	2017-12-18 13:22:28 +00:00
Benjamin Kramer	bc8fdaaf60	Constexprify LaneBitmask factory methods. This avoids global constructors when they're used in a global constant. llvm-svn: 320979	2017-12-18 13:20:26 +00:00
Max Kazantsev	d792171efb	[ConstantRange] Support for ashr in ConstantRange computation Extend the ConstantRange implementation to compute the range of possible values resulting from an arithmetic right shift operation. There will be a follow up patch to leverage this constant range infrastructure in LazyValueInfo. Patch by Surya Kumari Jangala! Differential Revision: https://reviews.llvm.org/D40881 llvm-svn: 320976	2017-12-18 13:01:32 +00:00
Simon Dardis	f70af977af	Revert "[mips] Fix the target specific instruction verifier" This reverts commit r320974. The commit message lacked the Differential Revison: line. llvm-svn: 320975	2017-12-18 12:30:34 +00:00
Simon Dardis	c3c0d4590b	[mips] Fix the target specific instruction verifier Fix an off by one error in the bounds checking for 'dinsu' and update the ranges in the test comments so that they are accurate. Reviewers: atanasyan https://reviews.llvm.org/D41183 llvm-svn: 320974	2017-12-18 12:24:17 +00:00
Sander de Smalen	fce0c1c45b	[AArch64][SVE] Asm: Add ZIP1/ZIP2 instructions (predicate/data vectors) Summary: Patch [2/4] in a series to add parsing of predicates and properly parse SVE ZIP1/ZIP2 instructions. Reviewers: rengolin, kristof.beyls, fhahn, mcrosier, evandro Reviewed By: fhahn Subscribers: aemerson, javed.absar, llvm-commits, tschuett Differential Revision: https://reviews.llvm.org/D40361 llvm-svn: 320973	2017-12-18 11:29:59 +00:00
Sander de Smalen	ce1e0975f4	[AArch64][SVE] Asm: Add SVE predicate register definitions and parsing support Summary: Patch [1/4] in a series to add parsing of predicates and properly parse SVE ZIP1/ZIP2 instructions. Reviewers: rengolin, kristof.beyls, fhahn, mcrosier, evandro, echristo, efriedma Reviewed By: fhahn Subscribers: aemerson, javed.absar, llvm-commits, tschuett Differential Revision: https://reviews.llvm.org/D40360 llvm-svn: 320970	2017-12-18 11:26:34 +00:00
Eugene Leviant	c95b49603e	[ThinLTO] Remove unused code This is a re-commit of r320464, after patch for gold plugin was landed. llvm-svn: 320968	2017-12-18 10:53:45 +00:00
Tim Northover	9097a07e4e	AArch64: work around how Cyclone handles "movi.2d vD, #0". For Cylone, the instruction "movi.2d vD, #0" is executed incorrectly in some rare circumstances. Work around the issue conservatively by avoiding the instruction entirely. This patch changes CodeGen so that problematic instructions are never generated, and the AsmParser so that an equivalent instruction is used (with a warning). llvm-svn: 320965	2017-12-18 10:36:00 +00:00
Igor Laevsky	7bd3fb15e1	[TargetLibraryInfo] Discard library functions with incorrectly sized integers Differential Revision: https://reviews.llvm.org/D41184 llvm-svn: 320964	2017-12-18 10:31:58 +00:00
Sam Parker	fd967f2f7a	[ARM] Adjust test checks Correct the CHECK-LABELS of a couple of dag combine tests. llvm-svn: 320963	2017-12-18 10:08:03 +00:00
Sam Parker	00804efd72	[DAGCombine] Move AND nodes to multiple load leaves Search from AND nodes to find whether they can be propagated back to loads, so that the AND and load can be combined into a narrow load. We search through OR, XOR and other AND nodes and all bar one of the leaves are required to be loads or constants. The exception node then needs to be masked off meaning that the 'and' isn't removed, but the loads(s) are narrowed still. Differential Revision: https://reviews.llvm.org/D41177 llvm-svn: 320962	2017-12-18 10:04:27 +00:00
Clement Courbet	6f42de3062	[NFC][CodeGen][ExpandMemCmp] Fix documentation. llvm-svn: 320960	2017-12-18 07:32:48 +00:00
Craig Topper	7034d401f8	[X86] Use mattr instead of mcpu in some of the cost model tests. Based on the names of the check lines, features seems more appropriate that cpu. Spotted while prototyping my patch to make 512-bit vectors illegal on SKX sometimes. llvm-svn: 320959	2017-12-18 07:21:58 +00:00
Hiroshi Inoue	c6faf15459	[SROA] Disable non-whole-alloca splits by default This patch introduce a switch to control splitting of non-whole-alloca slices with default off. The switch will be default on again after fixing an issue reported in PR35657. llvm-svn: 320958	2017-12-18 06:47:37 +00:00
Craig Topper	8e2837cc6e	[X86] Fix mistake that I made when splitting up the setOperationAction calls recently. The block I moved things that need BWI and 512-bit or VLX is incorrectly qualified with just hasBWI \|\| hasVLX. Here I've qualified it with hasBWI && (hasAVX512 \|\| hasVLX) where the hasAVX512 will be replaced with allowing 512-bit vectors in an upcoming patch. llvm-svn: 320957	2017-12-18 04:50:05 +00:00
Serguei Katkov	b0b67a8d38	[CGP] Fix the handling select inst in complex addressing mode When we put the value in select placeholder we must pass the value through simplification tracker due to the value might be already simplified and erased. This is a fix for PR35658. Reviewers: john.brawn, uabelho Reviewed By: john.brawn Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D41251 llvm-svn: 320956	2017-12-18 04:25:07 +00:00
Sanjay Patel	9da049fa8a	[x86] add tests for finite libcall lowering (PR35672); NFC llvm-svn: 320955	2017-12-18 00:38:45 +00:00
Bjorn Steinbrink	3603de2fa2	Re-commit "Properly handle multi-element and dynamically sized allocas in getPointerDereferenceableBytes()"" llvm-clang-x86_64-expensive-checks-win is still broken, so the failure seems unrelated. llvm-svn: 320953	2017-12-17 21:20:16 +00:00
Craig Topper	255a76d6d1	[X86] Add test cases that show cases where buildvector of extract and inserts should be turned into fmsubadd. This is a follow up to the fmaddsub support added in r320950. Hopefully in the future we can fix lowering to handle this fmsubadd too. llvm-svn: 320951	2017-12-17 18:31:36 +00:00
Craig Topper	fd8d040820	[X86] Make the code that creates fmaddsub from build_vector of extracts and inserts functional and add tests. Summary: We had no tests for this and we couldn't do the optimization because of a bad use count check. We need to know how many non-undef pieces of the build vector were filled in and ensure our use count is equal to that. But on the shuffle combine version we need the use count to be 2. The missing coverage was noticed during the review of D40335. Reviewers: RKSimon, zvi, spatel Reviewed By: RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D41133 llvm-svn: 320950	2017-12-17 18:23:45 +00:00
Simon Pilgrim	406d04a916	[X86] Regenerate truncated rotation tests + add missing 32-bit checks llvm-svn: 320949	2017-12-17 18:20:42 +00:00
Sam Clegg	b07a016ed1	use uint32_t llvm-svn: 320947	2017-12-17 17:50:07 +00:00
Sam Clegg	c551522d25	[WebAssembly] Export some more info on wasm funtions Summary: These fields are useful for lld's gc-sections support Also remove an unused field. Subscribers: jfb, dschuff, jgravelle-google, aheejin, sunfish Differential Revision: https://reviews.llvm.org/D41320 llvm-svn: 320946	2017-12-17 17:50:07 +00:00
Bjorn Steinbrink	6f7bbf349f	Revert "Properly handle multi-element and dynamically sized allocas in getPointerDereferenceableBytes()" This reverts commit 217067d5179882de9deb60d2e866befea4c126e7. Fails on llvm-clang-x86_64-expensive-checks-win llvm-svn: 320945	2017-12-17 15:16:58 +00:00
Bjorn Steinbrink	e880f262e5	Revert "Treat sret arguments as being dereferenceable in getPointerDereferenceableBytes()" This reverts commit 8b7a7660a3904b2088bc594311bcea2c651def08. I didn't mean to commit this. llvm-svn: 320944	2017-12-17 15:16:51 +00:00
Bjorn Steinbrink	7afcb71a42	Treat sret arguments as being dereferenceable in getPointerDereferenceableBytes() llvm-svn: 320943	2017-12-17 15:11:52 +00:00
Simon Pilgrim	b1b30286bf	Remove superfluous break after a return. NFCI. llvm-svn: 320941	2017-12-17 11:01:33 +00:00
Craig Topper	5992535e1a	[X86DomainReassignment] Store legal domains in a std::bitset instead of using a SmallVector that really only ever has one element as a set. llvm-svn: 320940	2017-12-17 03:16:23 +00:00
Bjorn Steinbrink	c27f81b92b	Properly handle byval arguments in getPointerDereferenceableBytes() Summary: For byval arguments, the number of dereferenceable bytes is equal to the size of the pointee, not the pointer. Reviewers: hfinkel, rnk Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D41305 llvm-svn: 320939	2017-12-17 02:37:42 +00:00
Bjorn Steinbrink	5d86532467	Properly handle multi-element and dynamically sized allocas in getPointerDereferenceableBytes() Reviewers: hfinkel, rnk Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D41288 llvm-svn: 320938	2017-12-17 01:54:25 +00:00
Craig Topper	ee1e71e576	[X86] Use extract_vector_elt instead of X86ISD::VEXTRACT for isel of vXi1 extractions. llvm-svn: 320937	2017-12-17 01:35:48 +00:00
Craig Topper	c0c2d19e08	[X86] Canonicalize extract_vector_elt from vXi1 to always return MVT::i32. This allows us to remove some isel patterns that allowed MVT::i8 result type. llvm-svn: 320936	2017-12-17 01:35:47 +00:00
Craig Topper	c609dc8f55	[X86] Don't create X86ISD::VEXTRACT nodes directly. Use EXTRACT_VECTOR_ELT and allow that to be legaized to VEXTRACT. I think we can remove the VEXTRACT node completely and use a canonicalized EXTRACT_VECTOR_ELT instead. This is a first step. llvm-svn: 320935	2017-12-17 01:35:44 +00:00
Simon Pilgrim	5c0c93ed4c	Fix unused variable warning. llvm-svn: 320934	2017-12-16 23:37:51 +00:00
Simon Pilgrim	4c9e8215e9	[X86][AVX] lowerVectorShuffleAsBroadcast - aggressively peek through BITCASTs Assuming we can safely adjust the broadcast index for the new type to keep it suitably aligned, then peek through BITCASTs when looking for the broadcast source. Fixes PR32007 llvm-svn: 320933	2017-12-16 23:32:18 +00:00
Simon Pilgrim	88c10bc969	[X86][AVX] Use extract128BitVector helper. NFCI. llvm-svn: 320932	2017-12-16 23:09:57 +00:00
Simon Pilgrim	f3b6da00f5	[X86][AVX] Fix failed broadcast fold Strip excess BITCASTs from EXTRACT_SUBVECTOR input llvm-svn: 320930	2017-12-16 22:57:17 +00:00
Sean Fertile	68d7f9da76	[Memcpy Loop Lowering] Only calculate residual size/bytes copied when needed. If the loop operand type is int8 then there will be no residual loop for the unknown size expansion. Dont create the residual-size and bytes-copied values when they are not needed. llvm-svn: 320929	2017-12-16 22:41:39 +00:00
Craig Topper	849b717c86	[X86] Don't pass a zero input to the passthru operand of getVectorMaskingNode/getScalarMaskingNode when its going to emit an ISD::OR/ISD::AND. NFCI In those cases, the pass thru operand of the methods isn't used. The calls to the scalar version were passing a MVT::i1 zero, which is an illegal type at the stage this code runs. llvm-svn: 320928	2017-12-16 21:12:24 +00:00
Craig Topper	93253e189c	[X86] Have getVectorMaskingNode return an ISD::AND for X86ISD::VPSHUFBITQMB instead of creating a select with one input being 0. llvm-svn: 320927	2017-12-16 21:12:23 +00:00
Craig Topper	1260a4e826	[X86] When using vpopcntdq for ctpop of v8i16 vectors, only promote to v8i32. Previously we promoted to v8i64, but we don't need to go all the way to 512-bits. If we have VLX we can use the 256-bit instruction. And even if we don't have VLX we can widen v8i32 to v16i32 and drop the upper half. llvm-svn: 320926	2017-12-16 19:31:36 +00:00
Craig Topper	a42a2ba221	[X86] Combine some more scheduler model entries using regular expressions. We had a lot of separate 32 and 64 instructions that had the same scheduling data. This merges them into the same regular expression. This is pretty consistent with a lot of other instructions. llvm-svn: 320924	2017-12-16 18:35:31 +00:00
Craig Topper	17a311831c	[X86] Use instrs instead of instregex for gather/scatter instructions in the scheduler models. Combine into single InstrRW entries. The reduces the number of scheduler groups in subtarget info. llvm-svn: 320923	2017-12-16 18:35:29 +00:00
Simon Pilgrim	5f022d278b	[InstCombine] Regenerate FMUL/FMA combine tests with update_test_checks.py llvm-svn: 320922	2017-12-16 17:18:15 +00:00
Sanjay Patel	5a0cdac174	[InstCombine] canonicalize shifty abs(): ashr+add+xor --> cmp+neg+sel We want to do this for 2 reasons: 1. Value tracking does not recognize the ashr variant, so it would fail to match for cases like D39766. 2. DAGCombiner does better at producing optimal codegen when we have the cmp+sel pattern. More detail about what happens in the backend: 1. DAGCombiner has a generic transform for all targets to convert the scalar cmp+sel variant of abs into the shift variant. That is the opposite of this IR canonicalization. 2. DAGCombiner has a generic transform for all targets to convert the vector cmp+sel variant of abs into either an ABS node or the shift variant. That is again the opposite of this IR canonicalization. 3. DAGCombiner has a generic transform for all targets to convert the exact shift variants produced by #1 or #2 into an ISD::ABS node. Note: It would be an efficiency improvement if we had #1 go directly to an ABS node when that's legal/custom. 4. The pattern matching above is incomplete, so it is possible to escape the intended/optimal codegen in a variety of ways. a. For #2, the vector path is missing the case for setlt with a '1' constant. b. For #3, we are missing a match for commuted versions of the shift variants. 5. Therefore, this IR canonicalization can only help get us to the optimal codegen. The version of cmp+sel produced by this patch will be recognized in the DAG and converted to an ABS node when possible or the shift sequence when not. 6. In the following examples with this patch applied, we may get conditional moves rather than the shift produced by the generic DAGCombiner transforms. The conditional move is created using a target-specific decision for any given target. Whether it is optimal or not for a particular subtarget may be up for debate. define i32 @abs_shifty(i32 %x) { %signbit = ashr i32 %x, 31 %add = add i32 %signbit, %x %abs = xor i32 %signbit, %add ret i32 %abs } define i32 @abs_cmpsubsel(i32 %x) { %cmp = icmp slt i32 %x, zeroinitializer %sub = sub i32 zeroinitializer, %x %abs = select i1 %cmp, i32 %sub, i32 %x ret i32 %abs } define <4 x i32> @abs_shifty_vec(<4 x i32> %x) { %signbit = ashr <4 x i32> %x, <i32 31, i32 31, i32 31, i32 31> %add = add <4 x i32> %signbit, %x %abs = xor <4 x i32> %signbit, %add ret <4 x i32> %abs } define <4 x i32> @abs_cmpsubsel_vec(<4 x i32> %x) { %cmp = icmp slt <4 x i32> %x, zeroinitializer %sub = sub <4 x i32> zeroinitializer, %x %abs = select <4 x i1> %cmp, <4 x i32> %sub, <4 x i32> %x ret <4 x i32> %abs } > $ ./opt -instcombine shiftyabs.ll -S \| ./llc -o - -mtriple=x86_64 -mattr=avx > abs_shifty: > movl %edi, %eax > negl %eax > cmovll %edi, %eax > retq > > abs_cmpsubsel: > movl %edi, %eax > negl %eax > cmovll %edi, %eax > retq > > abs_shifty_vec: > vpabsd %xmm0, %xmm0 > retq > > abs_cmpsubsel_vec: > vpabsd %xmm0, %xmm0 > retq > > $ ./opt -instcombine shiftyabs.ll -S \| ./llc -o - -mtriple=aarch64 > abs_shifty: > cmp w0, #0 // =0 > cneg w0, w0, mi > ret > > abs_cmpsubsel: > cmp w0, #0 // =0 > cneg w0, w0, mi > ret > > abs_shifty_vec: > abs v0.4s, v0.4s > ret > > abs_cmpsubsel_vec: > abs v0.4s, v0.4s > ret > > $ ./opt -instcombine shiftyabs.ll -S \| ./llc -o - -mtriple=powerpc64le > abs_shifty: > srawi 4, 3, 31 > add 3, 3, 4 > xor 3, 3, 4 > blr > > abs_cmpsubsel: > srawi 4, 3, 31 > add 3, 3, 4 > xor 3, 3, 4 > blr > > abs_shifty_vec: > vspltisw 3, -16 > vspltisw 4, 15 > vsubuwm 3, 4, 3 > vsraw 3, 2, 3 > vadduwm 2, 2, 3 > xxlxor 34, 34, 35 > blr > > abs_cmpsubsel_vec: > vspltisw 3, -16 > vspltisw 4, 15 > vsubuwm 3, 4, 3 > vsraw 3, 2, 3 > vadduwm 2, 2, 3 > xxlxor 34, 34, 35 > blr > Differential Revision: https://reviews.llvm.org/D40984 llvm-svn: 320921	2017-12-16 16:41:17 +00:00
Craig Topper	d2a2a39c93	[X86] Remove GCCBuiltin from kand/kandn/kor/kxor/kxnor/knot intrinsics so clang can implement with native IR. llvm-svn: 320918	2017-12-16 08:25:30 +00:00
Craig Topper	1c7d07c601	[X86] Remove unneeded code for handling the old kunpck intrinsics. llvm-svn: 320917	2017-12-16 06:58:30 +00:00
Hal Finkel	92ea8acbcd	Move Transforms/LoopVectorize/consecutive-ptr-cg-bug.ll into the X86 subdirectory This test depends on X86's TTI; move into the X86 subdirectory. llvm-svn: 320914	2017-12-16 05:10:20 +00:00
Hal Finkel	5444f40965	[LV] Extend InstWidening with CM_Widen_Recursive Changes to the original scalar loop during LV code gen cause the return value of Legal->isConsecutivePtr() to be inconsistent with the return value during legal/cost phases (further analysis and information of the bug is in D39346). This patch is an alternative fix to PR34965 following the CM_Widen approach proposed by Ayal and Gil in D39346. It extends InstWidening enum with CM_Widen_Reverse to properly record the widening decision for consecutive reverse memory accesses and, consequently, get rid of the Legal->isConsetuviePtr() call in LV code gen. I think this is a simpler/cleaner solution to PR34965 than the one in D39346. Fixes PR34965. Patch by Diego Caballero, thanks! Differential Revision: https://reviews.llvm.org/D40742 llvm-svn: 320913	2017-12-16 02:55:24 +00:00
Galina Kistanova	5f8c84c5be	Fixed warning 'function declaration isn’t a prototype [-Werror=strict-prototypes]' llvm-svn: 320912	2017-12-16 02:54:17 +00:00
Hal Finkel	e86a8b79b5	[PowerPC, AsmParser] Enable the mnemonic spell corrector r307148 added an assembly mnemonic spelling correction support and enabled it on ARM. This enables that support on PowerPC as well. Patch by Dmitry Venikov, thanks! Differential Revision: https://reviews.llvm.org/D40552 llvm-svn: 320911	2017-12-16 02:42:18 +00:00
Craig Topper	c08960597c	[X86] Add 128 and 256-bit VPOPCNTDQ instructions. Adjust some tablegen classes LZCNT/POPCNT. I think when this instruction was first published it was only for a Knights CPU and thus VLX version was missing. llvm-svn: 320910	2017-12-16 02:40:28 +00:00
Vitaly Buka	12f9b8cf24	[LTO] Update tests for r320905 llvm-svn: 320909	2017-12-16 02:40:20 +00:00
Vitaly Buka	fd563a0352	Remove trailing whitespace llvm-svn: 320907	2017-12-16 02:12:35 +00:00
Sam Clegg	731a76646f	[WebAssembly] Return ArrayRef's rather than const std::vector& From working on lld I've learned this is generally the preferred way for several reasons (e.g. more concise, improves encapsulation). Differential Revision: https://reviews.llvm.org/D41265 llvm-svn: 320906	2017-12-16 02:10:16 +00:00
Vitaly Buka	a5376f393e	[LTO] Make processing of combined module more consistent Summary: 1. Use stream 0 only for combined module. Previously if combined module was not processes ThinLTO used the stream for own output. However small changes in input, could trigger combined module and shuffle outputs making life of llvm::LTO harder. 2. Always process combined module and write output to stream 0. Processing empty combined module is cheap and allows llvm::LTO users to avoid implementing processing which is already done in llvm::LTO. Subscribers: mehdi_amini, inglorion, eraman, hiraditya Differential Revision: https://reviews.llvm.org/D41267 llvm-svn: 320905	2017-12-16 02:10:00 +00:00
Teresa Johnson	160f4bb803	Add another missing -enable-import-metadata to test r320895 modified a test so that it needs -enable-import-metadata which is false by default for NDEBUG, found another place that needs this added. llvm-svn: 320903	2017-12-16 01:35:36 +00:00
Hal Finkel	2ff24731bb	[SimplifyLibCalls] Inline calls to cabs when it's safe to do so When unsafe algerbra is allowed calls to cabs(r) can be replaced by: sqrt(creal(r)creal(r) + cimag(r)cimag(r)) Patch by Paul Walker, thanks! Differential Revision: https://reviews.llvm.org/D40069 llvm-svn: 320901	2017-12-16 01:26:25 +00:00
Hal Finkel	7333aa9f16	[LV] NFC patch for moving VPRecipe class definitions from LoopVectorize.cpp to VPlan.h This is a small step forward to move VPlan stuff to where it should belong (i.e., VPlan.): 1. VPRecipe classes in LoopVectorize.cpp are moved to VPlan.h. 2. Many of VPRecipe::print() and execute() definitions are still left in LoopVectorize.cpp since they refer to things declared in LoopVectorize.cpp. To be moved to VPlan.cpp at a later time. 3. InterleaveGroup class is moved from anonymous namespace to llvm namespace. Referencing it in anonymous namespace from VPlan.h ended up in warning. Patch by Hideki Saito, thanks! Differential Revision: https://reviews.llvm.org/D41045 llvm-svn: 320900	2017-12-16 01:12:50 +00:00
Teresa Johnson	4358a40345	Add -enable-import-metadata to test r320895 modified a test so that it needs -enable-import-metadata which is false by default for NDEBUG. llvm-svn: 320899	2017-12-16 01:00:48 +00:00
Craig Topper	6b129fde5a	[X86] Add back the assert from r320830 that was reverted in r320850 Hopefully r320864 has fixed the offending case that failed the assert. llvm-svn: 320898	2017-12-16 00:33:16 +00:00
Teresa Johnson	69b2de8466	Fix NDEBUG build problem in r320895 Fix incorrect placement of #endif causing NDEBUG build failures. llvm-svn: 320897	2017-12-16 00:29:31 +00:00
Teresa Johnson	81bbf74265	[ThinLTO] Enable importing of aliases as copy of aliasee Summary: This implements a missing feature to allow importing of aliases, which was previously disabled because alias cannot be available_externally. We instead import an alias as a copy of its aliasee. Some additional work was required in the IndexBitcodeWriter for the distributed build case, to ensure that the aliasee has a value id in the distributed index file (i.e. even when it is not being imported directly). This is a performance win in codes that have many aliases, e.g. C++ applications that have many constructor and destructor aliases. Reviewers: pcc Subscribers: mehdi_amini, inglorion, eraman, llvm-commits Differential Revision: https://reviews.llvm.org/D40747 llvm-svn: 320895	2017-12-16 00:18:12 +00:00
David Blaikie	2110924909	Fix WebAssembly backend for some LLVM API changes llvm-svn: 320893	2017-12-15 23:52:06 +00:00
Quentin Colombet	893e0f15e2	[TableGen][GlobalISel] Make the different Matcher comparable This opens refactoring opportunities in the match table now that we can check that two predicates are the same. NFC. llvm-svn: 320890	2017-12-15 23:24:39 +00:00
Quentin Colombet	a646ef08e8	[TableGen][GlobalISel] Fix unused variable warning in release mode Introduced in r320887. NFC. llvm-svn: 320889	2017-12-15 23:24:36 +00:00
Paul Robinson	6d0484f2b6	Revert "Recommit "[DWARFv5] Dump an MD5 checksum in the line-table header."" This reverts commit 0afef672f63f0e4e91938656bc73424a8c058bfc. Still failing at runtime on bots. llvm-svn: 320888	2017-12-15 23:21:52 +00:00
Quentin Colombet	aad20be6ca	[TableGen][GlobalISel] Have the predicate directly know which data they are dealing with Prior to this patch, a predicate wouldn't make sense outside of its rule. Indeed, it was only during emitting a rule that a predicate would be made aware of the IDs of the data it is checking. Because of that, predicates could not be moved around or compared between each other. NFC. llvm-svn: 320887	2017-12-15 23:07:42 +00:00
Paul Robinson	5c8f7d7de4	Recommit "[DWARFv5] Dump an MD5 checksum in the line-table header." Adds missing support for DW_FORM_data16. Update of r320852, fixing the unittest to use a hand-coded struct instead of std::array to guarantee data layout. Differential Revision: https://reviews.llvm.org/D41090 llvm-svn: 320886	2017-12-15 22:57:17 +00:00
Matthias Braun	042fed54fb	Fix unused variable in non-assert builds llvm-svn: 320885	2017-12-15 22:53:33 +00:00
Matthias Braun	f1caa2833f	MachineFunction: Return reference from getFunction(); NFC The Function can never be nullptr so we can return a reference. llvm-svn: 320884	2017-12-15 22:22:58 +00:00
Matthias Braun	4684033a2f	MachineFunction: Slight refactoring; NFC Slight cleanup/refactor in preparation for upcoming commit. llvm-svn: 320882	2017-12-15 22:22:46 +00:00
Matthias Braun	89488fffdd	MachineModuleInfo: Remove unused function; NFC Remove the unused setModule() function; it would be dangerous if someone actually used it as it wouldn't reset/recompute various other module related data. llvm-svn: 320881	2017-12-15 22:22:42 +00:00
Galina Kistanova	6532b3b9d2	Fixed the gcc 'enumeral and non-enumeral type in conditional expression [-Werror=extra]' warning introduced by r320750 llvm-svn: 320868	2017-12-15 22:15:29 +00:00

... 2 3 4 5 6 ...

158393 Commits