llvm-project

Commit Graph

Author	SHA1	Message	Date
Petr Hosek	c348f9fcd5	Revert "Reland "[LLVM][llvm-objcopy] Added basic plumbing to get things started"" This reverts commit 2b52298eb28ba4d3eca113353a348c02a6ef1f93. llvm-svn: 308822	2017-07-22 02:43:50 +00:00
Petr Hosek	badc76623c	Reland "[LLVM][llvm-objcopy] Added basic plumbing to get things started" As discussed on llvm-dev I've implemented the first basic steps towards llvm-objcopy/llvm-objtool (name pending). This change adds the ability to copy (without modification) 64-bit little endian ELF executables that have SHT_PROGBITS, SHT_NOBITS, SHT_NULL and SHT_STRTAB sections. Patch by Jake Ehrlich Differential Revision: https://reviews.llvm.org/D33964 llvm-svn: 308821	2017-07-22 02:33:45 +00:00
Craig Topper	abfe380f9a	[X86] Add nopq instruction which is a rex encoded version of nopl for gas compatibility. llvm-svn: 308818	2017-07-22 01:30:53 +00:00
Craig Topper	e88aef4b5f	[X86] Add register form of NOPL and NOPW for assembler/disassembler. Fixes PR32805. llvm-svn: 308817	2017-07-22 01:30:51 +00:00
David Blaikie	2bee7c68af	Commit missing/empty test file from r308789 llvm-svn: 308814	2017-07-22 00:24:20 +00:00
Matt Arsenault	c5d1e503e1	RA: Remove another assert on empty intervals This case is similar to the one fixed in r308808, except when rematerializing. Fixes bug 33884. llvm-svn: 308813	2017-07-22 00:24:01 +00:00
Matt Arsenault	6a963f76ca	RA: Remove assert on empty live intervals This is possible if there is an undef use when splitting the vreg during spilling. Fixes bug 33620. llvm-svn: 308808	2017-07-21 23:56:13 +00:00
Petr Hosek	b13608f5e8	Revert "[LLVM][llvm-objcopy] Added basic plumbing to get things started" This reverts commit 2f423248e140b94b8377660d4d2fe9364f30febe. llvm-svn: 308806	2017-07-21 23:39:39 +00:00
Petr Hosek	0b746d3417	Reland "[LLVM][llvm-objcopy] Added basic plumbing to get things started" As discussed on llvm-dev I've implemented the first basic steps towards llvm-objcopy/llvm-objtool (name pending). This change adds the ability to copy (without modification) 64-bit little endian ELF executables that have SHT_PROGBITS, SHT_NOBITS, SHT_NULL and SHT_STRTAB sections. Patch by Jake Ehrlich Differential Revision: https://reviews.llvm.org/D33964 llvm-svn: 308803	2017-07-21 23:27:40 +00:00
Erich Keane	d8f61f8f7e	Remove Bitrig: LLVM Changes Bitrig code has been merged back to OpenBSD, thus the OS has been abandoned. Differential Revision: https://reviews.llvm.org/D35707 llvm-svn: 308799	2017-07-21 22:48:47 +00:00
David Blaikie	b8cc0544d2	[ProfData] Detect if zlib is available As discussed on [1], if the profile is compressed and llvm-profdata is not built with zlib support, the error message is not informative. Give a better error message if zlib is not available. [1] http://lists.llvm.org/pipermail/llvm-dev/2017-July/115571.html Reviewers: davidxl, dblaikie Differential Revision: https://reviews.llvm.org/D35586 llvm-svn: 308789	2017-07-21 21:41:15 +00:00
Farhana Aleen	e4a89a6462	X86InterleaveAccess: A fix for bug33826 Reviewers: DavidKreitzer Differential Revision: https://reviews.llvm.org/D35638 llvm-svn: 308784	2017-07-21 21:35:00 +00:00
Konstantin Zhuravlyov	e9a5a77ee3	AMDGPU: Implement memory model llvm-svn: 308781	2017-07-21 21:19:23 +00:00
Krzysztof Parzyszek	3ad0d01e9e	[Hexagon] Add inline-asm constraint 'a' for modifier register class For example asm ("memw(%0++%1) = %2" : : "r"(addr),"a"(mod),"r"(val) : "memory") llvm-svn: 308761	2017-07-21 17:51:27 +00:00
Haojie Wang	1dec57d5b0	ThinLTO Minimized Bitcode File Size Reduction Summary: Currently the ThinLTO minimized bitcode file only strip the debug info, but there is still a lot of information in the minimized bit code file that will be not used for thin linker. In this patch, most of the extra information is striped to reduce the minimized bitcode file. Now only ModuleVersion, ModuleInfo, ModuleGlobalValueSummary, ModuleHash, Symtab and Strtab are left. Now the minimized bitcode file size is reduced to 15%-30% of the debug info stripped bitcode file size. Reviewers: danielcdh, tejohnson, pcc Reviewed By: pcc Subscribers: mehdi_amini, aprantl, inglorion, eraman, llvm-commits Differential Revision: https://reviews.llvm.org/D35334 llvm-svn: 308760	2017-07-21 17:25:20 +00:00
Simon Dardis	0310eb7a67	[mips] Support -membedded-data and fix a related bug -membedded-data changes the location of constant data from the .sdata to the .rodata section. Previously it was (incorrectly) always located in the .rodata section. Reviewers: atanasyan Differential Revision: https://reviews.llvm.org/D35686 llvm-svn: 308758	2017-07-21 17:19:00 +00:00
Matt Arsenault	ca7b0a1777	AMDGPU: Add instruction definitions for some scratch_* instructions Omit atomics for now since they probably aren't useful. llvm-svn: 308747	2017-07-21 15:36:16 +00:00
Dmitry Preobrazhensky	abf2839478	[AMDGPU][MC][GFX9] Added support of VOP3 'op_sel' modifier See bug 33591: https://bugs.llvm.org//show_bug.cgi?id=33591 Reviewers: vpykhtin, artem.tamazov, SamWot, arsenm Differential Revision: https://reviews.llvm.org/D35424 llvm-svn: 308740	2017-07-21 13:54:11 +00:00
Jonas Paulsson	be7a7e4979	[SystemZ] test update test/CodeGen/SystemZ/loop-01.ll was incorrectly updated by r308729. llvm-svn: 308736	2017-07-21 13:14:17 +00:00
Jonas Paulsson	024e319489	[SystemZ, LoopStrengthReduce] This patch makes LSR generate better code for SystemZ in the cases of memory intrinsics, Load->Store pairs or comparison of immediate with memory. In order to achieve this, the following common code changes were made: * New TTI hook: LSRWithInstrQueries(), which defaults to false. Controls if LSR should do instruction-based addressing evaluations by calling isLegalAddressingMode() with the Instruction pointers. * In LoopStrengthReduce: handle address operands of memset, memmove and memcpy as address uses, and call isFoldableMemAccessOffset() for any LSRUse::Address, not just loads or stores. SystemZ changes: * isLSRCostLess() implemented with Insns first, and without ImmCost. * New function supportedAddressingMode() that is a helper for TTI methods looking at Instructions passed via pointers. Review: Ulrich Weigand, Quentin Colombet https://reviews.llvm.org/D35262 https://reviews.llvm.org/D35049 llvm-svn: 308729	2017-07-21 11:59:37 +00:00
Simon Pilgrim	84cbd8e750	[X86][SSE] Add extra (sra (sra x, c1), c2) -> (sra x, (add c1, c2)) test case We should be able to handle the case where some c1+c2 elements exceed max shift and some don't by performing a clamp after the sum llvm-svn: 308724	2017-07-21 10:22:49 +00:00
Simon Pilgrim	32c377a1cf	[X86][SSE] Add pre-AVX2 support for (i32 bitcast(v32i1)) -> 2xMOVMSK Currently we only support (i32 bitcast(v32i1)) using the AVX2 VPMOVMSKB ymm instruction. This patch adds support for splitting pre-AVX2 targets into 2 x (V)PMOVMSKB xmm instructions and merging the integer results. In future we could probably generalize this to handle more cases. Differential Revision: https://reviews.llvm.org/D35303 llvm-svn: 308723	2017-07-21 09:58:50 +00:00
Adrian Prantl	98dc599790	dsymutil: strip unused types from imported DW_TAG_modules This patch teaches dsymutil to strip types from the imported DW_TAG_module inside of an object file (not inside the PCM) if they can be resolved to the full definition inside the PCM. This reduces the size of the .dSYM from WebCore from webkit.org by almost 2/3. <rdar://problem/33047213> llvm-svn: 308710	2017-07-21 02:07:33 +00:00
Adrian Prantl	65e7ca995d	Debug Info: Don't strip clang module skeleton CUs. This corrects a (hopefully :-) accidental side-effect of r304020. rdar://problem/33442618 llvm-svn: 308708	2017-07-21 01:24:05 +00:00
Spyridoula Gravani	c6ef9873ac	[DWARF] Generalized verification of .debug_abbrev to be applicable to both .debug_abbrev and .debug_abbrev.dwo sections. Differential Revision: https://reviews.llvm.org/D35698 llvm-svn: 308703	2017-07-21 00:51:32 +00:00
Craig Topper	31140ade70	[AVX-512] Fix a bug that prevented some non-temporal loads from using the movntdqa instruction. The bitconverts here had an input type of 128-bits and an output type of 256 bits. The input type should also have been 256 bits. llvm-svn: 308702	2017-07-21 00:40:42 +00:00
Kevin Enderby	3e95bd2239	Add error handling to the dyld compact export entries in libObject. lld needs a matching change for this will be my next commit. Expect it to fail build until that matching commit is picked up by the bots. Like the changes in r296527 for dyld bind entires and the changes in r298883 for lazy bind, weak bind and rebase entries the export entries are the last of the dyld compact info to have error handling added. This follows the model of iterators that can fail that Lang Hanes designed when fixing the problem for bad archives r275316 (or r275361). So that iterating through the exports now terminates if there is an error and returns an llvm::Error with an error message in all cases for malformed input. This change provides the plumbing for the error handling, all the needed testing of error conditions and test cases for all of the unique error messages. llvm-svn: 308690	2017-07-20 23:08:41 +00:00
Tim Northover	7b6d66c0c9	Recommit: GlobalISel: select G_EXTRACT and G_INSERT instructions on AArch64. It revealed a bug in the Localizer pass which has now been fixed. This includes the fix for SUBREG_TO_REG committed separately last time. llvm-svn: 308688	2017-07-20 22:58:38 +00:00
Tim Northover	071d77a51f	GlobalISel: stop localizer putting constants before EH_LABELs If the localizer pass puts one of its constants before the label that tells the unwinder "jump here to handle your exception" then control-flow will skip it, leaving uninitialized registers at runtime. That's bad. llvm-svn: 308687	2017-07-20 22:58:26 +00:00
Rui Ueyama	1b5353b662	Make a file end with n and remove trailing whitespace. llvm-svn: 308685	2017-07-20 22:39:20 +00:00
Eric Beckmann	c35c34a7f9	Make tests always depend on llvm-mt, because some tests are run even w/o xml2. llvm-svn: 308682	2017-07-20 22:13:16 +00:00
Eric Beckmann	7d50c389c4	Implement parsing and writing of a single xml manifest file. Summary: Implement parsing and writing of a single xml manifest file. Subscribers: mgorny, llvm-commits, hiraditya Differential Revision: https://reviews.llvm.org/D35425 llvm-svn: 308679	2017-07-20 21:42:04 +00:00
Artem Belevich	d7a73824e4	[NVPTX] Add lowering of i128 params. The patch adds support of i128 params lowering. The changes are quite trivial to support i128 as a "special case" of integer type. With this patch, we lower i128 params the same way as aggregates of size 16 bytes: .param .b8 _ [16]. Currently, NVPTX can't deal with the 128 bit integers: * in some cases because of failed assertions like ValVTs.size() == OutVals.size() && "Bad return value decomposition" * in other cases emitting PTX with .i128 or .u128 types (which are not valid [1]) [1] http://docs.nvidia.com/cuda/parallel-thread-execution/index.html#fundamental-types Differential Revision: https://reviews.llvm.org/D34555 Patch by: Denys Zariaiev (denys.zariaiev@gmail.com) llvm-svn: 308675	2017-07-20 21:16:03 +00:00
Matt Arsenault	db78273b6e	Add an ID field to StackObjects On AMDGPU SGPR spills are really spilled to another register. The spiller creates the spills to new frame index objects, which is used as a placeholder. This will eventually be replaced with a reference to a position in a VGPR to write to and the frame index deleted. It is most likely not a real stack location that can be shared with another stack object. This is a problem when StackSlotColoring decides it should combine a frame index used for a normal VGPR spill with a real stack location and a frame index used for an SGPR. Add an ID field so that StackSlotColoring has a way of knowing the different frame index types are incompatible. llvm-svn: 308673	2017-07-20 21:03:45 +00:00
Zvi Rackover	eac8e7c08a	[X86] Adding ISel tests for strided-shuffles with non-zero offset. NFC. llvm-svn: 308672	2017-07-20 21:03:36 +00:00
Davide Italiano	0c8d26c312	[PGO] Move the PGOInstrumentation pass to new OptRemark API. This fixes PR33791. llvm-svn: 308668	2017-07-20 20:43:05 +00:00
Mandeep Singh Grang	d41ac895bb	[COFF, ARM64, CodeView] Add support to emit CodeView debug info for ARM64 COFF Reviewers: compnerd, ruiu, rnk, zturner Reviewed By: rnk Subscribers: majnemer, aemerson, aprantl, javed.absar, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D35518 llvm-svn: 308665	2017-07-20 20:20:00 +00:00
James Y Knight	bb76d48d59	[SPARC] Clean up the support for disabling fsmuld and fmuls instructions. Summary: Also enable no-fsmuld for sparcv7 (which doesn't have the instruction). The previous code which used a post-processing pass to do this was unnecessary; disabling the instruction is entirely sufficient. Reviewers: jacob_hansen, ekedaigle Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D35576 llvm-svn: 308661	2017-07-20 20:09:11 +00:00
Craig Topper	27c12e088e	[X86] Allow masks with more than 6 bits set on the x << (y & mask) optimization for the 64-bit memory shifts. llvm-svn: 308657	2017-07-20 19:29:58 +00:00
Craig Topper	02959b3d05	[X86] Add test case to demonstrate that we don't allow masks wider than 6 bits in the (shift x, (and y, mask)) patterns for the 64-bit memory form. We allow wider than 5 bits in the 16 and 32 bit store forms. And we allow wider than 6 bits on the 64-bit regsiter form.:w I'm assuming this was a mistake made back in r148024. llvm-svn: 308656	2017-07-20 19:29:56 +00:00
Peter Collingbourne	6f6788b99c	LowerTypeTests: Drop function type metadata only if we're going to replace it. Previously we were (mis)handling jump table members with a prevailing definition in a full LTO module and a non-prevailing definition in a ThinLTO module by dropping type metadata on those functions entirely, which would cause type tests involving such functions to fail. This patch causes us to drop metadata only if we are about to replace it with metadata from cfi.functions. We also want to replace metadata for available_externally functions, which can arise in the opposite scenario (prevailing ThinLTO definition, non-prevailing full LTO definition). The simplest way to handle that is to remove the definition; there's little value in keeping it around at this point (i.e. after most optimization passes have already run) and later code will try to use the function's linkage to create an alias, which would result in invalid IR if the function is available_externally. Fixes PR33832. Differential Revision: https://reviews.llvm.org/D35604 llvm-svn: 308642	2017-07-20 18:02:05 +00:00
Matt Arsenault	c37fe66ec5	AMDGPU: Add encoding for carryless add/sub instructions llvm-svn: 308639	2017-07-20 17:42:47 +00:00
Matt Arsenault	f65c5ac9c9	AMDGPU: Add encodings for global atomics llvm-svn: 308638	2017-07-20 17:31:56 +00:00
David Blaikie	79c316a192	Remove unnecessary prefix from comment lines in a .test file. llvm-svn: 308636	2017-07-20 17:20:36 +00:00
Simon Pilgrim	ba7b22cdb9	revert: [llvm] r308609 - This patch added some test cases to demonsrate the issues described in Bug 33848 - X86 Asm does not support symbolic names inside address calculation. llvm-svn: 308622	2017-07-20 15:17:15 +00:00
Nirav Dave	df86d2d008	[DAG] Handle missing transform in fold of value extension case. Summary: When pushing an extension of a constant bitwise operator on a load into the load, change other uses of the load value if they exist to prevent the old load from persisting. Reviewers: spatel, RKSimon, efriedma Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D35030 llvm-svn: 308618	2017-07-20 13:57:32 +00:00
Nirav Dave	77cc6f23b9	[DAG] Optimize away degenerate INSERT_VECTOR_ELT nodes. Summary: Add missing vector write of vector read reduction, i.e.: (insert_vector_elt x (extract_vector_elt x idx) idx) to x Reviewers: spatel, RKSimon, efriedma Reviewed By: RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D35563 llvm-svn: 308617	2017-07-20 13:48:17 +00:00
Stefan Maksimovic	be0bc71e02	Reland r308585 Builder clang-x86_64-linux-abi-test apparently failed due to a spurious error unrelated to the changes r308585 introduced. llvm-svn: 308612	2017-07-20 13:08:18 +00:00
Simon Pilgrim	b6485252aa	[X86][AVX512] Improve vector rotation constant folding tests Test constant folding both on node creation (which already works) and once the input nodes have been folded themselves (not working yet). llvm-svn: 308611	2017-07-20 13:07:37 +00:00
Andrew V. Tischenko	6431962792	This patch added some test cases to demonsrate the issues described in Bug 33848 - X86 Asm does not support symbolic names inside address calculation. llvm-svn: 308609	2017-07-20 12:46:02 +00:00
Simon Atanasyan	fb953926b1	[mips] Support `long_call/far/near` attributes passed by front-end This patch adds handling of the `long_call`, `far`, and `near` attributes passed by front-end. The patch depends on D35479. Differential revision: https://reviews.llvm.org/D35480. llvm-svn: 308606	2017-07-20 12:19:26 +00:00
Diana Picus	7534b28291	Revert "GlobalISel: select G_EXTRACT and G_INSERT instructions on AArch64." This reverts commit 36c6a2ea9669bc3bb695928529a85d12d1d3e3f9 because it broke the test-suite on the GlobalISel bot. llvm-svn: 308603	2017-07-20 11:36:03 +00:00
Simon Pilgrim	2911296f10	[DAGCombiner] Match ISD::SRL non-uniform constant vectors patterns using predicates. Use predicate matchers introduced in D35492 to match more ISD::SRL constant folds llvm-svn: 308602	2017-07-20 11:03:30 +00:00
Simon Pilgrim	7ff0e49d8c	[DAGCombiner] Match ISD::SRA non-uniform constant vectors patterns using predicates. Use predicate matchers introduced in D35492 to match more ISD::SRA constant folds llvm-svn: 308600	2017-07-20 10:43:05 +00:00
Simon Pilgrim	9d7863b935	[DAGCombiner] Match non-uniform constant vectors using predicates. Most combines currently recognise scalar and splat-vector constants, but not non-uniform vector constants. This patch introduces a matching mechanism that uses predicates to check against BUILD_VECTOR of ConstantSDNode, as well as scalar ConstantSDNode cases. I've changed a couple of predicates to demonstrate - the combine-shl changes add currently unsupported cases, while the MatchRotate replaces an existing mechanism. Differential Revision: https://reviews.llvm.org/D35492 llvm-svn: 308598	2017-07-20 10:13:40 +00:00
Stefan Maksimovic	3793a82b28	Revert r308585 Builder clang-x86_64-linux-abi-test seems to fail after this change llvm-svn: 308597	2017-07-20 09:57:14 +00:00
Daniel Sanders	7aac7cc57f	[globalisel][tablegen] Add control-flow to the MatchTable. Summary: This will allow us to merge the various sub-tables into a single table. This is a compile-time saving at this point. However, this will also enable the optimization of a table so that similar instructions can be tested together, reducing the time spent on the matching the code. The bulk of this patch is a mechanical conversion to the new MatchTable object which is responsible for tracking label definitions and filling in the index of the jump targets. It is also responsible for nicely formatting the table. This was necessary to support the new GIM_Try opcode which takes the index to jump to if the match should fail. This value is unknown during table construction and is filled in during emission. To support nesting try-blocks (although we currently don't emit tables with nested try-blocks), GIM_Reject has been re-introduced to explicitly exit a try-block or fail the overall match if there are no active try-blocks. Reviewers: ab, t.p.northover, qcolombet, rovka, aditya_nandakumar Reviewed By: rovka Subscribers: kristof.beyls, igorb, llvm-commits Differential Revision: https://reviews.llvm.org/D35117 llvm-svn: 308596	2017-07-20 09:25:44 +00:00
Stefan Maksimovic	8539f77bc3	[mips] Fix fp select machine verifier errors Introduced FSELECT node necesary when lowering ISD::SELECT which has i32, f64, f64 as its operands. SEL_D instruction required that its output and first operand of a SELECT node, which it used, have matching types. MTC1_D64 node introduced to aid FSELECT lowering. This fixes machine verifier errors on following tests: CodeGen/Mips/llvm-ir/select-dbl.ll CodeGen/Mips/llvm-ir/select-flt.ll CodeGen/Mips/select.ll Differential Revision: https://reviews.llvm.org/D35408 llvm-svn: 308595	2017-07-20 09:21:10 +00:00
Craig Topper	33225ef314	[X86] Use SARX/SHLX/SHLX instructions for (shift x (and y, (BitWidth-1))) Fixes PR33841. llvm-svn: 308591	2017-07-20 06:19:55 +00:00
Craig Topper	bdd114ef9d	[X86] Add test cases for (shift x (and y, (BitWidth-1))) to the BMI2 shift test. We should use SHLX and similar instructions for these patterns, but we currently don't. llvm-svn: 308590	2017-07-20 06:19:54 +00:00
Craig Topper	a774ecc7f5	[X86] Regenerate shift-and.ll and shift-bmi2.ll using update_llc_test_checks.py. I've stripped the checks for 64-bit types in 32-bit mode to match the existing tests. llvm-svn: 308589	2017-07-20 06:19:53 +00:00
Craig Topper	01d4ca3916	[X86] Remove outdated bug comment from a test. The test issue was fixed and the test was updated in r244577, but the comment wasn't removed. llvm-svn: 308588	2017-07-20 06:19:52 +00:00
Matt Arsenault	04004716ff	AMDGPU: Correct encoding for global instructions The soffset field needs to be be set to 0x7f to disable it, not 0. 0 is interpreted as an SGPR offset. This should be enough to get basic usage of the global instructions working. Technically it is possible to use an SGPR_32 offset, but I'm not sure if it's correct with 64-bit pointers, but that is not handled now. This should also be cleaned up to be more similar to how different MUBUF modes are handled, and to have InstrMappings between the different types. llvm-svn: 308583	2017-07-20 05:17:54 +00:00
Spyridoula Gravani	364b535234	[DWARF] Added check that verifies that no abbreviation declaration has more than one attribute with the same name. SUMMARY This patch adds a verification check on the abbreviation declarations in the .debug_abbrev section. The check makes sure that no abbreviation declaration has more than one attributes with the same name. Differential Revision: https://reviews.llvm.org/D35643 llvm-svn: 308579	2017-07-20 02:06:52 +00:00
Petr Hosek	d00e47fd71	Revert "[LLVM][llvm-objcopy] Added basic plumbing to get things started" This reverts commit 98f9792e7ca5bbd9eb43bda72bf497957cfb6eb8. llvm-svn: 308569	2017-07-20 00:13:32 +00:00
Petr Hosek	77da6f167c	[LLVM][llvm-objcopy] Added basic plumbing to get things started As discussed on llvm-dev I've implemented the first basic steps towards llvm-objcopy/llvm-objtool (name pending). This change adds the ability to copy (without modification) 64-bit little endian ELF executables that have SHT_PROGBITS, SHT_NOBITS, SHT_NULL and SHT_STRTAB sections. Patch by Jake Ehrlich Differential Revision: https://reviews.llvm.org/D33964 llvm-svn: 308559	2017-07-19 23:51:13 +00:00
Francis Visoiu Mistrih	52042aa21e	[PEI] Add basic opt-remarks support Add optimization remarks support to the PrologueEpilogueInserter. For now, emit the stack size as an analysis remark, but more additions wrt shrink-wrapping may be added. https://reviews.llvm.org/D35645 llvm-svn: 308556	2017-07-19 23:47:32 +00:00
Tim Northover	0e0b3c97dd	GlobalISel: fix SUBREG_TO_REG implementation. The first argument needs to be an immediate rather than a register. Should fix some crashes in the verifier bot. llvm-svn: 308540	2017-07-19 22:08:08 +00:00
Wolfgang Pieb	3610942c12	Forgot to add triple to test in r308513. llvm-svn: 308527	2017-07-19 21:45:21 +00:00
Davide Italiano	1985e9b7e1	[TRE] Add another test for OptRemark. This shows we emit a remark for tail recursion -> loop. llvm-svn: 308525	2017-07-19 21:39:51 +00:00
Davide Italiano	4b8c8eae32	[TRE] Move to the new OptRemark API. Fixes PR33788. Differential Revision: https://reviews.llvm.org/D35570 llvm-svn: 308524	2017-07-19 21:13:22 +00:00
Petr Hosek	eb04da3a56	[yaml2obj][ELF] Add support for program headers This change adds basic support for program headers. I need to do some testing which requires generating program headers but I can't use ld.lld or clang to produce programs that have headers. I'd also like to test some strange things that those programs may never produce. Patch by Jake Ehrlich Differential Revision: https://reviews.llvm.org/D35276 llvm-svn: 308520	2017-07-19 20:38:46 +00:00
Martin Storsjo	b2e9fcfca4	[AArch64] Force relocations for all ADRP instructions This generalizes an existing fix from ELF to MachO and COFF. Test that an ADRP to a local symbol whose offset is known at assembly time still produces relocations, both for MachO and COFF. Test that an ADRP without a @page modifier on MachO fails (previously it didn't). Differential Revision: https://reviews.llvm.org/D35544 llvm-svn: 308518	2017-07-19 20:14:32 +00:00
Martin Storsjo	2ff5f5d681	[AArch64, COFF] Interpret .align as power of two for COFF as well Differential Revision: https://reviews.llvm.org/D35545 llvm-svn: 308517	2017-07-19 20:14:24 +00:00
Wolfgang Pieb	e018bbd835	Fixing an issue with the initialization of LexicalScopes objects when mixing debug and non-debug units. Patch by Andrea DiBiagio. Differential Revision: https://reviews.llvm.org/D35637 llvm-svn: 308513	2017-07-19 19:36:40 +00:00
Krzysztof Parzyszek	ac01994db9	[Hexagon] Fix a bug in r308502: post-inc offset is always 0 llvm-svn: 308510	2017-07-19 19:17:32 +00:00
Peter Collingbourne	e776dd9ca2	LTO: Export functions referenced by the CFI jump table. If the LowerTypeTests pass decides to add a function to a jump table for CFI, it will add its name to the set cfiFunctionDefs, which among other things will cause the function to be renamed in the ThinLTO backend. One other thing that we must do with such functions is to not internalize them, because the jump table in the full LTO object will contain a reference to the actual function body in the ThinLTO object. This patch handles that by ensuring that we export any functions whose names appear in the cfiFunctionDefs set. Fixes PR33831. Differential Revision: https://reviews.llvm.org/D35605 llvm-svn: 308504	2017-07-19 18:18:19 +00:00
Davide Italiano	5fc5d0a406	[X86] Don't try to scale down if that exceeds the bitwidth. Fixes the crash reported in PR33844. llvm-svn: 308503	2017-07-19 18:09:46 +00:00
Peter Collingbourne	93fdaca5ac	ThinLTOBitcodeWriter: Do not rewrite intrinsic functions when splitting modules. Changing the type of an intrinsic may invalidate the IR. Differential Revision: https://reviews.llvm.org/D35593 llvm-svn: 308500	2017-07-19 17:54:29 +00:00
Tim Northover	d59fbec8e2	GlobalISel: select G_EXTRACT and G_INSERT instructions on AArch64. llvm-svn: 308493	2017-07-19 16:47:07 +00:00
Javed Absar	2cb0c95031	[ARM] Unify handling of M-Class system registers This patch cleans up and fixes issues in the M-Class system register handling: 1. It defines the system registers and the encoding (SYSm values) in one place: a new ARMSystemRegister.td using SearchableTable, thereby removing the hand-coded values which existed in multiple places. 2. Some system registers e.g. BASEPRI_MAX_NS which do not exist were being allowed! Ref: ARMv6/7/8M architecture reference manual. Reviewed by: @t.p.northover, @olist01, @john.brawn Differential Revision: https://reviews.llvm.org/D35209 llvm-svn: 308456	2017-07-19 12:57:16 +00:00
Simon Pilgrim	e5c7925c5e	[X86][XOP] Use default AVX2 lowering for v4i64 ashr by splat constants XOP shifts only support 128-bit vectors, so we were ending up with less optimal codegen requiring constants llvm-svn: 308430	2017-07-19 10:29:31 +00:00
Balaram Makam	b05a55787a	[SimplifyCFG] Defer folding unconditional branches to LateSimplifyCFG if it can destroy canonical loop structure. Summary: When simplifying unconditional branches from empty blocks, we pre-test if the BB belongs to a set of loop headers and keep the block to prevent passes from destroying canonical loop structure. However, the current algorithm fails if the destination of the branch is a loop header. Especially when such a loop's latch block is folded into loop header it results in additional backedges and LoopSimplify turns it into a nested loop which prevent later optimizations from being applied (e.g., loop unrolling and loop interleaving). This patch augments the existing algorithm by further checking if the destination of the branch belongs to a set of loop headers and defer eliminating it if yes to LateSimplifyCFG. Fixes PR33605: https://bugs.llvm.org/show_bug.cgi?id=33605 Reviewers: efriedma, mcrosier, pacxx, hsung, davidxl Reviewed By: efriedma Subscribers: ashutosh.nema, gberry, javed.absar, llvm-commits Differential Revision: https://reviews.llvm.org/D35411 llvm-svn: 308422	2017-07-19 08:53:34 +00:00
Ayal Zaks	8c452d76ed	[LV] Test once if vector trip count is zero, instead of twice Generate a single test to decide if there are enough iterations to jump to the vectorized loop, or else go to the scalar remainder loop. This test compares the Scalar Trip Count: if STC < VF * UF go to the scalar loop. If requiresScalarEpilogue() holds, at-least one iteration must remain scalar; the rest can be used to form vector iterations. So in this case the test checks instead if (STC - 1) < VF * UF by comparing STC <= VF * UF, and going to the scalar loop if so. Otherwise the vector loop is entered for at-least one vector iteration. This test covers the case where incrementing the backedge-taken count will overflow leading to an incorrect trip count of zero. In this (rare) case we will also avoid the vector loop and jump to the scalar loop. This patch simplifies the existing tests and effectively removes the basic-block originally named "min.iters.checked", leaving the single test in block "vector.ph". Original observation and initial patch by Evgeny Stupachenko. Differential Revision: https://reviews.llvm.org/D34150 llvm-svn: 308421	2017-07-19 05:16:39 +00:00
Serguei Katkov	4ea855ebe5	[CGP] Allow cycles during Phi traversal in OptimizaMemoryInst Allowing cycles in Phi traversal increases the scope of optimize memory instruction in case we are in loop. The added test shows an example of enabling optimization inside a loop. Reviewers: loladiro, spatel, efriedma Reviewed By: efriedma Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D35294 llvm-svn: 308419	2017-07-19 04:49:17 +00:00
Chandler Carruth	bb83558f00	Revert r308273 to reinstate part of r308100. That part was reverted because the underlying change necessitating it (r308025) was reverted in r308271. Nirav re-landed r308025 again in r308350, so re-landing this fix. llvm-svn: 308418	2017-07-19 04:15:30 +00:00
Chandler Carruth	06a86301a1	[PM/LCG] Follow-up fix to r308088 to handle deletion of library functions. In the prior commit, we provide ordering to the LCG between functions and library function definitions that they might begin to call through transformations. But we still would delete these library functions from the call graph if they became dead during inlining. While this immediately crashed, it also exposed a loss of information. We shouldn't remove definitions of library functions that can still usefully participate in the LCG-powered CGSCC optimization process. If new call edges are formed, we want to have definitions to be called. We can still remove these functions if truly dead using global-dce, etc, but removing them during the CGSCC walk is premature. This fixes a crash in the new PM when optimizing some unusual libraries that end up with "internal" lib functions such as the code in the "R" language's libraries. llvm-svn: 308417	2017-07-19 04:12:25 +00:00
Craig Topper	106b5b6856	AMD znver1 Initial Scheduler model Summary: This patch adds the following 1. Adds a skeleton scheduler model for AMD Znver1. 2. Introduces the znver1 execution units and pipes. 3. Caters the instructions based on the generic scheduler classes. 4. Further additions to the scheduler model with instruction itineraries will be carried out incrementally based on a. Instructions types b. Registers used 5. Since itineraries are not added based on instructions, throughput information are bound to change when incremental changes are added. 6. Scheduler testcases are modified accordingly to suit the new model. Patch by Ganesh Gopalasubramanian. With minor formatting tweaks from me. Reviewers: craig.topper, RKSimon Subscribers: javed.absar, shivaram, ddibyend, vprasad Differential Revision: https://reviews.llvm.org/D35293 llvm-svn: 308411	2017-07-19 02:45:14 +00:00
Petr Hosek	bf5155015c	[llvm-readobj] Install llvm-readelf alias Install an llvm-readelf symlink to llvm-readobj. When invoked as readelf, default to -elf-output-style=GNU. Patch by Roland McGrath Differential Revision: https://reviews.llvm.org/D33869 llvm-svn: 308408	2017-07-19 02:09:37 +00:00
Weiming Zhao	984f1dc338	Fix DebugLoc propagation for unreachable LoadInst Summary: Currently, when GVN creates a load and when InstCombine creates a new store for unreachable Load, the DebugLoc info gets lost. Reviewers: dberlin, davide, aprantl Reviewed By: aprantl Subscribers: davide, llvm-commits Differential Revision: https://reviews.llvm.org/D34639 llvm-svn: 308404	2017-07-19 01:27:24 +00:00
Adrian Prantl	d63bfd218b	Debug Info: Add a file: field to DIImportedEntity. DIImportedEntity has a line number, but not a file field. To determine the decl_line/decl_file we combine the line number from the DIImportedEntity with the file from the DIImportedEntity's scope. This does not work correctly when the parent scope is a DINamespace or a DIModule, both of which do not have a source file. This patch adds a file field to DIImportedEntity to unambiguously identify the source location of the using/import declaration. Most testcase updates are mechanical, the interesting one is the removal of the FIXME in test/DebugInfo/Generic/namespace.ll. This fixes PR33822. See https://bugs.llvm.org/show_bug.cgi?id=33822 for more context. <rdar://problem/33357889> https://bugs.llvm.org/show_bug.cgi?id=33822 Differential Revision: https://reviews.llvm.org/D35583 llvm-svn: 308398	2017-07-19 00:09:54 +00:00
Petr Hosek	ef54b49d16	[llvm-readobj] Accept (and ignore) --wide (-W) command-line switch Accept and ignore --wide/-W. In GNU readelf this switch is necessary to get the output format that's consistent between 32-bit and 64-bit targets. llvm-readobj always produces that output format. Patch by Roland McGrath Differential Revision: https://reviews.llvm.org/D33873 llvm-svn: 308396	2017-07-18 23:58:15 +00:00
Petr Hosek	4565de8fc3	[llvm-readobj] Accept -S as an alias for --sections In GNU readelf, the short option for --sections is upper-case -S. Note that GNU uses lower-case -s to mean --symbols, while LLVM uses -s to mean --sections and -t to mean --symbols (-t has yet a different meaning in GNU). So command-line uses with -S can now be compatible, but uses with -s or -t are still incompatible. Patch by Roland McGrath Differential Revision: https://reviews.llvm.org/D33872 llvm-svn: 308392	2017-07-18 23:35:22 +00:00
Vitaly Buka	74443f0778	[asan] Copy arguments passed by value into explicit allocas for ASan Summary: ASan determines the stack layout from alloca instructions. Since arguments marked as "byval" do not have an explicit alloca instruction, ASan does not produce red zones for them. This commit produces an explicit alloca instruction and copies the byval argument into the allocated memory so that red zones are produced. Submitted on behalf of @morehouse (Matt Morehouse) Reviewers: eugenis, vitalybuka Reviewed By: eugenis Subscribers: hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D34789 llvm-svn: 308387	2017-07-18 22:28:03 +00:00
Martell Malone	1079ef8dfe	llvm: add llvm-dlltool support to the archiver A PE COFF spec compliant import library generator. Intended to be used with mingw-w64. Supports: PE COFF spec (section 8, Import Library Format) PE COFF spec (Aux Format 3: Weak Externals) Reviewed By: ruiu Differential Revision: https://reviews.llvm.org/D29892 This reapplies rL308329, which was reverted in rL308374 llvm-svn: 308379	2017-07-18 21:26:38 +00:00
Rui Ueyama	6db83a3af3	Revert r308329: llvm: add llvm-dlltool support to the archiver This reverts commit r308329 because it broke buildbots. llvm-svn: 308374	2017-07-18 21:07:13 +00:00
Mandeep Singh Grang	d857b4ca98	[COFF, ARM64] Reserve X18 register by default Reviewers: compnerd, rnk, ruiu, mstorsjo Reviewed By: mstorsjo Subscribers: aemerson, javed.absar, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D35531 llvm-svn: 308358	2017-07-18 20:41:33 +00:00
Nirav Dave	d839749ae8	[DAG] Improve Aliasing of operations to static alloca Re-recommiting after landing DAG extension-crash fix. Recommiting after adding check to avoid miscomputing alias information on addresses of the same base but different subindices. Memory accesses offset from frame indices may alias, e.g., we may merge write from function arguments passed on the stack when they are contiguous. As a result, when checking aliasing, we consider the underlying frame index's offset from the stack pointer. Static allocs are realized as stack objects in SelectionDAG, but its offset is not set until post-DAG causing DAGCombiner's alias check to consider access to static allocas to frequently alias. Modify isAlias to consider access between static allocas and access from other frame objects to be considered aliasing. Many test changes are included here. Most are fixes for tests which indirectly relied on our aliasing ability and needed to be modified to preserve their original intent. The remaining tests have minor improvements due to relaxed ordering. The exception is CodeGen/X86/2011-10-19-widen_vselect.ll which has a minor degradation dispite though the pre-legalized DAG is improved. Reviewers: rnk, mkuper, jonpa, hfinkel, uweigand Reviewed By: rnk Subscribers: sdardis, nemanjai, javed.absar, llvm-commits Differential Revision: https://reviews.llvm.org/D33345 llvm-svn: 308350	2017-07-18 20:06:24 +00:00
James Y Knight	dda87cab7d	[Sparc] Added software multiplication/division feature Added a feature to the Sparc back-end that replaces the integer multiply and divide instructions with calls to .mul/.sdiv/.udiv. This is a step towards having full v7 support. Patch by: Eric Kedaigle Differential Revision: https://reviews.llvm.org/D35500 llvm-svn: 308343	2017-07-18 19:08:38 +00:00
Nirav Dave	07871007aa	[DAG] Avoid deleting nodes before combining them. When replacing a node and it's operand, replacing the operand node may cause the deletion of the original node leading to an assertion failure. Case around these replacements to avoid this without relying on inspecting the DELETED_NODE opcode in various extend dagcombiner cases. Fixes PR32515. Reviewers: dbabokin, RKSimon, davide, chandlerc Subscribers: chandlerc, llvm-commits Differential Revision: https://reviews.llvm.org/D34095 llvm-svn: 308330	2017-07-18 17:39:15 +00:00
Martell Malone	afe8549269	llvm: add llvm-dlltool support to the archiver A PE COFF spec compliant import library generator. Intended to be used with mingw-w64. Supports: PE COFF spec (section 8, Import Library Format) PE COFF spec (Aux Format 3: Weak Externals) Reviewed By: ruiu Differential Revision: https://reviews.llvm.org/D29892 llvm-svn: 308329	2017-07-18 17:39:11 +00:00
Matt Arsenault	254ad3de5c	AMDGPU: Annotate necessity of flat-scratch-init As an approximation of the existing handling to avoid regressions. Fixes using too many registers with calls on subtargets with the SGPR allocation bug. llvm-svn: 308326	2017-07-18 16:44:58 +00:00
Matt Arsenault	1cc47f8413	AMDGPU: Figure out private memory regs after lowering Introduce pseudo-registers for registers needed for stack access, which are replaced during finalizeLowering. Note these pseudo-registers are currently only used for the used register location, and not for determining their input argument register. This is better because it avoids the need to try to predict whether a call will be emitted from the IR, and also detects stack objects introduced by legalization. Test changes are from the HasStackObjects check being more accurate since stack objects introduced during legalization are now known. llvm-svn: 308325	2017-07-18 16:44:56 +00:00
Geoff Berry	9962faed2b	[AArch64][Falkor] Avoid HW prefetcher tag collisions (step 2) Summary: Avoid HW prefetcher instruction tag collisions in loops by inserting MOVs to change the base address register of strided loads. Reviewers: t.p.northover, mcrosier Subscribers: aemerson, rengolin, javed.absar, kristof.beyls, hfinkel, llvm-commits Differential Revision: https://reviews.llvm.org/D35366 llvm-svn: 308324	2017-07-18 16:14:22 +00:00
Simon Pilgrim	964a1f1fb0	[X86][AVX] Regenerate shift test to show constant broadcast comment llvm-svn: 308323	2017-07-18 16:07:12 +00:00
Simon Pilgrim	483927aefb	[x86, CGP] increase memcmp() expansion up to 4 load pairs It should be a win to avoid going out to the system lib for all small memcmp() calls using scalar ops. For x86 32-bit, this means most everything up to 16 bytes. For 64-bit, that doubles because we can do 8-byte loads. Notes: Reduced from 4 to 2 loads for -Os behavior, which might not be optimal in all cases. It's effectively a question of how much do we trust the system implementation. Linux and macOS (and Windows I assume, but did not test) have optimized memcmp() code for x86, so it's probably not bad either way? PPC is using 8/4 for defaults on these. We do not expand at all for -Oz. There are still potential improvements to make for the CGP expansion IR and/or lowering such as avoiding select-of-constants (D34904) and not doing zexts to the max load type before doing a compare. We have special-case SSE/AVX codegen for (memcmp(x, y, 16/32) == 0) that will no longer be produced after this patch. I've shown the experimental justification for that change in PR33329: https://bugs.llvm.org/show_bug.cgi?id=33329#c12 TLDR: While the vector code is a likely winner, we can't guarantee that it's a winner in all cases on all CPUs, so I'm willing to sacrifice it for the greater good of expanding all small memcmp(). If we want to resurrect that codegen, it can be done by adjusting the CGP params or poking a hole to let those fall-through the CGP expansion. Committed on behalf of Sanjay Patel Differential Revision: https://reviews.llvm.org/D35067 llvm-svn: 308322	2017-07-18 15:55:30 +00:00
Sumanth Gundapaneni	d5aa0f3464	[Hexagon] Emit lookup tables in text section based on a flag The flag "-hexagon-emit-lut-text" (defaulted to false) is added to decide on where to keep the switch generated lookup table. Differential Revision: https://reviews.llvm.org/D34818 llvm-svn: 308316	2017-07-18 15:31:37 +00:00
Nicolai Haehnle	a253e4c028	AMDGPU: Fix crash when folding immediates into multiple uses Summary: When an immediate is folded by constant folding, we re-scan the entire use list for two reasons: 1. The constant folding may have created a new use of the same reg. 2. The constant folding may have removed an additional use in the list we're currently traversing (e.g., constant folding an S_ADD_I32 c, c). However, this could previously lead to a crash when an unrelated use was added twice into the FoldList. Since we re-scan the whole list anyway, we might as well just clear the FoldList again before we do so. Using a MIR test to show this because real code seems to trigger the issue only in connection with some really subtle control flow structures. Fixes GL45-CTS.shading_language_420pack.binding_images on gfx9. Reviewers: arsenm Subscribers: kzhuravl, wdng, yaxunl, dstuttard, tpr, llvm-commits, t-tye Differential Revision: https://reviews.llvm.org/D35416 llvm-svn: 308314	2017-07-18 14:54:41 +00:00
Simon Pilgrim	c2cbb525ec	[X86] Add optsize and minsize memcmp tests (D35067) llvm-svn: 308311	2017-07-18 14:26:07 +00:00
Sam Kolton	4685b70a77	[AMDGPU] resubmit r308179: CodeGen: check dst operand type to determine if omod is supported for VOP3 instructions llvm-svn: 308310	2017-07-18 14:23:26 +00:00
Simon Pilgrim	420e5eadc2	[X86] Added cmov target to memcmp test As discussed by @spatel on D35067: "I added the cmov attribute to the 32-bit codegen test because it removes some noise for that file. I think the intent for the SSE vs no-SSE runs is to show the potential difference for the 16 and 32 byte cases rather than the lack of cmov (which has been available for all CPUs since SSE1, so that's why it shows up automatically with -mattr=sse2)." llvm-svn: 308309	2017-07-18 14:19:34 +00:00
Daniel Sanders	40b66d646e	[globalisel][tablegen] Enable the import of rules involving fma. Summary: G_FMA was recently added to GlobalISel which enables the import of rules involving fma. Add the mapping to allow it. Reviewers: ab, t.p.northover, qcolombet, rovka, aditya_nandakumar Reviewed By: rovka Subscribers: kristof.beyls, javed.absar, igorb, llvm-commits Differential Revision: https://reviews.llvm.org/D35130 llvm-svn: 308308	2017-07-18 14:10:07 +00:00
Dmitry Preobrazhensky	30fc523984	[AMDGPU][MC] Corrected disassembler for proper decoding of v_mqsad_u32_u8 See Bug 33639: https://bugs.llvm.org//show_bug.cgi?id=33639 Reviewers: vpykhtin, artem.tamazov Differential Revision: https://reviews.llvm.org/D34892 llvm-svn: 308303	2017-07-18 13:12:48 +00:00
Simon Pilgrim	4793a11df9	[DAGCombine] Fix issue with out of bound constant rotation (PR33828) Take the modulo of rotations by a constant greater than or equal to the bit-width llvm-svn: 308302	2017-07-18 12:31:46 +00:00
Stefan Maksimovic	58f225b371	[mips] Alter register classes for MSA pseudo f16 instructions This change introduces additional machine instructions in functions dealing with the expansion of msa pseudo f16 instructions due to register classes being inappropriate when checked with machine verifier. Differential Revision: https://reviews.llvm.org/D34276 llvm-svn: 308301	2017-07-18 12:05:35 +00:00
Dorit Nuzman	ca4fd18ddc	PSCEV] Create AddRec for Phis in cases of possible integer overflow, using runtime checks Extend the SCEVPredicateRewriter to work a bit harder when it encounters an UnknownSCEV for a Phi node; Try to build an AddRecurrence also for Phi nodes whose update chain involves casts that can be ignored under the proper runtime overflow test. This is one step towards addressing PR30654. Differential revision: http://reviews.llvm.org/D30041 llvm-svn: 308299	2017-07-18 11:57:08 +00:00
Alexander Potapenko	9385aaa848	[sancov] Fix PR33732 Coverage hooks that take less-than-64-bit-integers as parameters need the zeroext parameter attribute (http://llvm.org/docs/LangRef.html#paramattrs) to make sure they are properly extended by the x86_64 ABI. llvm-svn: 308296	2017-07-18 11:47:56 +00:00
Simon Pilgrim	0636fbd737	[X86][AVX512] Add ISD::ROTL/ISD::ROTR constant folding tests llvm-svn: 308295	2017-07-18 11:18:38 +00:00
Simon Pilgrim	8d0fc91adc	[X86] Add test case for PR32282 llvm-svn: 308286	2017-07-18 10:09:40 +00:00
Diana Picus	da25d5b8b0	[ARM] GlobalISel: Support G_(S\|U)REM for s8 and s16 Widen to s32, and then do whatever Lowering/Custom/Libcall action the subtarget wants. llvm-svn: 308285	2017-07-18 10:07:01 +00:00
Florian Hahn	6c7f0256c6	[LoopInterchange] Split up interchange.ll test case (NFC). Summary: Currently most tests for the loop interchange pass are in test/Transforms/LoopInterchange/interchange.ll. This patch splits up the large test file in smaller pieces, which makes debugging test failures easier. Reviewers: karthikthecool, blitz.opensource, hfinkel Reviewed By: hfinkel Subscribers: hfinkel, mcrosier, mkuper, mzolotukhin, mssimpso, llvm-commits Differential Revision: https://reviews.llvm.org/D35488 llvm-svn: 308284	2017-07-18 09:47:06 +00:00
Florian Hahn	3530094de6	[AArch64] Use 16 bytes as preferred function alignment on Cortex-A73. Summary: Using 16 byte alignment is beneficial on Cortex-A73, similar to Cortex-A72 (added in D34961). Reviewers: mcrosier, t.p.northover, aadg, silviu.baranga Reviewed By: t.p.northover Subscribers: aemerson, rengolin, javed.absar, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D35493 llvm-svn: 308283	2017-07-18 09:31:18 +00:00
Dmitry Preobrazhensky	095ec3da81	[AMDGPU][MC] Added missing VOP3P opcodes Added support of the following opcodes: v_pk_sub_u16 v_pk_mad_i16 v_pk_mad_u16 See Bug 33593: https://bugs.llvm.org//show_bug.cgi?id=33593 Reviewers: vpykhtin, artem.tamazov, arsenm Differential Revision: https://reviews.llvm.org/D34890 llvm-svn: 308281	2017-07-18 09:24:10 +00:00
Jonas Paulsson	d667417e80	[SystemZ, AsmParser] Enable the mnemonic spell corrector. This enables the suggestions of other mnemonics when invalid ones are specified. Review: Ulrich Weigand llvm-svn: 308280	2017-07-18 09:17:00 +00:00
Chandler Carruth	3a9968184a	Revert part of r308100 since the cause (r308025) was also reverted. The commit r308100 updated WebAssembly tests for r308025. In one case it merely made the test more resilient but in another case it made a substantive update. Because r308025 was reverted in r308271, these changes to the test also need to be reverted. They should be folded into the recommit of r308025 when it is ready. llvm-svn: 308273	2017-07-18 08:20:50 +00:00
Chandler Carruth	0781d52cb3	[x86] Add a missing triple, without which the CPU won't parse. Notably, this is failing on our PPC build bots: http://lab.llvm.org:8011/builders/clang-ppc64le-linux/builds/8338/steps/ninja%20check%201/logs/FAIL%3A%20LLVM%3A%3Apr33772.ll llvm-svn: 308272	2017-07-18 08:16:32 +00:00
Chandler Carruth	a15e080b05	Revert r308025 due to uncovering a crash in SelectionDAG. This is filed with a minimal test case in http://llvm.org/PR33833. Original commit message: Improve Aliasing of operations to static alloca llvm-svn: 308271	2017-07-18 07:53:47 +00:00
Chandler Carruth	9a7442d088	Revert r308179 which causes tablegen to spam stderr on every build. Original commit log: [AMDGPU] CodeGen: check dst operand type to determine if omod is supported for VOP3 instructions llvm-svn: 308270	2017-07-18 07:40:47 +00:00
Craig Topper	f54a500101	[X86] Prevent an assertion failure if a gather intrinsic is passed a non-constant scale value. This isn't legal code, but we shouldn't crash on it. Now we just don't convert the gather intrinsic if the scale isn't constant and let it go through to isel where we'll report an isel failure. Fixes PR33772. llvm-svn: 308267	2017-07-18 06:49:23 +00:00
Max Kazantsev	2c627a97fd	[IRCE] Recognize loops with ne/eq latch conditions In some particular cases eq/ne conditions can be turned into equivalent slt/sgt conditions. This patch teaches parseLoopStructure to handle some of these cases. Differential Revision: https://reviews.llvm.org/D35010 llvm-svn: 308264	2017-07-18 04:53:48 +00:00
Eric Beckmann	d782ca3a3c	Revert "Adding temporary debugging info to llvm-mt to solve fedora failure." This reverts commit 223ef99f839f6b056272bcf4390841fcb26dda3c. llvm-svn: 308263	2017-07-18 03:38:04 +00:00
Eric Beckmann	5d77c20a7a	Change '?' to 'h' in llvm-mt test. Some shells seem to have trouble parsing non-alphanumeric symbols. llvm-svn: 308259	2017-07-18 02:18:19 +00:00
Eric Beckmann	21c7e9e346	Adding extra test info for llvm-mt. llvm-svn: 308252	2017-07-18 01:13:10 +00:00
Daniel Neilson	3c7f89f382	Add element-atomic mem intrinsic canary tests for Memory Sanitizer. Summary: Add canary tests to verify that MSAN currently does nothing with the element atomic memory intrinsics for memcpy, memmove, and memset. Placeholder tests that will fail once element atomic @llvm.mem[cpy\|move\|set] instrinsics have been added to the MemIntrinsic class hierarchy. These will act as a reminder to verify that MSAN handles these intrinsics properly once they have been added to that class hierarchy. Reviewers: reames Reviewed By: reames Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D35510 llvm-svn: 308251	2017-07-18 01:06:54 +00:00
Daniel Neilson	ebf288a206	Add element-atomic mem intrinsic canary tests for Efficiency Sanitizer. Summary: Add canary tests to verify that ESAN currently does nothing with the element atomic memory intrinsics for memcpy, memmove, and memset. Placeholder tests that will fail once element atomic @llvm.mem[cpy\|move\|set] instrinsics have been added to the MemIntrinsic class hierarchy. These will act as a reminder to verify that ESAN handles these intrinsics properly once they have been added to that class hierarchy. Reviewers: reames Reviewed By: reames Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D35508 llvm-svn: 308250	2017-07-18 01:06:53 +00:00
Daniel Neilson	92be70f510	Add element-atomic mem intrinsic canary tests for Dataflow Sanitizer. Summary: Add canary tests to verify that DFSAN currently does nothing with the element atomic memory intrinsics for memcpy, memmove, and memset. Placeholder tests that will fail once @llvm.mem[cpy\|move\|set] instrinsics have been added to the MemIntrinsic class hierarchy. These will act as a reminder to verify that DFSAN handles these intrinsics properly once they have been added to that class hierarchy. Note that there could be some trickiness with these element-atomic intrinsics for the dataflow sanitizer in racy multithreaded programs. The data flow sanitizer inserts additional lib calls to mirror the memory intrinsic's action, so it is possible (very likely, even) that the dfsan buffers will not be in sync with the original buffers. Furthermore, implementation of the dfsan buffer updates for the element atomic intrinsics will have to also use unordered atomic instructions. If we can assume that dfsan is never run on racy multithreaded programs, then the element atomic memory intrinsics can pretty much be treated the same as the regular memory intrinsics. Reviewers: reames Reviewed By: reames Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D35507 llvm-svn: 308249	2017-07-18 01:06:52 +00:00
Daniel Neilson	5fc2ee7fab	Add element-atomic mem intrinsic canary tests for Address Sanitizer. Summary: Add canary tests to verify that ASAN currently does nothing with the element atomic memory intrinsics for memcpy, memmove, and memset. Placeholder tests that will fail once element atomic @llvm.mem[cpy\|move\|set] instrinsics have been added to the MemIntrinsic class hierarchy. These will act as a reminder to verify that ASAN handles these intrinsics properly once they have been added to that class hierarchy. Reviewers: reames Reviewed By: reames Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D35505 llvm-svn: 308248	2017-07-18 01:06:52 +00:00
Daniel Neilson	a3c7f4cba3	Add element-atomic mem intrinsic canary tests for InstCombine. Summary: Add canary tests to verify that InstCombine currently does nothing with the element atomic memory intrinsics for memmove and memset. Placeholder tests that will fail once element atomic @llvm.mem[move\|set] instrinsics have been added to the MemIntrinsic class hierarchy. These will act as a reminder to verify that inst combine handles these intrinsics properly once they have been added to that class hierarchy. Reviewers: reames Reviewed By: reames Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D35502 llvm-svn: 308247	2017-07-18 01:06:47 +00:00
Spyridoula Gravani	f6bd788dda	[DWARF] Modification of code for the verification of .debug_info section. Summary: This patch modifies the handleDebugInfo() function so that we verify the contents of each unit in the .debug_info section only if its header has been successfully verified. This change will allow for more/different verification checks depending on the type of the unit since from dwarf5, the .debug_info section may consist of different types of units. Subscribers: aprantl Differential Revision: https://reviews.llvm.org/D35521 llvm-svn: 308245	2017-07-18 01:00:26 +00:00
Reid Kleckner	52465615d4	Fix pdbdump-headers.test after TPI hash changes llvm-svn: 308244	2017-07-18 00:44:10 +00:00
Reid Kleckner	c50349d4c6	[PDB] Finish and simplify TPI hashing Summary: This removes the CVTypeVisitor updater and verifier classes. They were made dead by the minimal type dumping refactoring. Replace them with a single function that takes a type record and produces a hash. Call this from the minimal type dumper and compare the hash. I also noticed that the microsoft-pdb reference repository uses a basic CRC32 for records that aren't special. We already have an implementation of that CRC ready to use, because it's used in COFF for ICF. I'll make LLD call this hashing utility in a follow-up change. We might also consider using this same hash in type stream merging, so that we don't have to hash our records twice. Reviewers: inglorion, ruiu Subscribers: llvm-commits, hiraditya Differential Revision: https://reviews.llvm.org/D35515 llvm-svn: 308240	2017-07-18 00:33:45 +00:00
Reid Kleckner	67653ee086	[codeview] Fix YAML for LF_TYPESERVER2 by hoisting PDB_UniqueId Summary: We were treating the GUIDs in TypeServer2Record as strings, and the non-ASCII bytes in the GUID would not round-trip through YAML. We already had the PDB_UniqueId type portably represent a Windows GUID, but we need to hoist that up to the DebugInfo/CodeView library so that we can use it in the TypeServer2Record as well as in PDB parsing code. Reviewers: inglorion, amccarth Subscribers: llvm-commits, hiraditya Differential Revision: https://reviews.llvm.org/D35495 llvm-svn: 308234	2017-07-17 23:59:44 +00:00
Matt Arsenault	e15855d9e3	AMDGPU: Annotate features from x work item/group IDs. This wasn't necessary before since they are always enabled for kernels, but this is necessary if they need to be forwarded to a callable function. llvm-svn: 308226	2017-07-17 22:35:50 +00:00
Eric Beckmann	8299850670	Create empty shell of llvm-mt. Summary: This is the first patch towards creating the llvm-mt tool for merging Windows manifests. This is a reimplementation of mt.exe. Reviewers: zturner, ruiu, rnk Subscribers: llvm-commits, mgorny Differential Revision: https://reviews.llvm.org/D35333 llvm-svn: 308224	2017-07-17 21:35:12 +00:00
Martin Storsjo	2f24e93481	[AArch64] Extend CallingConv::X86_64_Win64 to AArch64 as well Rename the enum value from X86_64_Win64 to plain Win64. The symbol exposed in the textual IR is changed from 'x86_64_win64cc' to 'win64cc', but the numeric value is kept, keeping support for old bitcode. Differential Revision: https://reviews.llvm.org/D34474 llvm-svn: 308208	2017-07-17 20:05:19 +00:00
Teresa Johnson	f9dc3deaa6	Revert "Restore with fix "[ThinLTO] Ensure we always select the same function copy to import"" This reverts commit r308114 (and follow on fixes to test). There is a linking failure in a ThinLTO bot: http://green.lab.llvm.org/green/job/clang-stage2-configure-Rthinlto_build/3663/ (and undefined reference). It seems like it must be a second order effect of the heuristic change I made, and may take some time to try to reproduce locally and track down. Therefore, reverting for now. llvm-svn: 308206	2017-07-17 19:25:38 +00:00
Ulrich Weigand	f2968d58cb	[SystemZ] Add support for IBM z14 processor (3/3) This adds support for the new 128-bit vector float instructions of z14. Note that these instructions actually only operate on the f128 type, since only each 128-bit vector register can hold only one 128-bit float value. However, this is still preferable to the legacy 128-bit float instructions, since those operate on pairs of floating-point registers (so we can hold at most 8 values in registers), while the new instructions use single vector registers (so we hold up to 32 value in registers). Adding support includes: - Enabling the instructions for the assembler/disassembler. - CodeGen for the instructions. This includes allocating the f128 type now to the VR128BitRegClass instead of FP128BitRegClass. - Scheduler description support for the instructions. Note that for a small number of operations, we have no new vector instructions (like integer <-> 128-bit float conversions), and so we use the legacy instruction and then reformat the operand (i.e. copy between a pair of floating-point registers and a vector register). llvm-svn: 308196	2017-07-17 17:44:20 +00:00
Ulrich Weigand	33435c4c9c	[SystemZ] Add support for IBM z14 processor (2/3) This adds support for the new 32-bit vector float instructions of z14. This includes: - Enabling the instructions for the assembler/disassembler. - CodeGen for the instructions, including new LLVM intrinsics. - Scheduler description support for the instructions. - Update to the vector cost function calculations. In general, CodeGen support for the new v4f32 instructions closely matches support for the existing v2f64 instructions. llvm-svn: 308195	2017-07-17 17:42:48 +00:00
Ulrich Weigand	2b3482fe85	[SystemZ] Add support for IBM z14 processor (1/3) This patch series adds support for the IBM z14 processor. This part includes: - Basic support for the new processor and its features. - Support for new instructions (except vector 32-bit float and 128-bit float). - CodeGen for new instructions, including new LLVM intrinsics. - Scheduler description for the new processor. - Detection of z14 as host processor. Support for the new 32-bit vector float and 128-bit vector float instructions is provided by separate patches. llvm-svn: 308194	2017-07-17 17:41:11 +00:00
Mandeep Singh Grang	ed64963f1e	[llvm] Remove redundant check-prefix=CHECK from tests. NFC. Reviewers: t.p.northover, oren_ben_simhon, niravd, mcrosier Reviewed By: oren_ben_simhon, mcrosier Subscribers: nhaehnle, javed.absar, llvm-commits Differential Revision: https://reviews.llvm.org/D35466 llvm-svn: 308193	2017-07-17 17:32:45 +00:00
Krzysztof Parzyszek	5eef92eb7f	[Hexagon] Remove custom lowering of loads of v4i16 The target-independent lowering works fine, except concatenating 32-bit words. Add a pattern to generate A2_combinew instead of 64-bit asl/or. llvm-svn: 308186	2017-07-17 15:45:45 +00:00
Simon Pilgrim	948eca371e	[X86] Add LEA scheduling tests llvm-svn: 308180	2017-07-17 14:37:17 +00:00
Sam Kolton	a2b9e2f755	[AMDGPU] CodeGen: check dst operand type to determine if omod is supported for VOP3 instructions Summary: Previously, CodeGen checked first src operand type to determine if omod is supported by instruction. This isn't correct for some instructions: e.g. V_CMP_EQ_F32 has floating-point src operands but desn't support omod. Changed .td files to check if dst operand instead of src operand. Reviewers: arsenm, vpykhtin Subscribers: kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye Differential Revision: https://reviews.llvm.org/D35350 llvm-svn: 308179	2017-07-17 14:23:38 +00:00
Simon Pilgrim	1cbe8c2ca5	[X86][AVX512] Add lowering of vXi32/vXi64 ISD::ROTL/ISD::ROTR Add support for lowering to ISD::ROTL/ISD::ROTR, including rotate by immediate Differential Revision: https://reviews.llvm.org/D35463 llvm-svn: 308177	2017-07-17 14:11:30 +00:00
Simon Pilgrim	105a3716bb	Fixed line endings. NFCI. llvm-svn: 308175	2017-07-17 13:58:20 +00:00
Simon Pilgrim	11199b2ee5	[X86][AVX] Fix typo in vector rotate tests Was preventing rotate matching llvm-svn: 308171	2017-07-17 10:35:51 +00:00
Simon Pilgrim	5aa70e7fe5	[X86][AVX512] Add constant splat vector rotate tests for D35463 llvm-svn: 308169	2017-07-17 10:09:48 +00:00
Simon Pilgrim	701e25edce	[X86][AVX512] Regenerate shift tests llvm-svn: 308168	2017-07-17 09:53:45 +00:00
Mandeep Singh Grang	a210f1d7bf	[COFF, ARM64] Add initial relocation types Reviewers: compnerd, ruiu, rnk Reviewed By: compnerd Subscribers: mstorsjo, aemerson, javed.absar, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D34857 llvm-svn: 308154	2017-07-17 00:05:32 +00:00
Dylan McKay	5c8a50bddd	[AVR] Add/remove XFAILs to get the backend passing Generic CodeGen tests A few tests have since been fixed, and a few since now fail. llvm-svn: 308151	2017-07-16 23:33:50 +00:00
Andrew Zhogin	67a64041b9	[DAGCombiner] Recognise vector rotations with non-splat constants Fixes PR33691. Differential revision: https://reviews.llvm.org/D35381 llvm-svn: 308150	2017-07-16 23:11:45 +00:00
Dylan McKay	2c59215ae3	[AVR] Fix a typo in the tests llvm-svn: 308148	2017-07-16 22:31:07 +00:00
Konstantin Zhuravlyov	2ec725c9d8	AMDGPU: Fix amdgpu-flat-work-group-size/amdgpu-waves-per-eu check Differential Revision: https://reviews.llvm.org/D35433 llvm-svn: 308147	2017-07-16 19:38:47 +00:00
Simon Pilgrim	2899ec88fc	[X86][AVX512] Add 512-bit vector rotate tests llvm-svn: 308146	2017-07-16 19:26:49 +00:00
Davide Italiano	579064e2c1	[InstCombine] Don't violate dominance when replacing instructions. Differential Revision: https://reviews.llvm.org/D35376 llvm-svn: 308144	2017-07-16 18:56:30 +00:00
Amjad Aboud	4563c062b1	[X86] X86::CMOV to Branch heuristic based optimization. LLVM compiler recognizes opportunities to transform a branch into IR select instruction(s) - later it will be lowered into X86::CMOV instruction, assuming no other optimization eliminated the SelectInst. However, it is not always profitable to emit X86::CMOV instruction. For example, branch is preferable over an X86::CMOV instruction when: 1. Branch is well predicted 2. Condition operand is expensive, compared to True-value and the False-value operands In CodeGenPrepare pass there is a shallow optimization that tries to convert SelectInst into branch, but it is not enough. This commit, implements machine optimization pass that converts X86::CMOV instruction(s) into branch, based on a conservative heuristic. Differential Revision: https://reviews.llvm.org/D34769 llvm-svn: 308142	2017-07-16 17:39:56 +00:00
Simon Pilgrim	dad2aef037	[X86] Add F16C scheduling tests llvm-svn: 308138	2017-07-16 14:34:18 +00:00
Simon Pilgrim	6f26f3d07f	[X86] Add POPCNT scheduling tests llvm-svn: 308137	2017-07-16 14:22:39 +00:00
Simon Pilgrim	b884b208ee	[X86] Add BMI2 scheduling tests llvm-svn: 308136	2017-07-16 14:09:15 +00:00
Simon Pilgrim	dfb6eb279f	[X86] Add BMI1 scheduling tests llvm-svn: 308135	2017-07-16 13:59:44 +00:00
Simon Pilgrim	7194513268	[X86] Add LZCNT scheduling tests llvm-svn: 308133	2017-07-16 13:40:44 +00:00
Simon Pilgrim	73ef87978f	[X86][SSE4A] Add EXTRQ/INSERTQ values to BTVER2 scheduling model llvm-svn: 308132	2017-07-16 12:06:06 +00:00
Simon Pilgrim	7d43bcfd2d	[X86][AVX] Regenerate tests with constant broadcast comments llvm-svn: 308131	2017-07-16 11:43:16 +00:00
Simon Pilgrim	e47df64a18	[X86][AVX] Regenerate vector tzcnt tests with constant broadcast comments llvm-svn: 308130	2017-07-16 11:40:23 +00:00
Simon Pilgrim	17f20f48c2	[X86][AVX] Regenerate vector idiv tests with constant broadcast comments llvm-svn: 308129	2017-07-16 11:38:14 +00:00
Simon Pilgrim	77ce072f6b	[X86][AVX] Regenerate combine tests with constant broadcast comments llvm-svn: 308128	2017-07-16 11:36:11 +00:00
Hiroshi Inoue	7f46baff2c	fix typos in comments; NFC llvm-svn: 308127	2017-07-16 08:11:56 +00:00
Hiroshi Inoue	a9ee279e70	fix typos in comments; NFC llvm-svn: 308126	2017-07-16 07:48:48 +00:00
Teresa Johnson	9f2f94f64b	Fix bot failures from r308114 Finally figured out that some bots were failing from r308114 with the message: llvm-lto2: LTO::run failed: No available targets are compatible with this triple. after adding in some other checking that finally caused this to show up in the FileCheck output. Added "REQUIRES: x86-registered-target" which should fix it. llvm-svn: 308119	2017-07-16 00:28:22 +00:00
Teresa Johnson	664a3fb238	Attempt 2 to debug bot failures Modify checks from r308114 even more, to see if I can narrow down why some bots are still failing. llvm-svn: 308116	2017-07-16 00:01:16 +00:00
Teresa Johnson	8fd09a5163	Attempt to debug bot failures Simplifying checks from r308114, to see if I can narrow down why some bots are still failing. llvm-svn: 308115	2017-07-15 23:31:32 +00:00
Teresa Johnson	a7660b0127	Restore with fix "[ThinLTO] Ensure we always select the same function copy to import" This restores r308078/r308079 with a fix for bot non-determinisim (make sure we run llvm-lto in single threaded mode so the debug output doesn't get interleaved). llvm-svn: 308114	2017-07-15 22:58:06 +00:00
Craig Topper	d918d5b36b	[InstCombine] Improve the expansion in SimplifyUsingDistributiveLaws to handle cases where one side doesn't simplify, but the other side resolves to an identity value Summary: If one side simplifies to the identity value for inner opcode, we can replace the value with just the operation that can't be simplified. I've removed a couple now unneeded special cases in visitAnd and visitOr. There are probably other cases I missed. Reviewers: spatel, majnemer, hfinkel, dberlin Reviewed By: spatel Subscribers: grandinj, llvm-commits, spatel Differential Revision: https://reviews.llvm.org/D35451 llvm-svn: 308111	2017-07-15 21:49:49 +00:00
Simon Pilgrim	f9ea0959d9	[X86][AVX] Regenerate tests with constant broadcast comments llvm-svn: 308110	2017-07-15 21:17:35 +00:00
Simon Pilgrim	c2221ee767	[X86][AVX] Regenerate tests with constant broadcast comments llvm-svn: 308109	2017-07-15 20:28:09 +00:00
Reid Kleckner	af88a910fd	[CodeView] Dump BuildInfoSym and ProcSym type indices I need to print the type index in hex so that I can match it in FileCheck for a test I'm writing. llvm-svn: 308107	2017-07-15 18:10:39 +00:00
Sanjay Patel	3437ee2740	[InstCombine] improve (1 << x) & 1 --> zext(x == 0) folding 1. Add a one-use check to prevent increasing instruction count. 2. Generalize the pattern matching to include vector types. llvm-svn: 308105	2017-07-15 17:26:01 +00:00
Craig Topper	e29b112f49	[InstCombine] Add test cases for (X & (Y \| ~X)) -> (X & Y) where the not is an inverted compare. NFC Do the same for (X \| (Y & ~X)) -> (X \| Y) llvm-svn: 308104	2017-07-15 17:09:23 +00:00
Craig Topper	860e0ba5da	[InstCombine] Move 4 test cases from a test that didn't use FileCheck and merge them into a existing test file. NFC llvm-svn: 308103	2017-07-15 17:09:22 +00:00
Sanjay Patel	a59aac675c	[InstCombine] add tests for (1 << x) & 1 --> zext(x == 0) ; NFC This fold hit the trifecta: 1. It was untested. 2. It oversteps (multiuse is not checked, so increases instruction count). 3. It is incomplete (doesn't work for vectors). llvm-svn: 308102	2017-07-15 15:55:07 +00:00
Chandler Carruth	85c82841ba	[wasm] Update two tests for r308025 which causes scheduling changes due to the newly improved AA information. llvm-svn: 308100	2017-07-15 15:44:36 +00:00
Sanjay Patel	55b9f88ecc	[InstCombine] allow (0 - x) & 1 --> x & 1 for vectors llvm-svn: 308098	2017-07-15 15:29:47 +00:00
Sanjay Patel	27339133a7	[InstCombine] remove dead code/tests; NFCI These patterns and tests were added to InstSimplify with: https://reviews.llvm.org/rL303004 llvm-svn: 308096	2017-07-15 15:01:33 +00:00
Chandler Carruth	d78a38ed2e	Revert r308078 (and subsequent tweak in r308079) which introduces a test that appears to exhibit non-determinism and is flaking on the bots pretty consistently. r308078: [ThinLTO] Ensure we always select the same function copy to import r308079: Require asserts in new test that uses debug flag llvm-svn: 308095	2017-07-15 13:50:26 +00:00
Florian Hahn	ad993521ac	[LoopInterchange] Add some optimization remarks. Reviewers: anemet, karthikthecool, blitz.opensource Reviewed By: anemet Subscribers: mzolotukhin, llvm-commits Differential Revision: https://reviews.llvm.org/D35122 llvm-svn: 308094	2017-07-15 13:13:19 +00:00
Chandler Carruth	f59a838720	[PM/LCG] Teach the LazyCallGraph to maintain reference edges from every function to every defined function known to LLVM as a library function. LLVM can introduce calls to these functions either by replacing other library calls or by recognizing patterns (such as memset_pattern or vector math patterns) and replacing those with calls. When these library functions are actually defined in the module, we need to have reference edges to them initially so that we visit them during the CGSCC walk in the right order and can effectively rebuild the call graph afterward. This was discovered when building code with Fortify enabled as that is a common case of both inline definitions of library calls and simplifications of code into calling them. This can in extreme cases of LTO-ing with libc introduce many more reference edges. I discussed a bunch of different options with folks but all of them are unsatisfying. They either make the graph operations substantially more complex even when there are no defined libfuncs, or they introduce some other complexity into the callgraph. So this patch goes with the simplest possible solution of actual synthetic reference edges. If this proves to be a memory problem, I'm happy to implement one of the clever techniques to save memory here. llvm-svn: 308088	2017-07-15 08:08:19 +00:00
Simon Atanasyan	f217c7b7e2	[mips] Handle the `long-calls` feature flags in the MIPS backend If the `long-calls` feature flags is enabled, disable use of the `jal` instruction. Instead of that call a function by by first loading its address into a register, and then using the contents of that register. Differential revision: https://reviews.llvm.org/D35168 llvm-svn: 308087	2017-07-15 07:14:25 +00:00
Matt Arsenault	b34635550a	AMDGPU: Return correct type during argument lowering The type needs to be casted back to the original argument type. Fixes an assert that for some reason is only run when using -debug. Includes an additional combine to avoid test regressions from having conversions mixed with multiple Assert[SZ]ext nodes. On subtargets where i16 is legal, this was producing an i32 register with an i16 AssertZExt, truncated to i16 with another i8 AssertZExt. t2: i32,ch = CopyFromReg t0, Register:i32 %vreg0 t3: i16 = truncate t2 t5: i16 = AssertZext t3, ValueType:ch:i8 t6: i8 = truncate t5 t7: i32 = zero_extend t6 llvm-svn: 308082	2017-07-15 05:52:59 +00:00
Yonghong Song	9276ef05c8	bpf: generate better lowering code for certain select/setcc instructions Currently, for code like below, === inner_map = bpf_map_lookup_elem(outer_map, &port_key); if (!inner_map) { inner_map = &fallback_map; } === the compiler generates (pseudo) code like the below: === I1: r1 = bpf_map_lookup_elem(outer_map, &port_key); I2: r2 = 0 I3: if (r1 == r2) I4: r6 = &fallback_map I5: ... === During kernel verification process, After I1, r1 holds a state map_ptr_or_null. If I3 condition is not taken (path [I1, I2, I3, I5]), supposedly r1 should become map_ptr. Unfortunately, kernel does not recognize this pattern and r1 remains map_ptr_or_null at insn I5. This will cause verificaiton failure later on. Kernel, however, is able to recognize pattern "if (r1 == 0)" properly and give a map_ptr state to r1 in the above case. LLVM here generates suboptimal code which causes kernel verification failure. This patch fixes the issue by changing BPF insn pattern matching and lowering to generate proper codes if the righthand parameter of the above condition is a constant. A test case is also added. Signed-off-by: Yonghong Song <yhs@fb.com> llvm-svn: 308080	2017-07-15 05:41:42 +00:00
Teresa Johnson	16798558ba	Require asserts in new test that uses debug flag This should fix bot failures from r308078. llvm-svn: 308079	2017-07-15 05:27:57 +00:00

... 2 3 4 5 6 ...

46451 Commits