llvm-project

Commit Graph

Author	SHA1	Message	Date
Pavel Labath	605636d872	[Support] Add WritableMemoryBuffer class Summary: The motivation here is LLDB, where we need to fixup relocations in mmapped files before their contents can be read correctly. The MemoryBuffer class does exactly what we need, except that it maps the file in read-only mode. WritableMemoryBuffer reuses the existing machinery for opening and mmapping a file. The only difference is in the argument to the mapped_file_region constructor -- we create a private copy-on-write mapping, so that we can make changes to the mapped data, but the changes aren't carried over to the underlying file. This patch is based on an initial version by Zachary Turner. Reviewers: mehdi_amini, rnk, rafael, dblaikie, zturner Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D40291 llvm-svn: 321071	2017-12-19 12:15:50 +00:00
Simon Pilgrim	f6d4ab6daf	[X86][SSE] Use (V)PHMINPOSUW for vXi8 SMAX/SMIN/UMAX/UMIN horizontal reductions (PR32841) Extension to D39729 which performed this for vXi16, with the same bit flipping to handle SMAX/SMIN/UMAX cases, vXi8 UMIN horizontal reductions can be performed. This makes use of the fact that by performing a pair-wise i8 SHUFFLE/UMIN before PHMINPOSUW, we both get the UMIN of each pair but also zero-extend the upper bits ready for v8i16. Differential Revision: https://reviews.llvm.org/D41294 llvm-svn: 321070	2017-12-19 12:02:40 +00:00
Francis Visoiu Mistrih	2130e6a080	Fix: [YAML] Always double quote UTF-8 characters llvm-svn: 321069	2017-12-19 11:59:28 +00:00
Francis Visoiu Mistrih	f34eea5aa1	[YAML] Always double quote UTF-8 characters llvm-svn: 321068	2017-12-19 11:51:05 +00:00
Simon Dardis	1ade566c45	[mips] Handle the emission of microMIPSr6 sll instruction when used as a nop. This instruction is encoded as zero, so we have handle that case when checking for unimplemented opcodes when producing the encoding for an instruction. llvm-svn: 321066	2017-12-19 11:16:22 +00:00
Jonas Devlieghere	efb06387b7	[dwarfdump] Lookup needs to be an unsigned long long parameter. Before this patch, dwarfdump's lookup parameter only accepts unsigned. Given that for many current platforms the load address already exceeds unsigned (e.g. arm64 w/ 0x100000000), dwarfdump needs an unsigned long long parameter. Patch by: Dr. Michael 'Mickey' Lauer <mickey@vanille-media.de> llvm-svn: 321064	2017-12-19 09:45:26 +00:00
Max Kazantsev	fd95ee0c9a	[JumpThreading] Restrict PRE across instructions that don't pass control to successors PRE in JumpThreading should not be able to hoist copy of non-speculable loads across instructions that don't always transfer execution to their successors, otherwise they may introduce an unsafe load which otherwise would not be executed. The same problem for GVN was fixed as rL316975. Differential Revision: https://reviews.llvm.org/D40347 llvm-svn: 321063	2017-12-19 09:10:21 +00:00
Igor Laevsky	ce6f2d0190	[FuzzMutate] Don't crash when mutator is unable to find operation Differential Revision: https://reviews.llvm.org/D41009 llvm-svn: 321062	2017-12-19 08:52:51 +00:00
Bjorn Steinbrink	2da4d9d86d	Treat sret arguments as being dereferenceable in getPointerDereferenceableBytes() Reviewers: rnk, hfinkel, efriedma Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D41355 llvm-svn: 321061	2017-12-19 08:46:46 +00:00
Craig Topper	13142b10d5	[X86] Don't extend v16i8 non-uniform shifts to v16i32 if we have BWI. Use v16i16 instead. BWI supports shifting by word amounts. Even if VLX isn't support we can still widen to v32i16 and extract the lower half. For SKX its preferrable to not use 512-bit vector if we can. llvm-svn: 321059	2017-12-19 06:59:10 +00:00
Craig Topper	6e3091c265	[X86] Use a specific list of MVTs in combineShiftRightArithmetic instead of iterating over every integer VT and checking their size. Previously, we were checking for MVTs with sizes betwen 8 and 64 which only includes i8, i16, i32, and i64 today. But I don't think we should assume that and should list the types that are legal for x86. I also don't think we need i64 since type legalization is guaranteed to split those up. llvm-svn: 321058	2017-12-19 06:29:00 +00:00
Craig Topper	eb13a418e1	[X86] Remove unnecessary check for integer VT from combineShiftRightArithmetic. I doubt there's any way to create a ashr for an FP type. llvm-svn: 321057	2017-12-19 06:28:58 +00:00
Craig Topper	da853a9c2f	[X86] Remove dead code for turning vector shifts by large amounts into a zero vector. Pretty sure these are handled by a target independent DAG combine that turns them into undef these days. llvm-svn: 321056	2017-12-19 05:21:50 +00:00
Craig Topper	ad3a554889	[X86] Use ZERO_EXTEND instead of ANY_EXTEND when extending the shift amount for a non-uniform shift. My reading of the SDM says that all bits of the shift amount are used. If the value of the element is larger than the number of bits the result the shift result is zero. So I think we need to zero_extend here to avoid garbage in the upper bits. In reality we lower any_extend as zero_extend so in most cases it would be hard to hit this. llvm-svn: 321055	2017-12-19 04:52:04 +00:00
Serguei Katkov	768d6dd087	Fix APFloat from string conversion for Inf The method IEEEFloat::convertFromStringSpecials() does not recognize the "+Inf" and "-Inf" strings but these strings are printed for the double Infinities by the IEEEFloat::toString(). This patch adds the "+Inf" and "-Inf" strings to the list of recognized patterns in IEEEFloat::convertFromStringSpecials(). Re-landing after fix. Reviewers: sberg, bogner, majnemer, timshen, rnk, skatkov, gottesmm, bkramer, scanon, anna Reviewed By: anna Subscribers: mkazantsev, FlameTop, llvm-commits, reames, apilipenko Differential Revision: https://reviews.llvm.org/D38030 llvm-svn: 321054	2017-12-19 04:27:39 +00:00
Quentin Colombet	63a328c30c	[TableGen][GlobalISel] Reset the internal map of RuleMatchers just before the emission Between the creation of the last InstructionMatcher and the first emission of the related Rule, we need to clear the internal map of IDs. We used to do that right after the creation of the main InstructionMatcher when building the rule and although that worked, this is fragile because if for some reason some later code decides to create more InstructionMatcher before the final call to emit, then the IDs would be completely messed up. Move that to the beginning of "emit" so that the IDs are guarantee to be consistent. NFC. llvm-svn: 321053	2017-12-19 02:57:23 +00:00
Reid Kleckner	73177e71bf	Fix Wasm as a follow up to r321035 and the other one This array is tightly coupled with the .def file. Someone should look into fixing that. llvm-svn: 321050	2017-12-19 01:08:53 +00:00
Justin Bogner	4314f3adc2	update_mir_test_checks: Accept IR as input as well as MIR We need to handle IR for tests that want to do lowering (or just -stop-after with IR as input). I've run this on one AArch64 test to demonstrate what it looks like. llvm-svn: 321048	2017-12-19 00:49:04 +00:00
Jake Ehrlich	e8437de727	[llvm-objcopy] Add option to add a progbits section from a file This change adds support for adding progbits sections with contents from a file Differential Revision: https://reviews.llvm.org/D41212 llvm-svn: 321047	2017-12-19 00:47:30 +00:00
Matthias Braun	e29c0b8862	TargetLoweringBase: Followup to r321035 I missed some prefixes and the fact that on AArch64 we use "bzero" instead of "__bzero" as on X86 when doing my refactoring in r321035. Improve tests for bzero. llvm-svn: 321046	2017-12-19 00:43:00 +00:00
Matthias Braun	92de8b2405	TargetLowering: Fix InitLibcallCallingConvs() overriding things set in InitLibcalls() I missed the fact that the later called InitLibcallCallingConvs() overrides some things set in InitLibcalls() when I did the refactoring in r321036. Fix by merging InitLibcallCallingConvs() into InitLibcalls() and doing the initialization earlier. llvm-svn: 321045	2017-12-19 00:20:33 +00:00
Matthias Braun	a942d62983	TargetLowering: Fix off-by-one error This problem was present for a while, but somehow asan didn't catch it before the refactoring in r321036. llvm-svn: 321043	2017-12-19 00:05:10 +00:00
Sam Clegg	b23a20179a	[llvm-readobj] Dump wasm init functions llvm-svn: 321042	2017-12-19 00:04:41 +00:00
Matthias Braun	0282091c9f	TargetLoweringBase: Remove unnecessary watchos exception; NFC WatchOS isn't report as iOS (as opposed to tvos) so the exception I added in my last commit wasn't necessary after all. llvm-svn: 321041	2017-12-18 23:33:28 +00:00
Justin Bogner	930a95c269	update_mir_test_checks: Add "mir" to some states and regex names For tests that do lowering we need to support IR as input, so here we clarify some names to avoid ambiguity in upcoming commits. llvm-svn: 321039	2017-12-18 23:31:55 +00:00
Craig Topper	f19121d647	[X86] Don't use NOPL when the assembler is passed an empty CPU string. This recommits the change from r321026. I have a fix for the lld test now. llvm-svn: 321038	2017-12-18 23:31:43 +00:00
Matthias Braun	ef95969e5b	LiveStacks: Rename LiveStack.{h\|cpp} to LiveStacks.{h\|cpp}; NFC Filenames should match the name of the class they contain. llvm-svn: 321037	2017-12-18 23:19:44 +00:00
Matthias Braun	a4852d2c19	X86/AArch64/ARM: Factor out common sincos_stret logic; NFCI Note: - X86ISelLowering: setLibcallName(SINCOS) was superfluous as InitLibcalls() already does it. - ARMISelLowering: Setting libcallnames for sincos/sincosf seemed superfluous as in the darwin case it wouldn't be used while for all other cases InitLibcalls already does it. llvm-svn: 321036	2017-12-18 23:19:42 +00:00
Matthias Braun	a92cecfbda	AArch64/X86: Factor out common bzero logic; NFC llvm-svn: 321035	2017-12-18 23:14:28 +00:00
Krzysztof Parzyszek	e704583f23	[Hexagon] Cache loads to select to avoid traversing mutating DAG llvm-svn: 321034	2017-12-18 23:13:27 +00:00
Craig Topper	46832126e1	Revert part of r321026 "[X86] Don't use NOPL when the assembler is passed an empty CPU string." while I investigate how to fix an lld test failure. Looks like lld also needs to pass a -mcpu in some of its tests llvm-svn: 321033	2017-12-18 22:20:10 +00:00
Evandro Menezes	687df6380e	[AArch64] Expand test coverage of vector element shuffling to Exynos Make sure that all test cases are run for Exynos as well. Otherwise, NFC. llvm-svn: 321032	2017-12-18 22:17:39 +00:00
Quentin Colombet	eba10cbc88	[TableGen][GlobalISel] Make the arguments of the Instruction and Operand Matchers consistent Move InsnVarID and OpIdx at the beginning of the list of arguments for all the constructors of the OperandMatcher subclasses. This matches what we do for the InstructionMatcher. NFC. llvm-svn: 321031	2017-12-18 22:12:13 +00:00
Bob Haarman	ea5ff9fa6b	Fix buffer overrun in WindowsResourceCOFFWriter::writeSymbolTable() Summary: We were using sprintf(..., "$R06X", <some uint32_t>) to create strings that are expected to be exactly length 8, but this results in longer strings if the uint32_t is greater than 0xffffff. This change modifies the behavior as follows: - Uses the loop counter instead of the data offset. This gives us sequential symbol names, avoiding collisions as much as possible. - Masks the value to 0xffffff to avoid generating names longer than 8 bytes. - Uses formatv instead of sprintf. Fixes PR35581. Reviewers: ruiu, zturner Reviewed By: ruiu Subscribers: hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D41270 llvm-svn: 321030	2017-12-18 22:10:14 +00:00
Reid Kleckner	8f3c351aa3	Add test for .req directive starting with 'p' Reduced test case from libjpeg_turbo. llvm-svn: 321029	2017-12-18 22:01:18 +00:00
Jessica Paquette	8565d3af84	[MachineOutliner][NFC] Gardening: use std::any_of instead of bool + loop River Riddle suggested to use std::any_of instead of the bool + loop thing on r320229. This commit does that. llvm-svn: 321028	2017-12-18 21:44:52 +00:00
Craig Topper	4802d4e23e	[X86] Don't use NOPL when the assembler is passed an empty CPU string. Update tests to force a CPU with NOPL Empty string should be equivalent to "generic" which doesn't allow NOPL. Force tests to use specificy 'pentiumpro' to guarantee NOPL. Fixes PR35686 llvm-svn: 321026	2017-12-18 21:37:27 +00:00
Quentin Colombet	34688b9e38	[TableGen][GlobalISel] Refactor optimizeRules related bit to allow code reuse In theory, reapplying optimizeRules on each group matchers should give us a second nesting level on the matching table. In practice, we need more work to make that happen because all the predicates are actually not directly available through the predicate matchers list. NFC. llvm-svn: 321025	2017-12-18 21:25:53 +00:00
Reid Kleckner	37517a2ddd	Revert "[AArch64][SVE] Asm" changes, they broke libjpeg_turbo This reverts changes r320992, r320986, r320973, and r320970. r320970 by itself breaks the test case, and the rest depend on it. Test case will land soon. llvm-svn: 321024	2017-12-18 20:58:25 +00:00
Ivan A. Kosarev	a80c79b5bf	[Analysis] Generate more precise TBAA tags when one access encloses the other There are cases when two tags with different base types denote accesses to the same direct or indirect member of a structure type. Currently, merging of such tags results in a tag that represents an access to an object that has the type of that member. This patch changes this so that if one of the accesses encloses the other, then the generic tag is the one of the enclosed access. Differential Revision: https://reviews.llvm.org/D39557 llvm-svn: 321019	2017-12-18 20:05:20 +00:00
Teresa Johnson	915897e21b	[PGO] Fix handling of cold entry count for instrumented PGO Summary: In r277849, getEntryCount was changed to return None when the entry count was 0, specifically for SamplePGO where it means no samples were recorded. However, for instrumentation PGO a 0 entry count should be returned directly, since it does mean that the function was completely cold. Otherwise we end up treating these functions conservatively in isFunctionEntryCold() and isColdBB(). Instead, for SamplePGO use -1 when there are no samples, and change getEntryCount to return None when the value is -1. Reviewers: danielcdh, davidxl Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D41307 llvm-svn: 321018	2017-12-18 20:02:43 +00:00
Quentin Colombet	ec76d9c47f	[TableGen][GlobalISel] Optimize MatchTable for faster instruction selection * Context * Prior to this patchw, the table generated for matching instruction was straight forward but highly inefficient. Basically, each pattern generates its own set of self contained checks and actions. E.g., TableGen generated: // First pattern CheckNumOperand 3 CheckOpcode G_ADD ... Build ADDrr // Second pattern CheckNumOperand 3 CheckOpcode G_ADD ... Build ADDri // Third pattern CheckNumOperand 3 CheckOpcode G_SUB ... Build SUBrr * Problem * Because of that generation, a lot of check were redundant between each pattern and were checked every single time until we reach the pattern that matches. E.g., Taking the previous table, let say we are matching a G_SUB, that means we were going to check all the rules for G_ADD before looking at the G_SUB rule. In particular we are going to do: check 3 operands; PASS check G_ADD; FAIL ; Next rule check 3 operands; PASS (but we already knew that!) check G_ADD; FAIL (well it is still not true) ; Next rule check 3 operands; PASS (really!!) check G_SUB; PASS (at last :P) * Proposed Solution * This patch introduces a concept of group of rules (GroupMatcher) that share some predicates and only get checked once for the whole group. This patch only creates groups with one nesting level. Conceptually there is nothing preventing us for having deeper nest level. However, the current implementation is not smart enough to share the recording (aka capturing) of values. That limits its ability to do more sharing. For the given example the current patch will generate: // First group CheckOpcode G_ADD // First pattern CheckNumOperand 3 ... Build ADDrr // Second pattern CheckNumOperand 3 ... Build ADDri // Second group CheckOpcode G_SUB // Third pattern CheckNumOperand 3 ... Build SUBrr But if we allowed several nesting level, it could create a sub group for the checknumoperand 3. (We would need to call optimizeRules on the rules within a group.) * Result * With only one level of nesting, the instruction selection pass is up to 4x faster. For instance, one instruction now takes 500 checks, instead of 24k! With more nesting we could get in the tens I believe. Differential Revision: https://reviews.llvm.org/D39034 rdar://problem/34670699 llvm-svn: 321017	2017-12-18 19:47:41 +00:00
Dimitry Andric	e4f5d01033	Fix more inconsistent line endings. NFC. llvm-svn: 321016	2017-12-18 19:46:56 +00:00
Craig Topper	48176a5fb6	[X86] Minor formatting fix to getHostCPUFeatures. NFC llvm-svn: 321015	2017-12-18 19:40:11 +00:00
Jessica Paquette	02c124d644	[MachineOutliner] Recommit r320229 LR was undefined entering outlined functions that contain calls. This made the machine verifier unhappy when expensive checks were enabled. This fixes that. llvm-svn: 321014	2017-12-18 19:33:21 +00:00
Benjamin Kramer	efc7c88ea8	[PPC] Also disable the pre-emit version of reg+reg to reg+imm transformation. This has the same issue as the early pass disabled in r321010. llvm-svn: 321013	2017-12-18 19:21:56 +00:00
Don Hinton	0fa52c7db1	[cmake] Update experimental target error message Summary: Update this error message indicate this test only ensures experimental targets were passed via LLVM_EXPERIMENTAL_TARGETS_TO_BUILD. Originally, this test validated all targets, but in r184923, it was moved after the LLVMBUILDTOOL test, which also validates all targets, making that part of the test redundant. Differential Revision: https://reviews.llvm.org/D41273 llvm-svn: 321012	2017-12-18 19:15:15 +00:00
Paul Robinson	a06f8dcca6	Recommit "[DWARFv5] Dump an MD5 checksum in the line-table header." Adds missing support for DW_FORM_data16. Update of r320852/r320886, fixing the unittest again, this time use a raw char string for the test data. Differential Revision: https://reviews.llvm.org/D41090 llvm-svn: 321011	2017-12-18 19:08:35 +00:00
Benjamin Kramer	f4cc67acb6	[PPC] Disable reg+reg to reg+imm transformation. It creates invalid instructions. PR35688. llvm-svn: 321010	2017-12-18 18:56:57 +00:00
Dimitry Andric	e44dea9f6b	Fix inconsistent line endings in HexagonVectorLoopCarriedReuse.cpp. NFC. llvm-svn: 321009	2017-12-18 18:56:00 +00:00
Krzysztof Parzyszek	eba8c0c61b	[Hexagon] Higher versions of HVX imply presence of lower versions The code in Hexagon_MC::completeHVXFeatures wasn't setting all HVX- related features correctly. llvm-svn: 321008	2017-12-18 18:51:57 +00:00
Ivan A. Kosarev	422a380a3e	[IR] Support the new TBAA metadata format in IR verifier Differential Revision: https://reviews.llvm.org/D40438 llvm-svn: 321007	2017-12-18 18:46:44 +00:00
Dimitry Andric	ca5b0f3f12	Fix inconsistent line endings in ARCDisassembler.cpp. NFC. llvm-svn: 321006	2017-12-18 18:45:37 +00:00
Krzysztof Parzyszek	7259263790	i[Hexagon] ANY_EXTEND_VECTOR_INREG should be Custom, not Legal in r321004 llvm-svn: 321005	2017-12-18 18:41:52 +00:00
Krzysztof Parzyszek	6b589e593d	[Hexagon] Generate HVX code for vector sign-, zero- and any-extends Implement any-extend as zero-extend. llvm-svn: 321004	2017-12-18 18:32:27 +00:00
Simon Pilgrim	f947137ed0	[X86] Regenerate test to improve codegen testing for D41350 llvm-svn: 321003	2017-12-18 18:31:02 +00:00
Krzysztof Parzyszek	5439a70d97	[Hexagon] Prefer to widen HVX vectors instead of promoting llvm-svn: 321002	2017-12-18 18:21:01 +00:00
Matt Arsenault	d89d0b6494	Removed unused DominanceFrontier llvm-svn: 321001	2017-12-18 18:01:13 +00:00
Teresa Johnson	9ecaaff251	[ThinLTO] Make distributed indexes test more robust Modify test so that it passes in the reverse-iteration bot. We use DenseMap instead of std::map for the summaries to emit into distributed index files. The iteration order is not defined, but it is deterministic, which is good enough. llvm-svn: 321000	2017-12-18 18:00:32 +00:00
Xinliang David Li	19fb5b467b	[PGO] add MST min edge selection heuristic to ensure non-zero entry count Differential Revision: http://reviews.llvm.org/D41059 llvm-svn: 320998	2017-12-18 17:56:19 +00:00
Francis Visoiu Mistrih	b213b27ee3	[YAML] Add support for non-printable characters LLVM IR function names which disable mangling start with '\01' (https://www.llvm.org/docs/LangRef.html#identifiers). When an identifier like "\01@abc@" gets dumped to MIR, it is quoted, but only with single quotes. http://www.yaml.org/spec/1.2/spec.html#id2770814: "The allowed character range explicitly excludes the C0 control block allowed), the surrogate block #xD800-#xDFFF, #xFFFE, and #xFFFF." http://www.yaml.org/spec/1.2/spec.html#id2776092: "All non-printable characters must be escaped. [...] Note that escape sequences are only interpreted in double-quoted scalars." This patch adds support for printing escaped non-printable characters between double quotes if needed. Should also fix PR31743. Differential Revision: https://reviews.llvm.org/D41290 llvm-svn: 320996	2017-12-18 17:38:03 +00:00
Ivan A. Kosarev	04e1d01736	[IR] Add MDBuilder helpers for the new TBAA metadata format The new helpers are supposed to be used in clang to generate TBAA information in the new format proposed in this thread: http://lists.llvm.org/pipermail/llvm-dev/2017-November/118748.html Differential Revision: https://reviews.llvm.org/D39956 llvm-svn: 320993	2017-12-18 16:49:39 +00:00
Sander de Smalen	09f56a54d0	[AArch64][SVE] Asm: Improve diagnostics further when +sve is not specified Summary: Patch [4/4] in a series to add parsing of predicates and properly parse SVE ZIP1/ZIP2 instructions. This patch further improves diagnostic messages for when the SVE feature is not specified. Reviewers: rengolin, fhahn, olista01, echristo, efriedma Reviewed By: fhahn Subscribers: sdardis, aemerson, javed.absar, tschuett, llvm-commits, kristof.beyls Differential Revision: https://reviews.llvm.org/D40363 llvm-svn: 320992	2017-12-18 16:48:53 +00:00
Simon Dardis	fd8c65e868	Reland "[mips] Fix the target specific instruction verifier" Fix an off by one error in the bounds checking for 'dinsu' and update the ranges in the test comments so that they are accurate. This version has the correct commit message. Reviewers: atanasyan Differential Revision: https://reviews.llvm.org/D41183 llvm-svn: 320991	2017-12-18 15:56:40 +00:00
Sean Fertile	5fb624a3b8	[Memcpy Loop Lowering] Remove the fixed int8 lowering. Switch over to the lowering that uses target supplied operand types. Differential Revision: https://reviews.llvm.org/D41201 llvm-svn: 320989	2017-12-18 15:31:14 +00:00
Sander de Smalen	190979189a	[TableGen][AsmMatcherEmitter] Only choose specific diagnostic for enabled instruction Summary: When emitting a diagnostic for an invalid operand, a specific diagnostic should only be reported when the instruction being matched is actually enabled by the feature flags. Patch [3/4] in a series to add parsing of predicates and properly parse SVE ZIP1/ZIP2 instructions. This patch fixes bogus diagnostic messages for when the SVE feature is not specified. Reviewers: rengolin, craig.topper, olista01, sdardis, stoklund Reviewed By: olista01, sdardis Subscribers: fhahn, javed.absar, llvm-commits Differential Revision: https://reviews.llvm.org/D40362 llvm-svn: 320986	2017-12-18 14:34:24 +00:00
Max Kazantsev	1acab00229	[LVI] Support for ashr in LVI Enhance LVI to analyze the ‘ashr’ binary operation. This leverages the infrastructure in ConstantRange for the ashr operation. Patch by Surya Kumari Jangala! Differential Revision: https://reviews.llvm.org/D40886 llvm-svn: 320983	2017-12-18 14:23:30 +00:00
Diana Picus	8ee540c01a	[ARM GlobalISel] Fix G_(UN)MERGE_VALUES handling after r319524 r319524 has made more G_MERGE_VALUES/G_UNMERGE_VALUES pairs legal than are supported by the rest of the pipeline. Restrict that to only the cases that we can currently handle: packing 32-bit values into 64-bit ones, when we have hardware FP. llvm-svn: 320980	2017-12-18 13:22:28 +00:00
Benjamin Kramer	bc8fdaaf60	Constexprify LaneBitmask factory methods. This avoids global constructors when they're used in a global constant. llvm-svn: 320979	2017-12-18 13:20:26 +00:00
Max Kazantsev	d792171efb	[ConstantRange] Support for ashr in ConstantRange computation Extend the ConstantRange implementation to compute the range of possible values resulting from an arithmetic right shift operation. There will be a follow up patch to leverage this constant range infrastructure in LazyValueInfo. Patch by Surya Kumari Jangala! Differential Revision: https://reviews.llvm.org/D40881 llvm-svn: 320976	2017-12-18 13:01:32 +00:00
Simon Dardis	f70af977af	Revert "[mips] Fix the target specific instruction verifier" This reverts commit r320974. The commit message lacked the Differential Revison: line. llvm-svn: 320975	2017-12-18 12:30:34 +00:00
Simon Dardis	c3c0d4590b	[mips] Fix the target specific instruction verifier Fix an off by one error in the bounds checking for 'dinsu' and update the ranges in the test comments so that they are accurate. Reviewers: atanasyan https://reviews.llvm.org/D41183 llvm-svn: 320974	2017-12-18 12:24:17 +00:00
Sander de Smalen	fce0c1c45b	[AArch64][SVE] Asm: Add ZIP1/ZIP2 instructions (predicate/data vectors) Summary: Patch [2/4] in a series to add parsing of predicates and properly parse SVE ZIP1/ZIP2 instructions. Reviewers: rengolin, kristof.beyls, fhahn, mcrosier, evandro Reviewed By: fhahn Subscribers: aemerson, javed.absar, llvm-commits, tschuett Differential Revision: https://reviews.llvm.org/D40361 llvm-svn: 320973	2017-12-18 11:29:59 +00:00
Sander de Smalen	ce1e0975f4	[AArch64][SVE] Asm: Add SVE predicate register definitions and parsing support Summary: Patch [1/4] in a series to add parsing of predicates and properly parse SVE ZIP1/ZIP2 instructions. Reviewers: rengolin, kristof.beyls, fhahn, mcrosier, evandro, echristo, efriedma Reviewed By: fhahn Subscribers: aemerson, javed.absar, llvm-commits, tschuett Differential Revision: https://reviews.llvm.org/D40360 llvm-svn: 320970	2017-12-18 11:26:34 +00:00
Eugene Leviant	c95b49603e	[ThinLTO] Remove unused code This is a re-commit of r320464, after patch for gold plugin was landed. llvm-svn: 320968	2017-12-18 10:53:45 +00:00
Tim Northover	9097a07e4e	AArch64: work around how Cyclone handles "movi.2d vD, #0". For Cylone, the instruction "movi.2d vD, #0" is executed incorrectly in some rare circumstances. Work around the issue conservatively by avoiding the instruction entirely. This patch changes CodeGen so that problematic instructions are never generated, and the AsmParser so that an equivalent instruction is used (with a warning). llvm-svn: 320965	2017-12-18 10:36:00 +00:00
Igor Laevsky	7bd3fb15e1	[TargetLibraryInfo] Discard library functions with incorrectly sized integers Differential Revision: https://reviews.llvm.org/D41184 llvm-svn: 320964	2017-12-18 10:31:58 +00:00
Sam Parker	fd967f2f7a	[ARM] Adjust test checks Correct the CHECK-LABELS of a couple of dag combine tests. llvm-svn: 320963	2017-12-18 10:08:03 +00:00
Sam Parker	00804efd72	[DAGCombine] Move AND nodes to multiple load leaves Search from AND nodes to find whether they can be propagated back to loads, so that the AND and load can be combined into a narrow load. We search through OR, XOR and other AND nodes and all bar one of the leaves are required to be loads or constants. The exception node then needs to be masked off meaning that the 'and' isn't removed, but the loads(s) are narrowed still. Differential Revision: https://reviews.llvm.org/D41177 llvm-svn: 320962	2017-12-18 10:04:27 +00:00
Clement Courbet	6f42de3062	[NFC][CodeGen][ExpandMemCmp] Fix documentation. llvm-svn: 320960	2017-12-18 07:32:48 +00:00
Craig Topper	7034d401f8	[X86] Use mattr instead of mcpu in some of the cost model tests. Based on the names of the check lines, features seems more appropriate that cpu. Spotted while prototyping my patch to make 512-bit vectors illegal on SKX sometimes. llvm-svn: 320959	2017-12-18 07:21:58 +00:00
Hiroshi Inoue	c6faf15459	[SROA] Disable non-whole-alloca splits by default This patch introduce a switch to control splitting of non-whole-alloca slices with default off. The switch will be default on again after fixing an issue reported in PR35657. llvm-svn: 320958	2017-12-18 06:47:37 +00:00
Craig Topper	8e2837cc6e	[X86] Fix mistake that I made when splitting up the setOperationAction calls recently. The block I moved things that need BWI and 512-bit or VLX is incorrectly qualified with just hasBWI \|\| hasVLX. Here I've qualified it with hasBWI && (hasAVX512 \|\| hasVLX) where the hasAVX512 will be replaced with allowing 512-bit vectors in an upcoming patch. llvm-svn: 320957	2017-12-18 04:50:05 +00:00
Serguei Katkov	b0b67a8d38	[CGP] Fix the handling select inst in complex addressing mode When we put the value in select placeholder we must pass the value through simplification tracker due to the value might be already simplified and erased. This is a fix for PR35658. Reviewers: john.brawn, uabelho Reviewed By: john.brawn Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D41251 llvm-svn: 320956	2017-12-18 04:25:07 +00:00
Sanjay Patel	9da049fa8a	[x86] add tests for finite libcall lowering (PR35672); NFC llvm-svn: 320955	2017-12-18 00:38:45 +00:00
Bjorn Steinbrink	3603de2fa2	Re-commit "Properly handle multi-element and dynamically sized allocas in getPointerDereferenceableBytes()"" llvm-clang-x86_64-expensive-checks-win is still broken, so the failure seems unrelated. llvm-svn: 320953	2017-12-17 21:20:16 +00:00
Craig Topper	255a76d6d1	[X86] Add test cases that show cases where buildvector of extract and inserts should be turned into fmsubadd. This is a follow up to the fmaddsub support added in r320950. Hopefully in the future we can fix lowering to handle this fmsubadd too. llvm-svn: 320951	2017-12-17 18:31:36 +00:00
Craig Topper	fd8d040820	[X86] Make the code that creates fmaddsub from build_vector of extracts and inserts functional and add tests. Summary: We had no tests for this and we couldn't do the optimization because of a bad use count check. We need to know how many non-undef pieces of the build vector were filled in and ensure our use count is equal to that. But on the shuffle combine version we need the use count to be 2. The missing coverage was noticed during the review of D40335. Reviewers: RKSimon, zvi, spatel Reviewed By: RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D41133 llvm-svn: 320950	2017-12-17 18:23:45 +00:00
Simon Pilgrim	406d04a916	[X86] Regenerate truncated rotation tests + add missing 32-bit checks llvm-svn: 320949	2017-12-17 18:20:42 +00:00
Sam Clegg	b07a016ed1	use uint32_t llvm-svn: 320947	2017-12-17 17:50:07 +00:00
Sam Clegg	c551522d25	[WebAssembly] Export some more info on wasm funtions Summary: These fields are useful for lld's gc-sections support Also remove an unused field. Subscribers: jfb, dschuff, jgravelle-google, aheejin, sunfish Differential Revision: https://reviews.llvm.org/D41320 llvm-svn: 320946	2017-12-17 17:50:07 +00:00
Bjorn Steinbrink	6f7bbf349f	Revert "Properly handle multi-element and dynamically sized allocas in getPointerDereferenceableBytes()" This reverts commit 217067d5179882de9deb60d2e866befea4c126e7. Fails on llvm-clang-x86_64-expensive-checks-win llvm-svn: 320945	2017-12-17 15:16:58 +00:00
Bjorn Steinbrink	e880f262e5	Revert "Treat sret arguments as being dereferenceable in getPointerDereferenceableBytes()" This reverts commit 8b7a7660a3904b2088bc594311bcea2c651def08. I didn't mean to commit this. llvm-svn: 320944	2017-12-17 15:16:51 +00:00
Bjorn Steinbrink	7afcb71a42	Treat sret arguments as being dereferenceable in getPointerDereferenceableBytes() llvm-svn: 320943	2017-12-17 15:11:52 +00:00
Simon Pilgrim	b1b30286bf	Remove superfluous break after a return. NFCI. llvm-svn: 320941	2017-12-17 11:01:33 +00:00
Craig Topper	5992535e1a	[X86DomainReassignment] Store legal domains in a std::bitset instead of using a SmallVector that really only ever has one element as a set. llvm-svn: 320940	2017-12-17 03:16:23 +00:00
Bjorn Steinbrink	c27f81b92b	Properly handle byval arguments in getPointerDereferenceableBytes() Summary: For byval arguments, the number of dereferenceable bytes is equal to the size of the pointee, not the pointer. Reviewers: hfinkel, rnk Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D41305 llvm-svn: 320939	2017-12-17 02:37:42 +00:00
Bjorn Steinbrink	5d86532467	Properly handle multi-element and dynamically sized allocas in getPointerDereferenceableBytes() Reviewers: hfinkel, rnk Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D41288 llvm-svn: 320938	2017-12-17 01:54:25 +00:00
Craig Topper	ee1e71e576	[X86] Use extract_vector_elt instead of X86ISD::VEXTRACT for isel of vXi1 extractions. llvm-svn: 320937	2017-12-17 01:35:48 +00:00
Craig Topper	c0c2d19e08	[X86] Canonicalize extract_vector_elt from vXi1 to always return MVT::i32. This allows us to remove some isel patterns that allowed MVT::i8 result type. llvm-svn: 320936	2017-12-17 01:35:47 +00:00
Craig Topper	c609dc8f55	[X86] Don't create X86ISD::VEXTRACT nodes directly. Use EXTRACT_VECTOR_ELT and allow that to be legaized to VEXTRACT. I think we can remove the VEXTRACT node completely and use a canonicalized EXTRACT_VECTOR_ELT instead. This is a first step. llvm-svn: 320935	2017-12-17 01:35:44 +00:00
Simon Pilgrim	5c0c93ed4c	Fix unused variable warning. llvm-svn: 320934	2017-12-16 23:37:51 +00:00
Simon Pilgrim	4c9e8215e9	[X86][AVX] lowerVectorShuffleAsBroadcast - aggressively peek through BITCASTs Assuming we can safely adjust the broadcast index for the new type to keep it suitably aligned, then peek through BITCASTs when looking for the broadcast source. Fixes PR32007 llvm-svn: 320933	2017-12-16 23:32:18 +00:00
Simon Pilgrim	88c10bc969	[X86][AVX] Use extract128BitVector helper. NFCI. llvm-svn: 320932	2017-12-16 23:09:57 +00:00
Simon Pilgrim	f3b6da00f5	[X86][AVX] Fix failed broadcast fold Strip excess BITCASTs from EXTRACT_SUBVECTOR input llvm-svn: 320930	2017-12-16 22:57:17 +00:00
Sean Fertile	68d7f9da76	[Memcpy Loop Lowering] Only calculate residual size/bytes copied when needed. If the loop operand type is int8 then there will be no residual loop for the unknown size expansion. Dont create the residual-size and bytes-copied values when they are not needed. llvm-svn: 320929	2017-12-16 22:41:39 +00:00
Craig Topper	849b717c86	[X86] Don't pass a zero input to the passthru operand of getVectorMaskingNode/getScalarMaskingNode when its going to emit an ISD::OR/ISD::AND. NFCI In those cases, the pass thru operand of the methods isn't used. The calls to the scalar version were passing a MVT::i1 zero, which is an illegal type at the stage this code runs. llvm-svn: 320928	2017-12-16 21:12:24 +00:00
Craig Topper	93253e189c	[X86] Have getVectorMaskingNode return an ISD::AND for X86ISD::VPSHUFBITQMB instead of creating a select with one input being 0. llvm-svn: 320927	2017-12-16 21:12:23 +00:00
Craig Topper	1260a4e826	[X86] When using vpopcntdq for ctpop of v8i16 vectors, only promote to v8i32. Previously we promoted to v8i64, but we don't need to go all the way to 512-bits. If we have VLX we can use the 256-bit instruction. And even if we don't have VLX we can widen v8i32 to v16i32 and drop the upper half. llvm-svn: 320926	2017-12-16 19:31:36 +00:00
Craig Topper	a42a2ba221	[X86] Combine some more scheduler model entries using regular expressions. We had a lot of separate 32 and 64 instructions that had the same scheduling data. This merges them into the same regular expression. This is pretty consistent with a lot of other instructions. llvm-svn: 320924	2017-12-16 18:35:31 +00:00
Craig Topper	17a311831c	[X86] Use instrs instead of instregex for gather/scatter instructions in the scheduler models. Combine into single InstrRW entries. The reduces the number of scheduler groups in subtarget info. llvm-svn: 320923	2017-12-16 18:35:29 +00:00
Simon Pilgrim	5f022d278b	[InstCombine] Regenerate FMUL/FMA combine tests with update_test_checks.py llvm-svn: 320922	2017-12-16 17:18:15 +00:00
Sanjay Patel	5a0cdac174	[InstCombine] canonicalize shifty abs(): ashr+add+xor --> cmp+neg+sel We want to do this for 2 reasons: 1. Value tracking does not recognize the ashr variant, so it would fail to match for cases like D39766. 2. DAGCombiner does better at producing optimal codegen when we have the cmp+sel pattern. More detail about what happens in the backend: 1. DAGCombiner has a generic transform for all targets to convert the scalar cmp+sel variant of abs into the shift variant. That is the opposite of this IR canonicalization. 2. DAGCombiner has a generic transform for all targets to convert the vector cmp+sel variant of abs into either an ABS node or the shift variant. That is again the opposite of this IR canonicalization. 3. DAGCombiner has a generic transform for all targets to convert the exact shift variants produced by #1 or #2 into an ISD::ABS node. Note: It would be an efficiency improvement if we had #1 go directly to an ABS node when that's legal/custom. 4. The pattern matching above is incomplete, so it is possible to escape the intended/optimal codegen in a variety of ways. a. For #2, the vector path is missing the case for setlt with a '1' constant. b. For #3, we are missing a match for commuted versions of the shift variants. 5. Therefore, this IR canonicalization can only help get us to the optimal codegen. The version of cmp+sel produced by this patch will be recognized in the DAG and converted to an ABS node when possible or the shift sequence when not. 6. In the following examples with this patch applied, we may get conditional moves rather than the shift produced by the generic DAGCombiner transforms. The conditional move is created using a target-specific decision for any given target. Whether it is optimal or not for a particular subtarget may be up for debate. define i32 @abs_shifty(i32 %x) { %signbit = ashr i32 %x, 31 %add = add i32 %signbit, %x %abs = xor i32 %signbit, %add ret i32 %abs } define i32 @abs_cmpsubsel(i32 %x) { %cmp = icmp slt i32 %x, zeroinitializer %sub = sub i32 zeroinitializer, %x %abs = select i1 %cmp, i32 %sub, i32 %x ret i32 %abs } define <4 x i32> @abs_shifty_vec(<4 x i32> %x) { %signbit = ashr <4 x i32> %x, <i32 31, i32 31, i32 31, i32 31> %add = add <4 x i32> %signbit, %x %abs = xor <4 x i32> %signbit, %add ret <4 x i32> %abs } define <4 x i32> @abs_cmpsubsel_vec(<4 x i32> %x) { %cmp = icmp slt <4 x i32> %x, zeroinitializer %sub = sub <4 x i32> zeroinitializer, %x %abs = select <4 x i1> %cmp, <4 x i32> %sub, <4 x i32> %x ret <4 x i32> %abs } > $ ./opt -instcombine shiftyabs.ll -S \| ./llc -o - -mtriple=x86_64 -mattr=avx > abs_shifty: > movl %edi, %eax > negl %eax > cmovll %edi, %eax > retq > > abs_cmpsubsel: > movl %edi, %eax > negl %eax > cmovll %edi, %eax > retq > > abs_shifty_vec: > vpabsd %xmm0, %xmm0 > retq > > abs_cmpsubsel_vec: > vpabsd %xmm0, %xmm0 > retq > > $ ./opt -instcombine shiftyabs.ll -S \| ./llc -o - -mtriple=aarch64 > abs_shifty: > cmp w0, #0 // =0 > cneg w0, w0, mi > ret > > abs_cmpsubsel: > cmp w0, #0 // =0 > cneg w0, w0, mi > ret > > abs_shifty_vec: > abs v0.4s, v0.4s > ret > > abs_cmpsubsel_vec: > abs v0.4s, v0.4s > ret > > $ ./opt -instcombine shiftyabs.ll -S \| ./llc -o - -mtriple=powerpc64le > abs_shifty: > srawi 4, 3, 31 > add 3, 3, 4 > xor 3, 3, 4 > blr > > abs_cmpsubsel: > srawi 4, 3, 31 > add 3, 3, 4 > xor 3, 3, 4 > blr > > abs_shifty_vec: > vspltisw 3, -16 > vspltisw 4, 15 > vsubuwm 3, 4, 3 > vsraw 3, 2, 3 > vadduwm 2, 2, 3 > xxlxor 34, 34, 35 > blr > > abs_cmpsubsel_vec: > vspltisw 3, -16 > vspltisw 4, 15 > vsubuwm 3, 4, 3 > vsraw 3, 2, 3 > vadduwm 2, 2, 3 > xxlxor 34, 34, 35 > blr > Differential Revision: https://reviews.llvm.org/D40984 llvm-svn: 320921	2017-12-16 16:41:17 +00:00
Craig Topper	d2a2a39c93	[X86] Remove GCCBuiltin from kand/kandn/kor/kxor/kxnor/knot intrinsics so clang can implement with native IR. llvm-svn: 320918	2017-12-16 08:25:30 +00:00
Craig Topper	1c7d07c601	[X86] Remove unneeded code for handling the old kunpck intrinsics. llvm-svn: 320917	2017-12-16 06:58:30 +00:00
Hal Finkel	92ea8acbcd	Move Transforms/LoopVectorize/consecutive-ptr-cg-bug.ll into the X86 subdirectory This test depends on X86's TTI; move into the X86 subdirectory. llvm-svn: 320914	2017-12-16 05:10:20 +00:00
Hal Finkel	5444f40965	[LV] Extend InstWidening with CM_Widen_Recursive Changes to the original scalar loop during LV code gen cause the return value of Legal->isConsecutivePtr() to be inconsistent with the return value during legal/cost phases (further analysis and information of the bug is in D39346). This patch is an alternative fix to PR34965 following the CM_Widen approach proposed by Ayal and Gil in D39346. It extends InstWidening enum with CM_Widen_Reverse to properly record the widening decision for consecutive reverse memory accesses and, consequently, get rid of the Legal->isConsetuviePtr() call in LV code gen. I think this is a simpler/cleaner solution to PR34965 than the one in D39346. Fixes PR34965. Patch by Diego Caballero, thanks! Differential Revision: https://reviews.llvm.org/D40742 llvm-svn: 320913	2017-12-16 02:55:24 +00:00
Galina Kistanova	5f8c84c5be	Fixed warning 'function declaration isn’t a prototype [-Werror=strict-prototypes]' llvm-svn: 320912	2017-12-16 02:54:17 +00:00
Hal Finkel	e86a8b79b5	[PowerPC, AsmParser] Enable the mnemonic spell corrector r307148 added an assembly mnemonic spelling correction support and enabled it on ARM. This enables that support on PowerPC as well. Patch by Dmitry Venikov, thanks! Differential Revision: https://reviews.llvm.org/D40552 llvm-svn: 320911	2017-12-16 02:42:18 +00:00
Craig Topper	c08960597c	[X86] Add 128 and 256-bit VPOPCNTDQ instructions. Adjust some tablegen classes LZCNT/POPCNT. I think when this instruction was first published it was only for a Knights CPU and thus VLX version was missing. llvm-svn: 320910	2017-12-16 02:40:28 +00:00
Vitaly Buka	12f9b8cf24	[LTO] Update tests for r320905 llvm-svn: 320909	2017-12-16 02:40:20 +00:00
Vitaly Buka	fd563a0352	Remove trailing whitespace llvm-svn: 320907	2017-12-16 02:12:35 +00:00
Sam Clegg	731a76646f	[WebAssembly] Return ArrayRef's rather than const std::vector& From working on lld I've learned this is generally the preferred way for several reasons (e.g. more concise, improves encapsulation). Differential Revision: https://reviews.llvm.org/D41265 llvm-svn: 320906	2017-12-16 02:10:16 +00:00
Vitaly Buka	a5376f393e	[LTO] Make processing of combined module more consistent Summary: 1. Use stream 0 only for combined module. Previously if combined module was not processes ThinLTO used the stream for own output. However small changes in input, could trigger combined module and shuffle outputs making life of llvm::LTO harder. 2. Always process combined module and write output to stream 0. Processing empty combined module is cheap and allows llvm::LTO users to avoid implementing processing which is already done in llvm::LTO. Subscribers: mehdi_amini, inglorion, eraman, hiraditya Differential Revision: https://reviews.llvm.org/D41267 llvm-svn: 320905	2017-12-16 02:10:00 +00:00
Teresa Johnson	160f4bb803	Add another missing -enable-import-metadata to test r320895 modified a test so that it needs -enable-import-metadata which is false by default for NDEBUG, found another place that needs this added. llvm-svn: 320903	2017-12-16 01:35:36 +00:00
Hal Finkel	2ff24731bb	[SimplifyLibCalls] Inline calls to cabs when it's safe to do so When unsafe algerbra is allowed calls to cabs(r) can be replaced by: sqrt(creal(r)creal(r) + cimag(r)cimag(r)) Patch by Paul Walker, thanks! Differential Revision: https://reviews.llvm.org/D40069 llvm-svn: 320901	2017-12-16 01:26:25 +00:00
Hal Finkel	7333aa9f16	[LV] NFC patch for moving VPRecipe class definitions from LoopVectorize.cpp to VPlan.h This is a small step forward to move VPlan stuff to where it should belong (i.e., VPlan.): 1. VPRecipe classes in LoopVectorize.cpp are moved to VPlan.h. 2. Many of VPRecipe::print() and execute() definitions are still left in LoopVectorize.cpp since they refer to things declared in LoopVectorize.cpp. To be moved to VPlan.cpp at a later time. 3. InterleaveGroup class is moved from anonymous namespace to llvm namespace. Referencing it in anonymous namespace from VPlan.h ended up in warning. Patch by Hideki Saito, thanks! Differential Revision: https://reviews.llvm.org/D41045 llvm-svn: 320900	2017-12-16 01:12:50 +00:00
Teresa Johnson	4358a40345	Add -enable-import-metadata to test r320895 modified a test so that it needs -enable-import-metadata which is false by default for NDEBUG. llvm-svn: 320899	2017-12-16 01:00:48 +00:00
Craig Topper	6b129fde5a	[X86] Add back the assert from r320830 that was reverted in r320850 Hopefully r320864 has fixed the offending case that failed the assert. llvm-svn: 320898	2017-12-16 00:33:16 +00:00
Teresa Johnson	69b2de8466	Fix NDEBUG build problem in r320895 Fix incorrect placement of #endif causing NDEBUG build failures. llvm-svn: 320897	2017-12-16 00:29:31 +00:00
Teresa Johnson	81bbf74265	[ThinLTO] Enable importing of aliases as copy of aliasee Summary: This implements a missing feature to allow importing of aliases, which was previously disabled because alias cannot be available_externally. We instead import an alias as a copy of its aliasee. Some additional work was required in the IndexBitcodeWriter for the distributed build case, to ensure that the aliasee has a value id in the distributed index file (i.e. even when it is not being imported directly). This is a performance win in codes that have many aliases, e.g. C++ applications that have many constructor and destructor aliases. Reviewers: pcc Subscribers: mehdi_amini, inglorion, eraman, llvm-commits Differential Revision: https://reviews.llvm.org/D40747 llvm-svn: 320895	2017-12-16 00:18:12 +00:00
David Blaikie	2110924909	Fix WebAssembly backend for some LLVM API changes llvm-svn: 320893	2017-12-15 23:52:06 +00:00
Quentin Colombet	893e0f15e2	[TableGen][GlobalISel] Make the different Matcher comparable This opens refactoring opportunities in the match table now that we can check that two predicates are the same. NFC. llvm-svn: 320890	2017-12-15 23:24:39 +00:00
Quentin Colombet	a646ef08e8	[TableGen][GlobalISel] Fix unused variable warning in release mode Introduced in r320887. NFC. llvm-svn: 320889	2017-12-15 23:24:36 +00:00
Paul Robinson	6d0484f2b6	Revert "Recommit "[DWARFv5] Dump an MD5 checksum in the line-table header."" This reverts commit 0afef672f63f0e4e91938656bc73424a8c058bfc. Still failing at runtime on bots. llvm-svn: 320888	2017-12-15 23:21:52 +00:00
Quentin Colombet	aad20be6ca	[TableGen][GlobalISel] Have the predicate directly know which data they are dealing with Prior to this patch, a predicate wouldn't make sense outside of its rule. Indeed, it was only during emitting a rule that a predicate would be made aware of the IDs of the data it is checking. Because of that, predicates could not be moved around or compared between each other. NFC. llvm-svn: 320887	2017-12-15 23:07:42 +00:00
Paul Robinson	5c8f7d7de4	Recommit "[DWARFv5] Dump an MD5 checksum in the line-table header." Adds missing support for DW_FORM_data16. Update of r320852, fixing the unittest to use a hand-coded struct instead of std::array to guarantee data layout. Differential Revision: https://reviews.llvm.org/D41090 llvm-svn: 320886	2017-12-15 22:57:17 +00:00
Matthias Braun	042fed54fb	Fix unused variable in non-assert builds llvm-svn: 320885	2017-12-15 22:53:33 +00:00
Matthias Braun	f1caa2833f	MachineFunction: Return reference from getFunction(); NFC The Function can never be nullptr so we can return a reference. llvm-svn: 320884	2017-12-15 22:22:58 +00:00
Matthias Braun	4684033a2f	MachineFunction: Slight refactoring; NFC Slight cleanup/refactor in preparation for upcoming commit. llvm-svn: 320882	2017-12-15 22:22:46 +00:00
Matthias Braun	89488fffdd	MachineModuleInfo: Remove unused function; NFC Remove the unused setModule() function; it would be dangerous if someone actually used it as it wouldn't reset/recompute various other module related data. llvm-svn: 320881	2017-12-15 22:22:42 +00:00
Galina Kistanova	6532b3b9d2	Fixed the gcc 'enumeral and non-enumeral type in conditional expression [-Werror=extra]' warning introduced by r320750 llvm-svn: 320868	2017-12-15 22:15:29 +00:00
Krzysztof Parzyszek	058d3cec15	[Hexagon] Remove recursion in visitUsesOf, replace with use queue This is primarily to reduce stack usage, but ordering the use queue according to the position in the code (earlier instructions visited before later ones) reduces the number of unnecessary bottoms due to visiting instructions out of order, e.g. %reg1 = copy %reg0 %reg2 = copy %reg0 %reg3 = and %reg1, %reg2 Here, reg3 should be known to be same as reg0-2, but if reg3 is evaluated after reg1 is updated, but before reg2 is updated, the two inputs to the and will appear different, causing reg3 to become bottom. llvm-svn: 320866	2017-12-15 21:34:05 +00:00
Krzysztof Parzyszek	266d6f03a1	[Hexagon] Handle concat_vectors of all allowed HVX types llvm-svn: 320865	2017-12-15 21:23:12 +00:00
Craig Topper	6b8ac481f1	[X86] Use AND32ri8 instead of AND64ri8 in Asan code in EmitCallAsanReport for 32-bit mode. This seemed to work due to a quirk in the X86 MC encoder that didn't emit a REX byte that the AND64ri8 implies when in 32-bit mode. This made the encoding the same as AND32ri8. I tried to add an assert to catch the dropped REX prefix that caught this. llvm-svn: 320864	2017-12-15 21:18:06 +00:00
Craig Topper	422ed23298	[X86] In LowerVectorCTPOP use ISD::ZERO_EXTEND/ISD::TRUNCATE instead of the target specific nodes. The target independent nodes will get legalized to the target specific nodes by their own legalization process. Someday I'd like to stop using a target specific for zero extends and truncates of legal types so the less places we reference the target specific opcode the better. llvm-svn: 320863	2017-12-15 21:18:05 +00:00
Craig Topper	f08ab74ae3	[X86] Remove unnecessary TODO. When I wrote it I thought we were missing a potential optimization for KNL. But investigating further shows that for KNL we still do the optimal thing by widening to v4f32 and then using special isel patterns to widen again to zmm a register. llvm-svn: 320862	2017-12-15 20:57:18 +00:00
Vitaly Buka	cad70885a5	[LTO] Remove unused RegularLTOState::HasModule llvm-svn: 320859	2017-12-15 20:50:25 +00:00
Jun Bum Lim	44c58d35c1	Re-commit : [LICM] Allow sinking when foldable in loop This recommits r320823 reverted due to the test failure in sink-foldable.ll and an unused variable. Added "REQUIRES: aarch64-registered-target" in the test and removed unused variable. Original commit message: Continue trying to sink an instruction if its users in the loop is foldable. This will allow the instruction to be folded in the loop by decoupling it from the user outside of the loop. Reviewers: hfinkel, majnemer, davidxl, efriedma, danielcdh, bmakam, mcrosier Reviewed By: hfinkel Subscribers: javed.absar, bmakam, mcrosier, llvm-commits Differential Revision: https://reviews.llvm.org/D37076 llvm-svn: 320858	2017-12-15 20:33:24 +00:00
Paul Robinson	67ca67d1b2	Revert "[DWARFv5] Dump an MD5 checksum in the line-table header." Unit test fails on some bots. llvm-svn: 320857	2017-12-15 20:29:25 +00:00
Jake Ehrlich	777fb00a76	[llvm-objcopy] Reformat everything using clang-format -i Overtime some non-clang formatted code has creeped into llvm-objcopy. This patch fixes all of that. Differential Revision: https://reviews.llvm.org/D41262 llvm-svn: 320856	2017-12-15 20:17:55 +00:00
Krzysztof Parzyszek	29832a6c8b	[Hexagon] Fix operand-swapping PatFrag for atomic stores PatFrag now has the atomicity information stored as bit fields. They need to be copied to the new PatFrag. llvm-svn: 320855	2017-12-15 20:13:57 +00:00
Paul Robinson	72546fe87b	[DWARFv5] Dump an MD5 checksum in the line-table header. Adds missing support for DW_FORM_data16. Differential Revision: https://reviews.llvm.org/D41090 llvm-svn: 320852	2017-12-15 19:52:34 +00:00
Craig Topper	df2521a638	[X86] Remove assert in X86MCCodeEmitter.cpp that was added in r320830. It seems to be failing real code which is concerning, but we were silently getting away with it. I'll investigate further. llvm-svn: 320850	2017-12-15 19:38:14 +00:00
Craig Topper	3fb8386685	[SelectionDAG][X86] Fix insert_vector_elt lowering for v32i1/v64i1 with non-constant index Summary: Currently we don't handle v32i1/v64i1 insert_vector_elt correctly as we fail to look at the number of elements closely and assume it can only be v16i1 or v8i1. We also can't type legalize v64i1 insert_vector_elt correctly on KNL due to the type not being byte addressable as required by the legalizing through memory accesses path requires. For the first issue, the patch now tries to pick a 512-bit register with the correct number of elements and promotes to that. For the second issue, we now extend the vector to a byte addressable type, do the stores to memory, load the two halves, and then truncate the halves back to the original type. Technically since we changed the type, we may not need two loads, but actually checking that is more work and for the v64i1 case we do need them. Reviewers: RKSimon, delena, spatel, zvi Reviewed By: RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D40942 llvm-svn: 320849	2017-12-15 19:35:22 +00:00
Sean Fertile	42b13343fd	[Memcpy Loop Lowering] Insert loop BB inbetween the split BB. The original memcpy expansion inserted the loop basic block inbetween the 2 new basic blocks created by splitting the original block the memcpy call was in. This commit makes the new memcpy expansion do the same to keep the layout of the IR matching between the old and new implementations. Differential Review: https://reviews.llvm.org/D41197 llvm-svn: 320848	2017-12-15 19:29:12 +00:00
Craig Topper	23c348850f	[X86] Add 'Requires<[In64BitMode]>' to a bunch of instructions that only have memory and immediate operands. The asm parser wasn't preventing these from being accepted in 32-bit mode. Instructions that use a GR64 register are protected by the parser rejecting the register in 32-bit mode. llvm-svn: 320846	2017-12-15 19:01:51 +00:00
Craig Topper	914b1d524c	[X86] Change BNDLDX to use anymem instead of i64mem for itsmemory operand. This instruction doesn't access memory. It juse use a similar looking memory encoding. Don't require Intel syntax to put "qword ptr" in front of it. llvm-svn: 320845	2017-12-15 19:01:50 +00:00
Craig Topper	446f3e2084	[X86] Remove the 'Requires' In64BitMode/Not64BitMode from the LWP instructions. These aren't doing anything due to a top level "let Predicates =". I think the GR32/GR64 register class protects these anyway. llvm-svn: 320844	2017-12-15 19:01:49 +00:00
Craig Topper	365e8aa5d5	[X86] Remove the 'Requires<[In64BitMode]>' from SHSTK instructions. This has no effect due to a top level "let Predicates =" around the instructions. But its also not required because the GR64 usage in the instruction guarantees it can never match. llvm-svn: 320843	2017-12-15 19:01:48 +00:00
Sanjay Patel	600d24b49c	[TargetLibraryInfo] fix documentation comment; NFC llvm-svn: 320842	2017-12-15 18:54:29 +00:00
Sanjay Patel	76657f81ba	[CodeGen] fix documentation comments; NFC llvm-svn: 320840	2017-12-15 18:34:45 +00:00
Evandro Menezes	a9134e86f1	[AArch64] Fix typo in the ASIMD instruction optimization pass Fix typo in the representative instruction replacement. Also, fix formatting and reword some comments. llvm-svn: 320839	2017-12-15 18:26:54 +00:00
Sanjay Patel	c722e26549	fix typo in comment and remove inaccurate comment; NFC llvm-svn: 320838	2017-12-15 18:25:13 +00:00
Andrew V. Tischenko	22f0742dda	Fix for bug PR35549 - Repeated schedule comments. Differential Revision: https://reviews.llvm.org/D40960 llvm-svn: 320837	2017-12-15 18:13:05 +00:00
Jun Bum Lim	5efd4d8b5e	Revert "Re-commit : [LICM] Allow sinking when foldable in loop" This reverts commit r320833. llvm-svn: 320836	2017-12-15 18:12:49 +00:00
Sanjay Patel	d3ddf28e7f	[CodeGen] fix documentation comments; NFC llvm-svn: 320835	2017-12-15 18:09:33 +00:00
Jun Bum Lim	83ccad6684	Re-commit : [LICM] Allow sinking when foldable in loop This recommit r320823 after fixing a test failure. Original commit message: Continue trying to sink an instruction if its users in the loop is foldable. This will allow the instruction to be folded in the loop by decoupling it from the user outside of the loop. Reviewers: hfinkel, majnemer, davidxl, efriedma, danielcdh, bmakam, mcrosier Reviewed By: hfinkel Subscribers: javed.absar, bmakam, mcrosier, llvm-commits Differential Revision: https://reviews.llvm.org/D37076 llvm-svn: 320833	2017-12-15 17:58:59 +00:00
Michael Trent	a1703b1fc2	Updated llvm-objdump to display local relocations in Mach-O binaries Summary: llvm-objdump's Mach-O parser was updated in r306037 to display external relocations for MH_KEXT_BUNDLE file types. This change extends the Macho-O parser to display local relocations for MH_PRELOAD files. When used with the -macho option relocations will be displayed in a historical format. All tests are passing for llvm, clang, and lld. llvm-objdump builds without compiler warnings. rdar://35778019 Reviewers: enderby Reviewed By: enderby Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D41199 llvm-svn: 320832	2017-12-15 17:57:40 +00:00
Craig Topper	a16395008c	[X86] Fix XSAVE64 and similar instructions to not be allowed by the assembler in 32-bit mode. There was a top level "let Predicates =" in the .td file that was overriding the Requires on each instruction. I've added an assert to the code emitter to catch more cases like this. I'm sure this isn't the only place where the right predicates aren't being applied. This assert already found that we don't block btq/btsq/btrq in 32-bit mode. llvm-svn: 320830	2017-12-15 17:22:58 +00:00
Jun Bum Lim	6136d87f5d	Revert "[LICM] Allow sinking when foldable in loop" This reverts commit r320823. llvm-svn: 320828	2017-12-15 16:35:09 +00:00
Francis Visoiu Mistrih	0b5bdceabf	[CodeGen] Print stack object references as %(fixed-)stack.0 in both MIR and debug output Work towards the unification of MIR and debug output by printing `%stack.0` instead of `<fi#0>`, and `%fixed-stack.0` instead of `<fi#-4>` (supposing there are 4 fixed stack objects). Only debug syntax is affected. Differential Revision: https://reviews.llvm.org/D41027 llvm-svn: 320827	2017-12-15 16:33:45 +00:00
Eugene Leviant	cb12249238	[ThinLTO] Disallow multiple prevailing defs https://reviews.llvm.org/D41291 llvm-svn: 320825	2017-12-15 16:27:33 +00:00
Craig Topper	ad9221d684	[X86] Widen (v2i32 (fp_to_uint v2f64)) to (v8i32 (fp_to_uint v8f64)) during legalization if we have AVX512F, but not VLX. NFC Previously we widened it using isel patterns. llvm-svn: 320824	2017-12-15 16:22:20 +00:00
Jun Bum Lim	22855c26a5	[LICM] Allow sinking when foldable in loop Summary: Continue trying to sink an instruction if its users in the loop is foldable. This will allow the instruction to be folded in the loop by decoupling it from the user outside of the loop. Reviewers: hfinkel, majnemer, davidxl, efriedma, danielcdh, bmakam, mcrosier Reviewed By: hfinkel Subscribers: javed.absar, bmakam, mcrosier, llvm-commits Differential Revision: https://reviews.llvm.org/D37076 llvm-svn: 320823	2017-12-15 16:09:54 +00:00
Sam Parker	18b0d1e5b9	[ARM] Some DAG combine tests Add some more and and shift load combine tests. llvm-svn: 320822	2017-12-15 15:30:39 +00:00
Francis Visoiu Mistrih	5de20e039e	[MIR] Add support for missing CFI directives The following CFI directives are suported by MC but not by MIR: * .cfi_rel_offset * .cfi_adjust_cfa_offset * .cfi_escape * .cfi_remember_state * .cfi_restore_state * .cfi_undefined * .cfi_register * .cfi_window_save Add support for printing, parsing and update tests. Differential Revision: https://reviews.llvm.org/D41230 llvm-svn: 320819	2017-12-15 15:17:18 +00:00
Simon Pilgrim	5009a1c738	[X86] Add RTM schedule tests llvm-svn: 320815	2017-12-15 14:37:28 +00:00
Haicheng Wu	a446151552	[InlineCost] Find repeated loads in the callee SROA analysis of InlineCost can figure out that some stores can be removed after inlining and then the repeated loads clobbered by these stores are also free. This patch finds these clobbered loads and adjust the inline cost accordingly. Differential Revision: https://reviews.llvm.org/D33946 llvm-svn: 320814	2017-12-15 14:34:41 +00:00
Simon Pilgrim	786431231f	[X86] Add MWAITX/MONITORX schedule tests llvm-svn: 320812	2017-12-15 14:22:15 +00:00
Nemanja Ivanovic	6ab32dea12	Fix the second build bot break introduced by r320791. llvm-svn: 320811	2017-12-15 14:17:45 +00:00
Simon Pilgrim	e662fa3752	[X86] Add XOP schedule tests llvm-svn: 320810	2017-12-15 14:02:35 +00:00
Nemanja Ivanovic	1794cdc481	Fix code causing fallthrough warnings in the PPC back end. llvm-svn: 320806	2017-12-15 11:47:48 +00:00
Simon Pilgrim	0c1e0dbb96	[X86] Add AVX512 VPOPCNTDQ schedule tests Demonstrates how to perform full coverage avx512 schedule tests llvm-svn: 320805	2017-12-15 11:32:31 +00:00
Alex Bradbury	0ad4c265d7	[RISCV] Change shift amount operand of RVC shift instructions to uimmlog2xlennonzero c.slli/c.srli/c.srai allow a 5-bit shift in RV32C and a 6-bit shift in RV64C. This patch adds uimmlog2xlennonzero to reflect this constraint as well as tests. Differential Revision: https://reviews.llvm.org/D41216 Patch by Shiva Chen. llvm-svn: 320799	2017-12-15 10:20:51 +00:00
Nemanja Ivanovic	74ecf59cc0	Fix the build bot break introduced by r320791. llvm-svn: 320798	2017-12-15 09:51:34 +00:00
Alex Bradbury	59136ffab1	[RISCV] Enable emission of alias instructions by default This patch switches the default for -riscv-no-aliases to false and updates all affected MC and CodeGen tests. As recommended in D41071, MC tests use the canonical instructions and the CodeGen tests use the aliases. Additionally, for the f and d instructions with rounding mode, the tests for the aliased versions are moved and tightened such that they can actually detect if alias emission is enabled. (see D40902 for context) Differential Revision: https://reviews.llvm.org/D41225 Patch by Mario Werner. llvm-svn: 320797	2017-12-15 09:47:01 +00:00
Fedor Sergeev	4b86d79048	[PM] port Rewrite Statepoints For GC to the new pass manager. Summary: The port is nearly straightforward. The only complication is related to the analyses handling, since one of the analyses used in this module pass is domtree, which is a function analysis. That requires asking for the results of each function and disallows a single interface for run-on-module pass action. Decided to copy-paste the main body of this pass. Most of its code is requesting analyses anyway, so not that much of a copy-paste. The rest of the code movement is to transform all the implementation helper functions like stripNonValidData into non-member statics. Extended all the related LLVM tests with new-pass-manager use. No failures. Reviewers: sanjoy, anna, reames Reviewed By: anna Subscribers: skatkov, llvm-commits Differential Revision: https://reviews.llvm.org/D41162 llvm-svn: 320796	2017-12-15 09:32:11 +00:00
Roger Ferrer Ibanez	9fcc4727ac	[ARM] Add tests for D34515 This is NFC and a preparatory step for D34515. Differential Revision: https://reviews.llvm.org/D41122 llvm-svn: 320795	2017-12-15 09:24:46 +00:00
Eugene Leviant	746f152dd6	[LLVMgold] Don't set undefined symbol as prevailing Differential revision: https://reviews.llvm.org/D41113 llvm-svn: 320794	2017-12-15 09:18:21 +00:00
Nemanja Ivanovic	6995e5dae7	[PowerPC] Convert r+r instructions to r+i (pre and post RA) This patch adds the necessary infrastructure to convert instructions that take two register operands to those that take a register and immediate if the necessary operand is produced by a load-immediate. Furthermore, it uses this infrastructure to perform such conversions twice - first at MachineSSA and then pre-emit. There are a number of reasons we may end up with opportunities for this transformation, including but not limited to: - X-Form instructions chosen since the exact offset isn't available at ISEL time - Atomic instructions with constant operands (we will add patterns for this in the future) - Tail duplication may duplicate code where one block contains this redundancy - When emitting compare-free code in PPCDAGToDAGISel, we don't handle constant comparands specially Furthermore, this patch moves the initialization of PPCMIPeepholePass so that it can be used for MIR tests. llvm-svn: 320791	2017-12-15 07:27:53 +00:00
Craig Topper	7cfacbf6ea	[X86] Fix a couple bugs in my recent changes to vXi1 insert_subvector lowering. A couple places didn't use the same SDValue variables to connect everything all the way through. I don't have a test case for a bug in insert into the lower bits of a non-zero, non-undef vector. Not sure the best way to create that. We don't create the case when lowering concat_vectors which is the main way to get insert_subvectors. llvm-svn: 320790	2017-12-15 07:16:41 +00:00
Serguei Katkov	67da7696a0	[SCEV] Fix the movement of insertion point in expander. PR35406. We cannot move the insertion point to header if SCEV contains div/rem operations due to they may go over check for zero denominator. Reviewers: sanjoy, mkazantsev, sebpop Reviewed By: sebpop Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D41229 llvm-svn: 320789	2017-12-15 05:24:42 +00:00
Yaxun Liu	c41e2f6e7b	Recommit CodeGen: Fix assertion in machine inst sheduler due to llvm.dbg.value The regression on ppc64 was not due to this commit. llvm-svn: 320788	2017-12-15 03:56:57 +00:00
Nemanja Ivanovic	0d47d32caa	Disabling r312514 as it causes miscompiles that show up on bootstrap The compare elimination peephole introduced in https://reviews.llvm.org/rL312514 causes a miscompile in AMDGPUInstrInfo.cpp which in turn causes some AMDGPU test case failures in stage2 bootstrap testing. This miscompile didn't cause any test case failures until https://reviews.llvm.org/rL320614, so it appeared as if that patch caused these failures. Disabling this transformation for now to bring the build bots back to green and the author of the patch will investigate the miscompile. llvm-svn: 320786	2017-12-15 01:38:03 +00:00
Shoaib Meenai	c84211b9a8	[cmake] Fix clang-cl cross-compilation on macOS macOS paths usually start with /Users, which clang-cl interprets as a macro undefine, leading to pretty much everything failing to compile. CMake should be taught to put a -- in its compilation rules for clang-cl (and I've been meaning to submit that upstream for a while). In the meantime, however, and to support older CMake versions, we can just create a custom make rules override to fix the compilation rules. Differential Revision: https://reviews.llvm.org/D41219 llvm-svn: 320785	2017-12-15 01:05:48 +00:00
Craig Topper	1a1e6d6cf6	[X86] Add a TODO about v8i1 CONCAT_VECTORS. llvm-svn: 320784	2017-12-15 01:03:46 +00:00
Craig Topper	23951ec2cd	[SelectionDAG] Make getNode calls that take an ArrayRef of SDValue for operands call NewSDValueDbgMsg. This makes it work better with some build_vector and concat_vectors creations. Adjust the NewSDValueDbgMsg in getConstant to avoid duplicating the print when it calls getSplatBuildVector since getSplatBuildVector didn't trigger a print before. llvm-svn: 320783	2017-12-15 01:03:45 +00:00
Craig Topper	5ebf3ac9c2	[X86] Further rearrange the setOperationAction calls to separate the ones that require 512-bit registers OR VLX into separate sections. NFCI We have several instructions that were introduced in AVX512F that are only available in 512-bit form on KNL. We still make use of them for 128/256 by artificially widening and extracting during isel. This commit separates these operations from the true 512-bit operations. This way we can qualify the normal 512-bit operations with needing 512-bit register support. And these special operations will get qualified with needing 512-bit registers OR VLX. The 512-bit register qualification will be introduced in a future patch this just gets everything grouped to minimize deltas on that patch. llvm-svn: 320782	2017-12-15 01:03:43 +00:00
Craig Topper	07a28f777e	[X86] Group setOperationActions related to vXi1 masks together. NFCI Previously they were sort of interleaved in with XMM/YMM/ZMM action related code. Trying to separate things so its easier to split 512-bit vectors later. llvm-svn: 320781	2017-12-15 01:03:42 +00:00

... 2 3 4 5 6 ...

158335 Commits