llvm-project

Commit Graph

Author	SHA1	Message	Date
Simon Pilgrim	ef54b1dddf	[DAGCombine] Cleanup visitEXTRACT_SUBVECTOR. NFCI. Use ArrayRef::slice, reduce some rather awkward long lines for legibility and run clang-format. llvm-svn: 359326	2019-04-26 17:49:02 +00:00
Nikita Popov	c0fa4ec01d	[ConstantRange] Add abs() support Add support for abs() to ConstantRange. This will allow to handle SPF_ABS select flavor in LVI and will also come in handy as a primitive for the srem implementation. The implementation is slightly tricky, because a) abs of signed min is signed min and b) sign-wrapped ranges may have an abs() that is smaller than a full range, so we need to explicitly handle them. Differential Revision: https://reviews.llvm.org/D61084 llvm-svn: 359321	2019-04-26 16:50:31 +00:00
Craig Topper	354247c08d	[X86] Sink NoRegister creation for unused Base/Index registers into getAddressOperands. NFCI llvm-svn: 359318	2019-04-26 16:39:38 +00:00
Craig Topper	ad662cf4c1	[X86] Segment registers should have i16 type not i32. Probably doesn't really matter, but was inconsistent with the rest of the code. llvm-svn: 359317	2019-04-26 16:39:35 +00:00
Stanislav Mekhanoshin	8f3da70eed	[AMDGPU] gfx1010 VOP2 changes Differential Revision: https://reviews.llvm.org/D61156 llvm-svn: 359316	2019-04-26 16:37:51 +00:00
Fangrui Song	283bc74054	[llvm-nm] Revert inadvertently committed 'i' change in r359314 llvm-svn: 359315	2019-04-26 16:27:11 +00:00
Fangrui Song	5015aa854d	[ThinLTO] Fix X86/strong_non_prevailing.ll after llvm-nm 'r' change llvm-svn: 359314	2019-04-26 16:21:51 +00:00
Roland Froese	4b17772b9e	[PowerPC] Update P9 vector costs for insert/extract element The PPC vector cost model values for insert/extract element reflect older processors that lacked vector insert/extract and move-to/move-from VSR instructions. Update getVectorInstrCost to give appropriate values for when the newer instructions are present. Differential Revision: https://reviews.llvm.org/D60160 llvm-svn: 359313	2019-04-26 16:14:17 +00:00
Fangrui Song	5f184f1780	[llvm-nm] Generalize symbol types 'N', 'n' and '?' llvm-svn: 359312	2019-04-26 16:03:31 +00:00
Fangrui Song	0bf06a8f59	[llvm-nm] Fix handling of symbol types 't' 'd' 'r' In addition, fix and convert the two tests to yaml2obj based. This allows us to delete two executables. X86/weak.test: 'v' was not tested X86/init-fini.test: symbol types of __bss_start _edata _end were wrong GNU nm reports __init_array_start as 't', and __preinit_array_start as 'd'. __init_array_start is 't' just because its section ".init_array" starts with ".init" 'd' makes more sense and allows us to drop the weird SHT_INIT_ARRAY rule. So, change __init_array_start to 'd' instead. llvm-svn: 359311	2019-04-26 16:01:48 +00:00
Don Hinton	6ee3fef9a4	[docs] Put DefaultOption bullet in alphabetical order. llvm-svn: 359309	2019-04-26 15:22:21 +00:00
Fangrui Song	41327e3522	[llvm-nm][llvm-size] Use --double-dash options in tests llvm-svn: 359308	2019-04-26 13:42:16 +00:00
Fangrui Song	3153764c88	s/Dwarf 5/DWARF v5/ NFC llvm-svn: 359307	2019-04-26 13:41:19 +00:00
Sanjay Patel	8224bc081c	[x86] add tests for fmin/fmax; NFC 'maximum' and 'minimum' still crash, so they are commented out. llvm-svn: 359306	2019-04-26 13:36:37 +00:00
Alexandre Ganea	8245140d3f	Fix llvm-objcopy/ELF/preserve-segment-contents test on UTF-8 locale Differential Revision: https://reviews.llvm.org/D61137 llvm-svn: 359302	2019-04-26 13:09:26 +00:00
George Rimar	5fcdebe75f	[yaml2obj] - Make implicitSectionNames() return std::vector<StringRef>. NFCI. No need to use SmallVector of char* here. This simplifies the code. llvm-svn: 359301	2019-04-26 13:09:11 +00:00
George Rimar	c1da14941f	[yaml2obj] - Remove excessive variable. NFC. `auto &Strtab` was used only once. llvm-svn: 359300	2019-04-26 12:45:54 +00:00
Simon Pilgrim	c3a34c3e07	Fix Wparentheses warning. NFCI. llvm-svn: 359299	2019-04-26 12:23:42 +00:00
George Rimar	fb7780a41f	[yaml2obj] - Make the code to match the LLVM style. NFCI. This renames the variables to uppercase and removes use of `auto` for unobvious type. llvm-svn: 359298	2019-04-26 12:20:51 +00:00
George Rimar	da1b3abad6	[yaml2elf] - Cleanup the initSectionHeaders(). NFCI. This encapsulates the section specific code inside the corresponding writeSectionContent methods. Making the code a bit more consistent. llvm-svn: 359297	2019-04-26 12:15:32 +00:00
Simon Pilgrim	bb230c5e79	[X86][SSE] Pull out OR(EXTRACTELT(X,0),OR(EXTRACTELT(X,1),...)) matching code from LowerVectorAllZeroTest Create a matchBitOpReduction helper that checks for the pattern with any opcode. First step towards reusing this code to recognize other scalar reduction patterns. llvm-svn: 359296	2019-04-26 11:45:54 +00:00
Nico Weber	ae73e1fcfb	Minor formatting tweak, no behavior change llvm-svn: 359295	2019-04-26 11:44:10 +00:00
Fangrui Song	50dcd8bf90	caseFoldingDjbHash: simplify and make the US-ASCII fast path faster The slow path (with at least one non US-ASCII) will be slower but that doesn't matter. Differential Revision: https://reviews.llvm.org/D61178 llvm-svn: 359294	2019-04-26 10:56:10 +00:00
Simon Pilgrim	5d6ef94c36	[X86][SSE] Disable shouldFoldConstantShiftPairToMask for btver1/btver2 targets (PR40758) As detailed on PR40758, Bobcat/Jaguar can perform vector immediate shifts on the same pipes as vector ANDs with the same latency - so it doesn't make sense to replace a shl+lshr with a shift+and pair as it requires an additional mask (with the extra constant pool, loading and register pressure costs). Differential Revision: https://reviews.llvm.org/D61068 llvm-svn: 359293	2019-04-26 10:49:13 +00:00
Simon Pilgrim	5e161df9f8	[X86][AVX] Combine shuffles extracted from a common vector A small step towards combining shuffles across vector sizes - this recognizes when a shuffle's operands are all extracted from the same larger source and tries to combine to an unary shuffle of that source instead. Fixes one of the test cases from PR34380. Differential Revision: https://reviews.llvm.org/D60512 llvm-svn: 359292	2019-04-26 09:56:14 +00:00
Sven van Haastregt	66f612601d	[InferAddressSpaces] Add AS parameter to the pass factory This enables the pass to be used in the absence of TargetTransformInfo. When the argument isn't passed, the factory defaults to UninitializedAddressSpace and the flat address space is obtained from the TargetTransformInfo as before this change. Existing users won't have to change. Patch by Kevin Petit. Differential Revision: https://reviews.llvm.org/D60602 llvm-svn: 359290	2019-04-26 09:21:25 +00:00
Hans Wennborg	5d5ee4aff7	Fix alignment in AArch64InstructionSelector::emitConstantPoolEntry() The code was using the alignment of a pointer to the value, not the alignment of the constant itself. Maybe we got away with it so far because the pointer alignment is fairly high, but we did end up under-aligning <16 x i8> vectors, which was caught in the Chromium build after lld stopped over-aligning the .rodata.cst16 section in r356428. (See crbug.com/953815) Differential revision: https://reviews.llvm.org/D61124 llvm-svn: 359287	2019-04-26 08:31:00 +00:00
Marcello Maggioni	c596584f67	[GlobalISel] Fix inserting copies in the right position for reg definitions When constrainRegClass is called if the constraining happens on a use the COPY needs to be inserted before the instruction that contains the MachineOperand, but if we are constraining a definition it actually needs to be added after the instruction. In addition, the COPY needs to have its operands flipped (in the use case we are copying from the old unconstrained register to the new constrained register, while in the definition case we are copying from the new constrained register that the instruction defines to the old unconstrained register). llvm-svn: 359282	2019-04-26 07:21:56 +00:00
Fangrui Song	2aa0bdeb25	Fix typos: (re)?sor?uce -> (re)?source Closes: https://github.com/llvm/llvm-project/pull/10 In-collaboration-with: Olivier Cochard-Labbé <olivier@FreeBSD.org> Signed-off-by: Enji Cooper <yaneurabeya@gmail.com> Differential Revision: https://reviews.llvm.org/D61021 llvm-svn: 359277	2019-04-26 05:56:23 +00:00
Dan Robertson	9e441aee50	[NFC] Add baseline tests for int isKnownNonZero Add baseline tests for improvements of isKnownNonZero for integer types. Differential Revision: https://reviews.llvm.org/D60932 llvm-svn: 359267	2019-04-26 02:55:54 +00:00
Fangrui Song	2db79e9d2c	[llvm-objcopy] Accept --long-option but not -long-option Summary: llvm-{objcopy,strip} (and many other LLVM binary utilities) accept cl::opt style -long-option as well as many short options (e.g. -p -S -x). People who use them as replacement of GNU binutils often use the grouped option syntax (POSIX Utility Conventions), e.g. -Sx => -S -x, -Wd => -W -d, -sj.text => -s -j.text There is ambiguity if a long option starts with the character used by a short option. Drop the support for -long-option to resolve the ambiguity. This divergence from other utilities is accepted (other utilities continue supporting -long-option). https://lists.llvm.org/pipermail/llvm-dev/2019-April/131786.html Reviewers: alexshap, jakehehrlich, jhenderson, rupprecht, espindola Reviewed By: jakehehrlich, jhenderson, rupprecht Subscribers: grimar, emaste, arichardson, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D60439 llvm-svn: 359265	2019-04-26 02:10:10 +00:00
Justin Bogner	df5d2b3846	[GlobalOpt] Swap the expensive check for cold calls with the cheap TTI check isValidCandidateForColdCC is much more expensive than TTI.useColdCCForColdCall, which by default just returns false. Avoid doing this work if we're not going to look at the answer anyway. This change is NFC, but I see significant compile time improvements on some code with pathologically many functions. llvm-svn: 359253	2019-04-26 00:12:50 +00:00
Lang Hames	4f71049a39	[ORC] Remove symbols from dependency lists when failing materialization. When failing materialization of a symbol X, remove X from the dependants list of any of X's dependencies. This ensures that when X's dependencies are emitted (or fail themselves) they do not try to access the no-longer-existing MaterializationInfo for X. llvm-svn: 359252	2019-04-25 23:31:33 +00:00
Artem Belevich	5fe85a003f	[CUDA] Implemented _[bi]mma* builtins. These builtins provide access to the new integer and sub-integer variants of MMA (matrix multiply-accumulate) instructions provided by CUDA-10.x on sm_75 (AKA Turing) GPUs. Also added a feature for PTX 6.4. While Clang/LLVM does not generate any PTX instructions that need it, we still need to pass it through to ptxas in order to be able to compile code that uses the new 'mma' instruction as inline assembly (e.g used by NVIDIA's CUTLASS library https://github.com/NVIDIA/cutlass/blob/master/cutlass/arch/mma.h#L101) Differential Revision: https://reviews.llvm.org/D60279 llvm-svn: 359248	2019-04-25 22:28:09 +00:00
Artem Belevich	16737538f4	PTX 6.3 extends `wmma` instruction to support s8/u8/s4/u4/b1 -> s32. All of the new instructions are still handled mostly by tablegen. I've slightly refactored the code to drive intrinsic/instruction generation from a master list of supported variants, so all irregularities have to be implemented in one place only. The test generation script wmma.py has been refactored in a similar way. Differential Revision: https://reviews.llvm.org/D60015 llvm-svn: 359247	2019-04-25 22:27:57 +00:00
Artem Belevich	8d825b38ed	[NVPTX] generate correct MMA instruction mnemonics with PTX63+. PTX 6.3 requires using ".aligned" in the MMA instruction names. In order to generate correct name, now we pass current PTX version to each instruction as an extra constant operand and InstPrinter adjusts its output accordingly. Differential Revision: https://reviews.llvm.org/D59393 llvm-svn: 359246	2019-04-25 22:27:46 +00:00
Artem Belevich	7ecd82ce19	[NVPTX] Refactor generation of MMA intrinsics and instructions. NFC. Generalized constructions of 'fragments' of MMA operations to provide common primitives for construction of the ops. This will make it easier to add new variants of the instructions that operate on integer types. Use nested foreach loops which makes it possible to better control naming of the intrinsics. This patch does not affect LLVM's output, so there are no test changes. Differential Revision: https://reviews.llvm.org/D59389 llvm-svn: 359245	2019-04-25 22:27:35 +00:00
Sean Fertile	a93a33cb87	[Object][XCOFF] Add intial support for section header table. Adds a representation of the section header table to XCOFFObjectFile, and implements enough to dump the section headers with llvm-obdump. Differential Revision: https://reviews.llvm.org/D60784 llvm-svn: 359244	2019-04-25 21:36:04 +00:00
Keno Fischer	e008be2b07	[CMake][PowerPC] Recognize LLVM_NATIVE_TARGET="ppc64le" as PowerPC Summary: This value is derived from the host triple, which on the machine I'm currently using is `ppc64le-linux-redhat`. This change makes LLVM compile. Reviewers: nemanjai Differential Revision: https://reviews.llvm.org/D57118 llvm-svn: 359242	2019-04-25 21:28:03 +00:00
Stanislav Mekhanoshin	917c477a07	[AMDGPU] gfx1010 - fix ubsan failure Revert DecoderNamespace in one place for now. It will need more changes to properly work. llvm-svn: 359239	2019-04-25 20:39:06 +00:00
Sanjay Patel	7a2718181e	[x86] add tests for vector fdiv reciprocal estimate; NFC llvm-svn: 359238	2019-04-25 20:35:47 +00:00
David Blaikie	0c4dbf9ecd	Assigning to a local object in a return statement prevents copy elision. NFC. I added a diagnostic along the lines of `-Wpessimizing-move` to detect `return x = y` suppressing copy elision, but I don't know if the diagnostic is really worth it. Anyway, here are the places where my diagnostic reported that copy elision would have been possible if not for the assignment. P1155R1 in the post-San-Diego WG21 (C++ committee) mailing discusses whether WG21 should fix this pitfall by just changing the core language to permit copy elision in cases like these. (Kona update: The bulk of P1155 is proceeding to CWG review, but specifically not the parts that explored the notion of permitting copy-elision in these specific cases.) Reviewed By: dblaikie Author: Arthur O'Dwyer Differential Revision: https://reviews.llvm.org/D54885 llvm-svn: 359236	2019-04-25 20:09:00 +00:00
Jessica Paquette	f54258c888	[GlobalISel][AArch64] Make G_EXTRACT_VECTOR_ELT legal for v8s16s This case was missing before, so we couldn't legalize it. Add it to AArch64LegalizerInfo.cpp and update select-extract-vector-elt.mir. llvm-svn: 359231	2019-04-25 20:00:57 +00:00
Akira Hatanaka	8edf8f317b	[ObjC][ARC] Let ARC optimizer bail out if the number of pointer states it keeps track of becomes too large ARC optimizer does a top-down and a bottom-up traversal of the whole function to pair up retain and release instructions and remove them. This can be expensive if the number of instructions in the function and pointer states it tracks are large since it has to look at each pointer state and determine whether the instruction being visited can potentially use the pointer. This patch adds a command line option that sets a limit to the number of pointers it tracks. rdar://problem/49477063 Differential Revision: https://reviews.llvm.org/D61100 llvm-svn: 359226	2019-04-25 19:42:55 +00:00
Stanislav Mekhanoshin	2c97ff07bf	[AMDGPU] gfx1010 VOP1 instructions Differential Revision: https://reviews.llvm.org/D61099 llvm-svn: 359225	2019-04-25 19:01:51 +00:00
Stanislav Mekhanoshin	956b0be72e	[AMDGPU] gfx1010 utility functions Differential Revision: https://reviews.llvm.org/D61094 llvm-svn: 359224	2019-04-25 18:53:41 +00:00
Jessica Paquette	8184b6e7f6	[GlobalISel][AArch64] Add generic legalization rule for extends This adds a legalization rule for G_ZEXT, G_ANYEXT, and G_SEXT which allows extends whenever the types will fit in registers (or the source is an s1). Update tests. Add GISel checks throughout all of arm64-vabs.ll, where we now select a good portion of the code. Add GISel checks to arm64-subvector-extend.ll, which has a good number of vector extends in it. Differential Revision: https://reviews.llvm.org/D60889 llvm-svn: 359222	2019-04-25 18:42:00 +00:00
Craig Topper	f9c30eddd0	[SelectionDAG][X86] Use stack load/store in PromoteIntRes_BITCAST when the input needs to be be split and the output type is a vector. We had special case handling here, but it uses a scalar any_extend for the promotion then bitcasts to the final type. This won't split up the input data into multiple promoted elements like we need. This patch falls back to doing the conversion through memory. Fixes PR41594 which I believe was reflected in the bitcast-vector-bool.ll changes. The changes to vector-half-conversions.ll are fixing a previously unknown miscompile from this issue. Differential Revision: https://reviews.llvm.org/D61114 llvm-svn: 359219	2019-04-25 18:19:59 +00:00
Robert Lougher	d469133f95	[Evaluator] Walk initial elements when handling load through bitcast When evaluating a store through a bitcast, the evaluator tries to move the bitcast from the pointer onto the stored value. If the cast is invalid, it tries to "introspect" the type to get a valid cast by obtaining a pointer to the initial element (if the type is nested, this may require walking several initial elements). In some situations it is possible to get a bitcast on a load (e.g. with unions, where the bitcast may not be the same type as the store). However, equivalent logic to the store to introspect the type is missing. This patch add this logic. Note, when developing the patch I was unhappy with adding similar logic directly to the load case as it could get out of step. Instead, I have abstracted the "introspection" into a helper function, with the specifics being handled by a passed-in lambda function. Differential Revision: https://reviews.llvm.org/D60793 llvm-svn: 359205	2019-04-25 17:00:01 +00:00
Jessica Paquette	ba55767f51	[GlobalISel][AArch64] Legalize G_FNEARBYINT Add legalizer support for G_FNEARBYINT. It's the same as G_FCEIL etc. Since the importer allows us to automatically select this after legalization, also add tests for selection etc. Also update arm64-vfloatintrinsics.ll. llvm-svn: 359204	2019-04-25 16:44:40 +00:00

1 2 3 4 5 ...

178011 Commits