llvm-project

Commit Graph

Author	SHA1	Message	Date
Tim Keith	d5c05ced82	[flang][NFC] Add accessors to equivalence and common blocks Add a way to get mutable equivalence sets to Scope so that they can have sizes and offsets assigned to them. Change CommonBlockDetails to have mutable symbols so that they can have sizes and offets assigned to them. This also allows the removal of some `const_cast`s. Add MutableSymbolRef and MutableSymbolVector as mutable analogs to SymbolRef and SymbolVector. Replace uses of equivalent types with those names. Differential Revision: https://reviews.llvm.org/D79346	2020-05-06 12:22:28 -07:00
Ulrich Weigand	947f78ac27	[SystemZ] Fix/optimize vec_load_len and related intrinsics When using vec_load/store_len_r with an immediate length operand of 16 or larger, LLVM will currently emit an VLRL/VSTRL instruction with that immediate. This creates a valid encoding (which should be supported by the assembler), but always traps at runtime. This patch fixes this by not creating VLRL/VSTRL in those cases. This would result in loading the length into a register and calling VLRLR/VSTRLR instead. However, these operations with a length of 15 or larger are in fact simply equivalent to a full vector load or store. And in fact the same holds true for vec_load/store_len as well. Therefore, add a DAGCombine rule to replace those operations with plain vector loads or stores if the length is known at compile time and equal or larger to 15.	2020-05-06 21:15:58 +02:00
Daan De Meyer	f21c704553	clang-format: Add ControlStatementsExceptForEachMacros option to SpaceBeforeParens Summary: systemd recently added a clang-format file. One issue I encountered in using clang-format on systemd is that systemd does not add a space before the parens of their foreach macros but clang-format always adds a space. This does not seem to be configurable in clang-format. This revision adds the ControlStatementsExceptForEachMacros option to SpaceBeforeParens which puts a space before all control statement parens except ForEach macros. This drastically reduces the amount of changes when running clang-format on systemd's source code. Reviewers: MyDeveloperDay, krasimir, mitchell-stellar Reviewed By: MyDeveloperDay Subscribers: cfe-commits Tags: #clang-format, #clang Differential Revision: https://reviews.llvm.org/D78869	2020-05-06 20:59:24 +02:00
LemonBoy	7fa5abd343	[SelectionDAG] Fix assertion failure with big shift amounts Calling getShiftAmountTy with LegalTypes set may return a type that's too narrow to hold the shift amount for integer type it's applied to. Fixes the regression introduced by D79096 Differential Revision: https://reviews.llvm.org/D79405	2020-05-06 11:58:37 -07:00
Uday Bondhugula	57d361bd2f	[MLIR][NFC] Rename op trait PolyhedralScope -> AffineScope Rename op trait PolyhedralScope -> AffineScope for consistency. Differential Revision: https://reviews.llvm.org/D79503	2020-05-07 00:19:56 +05:30
Tim Keith	237d0e3c04	[flang] Handle EQUIVALENCE and COMMON in size and offset computations Objects in common blocks have offsets relative to the start of the common block, independent of the enclosing scope, so they are processed first. Add alignment to CommonBlockDetails to record the required alignment of the common block. For equivalence sets, each object depends on the one that is forced to occur first in memory. The rest are recorded in the dependents_ map and have offsets assigned after the other symbols are done. Differential Revision: https://reviews.llvm.org/D79347	2020-05-06 11:45:28 -07:00
Alex Zinenko	26f93d9f37	[mlir] OperationFolder: fix crash in creation of single-result-ops with in-place folds When the folding is performed in place, the `::fold` function does not populate its `results` argument to indicate that. (In the folding hook for single-result operations, the result of the original operation is expected to be returned, but it is then ignored by the wrapper.) `OperationFolder::create` would erronously rely on the _operation_ having zero results instead of on the _folding_ producing zero new results to populate the list of results with those of the original operation. This would lead to a crash for single-result ops with in-place folds where the first result is accessed uncondtionally because the list of results was not properly populated. Use the list of values produced by the folding instead. Differential Revision: https://reviews.llvm.org/D79497	2020-05-06 20:40:32 +02:00
Sanjay Patel	1b678ee8a6	[x86] add test of shift+cast+concat for PR45794; NFC Depends on D79360 / rG2f1fe1864d25 for the transform.	2020-05-06 14:18:04 -04:00
Michael Liao	4ee5a04187	[amdgpu] Fix check of VCC. Summary: - Need to include checking on the new 16-bit subregs. Reviewers: rampitec Subscribers: arsenm, kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, hiraditya, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79498	2020-05-06 14:16:37 -04:00
Simon Pilgrim	fe6f5ba0bf	[X86][AVX] Add PR45808 test case for badly promoted comparison mask arithmetic	2020-05-06 19:09:57 +01:00
zoecarver	1998e796e9	Revert "Mark values as trivially dead when their only use is a start or end lifetime intrinsic." This reverts commit `95aa28cc8f`.	2020-05-06 11:07:22 -07:00
zoecarver	95aa28cc8f	Mark values as trivially dead when their only use is a start or end lifetime intrinsic. Summary: If the only use of a value is a start or end lifetime intrinsic then mark the intrinsic as trivially dead. This should allow for that value to then be removed as well. Currently, this only works for allocas, globals, and arguments. Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79355	2020-05-06 10:58:08 -07:00
Siva Chandra Reddy	a7e1149699	[libc] Fix how math results are compared with MPFR results. Summary: Math results are compared with MPFR results by checking if they are within a tolerance level of the MPFR result. The tolerance level is set using additional bits of precision of the fractional part of a floating point value. Hence, the actual value of the tolerance depends on not only the additional bits, but also on the exponent part of the floating point number. Previously, the exponent part was not considered in evaluating the tolerance value. While it was OK for small values less than 1 (hence sinf, cosf, sincosf tests were OK), it breaks for large values which functions like exp and friends produce. This change uses the exponent value also to evaluate the tolerance value. LLVM libc produced results can now be compared with MPFR produced results for large values also. Reviewers: abrachet Differential Revision: https://reviews.llvm.org/D79278	2020-05-06 10:47:23 -07:00
Sean Silva	e382b3770e	Fix ShapeBase.td Summary: - Add license header. - Remove TODO about extracting ShapeBase.td Differential Revision: https://reviews.llvm.org/D79506	2020-05-06 10:43:16 -07:00
Simon Pilgrim	8817334ce3	[X86] getShuffleScalarElt - add CONCAT_VECTORS/INSERT_VECTOR_ELT support. This helped fix some i686 vXi64 broadcast folds that were becoming v2Xi32 broadcasts because we didn't match the broadcast until after SimplifyDemandedBits worked out we only used the bottom 32-bits in PMUL(U)DQ and type legalization had split the original i64 load. A couple of regressions occurred which required some fixups - adding concat_vectors(broadcast_load,broadcast_load) splat support and recognising (unnecessary) unary shuffles of already broadcasted vectors. This came about as part of the work investigating vector load combining from shuffles for PR42550.	2020-05-06 18:13:33 +01:00
Simon Pilgrim	8c71c2291e	[X86] getShuffleScalarElt - consistently use SDValue. NFC. We never need to call this from anything but ISD::SHUFFLE_VECTOR or target shuffles so shouldn't need to address SDNode directly.	2020-05-06 18:13:33 +01:00
Sanjay Patel	2058c98715	[InstCombine] limit bitcast+insertelement transform to x86 MMX type This is unusual for the general case because we are replacing 1 instruction with 2. Splitting from a potential conflicting transform in D79171	2020-05-06 13:12:36 -04:00
Fangrui Song	57a1c1be53	[Sema] Allow function attribute patchable_function_entry on aarch64_be Reviewed By: nickdesaulniers Differential Revision: https://reviews.llvm.org/D79495	2020-05-06 10:10:25 -07:00
Christopher Tetreault	782231ac79	[SVE] Fix invalid uses of VectorType::getNumElements() in ValueTracking Summary: Any function in this module that make use of DemandedElts laregely does not work with scalable vectors. DemandedElts is used to define which elements of the vector to look at. At best, for scalable vectors, we can express the first N elements of the vector. However, in practice, most code that uses these functions expect to be able to talk about the entire vector. In principle, this module should be able to be extended to work with scalable vectors. However, before we can do that, we should ensure that it does not cause code with scalable vectors to miscompile. All functions that use a DemandedElts will bail out if the vector is scalable. Usages of getNumElements() are updated to go through FixedVectorType pointers. Reviewers: rengolin, efriedma, sdesmalen, c-rhodes, spatel Reviewed By: efriedma Subscribers: david-arm, tschuett, kristof.beyls, hiraditya, rkruppe, psnobl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79053	2020-05-06 10:06:06 -07:00
Kadir Cetinkaya	6d6d48add8	[clangd] Reland 'Handle PresumedLocations in IncludeCollector' Summary: This will enable extraction of correct line locations in preamble patch for includes. Reviewers: sammccall Reviewed By: sammccall Subscribers: ilya-biryukov, MaskRay, jkorous, arphaman, usaxena95, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D78740	2020-05-06 17:57:03 +02:00
Matt Arsenault	59bc99a08a	InstCombine: Fix return after else	2020-05-06 11:53:26 -04:00
Louis Dionne	89bb9f8d78	[libc++] Make sure the cin/wcin tests run on remote hosts When running on remote hosts, we need the whole `echo 123 \| %t.exe` command to run on the remote host. Thus, we need to escape the pipe to make sure the command is treated as `{ echo 123 \| %t.exe } > %t.out` instead of `{ echo 123 } \| %t.exe > %t.out`m where only `echo 123` is run on the remote host.	2020-05-06 11:33:13 -04:00
Louis Dionne	d98b9a4157	[libc++] NFC: Do not print the environment on remote hosts Running `export` when there is no environment variable to export will cause the environment on the remote host to be printed. We don't want that, so don't run any `export` command on the host when there's no env.	2020-05-06 11:33:13 -04:00
Michael Liao	6533c1da7f	Revert "[MIR] Fix a bug in MIR printer." This reverts commit `e38018b80d`.	2020-05-06 11:26:42 -04:00
Stanislav Mekhanoshin	54d6dfe996	[AMDGPU] Drop 16 bit subreg suffixes on print We do not want to break asm syntax. These suffixes are quite useful for debugging, so add an option to print them. Right now it is NFC. Differential Revision: https://reviews.llvm.org/D79435	2020-05-06 08:14:10 -07:00
Jay Foad	29067aac46	[AMDGPU] Don't implement GCNHazardRecognizer::PreEmitNoops(SUnit ) When called from the post-RA scheduler, hazards have already been handled by getHazardType returning NoopHazard, so PreEmitNoops always returns zero. Remove it. NFC. Historical note: PreEmitNoops was added to the hazard recognizer interface as an optional feature to support dispatch group formation on the POWER target: http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20131202/197470.html So it seems right that we shouldn't need to implement it. We do still implement the other overload PreEmitNoops(MachineInstr ) because that is used by the PostRAHazardRecognizer pass. Differential Revision: https://reviews.llvm.org/D79476	2020-05-06 16:11:19 +01:00
Luís Marques	a3e6e624c7	[RISCV][NFC] Add more constant materialization tests This patch adds more constant materialization tests, focusing on cases where we could improve our materialization instruction sequences (particularly for RV64). Various of these cases will be improved upon in follow-up patches. Differential Revision: https://reviews.llvm.org/D79453	2020-05-06 16:06:16 +01:00
Melanie Blower	c355bec749	Add support for #pragma clang fp reassociate(on\|off) Reviewers: rjmccall, erichkeane, sepavloff Differential Revision: https://reviews.llvm.org/D78827	2020-05-06 08:05:44 -07:00
David Green	f5f83cf4df	[ARM] VMOVhr load -> vldr Much like the similar combine added recently for VMOVrh load, this adds a fold for VMOVhr load turning it into a vldr.f16 as opposed to a vldrh and vmov.f16. Differential Revision: https://reviews.llvm.org/D78714	2020-05-06 15:45:56 +01:00
Michael Liao	e38018b80d	[MIR] Fix a bug in MIR printer. - Need to skip the assignment of `ID`, which is used to index that two object arrays.	2020-05-06 10:33:45 -04:00
Ram Nalamothu	f7060f4f88	For PAL, make sure Scratch Buffer Descriptor do not clobber GIT pointer Since SRSRC has alignment requirements, first find non GIT pointer clobbered registers for SRSRC and then if those registers clobber preloaded Scratch Wave Offset register, copy the Scratch Wave Offset register to a free SGPR.	2020-05-06 10:31:15 -04:00
Sanjay Patel	2f1fe1864d	[DAGCombiner] sink target-supported FP<->int cast op after concat vectors Try to combine N short vector cast ops into 1 wide vector cast op: concat (cast X), (cast Y)... -> cast (concat X, Y...) This is part of solving PR45794: https://bugs.llvm.org/show_bug.cgi?id=45794 As noted in the code comment, this is uglier than I was hoping because the opcode determines whether we pass the source or destination type to isOperationLegalOrCustom(). Also IIUC, there's no way to validate what the other (dest or src) type is. Without the extra legality check on that, there's an ARM regression test in: test/CodeGen/ARM/isel-v8i32-crash.ll ...that will crash trying to lower an unsupported v8f32 to v8i16. Differential Revision: https://reviews.llvm.org/D79360	2020-05-06 10:25:58 -04:00
Simon Pilgrim	f5f7fd990e	[X86][SSE] combineX86ShuffleChain - remove unused shuffle(vzext_load(),undef) combine. This should always be caught by the various VZEXT_MOVL handling in combineTargetShuffle and SimplifyDemandedVectorEltsForTargetNode.	2020-05-06 15:20:29 +01:00
Matt Arsenault	074c371a48	AMDGPU: Insert kernarg code after allocas This produces more normal looking IR by keeping all the allocas clustered at the start of the block.	2020-05-06 10:19:56 -04:00
David Green	d05f8a38c5	[ARM] VMOVrh of VMOVhr A VMOVhr of a VMOVrh can be simply folded to the original HPR value. Differential Revision: https://reviews.llvm.org/D78710	2020-05-06 15:10:01 +01:00
Louis Dionne	c82f9eba4a	[libc++] Fix broken modules tests on single-threaded systems Since `c0cd106fcc`, we add __config_site macro defines to the compiler command line whether we are building with modules or not. This means that the modules tests are expected to fail on single-threaded systems whether we build with modules or not.	2020-05-06 09:59:50 -04:00
Sanjay Patel	e3eb297deb	[VectorCombine] add tests for possible scalarization; NFC	2020-05-06 09:58:27 -04:00
Erich Keane	8a1c999c9b	Implement _ExtInt ABI for all ABIs in Clang, enable type for ABIs This is the result of an audit of all of the ABIs in clang to implement and enable the type for those targets. Additionally, this finds an issue with integer-promotion passing for a few platforms when using _ExtInt of < int, so this also corrects that resulting in signext/zeroext being on a params of those types in some platforms. Differential Revisions: https://reviews.llvm.org/D79118	2020-05-06 06:52:18 -07:00
David Green	a349949f8a	[ARM] Extract from a VDUP If we get into the situation where we are extracting from a VDUP, the extracted value is just the origin, so long as the types match or we can bitcast between the two. Differential Revision: https://reviews.llvm.org/D78708	2020-05-06 14:51:25 +01:00
Adam Czachorowski	319787315d	[clangd] Do not offer "Add using" tweak in header files. Reviewers: sammccall Reviewed By: sammccall Subscribers: ilya-biryukov, MaskRay, jkorous, arphaman, kadircet, usaxena95, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D79488	2020-05-06 15:50:54 +02:00
Renato Golin	5010b5b7e6	Check type for forward reference definition The types of forward references are checked that they match with other uses, but they do not check they match with the definition. func @forward_reference_type_check() -> (i8) { br ^bb2 ^bb1: return %1 : i8 ^bb2: %1 = "bar"() : () -> (f32) br ^bb1 } Would be parsed and the use site of '%1' would be silently changed to 'f32'. This commit adds a test for this case, and a check during parsing for the types to match. Patch by Matthew Parkinson <mattpark@microsoft.com> Closes D79317.	2020-05-06 14:34:18 +01:00
David Green	ed7db68c35	[ARM] Convert a bitcast VDUP to a VDUP The idea, under MVE, is to introduce more bitcasts around VDUP's in an attempt to get the type correct across basic block boundaries. In order to do that without other regressions we need a few fixups, of which this is the first. If the code is a bitcast of a VDUP, we can convert that straight into a VDUP of the new type, so long as they have the same size. Differential Revision: https://reviews.llvm.org/D78706	2020-05-06 14:14:21 +01:00
Alexandre Ganea	06591b6d19	[Debug][CodeView] Emit fully qualified names for globals Emit S_[L\|G][THREAD32\|DATA32] records with a fully qualified name (namespace + class scope). Differential Revision: https://reviews.llvm.org/D79447	2020-05-06 09:12:00 -04:00
Alexandre Ganea	c503d97d19	[Support] Silence warning: comparison of integers of different signs: 'const int' and 'const unsigned long'	2020-05-06 09:12:00 -04:00
Alexandre Ganea	db817d15d1	[InstrProf] Silence warnings when targeting x86 with VS2019 16.5.4 Differential Revision: https://reviews.llvm.org/D79337	2020-05-06 09:12:00 -04:00
Alexandre Ganea	3483cdc834	[Sema] Silence warnings when targeting x86 with VS2019 16.5.4 Differential Revision: https://reviews.llvm.org/D79337	2020-05-06 09:11:59 -04:00
Nicolas Vasilache	94438c86ad	[mlir] Add a MemRefCastOp canonicalization pattern. Summary: This revision adds a conservative canonicalization pattern for MemRefCastOp that are typically inserted during ViewOp and SubViewOp canonicalization. Ideally such canonicalizations would propagate the type to consumers but this is not a local behavior. As a consequence MemRefCastOp are introduced to keep type compatibility but need to be cleaned up later, in the case where more dynamic behavior than necessary is introduced. Differential Revision: https://reviews.llvm.org/D79438	2020-05-06 09:10:05 -04:00
Simon Pilgrim	8650b36935	[X86][SSE] Move VZEXT_MOVL removal into SimplifyDemandedVectorEltsForTargetNode This patch replaces the VZEXT_MOVL removal from combineShuffle with a more general version based in SimplifyDemandedVectorEltsForTargetNode. By using computeKnownBits we can always remove the VZEXT_MOVL if the upper elements of the source operand are known to be zero. This requires us to add the conversion ops to computeKnownBitsForTargetNode as well. Reviewed By: @craig.topper Differential Revision: https://reviews.llvm.org/D79335	2020-05-06 14:05:07 +01:00
Simon Pilgrim	1c4f118d89	[X86][SSE] getShuffleScalarElt - minor NFC cleanup. Use SelectionDAG::MaxRecursionDepth instead of (equal) hard coded constant. clang-format	2020-05-06 14:05:07 +01:00
David Spickett	055ea585c7	Reland "[CodeGen] Make logic of CCState::resultsCompatible clearer" This relands commit `d782d1f898`. With a typo fixed, which was causing the x86 test failure.	2020-05-06 13:40:49 +01:00

1 2 3 4 5 ...

353551 Commits All Branches Search

353551 Commits

All Branches