llvm-project

Commit Graph

Author	SHA1	Message	Date
Krzysztof Parzyszek	a2dc19b81b	[Hexagon] Return scalar size in getMinVectorRegisterBitWidth() when no HVX This fixes https://llvm.org/PR47128.	2020-08-12 10:13:58 -05:00
Anna Welker	4fe5615eab	[ARM][MVE] Enable tail predication for loops containing MVE gather/scatters Widen the scope of memory operations that are allowed to be tail predicated to include gathers and scatters, such that loops that are auto-vectorized with the option -enable-arm-maskedgatscat (and actually end up containing an MVE gather or scatter) can be tail predicated. Differential Revision: https://reviews.llvm.org/D85138	2020-08-12 15:32:37 +01:00
Zurab Tsinadze	25bbe234e4	[analyzer] StdLibraryFunctionsChecker: Add support for new functions `toupper`, `tolower`, `toascii` functions were added to StdLibraryFunctionsChecker to fully cover CERT STR37-C rule: https://wiki.sei.cmu.edu/confluence/x/BNcxBQ Differential Revision: https://reviews.llvm.org/D85093	2020-08-12 16:20:00 +02:00
Alexey Bataev	ddbd21d288	[OPENMP]Do not add TGT_OMP_TARGET_PARAM flag to non-captured mapped arguments. If the arguments are mapped, but are actually not used in the target region, the compiler still adds attribute TGT_OMP_TARGET_PARAM for such arguments. It makes the libomptarget to add such parameters to the list of arguments, passed to the kernel at the runtime, and may lead to incorrect results/crashes during execution. Differential Revision: https://reviews.llvm.org/D85755	2020-08-12 10:06:52 -04:00
Matt Arsenault	e14474a39a	AMDGPU/GlobalISel: Select llvm.amdgcn.global.atomic.fadd Remove the intermediate transform in the DAG path. I believe this is the last non-deprecated intrinsic that needs handling.	2020-08-12 10:04:53 -04:00
Matt Arsenault	701228c411	AMDGPU: Handle intrinsics in performMemSDNodeCombine This avoids a possible regression in a future patch	2020-08-12 10:04:53 -04:00
Alexey Bataev	3651658bdd	Revert "[OPENMP]Fix PR37671: Privatize local(private) variables in untied tasks." This reverts commit `ec9563c54e` to investigate compiler crash revelaed by the buildbots.	2020-08-12 09:50:32 -04:00
Xing GUO	e891b6a75d	[DWARFYAML] Make the address size of compilation units optional. This patch makes the 'AddrSize' field optional. If the address size is missing, yaml2obj will infer it from the object file. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D85805	2020-08-12 21:47:32 +08:00
Xing GUO	386d5af04b	[MachOYAML] Simplify the section data emitting function. NFC. This patch helps simplify some codes in writeSectionData() function. Reviewed By: jhenderson, grimar Differential Revision: https://reviews.llvm.org/D85821	2020-08-12 21:46:43 +08:00
Alexey Bataev	ec9563c54e	[OPENMP]Fix PR37671: Privatize local(private) variables in untied tasks. Summary: In untied tasks, need to allocate the space for local variales, declared in task region, when the memory for task data is allocated. THe function can be interrupted and we can exit from the function in untied task switch. Need to keep the state of the local variables in this case. Also, the compiler should not call cleanup when exiting in untied task switch until the real exit out of the declaration scope is met during execution. Reviewers: jdoerfert Subscribers: yaxunl, guansong, cfe-commits, sstefan1, caomhin Tags: #clang Differential Revision: https://reviews.llvm.org/D84457	2020-08-12 09:37:24 -04:00
Erich Keane	aa4bc1cb79	Limit Max Vector alignment on COFF targets to 8192. COFF targets have a max object alignment of 8192, so trying to create one with a larger size results in an unreachable in WinCOFFObjectWriter. For the reproducer I have uses thread local storage, however other alignments are likely affected as well. This patch sets the MaxVectorAlign for COFF to 8192. Additionally, though there is no longer a way to reproduce that I could find, it correctly sets the MaxTLSAlign for COFF to that value as well, so that if anyone comes up with a situation where this is true, it will cause an error. Differential Revision: https://reviews.llvm.org/D85543	2020-08-12 06:35:35 -07:00
Sanjay Patel	cc892fd9f4	[VectorCombine] early exit if target has no vector registers Based on post-commit discussion in: D81766 Other vectorization passes (SLP and Loop) use this TTI API similarly.	2020-08-12 09:22:31 -04:00
Sanjay Patel	89a7f64afc	[VectorCombine] add test for x86 target with SSE disabled; NFC	2020-08-12 09:22:31 -04:00
David Green	e859868eb3	[ARM] Add additional predicated VFMA tests. NFC	2020-08-12 14:20:20 +01:00
Sanjay Patel	912c09e845	[InstCombine] eliminate a pointer cast around insertelement I'm not sure if this solves PR46839 completely, but reducing the casting should help: https://bugs.llvm.org/show_bug.cgi?id=46839 Differential Revision: https://reviews.llvm.org/D85647	2020-08-12 09:08:17 -04:00
Sanjay Patel	b97e402ca5	[VectorCombine] add test for Hexagon that would crash; NFC This test verifies the code change from: rGb0b95dab1ce2 (although that would not be true if PR47128 is fixed)	2020-08-12 08:38:20 -04:00
Kai Nacke	bca1b8ed99	[SystemZ/ZOS] Implement computeHostNumPhysicalCores On z/OS, the information is stored in the Common System Data Area (CSD). It is the number of CPs allocated to the current LPAR. Reviewers: aganea, hubert.reinterpertcast, MaskRay Reviewed By: hubert.reinterpertcast Differential Revision: https://reviews.llvm.org/D85531	2020-08-12 08:31:33 -04:00
Sam Parker	ea8448e361	[LoopUnroll] Adjust CostKind query When TTI was updated to use an explicit cost, TCK_CodeSize was used although the default implicit cost would have been the hand-wavey cost of size and latency. So, revert back to this behaviour. This is not expected to have (much) impact on targets since most (all?) of them return the same value for SizeAndLatency and CodeSize. When optimising for size, the logic has been changed to query CodeSize costs instead of SizeAndLatency. This patch also adds a testing option in the unroller so that OptSize thresholds can be specified. Differential Revision: https://reviews.llvm.org/D85723	2020-08-12 12:56:09 +01:00
Raphael Isemann	cff880b0c9	Revert "[lldb] Display autosuggestion part in gray if there is one possible suggestion" This reverts commit `246afe0cd1`. This broke the following tests on Linux it seems: lldb-api :: commands/expression/multiline-completion/TestMultilineCompletion.py lldb-api :: iohandler/completion/TestIOHandlerCompletion.py	2020-08-12 13:52:03 +02:00
David Green	fccf4c6115	[ARM] Commutative vmin/maxnma tests. NFC	2020-08-12 12:50:18 +01:00
Bogdan Serea	35bee3503f	[clang-tidy] prevent generated checks from triggering assertions on anonymous functions Skeleton checks generated by clang-tidy add_check.py cause assertions to fail when run over anonymous functions(lambda functions). This patch introduces an additional check to verify that the target function is not anonymous before calling getName(). The code snippet from the [[ https://clang.llvm.org/extra/clang-tidy/Contributing.html \| clang-tidy tutorial ]]is also updated. Reviewed By: alexfh, DavidTruby Differential Revision: https://reviews.llvm.org/D85218	2020-08-12 12:43:40 +01:00
Simon Pilgrim	9bd97d0363	[X86][SSE] Fold HOP(SHUFFLE(X),SHUFFLE(Y)) --> SHUFFLE(HOP(X,Y)) This is beginning to look like a canonicalization stage that could be performed as part of shuffle combining Another step towards PR41813	2020-08-12 12:16:36 +01:00
Shu Anzai	246afe0cd1	[lldb] Display autosuggestion part in gray if there is one possible suggestion I implemented autosuggestion if there is one possible suggestion. I set the keybinds for every character. When a character is typed, Editline::TypedCharacter is called. Then, autosuggestion part is displayed in gray, and you can actually input by typing C-k. Editline::Autosuggest is a function for finding completion, and it is like Editline::TabCommand now, but I will add more features to it. Testing does not work well in my environment, so I can't confirm that it goes well, sorry. I am dealing with it now. Reviewed By: teemperor, JDevlieghere, #lldb Differential Revision: https://reviews.llvm.org/D81001	2020-08-12 13:11:20 +02:00
Alex Zinenko	321aa19ec8	[mlir] Expose printing functions in C API Provide printing functions for most IR objects in C API (except Region that does not have a `print` function, and Module that is expected to be printed as Operation instead). The printing is based on a callback that is called with chunks of the string representation and forwarded user-defined data. Reviewed By: stellaraccident, Jing, mehdi_amini Differential Revision: https://reviews.llvm.org/D85748	2020-08-12 13:07:34 +02:00
Georgii Rymar	3b0a4e9584	[llvm-readobj] - Refine logic of the symbol table locating in printRelocationsHelper(). This removes the last `unwrapOrError` call from the `printRelocationsHelper`. There is a little additional complexity because of `SHT_RELR/SHT_ANDROID_RELR` sections. Such sections contains only relative relocations and they do not have a symbol table associated with them, hence we should not try to treat their `sh_link` field as a reference to a symbol table. Differential revision: https://reviews.llvm.org/D85430	2020-08-12 14:03:56 +03:00
Simon Pilgrim	a0c2c6aa42	[X86][AVX] Fold CONCAT(HOP(X,Y),HOP(Z,W)) -> HOP(CONCAT(X,Z),CONCAT(Y,W)) for float types Only do this for AVX2+ targets as we still get some regressions on AVX1 without PERMPD/PERMQ	2020-08-12 11:31:05 +01:00
Raphael Isemann	dd0fdf8030	[lldb] Add support for checking children in expect_expr expect_expr currently can't verify the children of the result SBValue. This patch adds the ability to check them. The idea is to have a CheckValue class where one can specify what attributes of a SBValue should be checked. Beside the properties we already check for (summary, type, etc.) this also has a list of children which is again just a list of CheckValue object (which can also have children of their own). The main motivation is to make checking the children no longer based on error-prone substring checks that allow tests to pass just because for example the error message contains the expected substrings by accident. I also expect that we can just have a variant of `expect_expr` for LLDB's expression paths (aka 'frame var') feature. Reviewed By: labath Differential Revision: https://reviews.llvm.org/D83792	2020-08-12 12:11:24 +02:00
Cullen Rhodes	511d5aaca3	[Transforms][SROA] Skip uses of allocas where the type is scalable When visiting load and store instructions in SROA skip scalable vectors. This is relevant in the implementation of the 'arm_sve_vector_bits' attribute that is used to define VLS types, where an alloca of a fixed-length vector could be bitcasted to scalable. See D85128 for more information. Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D85725	2020-08-12 09:35:48 +00:00
Mehdi Amini	7b18716361	Add missing dependency on Doc generation for the OpenMP dialect This is fixing the bot building the MLIR website.	2020-08-12 09:12:15 +00:00
Alex Zinenko	af838584ec	[mlir] use intptr_t in C API Using intptr_t is a consensus for MLIR C API, but the change was missing from `75f239e975` (that was using unsigned initially) due to a misrebase. Reviewed By: stellaraccident, mehdi_amini Differential Revision: https://reviews.llvm.org/D85751	2020-08-12 11:11:25 +02:00
Florian Hahn	e441b7a7a0	[SCEV] Look through single value PHIs. Now that SCEVExpander can preserve LCSSA form, we do not have to worry about LCSSA form when trying to look through PHIs. SCEVExpander will take care of inserting LCSSA PHI nodes as required. This increases precision of the analysis in some cases. Reviewed By: mkazantsev, bmahjour Differential Revision: https://reviews.llvm.org/D71539	2020-08-12 10:03:42 +01:00
Igor Kudrin	9ceb192e14	[llvm-dwarfdump] Avoid crashing if an abbreviation offset is invalid. Note that DWARFUnit::getAbbreviations() returns nullptr if the abbreviations could not be read, but callers used the returned pointer without checking. Differential Revision: https://reviews.llvm.org/D85738	2020-08-12 16:01:53 +07:00
Sjoerd Meijer	6716e7868e	[ARM][MVE] tail-predication: overflow checks for backedge taken count. This pick ups the work on the overflow checks for get.active.lane.mask, which ensure that it is safe to insert the VCTP intrinisc that enables tail-predication. For a 2d auto-correlation kernel and its inner loop j: M = Size - i; for (j = 0; j < M; j++) Sum += Input[j] * Input[j+i]; For this inner loop, the SCEV backedge taken count (BTC) expression is: (-1 + (sext i16 %Size to i32)),+,-1}<nw><%for.body> and LoopUtil cannotBeMaxInLoop couldn't calculate a bound on this, thus "BTC cannot be max" could not be determined. So overflow behaviour had to be assumed in the loop tripcount expression that uses the BTC. As a result tail-predication had to be forced (with an option) for this case. This change solves that by using ScalarEvolution's helper getConstantMaxBackedgeTakenCount which is able to determine the range of BTC, thus can determine it is safe, so that we no longer need to force tail-predication as reflected in the changed test cases. Differential Revision: https://reviews.llvm.org/D85737	2020-08-12 09:32:26 +01:00
Eduardo Caldas	ac37afa650	[SyntaxTree] Unbox operators into tokens for nodes generated from `CXXOperatorCallExpr` For an user define `<`, `x < y` would yield the syntax tree: ``` BinaryOperatorExpression \|-IdExpression \| `-UnqualifiedId \| `-x \|-IdExpression \| `-UnqualifiedId \| `-< `-IdExpression `-UnqualifiedId `-y ``` But there is no syntatic difference at call site between call site or built-in `<`. As such they should generate the same syntax tree, namely: ``` BinaryOperatorExpression \|-IdExpression \| `-UnqualifiedId \| `-x \|-< `-IdExpression `-UnqualifiedId `-y ``` Differential Revision: https://reviews.llvm.org/D85750	2020-08-12 08:01:18 +00:00
David Sherwood	88bbd30736	[SVE][CodeGen] Fix issues with EXTRACT_SUBVECTOR when using scalable FP vectors In this patch I have fixed two issues: 1. Our SVE tuple get/set intrinsics were using the wrong constant type for the index passed to EXTRACT_SUBVECTOR. I have fixed this by using the function SelectionDAG::getVectorIdxConstant to create the value. Also, I have updated the documentation for EXTRACT_SUBVECTOR describing what type the constant index should be and we now enforce this when creating the node. 2. The AArch64 backend was missing the appropriate patterns for extracting certain subvectors (nxv4f16 and nxv2f32) from legal SVE types. I have added them as part of this patch. The only way that I could find to test the new patterns was to use the SVE tuple get intrinsics, although I realise it looks a bit unusual. Tests added here: test/CodeGen/AArch64/sve-extract-subvector.ll Differential Revision: https://reviews.llvm.org/D85516	2020-08-12 08:35:46 +01:00
Kazushi (Jam) Marukawa	5d549219df	[VE] Change to promote i32 AND/OR/XOR operations VE has only 64 bits AND/OR/XOR instructions. We pretended that VE has 32 bits instructions also, but doing it increase the number of generated instructions. Therefore, we decide to promote 32 bits operations and use only 64 bits instructions in back end. We also avoid pretending that VE has 32 bits LEA instruction. Update regression tests also. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D85726	2020-08-12 16:23:50 +09:00
Siva Chandra	a87576592c	[libc][obvious] Switch copysign[f\|l] and fabs[f\|l] to the new test matchers.	2020-08-12 00:20:58 -07:00
Haojian Wu	dc7b1e9db5	[AST] Fix the CXXFoldExpr source range when parentheses range is invalid. The CXXFoldExpr's range is invalid if the cxxfoldexpr is formed via the Concept's TypeContraints (because the parentheses are not written in the source code). We fallback to use the range from the pattern. Differential Revision: https://reviews.llvm.org/D85645	2020-08-12 09:20:23 +02:00
Kiran Chandramohan	e6c5e6efd0	[MLIR,OpenMP] Lowering of parallel operation: proc_bind clause 2/n This patch adds the translation of the proc_bind clause in a parallel operation. The values that can be specified for the proc_bind clause are specified in the OMP.td tablegen file in the llvm/Frontend/OpenMP directory. From this single source of truth enumeration for proc_bind is generated in llvm and mlir (used in specification of the parallel Operation in the OpenMP dialect). A function to return the enum value from the string representation is also generated. A new header file (DirectiveEmitter.h) containing definitions of classes directive, clause, clauseval etc is created so that it can be used in mlir as well. Reviewers: clementval, jdoerfert, DavidTruby Differential Revision: https://reviews.llvm.org/D84347	2020-08-12 08:03:13 +01:00
Craig Topper	6b3dc96e59	[X86][GlobalISel] Replace a misuse of SUBREG_TO_REG with INSERT_SUBREG. SUBREG_TO_REG is supposed to be used when we know the producing instruction already zeroed the bits we're extending. But that's not the case here. So INSERT_SUBREG with an IMPLICIT_DEF is the correct thing to use.	2020-08-11 23:51:02 -07:00
George Mitenkov	2ad7e1a301	[MLIR][SPIRVToLLVM] Conversion for global and addressof Inital conversion of `spv._address_of` and `spv.globalVariable`. In SPIR-V, the global returns a pointer, whereas in LLVM dialect the global holds an actual value. This difference is handled by `spv._address_of` and `llvm.mlir.addressof`ops that both return a pointer. Moreover, only current invocation is in conversion's scope. Reviewed By: antiagainst, mravishankar Differential Revision: https://reviews.llvm.org/D84626	2020-08-12 09:41:14 +03:00
Siva Chandra	01b99c6e1d	[libc][obvious] Switch nearest integer function tests to the new matchers.	2020-08-11 23:33:15 -07:00
Kyungwoo Lee	d73be5af0a	[NFC] Factor out hasForceAttributes This is a preparation for https://reviews.llvm.org/D85586. Differential Revision: https://reviews.llvm.org/D85793	2020-08-12 02:16:57 -04:00
Johannes Doerfert	3a033921ed	[Attributor][NFC] Reformat tests after D85099 Reviewed By: sstefan1 Differential Revision: https://reviews.llvm.org/D85700	2020-08-12 01:04:19 -05:00
Johannes Doerfert	97ce7fd89f	[UpdateTestChecks] Match unnamed values like "@[0-9]+" and "![0-9]+" With this patch we will match most uses of "temporary" named things in the IR via regular expressions, not their name at creation time. The new "values" we match are: - "unnamed" globals: `@[0-9]+` - debug metadata: `!dbg ![0-9]+` - loop metadata: `!loop ![0-9]+` - tbaa metadata: `!tbaa ![0-9]+` - range metadata: `!range ![0-9]+` - generic metadata: `metadata ![0-9]+` - attributes groups: `#[0-9]` We still don't match the declarations but that can be done later. This patch can introduce churn when existing check lines contain the old hardcoded versions of the above "values". We can add a flag to opt-out, or opt-in, if necessary. Reviewed By: arichardson, MaskRay Differential Revision: https://reviews.llvm.org/D85099	2020-08-12 01:04:16 -05:00
Petr Hosek	31e5f7120b	[CMake] Simplify CMake handling for zlib Rather than handling zlib handling manually, use find_package from CMake to find zlib properly. Use this to normalize the LLVM_ENABLE_ZLIB, HAVE_ZLIB, HAVE_ZLIB_H. Furthermore, require zlib if LLVM_ENABLE_ZLIB is set to YES, which requires the distributor to explicitly select whether zlib is enabled or not. This simplifies the CMake handling and usage in the rest of the tooling. This is a reland of `abb0075` with all followup changes and fixes that should address issues that were reported in PR44780. Differential Revision: https://reviews.llvm.org/D79219	2020-08-11 20:22:11 -07:00
Jordan Rupprecht	1a67522d3e	[NFC] Inline variable only used in debug builds	2020-08-11 19:38:01 -07:00
Sanjay Patel	b0b95dab1c	[VectorCombine] add safety check for 0-width register Based on post-commit discussion in D81766, Hexagon sets this to "0". I'll see if I can come up with a test, but making the obvious code fix first to unblock that target.	2020-08-11 20:30:02 -04:00
Thomas Lively	2985c02f79	[WebAssembly][AsmParser] Name missing features in error message Rather than just saying that some feature is missing, report the exact features to make the error message more useful and actionable. Differential Revision: https://reviews.llvm.org/D85795	2020-08-11 17:26:14 -07:00
Dávid Bolvanský	b9af72bffe	[Diagnostics] Reworked -Wstring-concatenation	2020-08-12 02:18:01 +02:00

1 2 3 4 5 ...

363282 Commits All Branches Search

363282 Commits

All Branches