llvm-project

Commit Graph

Author	SHA1	Message	Date
Yaxun (Sam) Liu	369e26ca9e	[AMDGPU] Add __builtin_amdgcn_workgroup_size_x/y/z The main purpose of introducing these builtins is to add a range metadata [1, 1025) on the work group size loaded from dispatch ptr, which cannot be done by source code. Differential Revision: https://reviews.llvm.org/D76772	2020-03-28 01:03:20 -04:00
Richard Smith	499b2a8d63	PR45294: Fix handling of assumed template names looked up in the lexical scope. There are a few contexts in which we assume a name is a template name; if such a context is one where we should perform an unqualified lookup, and lookup finds nothing, we would form a dependent template name even if the name is not dependent. This happens in particular for the lookup of a pseudo-destructor. In passing, rename ActOnDependentTemplateName to just ActOnTemplateName given that we apply it for non-dependent template names too.	2020-03-27 21:07:06 -07:00
Richard Smith	88c7ffaf94	Form invalid template-id annotations when parsing a construct that is required to be a template-id but names an undeclared identifier.	2020-03-27 20:27:42 -07:00
Richard Smith	0c42539df3	Improve error recovery from missing '>' in template argument list. Produce the conventional "to match this '<'" note, so that the user knows why we expected a '>', and properly handle '>>' in C++11 onwards.	2020-03-27 18:59:01 -07:00
Richard Smith	b3f6e3d6d6	Improve recovery from invalid template-ids. Instead of bailing out of parsing when we encounter an invalid template-name or template arguments in a template-id, produce an annotation token describing the invalid construct. This avoids duplicate errors and generally allows us to recover better. In principle we should be able to extend this to store some kinds of invalid template-id in the AST for tooling use, but that isn't handled as part of this change.	2020-03-27 17:11:04 -07:00
Sam McCall	d68c09ac87	[AST] Add a Dependence bitmask to use for calculations with multiple node types. Summary: This makes it easier/safer to add bits (error) to other node types without worrying about bit layout all the time. For now, just use to implement the ad-hoc conversion functions. Next: remove these functions and use this directly. Reviewers: hokein Subscribers: cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D76939	2020-03-28 00:15:50 +01:00
Sam McCall	94938d7d41	[Syntax] Prevent (accidentally) copying TokenBuffer	2020-03-28 00:09:09 +01:00
Alexey Bataev	0fca766458	[OPENMP50]Fix PR45117: Orphaned task reduction should be allowed. Add support for orpahned task reductions.	2020-03-27 17:47:30 -04:00
Sam McCall	6b3bedec99	Add BitWidth trait to BitmaskEnum, and use for clang DependenceFlags. NFC Reviewers: hokein Subscribers: dexonsmith, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D76942	2020-03-27 22:40:21 +01:00
Michael Liao	5be9b8cbe2	[cuda][hip] Add CUDA builtin surface/texture reference support. Summary: - Re-commit after fix Sema checks on partial template specialization. Reviewers: tra, rjmccall, yaxunl, a.sidorin Subscribers: cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D76365	2020-03-27 17:18:49 -04:00
Alexey Bataev	49764dc30c	[OPENMP50]Add basic support for inscan reduction modifier. Added basic support (parsing/sema checks) for the inscan modifier in the reduction clauses.	2020-03-27 13:54:38 -04:00
Artem Belevich	fe8063e1a0	Revert "[cuda][hip] Add CUDA builtin surface/texture reference support." This reverts commit `6a9ad5f3f4`. The patch breaks CUDA copmilation. Differential Revision: https://reviews.llvm.org/D76365	2020-03-27 10:01:38 -07:00
Alexey Bataev	ee27df5552	Revert "[OPENMP50]Add basic support for inscan reduction modifier." This reverts commit `36ed0ceec7` to fix a crash in scan_messages.cpp test.	2020-03-27 11:25:47 -04:00
Alexey Bataev	36ed0ceec7	[OPENMP50]Add basic support for inscan reduction modifier. Added basic support (parsing/sema checks) for the inscan modifier in the reduction clauses.	2020-03-27 10:38:25 -04:00
Yannic Bonenberger	848112cca4	Simplify implementation of Type::isXXXType(); NFC	2020-03-27 10:24:59 -04:00
Kirstóf Umann	bda3dd0d98	[analyzer][NFC] Change LangOptions to CheckerManager in the shouldRegister* functions Some checkers may not only depend on language options but also analyzer options. To make this possible this patch changes the parameter of the shouldRegister* function to CheckerManager to be able to query the analyzer options when deciding whether the checker should be registered. Differential Revision: https://reviews.llvm.org/D75271	2020-03-27 14:34:09 +01:00
Johannes Doerfert	befb4be3a8	[OpenMP] `omp begin/end declare variant` - part 2, sema ("+CG") This is the second part loosely extracted from D71179 and cleaned up. This patch provides semantic analysis support for `omp begin/end declare variant`, mostly as defined in OpenMP technical report 8 (TR8) [0]. The sema handling makes code generation obsolete as we generate "the right" calls that can just be handled as usual. This handling also applies to the existing, albeit problematic, `omp declare variant support`. As a consequence a lot of unneeded code generation and complexity is removed. A major purpose of this patch is to provide proper `math.h`/`cmath` support for OpenMP target offloading. See PR42061, PR42798, PR42799. The current code was developed with this feature in mind, see [1]. The logic is as follows: If we have seen a `#pragma omp begin declare variant match(<SELECTOR>)` but not the corresponding `end declare variant`, and we find a function definition we will: 1) Create a function declaration for the definition we were about to generate. 2) Create a function definition but with a mangled name (according to `<SELECTOR>`). 3) Annotate the declaration with the `OMPDeclareVariantAttr`, the same one used already for `omp declare variant`, using and the mangled function definition as specialization for the context defined by `<SELECTOR>`. When a call is created we inspect it. If the target has an `OMPDeclareVariantAttr` attribute we try to specialize the call. To this end, all variants are checked, the best applicable one is picked and a new call to the specialization is created. The new call is used instead of the original one to the base function. To keep the AST printing and tooling possible we utilize the PseudoObjectExpr. The original call is the syntactic expression, the specialized call is the semantic expression. [0] https://www.openmp.org/wp-content/uploads/openmp-TR8.pdf [1] https://reviews.llvm.org/D61399#change-496lQkg0mhRN Reviewers: kiranchandramohan, ABataev, RaviNarayanaswamy, gtbercea, grokos, sdmitriev, JonChesterfield, hfinkel, fghanim, aaron.ballman Subscribers: bollu, guansong, openmp-commits, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D75779	2020-03-27 02:30:58 -05:00
Johannes Doerfert	095cecbe0d	[OpenMP] `omp begin/end declare variant` - part 1, parsing This is the first part extracted from D71179 and cleaned up. This patch provides parsing support for `omp begin/end declare variant`, as defined in OpenMP technical report 8 (TR8) [0]. A major purpose of this patch is to provide proper math.h/cmath support for OpenMP target offloading. See PR42061, PR42798, PR42799. The current code was developed with this feature in mind, see [1]. [0] https://www.openmp.org/wp-content/uploads/openmp-TR8.pdf [1] https://reviews.llvm.org/D61399#change-496lQkg0mhRN Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D74941	2020-03-27 02:30:58 -05:00
Johannes Doerfert	56d1553dd2	[OpenMP][NFC] Outline common functionality (skipUntilPragmaOpenMPEnd) The same code was repeated multiple times, we put it in a function now.	2020-03-27 02:30:57 -05:00
Sid Manning	b0da094983	[Hexagon] Add support for Linux/Musl ABI (part 2) A continuation of https://reviews.llvm.org/D72701. This adds support needed in clang. Differential Revision: https://reviews.llvm.org/D75638	2020-03-26 17:19:46 -05:00
Alexey Bataev	2a43a1610d	[OPENMP50]Fix the checks for the nesting of scan directives. Fixed the check for the orhaned scan directives and improved checks for parallel for and parallel for simd directives.	2020-03-26 17:30:02 -04:00
Alexey Bataev	f9e71f4d9d	Revert "[OPENMP50]Add basic support for inscan reduction modifier." This reverts commit `8099e0fe82` to fix the problems with the Windows-based buildbots.	2020-03-26 15:57:19 -04:00
Alexey Bataev	8099e0fe82	[OPENMP50]Add basic support for inscan reduction modifier. Added basic support (parsing/sema checks) for the inscan modifier in the reduction clauses.	2020-03-26 14:51:09 -04:00
Michael Liao	6a9ad5f3f4	[cuda][hip] Add CUDA builtin surface/texture reference support. Summary: - Even though the bindless surface/texture interfaces are promoted, there are still code using surface/texture references. For example, [PR#26400](https://bugs.llvm.org/show_bug.cgi?id=26400) reports the compilation issue for code using `tex2D` with texture references. For better compatibility, this patch proposes the support of surface/texture references. - Due to the absent documentation and magic headers, it's believed that `nvcc` does use builtins for texture support. From the limited NVVM documentation[^nvvm] and NVPTX backend texture/surface related tests[^test], it's believed that surface/texture references are supported by replacing their reference types, which are annotated with `device_builtin_surface_type`/`device_builtin_texture_type`, with the corresponding handle-like object types, `cudaSurfaceObject_t` or `cudaTextureObject_t`, in the device-side compilation. On the host side, that global handle variables are registered and will be established and updated later when corresponding binding/unbinding APIs are called[^bind]. Surface/texture references are most like device global variables but represented in different types on the host and device sides. - In this patch, the following changes are proposed to support that behavior: + Refine `device_builtin_surface_type` and `device_builtin_texture_type` attributes to be applied on `Type` decl only to check whether a variable is of the surface/texture reference type. + Add hooks in code generation to replace that reference types with the correponding object types as well as all accesses to them. In particular, `nvvm.texsurf.handle.internal` should be used to load object handles from global reference variables[^texsurf] as well as metadata annotations. + Generate host-side registration with proper template argument parsing. --- [^nvvm]: https://docs.nvidia.com/cuda/pdf/NVVM_IR_Specification.pdf [^test]: https://raw.githubusercontent.com/llvm/llvm-project/master/llvm/test/CodeGen/NVPTX/tex-read-cuda.ll [^bind]: See section 3.2.11.1.2 ``Texture reference API` in [CUDA C Programming Guide](https://docs.nvidia.com/cuda/pdf/CUDA_C_Programming_Guide.pdf). [^texsurf]: According to NVVM IR, `nvvm.texsurf.handle` should be used. But, the current backend doesn't have that supported. We may revise that later. Reviewers: tra, rjmccall, yaxunl, a.sidorin Subscribers: cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D76365	2020-03-26 14:44:52 -04:00
Kirstóf Umann	4dc8472942	[analyzer] Add the Preprocessor to CheckerManager	2020-03-26 17:29:52 +01:00
Kirstóf Umann	0766d1dca8	Make a windows buildbot happy	2020-03-26 17:04:23 +01:00
Haojian Wu	62dea6e9be	Revert "[AST] Build recovery expressions by default for C++." This reverts commit `0788acbccb`. This reverts commit c2d7a1f79cedfc9fcb518596aa839da4de0adb69: Revert "[clangd] Add test for FindTarget+RecoveryExpr (which already works). NFC" It causes a crash on invalid code: class X { decltype(unresolved()) foo; }; constexpr int s = sizeof(X);	2020-03-26 16:25:32 +01:00
Kristóf Umann	2aac0c47ae	Reland "[analyzer][NFC] Tie CheckerRegistry to CheckerManager, allow CheckerManager to be constructed for non-analysis purposes" Originally commited in rG57b8a407493c34c3680e7e1e4cb82e097f43744a, but it broke the modules bot. This is solved by putting the contructors of the CheckerManager class to the Frontend library. Differential Revision: https://reviews.llvm.org/D75360	2020-03-26 16:12:38 +01:00
Sam McCall	159a9f7e76	[AST] Print a<b<c>> without extra spaces in C++11 or later. Summary: It's not 1998 anymore. Reviewers: kadircet Subscribers: jkorous, arphaman, usaxena95, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D76801	2020-03-26 09:53:54 +01:00
Alexander Belyaev	df48e3920a	[Clang] Fix clang-tidy errors.	2020-03-25 20:13:43 +01:00
Mikhail Maltsev	bb4da94e5b	[ARM,CDE] Implement predicated Q-register CDE intrinsics Summary: This patch implements the following CDE intrinsics: T __arm_vcx1q_m(int coproc, T inactive, uint32_t imm, mve_pred_t p); T __arm_vcx2q_m(int coproc, T inactive, U n, uint32_t imm, mve_pred_t p); T __arm_vcx3q_m(int coproc, T inactive, U n, V m, uint32_t imm, mve_pred_t p); T __arm_vcx1qa_m(int coproc, T acc, uint32_t imm, mve_pred_t p); T __arm_vcx2qa_m(int coproc, T acc, U n, uint32_t imm, mve_pred_t p); T __arm_vcx3qa_m(int coproc, T acc, U n, V m, uint32_t imm, mve_pred_t p); The intrinsics are not part of the released ACLE spec, but internally at Arm we have reached consensus to add them to the next ACLE release. Reviewers: simon_tatham, MarkMurrayARM, ostannard, dmgreen Reviewed By: simon_tatham Subscribers: kristof.beyls, hiraditya, danielkiss, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D76610	2020-03-25 17:08:19 +00:00
zoecarver	b915aec6b5	Add method to TargetInfo to get CPU cache line size Summary: This patch adds a virtual method `getCPUCacheLineSize()` to `TargetInfo`. Currently, I've only implemented the method in `X86TargetInfo`. It's extremely important that each CPU's cache line size correct (e.g., we can't just define it as `64` across the board) so, it has been a little slow getting to this point. I'll work on the ARM CPUs next, but that will probably come later in a different patch. Tags: #clang Differential Revision: https://reviews.llvm.org/D74918	2020-03-25 09:50:38 -07:00
Michael Kruse	7520cf03ee	[clang] Reformat cindex. NFC. to reduce spurios changes in patches after clang-formatting them. In particular, these files contain long enums that clang-format reformats in their entirety if e.g. an element is added. Reviews having this problem include https://reviews.llvm.org/D76342 and https://reviews.llvm.org/D71447.	2020-03-25 11:11:48 -05:00
Erich Keane	b5a034e771	[SYCL] Implement __builtin_unique_stable_name. In order to support non-user-named kernels, SYCL needs some way in the integration headers to name the kernel object themselves. Initially, the design considered just RTTI naming of the lambdas, this results in a quite unstable situation in light of some device/host macros. Additionally, this ends up needing to use RTTI, which is a burden on the implementation and typically unsupported. Instead, we've introduced a builtin, __builtin_unique_stable_name, which takes a type or expression, and results in a constexpr constant character array that uniquely represents the type (or type of the expression) being passed to it. The implementation accomplishes that simply by using a slightly modified version of the Itanium Mangling. The one exception is when mangling lambdas, instead of appending the index of the lambda in the function, it appends the macro-expansion back-trace of the lambda itself in the form LINE->COL[~LINE->COL...]. Differential Revision: https://reviews.llvm.org/D76620	2020-03-25 07:01:50 -07:00
Haojian Wu	0788acbccb	[AST] Build recovery expressions by default for C++. Update the existing tests. Reviewers: sammccall Subscribers: cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D76696	2020-03-25 09:00:48 +01:00
Yaxun (Sam) Liu	2ae25647d1	[CUDA][HIP] Add -Xarch_device and -Xarch_host options The argument after -Xarch_device will be added to the arguments for CUDA/HIP device compilation and will be removed for host compilation. The argument after -Xarch_host will be added to the arguments for CUDA/HIP host compilation and will be removed for device compilation. Differential Revision: https://reviews.llvm.org/D76520	2020-03-24 10:13:05 -04:00
Sam McCall	a2aa9970e1	[AST] Use TypeDependence bitfield to calculate dependence on Types. NFC Summary: This clears the way for adding an Error dependence bit to Type and having it mostly-automatically propagated. Reviewers: hokein Subscribers: jfb, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D76424	2020-03-24 13:56:38 +01:00
Russell Gallop	8fa322dd39	Increase DIAG_SIZE_DRIVER as we're close to hitting it	2020-03-24 11:45:53 +00:00
Momchil Velikov	080d046c91	[ARM][CMSE] Implement CMSE attributes This patch adds CMSE attributes `cmse_nonsecure_call` and `cmse_nonsecure_entry`. As usual, specification is available here: https://developer.arm.com/docs/ecm0359818/latest Patch by Javed Absar, Bradley Smith, David Green, Momchil Velikov, possibly others. Differential Revision: https://reviews.llvm.org/D71129	2020-03-24 10:21:26 +00:00
Haojian Wu	733edf9750	[AST] Add RecoveryExpr to retain expressions on semantic errors Normally clang avoids creating expressions when it encounters semantic errors, even if the parser knows which expression to produce. This works well for the compiler. However, this is not ideal for source-level tools that have to deal with broken code, e.g. clangd is not able to provide navigation features even for names that compiler knows how to resolve. The new RecoveryExpr aims to capture the minimal set of information useful for the tools that need to deal with incorrect code: source range of the expression being dropped, subexpressions of the expression. We aim to make constructing RecoveryExprs as simple as possible to ensure writing code to avoid dropping expressions is easy. Producing RecoveryExprs can result in new code paths being taken in the frontend. In particular, clang can produce some new diagnostics now and we aim to suppress bogus ones based on Expr::containsErrors. We deliberately produce RecoveryExprs only in the parser for now to minimize the code affected by this patch. Producing RecoveryExprs in Sema potentially allows to preserve more information (e.g. type of an expression), but also results in more code being affected. E.g. SFINAE checks will have to take presence of RecoveryExprs into account. Initial implementation only works in C++ mode, as it relies on compiler postponing diagnostics on dependent expressions. C and ObjC often do not do this, so they require more work to make sure we do not produce too many bogus diagnostics on the new expressions. See documentation of RecoveryExpr for more details. original patch from Ilya This change is based on https://reviews.llvm.org/D61722 Reviewers: sammccall, rsmith Reviewed By: sammccall, rsmith Tags: #clang Differential Revision: https://reviews.llvm.org/D69330	2020-03-24 09:20:37 +01:00
Alexey Bataev	1236eb6c31	[OPENMP50]Add 'default' modifier in reduction clauses. Added full support for 'default' modifier in the reduction clauses.	2020-03-23 18:18:08 -04:00
Richard Smith	502915c619	PR45142: 'template ~X<T>' is ill-formed; reject it rather than crashing.	2020-03-23 15:07:06 -07:00
Simon Pilgrim	1a4421a5e8	[analyzer] ConstraintManager - use EXPENSIVE_CHECKS instead of (gcc specific) __OPTIMIZE__ guard This was noticed on D71817, which removed another use of __OPTIMIZE__ Differential Revision: https://reviews.llvm.org/D76622	2020-03-23 21:03:14 +00:00
Kirstóf Umann	7bf871c39f	[analyzer][NFC] Move the text output type to its own file, move code to PathDiagnosticConsumer creator functions TableGen and .def files (which are meant to be used with the preprocessor) come with obvious downsides. One of those issues is that generated switch-case branches have to be identical. This pushes corner cases either to an outer code block, or into the generated code. Inspect the removed code in AnalysisConsumer::DigestAnalyzerOptions. You can see how corner cases like a not existing output file, the analysis output type being set to PD_NONE, or whether to complement the output with additional diagnostics on stderr lay around the preprocessor generated code. This is a bit problematic, as to how to deal with such errors is not in the hands of the users of this interface (those implementing output types, like PlistDiagnostics etc). This patch changes this by moving these corner cases into the generated code, more specifically, into the called functions. In addition, I introduced a new output type for convenience purposes, PD_TEXT_MINIMAL, which always existed conceptually, but never in the actual Analyses.def file. This refactoring allowed me to move TextDiagnostics (renamed from ClangDiagPathDiagConsumer) to its own file, which it really deserved. Also, those that had the misfortune to gaze upon Analyses.def will probably enjoy the sight that a clang-format did on it. Differential Revision: https://reviews.llvm.org/D76509	2020-03-23 21:50:40 +01:00
Vitaly Buka	cfaa84e1a6	Fix "previously declared as a struct" warning	2020-03-23 12:59:16 -07:00
Johannes Doerfert	55eca2853e	[OpenMP][NFC] Minimize memory usage and copying of `OMPTraitInfo`s See rational here: https://reviews.llvm.org/D71830#1922656 Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D76173	2020-03-23 14:23:46 -05:00
Jonas Devlieghere	56abcfad70	Revert "[analyzer][NFC] Tie CheckerRegistry to CheckerManager, allow CheckerManager to be constructed for non-analysis purposes" Temporarily reverting this patch because it breaks the modules build.	2020-03-23 12:09:24 -07:00
Alexey Bataev	63828a35da	[OPENMP50]Bassic support for exclusive clause. Added basic support (parsing/sema/serialization) for exclusive clause in scan directives.	2020-03-23 13:12:52 -04:00
Kristóf Umann	57b8a40749	[analyzer][NFC] Tie CheckerRegistry to CheckerManager, allow CheckerManager to be constructed for non-analysis purposes Its been a while since my CheckerRegistry related patches landed, allow me to refresh your memory: During compilation, TblGen turns clang/include/clang/StaticAnalyzer/Checkers/Checkers.td into (build directory)/tools/clang/include/clang/StaticAnalyzer/Checkers/Checkers.inc. This is a file that contains the full name of the checkers, their options, etc. The class that is responsible for parsing this file is CheckerRegistry. The job of this class is to establish what checkers are available for the analyzer (even from plugins and statically linked but non-tblgen generated files!), and calculate which ones should be turned on according to the analyzer's invocation. CheckerManager is the class that is responsible for the construction and storage of checkers. This process works by first creating a CheckerRegistry object, and passing itself to CheckerRegistry::initializeManager(CheckerManager&), which will call the checker registry functions (for example registerMallocChecker) on it. The big problem here is that these two classes lie in two different libraries, so their interaction is pretty awkward. This used to be far worse, but I refactored much of it, which made things better but nowhere near perfect. --- This patch changes how the above mentioned two classes interact. CheckerRegistry is mainly used by CheckerManager, and they are so intertwined, it makes a lot of sense to turn in into a field, instead of a one-time local variable. This has additional benefits: much of the information that CheckerRegistry conveniently holds is no longer thrown away right after the analyzer's initialization, and opens the possibility to pass CheckerManager in the shouldRegister* function rather then LangOptions (D75271). There are a few problems with this. CheckerManager isn't the only user, when we honor help flags like -analyzer-checker-help, we only have access to a CompilerInstance class, that is before the point of parsing the AST. CheckerManager makes little sense without ASTContext, so I made some changes and added new constructors to make it constructible for the use of help flags. Differential Revision: https://reviews.llvm.org/D75360	2020-03-23 17:09:49 +01:00
Yaxun (Sam) Liu	b670ab7b6b	recommit `1b978ddba0` [CUDA][HIP][OpenMP] Emit deferred diagnostics by a post-parsing AST travese Differential Revision: https://reviews.llvm.org/D70172	2020-03-23 12:09:07 -04:00
Marcel Hlopko	a711a3a460	[Syntax] Build mapping from AST to syntax tree nodes Summary: Copy of https://reviews.llvm.org/D72446, submitting with Ilya's permission. Only used to assign roles to child nodes for now. This is more efficient than doing range-based queries. In the future, will be exposed in the public API of syntax trees. Reviewers: gribozavr2 Reviewed By: gribozavr2 Subscribers: cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D76355	2020-03-23 16:22:14 +01:00
John Brawn	fa0320dd8d	Add ParsedAttrInfo::handleDeclAttribute This makes it possible for plugin attributes to actually do something, and also removes a lot of boilerplate for simple attributes in SemaDeclAttr.cpp. Differential Revision: https://reviews.llvm.org/D31342	2020-03-23 13:23:11 +00:00
David Blaikie	2ec59a0a40	Buildbot debugging of `0d0b90105f` (lambda/function_ref lifetime issues) This is failing on several buildbots with some inexplicable (to me, right now) crashes. Let's see if this change is adequate to unblock the buildbots & further understanding can be gained later.	2020-03-22 22:43:44 -07:00
David Blaikie	0d0b90105f	Revert "[FIX] Do not copy an llvm::function_ref if it has to be reused" This fix doesn't seem to be right (function_ref can/should be passed by value) so I'm reverted it to see if the buildbots decide to explain what's wrong. This reverts commit `857bf5da35`.	2020-03-22 18:43:39 -07:00
Yaxun (Sam) Liu	78957bab55	[NFC] Refactor handling of Xarch option Extract common code to a function. To prepare for adding an option for CUDA/HIP host and device only option. Differential Revision: https://reviews.llvm.org/D76455	2020-03-22 14:42:09 -04:00
Florian Hahn	684ee2057f	[clang/docs] Fix various sphinx warnings/errors in docs. There are a few places with unexpected indents that trip over sphinx and other syntax errors. Also, the C++ syntax highlighting does not work for class [[gsl::Owner(int)]] IntOwner { Use a regular code:: block instead. There are a few other warnings errors remaining, of the form 'Duplicate explicit target name: "cmdoption-clang--prefix"'. They seem to be caused by the following .. option:: -B<dir>, --prefix <arg>, --prefix=<arg> I am no Restructured Text expert, but it seems like sphinx 1.8.5 tries to generate the same target for the --prefix <arg> and --prefix=<arg>. This pops up in a lot of places and I am not sure how to best resolve it Reviewers: jfb, Bigcheese, dexonsmith, rjmccall Reviewed By: rjmccall Differential Revision: https://reviews.llvm.org/D76534	2020-03-21 16:06:33 +00:00
Thomas Lively	de6cd3e836	[WebAssembly] Add SIMD integer abs builtins Summary: Since the conditional operator cannot be used with vector conditions in C, we need a builtin to be able to express this operation in C source. Reviewers: aheejin Subscribers: dschuff, sbc100, jgravelle-google, sunfish, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D76538	2020-03-21 00:21:24 -07:00
Wyatt Childers	be10b7e43a	Use values cached in ConstantExprs for expression evaluation where present. No functionality change intended. Differential Revision: https://reviews.llvm.org/D76438	2020-03-20 18:14:58 -07:00
Richard Smith	dc4259d5a3	[c++20] Further extend the set of comparisons broken by C++20 that we accept as an extension. This attempts to accept the same cases a GCC, plus cases where a comparison is rewritten to an operator== with an integral but non-bool return type; this is sufficient to avoid most problems with various major open-source projects (such as ICU) and appears to fix all but one of the comparison-related C++20 build breaks in LLVM. This approach is being pursued for standardization.	2020-03-20 14:22:48 -07:00
Alexey Bataev	9b95929a26	[OPENMP50]Do not allow several scan directives in the same parent region. According to OpenMP 5.0, exactly one scan directive must appear in the loop body of an enclosing worksharing-loop, worksharing-loop SIMD, or simd construct on which a reduction clause with the inscan modifier is present.	2020-03-20 15:45:31 -04:00
Alexey Bataev	06dea73307	[OPENMP50]Initial support for inclusive clause. Added parsing/sema/serialization support for inclusive clause in scan directive.	2020-03-20 14:20:38 -04:00
Erich Keane	ffcc076a2b	[[Clang CallGraph]] CallGraph should still record calls to decls. Discovered by a downstream user, we found that the CallGraph ignores callees unless they are defined. This seems foolish, and prevents combining the report with other reports to create unified reports. Additionally, declarations contain information that is likely useful to consumers of the CallGraph. This patch implements this by splitting the includeInGraph function into two versions, the current one plus one that is for callees only. The only difference currently is that includeInGraph checks for a body, then calls includeCalleeInGraph. Differential Revision: https://reviews.llvm.org/D76435	2020-03-20 08:55:23 -07:00
Simon Tatham	1adfa4c991	[ARM,MVE] Add ACLE intrinsics for the vaddv/vaddlv family. Summary: I've implemented them as target-specific IR intrinsics rather than using `@llvm.experimental.vector.reduce.add`, on the grounds that the 'experimental' intrinsic doesn't currently have much code generation benefit, and my replacements encapsulate the sign- or zero-extension so that you don't expose the illegal MVE vector type (`<4 x i64>`) in IR. The machine instructions come in two versions: with and without an input accumulator. My new IR intrinsics, like the 'experimental' one, don't take an accumulator parameter: we represent that by just adding on the input value using an ordinary i32 or i64 add. So if you write the `vaddvaq` C-language intrinsic with an input accumulator of zero, it can be optimised to VADDV, and conversely, if you write something like `x += vaddvq(y)` then that can be combined into VADDVA. Most of this is achieved in isel lowering, by converting these IR intrinsics into the existing `ARMISD::VADDV` family of custom SDNode types. For the difficult case (64-bit accumulators), isel lowering already implements the optimization of folding an addition into a VADDLV to make a VADDLVA; so once we've made a VADDLV, our job is already done, except that I had to introduce a parallel set of ARMISD nodes for the //predicated// forms of VADDLV. For the simpler VADDV, we handle the predicated form by just leaving the IR intrinsic alone and matching it in an ordinary dag pattern. Reviewers: dmgreen, MarkMurrayARM, miyuki, ostannard Reviewed By: dmgreen Subscribers: kristof.beyls, hiraditya, danielkiss, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D76491	2020-03-20 15:42:33 +00:00
Simon Tatham	45a9945b9e	[ARM,MVE] Add ACLE intrinsics for the vminv/vmaxv family. Summary: I've implemented these as target-specific IR intrinsics, because they're not //quite// enough like @llvm.experimental.vector.reduce.min (which doesn't take the extra scalar parameter). Also this keeps the predicated and unpredicated versions looking similar, and the floating-point minnm/maxnm versions fold into the same schema. We had a couple of min/max reductions already implemented, from the initial pathfinding exercise in D67158. Those were done by having separate IR intrinsic names for the signed and unsigned integer versions; as part of this commit, I've changed them to use a flag parameter indicating signedness, which is how we ended up deciding that the rest of the MVE intrinsics family ought to work. So now hopefully the ewhole lot is consistent. In the new llc test, the output code from the `v8f16` test functions looks quite unpleasant, but most of it is PCS lowering (you can't pass a `half` directly in or out of a function). In other circumstances, where you do something else with your `half` in the same function, it doesn't look nearly as nasty. Reviewers: dmgreen, MarkMurrayARM, miyuki, ostannard Reviewed By: MarkMurrayARM Subscribers: kristof.beyls, hiraditya, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D76490	2020-03-20 15:42:33 +00:00
Gabor Marton	94061df6e5	[analyzer] StdLibraryFunctionsChecker: Add argument constraints Differential Revision: https://reviews.llvm.org/D73898	2020-03-20 16:33:14 +01:00
Mikhail Maltsev	6ae3eff8ba	[ARM,CDE] Implement CDE vreinterpret intrinsics Summary: This patch implements the following CDE intrinsics: int8x16_t __arm_vreinterpretq_s8_u8 (uint8x16_t in); uint16x8_t __arm_vreinterpretq_u16_u8 (uint8x16_t in); int16x8_t __arm_vreinterpretq_s16_u8 (uint8x16_t in); uint32x4_t __arm_vreinterpretq_u32_u8 (uint8x16_t in); int32x4_t __arm_vreinterpretq_s32_u8 (uint8x16_t in); uint64x2_t __arm_vreinterpretq_u64_u8 (uint8x16_t in); int64x2_t __arm_vreinterpretq_s64_u8 (uint8x16_t in); float16x8_t __arm_vreinterpretq_f16_u8 (uint8x16_t in); float32x4_t __arm_vreinterpretq_f32_u8 (uint8x16_t in); These intrinsics are header-only because they reuse the existing MVE vreinterpret clang built-ins. This set is slightly different from the published specification (see https://static.docs.arm.com/101028/0010/ACLE_2019Q4_release-0010.pdf): it includes int8x16_t __arm_vreinterpretq_s8_u8 (uint8x16_t in); which was unintentionally ommitted from the spec, and does not include float64x2_t __arm_vreinterpretq_f64_u8 (uint8x16_t in); The float64x2_t type requires additional implementation effort, and we are not including it yet. Reviewers: simon_tatham, MarkMurrayARM, dmgreen, ostannard Reviewed By: MarkMurrayARM Subscribers: kristof.beyls, danielkiss, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D76300	2020-03-20 14:01:57 +00:00
Mikhail Maltsev	969034b860	[ARM,CDE] Implement CDE unpredicated Q-register intrinsics Summary: This patch implements the following intrinsics: uint8x16_t __arm_vcx1q_u8 (int coproc, uint32_t imm); T __arm_vcx1qa(int coproc, T acc, uint32_t imm); T __arm_vcx2q(int coproc, T n, uint32_t imm); uint8x16_t __arm_vcx2q_u8(int coproc, T n, uint32_t imm); T __arm_vcx2qa(int coproc, T acc, U n, uint32_t imm); T __arm_vcx3q(int coproc, T n, U m, uint32_t imm); uint8x16_t __arm_vcx3q_u8(int coproc, T n, U m, uint32_t imm); T __arm_vcx3qa(int coproc, T acc, U n, V m, uint32_t imm); Most of them are polymorphic. Furthermore, some intrinsics are polymorphic by 2 or 3 parameter types, such polymorphism is not supported by the existing MVE/CDE tablegen backends, also we don't really want to have a combinatorial explosion caused by 1000 different combinations of 3 vector types. Because of this some intrinsics are implemented as macros involving a cast of the polymorphic arguments to uint8x16_t. The IR intrinsics are even more restricted in terms of types: all MVE vectors are cast to v16i8. Reviewers: simon_tatham, MarkMurrayARM, dmgreen, ostannard Reviewed By: MarkMurrayARM Subscribers: kristof.beyls, hiraditya, danielkiss, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D76299	2020-03-20 14:01:56 +00:00
Mikhail Maltsev	d22e661712	[ARM,CDE] Implement CDE S and D-register intrinsics Summary: This patch implements the following ACLE intrinsics: uint32_t __arm_vcx1_u32(int coproc, uint32_t imm); uint32_t __arm_vcx1a_u32(int coproc, uint32_t acc, uint32_t imm); uint32_t __arm_vcx2_u32(int coproc, uint32_t n, uint32_t imm); uint32_t __arm_vcx2a_u32(int coproc, uint32_t acc, uint32_t n, uint32_t imm); uint32_t __arm_vcx3_u32(int coproc, uint32_t n, uint32_t m, uint32_t imm); uint32_t __arm_vcx3a_u32(int coproc, uint32_t acc, uint32_t n, uint32_t m, uint32_t imm); uint64_t __arm_vcx1d_u64(int coproc, uint32_t imm); uint64_t __arm_vcx1da_u64(int coproc, uint64_t acc, uint32_t imm); uint64_t __arm_vcx2d_u64(int coproc, uint64_t m, uint32_t imm); uint64_t __arm_vcx2da_u64(int coproc, uint64_t acc, uint64_t m, uint32_t imm); uint64_t __arm_vcx3d_u64(int coproc, uint64_t n, uint64_t m, uint32_t imm); uint64_t __arm_vcx3da_u64(int coproc, uint64_t acc, uint64_t n, uint64_t m, uint32_t imm); Since the semantics of CDE instructions is opaque to the compiler, the ACLE intrinsics require dedicated LLVM IR intrinsics. The 64-bit and 32-bit variants share the same IR intrinsic. Reviewers: simon_tatham, MarkMurrayARM, ostannard, dmgreen Reviewed By: MarkMurrayARM Subscribers: kristof.beyls, hiraditya, danielkiss, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D76298	2020-03-20 14:01:53 +00:00
Mikhail Maltsev	7a85e3585e	[ARM,CDE] Implement GPR CDE intrinsics Summary: This change implements ACLE CDE intrinsics that translate to instructions working with general-purpose registers. The specification is available at https://static.docs.arm.com/101028/0010/ACLE_2019Q4_release-0010.pdf Each ACLE intrinsic gets a corresponding LLVM IR intrinsic (because they have distinct function prototypes). Dual-register operands are represented as pairs of i32 values. Because of this the instruction selection for these intrinsics cannot be represented as TableGen patterns and requires custom C++ code. Reviewers: simon_tatham, MarkMurrayARM, dmgreen, ostannard Reviewed By: MarkMurrayARM Subscribers: kristof.beyls, hiraditya, danielkiss, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D76296	2020-03-20 14:01:51 +00:00
Alexey Bataev	fcba7c3534	[OPENMP50]Initial support for scan directive. Addedi basic parsing/sema/serialization support for scan directive.	2020-03-20 07:58:15 -04:00
Tyker	180581cfcf	[clang] Add support for consteval constructors Summary: Changes: - handle immediate invocations for constructors. - add tests after this patch i believe the implementation of consteval is nearly standard compliant, but IR-gen still needs to be taught not to emit consteval declarations. Reviewers: rsmith Reviewed By: rsmith Subscribers: wchilders Differential Revision: https://reviews.llvm.org/D74007	2020-03-20 11:33:54 +01:00
Shiva Chen	fc3752665f	[RISCV] Passing small data limitation value to RISCV backend Passing small data limit to RISCVELFTargetObjectFile by module flag, So the backend can set small data section threshold by the value. The data will be put into the small data section if the data smaller than the threshold. Differential Revision: https://reviews.llvm.org/D57497	2020-03-20 11:03:51 +08:00
Thomas Lively	a3f974f3c3	[WebAssembly] SIMD bitmask intrinsics and builtin functions Summary: These experimental new instructions are proposed in https://github.com/WebAssembly/simd/pull/201. Reviewers: aheejin Subscribers: dschuff, sbc100, jgravelle-google, hiraditya, sunfish, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D76397	2020-03-19 17:15:37 -07:00
Sam McCall	b4f02d89e5	[AST] Make Expr::setDependence protected and remove add/removeDependence. NFC Summary: The expected pattern is for subclasses to initialize through computeDependence, which needs only setDependence. The few places that still use addDependence can be simulated with get+set. Reviewers: hokein Subscribers: cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D76392	2020-03-19 21:54:40 +01:00
Marcel Hlopko	88bf9b3d26	[Syntax] Build template declaration nodes Summary: Rollforward of https://reviews.llvm.org/rGdd12826808f9079e164b82e64b0697a077379241 after temporarily adding -fno-delayed-template-parsing to the TreeTest. Original summary: > Copy of https://reviews.llvm.org/D72334, submitting with Ilya's permission. > > Handles template declaration of all kinds. > > Also builds template declaration nodes for specializations and explicit > instantiations of classes. > > Some missing things will be addressed in the follow-up patches: > > * specializations of functions and variables, > * template parameters. Reviewers: gribozavr2 Subscribers: cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D76418	2020-03-19 17:43:07 +01:00
Haojian Wu	0dd0b1017c	[Parser] Avoid spurious 'missing template' error in presence of typos. Suppress those diagnostics if lhs of a member expression contains errors. Typo correction produces dependent expressions even in non-template code, that led to spurious diagnostics before. previous: /tmp/t.cpp:6:17: error: use 'template' keyword to treat 'f' as a dependent template name auto a = bilder.f<int>(); ^ template /tmp/t.cpp:6:10: error: use of undeclared identifier 'bilder'; did you mean 'builder'? auto a = bilder.f<int>(); ^~~~~~ builder vs now: /tmp/t.cpp:6:10: error: use of undeclared identifier 'bilder'; did you mean 'builder'? auto a = bilder.f<int>(); ^~~~~~ builder Original patch from Ilya. Reviewers: sammccall Reviewed By: sammccall Tags: #clang Differential Revision: https://reviews.llvm.org/D65592	2020-03-19 16:15:27 +01:00
Adam Balogh	6cff2e9f78	[Analyzer] Bugfix for CheckerRegistry `CheckerRegistry` registers a checker either if it is excplicitly enabled or it is a dependency of an explicitly enabled checker and is not explicitly disabled. In both cases it is also important that the checker should be registered (`shoudRegister`//XXX//`()` returns true). Currently there is a bug here: if the dependenct checker is not explicitly disabled it is registered regardless of whether it should be registered. This patch fixes this bug. Differential Revision: https://reviews.llvm.org/D75842	2020-03-19 16:06:42 +01:00
Jan Korous	5d67fb3ecc	[AST][NFCi] Make CXXBasePaths::Origin const	2020-03-19 07:54:05 -07:00
Djordje Todorovic	d9b9621009	Reland D73534: [DebugInfo] Enable the debug entry values feature by default The issue that was causing the build failures was fixed with the D76164.	2020-03-19 13:57:30 +01:00
Lucas Prates	d4ad386ee1	[ARM] Fixing range checks for Neon's vqdmulhq_lane and vqrdmulhq_lane intrinsics Summary: The range checks performed for the vqrdmulh_lane and vqrdmulh_lane Neon intrinsics were incorrectly using their return type as the base type for the range check performed on their 'lane' argument. This patch updates those intrisics to use the type of the proper reference argument to perform the range checks. Reviewers: jmolloy, t.p.northover, rsmith, olista01, dnsampaio Reviewed By: dnsampaio Subscribers: dnsampaio, kristof.beyls, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D74766	2020-03-19 12:08:12 +00:00
Lucas Prates	f56550cf7f	[ARM] Enabling range checks on Neon intrinsics' lane arguments Summary: Range checks were not properly performed in the lane arguments of Neon intrinsics implemented based on splat operations. Calls to those intrinsics where translated to `__builtin__shufflevector` calls directly by the pre-processor through the arm_neon.h macros, missing the chance for the proper range checks. This patch enables the range check by introducing an auxiliary splat instruction in arm_neon.td, delaying the translation to shufflevector calls to CGBuiltin.cpp in clang after the checks were performed. Reviewers: jmolloy, t.p.northover, rsmith, olista01, ostannard Reviewed By: ostannard Subscribers: ostannard, dnsampaio, danielkiss, kristof.beyls, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D74619	2020-03-19 12:07:23 +00:00
Lucas Prates	d42711625a	[ARM] Creating 'call_mangled' for Neon intrinsics definitions Summary: As multiple versions of the same Neon intrinsic can be created through the same TableGen definition with the same argument types, the existing `call` operator is not always able to properly perform overload resolutions. As these different intrinsic versions are differentiated later on by the NeonEmitter through name mangling, this patch introduces a new `call_mangled` operator to the TableGen definitions, which allows a call for an otherwise ambiguous intrinsic by matching its mangled name with the mangled variation of the caller. Reviewers: jmolloy, t.p.northover, rsmith, olista01, dnsampaio Reviewed By: dnsampaio Subscribers: dnsampaio, kristof.beyls, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D74618	2020-03-19 12:05:55 +00:00
Lucas Prates	dade859b58	[ARM] Setting missing isLaneQ attribute on Neon Intrisics definitions Summary: Some of the `*_laneq` intrinsics defined in arm_neon.td were missing the setting of the `isLaneQ` attribute. This patch sets the attribute on the related definitions, as they will be required to properly perform range checks on their lane arguments. Reviewers: jmolloy, t.p.northover, rsmith, olista01, dnsampaio Reviewed By: dnsampaio Subscribers: dnsampaio, kristof.beyls, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D74616	2020-03-19 12:04:14 +00:00
Lucas Prates	7bf23563f4	Revert "[ARM] Setting missing isLaneQ attribute on Neon Intrisics definitions" This reverts commit `62ab15ffa3`. Multiple commits were unintentionally squashed into this one. Reverting so each of them can be pushed properly.	2020-03-19 12:01:13 +00:00
Lucas Prates	62ab15ffa3	[ARM] Setting missing isLaneQ attribute on Neon Intrisics definitions Summary: Some of the `*_laneq` intrinsics defined in arm_neon.td were missing the setting of the `isLaneQ` attribute. This patch sets the attribute on the related definitions, as they will be required to properly perform range checks on their lane arguments. Reviewers: jmolloy, t.p.northover, rsmith, olista01, dnsampaio Reviewed By: dnsampaio Subscribers: dnsampaio, kristof.beyls, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D74616	2020-03-19 11:52:41 +00:00
Sander de Smalen	981f0802b3	[SVE] Generate overloaded functions for ACLE intrinsics. The SVE ACLE allows using a short-form for the intrinsics, e.g. the following two declarations generate the same code: svuint32_t svld1(svbool_t, uint32_t const ); svuint32_t svld1_u32(svbool_t, uint32_t const ); using the attribute: __clang_arm_builtin_alias so that any call to svld1(svbool_t, uint32_t const *) will map to __builtin_sve_svld1_u32. Reviewers: SjoerdMeijer, miyuki, efriedma, simon_tatham, rengolin Reviewed By: SjoerdMeijer Tags: #clang Differential Revision: https://reviews.llvm.org/D75861	2020-03-19 09:36:23 +00:00
Haojian Wu	4b0f1e12c2	[AST] Add a flag indicating if any subexpression had errors The only subexpression that is considered an error now is TypoExpr, but we plan to add expressions with errors to improve editor tooling on broken code. We intend to use the same mechanism to guard against spurious diagnostics on those as well. See the follow-up revision for an actual usage of the flag. Original patch from Ilya. Reviewers: sammccall Reviewed By: sammccall Tags: #clang Differential Revision: https://reviews.llvm.org/D65591	2020-03-19 08:56:10 +01:00
Alexey Bataev	2f8894a5b8	[OPENMP50]Add support for extended device clause in target directives. Added parsing/sema/serialization support for extended device clause in executable target directives.	2020-03-18 15:02:37 -04:00
Adrian Prantl	1cc09dcefc	Add missing module map entry.	2020-03-18 10:50:45 -07:00
Simon Tatham	e13d153c1b	[ARM,MVE] Add intrinsics for the VQDMLAD family. Summary: This is another set of instructions too complicated to be sensibly expressed in IR by anything short of a target-specific intrinsic. Given input vectors a,b, the instruction generates intermediate values 2(a[0]b[0]+a[1]+b[1]), 2(a[2]b[2]+a[3]+b[3]), etc; takes the high half of each double-width values, and overwrites half the lanes in the output vector c, which you therefore have to provide the input value of. Optionally you can swap the elements of b so that the are things like a[0]b[1]+a[1]b[0]; optionally you can round to nearest when taking the high half; and optionally you can take the difference rather than sum of the two products. Finally, saturation is applied when converting back to a single-width vector lane. Reviewers: dmgreen, MarkMurrayARM, miyuki, ostannard Reviewed By: miyuki Subscribers: kristof.beyls, hiraditya, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D76359	2020-03-18 17:11:22 +00:00
Nico Weber	881f5b5a7b	Revert "[Syntax] Build template declaration nodes" This reverts commit `dd12826808`. Breaks tests on Windows, see https://reviews.llvm.org/D76346#1929208	2020-03-18 12:57:55 -04:00
Marcel Hlopko	dd12826808	[Syntax] Build template declaration nodes Summary: Copy of https://reviews.llvm.org/D72334, submitting with Ilya's permission. Handles template declaration of all kinds. Also builds template declaration nodes for specializations and explicit instantiations of classes. Some missing things will be addressed in the follow-up patches: specializations of functions and variables, template parameters. Reviewers: gribozavr2 Reviewed By: gribozavr2 Subscribers: cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D76346	2020-03-18 16:16:59 +01:00
Michael Liao	4cf01ed75e	[hip] Revise `GlobalDecl` constructors. NFC. Summary: - https://reviews.llvm.org/D68578 revises the `GlobalDecl` constructors to ensure all GPU kernels have `ReferenceKenelKind` initialized properly with an explicit constructor and static one. But, there are lots of places using the implicit constructor triggering the assertion on non-GPU kernels. That's found in compilation of many tests and workloads. - Fixing all of them may change more code and, more importantly, all of them assumes the default kernel reference kind. This patch changes that constructor to tell `CUDAGlobalAttr` and construct `GlobalDecl` properly. Reviewers: yaxunl Subscribers: cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D76344	2020-03-18 09:33:39 -04:00
Sander de Smalen	c5b81466c2	Reland D75470 [SVE] Auto-generate builtins and header for svld1. Reworked the patch to avoid sharing a header (SVETypeFlags.h) between include/clang/Basic and utils/TableGen/SveEmitter.cpp. Now the patch generates the enum/flags which is included in TargetBuiltins.h. Also renamed one of the SveEmitter options to be in line with MVE. Summary: This is a first patch in a series for the SveEmitter to generate the arm_sve.h header file and builtins. I've tried my best to strip down this patch as best as I could, but there are still a few changes that are not necessarily exercised by the load intrinsics in this patch, mostly around the SVEType class which has some common logic to represent types from a type and prototype string. I thought it didn't make much sense to remove that from this patch and split it up.	2020-03-18 11:16:28 +00:00
Simon Tatham	928776de92	[ARM,MVE] Add intrinsics for the VQDMLAH family. Summary: These are complicated integer multiply+add instructions with extra saturation, taking the high half of a double-width product, and optional rounding. There's no sensible way to represent that in standard IR, so I've converted the clang builtins directly to target-specific intrinsics. Reviewers: dmgreen, MarkMurrayARM, miyuki, ostannard Reviewed By: miyuki Subscribers: kristof.beyls, hiraditya, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D76123	2020-03-18 10:55:04 +00:00
Simon Tatham	28c5d97bee	[ARM,MVE] Add intrinsics and isel for MVE integer VMLA. Summary: These instructions compute multiply+add in integers, with one of the operands being a splat of a scalar. (VMLA and VMLAS differ in whether the splat operand is a multiplier or the addend.) I've represented these in IR using existing standard IR operations for the unpredicated forms. The predicated forms are done with target- specific intrinsics, as usual. When operating on n-bit vector lanes, only the bottom n bits of the i32 scalar operand are used. So we have to tell that to isel lowering, to allow it to remove a pointless sign- or zero-extension instruction on that input register. That's done in `PerformIntrinsicCombine`, but first I had to enable `PerformIntrinsicCombine` for MVE targets (previously all the intrinsics it handled were for NEON), and make it a method of `ARMTargetLowering` so that it can get at `SimplifyDemandedBits`. Reviewers: dmgreen, MarkMurrayARM, miyuki, ostannard Reviewed By: dmgreen Subscribers: kristof.beyls, hiraditya, danielkiss, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D76122	2020-03-18 10:55:04 +00:00
Richard Smith	e7a811b319	PR45133: Don't crash if the active member of a union changes while it's in the process of being initialized.	2020-03-17 20:37:14 -07:00
Jon Chesterfield	c45eaeabb7	[Clang] Undef attribute for global variables Summary: [Clang] Attribute to allow defining undef global variables Initializing global variables is very cheap on hosted implementations. The C semantics of zero initializing globals work very well there. It is not necessarily cheap on freestanding implementations. Where there is no loader available, code must be emitted near the start point to write the appropriate values into memory. At present, external variables can be declared in C++ and definitions provided in assembly (or IR) to achive this effect. This patch provides an attribute in order to remove this reason for writing assembly for performance sensitive freestanding implementations. A close analogue in tree is LDS memory for amdgcn, where the kernel is responsible for initializing the memory after it starts executing on the gpu. Uninitalized variables in LDS are observably cheaper than zero initialized. Patch is loosely based on the cuda __shared__ and opencl __local variable implementation which also produces undef global variables. Reviewers: kcc, rjmccall, rsmith, glider, vitalybuka, pcc, eugenis, vlad.tsyrklevich, jdoerfert, gregrodgers, jfb, aaron.ballman Reviewed By: rjmccall, aaron.ballman Subscribers: Anastasia, aaron.ballman, davidb, Quuxplusone, dexonsmith, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D74361	2020-03-17 21:22:23 +00:00
Benjamin Kramer	acf6e4190f	Purge unused diagnostics. NFC.	2020-03-17 15:17:10 +01:00
Alexey Bataev	0f0564bb9a	[OPENMP50]Initial support for detach clause in task directive. Added parsing/sema/serialization support for detach clause.	2020-03-17 09:19:03 -04:00

1 2 3 4 5 ...

26333 Commits