llvm-project

Commit Graph

Author	SHA1	Message	Date
Shilei Tian	2c6ffb4eb2	[NFC] clang-format -i clang/lib/CodeGen/CGStmtOpenMP.cpp	2021-08-22 22:57:05 -04:00
Simon Pilgrim	7f48bd3bed	CGBuiltin.cpp - pass SVETypeFlags by const reference. NFC. Don't pass the struct by value.	2021-08-22 12:13:17 +01:00
Wang, Pengfei	b088536ce9	[X86] AVX512FP16 instructions enabling 4/6 Enable FP16 unary operator instructions. Ref.: https://software.intel.com/content/www/us/en/develop/download/intel-avx512-fp16-architecture-specification.html Reviewed By: LuoYuanke Differential Revision: https://reviews.llvm.org/D105267	2021-08-22 08:59:35 +08:00
Joseph Huber	ec66ed79f4	[OpenMP] Correctly add member expressions to OpenMP info Mapping expressions that have `this` as their base expression aren't considered a valid base variable and the rest of the runtime expects this. However, if we have an expression with no value declaration we can try to extract it manually to provide more helpful debuggin information. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D108483	2021-08-20 20:45:14 -04:00
Arthur Eubanks	644f88a25b	[NFC] addAttribute(FunctionIndex) => addFnAttribute()	2021-08-20 14:18:59 -07:00
Yonghong Song	5ca7131eb3	[DebugInfo] convert btf_tag attrs to DI annotations for record fields Generate btf_tag annotations for record fields. The annotations are represented as an DINodeArray in DebugInfo. Differential Revision: https://reviews.llvm.org/D106616	2021-08-20 12:52:51 -07:00
Jon Chesterfield	b1efeface7	Revert "[openmp][nfc] Refactor GridValues" Failed a nvptx codegen test This reverts commit `2a47a84b40`.	2021-08-20 18:17:27 +01:00
Craig Topper	5cf5df8014	[X86] Add missing __inline__ to functions in amxintrin.h	2021-08-20 09:35:02 -07:00
Thomas Lively	88962cea46	[WebAssembly] Restore builtins and intrinsics for pmin/pmax Partially reverts `85157c0079`, which had removed these builtins and intrinsics in favor of normal codegen patterns. It turns out that it is possible for the patterns to be split over multiple basic blocks, however, which means that DAG ISel is not able to select them to the pmin/pmax instructions. To make sure the SIMD intrinsics generate the correct instructions in these cases, reintroduce the clang builtins and corresponding LLVM intrinsics, but also keep the normal pattern matching as well. Differential Revision: https://reviews.llvm.org/D108387	2021-08-20 09:21:31 -07:00
Thomas Lively	64a9957bf7	[WebAssembly] Make shift values unsigned in wasm_simd128.h On some platforms, negative shift values mean to shift in the opposite direction, but this is not true with WebAssembly. To avoid confusion, make the shift values in the shift intrinsics unsigned. Differential Revision: https://reviews.llvm.org/D108415	2021-08-20 09:10:37 -07:00
Aaron Ballman	65bcdeaa15	Replace an unnecessary null check with an assert; NFC	2021-08-20 12:04:46 -04:00
Thomas Lively	2456e11614	[WebAssembly] Add SIMD intrinsics using unsigned integers For each SIMD intrinsic function that takes or returns a scalar signed integer value, ensure there is a corresponding intrinsic that returns or an unsigned value. This is a convenience for users who use -Wsign-conversion so they don't have to insert explicit casts, especially when the intrinsic arguments are integer literals that fit into the unsigned integer type but not the signed type. Differential Revision: https://reviews.llvm.org/D108412	2021-08-20 08:56:51 -07:00
Jon Chesterfield	2a47a84b40	[openmp][nfc] Refactor GridValues Remove redundant fields and replace pointer with virtual function Of fourteen fields, three are dead and four can be computed from the remainder. This leaves a couple of currently dead fields in place as they are expected to be used from the deviceRTL shortly. Two of the fields that can be computed are only used from codegen and require a log2() implementation so are inlined into codegen instead. This change leaves the new methods in the same location in the struct as the previous fields for convenience at review. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D108380	2021-08-20 16:41:26 +01:00
Corentin Jabot	bdeda959ab	Make wide multi-character character literals ill-formed This implements P2362, which has not yet been approved by the C++ committee, but because wide-multi character literals are implementation defined, clang might not have to wait for WG21. This change is also being applied in C mode as the behavior is implementation-defined in C as well and there's no benefit to having different rules between the languages. The other part of P2362, making non-representable character literals ill-formed, is already implemented by clang	2021-08-20 11:10:53 -04:00
Aaron Ballman	c7aacce304	Use DeclContext::getNonTransparentContext(); NFC	2021-08-20 11:08:58 -04:00
Aaron Ballman	48f73ee666	Fix assertion when generating diagnostic for inline namespaces When calculating the name to display for inline namespaces, we have custom logic to try to hide redundant inline namespaces from the diagnostic. Calculating these redundancies requires performing a lookup in the parent declaration context, but that lookup should not try to look through transparent declaration contexts, like linkage specifications. Instead, loop up the declaration context chain until we find a non-transparent context and use that instead. This fixes PR49954.	2021-08-20 09:50:24 -04:00
Alexander Potapenko	b0391dfc73	[clang][Codegen] Introduce the disable_sanitizer_instrumentation attribute The purpose of __attribute__((disable_sanitizer_instrumentation)) is to prevent all kinds of sanitizer instrumentation applied to a certain function, Objective-C method, or global variable. The no_sanitize(...) attribute drops instrumentation checks, but may still insert code preventing false positive reports. In some cases though (e.g. when building Linux kernel with -fsanitize=kernel-memory or -fsanitize=thread) the users may want to avoid any kind of instrumentation. Differential Revision: https://reviews.llvm.org/D108029	2021-08-20 14:01:06 +02:00
Simon Pilgrim	5b72fb866c	[AST] getDeclLocForCommentSearch - remove dead return. NFC. Don't use an else-block as the previous if-block always returns, and remove the (now more obvious) dead return {}.	2021-08-20 12:32:29 +01:00
Simon Pilgrim	62f4c90aaf	[Sema] Remove dead return immediately after another return. NFC.	2021-08-20 12:11:23 +01:00
Justas Janickas	9ae9262857	[OpenCL] Fix version reporting of C++ for OpenCL 2021 C++ for OpenCL version 2021 and later are expected to consist of a major version number only. Therefore, a different constructor for `VersionTuple` needs to be called when reporting language version. Differential Revision: https://reviews.llvm.org/D108379	2021-08-20 08:58:12 +01:00
Anshil Gandhi	7063ac1afa	[HIP] Allow target addr space in target builtins This patch allows target specific addr space in target builtins for HIP. It inserts implicit addr space cast for non-generic pointer to generic pointer in general, and inserts implicit addr space cast for generic to non-generic for target builtin arguments only. It is NFC for non-HIP languages. Differential Revision: https://reviews.llvm.org/D102405	2021-08-19 23:51:58 -06:00
Albion Fung	9d4faa8ac3	[PowerPC] Implement cmplxl builtins This patch implements the builtins for cmplxl by utilising __builtin_complex. This builtin is implemented to match XL functionality. Differential revision: https://reviews.llvm.org/D107138	2021-08-19 21:36:43 -05:00
Yonghong Song	cab12fc28c	[DebugInfo] convert btf_tag attrs to annotations for DIComposite types Clang patch D106614 added attribute btf_tag support. This patch generates btf_tag annotations for DIComposite types. Each btf_tag annotation is represented as a 2D array of meta strings. Each record may have more than one btf_tag annotations. Differential Revision: https://reviews.llvm.org/D106615	2021-08-19 18:01:29 -07:00
Thomas Lively	fd3bd63df2	[WebAssembly] Make bitmask instructions return unsigned ints Since they are bitmasks, it will be more common for them to be used and potentially extended to 64-bit integers as unsigned values rather than signed values. Differential Revision: https://reviews.llvm.org/D108401	2021-08-19 16:23:47 -07:00
Jennifer Yu	c274b19866	Add implicit map for a list item appears in a reduction clause. A new rule is added in 5.0: If a list item appears in a reduction, lastprivate or linear clause on a combined target construct then it is treated as if it also appears in a map clause with a map-type of tofrom. Currently map clauses for all capture variables are added implicitly. But missing for list item of expression for array elements or array sections. The change is to add implicit map clause for array of elements used in reduction clause. Skip adding map clause if the expression is not mappable. Noted: For linear and lastprivate, since only variable name is accepted, the map has been added though capture variables. To do so: During the mappable checking, if error, ignore diagnose and skip adding implicit map clause. The changes: 1> Add code to generate implicit map in ActOnOpenMPExecutableDirective, for omp 5.0 and up. 2> Add extra default parameter NoDiagnose in ActOnOpenMPMapClause: Use that to skip error as well as skip adding implicit map during the mappable checking. Note: there are only tow places need to be check for NoDiagnose. Rest of them either the check is for < omp 5.0 or the error already generated for reduction clause. Differential Revision: https://reviews.llvm.org/D108132	2021-08-19 12:53:47 -07:00
Simon Pilgrim	ae691648b4	Fix unknown parameter Wdocumentation warning. NFC.	2021-08-19 15:40:10 +01:00
Simon Pilgrim	fd37ead386	Fix unknown parameter Wdocumentation warning. NFC.	2021-08-19 15:40:10 +01:00
Simon Pilgrim	caa282a449	Fix unknown parameter Wdocumentation warning. NFC.	2021-08-19 15:40:09 +01:00
Jon Chesterfield	77579b99e9	[openmp][nfc] Replace OMPGridValues array with struct [nfc] Replaces enum indices into an array with a struct. Named the fields to match the enum, leaves memory layout and initialization unchanged. Motivation is to later safely remove dead fields and replace redundant ones with (compile time) computation. It should also be possible to factor some common fields into a base and introduce a gfx10 amdgpu instance with less duplication than the arrays of integers require. Reviewed By: ronlieb Differential Revision: https://reviews.llvm.org/D108339	2021-08-19 13:25:42 +01:00
Sam McCall	cab7c52acd	[CodeCompletion] Provide placeholders for known attribute arguments Completion now looks more like function/member completion: used alias(Aliasee) abi_tag(Tags...) Differential Revision: https://reviews.llvm.org/D108109	2021-08-19 14:03:41 +02:00
Sam McCall	a1ebae08f4	[CodeComplete] Only complete attributes that match the current LangOpts Differential Revision: https://reviews.llvm.org/D108111	2021-08-19 13:35:07 +02:00
Sven van Haastregt	7bda1a0711	[OpenCL] Fix as_type(vec3) invalid store creation With -fpreserve-vec3-type enabled, a cast was not created when converting from a vec3 type to a non-vec3 type, even though a conversion to vec4 was performed. This resulted in creation of invalid store instructions. Differential Revision: https://reviews.llvm.org/D107963	2021-08-19 11:57:09 +01:00
Bjorn Pettersson	36d5138619	[NewPM] Make some sanitizer passes parameterized in the PassRegistry Refactored implementation of AddressSanitizerPass and HWAddressSanitizerPass to use pass options similar to passes like MemorySanitizerPass. This makes sure that there is a single mapping from class name to pass name (needed by D108298), and options like -debug-only and -print-after makes a bit more sense when (despite that it is the unparameterized pass name that should be used in those options). A result of the above is that some pass names are removed in favor of the parameterized versions: - "khwasan" is now "hwasan<kernel;recover>" - "kasan" is now "asan<kernel>" - "kmsan" is now "msan<kernel>" Differential Revision: https://reviews.llvm.org/D105007	2021-08-19 12:43:37 +02:00
Martin Storsjö	cc3affd8b0	[clang] [MSVC] Implement __mulh and __umulh builtins for aarch64 The code is based on the same __mulh and __umulh intrinsics for x86. This should fix PR51128. Differential Revision: https://reviews.llvm.org/D106721	2021-08-19 11:29:55 +03:00
Anshil Gandhi	f5d5f17d3a	Revert "[HIP] Allow target addr space in target builtins" This reverts commit `a35008955f`.	2021-08-18 21:38:42 -06:00
Rong Xu	5fdaaf7fd8	[SampleFDO] Flow Sensitive Sample FDO (FSAFDO) profile loader This patch implements Flow Sensitive Sample FDO (FSAFDO) profile loader. We have two profile loaders for FS profile, one before RegAlloc and one before BlockPlacement. To enable it, when -fprofile-sample-use=<profile> is specified, add "-enable-fs-discriminator=true \ -disable-ra-fsprofile-loader=false \ -disable-layout-fsprofile-loader=false" to turn on the FS profile loaders. Differential Revision: https://reviews.llvm.org/D107878	2021-08-18 18:37:35 -07:00
Jon Chesterfield	dbd7bad9ad	[openmp] Annotate tmp variables with omp_thread_mem_alloc Fixes miscompile of calls into ocml. Bug 51445. The stack variable `double __tmp` is moved to dynamically allocated shared memory by CGOpenMPRuntimeGPU. This is usually fine, but when the variable is passed to a function that is explicitly annotated address_space(5) then allocating the variable off-stack leads to a miscompile in the back end, which cannot decide to move the variable back to the stack from shared. This could be fixed by removing the AS(5) annotation from the math library or by explicitly marking the variables as thread_mem_alloc. The cast to AS(5) is still a no-op once IR is reached. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D107971	2021-08-19 02:22:11 +01:00
owenca	643f2be7b6	[clang-format] Improve detection of parameter declarations in K&R C Clean up the detection of parameter declarations in K&R C function definitions. Also make it more precise by requiring the second token after the r_paren to be either a star or keyword/identifier. Differential Revision: https://reviews.llvm.org/D108094	2021-08-18 15:21:48 -07:00
Corentin Jabot	2715c4da50	Do not emit diagnostics for invalid unicode characters in preprocessing mode This amends `4e80636db7` with a fix for https://lab.llvm.org/buildbot/#/builders/139/builds/8943	2021-08-18 09:12:36 -04:00
Corentin Jabot	4e80636db7	Implement P1949 This adds the Unicode 13 data for XID_Start and XID_Continue. The definition of valid identifier is changed in all C++ modes as P1949 (https://wg21.link/p1949) was accepted by WG21 as a defect report.	2021-08-18 07:33:14 -04:00
Simon Pilgrim	f22e5869a0	[Sema] CheckObjCBridgeNSCast - fix dead code warning. NFCI. Target is only ever non-null when we find an existing type, so move its declaration inside that case, and remove the dead code where Target was always null.	2021-08-18 11:53:27 +01:00
Qiu Chaofan	1d06a39d6e	Fix a typo around 'builtin'	2021-08-18 17:42:12 +08:00
Justas Janickas	0d0628b2d2	[OpenCL] C++ for OpenCL version 2021 introduced to command line. Introduces language standard `lang_openclcpp2021` and allows `clc++2021` as a version flag for `-cl-std` in command line. Defines macros related to C++ for OpenCL version 2021. C++ for OpenCL version 2021 has been proposed in an RFC: https://lists.llvm.org/pipermail/cfe-dev/2021-August/068593.html Differential Revision: https://reviews.llvm.org/D108038	2021-08-18 10:08:59 +01:00
Arthur Eubanks	3f4d00bc3b	[NFC] More get/removeAttribute() cleanup	2021-08-17 21:05:41 -07:00
Arthur Eubanks	de0ae9e89e	[NFC] Cleanup more AttributeList::addAttribute()	2021-08-17 21:05:41 -07:00
Arthur Eubanks	ad727ab7d9	[NFC] Migrate some callers away from Function/AttributeLists methods that take an index These methods can be confusing.	2021-08-17 21:05:40 -07:00
Arthur Eubanks	46cf82532c	[NFC] Replace Function handling of attributes with less confusing calls To avoid magic constants and confusing indexes.	2021-08-17 21:05:40 -07:00
Wang, Pengfei	5aeca3b0a5	[CFE][X86] Enable complex _Float16 support Support complex _Float16 on X86 in C/C++ following the latest X86 psABI. (https://gitlab.com/x86-psABIs) Reviewed By: LuoYuanke Differential Revision: https://reviews.llvm.org/D105331	2021-08-18 11:16:14 +08:00
Brian Cain	59dfde7d94	[clang] enable sanitizers for hexagon	2021-08-17 19:59:24 -07:00
Wang, Pengfei	2379949aad	[X86] AVX512FP16 instructions enabling 3/6 Enable FP16 conversion instructions. Ref.: https://software.intel.com/content/www/us/en/develop/download/intel-avx512-fp16-architecture-specification.html Reviewed By: LuoYuanke Differential Revision: https://reviews.llvm.org/D105265	2021-08-18 09:03:41 +08:00
Ben Barham	32208555af	[Modules] Do not remove failed modules after the control block phase Reading modules first reads each control block in the chain and then all AST blocks. The first phase is intended to find recoverable errors, eg. an out of date or missing module. If any error occurs during this phase, it is safe to remove all modules in the chain as no references to them will exist. While reading the AST blocks, however, various fields in ASTReader are updated with references to the module. Removing modules at this point can cause dangling pointers which can be accessed later. These would be otherwise harmless, eg. a binary search over `GlobalSLocEntryMap` may access a failed module that could error, but shouldn't crash. Do not remove modules in this phase, regardless of failures. Since this is the case, it also doesn't make sense to return OutOfDate during this phase, so remove the two cases where this happens. When they were originally added these checks would return a failure when the serialized and current path didn't match up. That was updated to an OutOfDate as it was found to be hit when using VFS and overriding the umbrella. Later on the path was changed to instead be the name as written in the module file, resolved using the serialized base directory. At this point the check is really only comparing the name of the umbrella and only works for frameworks since those don't include `Headers/` in the name (which means the resolved path will never exist) Given all that, it seems safe to ignore this case entirely for now. This makes the handling of an umbrella header/directory the same as regular headers, which also don't check for differences in the path caused by VFS. Resolves rdar://79329355 Differential Revision: https://reviews.llvm.org/D107690	2021-08-17 16:46:51 -07:00
Denys Petrov	9dabacd09f	[analyzer] Adjust JS code of analyzer's HTML report for IE support. Summary: Change and replace some functions which IE does not support. This patch is made as a continuation of D92928 revision. Also improve hot keys behavior. Differential Revision: https://reviews.llvm.org/D107366	2021-08-17 19:32:34 +03:00
Dylan Fleming	ef198cd99e	[SVE] Remove usage of getMaxVScale for AArch64, in favour of IR Attribute Removed AArch64 usage of the getMaxVScale interface, replacing it with the vscale_range(min, max) IR Attribute. Reviewed By: paulwalker-arm Differential Revision: https://reviews.llvm.org/D106277	2021-08-17 14:42:47 +01:00
Ben Shi	b31199bab4	[AVR][clang] Improve search for avr-libc installation path Search avr-libc path according to avr-gcc installation at first, then other possible installed pathes. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D107682	2021-08-17 11:51:35 +08:00
Weverything	c411c1bd7f	Fix missing qualifier in template type diffing Handle SubstTemplateTypeParmType so qualifiers do not get dropped from the diagnostic message.	2021-08-16 18:34:18 -07:00
Nathan Chancellor	9ed4a94d64	[clang] Expose unreachable fallthrough annotation warning The Linux kernel has a macro called IS_ENABLED(), which evaluates to a constant 1 or 0 based on Kconfig selections, allowing C code to be unconditionally enabled or disabled at build time. For example: int foo(struct *a, int b) { switch (b) { case 1: if (a->flag \|\| !IS_ENABLED(CONFIG_64BIT)) return 1; __attribute__((fallthrough)); case 2: return 2; default: return 3; } } There is an unreachable warning about the fallthrough annotation in the first case because !IS_ENABLED(CONFIG_64BIT) can be evaluated to 1, which looks like return 1; __attribute__((fallthrough)); to clang. This type of warning is pointless for the Linux kernel because it does this trick all over the place due to the sheer number of configuration options that it has. Add -Wunreachable-code-fallthrough, enabled under -Wunreachable-code, so that projects that want to warn on unreachable code get this warning but projects that do not care about unreachable code can still use -Wimplicit-fallthrough without having to make changes to their code base. Fixes PR51094. Reviewed By: aaron.ballman, nickdesaulniers Differential Revision: https://reviews.llvm.org/D107933	2021-08-16 17:14:55 -07:00
Rong Xu	9b8425e42c	Reapply commit `b7425e956` The commit b7425e956: [NFC] fix typos is harmless but was reverted by accident. Reapply.	2021-08-16 12:18:40 -07:00
Kostya Kortchinsky	80ed75e7fb	Revert "[NFC] Fix typos" This reverts commit `b7425e956b`.	2021-08-16 11:13:05 -07:00
Rong Xu	b7425e956b	[NFC] Fix typos s/senstive/senstive/g	2021-08-16 10:15:30 -07:00
Kristóf Umann	2d3668c997	[analyzer] MallocChecker: Add a visitor to leave a note on functions that could have, but did not change ownership on leaked memory This is a rather common feedback we get from out leak checkers: bug reports are really short, and are contain barely any usable information on what the analyzer did to conclude that a leak actually happened. This happens because of our bug report minimizing effort. We construct bug reports by inspecting the ExplodedNodes that lead to the error from the bottom up (from the error node all the way to the root of the exploded graph), and mark entities that were the cause of a bug, or have interacted with it as interesting. In order to make the bug report a bit less verbose, whenever we find an entire function call (from CallEnter to CallExitEnd) that didn't talk about any interesting entity, we prune it (click here for more info on bug report generation). Even if the event to highlight is exactly this lack of interaction with interesting entities. D105553 generalized the visitor that creates notes for these cases. This patch adds a new kind of NoStateChangeVisitor that leaves notes in functions that took a piece of dynamically allocated memory that later leaked as parameter, and didn't change its ownership status. Differential Revision: https://reviews.llvm.org/D105553	2021-08-16 16:19:00 +02:00
Kristóf Umann	c019142a89	[analyzer][NFC] Split the main logic of NoStoreFuncVisitor to an abstract NoStateChangeVisitor class Preceding discussion on cfe-dev: https://lists.llvm.org/pipermail/cfe-dev/2021-June/068450.html NoStoreFuncVisitor is a rather unique visitor. As VisitNode is invoked on most other visitors, they are looking for the point where something changed -- change on a value, some checker-specific GDM trait, a new constraint. NoStoreFuncVisitor, however, looks specifically for functions that didn't write to a MemRegion of interesting. Quoting from its comments: /// Put a diagnostic on return statement of all inlined functions /// for which the region of interest \p RegionOfInterest was passed into, /// but not written inside, and it has caused an undefined read or a null /// pointer dereference outside. It so happens that there are a number of other similar properties that are worth checking. For instance, if some memory leaks, it might be interesting why a function didn't take ownership of said memory: void sink(int P) {} // no notes void f() { sink(new int(5)); // note: Memory is allocated // Well hold on, sink() was supposed to deal with // that, this must be a false positive... } // warning: Potential memory leak [cplusplus.NewDeleteLeaks] In here, the entity of interest isn't a MemRegion, but a symbol. The property that changed here isn't a change of value, but rather liveness and GDM traits managed by MalloChecker. This patch moves some of the logic of NoStoreFuncVisitor to a new abstract class, NoStateChangeFuncVisitor. This is mostly calculating and caching the stack frames in which the entity of interest wasn't changed. Descendants of this interface have to define 3 things: What constitutes as a change to an entity (this is done by overriding wasModifiedBeforeCallExit) * What the diagnostic message should be (this is done by overriding maybeEmitNoteFor.) What constitutes as the entity of interest being passed into the function (this is also done by overriding maybeEmitNoteFor.*) Differential Revision: https://reviews.llvm.org/D105553	2021-08-16 15:03:22 +02:00
Sylvestre Ledru	b8d451da86	Add support of the future Debian (Debian 12 - Bookworm) https://wiki.debian.org/DebianBookworm ETA: 2023	2021-08-16 09:11:31 +02:00
Craig Topper	705b1191aa	[X86] Add parentheses around casts in X86 intrinsic headers. Fixes PR51324.	2021-08-14 18:14:44 -07:00
Wang, Pengfei	f1de9d6dae	[X86] AVX512FP16 instructions enabling 2/6 Enable FP16 binary operator instructions. Ref.: https://software.intel.com/content/www/us/en/develop/download/intel-avx512-fp16-architecture-specification.html Reviewed By: LuoYuanke Differential Revision: https://reviews.llvm.org/D105264	2021-08-15 08:56:33 +08:00
Craig Topper	d2cb189184	[X86] Use a do {} while (0) in the _MM_EXTRACT_FLOAT implementation. Previously we just used {}, but that doesn't work in situations like this. if (1) _MM_EXTRACT_FLOAT(d, x, n); else ... The semicolon would terminate the if.	2021-08-14 16:41:55 -07:00
Craig Topper	73c4c32767	[X86] Use __builtin_bit_cast _mm_extract_ps instead of type punning through a union. NFC	2021-08-14 16:35:55 -07:00
Owen	f6928cf455	[clang-format] Distinguish K&R C function definition and attribute This is a follow-up to https://reviews.llvm.org/D107950 which missed user-defined types in K&R C. Differential Revision: https://reviews.llvm.org/D107961	2021-08-14 05:00:40 -07:00
Arthur Eubanks	8e9ffa1dc6	[NFC] Cleanup callers of AttributeList::hasAttributes() AttributeList::hasAttributes() is confusing, use clearer methods like hasFnAttrs().	2021-08-13 12:16:52 -07:00
Arthur Eubanks	80ea2bb574	[NFC] Rename AttributeList::getParam/Ret/FnAttributes() -> get*Attributes() This is more consistent with similar methods.	2021-08-13 11:16:52 -07:00
Arthur Eubanks	92ce6db9ee	[NFC] Rename AttributeList::hasFnAttribute() -> hasFnAttr() This is more consistent with similar methods.	2021-08-13 11:09:18 -07:00
Craig Topper	606735c045	[Clang] Add an explicit makeArrayRef to appease gcc 5.4.	2021-08-13 09:42:28 -07:00
Craig Topper	4190d99dfc	[X86] Add parentheses around casts in some of the X86 intrinsic headers. This covers the SSE and AVX/AVX2 headers. AVX512 has a lot more macros due to rounding mode. Fixes part of PR51324. Reviewed By: pengfei Differential Revision: https://reviews.llvm.org/D107843	2021-08-13 09:36:16 -07:00
David Spickett	98eb348eb3	Revert "[clang-format] Distinguish K&R C function definition and attribute" This reverts commit `de763c4037`. Causing test failures on the Arm/AArch64 quick bots: https://lab.llvm.org/buildbot/#/builders/188/builds/2202	2021-08-13 16:25:32 +01:00
Justas Janickas	cfdfb75c1f	[OpenCL] Clang diagnostics allow reporting C++ for OpenCL version. Some Clang diagnostics could only report OpenCL C version. Because C++ for OpenCL can be used as an alternative to OpenCL C, the text for diagnostics should reflect that. Desrciptions modified for these diagnostics: `err_opencl_unknown_type_specifier` `warn_option_invalid_ocl_version` `err_attribute_requires_opencl_version` `warn_opencl_attr_deprecated_ignored` `ext_opencl_ext_vector_type_rgba_selector` Differential Revision: https://reviews.llvm.org/D107648	2021-08-13 13:55:22 +01:00
Owen	de763c4037	[clang-format] Distinguish K&R C function definition and attribute This is a follow-up to https://reviews.llvm.org/D107950 which missed user-defined types in K&R C. Differential Revision: https://reviews.llvm.org/D107961	2021-08-13 05:28:19 -07:00
Pushpinder Singh	60e07a9568	[AMDGPU][OpenMP] Use llvm-link to link ocml libraries This fixes the 'unused linker option: -lm' warning when compiling program with -c. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D107952	2021-08-13 13:36:57 +05:30
Sarah Purohit	ee620b1743	[clang][Arm] Fix the default floating point ABI for 'armv7-pc-win32-macho' It is incorrect to select the hardware floating point ABI on Mach-O platforms using the Windows triple if the ABI is "apcs-gnu". rdar://81810554 Differential Revision: https://reviews.llvm.org/D107939	2021-08-12 21:46:30 -07:00
Michael Kruse	b1de32d6dd	[OMPIRBuilder] Clarify CanonicalLoopInfo. NFC. Add in-source documentation on how CanonicalLoopInfo is intended to be used. In particular, clarify what parts of a CanonicalLoopInfo is considered part of the loop, that those parts must be side-effect free, and that InsertPoints to instructions outside those parts can be expected to be preserved after method calls implementing loop-associated directives. CanonicalLoopInfo are now invalidated after it does not describe canonical loop anymore and asserts when trying to use it afterwards. In addition, rename `createXYZWorkshareLoop` to `applyXYZWorkshareLoop` and remove the update location to avoid that the impression that they insert something from scratch at that location where in reality its InsertPoint is ignored. createStaticWorkshareLoop does not return a CanonicalLoopInfo anymore. First, it was not a canonical loop in the clarified sense (containing side-effects in form of calls to the OpenMP runtime). Second, it is ambiguous which of the two possible canonical loops it should actually return. It will not be needed before a feature expected to be introduced in OpenMP 6.0 Also see discussion in D105706. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D107540	2021-08-12 21:02:19 -05:00
Nico Weber	1899112037	[gn build] manually port `18f9e25ce1` (AttrDocTable) Also clang ClangAttrEmitter for -gen-clang-attr-doc-table to be like all other tablegen: Produce a .inc file with the generated bits and put the static parts into a regular .cpp file that includes the .inc file.	2021-08-12 21:30:59 -04:00
Yonghong Song	1b194ef1ab	[Clang] add btf_tag attribute A new attribute btf_tag is added. The syntax looks like __attribute__((btf_tag(<string>))) Users may tag a particular structure/member/function/func_parameter/variable declaration with an arbitrary string and the intention is that this string is passed to dwarf so it is available for post-compilation analysis. The string will be also passed to .BTF section if the target is BPF. For each permitted declaration, multiple btf_tag's are allowed. For detailed use cases, please see https://lists.llvm.org/pipermail/llvm-dev/2021-June/151009.html In case that there exist redeclarations, the btf_tag attributes will be accumulated along with different declarations, and the last declaration will contain all attributes. Differential Revision: https://reviews.llvm.org/D106614	2021-08-12 16:34:22 -07:00
Duncan P. N. Exon Smith	b714f73def	Frontend: Add -f{,no-}implicit-modules-uses-lock and -Rmodule-lock Add -cc1 flags `-fmodules-uses-lock` and `-fno-modules-uses-lock` to allow the lock manager to be turned off when building implicit modules. Add `-Rmodule-lock` so that we can see when it's being used. Differential Revision: https://reviews.llvm.org/D95583	2021-08-12 15:58:19 -07:00
Sam McCall	2ff7ca98a9	[clangd] Avoid "expected one compiler job" by picking the first eligible job. This happens in createInvocationWithCommandLine but only clangd currently passes ShouldRecoverOnErorrs (sic). One cause of this (with correct command) is several -arch arguments for mac multi-arch support. Fixes https://github.com/clangd/clangd/issues/827 Differential Revision: https://reviews.llvm.org/D107632	2021-08-13 00:36:30 +02:00
Duncan P. N. Exon Smith	c130300f8b	Frontend: Refactor compileModuleAndReadAST, NFC This renames `compileModuleAndReadAST`, adding a `BehindLock` suffix, and refactors it to significantly reduce nesting. - Split out helpers `compileModuleAndReadASTImpl` and `readASTAfterCompileModule` which have straight-line code that doesn't worry about locks. - Use `break` in the interesting cases of `switch` statements to reduce nesting. - Use early `return`s to reduce nesting. Detangling the compile-and-read logic from the check-for-locks logic should be a net win for readability, although I also have a side motivation of making the locks optional in a follow-up. No functionality change here. Differential Revision: https://reviews.llvm.org/D95581	2021-08-12 15:16:08 -07:00
Sam McCall	ece4e92085	[CodeComplete] Basic code completion for attribute names. Only the bare name is completed, with no args. For args to be useful we need arg names. These are in the tablegen but not currently emitted in usable form, so left this as future work. C++11, C2x, GNU, declspec, MS syntax is supported, with the appropriate spellings of attributes suggested. `#pragma clang attribute` is supported but not terribly useful as we only reach completion if parens are balanced (i.e. the line is not truncated) There's no filtering of which attributes might make sense in this grammatical context (e.g. attached to a function). In code-completion context this is hard to do, and will only work in few cases :-( There's also no filtering by langopts: this is because currently the only way of checking is to try to produce diagnostics, which requires a valid ParsedAttr which is hard to get. This should be fairly simple to fix but requires some tablegen changes to expose the logic without the side-effect. Differential Revision: https://reviews.llvm.org/D107696	2021-08-12 23:49:10 +02:00
Sam McCall	3b99acbff2	[Attributes]: refactor to expose ParsedAttrInfo::acceptsLangOpts. NFC We will use this function to filter code completion of attributes. Differential Revision: https://reviews.llvm.org/D107836	2021-08-12 23:47:01 +02:00
Lei Huang	8930af45c3	[PowerPC] Implement XL compatibility builtin __addex Add builtin and intrinsic for `__addex`. This patch is part of a series of patches to provide builtins for compatibility with the XL compiler. Reviewed By: stefanp, nemanjai, NeHuang Differential Revision: https://reviews.llvm.org/D107002	2021-08-12 16:38:21 -05:00
Sam McCall	18f9e25ce1	[AST][clangd] Expose documentation of Attrs on hover. This adds a method to Attr to get at the documentation programmatically. Differential Revision: https://reviews.llvm.org/D107703	2021-08-12 21:16:37 +02:00
Craig Topper	f66ba5fcef	[Sema] Fix -Wparentheses warning from gcc. NFC	2021-08-12 10:05:27 -07:00
Jon Chesterfield	6a8e5120ab	Revert "[openmp] Annotate tmp variables with omp_thread_mem_alloc" This reverts commit `b6113548c9`.	2021-08-12 17:44:36 +01:00
Jon Chesterfield	b6113548c9	[openmp] Annotate tmp variables with omp_thread_mem_alloc Fixes miscompile of calls into ocml. Bug 51445. The stack variable `double __tmp` is moved to dynamically allocated shared memory by CGOpenMPRuntimeGPU. This is usually fine, but when the variable is passed to a function that is explicitly annotated address_space(5) then allocating the variable off-stack leads to a miscompile in the back end, which cannot decide to move the variable back to the stack from shared. This could be fixed by removing the AS(5) annotation from the math library or by explicitly marking the variables as thread_mem_alloc. The cast to AS(5) is still a no-op once IR is reached. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D107971	2021-08-12 17:30:22 +01:00
Hongtao Yu	ccb5b9bbfb	[CSSPGO] Allow the use of debug-info-for-profiling and pseudo-probe-for-profiling together Previoulsy debug-info-for-profiling and pseudo-probe-for-profiling are mutual exclusive because they compete the dwarf discrimnator for callsites on the IR. This changes allows to use the two switches together. The side effect is that callsite discriminators will be taken by pseudo probe, while discriminators for other instructions are still available for AutoFDO use. This is less than ideal, however, it still allows us a chance to smoothly transition from AutoFDO to CSSPGO, by collecting both profiles from a CSSPGO binary. Reviewed By: wenlei, wmi Differential Revision: https://reviews.llvm.org/D107876	2021-08-12 08:52:49 -07:00
Martin Storsjö	5ed9e5c2c0	[clang] [MinGW] Consider the per-target libc++ include directory too The existing logic for per-target libc++ include directories only seem to exist for the Gnu and Fuchsia drivers, added in `ea12d779bc` / D89013. This is less generic than the corresponding case in the Gnu driver, but matches the existing level of genericity in the MinGW driver (and others too). Differential Revision: https://reviews.llvm.org/D107893	2021-08-12 13:27:09 +03:00
Krasimir Georgiev	45934922fa	[clang-format] improve distinction of K&R function definitions vs attributes After `9da70ab3d4` we saw a few regressions around trailing attribute definitions and in typedefs (examples in the added test cases). There's some tension distinguishing K&R definitions from attributes at the parser level, where we have to decide if we need to put the type of the K&R definition on a new unwrapped line before we have access to the rest of the line, so we're scanning backwards and looking for a pattern like f(a, b). But this type of pattern could also be an attribute macro, or the whole declaration could be a typedef itself. I updated the code to check for a typedef at the beginning of the line and to not consider raw identifiers as possible first K&R declaration (but treated as an attribute macro instead). This is not 100% correct heuristic, but I think it should be reasonably good in practice, where we'll: * likely be in some very C-ish code when using K&R style (e.g., stuff that uses `struct name a;` instead of `name a;` * likely be in some very C++-ish code when using attributes * unlikely mix up the two in the same declaration. Ideally, we should only decide to add the unwrapped line before the K&R declaration after we've scanned the rest of the line an noticed the variable declarations and the semicolon, but the way the parser is organized I don't see a good way to do this in the current parser, which only has good context for the previously visited tokens. I also tried not emitting an unwrapped line there and trying to resolve the situation later in the token annotator and the continuation indenter, and that approach seems promising, but I couldn't make it to work without messing up a bunch of other cases in unit tests. Reviewed By: MyDeveloperDay Differential Revision: https://reviews.llvm.org/D107950	2021-08-12 10:29:06 +02:00
Freddy Ye	6c1468854d	[X86] Reverse _set_ph and _setr_ph 's set order. Reviewed By: pengfei Differential Revision: https://reviews.llvm.org/D107946	2021-08-12 16:27:04 +08:00
Krasimir Georgiev	0fc27ef196	[clang-format] handle trailing comments in function definition detection A follow-up to `f6bc614546` where we handle the case where the semicolon is followed by a trailing comment. Reviewed By: MyDeveloperDay Differential Revision: https://reviews.llvm.org/D107907	2021-08-12 08:55:54 +02:00
Stefan Pintilie	a614a28772	[PowerPC] Do not define __PRIVILEGED__ We do not want to define __PRIVILEGED__. There is no use case for the definition and gcc does not define it. This patch removes that definition. Reviewed By: lei, NeHuang Differential Revision: https://reviews.llvm.org/D107461	2021-08-11 14:10:22 -05:00
Reid Kleckner	718c632582	Simplify dllexport class member code, NFC We can hoist the check for the dllexport attribute to before the check if this is a static data member or method.	2021-08-11 11:42:20 -07:00
Arnold Schwaighofer	9eb99d2e73	CodeGen: No need to check for isExternC if HasStrictReturn is already false NFC intended. Differential Revision: https://reviews.llvm.org/D107841	2021-08-11 07:42:48 -07:00
Joseph Huber	01d59c0de8	[OpenMP]Fix PR50336: Remove temporary files in the offload bundler tool Temporary files created by the offloading device toolchain are not removed after compilation when using a two-step compilation. The offload-bundler uses a different filename for the device binary than the `.o` file present in the Job's input list. This is not listed as a temporary file so it is never removed. This patch explicitly adds the device binary as a temporary file to consume it. This fixes PR50336. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D107668	2021-08-11 08:50:47 -04:00
Balázs Kéri	9f517fd11e	[clang][analyzer] Improve bug report in alpha.security.ReturnPtrRange Add some notes and track of bad return value. Reviewed By: steakhal Differential Revision: https://reviews.llvm.org/D107051	2021-08-11 13:04:55 +02:00
Petr Hosek	389dc94d4b	[InstrProfiling] Generate runtime hook for Fuchsia When none of the translation units in the binary have been instrumented we shouldn't need to link the profile runtime. However, because we pass -u__llvm_profile_runtime on Linux and Fuchsia, the runtime would still be pulled in and incur some overhead. On Fuchsia which uses runtime counter relocation, it also means that we cannot reference the bias variable unconditionally. This change modifies the InstrProfiling pass to pull in the profile runtime only when needed by declaring the __llvm_profile_runtime symbol in the translation unit only when needed. For now we restrict this only for Fuchsia, but this can be later expanded to other platforms. This approach was already used prior to `9a041a7522`, but we changed it to always generate the __llvm_profile_runtime due to a TAPI limitation, but that limitation may no longer apply, and it certainly doesn't apply on platforms like Fuchsia. Differential Revision: https://reviews.llvm.org/D98061	2021-08-10 23:21:15 -07:00
Christopher Di Bella	c874dd5362	[llvm][clang][NFC] updates inline licence info Some files still contained the old University of Illinois Open Source Licence header. This patch replaces that with the Apache 2 with LLVM Exception licence. Differential Revision: https://reviews.llvm.org/D107528	2021-08-11 02:48:53 +00:00
Vince Bridgers	d39ebdae67	[analyzer] Cleanup a FIXME in SValBuilder.cpp This change follows up on a FIXME submitted with D105974. This change simply let's the reference case fall through to return a concrete 'true' instead of a nonloc pointer of appropriate length set to NULL. Reviewed By: NoQ Differential Revision: https://reviews.llvm.org/D107720	2021-08-10 16:12:52 -05:00
Sam McCall	0cecb42e4e	[Sema] Include full range of the switch condition in -Wswitch diagnostic	2021-08-10 22:50:40 +02:00
Jake Egan	3b39fa3e28	[AIX] Define __HOS_AIX__ macro only for AIX target %%% This patch defines the macro __HOS_AIX__ when the target is AIX and without any dependency on the host. The macro indicates that the host is AIX. Defining the macro will help minimize porting pain for existing code compiled with xlc/xlC. xlC never shipped cross-compiling support, so the difference is not observable anyway. %%% This is a follow up to the discussion in https://reviews.llvm.org/D107242. Reviewed By: cebowleratibm, joerg Differential Revision: https://reviews.llvm.org/D107825	2021-08-10 13:03:17 -04:00
Alex Orlov	638dcea010	[clang] Implement P0692R1 from C++20 (access checking on specializations and instantiations) This patch implements paper P0692R1 from the C++20 standard. Disable usual access checking rules to template argument names in a declaration of partial specializations, explicit instantiation or explicit specialization (C++20 13.7.5/10, 13.9.1/6). Fixes: https://llvm.org/PR37424 This patch also implements option A from this paper P0692R1 from the C++20 standard. This patch follows the @rsmith suggestion from D78404. Reviewed By: krisb Differential Revision: https://reviews.llvm.org/D92024	2021-08-10 19:20:50 +04:00
Brian Cain	888876ba27	[clang] [hexagon] Add resource include dir	2021-08-10 08:37:58 -05:00
Sam McCall	13a86c2bb4	[Sema] Preserve invalid CXXCtorInitializers using RecoveryExpr in initializer Before this patch, CXXCtorInitializers that don't typecheck get discarded in most cases. In particular: - typos that can't be corrected don't turn into RecoveryExpr. The full expr disappears instead, and without an init expr we discard the node. - initializers that fail initialization (e.g. constructor overload resolution) are discarded too. This patch addresses both these issues (a bit clunkily and repetitively, for member/base/delegating initializers) It does not preserve any AST nodes when the member/base can't be resolved or other problems of that nature. That breaks invariants of CXXCtorInitializer itself, and we don't have a "weak" RecoveryCtorInitializer like we do for Expr. I believe the changes to diagnostics in existing tests are improvements. (We're able to do some analysis on the non-broken parts of the initializer) Differential Revision: https://reviews.llvm.org/D101641	2021-08-10 15:16:52 +02:00
Sam McCall	bd63977ca9	[Parser] Fix attr infloop on "int x [[c" Similar to `ad2d6bbb14` Differential Revision: https://reviews.llvm.org/D107693	2021-08-10 15:03:08 +02:00
Sven van Haastregt	5de6b1acb5	[OpenCL] Make pipes and workgroup optional for -fdeclare-opencl-builtins Align guards of these builtins with opencl-c.h.	2021-08-10 13:01:47 +01:00
Wang, Pengfei	6f7f5b54c8	[X86] AVX512FP16 instructions enabling 1/6 1. Enable FP16 type support and basic declarations used by following patches. 2. Enable new instructions VMOVW and VMOVSH. Ref.: https://software.intel.com/content/www/us/en/develop/download/intel-avx512-fp16-architecture-specification.html Reviewed By: LuoYuanke Differential Revision: https://reviews.llvm.org/D105263	2021-08-10 12:46:01 +08:00
Ettore Tiotto	41e3ac398c	[AIX]: Fix option processing for -b Code added by D106688 has a problem. It passes the option -bxyz to the system linker as -b xyz xyz (duplication of the string 'xyz' is incorrect). This patch fixes that oversight. Reviewed by: hubert.reinterpretcast, jsji Differential Revision: https://reviews.llvm.org/D107786	2021-08-09 19:52:31 -04:00
Anshil Gandhi	a35008955f	[HIP] Allow target addr space in target builtins This patch allows target specific addr space in target builtins for HIP. It inserts implicit addr space cast for non-generic pointer to generic pointer in general, and inserts implicit addr space cast for generic to non-generic for target builtin arguments only. It is NFC for non-HIP languages. Differential Revision: https://reviews.llvm.org/D102405	2021-08-09 16:38:04 -06:00
Michael Liao	6ec36d18ec	[cuda] Mark builtin texture/surface reference variable as 'externally_initialized'. - They need to be preserved even if there's no reference within the device code as the host code may need to initialize them based on the application logic. Reviewed By: tra Differential Revision: https://reviews.llvm.org/D107718	2021-08-09 13:27:40 -04:00
Craig Topper	618543bb12	[clang][NFC] Fix a -Wparentheses warning.	2021-08-07 08:56:31 -07:00
Roger Ferrer Ibanez	bfb77364d0	[OpenMP] Fix accidental reuse of VLA size We were using an OpaqueValueExpr allocated on the stack to store the size of a VLA. Because the VLASizeMap in CodegenFunction uses the address of the expression to avoid recomputing VLAs, we were accidentally reusing an earlier llvm::Value. This led to invalid LLVM IR. This is a temporary solution until VLASizeMap can be pushed and popped based on the context. Differential Revision: https://reviews.llvm.org/D107666	2021-08-07 05:55:27 +00:00
Matt Jacobson	71e71067f3	[AVR][clang] Add '$SYSROOT/avr' to possible avr-libc locations Reviewed by: benshi001 Differential Revision: https://reviews.llvm.org/D107672	2021-08-07 10:24:14 +08:00
Dave Airlie	1854db74c5	opencl-c.h: add 3.0 optional extension support for a few more bits These 3 are fairly simple, pipes, workgroups and subgroups. Reviewed By: Anastasia Differential Revision: https://reviews.llvm.org/D105858	2021-08-07 09:25:00 +10:00
Joseph Huber	41a6b50c25	[OpenMP]Fix PR51349: Remove AlwaysInline for if regions. After D94315 we add the `NoInline` attribute to the outlined function to handle data environments in the OpenMP if clause. This conflicted with the `AlwaysInline` attribute added to the outlined function. for better performance in D106799. The data environments should ideally not require NoInline, but for now this fixes PR51349. Reviewed By: mikerice Differential Revision: https://reviews.llvm.org/D107649	2021-08-06 17:53:04 -04:00
Nick Desaulniers	d238b60285	[Clang][DiagnosticSemaKinds] combine diagnostic texts The diagnostic texts for warning on attributes that don't appear on the initial declaration is generally useful. We'd like to re-use it in D106030, but first let's combine two that already are very similar so we may re-use it a third time in that commit. Also, fix a few places that were using notePreviousDefinition to point to declarations, to instead use diag::note_previous_declaration. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D107613	2021-08-06 13:58:21 -07:00
Sam McCall	c8f148274f	Reapply "Support Attr in DynTypedNode and ASTMatchers." This reverts commit `3241680f11`. Fixed mangled post-test formatting :-(	2021-08-06 22:30:32 +02:00
Sam McCall	3241680f11	Revert "Support Attr in DynTypedNode and ASTMatchers." This reverts commit `a4bdcdadc6`. Fails bots: https://lab.llvm.org/buildbot/#/builders/109/builds/20231/steps/6/logs/stdio	2021-08-06 22:27:54 +02:00
Sam McCall	a4bdcdadc6	Support Attr in DynTypedNode and ASTMatchers. Differential Revision: https://reviews.llvm.org/D89743	2021-08-06 22:06:04 +02:00
Thomas Johnson	f8a4495149	[ARC] Add codegen for llvm.ctlz intrinsic for the ARC backend Differential Revision: https://reviews.llvm.org/D107611	2021-08-06 12:18:06 -07:00
Zahira Ammarguellat	4389a413e2	Revert "[clang][fpenv][patch] Change clang option -ffp-model=precise to select ffp-contract=on" This reverts commit `48ad446a0f`.	2021-08-06 12:01:47 -07:00
Artem Belevich	6a9cf21f5a	[CUDA, MemCpyOpt] Add a flag to force-enable memcpyopt and use it for CUDA. Attempt to enable MemCpyOpt unconditionally in D104801 uncovered the fact that there are users that do not expect LLVM to materialize `memset` intrinsic. While other passes can do that, too, MemCpyOpt triggers it more frequently and breaks sanitizers and some downstream users. For now introduce a flag to force-enable the flag and opt-in only CUDA compilation with NVPTX back-end. Differential Revision: https://reviews.llvm.org/D106401	2021-08-06 11:13:52 -07:00
Sean Fertile	dddd524bb8	Revert "[PowerPC][AIX] Limit attribute aligned to 4096." This reverts commit `5181be344a`. Break libcxx type_traits header which uses aligned storage with alignments greater than 4096. Reverting untill we can fix the header.	2021-08-06 13:54:50 -04:00
Jake Egan	41bcfe8174	[AIX] Define _ARCH_PPC64 macro for 32-bit %%% The macro _ARCH_PPC64 is already defined for 64-bit, but this patch defines it for 32-bit on AIX to follow xlc. See: https://www.ibm.com/docs/en/xl-c-and-cpp-aix/13.1.0?topic=features-macros-related-architecture-settings Note: This change creates a discrepancy between GCC, which defines _ARCH_PPC64 only for 64-bit mode. Tested with SPEC. %%% Reviewed By: cebowleratibm Differential Revision: https://reviews.llvm.org/D107244	2021-08-06 10:42:44 -04:00
Jake Egan	869d07ee88	[AIX] Define __HOS_AIX__ macro %%% This patch defines __HOS_AIX__ macro for AIX in case of a cross compiler implementation. %%% Tested with SPEC. Reviewed By: cebowleratibm Differential Revision: https://reviews.llvm.org/D107242	2021-08-06 10:40:13 -04:00
Corentin Jabot	131b4620ee	Implement P1937 consteval in unevaluated contexts In an unevaluated contexts, consteval functions should not be immediately evaluated.	2021-08-06 10:29:28 -04:00
Corentin Jabot	3c8e94bc20	Disallow narrowing conversions to bool in noexcept specififers Completes the support for P1401R5.	2021-08-06 10:26:39 -04:00
Jake Egan	3189dd205a	[AIX] Define __THW_PPC__ macro %%% This patch defines the macro __THW_PPC__ for AIX. %%% Tested with SPEC. Reviewed By: cebowleratibm Differential Revision: https://reviews.llvm.org/D107243	2021-08-06 09:52:26 -04:00
Jake Egan	420e1d4cf4	[AIX] Define __THW_BIG_ENDIAN__ macro %%% This patch defines the macro __THW_BIG_ENDIAN__ for AIX. %%% Tested with SPEC. Reviewed By: cebowleratibm Differential Revision: https://reviews.llvm.org/D107241	2021-08-06 09:46:59 -04:00
Justas Janickas	a5a2f05dcc	[C++4OpenCL] Introduces __remove_address_space utility This change provides a way to conveniently declare types that have address space qualifiers removed. Since OpenCL adds address spaces implicitly even when they are not specified in source, it is useful to allow deriving address space unqualified types. Fixes llvm.org/PR45326 Differential Revision: https://reviews.llvm.org/D106785	2021-08-06 10:40:22 +01:00
Serge Pavlov	4c4093e6e3	Introduce intrinsic llvm.isnan This is recommit of the patch `16ff91ebcc`, reverted in `0c28a7c990` because it had an error in call of getFastMathFlags (base type should be FPMathOperator but not Instruction). The original commit message is duplicated below: Clang has builtin function '__builtin_isnan', which implements C library function 'isnan'. This function now is implemented entirely in clang codegen, which expands the function into set of IR operations. There are three mechanisms by which the expansion can be made. * The most common mechanism is using an unordered comparison made by instruction 'fcmp uno'. This simple solution is target-independent and works well in most cases. It however is not suitable if floating point exceptions are tracked. Corresponding IEEE 754 operation and C function must never raise FP exception, even if the argument is a signaling NaN. Compare instructions usually does not have such property, they raise 'invalid' exception in such case. So this mechanism is unsuitable when exception behavior is strict. In particular it could result in unexpected trapping if argument is SNaN. * Another solution was implemented in https://reviews.llvm.org/D95948. It is used in the cases when raising FP exceptions by 'isnan' is not allowed. This solution implements 'isnan' using integer operations. It solves the problem of exceptions, but offers one solution for all targets, however some can do the check in more efficient way. * Solution implemented by https://reviews.llvm.org/D96568 introduced a hook 'clang::TargetCodeGenInfo::testFPKind', which injects target specific code into IR. Now only SystemZ implements this hook and it generates a call to target specific intrinsic function. Although these mechanisms allow to implement 'isnan' with enough efficiency, expanding 'isnan' in clang has drawbacks: * The operation 'isnan' is hidden behind generic integer operations or target-specific intrinsics. It complicates analysis and can prevent some optimizations. * IR can be created by tools other than clang, in this case treatment of 'isnan' has to be duplicated in that tool. Another issue with the current implementation of 'isnan' comes from the use of options '-ffast-math' or '-fno-honor-nans'. If such option is specified, 'fcmp uno' may be optimized to 'false'. It is valid optimization in general, but it results in 'isnan' always returning 'false'. For example, in some libc++ implementations the following code returns 'false': std::isnan(std::numeric_limits<float>::quiet_NaN()) The options '-ffast-math' and '-fno-honor-nans' imply that FP operation operands are never NaNs. This assumption however should not be applied to the functions that check FP number properties, including 'isnan'. If such function returns expected result instead of actually making checks, it becomes useless in many cases. The option '-ffast-math' is often used for performance critical code, as it can speed up execution by the expense of manual treatment of corner cases. If 'isnan' returns assumed result, a user cannot use it in the manual treatment of NaNs and has to invent replacements, like making the check using integer operations. There is a discussion in https://reviews.llvm.org/D18513#387418, which also expresses the opinion, that limitations imposed by '-ffast-math' should be applied only to 'math' functions but not to 'tests'. To overcome these drawbacks, this change introduces a new IR intrinsic function 'llvm.isnan', which realizes the check as specified by IEEE-754 and C standards in target-agnostic way. During IR transformations it does not undergo undesirable optimizations. It reaches instruction selection, where is lowered in target-dependent way. The lowering can vary depending on options like '-ffast-math' or '-ffp-model' so the resulting code satisfies requested semantics. Differential Revision: https://reviews.llvm.org/D104854	2021-08-06 14:32:27 +07:00
Matt Jacobson	dae7adda94	[AVR][clang] Pass '-fno-use-init-array' to cc1 as default On AVR, '.ctors' is used, not '.init_array'. Make this the default unless specifically overridden by driver argument. This matches gcc, and it matches the behavior in (e.g.) the NetBSD driver (for certain OS variants). Reviewed by: MaskRay Differential Revision: https://reviews.llvm.org/D107610	2021-08-06 10:14:23 +08:00
Steven Wan	a91916500d	[AIX] "aligned" attribute should not decrease type alignment returned by __alignof__ `__alignof__(x)` always returns `ABIAlign` if the "x" is marked `__attribute__((aligned()))`. However, the "aligned" attribute should only increase the alignment of a struct, or struct member, unless it's used together with the "packed" attribute, or used as a part of a typedef, in which case, the "aligned" attribute can both increase and decrease alignment. Reviewed By: sfertile Differential Revision: https://reviews.llvm.org/D107598	2021-08-05 18:18:58 -04:00
Fangrui Song	c38efb4899	[clang] Implement -falign-loops=N (N is a power of 2) for non-LTO GCC supports multiple forms of -falign-loops=. -falign-loops= is currently ignored in Clang. This patch implements the simplest but the most useful form where N is a power of 2. The underlying implementation uses a `llvm::TargetOptions` option for now. Bitcode generation ignores this option. Differential Revision: https://reviews.llvm.org/D106701	2021-08-05 12:17:50 -07:00
Jon Chesterfield	509854b69c	[clang] Replace asm with __asm__ in cuda header Asm is a gnu extension for C, so at present -fopenmp -std=c99 and similar fail to compile on nvptx, bug 51344 Changing to `__asm__` or `__asm` works for openmp, all three appear to work for cuda. Suggesting `__asm__` here as `__asm` is used by MSVC with different syntax, so this should make for better error diagnostics if the header is passed to a compiler other than clang. Reviewed By: tra, emankov Differential Revision: https://reviews.llvm.org/D107492	2021-08-05 18:46:57 +01:00
Jennifer Yu	6b0f35931a	Fix signal during the call to checkOpenMPLoop. The root problem is a null pointer is accessed during the call to checkOpenMPLoop, because loop up bound expr is an error expression due to error diagnostic was emit early. To fix this, in setLCDeclAndLB, setUB and setStep instead return false, return true when LB, UB or Step contains Error, so that the checking is stopped in checkOpenMPLoop. Differential Revision: https://reviews.llvm.org/D107385	2021-08-05 08:59:35 -07:00
Sean Fertile	f888e442bc	[PowerPC][AIX] attribute aligned cannot decrease align of a vector var. On AIX an aligned attribute cannot decrease the alignment of a variable when placed on a variable declaration of vector type. Differential Revision: https://reviews.llvm.org/D107522	2021-08-05 11:15:12 -04:00
Sean Fertile	5181be344a	[PowerPC][AIX] Limit attribute aligned to 4096. Limit the maximum alignment for attribute aligned to 4096 to match the limit of the .align pseudo op in the system assembler. Differential Revision: https://reviews.llvm.org/D107497	2021-08-05 09:51:16 -04:00
Aaron Ballman	530ea28fef	Correct a lot of diagnostic wordings for the driver Clang diagnostics should not start with a capital letter or use trailing punctuation (https://clang.llvm.org/docs/InternalsManual.html#the-format-string), but quite a few driver diagnostics were not following this advice. This corrects the grammar and punctuation to improve consistency, but does not change the circumstances under which the diagnostics are produced.	2021-08-05 07:04:55 -04:00
Sven van Haastregt	92b00ffe0f	[OpenCL] Reduce duplicate defs by using multiclasses; NFC Builtin definitions with pointer arguments were duplicated to provide overloads differing in the pointer argument's address space. Reduce this duplication by capturing the definitions in multiclasses. This still results in the same number of builtins in the generated tables, but the description is more concise now. Differential Revision: https://reviews.llvm.org/D107151	2021-08-05 11:06:33 +01:00
Anshil Gandhi	39dac1f7f6	[clang] Add clang builtins support for gfx90a Implement target builtins for gfx90a including fadd64, fadd32, add2h, max and min on various global, flat and ds address spaces for which intrinsics are implemented. Differential Revision: https://reviews.llvm.org/D106909	2021-08-05 02:08:06 -06:00
Pavel Asyutchenko	7df405e079	Apply -fmacro-prefix-map to __builtin_FILE() This matches the behavior of GCC. Patch does not change remapping logic itself, so adding one simple smoke test should be enough. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D107393	2021-08-04 16:42:14 -07:00
Dave Airlie	14cb67862a	[OpenCL] allow generic address and non-generic defs for CL3.0 This allows both sets of definitions to exist on CL 3.0 Reviewed By: Anastasia Differential Revision: https://reviews.llvm.org/D107318	2021-08-05 07:32:45 +10:00
Bradley Smith	e57e1e4e00	[clang][AArch64][SVE] Avoid going through memory for fixed/scalable predicate casts For fixed SVE types, predicates are represented using vectors of i8, where as for scalable types they are represented using vectors of i1. We can avoid going through memory for casts between these by bitcasting the i1 scalable vectors to/from a scalable i8 vector of matching size, which can then use the existing vector insert/extract logic. Differential Revision: https://reviews.llvm.org/D106860	2021-08-04 16:10:37 +00:00
Pushpinder Singh	f3eb5f900d	[AMDGPU][OpenMP] Wrap amdgcn declare variant inside ifdef This fixes the issue https://bugs.llvm.org/show_bug.cgi?id=51337 Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D107468	2021-08-04 15:24:46 +00:00
Sean Fertile	b8f612e780	[PowerPC][AIX] Packed zero-width bitfields do not affect alignment. Zero-width bitfields on AIX pad out to the natral alignment boundary but do not change the containing records alignment. Differential Revision: https://reviews.llvm.org/D106900	2021-08-04 11:03:25 -04:00

1 2 3 4 5 ...

63217 Commits