llvm-project

Commit Graph

Author	SHA1	Message	Date
Simon Tatham	1d78294250	[Sema][BFloat] Forbid arithmetic on vectors of bfloat. Vectors of bfloat are a storage format only; you're supposed to explicitly convert them to a wider type to do arithmetic on them. But currently, if you write something like bfloat16x4_t test(bfloat16x4_t a, bfloat16x4_t b) { return a + b; } then the clang frontend accepts it without error, and (ARM or AArch64) isel fails to generate code for it. Added a rule in Sema that forbids the attempt from even being made, and tests that check it. In particular, we also outlaw arithmetic between vectors of bfloat and any other vector type. Patch by Luke Cheeseman. Reviewed By: LukeGeeson Differential Revision: https://reviews.llvm.org/D85009	2020-08-07 11:25:19 +01:00
Haojian Wu	67174765db	[AST][RecoveryExpr] Fix the missing type when rebuilding RecoveryExpr in TreeTransform. Differential Revision: https://reviews.llvm.org/D85423	2020-08-07 11:15:19 +02:00
Thomas Preud'homme	6dc863cd85	[clang, test, Darwin] Fix tests expecting Darwin target Clang tests Driver/apple-arm64-arch.c and Driver/darwin-warning-options.c test Darwin driver functionality but only require the host system to be Darwin. This leads the tests to fail when building a cross-compiler on Darwin and to be marked unsupported when cross-compiling to Darwin from another system. This commit changes the requirements for those tests to require the target to be Darwin. Reviewed By: steven_wu Differential Revision: https://reviews.llvm.org/D85367	2020-08-07 09:21:31 +01:00
Christian Kühnel	f3cc4df51d	Revert "[CMake] Simplify CMake handling for zlib" This reverts commit `1adc494bce`. This patch broke the Windows compilation on buildbot and pre-merge testing: http://lab.llvm.org:8011/builders/mlir-windows/builds/5945 https://buildkite.com/llvm-project/llvm-master-build/builds/780	2020-08-07 09:36:49 +02:00
biplmish	cce1b0e891	[PowerPC] Implement Vector Extract Low/High Order Builtins in LLVM/Clang This patch implements the function prototypes vec_extractl and vec_extracth in altivec.h to utilize the vector extract double element instructions introduced in Power10. Differential Revision: https://reviews.llvm.org/D84622	2020-08-07 01:02:29 -05:00
Richard Smith	ed5a18fc03	PR30738: Implement two-phase name lookup for fold-expressions.	2020-08-06 16:56:39 -07:00
Richard Smith	d6492d8744	Add -Wtautological-value-range-compare warning. This warning diagnoses cases where an expression is compared to a constant, and the comparison is tautological due to the form of the expression (but not merely due to its type). This applies in cases such as comparisons of bit-fields and the result of bit-masks. The new warning is added to the Clang diagnostic group -Wtautological-constant-in-range-compare but not to the formerly-equivalent GCC-compatibility diagnostic group -Wtype-limits, which retains its old meaning of diagnosing only tautological comparisons to extremal values of a type (eg, int > INT_MAX). Reviewed By: rtrieu Differential Revision: https://reviews.llvm.org/D85256	2020-08-06 13:28:50 -07:00
Matt Arsenault	30eeb742f1	clang: Use byref for aggregate kernel arguments Add address space to indirect abi info and use it for kernels. Previously, indirect arguments assumed assumed a stack passed object in the alloca address space using byval. A stack pointer is unsuitable for kernel arguments, which are passed in a separate, constant buffer with a different address space. Start using the new byref for aggregate kernel arguments. Previously these were emitted as raw struct arguments, and turned into loads in the backend. These will lower identically, although with byref you now have the option of applying an explicit alignment. In the future, a reasonable implementation would use byref for all kernel arguments (this would be a practical problem at the moment due to losing things like noalias on pointer arguments). This is mostly to avoid fighting the optimizer's treatment of aggregate load/store. SROA and instcombine both turn aggregate loads and stores into a long sequence of element loads and stores, rather than the optimizable memcpy I would expect in this situation. Now an explicit memcpy will be introduced up-front which is better understood and helps eliminate the alloca in more situations. This skips using byref in the case where HIP kernel pointer arguments in structs are promoted to global pointers. At minimum an additional patch is needed to allow coercion with indirect arguments. This also skips using it for OpenCL due to the current workaround used to support kernels calling kernels. Distinct function bodies would need to be generated up front instead of emitting an illegal call.	2020-08-06 15:52:26 -04:00
Aaron En Ye Shi	96c2d5e99e	[HIP] Ignore invalid ar linker options Instead of accepting the same arguments as regular linker, the static linker will only accept input files. Reviewed By: yaxunl Differential Revision: https://reviews.llvm.org/D85442	2020-08-06 17:39:41 +00:00
Alexey Bataev	0af7835eae	[OPENMP]Redesign of OMPExecutableDirective/OMPDeclarativeDirective representation. Summary: Introduced OMPChildren class to handle all associated clauses, statement and child expressions/statements. It allows to represent some directives more correctly (like flush, depobj etc. with pseudo clauses, ordered depend directives, which are standalone, and target data directives). Also, it will make easier to avoid using of CapturedStmt in directives, if required (atomic, tile etc. directives). Also, it simplifies serialization/deserialization of the executable/declarative directives. Reduces number of allocation operations for mapper declarations. Reviewers: jdoerfert Subscribers: yaxunl, guansong, jfb, cfe-commits, sstefan1, aaron.ballman, caomhin Tags: #clang Differential Revision: https://reviews.llvm.org/D83261	2020-08-06 12:25:19 -04:00
Alex Richardson	e150d2cab8	[update_cc_test_checks.py] Add test for D84511 Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D85115	2020-08-06 09:15:56 +01:00
Jan Korous	820e8d8656	[Analyzer][WebKit] UncountedLambdaCaptureChecker Differential Revision: https://reviews.llvm.org/D82837	2020-08-05 15:23:55 -08:00
Petr Hosek	1adc494bce	[CMake] Simplify CMake handling for zlib Rather than handling zlib handling manually, use find_package from CMake to find zlib properly. Use this to normalize the LLVM_ENABLE_ZLIB, HAVE_ZLIB, HAVE_ZLIB_H. Furthermore, require zlib if LLVM_ENABLE_ZLIB is set to YES, which requires the distributor to explicitly select whether zlib is enabled or not. This simplifies the CMake handling and usage in the rest of the tooling. This is a reland of `abb0075` with all followup changes and fixes that should address issues that were reported in PR44780. Differential Revision: https://reviews.llvm.org/D79219	2020-08-05 16:07:11 -07:00
Richard Smith	076b120beb	CFG: Destroy temporaries in (a,b) expression in the correct order.	2020-08-05 14:52:53 -07:00
Thomas Lively	f496950001	[WebAssembly] Fix types in wasm_simd128.h and add tests `47f7174ffa` changed the types used in the Wasm SIMD builtin functions, but not all of their uses in wasm_simd128.h were updated. This commit fixes wasm_simd128.h and adds tests to make sure similar problems do not pass uncaught in the future. Differential Revision: https://reviews.llvm.org/D85347	2020-08-05 14:00:01 -07:00
Stanislav Mekhanoshin	ea7d0e2996	[AMDGPU] gfx1031 target Differential Revision: https://reviews.llvm.org/D85337	2020-08-05 12:36:26 -07:00
Arthur Eubanks	8dc43852e4	[Driver] Accept -fno-lto in clang-cl Some compiler-rt tests check for the presence of the compiler accepting -fno-lto to add that flag. Otherwise some tests don't link due to -flto mismatch between compiling and linking. $ cmake ... -DLLVM_ENABLE_LTO=Thin ... $ ninja projects/compiler-rt/lib/sanitizer_common/tests/Sanitizer-x86_64-Test.exe previously failed, now links. Reviewed By: hans Differential Revision: https://reviews.llvm.org/D85252	2020-08-05 12:24:10 -07:00
Erich Keane	2143a90b34	Fix _ExtInt(1) to be a i1 in memory. The _ExtInt(1) in getTypeForMem was hitting the bool logic for expanding to an 8 bit value. The result was an assert, or store i1 %0, i8* %2, align 1 since the parameter IS an i1. This patch changes the 'forMem' test to exclude ext-int from the bool test.	2020-08-05 10:54:51 -07:00
Joel E. Denny	002d61db2b	[OpenMP] Fix `present` for exit from `omp target data` Without this patch, the following example fails but shouldn't according to OpenMP TR8: ``` #pragma omp target enter data map(alloc:i) #pragma omp target data map(present, alloc: i) { #pragma omp target exit data map(delete:i) } // fails presence check here ``` OpenMP TR8 sec. 2.22.7.1 "map Clause", p. 321, L23-26 states: > If the map clause appears on a target, target data, target enter > data or target exit data construct with a present map-type-modifier > then on entry to the region if the corresponding list item does not > appear in the device data environment an error occurs and the > program terminates. There is no corresponding statement about the exit from a region. Thus, the `present` modifier should: 1. Check for presence upon entry into any region, including a `target exit data` region. This behavior is already implemented correctly. 2. Should not check for presence upon exit from any region, including a `target` or `target data` region. Without this patch, this behavior is not implemented correctly, breaking the above example. In the case of `target data`, this patch fixes the latter behavior by removing the `present` modifier from the map types Clang generates for the runtime call at the end of the region. In the case of `target`, we have not found a valid OpenMP program for which such a fix would matter. It appears that, if a program can guarantee that data is present at the beginning of a `target` region so that there's no error there, that data is also guaranteed to be present at the end. This patch adds a comment to the runtime to document this case. Reviewed By: grokos, RaviNarayanaswamy, ABataev Differential Revision: https://reviews.llvm.org/D84422	2020-08-05 10:03:31 -04:00
Bruno Ricci	19701458d4	[clang][nearly-NFC] Remove some superfluous uses of NamedDecl::getNameAsString `OS << ND->getDeclName();` is equivalent to `OS << ND->getNameAsString();` without the extra temporary string. This is not quite a NFC since two uses of `getNameAsString` in a diagnostic are replaced, which results in the named entity being quoted with additional "'"s (ie: 'var' instead of var).	2020-08-05 13:54:37 +01:00
Bruno Ricci	98b4b45705	[clang][NFC] Add a test showcasing an unnamed template parameter in a diagnostic	2020-08-05 13:54:36 +01:00
Yaxun (Sam) Liu	45f2a56856	[CUDA][HIP] Support accessing static device variable in host code for -fno-gpu-rdc nvcc supports accessing file-scope static device variables in host code by host APIs like cudaMemcpyToSymbol etc. CUDA/HIP let users access device variables in host code by shadow variables. In host compilation, clang emits a shadow variable for each device variable, and calls __RegisterVariable to register it in init function. The address of the shadow variable and the device side mangled name of the device variable is passed to __RegisterVariable. Runtime looks up the symbol by name in the device binary to find the address of the device variable. The problem with static device variables is that they have internal linkage, therefore their name may be changed by the linker if there are multiple symbols with the same name. Also they end up as local symbols in the elf file, whereas the runtime only looks up the global symbols. Another reason for making the static device variables external linkage is that they may be initialized externally by host code and their final value may be accessed by host code after kernel execution, therefore they actually have external linkage. Giving them internal linkage will cause incorrect optimizations on them. To support accessing static device var in host code for -fno-gpu-rdc mode, change the intnernal linkage to external linkage. The name does not need change since there is only one TU for -fno-gpu-rdc mode. Also the externalization is done only if the device static var is referenced by host code. Differential Revision: https://reviews.llvm.org/D80858	2020-08-05 07:57:38 -04:00
Hans Wennborg	3ab01550b6	Revert "[CMake] Simplify CMake handling for zlib" This quietly disabled use of zlib on Windows even when building with -DLLVM_ENABLE_ZLIB=FORCE_ON. > Rather than handling zlib handling manually, use find_package from CMake > to find zlib properly. Use this to normalize the LLVM_ENABLE_ZLIB, > HAVE_ZLIB, HAVE_ZLIB_H. Furthermore, require zlib if LLVM_ENABLE_ZLIB is > set to YES, which requires the distributor to explicitly select whether > zlib is enabled or not. This simplifies the CMake handling and usage in > the rest of the tooling. > > This is a reland of `abb0075` with all followup changes and fixes that > should address issues that were reported in PR44780. > > Differential Revision: https://reviews.llvm.org/D79219 This reverts commit `10b1b4a231` and follow-ups `64d99cc6ab` and `f9fec0447e`.	2020-08-05 12:31:44 +02:00
Yonghong Song	00602ee7ef	BPF: simplify IR generation for __builtin_btf_type_id() This patch simplified IR generation for __builtin_btf_type_id(). For __builtin_btf_type_id(obj, flag), previously IR builtin looks like if (obj is a lvalue) llvm.bpf.btf.type.id(obj.ptr, 1, flag) !type else llvm.bpf.btf.type.id(obj, 0, flag) !type The purpose of the 2nd argument is to differentiate __builtin_btf_type_id(obj, flag) where obj is a lvalue vs. __builtin_btf_type_id(obj.ptr, flag) Note that obj or obj.ptr is never used by the backend and the `obj` argument is only used to derive the type. This code sequence is subject to potential llvm CSE when - obj is the same .e.g., nullptr - flag is the same - metadata type is different, e.g., typedef of struct "s" and strust "s". In the above, we don't want CSE since their metadata is different. This patch change IR builtin to llvm.bpf.btf.type.id(seq_num, flag) !type and seq_num is always increasing. This will prevent potential llvm CSE. Also report an error if the type name is empty for remote relocation since remote relocation needs non-empty type name to do relocation against vmlinux. Differential Revision: https://reviews.llvm.org/D85174	2020-08-04 16:29:42 -07:00
Dan Gohman	47f7174ffa	[WebAssembly] Use "signed char" instead of "char" in SIMD intrinsics. This allows people to use `int8_t` instead of `char`, -funsigned-char, and generally decouples SIMD from the specialness of `char`. And it makes intrinsics like `__builtin_wasm_add_saturate_s_i8x16` and `__builtin_wasm_add_saturate_u_i8x16` use signed and unsigned element types, respectively. Differential Revision: https://reviews.llvm.org/D85074	2020-08-04 12:48:40 -07:00
Thorsten Schuett	e18c6ef6b4	[clang] improve diagnostics for misaligned and large atomics "Listing the alignment and access size (== expected alignment) in the warning seems like a good idea." solves PR 46947 struct Foo { struct Bar { void * a; void * b; }; Bar bar; }; struct ThirtyTwo { struct Large { void * a; void * b; void * c; void * d; }; Large bar; }; void braz(Foo foo, ThirtyTwo braz) { Foo::Bar bar; __atomic_load(&foo->bar, &bar, __ATOMIC_RELAXED); ThirtyTwo::Large foobar; __atomic_load(&braz->bar, &foobar, __ATOMIC_RELAXED); } repro.cpp:21:3: warning: misaligned atomic operation may incur significant performance penalty; the expected (16 bytes) exceeds the actual alignment (8 bytes) [-Watomic-alignment] __atomic_load(&foo->bar, &bar, __ATOMIC_RELAXED); ^ repro.cpp:24:3: warning: misaligned atomic operation may incur significant performance penalty; the expected (32 bytes) exceeds the actual alignment (8 bytes) [-Watomic-alignment] __atomic_load(&braz->bar, &foobar, __ATOMIC_RELAXED); ^ repro.cpp:24:3: warning: large atomic operation may incur significant performance penalty; the access size (32 bytes) exceeds the max lock-free size (16 bytes) [-Watomic-alignment] 3 warnings generated. Differential Revision: https://reviews.llvm.org/D85102	2020-08-04 11:10:29 -07:00
Yonghong Song	6d67506964	[clang][BPF] support type exist/size and enum exist/value relocations This patch added the following additional compile-once run-everywhere (CO-RE) relocations: - existence/size of typedef, struct/union or enum type - enum value and enum value existence These additional relocations will make CO-RE bpf programs more adaptive for potential kernel internal data structure changes. For existence/size relocations, the following two code patterns are supported: 1. uint32_t __builtin_preserve_type_info((<type> )0, flag); 2. <type> var; uint32_t __builtin_preserve_field_info(var, flag); flag = 0 for existence relocation and flag = 1 for size relocation. For enum value existence and enum value relocations, the following code pattern is supported: uint64_t __builtin_preserve_enum_value((<enum_type> )<enum_value>, flag); flag = 0 means existence relocation and flag = 1 for enum value. relocation. In the above <enum_type> can be an enum type or a typedef to enum type. The <enum_value> needs to be an enumerator value from the same enum type. The return type is uint64_t to permit potential 64bit enumerator values. Differential Revision: https://reviews.llvm.org/D83242	2020-08-04 08:39:53 -07:00
Erich Keane	0a8ac91a08	Permit nowthrow and nonnull with multiversioning. Some shipped versions of stdlib.h use nonnull and nothrow with function multiversioning. Support these, as they are generally harmless.	2020-08-04 07:40:27 -07:00
Erich Keane	961da69d7e	Improve diagnostics for disallowed attributes used with multiversioning Since we permit using SOME attributes (at the moment, just 1) with multiversioning, we should improve the message as it still implies that no attributes should be combined with multiversioning.	2020-08-04 07:40:27 -07:00
Haojian Wu	1c0a0dfa02	[Concepts] Include the found concept decl when dumping the ConceptSpecializationExpr Differential Revision: https://reviews.llvm.org/D85124	2020-08-04 15:58:12 +02:00
Endre Fülöp	141cb8a1ee	[analyzer] Model iterator random incrementation symmetrically Summary: In case a pointer iterator is incremented in a binary plus expression (operator+), where the iterator is on the RHS, IteratorModeling should now detect, and track the resulting value. Reviewers: Szelethus, baloghadamsoftware Reviewed By: baloghadamsoftware Subscribers: rnkovacs, whisperity, xazax.hun, baloghadamsoftware, szepet, a.sidorin, mikhail.ramalho, Szelethus, donat.nagy, dkrupp, Charusso, steakhal, martong, ASDenysPetrov, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D83190	2020-08-04 11:04:12 +02:00
Nathan Ridge	4ede396849	[clang] Include trailing-requires-clause in FunctionDecl's source range Fixes https://github.com/clangd/clangd/issues/476 Differential Revision: https://reviews.llvm.org/D85108	2020-08-04 02:52:01 -04:00
Kazushi (Jam) Marukawa	045e79e77c	[VE] Extend integer arguments and return values smaller than 64 bits In order to follow NEC Aurora SX VE ABI correctly, change to sign/zero extend integer arguments and return values smaller than 64 bits in clang. Also update regression test. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D85071	2020-08-04 08:07:05 +09:00
Thomas Lively	cb32792210	[WebAssembly] Implement prototype v128.load{32,64}_zero instructions Specified in https://github.com/WebAssembly/simd/pull/237, these instructions load the first vector lane from memory and zero the other lanes. Since these instructions are not officially part of the SIMD proposal, they are only available on an opt-in basis via LLVM intrinsics and clang builtin functions. If these instructions are merged to the proposal, this implementation will change so that the instructions will be generated from normal IR. At that point the intrinsics and builtin functions would be removed. This PR also changes the opcodes for the experimental f32x4.qfm{a,s} instructions because their opcodes conflicted with those of the v128.load{32,64}_zero instructions. The new opcodes were chosen to match those used in V8. Differential Revision: https://reviews.llvm.org/D84820	2020-08-03 13:54:00 -07:00
Akira Hatanaka	41b1e97b12	[CodeGen][ObjC] Mark calls to objc_unsafeClaimAutoreleasedReturnValue as notail on x86-64 This is needed because the epilogue code inserted before tail calls on x86-64 breaks the handshake between the caller and callee. Calls to objc_retainAutoreleasedReturnValue used to have the same problem, which was fixed in https://reviews.llvm.org/D59656. rdar://problem/66029552 Differential Revision: https://reviews.llvm.org/D84540	2020-08-03 13:25:25 -07:00
Daniel Sanders	7209f83112	Allow .dSYM's to be directly placed in an alternate directory Once available in the relevant toolchains this will allow us to implement LLVM_EXTERNALIZE_DEBUGINFO_OUTPUT_DIR after D84127 by directly placing the dSYM in the desired location instead of emitting next to the output file and moving it. Reviewed By: JDevlieghere Differential Revision: https://reviews.llvm.org/D84572	2020-08-03 13:18:52 -07:00
Simon Tatham	ed0e4c70c9	[clang][ARM] Add name-mangling test for direct __fp16 arguments. `clang/test/CodeGenCXX/fp16-mangle.cpp` tests pointers to __fp16, but if you give the `-fallow-half-arguments-and-returns` option, then clang can also leave an __fp16 unmodified as a function argument or return type. This regression test checks the name-mangling of that. Reviewed By: miyuki Differential Revision: https://reviews.llvm.org/D85010	2020-08-03 13:30:50 +01:00
Saiyedul Islam	160ff83765	[OpenMP][AMDGCN] Support OpenMP offloading for AMDGCN architecture - Part 3 Provides AMDGCN and NVPTX specific specialization of getGPUWarpSize, getGPUThreadID, and getGPUNumThreads methods. Adds tests for AMDGCN codegen for these methods in generic and simd modes. Also changes the precondition in InitTempAlloca to be slightly more permissive. Useful for AMDGCN OpenMP codegen where allocas are created with a cast to an address space. Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D84260	2020-08-03 05:38:39 +00:00
Florian Hahn	00a0282ff8	[Clang] Remove run-lines which use opt to run -ipconstprop. ipconstprop is going to get removed and checking opt with specific passes makes the tests more fragile. The tests retain the important checks that !callback metadata is created correctly.	2020-08-02 21:47:32 +01:00
Nikita Popov	25af353b0e	[NewPM][LVI] Abandon LVI after CVP As mentioned on D70376, LVI can currently cause performance issues when running under NewPM. The problem is that, unlike the legacy pass manager, NewPM will not immediately discard the LVI analysis if the following pass does not need it. This is a problem, because LVI has a high memory requirement, and mass invalidation of LVI values is very inefficient. LVI should only be alive during passes that actively interact with it. This patch addresses the issue by explicitly abandoning LVI after CVP, which gets us back to the LegacyPM behavior. Differential Revision: https://reviews.llvm.org/D84959	2020-08-01 23:47:46 +02:00
Richard Smith	234f51a65a	Don't crash if we deserialize a pack expansion type whose pattern contains no packs. Fixes a regression from `740a164dec`.	2020-07-31 17:19:44 -07:00
Hsiangkai Wang	721d93fc5a	Support experimental v extension v0.9. Differential revision: https://reviews.llvm.org/D81213	2020-08-01 07:42:06 +08:00
Jaydeep Chauhan	38d3e75332	[clang] Use the location of the void parameters when complaining that only a single void parameter should be present. Fixes PR46417. Differential Revision: https://reviews.llvm.org/D84678 Reviewed By: aaron.ballman	2020-07-31 20:36:58 +01:00
Amy Kwan	c4e5743232	[PowerPC] Implement low-order Vector Modulus Builtins, and add Vector Multiply/Divide/Modulus Builtins Tests Power10 introduces new instructions for vector multiply, divide and modulus. These instructions can be exploited by the builtin functions: vec_mul, vec_div, and vec_mod, respectively. This patch aims adds the function prototype, vec_mod, as vec_mul and vec_div been previously implemented in altivec.h. This patch also adds the following front end tests: vec_mul for v2i64 vec_div for v4i32 and v2i64 vec_mod for v4i32 and v2i64 Differential Revision: https://reviews.llvm.org/D82576	2020-07-31 10:58:07 -05:00
Erich Keane	e704aa4f25	DR2303: Prefer 'nearer' base classes during template deduction. DR2303 fixes the case where the derived-base match for template deduction is ambiguous if a base-of-base ALSO matches. The canonical example (as shown in the test) is just like the MSVC implementation of std::tuple. This fixes a fairly sizable issue, where if a user inherits from std::tuple on Windows (with the MS STL), they cannot use that type to call a function that takes std::tuple. Differential Revision: https://reviews.llvm.org/D84048	2020-07-31 05:39:55 -07:00
Fangrui Song	c06e22fe07	[Driver] Exclude options::LinkerInput for GCC linking options::LinkerInput options may get duplicated after 6a75496836ea14bcfd2f4b59d35a1cad4ac58cee..	2020-07-31 00:04:09 -07:00
Fangrui Song	fcf8ada18f	[Driver] Render -e for Gnu.cpp	2020-07-30 23:49:32 -07:00
Arthur Eubanks	c03d3aca7d	[test] Fix thinlto-distributed-newpm.ll Broken by https://reviews.llvm.org/D84981.	2020-07-30 20:09:34 -07:00
Eli Friedman	8dfb5d767e	[clang codegen][AArch64] Use llvm.aarch64.neon.fcvtzs/u where it's necessary fptosi/fptoui have similar, but not identical, semantics. In particular, the behavior on overflow is different. Fixes https://bugs.llvm.org/show_bug.cgi?id=46844 for 64-bit. (The corresponding patch for 32-bit is more involved because the equivalent intrinsics don't exist, as far as I can tell.) Differential Revision: https://reviews.llvm.org/D84703	2020-07-30 15:41:54 -07:00
Richard Smith	1e7f026c3b	PR46908: Emit undef destroying_delete_t as an aggregate RValue. We previously used a non-aggregate RValue to represent the passed value, which violated the assumptions of call arg lowering in some cases, in particular on 32-bit Windows, where we'd end up producing an FCA store with TBAA metadata, that the IR verifier would reject.	2020-07-30 14:50:01 -07:00

1 2 3 4 5 ...

41082 Commits