llvm-project

Commit Graph

Author	SHA1	Message	Date
@t-msn	0808d956c4	[OpenMP] libomp: Fix handling of barrier pattern environment variables It is better to set all barrier patterns to use "dist" when at least one environment variable specifies "dist". Otherwise if only one environment is set to "dist" and others left blank inadvertently, it would result in mixing dist barrier with default hyper barrier pattern. Differential Revision: https://reviews.llvm.org/D112597	2021-11-08 15:01:26 +03:00
Andrzej Warzynski	ddd11b9a4b	[flang][CodeGen] Transform `fir.call` to `llvm.call` This patch extends the `FIRToLLVMLowering` pass in Flang by adding a hook to transform `fir.call` to `llvm.call`. This is part of the upstreaming effort from the `fir-dev` branch in [1]. [1] https://github.com/flang-compiler/f18-llvm-project Patch originally written by: Co-authored-by: Eric Schweitz <eschweitz@nvidia.com> Co-authored-by: V Donaldson <vdonaldson@nvidia.com> Differential Revision: https://reviews.llvm.org/D113278	2021-11-08 11:43:54 +00:00
Simon Moll	c2b91eef27	[VE] default to integrated asm in AsmInfo VE integrated asm has been the default in Clang. Also use the default setting for integrated asm in the backend. Reviewed By: kaz7 Differential Revision: https://reviews.llvm.org/D113384	2021-11-08 11:58:29 +01:00
Simon Pilgrim	1f60302a37	[AArch64] Precommit i256 test from D111530	2021-11-08 10:47:57 +00:00
Tobias Gysi	9fbcad3298	[mlir][linalg] Improve the padding packing loop computation. The revision updates the packing loop search in hoist padding. Instead of considering all loops in the backward slice, we now compute a separate backward slice containing the index computations only. This modification ensures we do not add packing loops that are not used to index the packed buffer due to spurious dependencies. One instance where such spurious dependencies can appear is the extract slice operation introduced between the tile loops of a double tiling. Depends On D112412 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D112713	2021-11-08 10:20:33 +00:00
David Green	a982940eb5	[AArch64] Combine fptoi.sat(fmul) to fixed point cvtf We already have patterns for fptosi and fptoui plus fmul to fixed point convert, this adds equivalent patterns for fptosi.sat and fptoui.sat, which should apply equally well for the legal saturating variants. Differential Revision: https://reviews.llvm.org/D113199	2021-11-08 10:07:34 +00:00
Jean Perier	4375430689	[flang] Set the addendum when establishing pointer section in descriptor If the source has an addendum, the descriptor that is being established to describe a section over the source needs to copy the addendum so that derived type information is correctly set in the descriptor being established. This allows namelist IO with derived type to work correctly. Differential Revision: https://reviews.llvm.org/D113258	2021-11-08 11:05:31 +01:00
David Sherwood	c42bb30b9e	[LoopVectorize] Permit fixed-width epilogue loops for scalable vector bodies At the moment in LoopVectorizationCostModel::selectEpilogueVectorizationFactor we bail out if the main vector loop uses a scalable VF. This patch adds support for generating epilogue vector loops using a fixed-width VF when the main vector loop uses a scalable VF. I've changed LoopVectorizationCostModel::selectEpilogueVectorizationFactor so that we convert the scalable VF into a fixed-width VF and do profitability checks on that instead. In addition, since the scalable and fixed-width VFs live in different VPlans that means I had to change the calls to LVP.hasPlanWithVFs so that we only pass in the fixed-width VF. New tests added here: Transforms/LoopVectorize/AArch64/sve-epilog-vect.ll Differential Revision: https://reviews.llvm.org/D109432	2021-11-08 09:41:13 +00:00
Qiu Chaofan	9b5e2b5261	[PowerPC] Implement basic macro fusion in Power10 Including basic fusion types around arithmetic and logical instructions. Reviewed By: jsji Differential Revision: https://reviews.llvm.org/D111693	2021-11-08 17:23:56 +08:00
Andrew Wei	bf3784b882	[AArch64] Canonicalize X(Y+1) or X(1-Y) to madd/msub Performing the rearrangement for add/sub and mul instructions to match the madd/msub pattern Reviewed By: dmgreen, sdesmalen, david-arm Differential Revision: https://reviews.llvm.org/D111862	2021-11-08 16:49:31 +08:00
Konstantin Varlamov	12b55821a5	[libc++][NFC] Inline most of `__vector_base` into `vector`. `__vector_base` exists for historical reasons and cannot be eliminated entirely without breaking the ABI. Member variables are left untouched -- this patch only does changes that clearly cannot affect the ABI. Differential Revision: https://reviews.llvm.org/D112976	2021-11-08 00:45:48 -08:00
Konstantin Varlamov	d7ab283996	Revert "[libc++] Always define a key function for std::bad_function_call in the dylib" This reverts commit `bc74231756`. It was committed accidentally.	2021-11-08 00:44:47 -08:00
Valentin Clement	29abf2a4a4	[fir] Add test for FIR types conversion Add a separate file to test FIR types conversion to LLVM types. Conversion comes from `flang/lib/Optimizer/CodeGen/TypeConverter.h` This patch is part of the upstreaming effort from fir-dev branch. Reviewed By: kiranchandramohan, awarzynski Differential Revision: https://reviews.llvm.org/D113283	2021-11-08 09:41:39 +01:00
Louis Dionne	bc74231756	[libc++] Always define a key function for std::bad_function_call in the dylib However, whether applications rely on the std::bad_function_call vtable being in the dylib is still controlled by the ABI macro, since changing that would be an ABI break. Differential Revision: https://reviews.llvm.org/D92397	2021-11-08 00:31:00 -08:00
skc7	a0633f5ccb	[AMDGPU] Test Commit. NFC Reviewed By: hsmhsm Differential Revision: https://reviews.llvm.org/D113379	2021-11-08 07:09:09 +00:00
Esme-Yi	9b6f264d2b	[XCOFF][llvm-readobj] improve the relocation output. Summary: 1. implemented the unexpanded relocations output. 2. modified the expanded output format to align. Reviewed By: shchenz, jhenderson Differential Revision: https://reviews.llvm.org/D111700	2021-11-08 03:15:52 +00:00
Ben Shi	e32cf690df	[RISCV] Optimize (add (mul r, c0), c1) Optimize (add (mul x, c0), c1) -> (add (mul (add x, c1/c0+1), c0), c1%c0-c0), if c1/c0+1 and c1%c0-c0 are simm12, while c1 is not. Optimize (add (mul x, c0), c1) -> (add (mul (add x, c1/c0-1), c0), c1%c0+c0), if c1/c0-1 and c1%c0+c0 are simm12, while c1 is not. Reviewed By: craig.topper, asb Differential Revision: https://reviews.llvm.org/D111141	2021-11-08 02:58:25 +00:00
Chen Zheng	7c6f5950f0	[PowerPC] comment for different input register classes; nfc Add comments to explain why XXPERMDIs and XXPERMDI have different input register classes, vsfrc for XXPERMDIs and vsrc for XXPERMDI. This addresses the comments in abandoned patch D113178, we keep using `f0` instead of using `vs0` for XXPERMDIs on purpose.	2021-11-08 02:21:30 +00:00
Zi Xuan Wu	4fb282fec5	[CSKY] Add CSKY 16-bit instruction format and encoding CSKY is a ARCH which supports mixture of 16-bit and 32-bit instructions natively, and there is not an indivual predictor or feature to enable/disable 16-bit instruction. So I think it's better to add 16-bit instruction early, and naturally to use 16-bit and 32-bit instructions. Differential Revision: https://reviews.llvm.org/D112919	2021-11-08 10:02:15 +08:00
Chen Zheng	50acbbe3cd	[AsmPrinter][ORE] use correct opcode name Reviewed By: jsji Differential Revision: https://reviews.llvm.org/D113173	2021-11-08 01:51:24 +00:00
Chen Zheng	c7d27f90e7	[ORE][AsmPrinter] add testcase for D113173; NFC	2021-11-08 01:47:22 +00:00
Kazu Hirata	0d182d9d1e	[Transforms] Use make_early_inc_range (NFC)	2021-11-07 17:03:15 -08:00
Simon Pilgrim	55e4cd8485	[X86][AVX2] Recognise 256-bit truncation shuffles and mask 256-bit source For v8i16 shuffle patterns that are lowered with AND+PACKUS, check to see if the sources are from a 256-bit vector and perform the masking using BLENDW at the 256-bit level. With the test changes we can see more examples of duplicate XMM/YMM zero vectors (PR26018) :(	2021-11-07 21:24:55 +00:00
Valentin Clement	54c563474a	[fir] Add fir.extract_value and fir.insert_value conversion This patch add the conversion pattern for fir.extract_value and fir.insert_value. fir.extract_value is lowered to llvm.extractvalue anf fir.insert_value is lowered to llvm.insertvalue. This patch also adds the type conversion for the BoxType and RecordType needed to have some comprehensive tests. This patch is part of the upstreaming effort from fir-dev branch. This patch was landed and reverted once. TypeBuilderFunc getModel<Fortran::ISO::CFI_index_t>() was clashing with getModel<long long> on windows since they both are 64 bits signed interger. On linux CFI_index_t is long. Change CFI_index_t to getModel<long>. Reviewed By: awarzynski Differential Revision: https://reviews.llvm.org/D112961 Co-authored-by: Jean Perier <jperier@nvidia.com> Co-authored-by: Eric Schweitz <eschweitz@nvidia.com>	2021-11-07 21:59:01 +01:00
Nikita Popov	2060895c9c	[ConstantRange] Add exact union/intersect (NFC) For some optimizations on comparisons it's necessary that the union/intersect is exact and not a superset. Add methods that return Optional<ConstantRange> only if the result is exact. For the sake of simplicity this is implemented by comparing the subset and superset approximations for now, but it should be possible to do this more directly, as unionWith() and intersectWith() already distinguish the cases where the result is imprecise for the preferred range type functionality.	2021-11-07 21:46:06 +01:00
Nikita Popov	cf71a5ea8f	[ConstantRange] Support zero size in isSizeLargerThan() From an API perspective, it does not make a lot of sense that 0 is not a valid argument to this function. Add the exact check needed to support it.	2021-11-07 21:22:45 +01:00
Jonas Devlieghere	d09a21a0b3	[lldb] Remove failures case from TestTaggedPointerCmd Somehow every pointer looks like it's tagged on GreenDragon. Removing the check to unblock the bot until we can get to the bottom of this.	2021-11-07 10:40:43 -08:00
David Green	17acd6d940	[AArch64] Rewrite and update fcvt-fixed.ll. NFC This rewrites the fcvt-fixed.ll test case to be separate functions, not one large function with volatile global stores. It also adds fp16 and fptoi.sat testing at the same time.	2021-11-07 18:11:49 +00:00
Nikita Popov	a8c318b50e	[BasicAA] Use index size instead of pointer size When accumulating the GEP offset in BasicAA, we should use the pointer index size rather than the pointer size. Differential Revision: https://reviews.llvm.org/D112370	2021-11-07 18:56:11 +01:00
Kazu Hirata	aee86f9b6c	[AMDGPU] Remove unused declaration selectSMRD (NFC) The function body proper was removed on Feb 20, 2019 in commit `79b5c3842b`.	2021-11-07 09:53:18 -08:00
Kazu Hirata	41ef3187e0	[ARM, X86] Use MachineBasicBlock::{predecessors,successors} (NFC)	2021-11-07 09:53:16 -08:00
Kazu Hirata	eb1c7c1339	[AST, Analysis] Use llvm::reverse (NFC)	2021-11-07 09:53:14 -08:00
Manoj Gupta	db27867dfc	[compiler-rt] Produce the right arch suffix for arm baremetal D98452 introduced a mismatch between clang expectations for builtin name for baremetal targets on arm. Fix it by adding a case for baremetal. This now matches the output of "clang -target armv7m-none-eabi -print-libgcc-file-name \ -rtlib=compiler-rt" Reviewed By: mstorsjo Differential Revision: https://reviews.llvm.org/D113357	2021-11-07 08:51:35 -08:00
Benjamin Kramer	2e20ff8c1a	[AVR] Remove a global initializer. NFCI.	2021-11-07 16:30:18 +01:00
Mark de Wever	69603ae90f	[libc++][doc] Don't mention Prague twice.	2021-11-07 16:21:05 +01:00
Nikolas Klauser	9a140a1586	[libc++] Make test_allocator constexpr-friendly for constexpr string/vector Make test_allocator etc. constexpr-friendly so they can be used to test constexpr string and possibly constexpr vector Reviewed By: Quuxplusone, #libc, ldionne Differential Revision: https://reviews.llvm.org/D110994	2021-11-07 16:15:28 +01:00
Simon Pilgrim	f057756a1a	[SLP] Fix Wdocumentation warning - remove \returns from void function. NFC.	2021-11-07 15:08:39 +00:00
Simon Pilgrim	d391e4fe84	[X86] Update RET/LRET instruction to use the same naming convention as IRET (PR36876). NFC Be more consistent in the naming convention for the various RET instructions to specify in terms of bitwidth. Helps prevent future scheduler model mismatches like those that were only addressed in D44687. Differential Revision: https://reviews.llvm.org/D113302	2021-11-07 15:06:54 +00:00
Benjamin Kramer	9b8b16457c	Put implementation details into anonymous namespaces. NFCI.	2021-11-07 15:18:30 +01:00
Benjamin Kramer	8adb6d6de2	[clang] Use llvm::reverse. NFCI.	2021-11-07 14:24:33 +01:00
Simon Pilgrim	b5ef56f0bc	[X86][AVX] Add missing X86ISD::VBROADCAST(v4f32 -> v8f32) isel pattern for AVX1 targets D109434 addressed the v2f64 -> v4f64 case, an internal test has found an equivalent crash for the v4f32 -> v8f32 case.	2021-11-07 12:59:35 +00:00
Simon Pilgrim	f7880a78ce	[X86] Add AVX512 test coverage to vselect-zero.ll Noticed on D113212	2021-11-07 12:44:01 +00:00
Simon Pilgrim	0ff1edeeec	[DAG] SimplifyVBinOp - replace FoldConstantVectorArithmetic with FoldConstantArithmetic Currently FoldConstantArithmetic only handles binops, so replacing other uses of FoldConstantVectorArithmetic (in particular for SETCC nodes), still require more work.	2021-11-07 12:11:46 +00:00
Mats Larsen	ad523cc398	[NFC][Docs] Add missing Doxygen group comments for LLVM-C The LLVM-C API is relatively small so we've previously added doxygen tags so it's easier to navigate the LLVM-C web docs. Over the years, more headers were added without proper doxygen tags, effectively hiding them from the main LLVM-C doxygen page. This patch adds comments to headers which did not have them. Reviewed By: lhames Differential Revision: https://reviews.llvm.org/D112474	2021-11-07 12:23:17 +01:00
Fangrui Song	70986ea3d6	[sanitizer][aarch64] Add cast to drop reliance on the type of uc_mcontext.__reserved https://sourceware.org/bugzilla/show_bug.cgi?id=22742 uc_mcontext.__reserved probably should not be considered user visible API but unfortunate it is: it is the only way to access cpu states of some Linux asm/sigcontext.h extensions. That said, the declaration may be long double __reserved[256]; (used by musl) instead of unsigned char __reserved[4096] __attribute__((__aligned__(16))); (glibc) to avoid dependency on a GNU variable attribute.	2021-11-06 23:26:05 -07:00
Fangrui Song	815b9f53d8	[hwasan] Replace _Unwind_Word with uintptr_t GCC introduced `__attribute__((mode(unwind_word)))` to work around Cell Broadband Engine SPU (which was removed from GCC in 2019-09), which is irrelevant to hwasan. _Unwind_GetGR/_Unwind_GetCFA from llvm-project/libunwind don't use unwind_word. Using _Unwind_Word can lead to build failures if libunwind's unwind.h is preferred over unwind.h in the Clang resource directory (e.g. built with GCC).	2021-11-06 22:34:50 -07:00
Kazu Hirata	22e21da47d	[WebAssembly] Remove unused declaration SelectExternRefAddr (NFC)	2021-11-06 19:31:22 -07:00
Kazu Hirata	e4bab21848	[AMDGPU] Use MachineBasicBlock::{predecessors,successors} (NFC)	2021-11-06 19:31:20 -07:00
Kazu Hirata	843d1eda18	[llvm] Use llvm::reverse (NFC)	2021-11-06 19:31:18 -07:00
Yonghong Song	bbab17c6c9	[Clang][Attr] fix a btf_type_attr CGDebugInfo codegen bug Nathan Chancellor reported a crash due to commit `3466e00716` (Reland "[Attr] support btf_type_tag attribute"). The following test can reproduce the crash: $ cat efi.i typedef unsigned long efi_query_variable_info_t(int); typedef struct { struct { efi_query_variable_info_t __attribute__((regparm(0))) * query_variable_info; }; } efi_runtime_services_t; efi_runtime_services_t efi_0; $ clang -m32 -O2 -g -c -o /dev/null efi.i The reason is that FunctionTypeLoc.getParam(Idx) may return a nullptr which should be checked before dereferencing the result pointer. This patch fixed this issue.	2021-11-06 18:19:00 -07:00

... 2 3 4 5 6 ...

404106 Commits All Branches Search

404106 Commits

All Branches