llvm-project

Commit Graph

Author	SHA1	Message	Date
Kazu Hirata	0d182d9d1e	[Transforms] Use make_early_inc_range (NFC)	2021-11-07 17:03:15 -08:00
Simon Pilgrim	55e4cd8485	[X86][AVX2] Recognise 256-bit truncation shuffles and mask 256-bit source For v8i16 shuffle patterns that are lowered with AND+PACKUS, check to see if the sources are from a 256-bit vector and perform the masking using BLENDW at the 256-bit level. With the test changes we can see more examples of duplicate XMM/YMM zero vectors (PR26018) :(	2021-11-07 21:24:55 +00:00
Valentin Clement	54c563474a	[fir] Add fir.extract_value and fir.insert_value conversion This patch add the conversion pattern for fir.extract_value and fir.insert_value. fir.extract_value is lowered to llvm.extractvalue anf fir.insert_value is lowered to llvm.insertvalue. This patch also adds the type conversion for the BoxType and RecordType needed to have some comprehensive tests. This patch is part of the upstreaming effort from fir-dev branch. This patch was landed and reverted once. TypeBuilderFunc getModel<Fortran::ISO::CFI_index_t>() was clashing with getModel<long long> on windows since they both are 64 bits signed interger. On linux CFI_index_t is long. Change CFI_index_t to getModel<long>. Reviewed By: awarzynski Differential Revision: https://reviews.llvm.org/D112961 Co-authored-by: Jean Perier <jperier@nvidia.com> Co-authored-by: Eric Schweitz <eschweitz@nvidia.com>	2021-11-07 21:59:01 +01:00
Nikita Popov	2060895c9c	[ConstantRange] Add exact union/intersect (NFC) For some optimizations on comparisons it's necessary that the union/intersect is exact and not a superset. Add methods that return Optional<ConstantRange> only if the result is exact. For the sake of simplicity this is implemented by comparing the subset and superset approximations for now, but it should be possible to do this more directly, as unionWith() and intersectWith() already distinguish the cases where the result is imprecise for the preferred range type functionality.	2021-11-07 21:46:06 +01:00
Nikita Popov	cf71a5ea8f	[ConstantRange] Support zero size in isSizeLargerThan() From an API perspective, it does not make a lot of sense that 0 is not a valid argument to this function. Add the exact check needed to support it.	2021-11-07 21:22:45 +01:00
Jonas Devlieghere	d09a21a0b3	[lldb] Remove failures case from TestTaggedPointerCmd Somehow every pointer looks like it's tagged on GreenDragon. Removing the check to unblock the bot until we can get to the bottom of this.	2021-11-07 10:40:43 -08:00
David Green	17acd6d940	[AArch64] Rewrite and update fcvt-fixed.ll. NFC This rewrites the fcvt-fixed.ll test case to be separate functions, not one large function with volatile global stores. It also adds fp16 and fptoi.sat testing at the same time.	2021-11-07 18:11:49 +00:00
Nikita Popov	a8c318b50e	[BasicAA] Use index size instead of pointer size When accumulating the GEP offset in BasicAA, we should use the pointer index size rather than the pointer size. Differential Revision: https://reviews.llvm.org/D112370	2021-11-07 18:56:11 +01:00
Kazu Hirata	aee86f9b6c	[AMDGPU] Remove unused declaration selectSMRD (NFC) The function body proper was removed on Feb 20, 2019 in commit `79b5c3842b`.	2021-11-07 09:53:18 -08:00
Kazu Hirata	41ef3187e0	[ARM, X86] Use MachineBasicBlock::{predecessors,successors} (NFC)	2021-11-07 09:53:16 -08:00
Kazu Hirata	eb1c7c1339	[AST, Analysis] Use llvm::reverse (NFC)	2021-11-07 09:53:14 -08:00
Manoj Gupta	db27867dfc	[compiler-rt] Produce the right arch suffix for arm baremetal D98452 introduced a mismatch between clang expectations for builtin name for baremetal targets on arm. Fix it by adding a case for baremetal. This now matches the output of "clang -target armv7m-none-eabi -print-libgcc-file-name \ -rtlib=compiler-rt" Reviewed By: mstorsjo Differential Revision: https://reviews.llvm.org/D113357	2021-11-07 08:51:35 -08:00
Benjamin Kramer	2e20ff8c1a	[AVR] Remove a global initializer. NFCI.	2021-11-07 16:30:18 +01:00
Mark de Wever	69603ae90f	[libc++][doc] Don't mention Prague twice.	2021-11-07 16:21:05 +01:00
Nikolas Klauser	9a140a1586	[libc++] Make test_allocator constexpr-friendly for constexpr string/vector Make test_allocator etc. constexpr-friendly so they can be used to test constexpr string and possibly constexpr vector Reviewed By: Quuxplusone, #libc, ldionne Differential Revision: https://reviews.llvm.org/D110994	2021-11-07 16:15:28 +01:00
Simon Pilgrim	f057756a1a	[SLP] Fix Wdocumentation warning - remove \returns from void function. NFC.	2021-11-07 15:08:39 +00:00
Simon Pilgrim	d391e4fe84	[X86] Update RET/LRET instruction to use the same naming convention as IRET (PR36876). NFC Be more consistent in the naming convention for the various RET instructions to specify in terms of bitwidth. Helps prevent future scheduler model mismatches like those that were only addressed in D44687. Differential Revision: https://reviews.llvm.org/D113302	2021-11-07 15:06:54 +00:00
Benjamin Kramer	9b8b16457c	Put implementation details into anonymous namespaces. NFCI.	2021-11-07 15:18:30 +01:00
Benjamin Kramer	8adb6d6de2	[clang] Use llvm::reverse. NFCI.	2021-11-07 14:24:33 +01:00
Simon Pilgrim	b5ef56f0bc	[X86][AVX] Add missing X86ISD::VBROADCAST(v4f32 -> v8f32) isel pattern for AVX1 targets D109434 addressed the v2f64 -> v4f64 case, an internal test has found an equivalent crash for the v4f32 -> v8f32 case.	2021-11-07 12:59:35 +00:00
Simon Pilgrim	f7880a78ce	[X86] Add AVX512 test coverage to vselect-zero.ll Noticed on D113212	2021-11-07 12:44:01 +00:00
Simon Pilgrim	0ff1edeeec	[DAG] SimplifyVBinOp - replace FoldConstantVectorArithmetic with FoldConstantArithmetic Currently FoldConstantArithmetic only handles binops, so replacing other uses of FoldConstantVectorArithmetic (in particular for SETCC nodes), still require more work.	2021-11-07 12:11:46 +00:00
Mats Larsen	ad523cc398	[NFC][Docs] Add missing Doxygen group comments for LLVM-C The LLVM-C API is relatively small so we've previously added doxygen tags so it's easier to navigate the LLVM-C web docs. Over the years, more headers were added without proper doxygen tags, effectively hiding them from the main LLVM-C doxygen page. This patch adds comments to headers which did not have them. Reviewed By: lhames Differential Revision: https://reviews.llvm.org/D112474	2021-11-07 12:23:17 +01:00
Fangrui Song	70986ea3d6	[sanitizer][aarch64] Add cast to drop reliance on the type of uc_mcontext.__reserved https://sourceware.org/bugzilla/show_bug.cgi?id=22742 uc_mcontext.__reserved probably should not be considered user visible API but unfortunate it is: it is the only way to access cpu states of some Linux asm/sigcontext.h extensions. That said, the declaration may be long double __reserved[256]; (used by musl) instead of unsigned char __reserved[4096] __attribute__((__aligned__(16))); (glibc) to avoid dependency on a GNU variable attribute.	2021-11-06 23:26:05 -07:00
Fangrui Song	815b9f53d8	[hwasan] Replace _Unwind_Word with uintptr_t GCC introduced `__attribute__((mode(unwind_word)))` to work around Cell Broadband Engine SPU (which was removed from GCC in 2019-09), which is irrelevant to hwasan. _Unwind_GetGR/_Unwind_GetCFA from llvm-project/libunwind don't use unwind_word. Using _Unwind_Word can lead to build failures if libunwind's unwind.h is preferred over unwind.h in the Clang resource directory (e.g. built with GCC).	2021-11-06 22:34:50 -07:00
Kazu Hirata	22e21da47d	[WebAssembly] Remove unused declaration SelectExternRefAddr (NFC)	2021-11-06 19:31:22 -07:00
Kazu Hirata	e4bab21848	[AMDGPU] Use MachineBasicBlock::{predecessors,successors} (NFC)	2021-11-06 19:31:20 -07:00
Kazu Hirata	843d1eda18	[llvm] Use llvm::reverse (NFC)	2021-11-06 19:31:18 -07:00
Yonghong Song	bbab17c6c9	[Clang][Attr] fix a btf_type_attr CGDebugInfo codegen bug Nathan Chancellor reported a crash due to commit `3466e00716` (Reland "[Attr] support btf_type_tag attribute"). The following test can reproduce the crash: $ cat efi.i typedef unsigned long efi_query_variable_info_t(int); typedef struct { struct { efi_query_variable_info_t __attribute__((regparm(0))) * query_variable_info; }; } efi_runtime_services_t; efi_runtime_services_t efi_0; $ clang -m32 -O2 -g -c -o /dev/null efi.i The reason is that FunctionTypeLoc.getParam(Idx) may return a nullptr which should be checked before dereferencing the result pointer. This patch fixed this issue.	2021-11-06 18:19:00 -07:00
Fangrui Song	d9e2c8f54d	[yaml2obj][COFF] Make some PEHeader fields optional This makes it easy to write tests where the irrelevant fields are not needed.	2021-11-06 16:39:59 -07:00
Luke Benes	2249ecee8d	[IR][ShuffleVector] Fix Wdangling-else warning in InstructionsTest Fix a dangling else that gcc-11 warned about. The EXPECT_EQ macro expands to an if-else, so the whole construction contains a hidden dangling else. Differential Revision: https://reviews.llvm.org/D113346	2021-11-07 00:07:01 +03:00
Nikita Popov	9f0194be45	[ConstantRange] Add getEquivalentICmp() variant with offset (NFCI) Add a variant of getEquivalentICmp() that produces an optional offset. This allows us to create an equivalent icmp for all ranges. Use this in the with.overflow folding code, which was doing this adjustment separately -- this clarifies that the fold will indeed always apply.	2021-11-06 21:59:45 +01:00
Kazu Hirata	cefc01fa65	[X86] Simplify a call to MachineBasicBlock::erase (NFC)	2021-11-06 13:08:25 -07:00
Kazu Hirata	815e8b5a20	[Hexagon] Remove an extraneous variable (NFC)	2021-11-06 13:08:23 -07:00
Kazu Hirata	14d656b3d8	[Target] Use llvm::reverse (NFC)	2021-11-06 13:08:21 -07:00
Nikita Popov	e3cec17b2d	[InstSimplify] Remove incorrect icmp of gep fold (PR52429) As described in https://bugs.llvm.org/show_bug.cgi?id=52429 this fold is incorrect, because inbounds only guarantees that the pointers don't wrap in the unsigned space: It is possible that the sign boundary is crossed by an object. I'm dropping the fold entirely rather than adjusting it, because computePointerICmp() fully subsumes it (just with correct predicate handling). Differential Revision: https://reviews.llvm.org/D113343	2021-11-06 21:03:21 +01:00
Fangrui Song	859a6d973f	[llvm-objdump] Remove untested diagnostic "missing data dir for TLS table"	2021-11-06 11:18:29 -07:00
Nikita Popov	f8627877a9	[SCEV] Make eraseValueFromMap() private (NFC) The public API for this functionality is forgetValue(). There was only one call from LoopVectorize, which was directly next to a forgetValue() call and as such redundant.	2021-11-06 17:14:02 +01:00
Roman Lebedev	23566f18c6	[NFC][X86][Costmodel] Add tests for i32/i64 replication shuffles While this isn't what we eventually need (i8 or i1), approaching from this end is more straight-forward.	2021-11-06 17:14:56 +03:00
Anton Afanasyev	1c2ad70fd5	[Test][SLPVectorizer] Precommit test for PR52275	2021-11-06 17:11:02 +03:00
Shraiysh Vaishay	19a7e4729d	[MLIR][OpenMP] Added omp.sections and omp.section Added omp.sections and omp.section operation according to the section 2.8.1 of OpenMP Standard 5.0. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D110844	2021-11-06 19:27:35 +05:30
Roman Lebedev	a30ec4778a	[TTI][CostModel] `getUserCost()`: recognize replication shuffles and query their cost This finally creates proper test coverage for replication shuffles, that are used by LV for conditional loads, and will allow to add proper costmodel at least for AVX512. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D113324	2021-11-06 16:45:15 +03:00
Roman Lebedev	f8efc5c0ac	[NFC][TTI] Add/extract `getReplicationShuffleCost()` method, deduplicate it's implementations Hiding it in `getInterleavedMemoryOpCost()` is problematic for a number of reasons, including testability and reuse, let's do better. In a followup `getUserCost()` will be taught to use to to estimate the mask costs, which will allow for better cost model tests for it. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D113313	2021-11-06 16:45:15 +03:00
Sanjay Patel	39c4c7d391	[DAGCombiner] remove vselect fold that was accidentally added This diff snuck into the unrelated: `025a2f73a3` It's a suggested follow-up for D113212, but I need to add test coverage first.	2021-11-06 09:34:30 -04:00
Sanjay Patel	83c2fb9f66	[InstCombine] match usub.sat from umax intrinsic umax(X, Op1) - Op1 --> usub.sat(X, Op1) https://alive2.llvm.org/ce/z/HpcGiJ This happens in 2 or more steps with an icmp-select idiom instead of an intrinsic. This is another step towards canonicalization of the min/max intrinsics. See: D98152	2021-11-06 08:32:52 -04:00
Sanjay Patel	025a2f73a3	[InstCombine] add tests for umax with sub; NFC	2021-11-06 08:32:52 -04:00
hyeongyu kim	63fff0f5bf	Fix lit test failures in CodeGenCoroutines	2021-11-06 19:58:34 +09:00
hyeongyukim	aacfbb953e	[Clang/Test]: Rename enable_noundef_analysis to disable-noundef-analysis and turn it off by default Turning on `enable_noundef_analysis` flag allows better codegen by removing freeze instructions. I modified clang by renaming `enable_noundef_analysis` flag to `disable-noundef-analysis` and turning it off by default. Test updates are made as a separate patch: D108453 Reviewed By: eugenis Differential Revision: https://reviews.llvm.org/D105169 [Clang/Test]: Rename enable_noundef_analysis to disable-noundef-analysis and turn it off by default (2) This patch updates test files after D105169. Autogenerated test codes are changed by `utils/update_cc_test_checks.py,` and non-autogenerated test codes are changed as follows: (1) I wrote a python script that (partially) updates the tests using regex: {F18594904} The script is not perfect, but I believe it gives hints about which patterns are updated to have `noundef` attached. (2) The remaining tests are updated manually. Reviewed By: eugenis Differential Revision: https://reviews.llvm.org/D108453 Resolve lit failures in clang after 8ca4b3e's land Fix lit test failures in clang-ppc* and clang-x64-windows-msvc Fix missing failures in clang-ppc64be* and retry fixing clang-x64-windows-msvc Fix internal_clone(aarch64) inline assembly	2021-11-06 19:19:22 +09:00
Jason Rice	b5aef90d46	[Clang] Fix instantiation of OpaqueValueExprs (Bug #45964 ) The structured bindings decomposition of a non-dependent array in a dependent context (a template) were, upon instantiation, creating nested OpaqueValueExprs that would trigger assertions in CodeGen. Additionally the OpaqueValuesExpr's contained SourceExpr is being emitted in CodeGen, but there was no code for its transform in template instantiation. This would trigger other assertions such as when emitting a DeclRefExpr that refers to a VarDecl that is not marked as ODR-used. This is all based on cursory deduction, but with the way the code flows from SemaTemplateInstantiate back to SemaInit, it is apparent that the nesting of OpaqueValueExpr is unintentional. This commit fixes https://bugs.llvm.org/show_bug.cgi?id=45964 and possible other issues involving OpaqueValueExprs in template instantiations might be resolved. Reviewed By: aaron.ballman, rjmccall Differential Revision: https://reviews.llvm.org/D108482	2021-11-06 10:06:38 +02:00
Vitaly Buka	39ead64e3f	[sanitizer] Intercept lstat on Linux It's availible from GLIBC 2.33 Fixes use-of-uninitialized-value llvm/lib/Support/Unix/Path.inc:467:29 in llvm::sys::fs::remove(llvm::Twine const&, bool)	2021-11-06 00:52:54 -07:00

... 2 3 4 5 6 ...

404085 Commits All Branches Search

404085 Commits

All Branches