llvm-project

Commit Graph

Author	SHA1	Message	Date
Florian Hahn	98db27711d	[LV] Do not check widening decision for instrs outside of loop. No widening decisions will be computed for instructions outside the loop. Do not try to get a widening decision. The load/store will be just a scalar load, so treating at as normal should be fine I think. Fixes PR46950. Reviewed By: dmgreen Differential Revision: https://reviews.llvm.org/D85087	2020-08-03 10:09:24 +01:00
Nicolas Vasilache	35b65be041	[mlir][Vector] Add transformation + pattern to split vector.transfer_read into full and partial copies. This revision adds a transformation and a pattern that rewrites a "maybe masked" `vector.transfer_read %view[...], %pad `into a pattern resembling: ``` %1:3 = scf.if (%inBounds) { scf.yield %view : memref<A...>, index, index } else { %2 = vector.transfer_read %view[...], %pad : memref<A...>, vector<...> %3 = vector.type_cast %extra_alloc : memref<...> to memref<vector<...>> store %2, %3[] : memref<vector<...>> %4 = memref_cast %extra_alloc: memref<B...> to memref<A...> scf.yield %4 : memref<A...>, index, index } %res= vector.transfer_read %1#0[%1#1, %1#2] {masked = [false ... false]} ``` where `extra_alloc` is a top of the function alloca'ed buffer of one vector. This rewrite makes it possible to realize the "always full tile" abstraction where vector.transfer_read operations are guaranteed to read from a padded full buffer. The extra work only occurs on the boundary tiles. Differential Revision: https://reviews.llvm.org/D84631	2020-08-03 04:53:43 -04:00
Raphael Isemann	8aeb212887	[debugserver] Fix that is_dot_app is producing unused warnings Some build configurations don't use this static function.	2020-08-03 10:24:21 +02:00
Frederik Gossen	11492be9d7	[MLIR][Shape] Lower `shape.broadcast` to `scf` Differential Revision: https://reviews.llvm.org/D85027	2020-08-03 08:20:14 +00:00
Xing GUO	ef005f204b	[MachOYAML] Remove redundant variable initialization. NFC. The value of `is64Bit` is initialized in the constructor body.	2020-08-03 16:17:28 +08:00
Shinji Okumura	434cf2ded3	[Attributor] Check nonnull attribute violation in AAUndefinedBehavior This patch makes it possible to handle nonnull attribute violation at callsites in AAUndefinedBehavior. If null pointer is passed to callee at a callsite and the corresponding argument of callee has nonnull attribute, the behavior of the callee is undefined. In this patch, violations of argument nonnull attributes is only handled. But violations of returned nonnull attributes can be handled and I will implement that in a follow-up patch. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D84733	2020-08-03 17:12:50 +09:00
Igor Kudrin	414b9bec6d	[DebugInfo] Make DIEDelta::SizeOf() more explicit. NFCI. The patch restricts DIEDelta::SizeOf() to accept only DWARF forms that are actually used in the LLVM codebase. This should make the use of the class more explicit and help to avoid issues similar to fixed in D83958 and D84094. Differential Revision: https://reviews.llvm.org/D84095	2020-08-03 15:04:15 +07:00
Igor Kudrin	f98e03a35d	[DebugInfo] Fix misleading using of DWARF forms with DIELabel. NFCI. DIELabel can emit only 32- or 64-bit values, while it was created in some places with DW_FORM_udata, which implies emitting uleb128. Nevertheless, these places also expected to emit U32 or U64, but just used a misleading DWARF form. The patch updates those places to use more appropriate DWARF forms and restricts DIELabel::SizeOf() to accept only forms that are actually used in the LLVM codebase. Differential Revision: https://reviews.llvm.org/D84094	2020-08-03 15:04:08 +07:00
Igor Kudrin	8feff8d14f	[DebugInfo] Fix a comment and a variable name. NFC. DebugLocListIndex keeps the index of an entry list, not the offset. Differential Revision: https://reviews.llvm.org/D84093	2020-08-03 15:04:00 +07:00
Igor Kudrin	4e10a18972	[DebugInfo] Make DIELocList::SizeOf() more explicit. NFCI. DIELocList is used with a limited number of DWARF forms, see the only place where it is instantiated, DwarfCompileUnit::addLocationList(). The patch marks the unexpected execution path in DIELocList::SizeOf() as unreachable, to reduce ambiguity. Differential Revision: https://reviews.llvm.org/D84092	2020-08-03 15:03:37 +07:00
Daniel Kiss	9c3f6fb688	[libunwind] Make the test depend on the libunwind explicitly. Before this patch the `ninja check-unwind` won't rebuild the unwind library. Reviewed By: jroelofs Differential Revision: https://reviews.llvm.org/D85004	2020-08-03 09:46:23 +02:00
Djordje Todorovic	4fdc4d892b	[NFC] [MIR] Document the reg state flags This patch adds documentation for the RegState enumeration. Differential Revision: https://reviews.llvm.org/D84634	2020-08-03 09:03:24 +02:00
George Mitenkov	91f6a5f785	[MLIR][SPIRV] Control attributes support for loop and selection This patch handles loopControl and selectionControl in parsing and printing. In order to reuse the functionality, and avoid handling cases when `{` of the region is parsed as a dictionary attribute, `control` keyword was introduced.`None` is a default control attribute. This functionality can be later extended to `spv.func`. Also, loopControl and selectionControl can now be (de)serialized. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D84175	2020-08-03 09:31:37 +03:00
Fangrui Song	c41a18cf61	[CMake] Default ENABLE_X86_RELAX_RELOCATIONS to ON This makes clang default to -Wa,-mrelax-relocations=yes, which enables R_386_GOT32X (GNU as enables it regardless of -mrelax-relocations=) and R_X86_64_[REX_]GOTPCRELX in MC. The produced object files require GNU ld>=2.26 to link. binutils 2.26 is considered a very old release today.	2020-08-02 23:06:31 -07:00
LLVM GN Syncbot	5a4cd55e5d	[gn build] Port `160ff83765`	2020-08-03 05:55:14 +00:00
Saiyedul Islam	160ff83765	[OpenMP][AMDGCN] Support OpenMP offloading for AMDGCN architecture - Part 3 Provides AMDGCN and NVPTX specific specialization of getGPUWarpSize, getGPUThreadID, and getGPUNumThreads methods. Adds tests for AMDGCN codegen for these methods in generic and simd modes. Also changes the precondition in InitTempAlloca to be slightly more permissive. Useful for AMDGCN OpenMP codegen where allocas are created with a cast to an address space. Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D84260	2020-08-03 05:38:39 +00:00
Fangrui Song	40da58a04b	[MC] Default MCAsmBackend::mayNeedRelaxation() to false	2020-08-02 22:13:59 -07:00
compinder	594dec2884	[FLANG] Fix issues in SELECT TYPE construct when intrinsic type specification is specified in TYPE GUARD statement. Fix of PR46789 and PR46830. Differential Revision: https://reviews.llvm.org/D84290	2020-08-03 09:24:42 +05:30
QingShan Zhang	62e4644616	[NFC][PowerPC] Add a multiclass for fsetcc to define them in a uniform way This is a refactor patch to prepare for adding the support for strict-fsetcc in PowerPC backend. We want to move their definition into a uniform way so that, we could add the strict node easier. Reviewed By: shchenz Differential Revision: https://reviews.llvm.org/D81712	2020-08-03 03:28:03 +00:00
StephenFan	a96921afa7	[RISCV] eliminate the repetition declare of SDLoc DL Differential revision: https://reviews.llvm.org/D85002	2020-08-03 10:24:30 +08:00
Fangrui Song	b497665d98	Reland D64327 [MC][ELF] Allow STT_SECTION referencing SHF_MERGE on REL targets This drops a GNU gold workaround and reverts the revert commit rL366708. Before binutils 2.34, gold -O2 and above did not correctly handle R_386_GOTOFF to SHF_MERGE\|SHF_STRINGS sections: https://sourceware.org/bugzilla/show_bug.cgi?id=16794 From the original review: ... it reduced the size of a big ARM-32 debug image by 33%. It contained ~68M of relocations symbols out of total ~71M symbols (96% of symbols table was generated for relocations with symbol). -Wl,-O2 (and -Wl,-O3) is so rare that we should just lower the optimization level for LLVM_LINKER_IS_GOLD rather than pessimizing all users.	2020-08-02 18:05:17 -07:00
Florian Hahn	4ffa6a27ac	[Bindings] Remove ipc_propagation. IPConstantPropagation has been removed, also remove the bindings.	2020-08-02 22:36:53 +01:00
Florian Hahn	599955eb56	Recommit "[IPConstProp] Remove and move tests to SCCP." This reverts commit `59d6e814ce`. The cause for the revert (3 clang tests running opt -ipconstprop) was fixed by removing those lines.	2020-08-02 22:23:54 +01:00
Vitaly Buka	08cf49658c	[StackSafety, NFC] Don't insert empty objects into the map Result should be the same but it makes generateParamAccessSummary 5x faster.	2020-08-02 13:58:56 -07:00
Florian Hahn	00a0282ff8	[Clang] Remove run-lines which use opt to run -ipconstprop. ipconstprop is going to get removed and checking opt with specific passes makes the tests more fragile. The tests retain the important checks that !callback metadata is created correctly.	2020-08-02 21:47:32 +01:00
Jan Kratochvil	e6c2c9a7d1	[lldb] [test] Fix DW_TAG_GNU_call_site-DW_AT_low_pc.s relocation I have made the DW_FORM_ref4 relative. One could also use relocated DW_FORM_ref_addr instead. Tested with: echo 'void f(){}'\|clang -o 1.o -c -Wall -g -x c -;./bin/clang -o 1 1.o ../llvm-monorepo/lldb/test/Shell/SymbolFile/DWARF/DW_TAG_GNU_call_site-DW_AT_low_pc.s;./bin/lldb --no-lldbinit ./1 -o r -o 'p p' -o exit	2020-08-02 22:41:02 +02:00
Craig Topper	64516ec7c1	[X86] Use parity flag from byte test/cmp instruction for __builtin_parity when input fits in 8 bits. If the upper bits of the __builtin_parity idiom are known to be 0 we were previously emitting an xor with 0 to get the parity flag. But we can use cmp/test instead which may expose opportunities for load folding or combining an AND.	2020-08-02 10:45:04 -07:00
Craig Topper	a258338d62	[X86] Add test cases for missed opportunity to use a byte test instruction instead of an xor with 0 in parity patterns. If the input to the ctpop fits in 8 bits, we can use the parity flag from a TEST instruction, but we're currently XORing with 0.	2020-08-02 10:45:04 -07:00
Simon Pilgrim	e7a8ee00e6	[AMDGPU] Regenerate tests to fix whitespace indentations Noticed while updating D77804	2020-08-02 18:11:18 +01:00
Mehdi Amini	4091413c00	Remove debug flags from test (NFC)	2020-08-02 16:59:20 +00:00
Simon Pilgrim	e202236721	[IR] Add IRBuilderBase::CreateVectorSplat(ElementCount EC) variant As discussed on D81500, this adds a more general ElementCount variant of the build helper and converts the (non-scalable) unsigned NumElts variant to use it internally.	2020-08-02 16:55:38 +01:00
Sanjay Patel	4abc69c6f5	[InstSimplify] fold max (max X, Y), X --> max X, Y https://alive2.llvm.org/ce/z/VGgG3M	2020-08-02 11:50:58 -04:00
Sanjay Patel	e37987563a	[InstSimplify] add tests for max(max x,y), x) and variants; NFC	2020-08-02 11:50:47 -04:00
Matt Arsenault	212570abcf	GlobalISel: Implement bitcast action for G_EXTRACT_VECTOR_ELEMENT For AMDGPU, vectors with elements < 32 bits should be indexed in 32-bit elements and the desired bits extracted from there. For elements > 64-bits, these should be reduce to 64/32 elements to enable the normal dynamic indexing paths. In the dynamic index cases, this produces shorter code most of the time. This does immediately regress the constant index cases, but this should be fixed once we have the most basic of shift combines. The element size > 64 case is pretty much ported from the exisiting DAG implementation for extract element promote. The increasing element size case is new.	2020-08-02 10:42:07 -04:00
Simon Pilgrim	00d0f354f2	X86InstrInfo.cpp - fix include ordering. NFCI.	2020-08-02 15:34:18 +01:00
Simon Pilgrim	7dd4f03595	Use merge null and isa<> tests into isa_and_nonnull<>. NFCI.	2020-08-02 15:34:18 +01:00
Simon Pilgrim	b8ffbf0e02	[DAG] TargetLowering::expandMUL_LOHI - pass SDLoc as const& Try to be more consistent with the SDLoc param in the TargetLowering methods. This also exposes an issue where we were passing a SDNode as a SDLoc, relying on the implicit SDLoc(SDNode) constructor.	2020-08-02 15:31:36 +01:00
Simon Pilgrim	d14a22da5e	[DAG] TargetLowering::LowerAsmOutputForConstraint - pass SDLoc as const& Try to be more consistent with the SDLoc param in the TargetLowering methods.	2020-08-02 15:12:02 +01:00
Simon Pilgrim	90dab1aece	Remove unused param tag to fix Wdocumentation warning. NFC.	2020-08-02 15:12:01 +01:00
Shinji Okumura	376b64926b	Revert "[Attributor] AAPotentialValues Interface" The commit cause build failure.	2020-08-02 22:49:52 +09:00
Nikita Popov	a0addbb4ec	[InstSimplify] Reduce code duplication in icmp of binop folds (NFC) For folds where we check for the binop on both the LHS and RHS, extract a function that expects it on the LHS and call it with swapped order.	2020-08-02 15:47:18 +02:00
Xing GUO	8d1b9505f2	[DWARFYAML][debug_aranges] Make the 'Descriptors' field optional.	2020-08-02 21:39:44 +08:00
Simon Pilgrim	20fbbbc583	[X86] Use const APInt& in for-range loop to avoid unnecessary copies. NFCI. Fixes clang-tidy warning.	2020-08-02 14:32:23 +01:00
Simon Pilgrim	d7e2616741	[X86] Pass SDLoc by const reference. NFCI.	2020-08-02 14:32:22 +01:00
Simon Pilgrim	3f276840b6	[X86] Use const APInt& in for-range loop to avoid unnecessary copies. NFCI. Fixes clang-tidy warning.	2020-08-02 14:32:22 +01:00
Simon Pilgrim	2700311cce	[X86] combineX86ShuffleChain - pull out repeated RootVT.getSizeInBits() calls. NFCI.	2020-08-02 14:32:22 +01:00
Shinji Okumura	d3f01b6681	[Attributor] AAPotentialValues Interface This is a split patch of D80991. This patch introduces AAPotentialValues and its interface only. For more detail of AAPotentialValues abstract attribute, see the original patch. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D83283	2020-08-02 19:12:17 +09:00
Florian Hahn	ffb4735200	[SCEV] Precommit tests with signed counting down loop. From PR46939.	2020-08-02 10:26:26 +01:00
Michał Górny	21c165de2a	[CMake] Pass bugreport URL to standalone clang build BUG_REPORT_URL is currently used both in LLVM and in Clang but declared only in the latter. This means that it's missing in standalone clang builds and the driver ends up outputting: PLEASE submit a bug report to and include [...] (note the missing URL) To fix this, include LLVM_PACKAGE_BUGREPORT in LLVMConfig.cmake (similarly to how we pass PACKAGE_VERSION) and use it to fill BUG_REPORT_URL when building clang standalone. Differential Revision: https://reviews.llvm.org/D84987	2020-08-02 08:32:05 +02:00
Craig Topper	56166a3a52	[X86] Improve parity idiom recognition to handle (and (truncate (ctpop X)), 1). Fixes part of PR46954	2020-08-01 22:59:43 -07:00

1 2 3 4 5 ...

362198 Commits All Branches Search

362198 Commits

All Branches