llvm-project

Commit Graph

Author	SHA1	Message	Date
Nicolas Vasilache	d67c4cc2eb	[mlir][Linalg] Reimplement and extend getStridesAndOffset Summary: This diff reimplements getStridesAndOffset in a significantly simpler way by operating on the AffineExpr and calling into simplifyAffineExpr instead of rolling its own saturating arithmetic. As a consequence it becomes quite simple to extend the behavior of getStridesAndOffset to encompass more cases by manipulating the AffineExpr more directly. The divisions are still filtered out and continue to yield fully dynamic strides. Simplifying the divisions is left for a later time if compelling use cases arise. Relevant tests are added. Reviewers: ftynse Subscribers: mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, arpith-jacob, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D72098	2020-01-06 09:41:38 -05:00
Mitchell Balan	d45aafa2fb	[clang-format] fix conflict between FormatStyle::BWACS_MultiLine and BeforeCatch/BeforeElse Summary: Found a bug introduced with BraceWrappingFlags AfterControlStatement MultiLine. This feature conflicts with the existing BeforeCatch and BeforeElse flags. For example, our team uses BeforeElse. if (foo \|\| bar) { doSomething(); } else { doSomethingElse(); } If we enable MultiLine (which we'd really love to do) we expect it to work like this: if (foo \|\| bar) { doSomething(); } else { doSomethingElse(); } What we actually get is: if (foo \|\| bar) { doSomething(); } else { doSomethingElse(); } Reviewers: MyDeveloperDay, Bouska, mitchell-stellar Patch by: pastey Subscribers: Bouska, cfe-commits Tags: clang Differential Revision: https://reviews.llvm.org/D71939	2020-01-06 09:21:41 -05:00
Simon Pilgrim	de735247c8	[X86] Add extra PR43971 test case mentioned in D70267	2020-01-06 13:44:55 +00:00
Simon Pilgrim	5d986a68a5	[CostModel][X86] Add missing scalar i64->f32 uitofp costs	2020-01-06 13:17:02 +00:00
Simon Pilgrim	6fa6000e3e	[DAG] DAGCombiner::XformToShuffleWithZero - use APInt::extractBits helper. NFCI.	2020-01-06 13:17:02 +00:00
James Henderson	89b1184325	[test][DebugInfo][NFC] Rename method for clarity The checkGetOrParseLineTableEmitsError function could end up generating both recoverable and unrecoverable errors, but it is only intended for handling the latter. Reviewed by: dblaikie Differential Revision: https://reviews.llvm.org/D72156	2020-01-06 11:30:26 +00:00
James Henderson	d68904f957	[NFC] Fix trivial typos in comments Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D72143 Patch by Kazuaki Ishizaki.	2020-01-06 10:50:26 +00:00
Simon Pilgrim	7180d9568d	Fix MSVC "not all control paths return a value" warning. NFCI.	2020-01-06 10:20:20 +00:00
Sjoerd Meijer	0efc9e5a8c	[ARM][MVE] More MVETailPredication debug messages. NFC. I've added a few more debug messages to MVETailPredication because I wanted to trace better which instructions are added/removed. And while I was at it, I factored out one function which I thought was clearer, and have added some comments to describe better the flow between MVETailPredication and ARMLowOverheadLoops. Differential Revision: https://reviews.llvm.org/D71549	2020-01-06 09:56:02 +00:00
Shengchen Kan	5173bfcbc4	Add interface emitPrefix for MCCodeEmitter Differential Revision: https://reviews.llvm.org/D72047	2020-01-06 17:51:14 +08:00
Ehud Katz	f3f7dc3d29	[APFloat] Fix compilation warnings	2020-01-06 11:30:40 +02:00
Kern Handa	aab72f89b1	[mlir] Update mlir/CMakeLists.txt to install *.def files This is needed to consume mlir after it has been installed of the source tree. Without this, consuming mlir results a build error. Differential Revision: https://reviews.llvm.org/D72232	2020-01-06 10:02:25 +01:00
Neil Henning	103a58c8f2	Add ExternalAAWrapperPass to createLegacyPMAAResults. Our out-of-tree custom aliasing solution for the HPC# Burst compiler here at Unity makes use of the `ExternalAAwrapperPass` infrastructure to insert our custom aliasing resolution into the core of LLVM. This is great for all cases except for function inlining, where because `createLegacyPMAAResults` does not make use of `ExternalAAWrapperPass`, when we have a definite no-alias result within a function it won't be propagated to the calling function during inlining. This commit just rectifies this oversight by adding the missing dependency. Differential Revision: https://reviews.llvm.org/D71348	2020-01-06 08:50:18 +00:00
Ehud Katz	c5fb73c5d1	[APFloat] Add recoverable string parsing errors to APFloat Implementing the APFloat part in PR4745. Differential Revision: https://reviews.llvm.org/D69770	2020-01-06 10:09:01 +02:00
Anton Afanasyev	a792953330	[Metadata] Add TBAA struct metadata to `AAMDNode` Summary: Make `AAMDNodes`' `getAAMetadata()` and `setAAMetadata()` to take `!tbaa.struct` into account as well as `!tbaa`. This impacts llvm.org/pr42022. This is a temprorary fix needed to keep `!tbaa.struct` tag by SROA pass. New field `TBAAStruct` should be deleted when `!tbaa` tag replaces `!tbaa.struct`. Merging two `!tbaa.struct`'s to one is conservatively considered to be `nullptr` (giving `MayAlias`) -- this could be enhanced, but relying on the said future replacement. Reviewers: RKSimon, spatel, vporpo Subscribers: hiraditya, kosarev, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D70924	2020-01-06 11:05:15 +03:00
Kristina Brooks	ce67db4185	[Clang] Force rtlib=platform in test to avoid fails with CLANG_DEFAULT_RTLIB Driver test `cross-linux.c` fails when CLANG_DEFAULT_RTLIB is "compiler-rt" as the it expects a GCC-style `"crtbegin.o"` after `"crti.o"` but instead receives something akin to this in the frontend invocation: ``` "crt1.o" "crti.o" "/o/b/llvm/bin/../lib/clang/10.0.0/lib/linux/clang_rt.crtbegin-x86_64.o" ``` This patch adds an override to `cross-linux.c` tests so the expected result is produced regardless of the compile-time default rtlib, as having tests fail due to that is fairly confusing. After applying the patch, the test passes regardless of the CLANG_DEFAULT_RTLIB setting. Differential Revision: https://reviews.llvm.org/D72236	2020-01-06 07:21:15 +00:00
Craig Topper	19ace449a3	[TargetLowering] Use SETCC input type to call getBooleanContents instead of the setcc result type. This isn't a functonal change since we also check the bit width is the same and the input type is integer. This guarantees the input and output type are the same. But passing the input type makes the code more readable.	2020-01-05 23:15:49 -08:00
MaheshRavishankar	8aae6455c0	[mlir][spirv] Update SPIR-V documentation with information about lowering to SPIR-V dialect. Add information about - SPIRVTypeConverter - SPIRVOpLowering - Utility functions used in lowering to SPIR-V dialect.	2020-01-05 23:05:37 -08:00
Fangrui Song	2e46695003	[MC] Reorder members of MCFragment's subclasses to decrease padding On a 64-bit platform: sizeof(MCBoundaryAlignFragment): 64 -> 56 sizeof(MCOrgFragment): 72 -> 64 sizeof(MCFillFragment): 80 -> 72 sizeof(MCLEBFragment): 88 -> 80	2020-01-05 20:22:16 -08:00
Fangrui Song	806a2b1f3d	[MC] Reorder MCFragment members to decrease padding sizeof(MCFragment) does not change, but some if its subclasses do, e.g. on a 64-bit platform, sizeof(MCEncodedFragment) decreases from 64 to 56, sizeof(MCDataFragment) decreases from 224 to 216.	2020-01-05 19:09:40 -08:00
QingShan Zhang	b9780f4f80	[DAGCombine] Don't check the legality of type when combine the SIGN_EXTEND_INREG This is the DAG node for SIGN_EXTEND_INREG : t21: v4i32 = sign_extend_inreg t18, ValueType:ch:v4i16 It has two operands. The first one is the value it want to extend, and the second one is the type to specify how to extend the value. For this example, it means that, it is signed extend the t18(v4i32) from v4i16 to v4i32. That is the semantics of c code: vector int foo(vector int m) { return m << 16 >> 16; } And it could be any vector type that hardware support the operation, though the type 'v4i16' is NOT legal for the target. When we are trying to combine the srl + sra, what we did now is calling the TLI.isOperationLegal(), which will also check the legality of the type. That doesn't make sense. Differential Revision: https://reviews.llvm.org/D70230	2020-01-06 03:00:58 +00:00
Fangrui Song	2c053109fa	[MC] Delete MCFragment::isDummy. NFC isa<...>, dyn_cast<...> and cast<...> are used by other fragments. Don't make MCDummyFragment special.	2020-01-05 18:49:47 -08:00
Craig Topper	95840866b7	[X86] Improve v2i64->v2f32 and v4i64->v4f32 uint_to_fp on avx and avx2 targets. Summary: Based on Simon's D52965, but improved to handle strict fp and improve some of the shuffling. Rather than use v2i1/v4i1 and let type legalization continue, just generate all the code with legal types and use an explicit shuffle. I also added an explicit setcc to the v4i64 code to match the semantics of vselect which doesn't just use the sign bit. I'm also using a v4i64->v4i32 truncate instead of the shuffle in Simon's original code. With the setcc this will become a pack. Future work can look into using X86ISD::BLENDV and a different shuffle that only moves the sign bit. Reviewers: RKSimon, spatel Reviewed By: RKSimon Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D71956	2020-01-05 17:44:08 -08:00
Liu, Chen3	ca3bf289a7	[NFC] Modify the format: Drop the else since we alerady returned in the if.	2020-01-06 09:35:19 +08:00
Brian Gesiak	83a9321f60	[Coroutines] Remove corresponding phi values when apply simplifyTerminatorLeadingToRet Summary: In addMustTailToCoroResumes, we set musttail on those resume instructions that are followed by a ret instruction. This is done by simplifyTerminatorLeadingToRet which replace a sequence of branches leading to a ret with a clone of the ret. However it forgets to remove corresponding PHI values that come from basic block of replaced branch, and may cause jumpthreading pass hangs (https://bugs.llvm.org/show_bug.cgi?id=43720) This patch fix this issue Test Plan: cppcoro library with O3+flto check-llvm Reviewers: modocache, GorNishanov, lewissbaker Reviewed By: modocache Subscribers: mehdi_amini, EricWF, hiraditya, dexonsmith, jfb, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D71826 Patch by junparser (JunMa)!	2020-01-05 18:26:30 -05:00
Stephen Kelly	445f4d2310	Clang-format previous commit	2020-01-05 22:58:32 +00:00
Fangrui Song	5511861e6d	[MC][ARM] Delete MCSection::HasData and move SHF_ARM_PURECODE logic to ARMELFObjectWriter::addTargetSectionFlags This simplifies the generic interface and also makes SHF_ARM_PURECODE more robust (fixes a TODO). Inspecting MCDataFragment contents covers more cases than MCObjectStreamer::EmitBytes.	2020-01-05 14:20:34 -08:00
Stephen Kelly	35efef5351	Add missing test	2020-01-05 21:55:52 +00:00
Kristina Brooks	b18cb9c471	[Gnu toolchain] Look at standard GCC paths for libstdcxx by default Linux' current addLibCxxIncludePaths and addLibStdCxxIncludePaths are actually almost non-Linux-specific at all, and can be reused almost as such for all gcc toolchains. Only keep Android/Freescale/Cray hacks in Linux's version. Patch by sthibaul (Samuel Thibault) Differential Revision: https://reviews.llvm.org/D69758	2020-01-05 21:43:18 +00:00
Fangrui Song	586acd8490	[MC] Delete MCSection::{rbegin,rend}	2020-01-05 12:51:15 -08:00
Stephen Kelly	ad0a45833b	Allow using traverse() with bindings	2020-01-05 20:48:56 +00:00
Stephen Kelly	4711512384	Fix oversight in AST traversal helper	2020-01-05 20:27:37 +00:00
Fangrui Song	124b918bd3	[MC] Merge MCSymbol::getSectionPtr into getSection and simplify	2020-01-05 12:03:40 -08:00
Fangrui Song	c764304adc	[MC] Drop an unused rule about absolute temporary symbols	2020-01-05 11:39:52 -08:00
Simon Pilgrim	e3bd011890	[X86][SSE] Combine combineLogicBlendIntoConditionalNegate for VSELECT nodes (PR43660) Attempt to use combineLogicBlendIntoConditionalNegate for (select M, (sub 0, X), X) -> (sub (xor X, M), M) We limit this to cases that can't easily replace the VSELECT with a shuffle (non-constant masks) or where a BLENDV is likely to occur (which tends to result in slower codegen).	2020-01-05 18:50:44 +00:00
Simon Pilgrim	6a6e6f04ec	[X86] Move combineLogicBlendIntoConditionalNegate before combineSelect. NFCI. Updates function order in preparation of future fix for PR43660	2020-01-05 17:17:41 +00:00
Simon Pilgrim	3db84f142a	[X86] Merge (identical) LowerGC_TRANSITION_START and LowerGC_TRANSITION_END (NFC) Silences a copy+paste analyzer warning - all they are doing are inserting NOOPs in exactly the same way.	2020-01-05 15:24:57 +00:00
David Green	fb8c9a339a	[ARM] Use isFMAFasterThanFMulAndFAdd for scalars as well as MVE vectors This adds extra scalar handling to isFMAFasterThanFMulAndFAdd, allowing the target independent code to handle more folds in more situations (for example if the fast math flags are present, but the global AllowFPOpFusion option isnt). It also splits apart the HasSlowFPVMLx into HasSlowFPVFMx, to allow VFMA and VMLA to be controlled separately if needed. Differential Revision: https://reviews.llvm.org/D72139	2020-01-05 11:24:04 +00:00
David Green	c15a56f61a	[ARM] Fill in FP16 FMA patterns This adds fp16 variants of all the fma patterns in the ARM backend. Differential Revision: https://reviews.llvm.org/D72138	2020-01-05 11:24:04 +00:00
David Green	5a25399221	[ARM] Add and update FMA tests. NFC	2020-01-05 11:24:04 +00:00
David Green	170de3de2e	[ParserTest] Move raw string literal out of macro Some combinations of gcc and ccache do not deal well with raw strings in macros. Moving the string out to attempt to fix the bots.	2020-01-05 11:24:04 +00:00
Craig Topper	4e37d60f2a	[LegalizeVectorOps][X86] Enable expansion of vector fp_to_uint in LegalizeVectorOps to avoid scalarization. The code here isn't great in all caess. Particularly v4f64->v4i32 on 64-bit AVX targets. But there is some improvement in some configurations. There's definitely some issues with computeNumSignBits with X86ISD::STRICT_FCMP. As well as not being able to propagate sign bits through merge_values nodes that get created during custom legalization.	2020-01-04 19:18:54 -08:00
Craig Topper	16a67d252c	[TargetLowering] In expandFP_TO_UINT, add proper extend or truncate for the condition to feed the DstVT select. Previously, for vectors we created a vselect with a condition that didn't match what the target wanted according to getSetCCResultType. To make up for this, X86 had a special DAG combine to detect if the condition was all sign bits and then insert its own truncate or extend. By adding the extend/truncate here explicitly we can avoid that.	2020-01-04 18:15:20 -08:00
Craig Topper	285d5e6b8b	[LegalizeVectorOps] Split most of ExpandStrictFPOp into a separate UnrollStrictFPOp method. Call that method from ExpandUINT_TO_FLOAT. ExpandStrictFPOp calls ExpandUINT_TO_FLOAT. Previously, ExpandUINT_TO_FLOAT returned SDValue() if it wasn't able to handle and needed to unroll. Then ExpandStrictFPOp would detect his SDValue() and do the unroll. After this change, ExpandUINT_TO_FLOAT will directly call UnrollStrictFPOp and return the unrolled result.	2020-01-04 17:03:50 -08:00
Fangrui Song	085898d469	[ELF] Drop const qualifier to fix -Wrange-loop-analysis. NFC ``` lld/ELF/Relocations.cpp:1622:56: warning: loop variable 'ts' of type 'const std::pair<ThunkSection , uint32_t>' (aka 'const pair<lld:🧝:ThunkSection , unsigned int>') creates a copy from type 'const std::pair<ThunkSection , uint32_t>' [-Wrange-loop-analysis] for (const std::pair<ThunkSection , uint32_t> ts : isd->thunkSections) ``` Drop const qualifier to fix -Wrange-loop-analysis. We can make -Wrange-loop-analysis warnings (DiagnoseForRangeConstVariableCopies) on `const A` more permissive on more types (e.g. POD -> trivially copyable), unfortunately it will not make std::pair good, because `constexpr pair& operator=(const pair& p);` is unfortunately user-defined. Reviewed By: Mordante Differential Revision: https://reviews.llvm.org/D72211	2020-01-04 12:24:39 -08:00
Matt Arsenault	d12f2a2998	GlobalISel: Scalarize all division operations This only handled G_SDIV, but they all are trivially scalarizable. Also define placeholder AMDGPU division legalizer rules.	2020-01-04 13:47:10 -05:00
Florian Hahn	b8a3c34eee	Revert "[SCEV] Move ScalarEvolutionExpander.cpp to Transforms/Utils (NFC)." This reverts commit `51ef53f3bd`, as it breaks some bots.	2020-01-04 18:44:38 +00:00
Florian Hahn	51ef53f3bd	[SCEV] Move ScalarEvolutionExpander.cpp to Transforms/Utils (NFC). SCEVExpander modifies the underlying function so it is more suitable in Transforms/Utils, rather than Analysis. This allows using other transform utils in SCEVExpander. Reviewers: sanjoy.google, efriedma, reames Reviewed By: sanjoy.google Differential Revision: https://reviews.llvm.org/D71537	2020-01-04 18:29:35 +00:00
Florian Hahn	99f74a64a2	[SCEV] Remove unused ScalarEvolutionExpander.h includes (NFC).	2020-01-04 18:29:35 +00:00
Matt Arsenault	1f950ced50	GlobalISel: Define G_READCYCLECOUNTER	2020-01-04 13:10:19 -05:00

... 3 4 5 6 7 ...

338803 Commits All Branches Search

338803 Commits

All Branches