llvm-project

Commit Graph

Author	SHA1	Message	Date
Simon Pilgrim	21bd3767c8	X86MacroFusion.cpp - ensure X86MacroFusion.h module header is included first. NFC.	2020-04-20 14:25:15 +01:00
Simon Pilgrim	4ba7ae85da	X86Subtarget.h - remove unused includes. NFC. Replace with forward declarations.	2020-04-20 12:10:45 +01:00
Simon Pilgrim	2cfcbc52c3	X86Subtarget.cpp - sort includes. NFC Ensure X86Subtarget.h module header is at the top, and sort the remaining includes.	2020-04-20 11:54:04 +01:00
Simon Pilgrim	179dced13b	X86MCTargetDesc.h - remove unnecessary MCStreamer.h include. NFC. We don't need all of MCStreamer.h, just FormattedStream.h. The rest can be replaced with forward declarations. X86WinAllocaExpander.cpp had an implicit dependency on MapVector.h which I've added locally.	2020-04-20 11:39:38 +01:00
Simon Pilgrim	44cf9b85ad	X86MCAsmInfo.h - remove unnecessary MCAsmInfo.h include. NFC. We only use the COFF/Darwin/ELF classes directly.	2020-04-20 11:39:38 +01:00
Simon Pilgrim	da3bf811be	X86InstrFoldTables.h - remove unnecessary include. NFC. We don't need the limits defines, just the sized integer types so use cstdint system header directly.	2020-04-20 11:39:38 +01:00
Sam Parker	e3056ae9a0	[NFC][TTI] Explicit use of VectorType The API for shuffles and reductions uses generic Type parameters, instead of VectorType, and so assertions and casts are used a lot. This patch makes those types explicit, which means that the clients can't be lazy, but results in less ambiguity, and that can only be a good thing. Bugzilla: https://bugs.llvm.org/show_bug.cgi?id=45562 Differential Revision: https://reviews.llvm.org/D78357	2020-04-20 09:16:52 +01:00
Xiang1 Zhang	0980038a5e	Handle CET for -exception-model sjlj Summary: In SjLj exception mode, the old landingpad BB will create a new landingpad BB and use indirect branch jump to the old landingpad BB in lowering. So we should add 2 endbr for this exception model. Reviewers: hjl.tools, craig.topper, annita.zhang, LuoYuanke, pengfei, efriedma Reviewed By: LuoYuanke Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77124	2020-04-20 11:13:40 +08:00
Shengchen Kan	b78c3c89c2	[X86][MC][NFC] Reduce the parameters of functions in X86MCCodeEmitter(Part III) Summary: When we encode an instruction, we need to know the number of bytes being emitted to determine the fixups in `X86MCCodeEmitter::emitImmediate`. There are only two callers for `emitImmediate`: `emitMemModRMByte` and `encodeInstruction`. Before this patch, we kept track of the current byte being emitted by passing a reference parameter `CurByte` across all the `emit` funtions, which is ugly and unnecessary. For example, we don't have any fixups when emitting prefixes, so we don't need to track this value. In this patch, we use `StartByte` to record the initial status of the streamer, and use `OS.tell()` to get the current status of the streamer when we need to know the number of bytes being emitted. On one hand, this eliminates the parameter `CurByte` for most `emit` functions, on the other hand, this make things clear: Only pass the parameter when we really need it. Reviewers: craig.topper, pengfei, MaskRay Reviewed By: craig.topper, MaskRay Subscribers: hiraditya, llvm-commits, annita.zhang Tags: #llvm Differential Revision: https://reviews.llvm.org/D78419	2020-04-20 10:03:41 +08:00
Craig Topper	d7e2d937bc	[X86] Add X86ISD nodes for PDEP and PEXT. This will allow use to add DAG combines for these instructions.	2020-04-19 16:14:13 -07:00
Simon Pilgrim	a938c7b9ed	X86CallLowering.h - remove unnecessary ArrayRef.h include. NFC.	2020-04-19 21:25:10 +01:00
Simon Pilgrim	8859c7f6eb	X86MachineFunctionInfo.h - remove unused include. NFC.	2020-04-19 16:58:59 +01:00
Simon Pilgrim	c27fdc84df	X86InstrInfo.h - remove unused forward declarations. NFC.	2020-04-19 16:58:59 +01:00
Simon Pilgrim	a156646443	X86DisassemblerDecoder.h - remove unused forward declaration. NFC.	2020-04-19 16:58:58 +01:00
Sanjay Patel	720015e537	[x86] avoid build warning for enum mismatch; NFC gcc may warn here because X86ISD::NodeType is specified as "unsigned", but ISD::NodeType is a naked C enum (although passed as an "unsigned" throughout SDAG).	2020-04-19 10:22:11 -04:00
Simon Pilgrim	60765e911d	X86MCTargetDesc.h - remove unnecessary includes and forward declarations. NFC.	2020-04-19 14:29:35 +01:00
Simon Pilgrim	18bf42a86c	X86.h - remove unused forward declarations. NFC.	2020-04-19 14:28:52 +01:00
Simon Pilgrim	84aab8b772	X86SelectionDAGInfo.h - remove unnecessary includes and forward declarations. NFC.	2020-04-19 14:20:53 +01:00
Simon Pilgrim	44d91cac76	X86TargetTransformInfo.h - remove unnecessary includes. NFC.	2020-04-19 14:03:43 +01:00
Simon Pilgrim	e71dd7c011	[X86][SSE] getFauxShuffle - don't combine shuffles with small truncated scalars (PR45604) getFauxShuffle attempts to combine INSERT_VECTOR_ELT(TRUNCATE/EXTEND(EXTRACT_VECTOR_ELT(x))) patterns into a target shuffle chain. PR45604 identified an issue where the scalar was truncated to a size smaller than the destination vector element and then zero extended back, which requires the upper bits to be zero'd which we don't currently do. To avoid the bug I've added an early out in these truncation cases, a future commit should allow us to handle this by inserting the necessary SM_SentinelZero padding.	2020-04-19 13:35:22 +01:00
Sanjay Patel	cceb630a07	[x86] use vector instructions to lower more FP->int->FP casts This is an enhancement to D77895 to avoid another round-trip from XMM->GPR->XMM. This time we handle the case of starting/ending with an f64 and casting to signed i32 as the intermediate value. It's a bit more involved than I initially assumed because we need to use target-specific opcodes to represent the non-standard cast ops. Differential Revision: https://reviews.llvm.org/D78362	2020-04-19 08:33:17 -04:00
Simon Pilgrim	9559557014	X86InstrFMA3Info.h - remove unnecessary includes. NFC. There were a number of cpp files explicitly relying on X86InstrFMA3Info.h to include the X86.h header - so I've had to add it locally.	2020-04-19 12:17:56 +01:00
Simon Pilgrim	d49646e6de	X86AsmPrinter.h - cleanup includes and forward declarations. NFC. Reduce X86Subtarget.h/MCCodeEmitter.h/TargetMachine.h includes to forward declarations Add explicit X86Subtarget.h/TargetMachine.h includes to X86AsmPrinter.cpp/X86MCInstLower.cpp Remove unused MCSymbol forward declaration	2020-04-19 11:38:50 +01:00
Shengchen Kan	0d3149f431	[MC][X86] Disable branch align in non-text section Summary: The instruction in non-text section can not be executed, so they will not affect performance. In addition, their encoding values are treated as data, so we should not touch them. Reviewers: MaskRay, reames, LuoYuanke, jyknight Reviewed By: MaskRay Subscribers: annita.zhang, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77971	2020-04-18 14:41:25 +08:00
Andrew Litteken	8d5024f7fe	fix to outline cfi instruction when can be grouped in a tail call [MachineOutliner] fix test for excluding CFI and add test to include CFI in outlining New test to check that we only outline CFI instruction if all CFI Instructions in the function would be captured by the outlining adding x86 tests analagous to AARCH64 cfi tests Revision: https://reviews.llvm.org/D77852	2020-04-17 22:26:34 -07:00
Christopher Tetreault	dd24fb388b	Clean up usages of asserting vector getters in Type Summary: Remove usages of asserting vector getters in Type in preparation for the VectorType refactor. The existence of these functions complicates the refactor while adding little value. Reviewers: craig.topper, sdesmalen, efriedma, RKSimon Reviewed By: efriedma Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77264	2020-04-17 10:49:16 -07:00
Benjamin Kramer	166467e822	[VectorUtils] Create shufflevector masks as int vectors instead of Constants No functionality change intended.	2020-04-17 15:28:00 +02:00
Sanjay Patel	818126ae97	[x86] rename variables for types for readability; NFC This gets harder to follow if we allow changing types/sizes between source, dest, and intermediate value.	2020-04-17 08:41:18 -04:00
Shengchen Kan	c82faea9fb	Recommit [X86][MC][NFC] Reduce the parameters of functions in X86MCCodeEmitter(Part II) Previous patch didn't handle the early return in `emitREXPrefix` correctly, which causes REX prefix was not emitted for instruction without operands. This patch includes the fix for that.	2020-04-17 19:42:35 +08:00
Shengchen Kan	c5fa0a4d4b	Temporaily revert [X86][MC][NFC] Reduce the parameters of functions in X86MCCodeEmitter(Part II) It causes some encoding fails. Plan to recommit it after fixing that. This reverts commit `3017580c79`.	2020-04-17 14:11:33 +08:00
Shengchen Kan	3017580c79	[X86][MC][NFC] Reduce the parameters of functions in X86MCCodeEmitter(Part II) Summary: We determine the REX prefix used by instruction in `determineREXPrefix`, and this value is used in `emitMemModRMByte' and used as the return value of `emitOpcodePrefix`. Before this patch, REX was passed as reference to `emitPrefixImpl`, it is strange and not necessary, e.g, we have to write ``` bool Rex = false; emitPrefixImpl(CurOp, CurByte, Rex, MI, STI, OS); ``` in `emitPrefix` even if `Rex` will not be used. So we let HasREX be the return value of `emitPrefixImpl`. The HasREX is passed from `emitREXPrefix` to `emitOpcodePrefix` and then to `emitPrefixImpl`. This makes sense since REX is a kind of opcode prefix and of course is a prefix. Reviewers: craig.topper, pengfei Reviewed By: craig.topper Subscribers: annita.zhang, craig.topper, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78276	2020-04-17 13:32:19 +08:00
Shengchen Kan	71303b753c	[X86] Add interface X86II::isPseudo Avoid duplicate code in X86MCCodeEmitter, NFCI.	2020-04-16 12:40:17 +08:00
Shengchen Kan	7aaaea5acd	[X86][MC][NFC] Code cleanup in X86MCCodeEmitter Make some function static, move the definitions of functions to a better place and use C++ style cast, etc.	2020-04-16 11:30:49 +08:00
Shengchen Kan	6c66bb393e	[X86][MC][NFC] Refine code in X86MCCodeEmitter As we mentioned in D78180, merge some if clauses and use CamelCase for variables, etc.	2020-04-16 10:43:42 +08:00
Shengchen Kan	322ac2e917	[X86][MC][NFC] Reduce the parameters of functions in X86MCCodeEmitter(Part I) Summary: The function in X86MCCodeEmitter has too many parameters to make it look messy, and some parameters are unnecessary. This is the first patch to reduce their parameters. The follwing operations are cheap ``` unsigned Opcode = MI.getOpcode(); const MCInstrDesc &Desc = MCII.get(Opcode); uint64_t TSFlags = Desc.TSFlags; ``` So if we pass a `MCInst`, we don't need to pass `MCInstrDesc`; if we pass a `MCInstrDesc`, we don't need to pass `TSFlags`. Reviewers: craig.topper, MaskRay, pengfei Reviewed By: craig.topper Subscribers: annita.zhang, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78180	2020-04-16 09:53:45 +08:00
Christopher Tetreault	85247c1e89	[SVE] Remove calls to getBitWidth from x86 Reviewers: efriedma, RKSimon, sdesmalen Reviewed By: RKSimon Subscribers: tschuett, hiraditya, rkruppe, psnobl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77901	2020-04-15 15:48:48 -07:00
Craig Topper	8dfb9627b7	[X86] Make v32i16/v64i8 legal types without avx512bw. Use custom splitting instead. This moves v32i16/v64i8 to a model consistent with how we treat integer types with avx1. This does change the ABI for types vXi16/vXi8 vectors larger than 512 bits to pass in multiple zmms instead of multiple ymms. We'd already hacked some code to make v64i8/v32i16 pass in zmm. Cost model is still a bit of a mess. In some place I tried to match existing behavior. But really we need to account for splitting and concating costs. Cost model for shuffles is especially pessimistic. Differential Revision: https://reviews.llvm.org/D76212	2020-04-15 12:17:18 -07:00
Craig Topper	a916e81927	[X86] Various improvements to our vector splitting helpers for lowering. NFC -Consistently name the functions as split* -Add a helper for doing the two extractSubvector calls and determining the size of the split -Use getSplitDestVTs to get the result type for the split node. -Move the binary and unary helper to one place in the file near the extractSubvector functions. Left the VSETCC one near LowerVSETCC since that's its only caller. -Remove the 256/512 wrappers that just had asserts. I don't think they provided a lot of value and now with the routines called split* the call sites are more obvious what they do. -Make the unary routine support different source and dest types to support D76212. -Add some weaker asserts into the helpers to make up for losing the very specific asserts from the 256/512 wrappers. Differential Revision: https://reviews.llvm.org/D78176	2020-04-15 10:57:53 -07:00
Benjamin Kramer	316b49d373	Pass shufflevector indices as int instead of unsigned. No functionality change intended.	2020-04-15 15:52:49 +02:00
Benjamin Kramer	6f64daca8f	Upgrade calls to CreateShuffleVector to use the preferred form of passing an array of ints No functionality change intended.	2020-04-15 12:51:38 +02:00
Simon Pilgrim	426f37584e	[TTI][X86] Add X86TTIImpl::getScalarizationOverhead implementation. This is a currently just a wrapper to the base type, I'll be adding ISD::BUILD_VECTOR costs in a future patch.	2020-04-14 12:58:19 +01:00
Georgii Rymar	1647ff6e27	[ADT/STLExtras.h] - Add llvm::is_sorted wrapper and update callers. It can be used to avoid passing the begin and end of a range. This makes the code shorter and it is consistent with another wrappers we already have. Differential revision: https://reviews.llvm.org/D78016	2020-04-14 14:11:02 +03:00
Craig Topper	2f60fbce6c	[X86] Use a more realisitic cost for truncate v16i64->v16i8 with avx512f. Still not great and we could probably codegen this better, but 11 was clearly ridiculous.	2020-04-13 21:09:43 -07:00
Craig Topper	535a566a01	[X86] Split AVX512 getCastInstrCost into tables that require useAVX512Regs() and those that just operate on 256 or smaller vectors. Use useAVX512Regs() to skip lookups instead of using type legalization action.	2020-04-13 21:09:42 -07:00
Craig Topper	071c64d68d	[X86] Add a more accurate truncate cost for v8i64->v8i8	2020-04-13 21:09:41 -07:00
Craig Topper	113f37a1f9	[CallSite removal][TargetLowering] Replace ImmutableCallSite with CallBase Differential Revision: https://reviews.llvm.org/D77995	2020-04-13 13:50:15 -07:00
Craig Topper	6dbf1a1229	[X86] Move X86ShuffleDecode.cpp/h into MCTargetDesc and remove X86Utils library. NFC The shuffle decoding is used by X86ISelLowering and MCTargetDesc/X86InstComments. The latter used to be in a separate InstPrinter library. The Utils library existed to allow InstPrinter and CodeGen to share the shuffle decoding. Since X86InstComments now lives in the MCTargetDesc, which CodeGen already depends on, we can sink the shuffle decoding there as well. Differential Revision: https://reviews.llvm.org/D77980	2020-04-13 10:14:08 -07:00
Jay Foad	bc78baec4c	[X86] Improve combineVectorShiftImm Summary: Fold (shift (shift X, C2), C1) -> (shift X, (C1 + C2)) for logical as well as arithmetic shifts. This is needed to prevent regressions from an upcoming funnel shift expansion change. While we're here, fold (VSRAI -1, C) -> -1 too. Reviewers: RKSimon, craig.topper Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77300	2020-04-13 15:54:55 +01:00
Simon Pilgrim	401cbe373b	[X86][AVX] Attempt to scale masked shuffles to match the root type Improve the chances of folding the writemask into the combined shuffle by scaling a wider shuffle mask to match the root's original type. This creates a few minor issues with variable shuffles, preventing combines of shuffles because of the more limited support binary shuffle types. In most cases we're probably better off combining the shuffles and losing the writemask fold, but this isn't always going to be true.	2020-04-13 14:57:25 +01:00
Simon Pilgrim	fdd9ff9700	[X86][AVX] Create splitVectorIntBinary helper. Removes duplicate code from split256IntArith/split512IntArith.	2020-04-13 13:09:38 +01:00

1 2 3 4 5 ...

20353 Commits