llvm-project

Commit Graph

Author	SHA1	Message	Date
Martin Storsjö	8dc8203932	[LLD] [COFF] Fix test to properly test all aspects of `c3b1d730d6`. NFC. Previously, the test could pass with one part of `c3b1d730d6` removed.	2020-07-26 23:10:20 +03:00
Jez Ng	9282d04e04	[lld-macho] Support lookup of dylibs in frameworks Needed for testing Objective-C programs (since e.g. Core Foundation is a framework) Reviewed By: #lld-macho, compnerd Differential Revision: https://reviews.llvm.org/D83925	2020-07-26 12:46:46 -07:00
Craig Topper	df12524e6b	[X86] Turn X86DAGToDAGISel::tryVPTERNLOG into a fully custom instruction selector that can handle bitcasts between logic ops Previously we just matched the logic ops and replaced with an X86ISD::VPTERNLOG node that we would send through the normal pattern match. But that approach couldn't handle a bitcast between the logic ops. Extending that approach would require us to peek through the bitcasts and emit new bitcasts to match the types. Those new bitcasts would then have to be properly topologically sorted. This patch instead switches to directly emitting the MachineSDNode and skips the normal tablegen pattern matching. We do have to handle load folding and broadcast load folding ourselves now. Which also means commuting the immediate control. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D83630	2020-07-26 12:19:08 -07:00
Tim Keith	7454acdf3b	[flang] Fix implicit declarations in statement functions If a symbol (that is not a dummy argument) is implicitly declared inside a statement function, don't create it in the statement function's scope. Instead, treat statement functions like blocks when finding the inclusive scope and create the symbol there. Add a new flag, StmtFunction, to symbols that represent statement functions. Differential Revision: https://reviews.llvm.org/D84588	2020-07-26 12:13:39 -07:00
Hannes Käufler	3bbf3e026d	Replace comment by private method; NFC.	2020-07-26 13:59:45 -04:00
Craig Topper	1a75d88b3e	[X86] Move getGatherOverhead/getScatterOverhead into X86TargetTransformInfo. These cost methods don't make much sense in X86Subtarget. Make them methods in X86's TTI and move the feature checks from the X86Subtarget constructor into these methods. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D84594	2020-07-26 10:38:42 -07:00
Juneyoung Lee	ab4e1be7ab	[InstCombine] Add a test for folding freeze into phi; NFC	2020-07-27 02:24:00 +09:00
Bruno Ricci	b0512eed1e	[clang][NFC] Add a test for __attribute__((flag_enum)) with an unnamed enumeration.	2020-07-26 17:24:43 +01:00
Bruno Ricci	a2f83d5a07	[clang][NFC] Add tests for the use of NamedDecl::getDeclName in the unused/unneeded diagnostics.	2020-07-26 17:20:56 +01:00
Bruno Ricci	ca9bfc20f4	[clang][NFC] Remove spurious +x flag on SemaConcept.cpp	2020-07-26 17:10:59 +01:00
Simon Pilgrim	17eafe0841	[X86][SSE] lowerV2I64Shuffle - use undef elements in PSHUFD mask widening If we lower a v2i64 shuffle to PSHUFD, we currently clamp undef elements to 0, (elements 0,1 of the v4i32) which can result in the shuffle referencing more elements of the source vector than expected, affecting later shuffle combines and KnownBits/SimplifyDemanded calls. By ensuring we widen the undef mask element we allow getV4X86ShuffleImm8 to use inline elements as the default, which are more likely to fold.	2020-07-26 16:04:22 +01:00
Vincent Zhao	d135744c34	[MLIR][Affine] Add test for non-hyperrectangular loop tiling This diff provides a concrete test case for the error that will be raised when the iteration space is non hyper-rectangular. The corresponding emission method for this error message has been changed as well. Differential Revision: https://reviews.llvm.org/D84531	2020-07-26 20:17:23 +05:30
Matt Arsenault	d35e2c101d	AMDGPU/GlobalISel: Fix not constraining ds_append/consume operands	2020-07-26 10:17:36 -04:00
Matt Arsenault	f6176f8a5f	GlobalISel: Handle G_PTR_ADD in narrowScalar	2020-07-26 10:08:17 -04:00
Matt Arsenault	3e8bb7a000	GlobalISel: Handle fewerElementsVector for G_PTR_ADD	2020-07-26 10:08:09 -04:00
Matt Arsenault	7c09c173a2	AMDGPU/GlobalISel: Reorder G_CONSTANT legality rules The legal cases should be the first rules.	2020-07-26 10:05:05 -04:00
Matt Arsenault	bcf5184a68	AMDGPU/GlobalISel: Make sure <2 x s1> phis are scalarized	2020-07-26 10:04:47 -04:00
Matt Arsenault	6f961a1e7e	AMDGPU/GlobalISel: Legalize GDS atomics I noticed these don't use the _gfx9, non-m0 reading variants but not sure if that's a bug or not. It's the same in the DAG.	2020-07-26 10:03:34 -04:00
Matt Arsenault	5819159995	AMDGPU/GlobalISel: Pack constant G_BUILD_VECTOR_TRUNCs when selecting	2020-07-26 09:55:34 -04:00
Sanjay Patel	0481e1ae3c	[InstSimplify] fold integer min/max intrinsics with limit constant	2020-07-26 09:41:54 -04:00
Matt Arsenault	61ced4b87a	GlobalISel: Handle 'n' inline asm constraint	2020-07-26 09:30:41 -04:00
Matt Arsenault	4033aa1467	AMDGPU/GlobalISel: Sign extend integer constants This matches the DAG behavior and fixes immediate folding	2020-07-26 09:30:14 -04:00
Matt Arsenault	4f6502ab33	AMDGPU/GlobalISel: Replace selection tests for G_CONSTANT/G_FCONSTANT Split into separate tests and make more consistent with the others.	2020-07-26 09:30:09 -04:00
Xing GUO	b1731da871	[DWARFYAML] Rename getUsedSectionNames() to getNonEmptySectionNames(). This patch renames getUsedSectionNames() to getNonEmptySectionNames. NFC.	2020-07-26 21:10:38 +08:00
Sanjay Patel	c6cf71107a	[InstSimplify] add tests for min/max intrinsics; NFC	2020-07-26 09:04:37 -04:00
Sanjay Patel	b89ae102e6	[InstSimplify] fold fcmp using isKnownNeverInfinity + isKnownNeverNaN Follow-up to D84035 / rG7393d7574c09. This sidesteps a question of FMF/poison on fcmp raised in PR46077: http://bugs.llvm.org/PR46077 https://alive2.llvm.org/ce/z/TCsyzD define i1 @src(float %x) { %0: %x42 = fadd nnan ninf float %x, 42.000000 %r = fcmp ueq float %x42, inf ret i1 %r } => define i1 @tgt(float %x) { %0: ret i1 0 } Transformation seems to be correct! https://alive2.llvm.org/ce/z/FQaH7a define i1 @src(i8 %x) { %0: %cast = uitofp i8 %x to float %r = fcmp one float inf, %cast ret i1 %r } => define i1 @tgt(i8 %x) { %0: ret i1 1 } Transformation seems to be correct!	2020-07-26 09:04:37 -04:00
Sanjay Patel	912e9e5262	[InstSimplify] add tests for fcmp with infinity constant; NFC	2020-07-26 09:04:36 -04:00
Juneyoung Lee	920e267974	[JumpThreading] Add a test for D84598; NFC	2020-07-26 22:00:01 +09:00
Juneyoung Lee	32088f4f7f	[ConstantFolding] Fold freeze if it is never undef or poison This is a simple patch that adds constant folding for freeze instruction. IIUC, it isn't needed to update ConstantFold.cpp because there is no freeze constexpr. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D84597	2020-07-26 21:54:44 +09:00
Juneyoung Lee	9f074214b7	[ValueTracking] Instruction::isBinaryOp should be used for constexprs This is a simple patch that makes canCreateUndefOrPoison use Instruction::isBinaryOp because BinaryOperator inherits Instruction. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D84596	2020-07-26 21:48:51 +09:00
Juneyoung Lee	1b802fe34d	NFC; add a test for freeze's constprop	2020-07-26 21:03:23 +09:00
Juneyoung Lee	02dadab1b4	NFC; add an example that subtracts pointers to two global vars	2020-07-26 20:47:33 +09:00
Roman Lebedev	b1210c059d	[NFC][XRay] Account: migrate to DenseMap + SmallVector, -16% faster on large (3.8G) input DenseMap is a single allocation underneath, so this is has pretty expected performance impact on large-ish (3.8G) xray log processing time.	2020-07-26 14:08:07 +03:00
Roman Lebedev	ed5a6b9305	[NFC][XRay] Account: decouple getStats() interface from underlying data structure It doesn't really need to know where Timings are stored, it just needs to be able to sort them, so MutableArrayRef is enough. That uncovers an interesting quirk that it relied on implicit double->int conversion for calculating percentiles.	2020-07-26 14:08:06 +03:00
Alex Richardson	18df607dbe	[lit] Don't include tests skipped due to sharding in reports When running multiple shards, don't include skipped tests in the xunit output since merging the files will result in duplicates. In our CHERI Jenkins CI, I configured the libc++ tests to run using sharding (since we are testing using a single-CPU QEMU). We then merge the generated XUnit xml files to produce a final result, but if the individual XMLs report tests excluded due to sharding each test is included N times in the final result. This also makes it difficult to find the tests that were skipped due to missing REQUIRES: etc. Reviewed By: yln Differential Revision: https://reviews.llvm.org/D84235	2020-07-26 11:39:22 +01:00
Alex Richardson	98b61112c3	[asan] Mark the strstr test as UNSUPPORTED on FreeBSD Like Android, FreeBSDs libc calls memchr which causes this test to fail. Reviewed By: emaste Differential Revision: https://reviews.llvm.org/D84541	2020-07-26 11:37:47 +01:00
Amara Emerson	9b19400004	[AArch64][GlobalISel] Make <8 x s16> and <16 x s8> legal types for G_SHUFFLE_VECTOR and G_IMPLICIT_DEF. Trivial change, we're still missing support for rev matching for these types in the combiner.	2020-07-26 00:48:09 -07:00
Craig Topper	1a1448e656	[X86] Merge X86MCInstLowering's maxLongNopLength into emitNop and remove check for FeatureNOPL. The switch in emitNop uses 64-bit registers for nops exceeding 2 bytes. This isn't valid outside 64-bit mode. We could fix this easily enough, but there are no users that ask for more than 2 bytes outside 64-bit mode. Inlining the method to make the coupling between the two methods more explicit.	2020-07-25 22:11:47 -07:00
Craig Topper	14c59b4577	[X86] Remove getProcFamily() method from X86Subtarget. NFC This isn't used and we've decided in the past that a CPU enum for tuning is not a good idea.	2020-07-25 22:11:45 -07:00
Jacques Pienaar	595d214f47	[mlir][shape] Further operand and result type generalization Previous changes generalized some of the operands and results. Complete a larger group of those to simplify progressive lowering. Also update some of the declarative asm form due to generalization. Tried to keep it mostly mechanical.	2020-07-25 21:41:31 -07:00
Changpeng Fang	9162b70e51	DADCombiner: Don't simplify the token factor if the node's number of operands already exceeds TokenFactorInlineLimit Summary: In parallelizeChainedStores, a TokenFactor was created with the size greater than 3000. We found that DAGCombiner::visitTokenFactor will consume a huge amount of time on such nodes. Since the number of operands already exceeds TokenFactorInlineLimit, we propose to give up simplification with the consideration of compile time. Reviewers: @spatel, @arsenm Differential Revision: https://reviews.llvm.org/D84204	2020-07-25 21:20:59 -07:00
Craig Topper	1df8804ce5	[X86] Replace a use of ProcIntelSLM with FeatureFast7ByteNOP.	2020-07-25 20:46:48 -07:00
Eric Christopher	4b14ef33e8	Temporarily Revert "Unify the return value of GetByteSize to an llvm::Optional<uint64_t> (NFC-ish)" as it's causing numerous (176) test failures on linux. This reverts commit `1d9b860fb6`.	2020-07-25 18:42:04 -07:00
Eric Christopher	18975762c1	Fold StatepointBB into checks as it's only used from an NDEBUG or ASSERT context fixing an unused variable warning.	2020-07-25 18:36:53 -07:00
Nemanja Ivanovic	cdead4f89c	[PowerPC][NFC] Fix an assert that cannot trip from `7d076e19e3` I mixed up the precedence of operators in the assert and thought I had it right since there was no compiler warning. This just adds the parentheses in the expression as needed.	2020-07-25 20:28:52 -04:00
Philip Reames	55dae9c20c	[Statepoints] Style cleanup after `3da1a963` [NFC] Just fixing a few minor stylistic issues.	2020-07-25 16:40:39 -07:00
Craig Topper	c5b2371436	[X86] Add masked versions of the VPTERNLOG test cases added for D83630. NFC We don't handle these yet and D83630 won't improve that, but at least we'll have the tests.	2020-07-25 16:37:17 -07:00
Roman Lebedev	96d74530c0	[Reduce] Argument reduction: do deal with function declarations We can happily turn function definitions into declarations, thus obscuring their argument from being elided by this pass. I don't believe there is a good reason to just ignore declarations. likely even proper llvm intrinsics ones, at worst the input becomes uninteresting. The other question here is that all these transforms are all-or-nothing. In some cases, should we be treating each use separately? The main blocker here seemed to be that llvm::CloneFunctionInto() does `&OldFunc->front()`, which inserts a nullptr into a densemap, which is not happy about it and asserts.	2020-07-26 01:31:56 +03:00
Roman Lebedev	9932d74740	[Reduce] Argument reduction: do properly handle invoke insts (PR46819) replaceFunctionCalls() is very non-exhaustive, it only handles CallInst's. Which means, by the time we drop old function, there may still be uses of it lurking around. Let's instead whack-a-mole them by all by replacing with undef. I'm not sure this is the best handling, especially for calls, but IMO poorly reduced input is much better than crashing reduction tool. A (previously-crashing!) test added. Fixes https://bugs.llvm.org/show_bug.cgi?id=46819	2020-07-26 01:29:00 +03:00
Roman Lebedev	af1dd0b1ad	[Reduce] Basic block reduction: do properly handle invoke insts (PR46818) Terminator may have returned value, so we need to replace uses, and in general handle invoke as a branch inst. I'm not sure this is the best handling, but IMO poorly reduced input is much better than crashing reduction tool. A (previously-crashing!) test added. Fixes https://bugs.llvm.org/show_bug.cgi?id=46818	2020-07-26 01:28:59 +03:00

1 2 3 4 5 ...

361466 Commits All Branches Search

361466 Commits

All Branches