llvm-project

Commit Graph

Author	SHA1	Message	Date
Matt Davis	712db51edd	[llvm-mca] Cleanup the header syntax line. Fix a comment. NFC. This patch removes a few dashes from the header comment to make room for the syntax line. llvm-svn: 334986	2018-06-18 21:38:38 +00:00
Wouter van Oortmerssen	48dac3109e	[WebAssembly] Modified tablegen defs to have 2 parallel instuction sets. Summary: One for register based, much like the existing definitions, and one for stack based (suffix _S). This allows us to use registers in most of LLVM (which works better), and stack based in MC (which results in a simpler and more readable assembler / disassembler). Tried to keep this change as small as possible while passing tests, follow-up commit will: - Add reg->stack conversion in MI. - Fix asm/disasm in MC to be stack based. - Fix emitter to be stack based. tests passing: llvm-lit -v `find test -name WebAssembly` test/CodeGen/WebAssembly test/MC/WebAssembly test/MC/Disassembler/WebAssembly test/DebugInfo/WebAssembly test/CodeGen/MIR/WebAssembly test/tools/llvm-objdump/WebAssembly Reviewers: dschuff, sbc100, jgravelle-google, sunfish Subscribers: aheejin, JDevlieghere, llvm-commits Differential Revision: https://reviews.llvm.org/D48183 llvm-svn: 334985	2018-06-18 21:22:44 +00:00
Michael Berg	932ba20af8	refactor of visitFADD for AllowNewConst cases Summary: Refactoring for all constant cases which require AllowNewConst and some staging for future fmf usage. Reviewers: spatel, hfinkel, wristow Reviewed By: spatel Subscribers: nhaehnle Differential Revision: https://reviews.llvm.org/D48289 llvm-svn: 334984	2018-06-18 21:12:21 +00:00
Sander de Smalen	067eee1c13	[AArch64][SVE] Asm: Fix predicate pattern diagnostics. This patch uses the DiagnosticPredicate for SVE predicate patterns to improve their diagnostics, now giving a 'invalid operand' diagnostic if the type is not an immediate or one of the expected pattern labels. Reviewers: samparker, SjoerdMeijer, javed.absar, fhahn Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D48220 llvm-svn: 334983	2018-06-18 21:03:02 +00:00
Sander de Smalen	7ac9e193ec	[AArch64][SVE] Asm: Support for saturating INC/DEC (32bit scalar) instructions. The variants added by this patch are: - SQINC signed increment, e.g. sqinc x0, w0, all, mul #4 - SQDEC signed decrement, e.g. sqdec x0, w0, all, mul #4 - UQINC unsigned increment, e.g. uqinc w0, all, mul #4 - UQDEC unsigned decrement, e.g. uqdec w0, all, mul #4 This patch includes asmparser changes to parse a GPR64 as a GPR32 in order to satisfy the constraint check: x0 == GPR64(w0) in: sqinc x0, w0, all, mul #4 ^___^ (must match) Reviewers: rengolin, fhahn, SjoerdMeijer, samparker, javed.absar Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D47716 llvm-svn: 334980	2018-06-18 20:50:33 +00:00
Wouter van Oortmerssen	78c62966c2	[WebAssembly] Cleaned up register accessors in WebAssemblyMachineFunctionInfo.h Tested: llvm-lit -v `find test -name WebAssembly` (This is a commit access "test commit" :) llvm-svn: 334979	2018-06-18 20:45:49 +00:00
Sanjay Patel	3e52deb144	[x86] regenerate checks and adjust tests 2 of these tests were clearly not doing what the comments said they were doing. The last test was added at rL177933 with no assertions (presumably it used to crash). But either we don't have that problem anymore, or this test is folded sooner, so we don't hit the bug that was fixed by disabling late FP constant creation. Looking at this as part of reviewing D48289. llvm-svn: 334977	2018-06-18 20:05:16 +00:00
Simon Pilgrim	a5638431dc	[docs] Fix indentation of llvm-exegesis command line arguments llvm-svn: 334976	2018-06-18 20:05:02 +00:00
Craig Topper	17bd84c12c	[X86] Encode the EVEX2VEX exception list information in .td files instead of the emitter source. Rather than having an exclusion list in tablegen sources, add a flag to the X86 instruction records that can be used to suppress checking for convertibility. llvm-svn: 334971	2018-06-18 18:47:07 +00:00
Michael Berg	cafe947445	[NFC] make MIFlag accessor functions consistant with usage model llvm-svn: 334970	2018-06-18 18:37:48 +00:00
Florian Hahn	3385caaafd	[VPlan] Add VPInstruction to VPRecipe transformation. This patch introduces a VPInstructionToVPRecipe transformation, which allows us to generate code for a VPInstruction based VPlan re-using the existing infrastructure. Reviewers: dcaballe, hsaito, mssimpso, hfinkel, rengolin, mkuper, javed.absar, sguggill Reviewed By: dcaballe Differential Revision: https://reviews.llvm.org/D46827 llvm-svn: 334969	2018-06-18 18:28:49 +00:00
Lang Hames	68c9b8d6a1	[ORC] Add an initial implementation of a replacement CompileOnDemandLayer. CompileOnDemandLayer2 is a replacement for CompileOnDemandLayer built on the ORC Core APIs. Functions in added modules are extracted and compiled lazily. CompileOnDemandLayer2 supports multithreaded JIT'd code, and compilation on multiple threads. llvm-svn: 334967	2018-06-18 18:01:43 +00:00
Lang Hames	2e96114caf	[ORC] Keep weak flag on VSO symbol tables during materialization, but treat materializing weak symbols as strong. This removes some elaborate flag tweaking and plays nicer with RuntimeDyld, which relies of weak/common flags to determine whether it should emit a given weak definition. (Switching to strong up-front makes it appear as if there is already an overriding definition, which would require an extra back-channel to override). llvm-svn: 334966	2018-06-18 18:01:41 +00:00
Krzysztof Parzyszek	546017322f	Shrink interval after moving copy in removePartialRedundancy llvm-svn: 334963	2018-06-18 17:16:39 +00:00
Andrea Di Biagio	a88281d8ae	[llvm-mca] Use an ordered map to collect hardware statistics. NFC. Histogram entries are now ordered by key. This should improves their readability when statistics are printed. llvm-svn: 334961	2018-06-18 17:04:56 +00:00
Nirav Dave	b35f9e1459	Fix typoed cast to avoid assertion in MCFragment::dump. llvm-svn: 334959	2018-06-18 16:26:11 +00:00
Simon Pilgrim	5b962b2fc3	[SLPVectorizer] Tidyup isShuffle helper Ensure we keep track of the input vectors in all cases instead of just for SK_Select. Ideally we'd reuse the shuffle mask pattern matching in TargetTransformInfo::getInstructionThroughput here to easily add support for all TargetTransformInfo::ShuffleKind without mass code duplication, I've added a TODO for now but D48236 should help us here. Differential Revision: https://reviews.llvm.org/D48023 llvm-svn: 334958	2018-06-18 16:25:01 +00:00
Craig Topper	88c142b42b	[TableGen] Make TiedAsmOperandTable in the AsmMatcher 'static' since its at file scope. llvm-svn: 334957	2018-06-18 16:17:46 +00:00
Craig Topper	b41a137669	[TableGen] Remove unused member variable. I think this became unused after r324196. llvm-svn: 334956	2018-06-18 16:17:45 +00:00
Florian Hahn	63cbcf98a5	[VPlanRecipeBase] Add eraseFromParent(). Reviewers: dcaballe, hsaito, mkuper, hfinkel Reviewed By: dcaballe Differential Revision: https://reviews.llvm.org/D48081 llvm-svn: 334951	2018-06-18 15:18:48 +00:00
Sander de Smalen	13684d8400	[AArch64][SVE] Asm: Support for saturating INC/DEC (64bit scalar) instructions. Summary: The variants added by this patch are: - SQINC (signed increment) - UQINC (unsigned increment) - SQDEC (signed decrement) - UQDEC (unsigned decrement) For example: uqincw x0, all, mul #4 Reviewers: rengolin, fhahn, SjoerdMeijer, samparker, javed.absar Differential Revision: https://reviews.llvm.org/D47715 llvm-svn: 334948	2018-06-18 14:47:52 +00:00
Simon Pilgrim	9173c97ce4	[X86][BtVer2] Flag AVX2+ scheduler classes as unsupported Jaguar only supports up to AVX1 Differential Revision: https://reviews.llvm.org/D48274 llvm-svn: 334947	2018-06-18 14:31:14 +00:00
Andrea Di Biagio	487da729a2	[llvm-mca] Add tests for XOP and AVX512 instructions that implicitly clear the upper portion of a super-register. When the destination register of a XOP instruction is an XMM register, bits [255:128] of the corresponding YMM register are cleared. When the destination register of a EVEX encoded instruction is an XMM/YMM register, the upper bits of the corresponding ZMM are cleared. On processors that feature AVX512, a write to an XMM registers always clears the upper portion of the corresponding ZMM register if the instruction is VEX or EVEX encoded. These new tests show some interesting cases which aren't correctly analyzed by llvm-mca. The lack of knowledge related to the implicit update on the super-registers is addressed by D48225. llvm-svn: 334945	2018-06-18 14:00:30 +00:00
Florian Hahn	3bcff3662c	[VPlan] Fix sanitizer problem with insertBefore. llvm-svn: 334943	2018-06-18 13:51:28 +00:00
Sander de Smalen	118099a62c	[TableGen][AsmMatcherEmitter] Allow tied operands of different classes in aliases. Allow a tied operand of a different operand class in InstAliases, so that the operand can be printed (and added to the MC instruction) as the appropriate register. For example, 'GPR64as32', which would be printed/parsed as a 32bit register and should match a tied 64bit register operand, where the former is a sub-register of the latter. This patch also generalizes the constraint checking to an overrideable method in MCTargetAsmParser, so that target asmparsers can specify whether a given operand satisfies the tied register constraint. Reviewers: olista01, rengolin, fhahn, SjoerdMeijer, samparker, dsanders, craig.topper Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D47714 llvm-svn: 334942	2018-06-18 13:39:29 +00:00
Paul Robinson	7555c589af	Update copyright year to 2018. llvm-svn: 334936	2018-06-18 12:22:17 +00:00
Simon Pilgrim	99a5832016	[SLPVectorizer] Avoid calling const VL.size() repeatedly in for-loop. NFCI. llvm-svn: 334934	2018-06-18 11:35:36 +00:00
Florian Hahn	7591e4e94a	[VPlanRecipeBase] Add insertBefore helper. Reviewers: dcaballe, mkuper, hfinkel, hsaito, mssimpso Reviewed By: dcaballe Differential Revision: https://reviews.llvm.org/D48080 llvm-svn: 334933	2018-06-18 11:34:17 +00:00
Clement Courbet	e752fd65e8	[llvm-exegesis] Optionally ignore instructions without a sched class. Summary: See PR37602. Reviewers: RKSimon Subscribers: llvm-commits, tschuett Differential Revision: https://reviews.llvm.org/D48267 llvm-svn: 334932	2018-06-18 11:27:47 +00:00
Sander de Smalen	d521c4353e	[AArch64][SVE] Asm: Support for vector element compares. This patch adds instructions for comparing elements from two vectors, e.g. cmpgt p0.s, p0/z, z0.s, z1.s and also adds support for comparing to a 64-bit wide element vector, e.g. cmpgt p0.s, p0/z, z0.s, z1.d The patch also contains aliases for certain comparisons, e.g.: cmple p0.s, p0/z, z0.s, z1.s => cmpge p0.s, p0/z, z1.s, z0.s cmplo p0.s, p0/z, z0.s, z1.s => cmphi p0.s, p0/z, z1.s, z0.s cmpls p0.s, p0/z, z0.s, z1.s => cmphs p0.s, p0/z, z1.s, z0.s cmplt p0.s, p0/z, z0.s, z1.s => cmpgt p0.s, p0/z, z1.s, z0.s llvm-svn: 334931	2018-06-18 10:59:19 +00:00
Clement Courbet	0d9da88d18	[X86] Fix NOOP sched overrides on BDW/HSW/SKL. Summary: Noop certainly does not use resources. Reviewers: RKSimon, craig.topper, andreadb Subscribers: gbedwell, llvm-commits, gchatelet Differential Revision: https://reviews.llvm.org/D48028 llvm-svn: 334927	2018-06-18 06:48:22 +00:00
Craig Topper	f0ab7bd196	[X86] Create X86InstrFMA3Group objects fully in a static table instead of on the heap. NFCI Previously we heap allocated the X86InstrFMA3Group objects which were created by passing them small register/memory opcode arrays that existed as individual static tables. Rather than a bunch of small static arrays we now have one large static table of X86InstrFMA3Group objects. Rather than storing a pointer to the opcode arrays in the X86InstrFMA3Group object, we now store have a register and memory array as part of the object. If a group doesn't have memory or register opcodes, the array entries will be 0. This greatly simplifies the destruction of the X86InstrFMA3Info object. We no longer need to delete the X86InstrFMA3Group objects as we destruct the DenseMap. And we don't need to keep track of which ones we already deleted. This reduces the llc binary size on my local machine by ~50k. I can only assume that's really due to the fact that we had something like 512 small static arrays that we passed to the init functions either one at a time or in pairs. So there were between 256 and 512 distinct calls to the init functions in the initOnceImpl method. llvm-svn: 334925	2018-06-18 06:32:22 +00:00
Craig Topper	16fdde5e63	[X86] Add '.s' aliases to the assembler for the various redundant move encodings to match gas and our EVEX instructions. We already have these aliases for EVEX enocded instructions, but not for the GPR, MMX, SSE, and VEX versions. Also remove the vpextrw.s EVEX alias. That's not something gas implements. llvm-svn: 334922	2018-06-18 05:00:50 +00:00
Craig Topper	916d0cf649	[X86] Move the 'vmovq.s' and similar assembly strings for EVEX vector moves with reversed operands to InstAliases. The .s assembly strings allow the reversed forms to be targeted from assembly which matches gas behavior. But when printing the instructions we should print them without the .s to match other tooling like objdump. By using InstAliases we can use the normal string in the instruction and just hide it from the assembly parser. Ideally we'd add the .s versions to the legacy SSE and VEX versions as well for full compatibility with gas. Not sure how we got to state where only EVEX was supported. llvm-svn: 334920	2018-06-18 01:28:05 +00:00
Craig Topper	2be74395cf	[TableGen] Prevent double flattening of InstAlias asm strings in the asm matcher emitter. Unlike CodeGenInstruction, CodeGenInstAlias was flatting asm strings in its constructor. For instructions it was the users responsibility to flatten the string. AsmMatcherEmitter didn't know this and treated them the same. This caused double flattening of InstAliases. This is mostly harmless unless the desired assembly string contains curly braces. The second flattening wouldn't know to ignore these and would remove the curly braces. And for variant 1 it would remove the contents of them as well. To mitigate this, this patch makes removes the flattening from the CodeGenIntAlias constructor and modifies AsmWriterEmitter to account for the flattening not having been done. llvm-svn: 334919	2018-06-18 01:28:01 +00:00
Lang Hames	0705ee8dc2	[ORC] Remove redundant condition llvm-svn: 334918	2018-06-17 23:54:58 +00:00
Lang Hames	a5247cc5c7	[ORC] Only notify queries that they are resolved/ready when the query state changes. This guards against redundant notifications. llvm-svn: 334916	2018-06-17 18:59:01 +00:00
Craig Topper	9fe45d846e	[X86] Add all the FMA instructions direclty to the load folding table instead of proxying through X86InstrFMA3Info. These increases the size of the static tables, but is closer to what we would get if used the autogenerated table directly. This reduces the remaining large deltas between what's in the manual table and what's in the autogenerated table. llvm-svn: 334915	2018-06-17 18:00:16 +00:00
Lang Hames	cd018a4467	[ORC] Suppress an unused variable warning for a debug-mode only use. llvm-svn: 334911	2018-06-17 17:18:12 +00:00
Lang Hames	df5776b1dc	[ORC] Erase empty dependence sets when adding new symbol dependencies. llvm-svn: 334910	2018-06-17 16:59:53 +00:00
Lang Hames	11adecfb2c	[ORC] In MaterializationResponsibility, only maintain the Materializing flag on symbols in debug mode. The MaterializationResponsibility class hijacks the Materializing flag to track symbols that have not yet been resolved in order to guard against redundant resolution. Since this is an API contract check and only enforced in debug mode there is no reason to maintain the flag state in release mode. llvm-svn: 334909	2018-06-17 16:59:52 +00:00
Craig Topper	b0e986f88e	[X86] Pass the parent SDNode to X86DAGToDAGISel::selectScalarSSELoad to simplify the hasSingleUseFromRoot handling. Some of the calls to hasSingleUseFromRoot were passing the load itself. If the load's chain result has a user this would count against that. By getting the true parent of the match and ensuring any intermediate between the match and the load have a single use we can avoid this case. isLegalToFold will take care of checking users of the load's data output. This fixed at least fma-scalar-memfold.ll to succed without the peephole pass. llvm-svn: 334908	2018-06-17 16:29:46 +00:00
Simon Pilgrim	e930f569f7	[llvm-mca][X86] Add some avx512f/avx512vl resource test placeholders There are a lot of instructions to add under these ISAs (and the other AVX512 variants) but this should demonstrate how to test for the EVEX instructions with different maskings llvm-svn: 334907	2018-06-17 16:25:48 +00:00
Sander de Smalen	279b7e74e7	[AArch64][SVE] Asm: Support for bitwise operations on predicate vectors. This patch adds support for instructions performing bitwise operations on predicate vectors, including AND, BIC, EOR, NAND, NOR, ORN, ORR, and their status flag setting variants ANDS, BICS, EORS, NANDS, ORNS, ORRS. This patch also adds several aliases: orr p0.b, p1/z, p1.b, p1.b => mov p0.b, p1.b orrs p0.b, p1/z, p1.b, p1.b => movs p0.b, p1.b and p0.b, p1/z, p2.b, p2.b => mov p0.b, p1/z, p2.b ands p0.b, p1/z, p2.b, p2.b => movs p0.b, p1/z, p2.b eor p0.b, p1/z, p2.b, p1.b => not p0.b, p1/z, p2.b eors p0.b, p1/z, p2.b, p1.b => nots p0.b, p1/z, p2.b llvm-svn: 334906	2018-06-17 10:48:21 +00:00
Sander de Smalen	2c25b4cd36	[AArch64][SVE] Asm: Support for SEL (vector/predicate) instructions. Support for SVE's predicated select instructions to select elements from either vector, both in a data-vector and a predicate-vector variant. llvm-svn: 334905	2018-06-17 10:11:04 +00:00
Jonas Hahnfeld	c7410ed47a	[NVPTX] Ignore target-cpu and -features for inlining We don't want to prevent inlining because of target-cpu and -features attributes that were added to newer versions of LLVM/Clang: There are no incompatible functions in PTX, ptxas will throw errors in such cases. Differential Revision: https://reviews.llvm.org/D47691 llvm-svn: 334904	2018-06-17 09:55:20 +00:00
Heejin Ahn	9786946731	[WebAssembly] Simple comment fix. NFC. llvm-svn: 334899	2018-06-17 00:37:56 +00:00
Craig Topper	29f22d7baa	[X86] More additions to the load folding tables based on the autogenerated tables. Including more additions for NotMemoryFoldable to remove some entries from the autogenerated table. llvm-svn: 334898	2018-06-16 23:25:50 +00:00
Craig Topper	c435632862	[X86] Hide POP16/32/64rmr and PUSH16/32/64rmr instructions from the assembly parser. These all have a short form encoding that the assembler already prefers. Though that preference seems to only be based on order in the .td fie. Hiding the long form saves space in the table and prevents us from breaking the implicit order based priority. llvm-svn: 334897	2018-06-16 23:25:48 +00:00
Craig Topper	74412c7d59	[X86] Fix an inconsistency between AVX512 and AVX/SSE version on a couple instructions. VMOVPQIto64Zmr is not a 64-bit mode only instruction. But I don't know how to test this because VMOVPQIto64mr should always have priority over it in 32-bit mode since its only advantage is XMM16-XMM31 which aren't usable in 32-bit mode. VMOVPQIto64Zrr is a 64-bit mode only instruction, but we don't need to explicitly mark it as such because it uses a GR64 register which won't parse in 32-bit mode. llvm-svn: 334896	2018-06-16 23:25:47 +00:00
Michael Zolotukhin	158a7c3323	CorrelatedValuePropagation: Preserve DT. Summary: We only modify CFG in a couple of places, and we can preserve DT there with a little effort. Reviewers: davide, vsk Subscribers: hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D48059 llvm-svn: 334895	2018-06-16 18:57:31 +00:00
Florian Hahn	6fbad90407	[Dominators] Change getNode parameter type to const NodeT * (NFC). DominatorTreeBase::getNode does not modify its parameter and this change allows callers that only have access to const pointers to use it without casting. Reviewers: kuhar, dblaikie, chandlerc Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D48231 llvm-svn: 334892	2018-06-16 14:47:05 +00:00
Benjamin Kramer	1193bbf6b7	Fix namespaces. No functionality change. llvm-svn: 334890	2018-06-16 13:37:52 +00:00
Florian Hahn	0939fea8b4	Revert r334887, as GCC 4.8 does not have is_trivially_copy_constructible & co llvm-svn: 334889	2018-06-16 13:00:33 +00:00
Florian Hahn	9d47ce784d	[SmallSet] Avoid using is_trivially_XXX<>::value which is C++17 llvm-svn: 334888	2018-06-16 12:50:32 +00:00
Florian Hahn	18714d6a7f	[SmallSet] Add SmallSetIterator. This patch adds a simple const_iterator implementation for SmallSet by delegating to either a SmallVector::const_iterator or std::set::const_iterator, depending on which storage is used by the SmallSet. Reviewers: dblaikie, craig.topper Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D47942 llvm-svn: 334887	2018-06-16 12:36:19 +00:00
Stanislav Mekhanoshin	3b11794dbf	[AMDGPU] setcc (select cc, CT, CF), CF, eq \| ne -> xor cc, -1 \| cc This is the common case in the BE when we serialize condition and then rematerialize it. Use either original or inverted condition. Differential Revision: https://reviews.llvm.org/D48246 llvm-svn: 334882	2018-06-16 03:46:59 +00:00
Nirav Dave	d4ff2f8a74	Avoid needing to walk out legalization tables. NFCI. Relanding after fixing expensive check from modifying tables. To avoid redundant work, during DAG legalization we keep tables mapping pre-legalized SDValues to post-legalized SDValues and a SDValue-to-SDValue map to enable fast node replacements. However, as the keys are nodes which may be reused it is possible that an entry in a table refers to a now deleted node N (that should have been renamed by the value replacement map) while a new node N' exists. If N' is then replaced that entry would be wrong. Previously we avoided this by when potentially violating this property, walking every table and updating all node pointers. This is very expensive but hopefully rare occurance. This patch assigns each instance of a SDValue used in legalization a unique id and uses these ids in the legalization tables. This avoids any such aliasing issue, avoiding the full table search and allowing more aggressive incremental table pruning. In some cases this is a 1000x speedup to compilation. Reviewers: jyknight, echristo, bogner, tra Reviewed By: bogner Subscribers: dberris, grandinj, hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D47959 llvm-svn: 334880	2018-06-16 02:51:29 +00:00
Justin Lebar	3f5490af21	Revert "[SCEV] Use LLVM_MARK_AS_BITMASK_ENUM in SCEV." -- breaks MSVC builds. This reverts D48237. llvm-svn: 334878	2018-06-16 00:14:10 +00:00
Justin Lebar	018c3790f9	Revert "[SCEV] Simplify some flags expressions." -- dependent revision breaks MSVC builds. This reverts D48238. llvm-svn: 334877	2018-06-16 00:13:57 +00:00
Michael Berg	8e570c3390	Utilize new SDNode flag functionality to expand current support for fma Summary: This patch originated from D47388 and is a proper subset of the originating changes, containing only the fmf optimization guard extensions. Reviewers: spatel, hfinkel, wristow, arsenm, javed.absar, rampitec, nhaehnle, nemanjai Reviewed By: rampitec, nhaehnle Subscribers: tpr, nemanjai, wdng Differential Revision: https://reviews.llvm.org/D47918 llvm-svn: 334876	2018-06-16 00:03:06 +00:00
Justin Lebar	af30bb1c90	[SCEV] Simplify some flags expressions. Summary: Sending for presubmit review out of an abundance of caution; it would be bad to mess this up. Reviewers: sanjoy Subscribers: hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D48238 llvm-svn: 334875	2018-06-15 23:52:11 +00:00
Justin Lebar	6cb702d00d	[SCEV] Use LLVM_MARK_AS_BITMASK_ENUM in SCEV. Summary: Obviates the need for mask/clear/setFlags helpers. There are some expressions here which can be simplified, but to keep this easy to review, I have not simplified them in this patch. No functional change. Reviewers: sanjoy Subscribers: hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D48237 llvm-svn: 334874	2018-06-15 23:51:57 +00:00
Daniel Sanders	8ead1290e6	[globalisel][tablegen] Add support for C++ predicates on PatFrags and use it to support BFC on ARM. So far, we've only handled special cases of PatFrag like ImmLeaf. This patch adds support for the remaining cases using similar mechanisms. Like most C++ code from SelectionDAG, GISel and DAGISel expect to operate on different types and representations and as such the code is not compatible between the two. It's therefore necessary to add an alternative implementation in the GISelPredicateCode field. The target test for this feature could easily be done with IntImmLeaf and this would save on a little boilerplate. The reason I've chosen to implement this using PatFrag.GISelPredicateCode and not IntImmLeaf is because I was unable to find a rule that was blocked solely by lack of support for PatFrag predicates. I found that the ones I investigated as being likely candidates for the test were further blocked by other things. llvm-svn: 334871	2018-06-15 23:13:43 +00:00
Francis Visoiu Mistrih	dc705a6a89	Revert r334729 "[DAG] Avoid needing to walk out legalization tables. NFCI." This reverts commit r334729. llvm-svn: 334869	2018-06-15 23:05:41 +00:00
Francis Visoiu Mistrih	1c9df30eca	Revert r334731 "Avoid unused variable in non-assert builds." This reverts commit r334731. It breaks EXPENSIVE_CHECKS bots. llvm-svn: 334868	2018-06-15 23:05:40 +00:00
Craig Topper	d00e375310	[X86] Add more instructions to the hasUndefRegUpdate list. Not sure any of these matter today because I don't think we ever produce them with IMPLICIT_DEF as an input. But by listing them we don't be suprised in the future. llvm-svn: 334867	2018-06-15 22:25:04 +00:00
Benjamin Kramer	7f68a309ac	[BPI] Remove unnecessary std::list vector is sufficient here. No functionality change intended. llvm-svn: 334865	2018-06-15 21:06:43 +00:00
Cameron McInally	7caac670b2	[FPEnv] Expand constrained FP POWI Modify ExpandStrictFPOp(...) to handle nodes that have scalar operands. Also, add a Strict FMA test and do some other light cleanup in the Strict FP code. Differential Revision: https://reviews.llvm.org/D48149 llvm-svn: 334863	2018-06-15 20:57:55 +00:00
Michael Berg	02d1c6c0cf	Utilize new SDNode flag functionality to expand current support for fdiv Summary: This patch originated from D46562 and is a proper subset, with some issues addressed. Reviewers: spatel, hfinkel, wristow, arsenm Reviewed By: spatel Subscribers: wdng, nhaehnle Differential Revision: https://reviews.llvm.org/D47954 llvm-svn: 334862	2018-06-15 20:44:55 +00:00
Matt Morehouse	0ea9a90b3d	[SanitizerCoverage] Add associated metadata to pc-tables. Summary: Using associated metadata rather than llvm.used allows linkers to perform dead stripping with -fsanitize-coverage=pc-table. Unfortunately in my local tests, LLD was the only linker that made use of this metadata. Partially addresses https://bugs.llvm.org/show_bug.cgi?id=34636 and fixes https://github.com/google/sanitizers/issues/971. Reviewers: eugenis Reviewed By: eugenis Subscribers: Dor1s, hiraditya, llvm-commits, kcc Differential Revision: https://reviews.llvm.org/D48203 llvm-svn: 334858	2018-06-15 20:12:58 +00:00
Geoff Berry	82b2e7431d	Update my information in the CREDITS file. llvm-svn: 334857	2018-06-15 20:02:11 +00:00
Sean Fertile	cac28aeb3f	[PowerPC] Add support for high and higha symbol modifiers on tls modifers. Enables using the high and high-adjusted symbol modifiers on thread local storage modifers in powerpc assembly. Needed to be able to support 64 bit thread-pointer and dynamic-thread-pointer access sequences. Differential Revision: https://reviews.llvm.org/D47754 llvm-svn: 334856	2018-06-15 19:47:16 +00:00
Sean Fertile	80b8f82f17	[PPC64] Support "symbol@high" and "symbol@higha" symbol modifers. Add support for the "@high" and "@higha" symbol modifiers in powerpc64 assembly. The modifiers represent accessing the segment consiting of bits 16-31 of a 64-bit address/offset. Differential Revision: https://reviews.llvm.org/D47729 llvm-svn: 334855	2018-06-15 19:47:11 +00:00
Diego Caballero	72aed5e5dc	Move redundant-vf2-cost.ll test to X86 directory redundant-vf2-cost.ll is X86 specific. Moved from test/Transforms/LoopVectorize/redundant-vf2-cost.ll to test/Transforms/LoopVectorize/X86/redundant-vf2-cost.ll llvm-svn: 334854	2018-06-15 18:46:03 +00:00
Simon Pilgrim	f5ecd8d50d	[llvm-mca][x86] Add Generic cpu resource tests Added a Generic x86 cpu set of resource tests to allow us to check all ISAs. We currently use SandyBridge as our generic CPU model, but it's better if we actually duplicate these tests for if/when we change the model, it also means we don't end up polluting the SandyBridge folder with tests for ISAs it doesn't support. llvm-svn: 334853	2018-06-15 18:35:25 +00:00
Tomasz Krupa	bcaab53d47	[X86] Lowering sqrt intrinsics to native IR Summary: Complementary patch to lowering sqrt intrinsics in Clang. Reviewers: craig.topper, spatel, RKSimon, DavidKreitzer, uriel.k Reviewed By: craig.topper Subscribers: tkrupa, mike.dvoretsky, llvm-commits Differential Revision: https://reviews.llvm.org/D41599 llvm-svn: 334849	2018-06-15 18:05:24 +00:00
Craig Topper	1657b7b8d2	[X86] Prevent folding stack reloads into instructions in hasUndefRegUpdate. An earlier commit prevented folds from the peephole pass by checking for IMPLICIT_DEF. But later in the pipeline IMPLICIT_DEF just becomes and Undef flag on the input register so we need to check for that case too. llvm-svn: 334848	2018-06-15 17:56:17 +00:00
Krzysztof Parzyszek	1a70426ac1	Remove <undef> from rematerialized full register When coalescing a small register into a subregister of a larger register, if the larger register is rematerialized, the function updateRegDefUses can add an <undef> flag to the rematerialized definition (since it's treating it as only definining the coalesced subregister). While with that assumption doing so is not incorrect, make sure to remove the flag later on after the call to updateRegDefUses. llvm-svn: 334845	2018-06-15 16:58:22 +00:00
Joseph Tremoulet	6f406d4f02	[InstCombine] Avoid iteration/mutation conflict Summary: When iterating users of a multiply in processUMulZExtIdiom, the call to setOperand in the truncation case may replace the use being visited; make sure the iterator has been advanced before doing that replacement. Reviewers: majnemer, davide Reviewed By: davide Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D48192 llvm-svn: 334844	2018-06-15 16:52:40 +00:00
Sander de Smalen	a6edca72ba	[AArch64][SVE] Asm: Support for CPY SIMD/FP and GPR instructions. Predicated splat/copy of SIMD/FP register or general purpose register to SVE vector, along with MOV-aliases. llvm-svn: 334842	2018-06-15 16:39:46 +00:00
Jordan Rose	7e535bc4bf	Avoid copying PrettyStackTrace messages an extra time on Apple OSs We were unnecessarily going from SmallString to std::string just to get a null-terminated C string. So just...don't do that. Crash slightly faster! llvm-svn: 334841	2018-06-15 16:35:31 +00:00
Diego Caballero	68795245cf	[LV] Prevent LV to run cost model twice for VF=2 This is a minor fix for LV cost model, where the cost for VF=2 was computed twice when the vectorization of the loop was forced without specifying a VF. Reviewers: xusx595, hsaito, fhahn, mkuper Reviewed By: hsaito, xusx595 Differential Revision: https://reviews.llvm.org/D48048 llvm-svn: 334840	2018-06-15 16:21:35 +00:00
Sander de Smalen	18ac8f9f25	[AArch64][SVE] Asm: Support for INC/DEC (scalar) instructions. Increment/decrement scalar register by (scaled) element count given by predicate pattern, e.g. 'incw x0, all, mul #4'. Reviewers: rengolin, fhahn, SjoerdMeijer, samparker, javed.absar Reviewed By: SjoerdMeijer Differential Revision: https://reviews.llvm.org/D47713 llvm-svn: 334838	2018-06-15 15:47:44 +00:00
Matt Arsenault	63bc0e3cb9	AMDGPU: Add combine for short vector extract_vector_elts Try to access pieces 4 bytes at a time. This helps various hasOneUse extract_vector_elt combines, such as load width reductions. Avoids test regressions in a future commit. llvm-svn: 334836	2018-06-15 15:31:36 +00:00
Matt Arsenault	02dc7e19e2	AMDGPU: Make v4i16/v4f16 legal Some image loads return these, and it's awkward working around them not being legal. llvm-svn: 334835	2018-06-15 15:15:46 +00:00
Paul Semel	fa5597b24d	[llvm-readobj] Add -string-dump (-p) option This option prints the section content as a string. Differential Revision: https://reviews.llvm.org/D47989 llvm-svn: 334834	2018-06-15 14:15:02 +00:00
Roman Lebedev	9ddf128f79	[MCA] Add -summary-view option Summary: While that is indeed a quite interesting summary stat, there are cases where it does not really add anything other than consuming extra lines. Declutters the output of D48190. Reviewers: RKSimon, andreadb, courbet, craig.topper Reviewed By: andreadb Subscribers: javed.absar, gbedwell, llvm-commits Differential Revision: https://reviews.llvm.org/D48209 llvm-svn: 334833	2018-06-15 14:01:43 +00:00
Roman Lebedev	7c423001e4	[MCA][x86][NFC] Add tests for -register-file-stats, -scheduler-stats Summary: There does not seem to be any other tests for this. Split off from D47676. Reviewers: RKSimon, craig.topper, courbet, andreadb Reviewed By: andreadb Subscribers: javed.absar, gbedwell, llvm-commits Differential Revision: https://reviews.llvm.org/D48190 llvm-svn: 334832	2018-06-15 14:01:35 +00:00
Sander de Smalen	5eb51d7495	[AArch64][SVE] Asm: Support for FADD, FMUL and FMAX immediate instructions. Reviewers: rengolin, fhahn, SjoerdMeijer, samparker, javed.absar Reviewed By: javed.absar Differential Revision: https://reviews.llvm.org/D47712 llvm-svn: 334831	2018-06-15 13:57:51 +00:00
Bjorn Pettersson	428caf988b	Re-apply "[DebugInfo] Check size of variable in ConvertDebugDeclareToDebugValue" This is r334704 (which was reverted in r334732) with a fix for types like x86_fp80. We need to use getTypeAllocSizeInBits and not getTypeStoreSizeInBits to avoid dropping debug info for such types. Original commit msg: > Summary: > Do not convert a DbgDeclare to DbgValue if the store > instruction only refer to a fragment of the variable > described by the DbgDeclare. > > Problem was seen when for example having an alloca for an > array or struct, and there were stores to individual elements. > In the past we inserted a DbgValue intrinsics for each store, > just as if the store wrote the whole variable. > > When handling store instructions we insert a DbgValue that > indicates that the variable is "undefined", as we do not know > which part of the variable that is updated by the store. > > When ConvertDebugDeclareToDebugValue is used with a load/phi > instruction we assert that the referenced value is large enough > to cover the whole variable. Afaict this should be true for all > scenarios where those methods are used on trunk. If the assert > blows in the future I guess we could simply skip to insert a > dbg.value instruction. > > In the future I think we should examine which part of the variable > that is accessed, and add a DbgValue instrinsic with an appropriate > DW_OP_LLVM_fragment expression. > > Reviewers: dblaikie, aprantl, rnk > > Reviewed By: aprantl > > Subscribers: JDevlieghere, llvm-commits > > Tags: #debug-info > > Differential Revision: https://reviews.llvm.org/D48024 llvm-svn: 334830	2018-06-15 13:48:55 +00:00
Simon Dardis	98b9849d34	[mips] Add licensing information of the microMIPS tablegen files. (NFC) llvm-svn: 334827	2018-06-15 13:29:35 +00:00
Sander de Smalen	3cbf171479	[AArch64][SVE] Asm: Add parsing/printing support for exact FP immediates. Some instructions require of a limited set of FP immediates as operands, for example '#0.5 or #1.0' for SVE's FADD instruction. This patch adds support for parsing and printing such FP immediates as exact values (e.g. #0.499999 is not accepted for #0.5). Reviewers: rengolin, fhahn, SjoerdMeijer, samparker, javed.absar Reviewed By: SjoerdMeijer Differential Revision: https://reviews.llvm.org/D47711 llvm-svn: 334826	2018-06-15 13:11:49 +00:00
Roman Lebedev	1ef9b2a102	[NFC] chmod +x utils/update_analyze_test_checks.py Looks like a simple oversight. llvm-svn: 334825	2018-06-15 12:41:50 +00:00
Matt Arsenault	df2f4ef29d	DAG: Fix creating concat_vectors with illegal type Test passes as is, but fails with future patch to make v4i16/v4f16 legal. llvm-svn: 334823	2018-06-15 12:09:15 +00:00
Simon Pilgrim	180497ea11	[SLP][X86] Add AVX2 run to POW2 SDIV Tests Non-uniform pow2 tests are only make sense on targets with fast (low cost) non-uniform shifts llvm-svn: 334821	2018-06-15 10:29:37 +00:00
Simon Pilgrim	ca6215f8c8	[SLP][X86] Regenerate POW2 SDIV Tests Added non-uniform pow2 test as well llvm-svn: 334819	2018-06-15 10:07:03 +00:00
Roman Lebedev	84c11aed10	[InstCombine] Recommit: Fold (x << y) >> y -> x & (-1 >> y) Summary: We already do it for splat constants, but not just values. Also, undef cases are mostly non-functional. The original commit was reverted because it broke tests for amdgpu backend, which i didn't check. Now, the backed was updated to recognize these new patterns, so we are good. https://bugs.llvm.org/show_bug.cgi?id=37603 https://rise4fun.com/Alive/cplX Reviewers: spatel, craig.topper, mareko, bogner, rampitec, nhaehnle, arsenm Reviewed By: spatel, rampitec, nhaehnle Subscribers: wdng, nhaehnle, llvm-commits Differential Revision: https://reviews.llvm.org/D47980 llvm-svn: 334818	2018-06-15 09:56:52 +00:00
Roman Lebedev	dec562c849	[AMDGPU] Recognize x & ~(-1 << y) pattern. Summary: The same pattern as D48010, but this one is IR-canonical as of D47428. Reviewers: nhaehnle, bogner, tstellar, arsenm Reviewed By: arsenm Subscribers: arsenm, kzhuravl, wdng, yaxunl, dstuttard, tpr, t-tye, llvm-commits Tags: #amdgpu Differential Revision: https://reviews.llvm.org/D48012 llvm-svn: 334817	2018-06-15 09:56:45 +00:00
Roman Lebedev	9c17dad8f2	[AMDGPU] Recognize x & ((1 << y) - 1) pattern. Summary: As a followup for D48007. Since we already handle `x << (bitwidth - y) >> (bitwidth - y)` pattern, which does not have ub for both the edge cases (`y == 0`, `y == bitwidth`), i think also handling a pattern that is ub for `y == bitwidth` should be fine. Reviewers: nhaehnle, bogner, tstellar, arsenm Reviewed By: arsenm Subscribers: arsenm, kzhuravl, wdng, yaxunl, dstuttard, tpr, t-tye, llvm-commits Tags: #amdgpu Differential Revision: https://reviews.llvm.org/D48010 llvm-svn: 334816	2018-06-15 09:56:39 +00:00
Roman Lebedev	aa8587d1fc	[AMDGPU] Recognize x & (-1 >> (32 - y)) pattern. Summary: D47980 will canonicalize the `x << (32 - y) >> (32 - y)`, which is the pattern the AMDGPU expects to `x & (-1 >> (32 - y))`, which is not recognized by AMDGPU. Thus, it needs to be recognized, too. Reviewers: nhaehnle, bogner, tstellar, arsenm Reviewed By: arsenm Subscribers: arsenm, kzhuravl, wdng, yaxunl, dstuttard, tpr, t-tye, llvm-commits Tags: #amdgpu Differential Revision: https://reviews.llvm.org/D48007 llvm-svn: 334815	2018-06-15 09:56:31 +00:00
Peter Smith	1503fc0fd0	[MC] Move bundling and MCSubtargetInfo to MCEncodedFragment [NFC] Instruction bundling is only supported on descendants of the MCEncodedFragment type. By moving the bundling functionality and MCSubtargetInfo to this class it makes it easier to set and extract the MCSubtargetInfo when it is necessary. This is a refactoring change that will make it easier to pass the MCSubtargetInfo through to writeNops when nop padding is required. Differential Revision: https://reviews.llvm.org/D45959 llvm-svn: 334814	2018-06-15 09:48:18 +00:00
Clement Courbet	205276bf37	[llvm-exegesis][NFC] Remove dead variable. llvm-svn: 334813	2018-06-15 09:46:57 +00:00
Clement Courbet	f64007fe82	[llvm-exegesis][NFC] Add more comments. llvm-svn: 334811	2018-06-15 09:27:12 +00:00
QingShan Zhang	0651eb1b31	add myself to the CREDITS.TXT llvm-svn: 334808	2018-06-15 08:34:41 +00:00
Mikhail Dvoretckii	0531ec654a	NFC: Regenerating x86-sse41.ll test for InstCombine Test regenerated to reduce noise in further patches. llvm-svn: 334806	2018-06-15 07:59:29 +00:00
Clement Courbet	4273e1e828	[llvm-exegesis] Print the whole snippet in analysis. Summary: On hover, the whole asm snippet is displayed, including operands. This requires the actual assembly output instead of just the MCInsts: This is because some pseudo-instructions get lowered to actual target instructions during codegen (e.g. ABS_Fp32 -> SSE or X87). Reviewers: gchatelet Subscribers: mgorny, tschuett, llvm-commits Differential Revision: https://reviews.llvm.org/D48164 llvm-svn: 334805	2018-06-15 07:30:45 +00:00
Craig Topper	c8a763ed84	Revert r334802 "[X86] Prevent folding stack reloads with instructions that have an undefined register update." There's a typo causing the build to fail. llvm-svn: 334803	2018-06-15 06:15:26 +00:00
Craig Topper	5ec210cc27	[X86] Prevent folding stack reloads with instructions that have an undefined register update. We want to keep the load unfolded so we can use the same register for both sources to avoid a false dependency. llvm-svn: 334802	2018-06-15 06:11:36 +00:00
Craig Topper	3c4cc01226	[X86] Add more instructions to the memory folding tables using the autogenerated table as a guide. I think this covers most of the unmasked vector instructions. We're still missing a lot of the masked instructions. There are some test changes here because of the new folding support. I don't think these particular cases should be folded because it creates an undef register dependency. I think the changes introduced in r334175 are not handling stack folding. They're only blocking the peephole pass. llvm-svn: 334800	2018-06-15 05:49:19 +00:00
Hiroshi Inoue	c36a1f1cb7	[NFC] fix trivial typos in documents llvm-svn: 334799	2018-06-15 05:10:09 +00:00
Craig Topper	3b060daba5	[X86] Fix some checks to use X86 instead of X32. These tests were recently updated so it looks like gone wrong. llvm-svn: 334786	2018-06-15 04:42:55 +00:00
Craig Topper	f43807dd89	[X86] Add 'Z' to the internal names of various EVEX instructions for overall consistency. llvm-svn: 334785	2018-06-15 04:42:54 +00:00
Andrew Kaylor	36bb0ad078	Add debug info for OProfile profiling support Patch by Gaetano Priori Differential Revision: https://reviews.llvm.org/D47925 llvm-svn: 334782	2018-06-15 00:07:28 +00:00
Shoaib Meenai	d65dba56a6	[cmake] Change ON/OFF to YES/NO. NFC compnerd pointed out that the latter reads better over here. llvm-svn: 334781	2018-06-14 23:40:04 +00:00
Shoaib Meenai	fce4616189	[cmake] Add linker detection for Apple platforms LLVM currently assumes that Apple platforms will always use ld64. In the future, LLD Mach-O might also be supported, so add the beginnings of linker detection support. ld64 is currently the only detected linker, since `ld64.lld -v` doesn't yield any useful version output, but we can add that detection later, and in the meantime it's still useful to have the ld64 identification. Switch clang's order file check to use this new detection rather than just checking for the presence of an ld64 executable. Differential Revision: https://reviews.llvm.org/D48201 llvm-svn: 334780	2018-06-14 23:26:33 +00:00
Eli Friedman	3f1ce093ea	Make uitofp and sitofp defined on overflow. IEEE 754 defines the expected result on overflow. As far as I know, hardware implementations (of f16), and compiler-rt (__floatuntisf) correctly return +-Inf on overflow. And I can't think of any useful transform that would take advantage of overflow being undefined here. Differential Revision: https://reviews.llvm.org/D47807 llvm-svn: 334777	2018-06-14 22:58:48 +00:00
Lang Hames	5d6c509944	[ORC] Strip weak flags from a symbol once it is selected for materialization. Once a symbol has been selected for materialization it can no longer be overridden. Stripping the weak flag guarantees this (override attempts will then be treated as duplicate definitions and result in a DuplicateDefinition error). llvm-svn: 334771	2018-06-14 21:16:29 +00:00
Matt Davis	248acf6b57	[llvm-mca] Clean up the header comment. NFC. This change removes a few dashes to make room for the header syntax string. llvm-svn: 334770	2018-06-14 20:58:54 +00:00
Michael Berg	0c20447a02	easing the constraint for isNegatibleForFree and GetNegatedExpression Summary: Here we relax the old constraint which utilized unsafe with the TargetOption flag HonorSignDependentRoundingFPMathOption, with the assertion that unsafe is no longer needed or never was required for correctness on FDIV/FMUL. Reviewers: spatel, hfinkel, wristow, arsenm, javed.absar Reviewed By: spatel Subscribers: efriedma, wdng, tpr Differential Revision: https://reviews.llvm.org/D48057 llvm-svn: 334769	2018-06-14 20:54:13 +00:00
Florian Hahn	6b1db82acf	Revert r334764, as it breaks some bots llvm-svn: 334767	2018-06-14 20:32:58 +00:00
Florian Hahn	1b465767d6	[TableGen] Make TreePatternNode::getChild return a reference (NFC) The return value of TreePatternNode::getChild is never null. This patch also updates various places that use return values of getChild to also use references. Those changes were suggested post-commit for D47463. llvm-svn: 334764	2018-06-14 20:23:48 +00:00
George Burgess IV	aa283d80fe	[MSSA] Print more optimization information In particular, when asked to print a MemoryAccess, we'll now print where defs are optimized to, and we'll print optimized access types. This patch also introduces an operator<< to make printing AliasResults easier. Patch by Juneyoung Lee! Differential Revision: https://reviews.llvm.org/D47860 llvm-svn: 334760	2018-06-14 19:55:53 +00:00
Sanjay Patel	f85ca6abee	[x86] be more selective about converting 'and' to shuffle (PR37749) isVectorClearMaskLegal() is the TLI hook used by the generic DAGCombiner::XformToShuffleWithZero(). We've grown to accomodate/expect this transform to shuffle (disabling it more generally results in many regressions). So I'm narrowly excluding the 256-bit types that clearly are not worthwhile for AVX1. I think in most cases we are able to recover by converting the shuffle back into 'and' ops, but the cases in: https://bugs.llvm.org/show_bug.cgi?id=37749 ...show that there are cracks. llvm-svn: 334759	2018-06-14 19:55:02 +00:00
Craig Topper	bfa94d5086	[X86] Fix stale comment in folding tables. llvm-svn: 334758	2018-06-14 19:28:31 +00:00
Tom Stellard	a92847359a	AMDGPU/GlobalISel: Implement select() for @llvm.amdgcn.cvt.pkrtz Reviewers: arsenm, nhaehnle Reviewed By: arsenm Subscribers: kzhuravl, wdng, yaxunl, rovka, kristof.beyls, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D45907 llvm-svn: 334757	2018-06-14 19:26:37 +00:00
Justin Bogner	3b83edb037	Re-apply "[VirtRegRewriter] Avoid clobbering registers when expanding copy bundles" This is r334750 (which was reverted in r334754) with a fix for an uninitialized variable that was caught by msan. Original commit message: > If a copy bundle happens to involve overlapping registers, we can end > up with emitting the copies in an order that ends up clobbering some > of the subregisters. Since instructions in the copy bundle > semantically happen at the same time, this is incorrect and we need to > make sure we order the copies such that this doesn't happen. llvm-svn: 334756	2018-06-14 19:24:03 +00:00
Justin Bogner	36c7f40f20	Revert "[VirtRegRewriter] Avoid clobbering registers when expanding copy bundles" There's an msan failure: http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux-fast/builds/19549 This reverts r334750. llvm-svn: 334754	2018-06-14 19:10:57 +00:00
Michael Berg	4663ceb63f	updating isNegatibleForFree and GetNegatedExpression with fmf for fadd Summary: A FMF constraint is added to FADD with unsafe still available as the fallback Reviewers: spatel, wristow, arsenm, hfinkel Reviewed By: spatel Subscribers: wdng Differential Revision: https://reviews.llvm.org/D48180 llvm-svn: 334753	2018-06-14 18:48:31 +00:00
Sam Clegg	277f898a4d	[WebAssembly] Ignore explicit section names for functions WebAssembly doesn't support more than one function per section and we rely on function sections being unique. This change ignores the section provided by the function to avoid two functions being in the same section. Without this change the object writer produces the following error for this test: LLVM ERROR: section already has a defining function: baz Differential Revision: https://reviews.llvm.org/D48178 llvm-svn: 334752	2018-06-14 18:48:19 +00:00
Justin Bogner	866d9f02be	[VirtRegRewriter] Avoid clobbering registers when expanding copy bundles If a copy bundle happens to involve overlapping registers, we can end up with emitting the copies in an order that ends up clobbering some of the subregisters. Since instructions in the copy bundle semantically happen at the same time, this is incorrect and we need to make sure we order the copies such that this doesn't happen. Differential Revision: https://reviews.llvm.org/D48154 llvm-svn: 334750	2018-06-14 18:32:55 +00:00
Bruno Cardoso Lopes	7e8508822f	[CMAKE] Honor CMAKE_OSX_SYSROOT to compute include dir for libxml2 On MacOS, if CMAKE_OSX_SYSROOT is used and the user has command line tools installed, we currently get the include path for libxml2 as /usr/include/libxml2, instead of ${CMAKE_OSX_SYSROOT}/usr/include/libxml2. Make it consistent on MacOS by prefixing ${CMAKE_OSX_SYSROOT} when possible. rdar://problem/41103601 llvm-svn: 334746	2018-06-14 18:19:54 +00:00
Sanjay Patel	d49219db84	[x86] add tests for AVX1 FP logic op abuse (PR37749); NFC Also, add a RUN for AVX2 to make sure that's good. llvm-svn: 334744	2018-06-14 18:08:06 +00:00
Andrea Di Biagio	4cafb297d5	[llvm-mca] Add tests for instructions that implicitly clear the upper portion of a super-register. On x86-64, a write to register EAX implicitly clears the upper half or RAX. 128-bit AVX instructions clear the upper 128-bit of the YMM register that aliases the XMM definition register. llvm-mca doesn't know about register writes that implicitly clear the upper portion of an aliasing super-register. This issue will be fixed in a future patch. llvm-svn: 334742	2018-06-14 17:48:42 +00:00
Tomasz Krupa	d8d66a6b28	[X86] Lowering Mask Scalar intrinsics to native IR (LLVM part) Summary: Complementary patch to lowering add, sub, mul and div mask scalar intrinsics in Clang. Reviewers: craig.topper, sroland, spatel, RKSimon Reviewed by: craig.topper Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D47978 llvm-svn: 334740	2018-06-14 17:32:58 +00:00
Justin Lebar	bdb0a58c91	[SCEV] Fix a variable name, NFC. llvm-svn: 334738	2018-06-14 17:14:01 +00:00
Justin Lebar	fe455464eb	[SCEV] Simplify zext/trunc idiom that appears when handling bitmasks. Summary: Specifically, we transform zext(2^K * (trunc X to iN)) to iM -> 2^K * (zext(trunc X to i{N-K}) to iM)<nuw> This is helpful because pulling the 2^K out of the zext allows further optimizations. Reviewers: sanjoy Subscribers: hiraditya, llvm-commits, timshen Differential Revision: https://reviews.llvm.org/D48158 llvm-svn: 334737	2018-06-14 17:13:48 +00:00
Justin Lebar	b326904dba	[SCEV] Simplify trunc-of-add/mul to add/mul-of-trunc under more circumstances. Summary: Previously we would do this simplification only if it did not introduce any new truncs (excepting new truncs which replace other cast ops). This change weakens this condition: If the number of truncs stays the same, but we're able to transform trunc(X + Y) to X + trunc(Y), that's still simpler, and it may open up additional transformations. While we're here, also clean up some duplicated code. Reviewers: sanjoy Subscribers: hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D48160 llvm-svn: 334736	2018-06-14 17:13:35 +00:00
Justin Lebar	62a0747926	[SCEV] Fix indentation and combine two if statements in getMulExpr, NFC. llvm-svn: 334735	2018-06-14 17:13:22 +00:00
Sam Clegg	c0dba0af01	Revert "[MC] Factor MCObjectStreamer::addFragmentAtoms out of MachO streamer." This reverts rL331412. We didn't up using fragment atoms in the wasm object writer after all. Differential Revision: https://reviews.llvm.org/D48173 llvm-svn: 334734	2018-06-14 17:11:19 +00:00
Tony Tye	e2f3e10913	[AMDGPU] Document the AMDGPU LLVM attributes Differential Revision: https://reviews.llvm.org/D48101 llvm-svn: 334733	2018-06-14 16:40:10 +00:00
Bjorn Pettersson	972fd1c9e7	Revert rL334704: "[DebugInfo] Check size of variable in ConvertDebugDeclareToDebugValue" This reverts commit r334704. Buildbots detected an assertion in "test tsan in debug compiler-rt build". llvm-svn: 334732	2018-06-14 16:08:22 +00:00
Nirav Dave	41e69a8e8b	Avoid unused variable in non-assert builds. llvm-svn: 334731	2018-06-14 15:55:15 +00:00
Andrea Di Biagio	4729d1ff27	[llvm-mca] Add another test for partial register stalls. This test checks that a physical register is correctly allocated for the partial write to register BX. The ADD instruction has to wait for the write to RBX (and BX) before being executed. llvm-svn: 334730	2018-06-14 15:54:34 +00:00
Nirav Dave	a1ee983a95	[DAG] Avoid needing to walk out legalization tables. NFCI. To avoid redundant work, during DAG legalization we keep tables mapping pre-legalized SDValues to post-legalized SDValues and a SDValue-to-SDValue map to enable fast node replacements. However, as the keys are nodes which may be reused it is possible that an entry in a table refers to a now deleted node N (that should have been renamed by the value replacement map) while a new node N' exists. If N' is then replaced that entry would be wrong. Previously we avoided this by when potentially violating this property, walking every table and updating all node pointers. This is very expensive but hopefully rare occurance. This patch assigns each instance of a SDValue used in legalization a unique id and uses these ids in the legalization tables. This avoids any such aliasing issue, avoiding the full table search and allowing more aggressive incremental table pruning. In some cases this is a 1000x speedup to compilation. Reviewers: jyknight, echristo, bogner, tra Reviewed By: bogner Subscribers: dberris, grandinj, hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D47959 llvm-svn: 334729	2018-06-14 15:46:23 +00:00
Craig Topper	3ffeb41f6b	[X86] Add more vector instructions to the memory folding table using the autogenerated table as a guide. The test cahnge is because we now fold stack reload into RNDSCALE and RNDSCALE can be turned into ROUND by EVEX->VEX. llvm-svn: 334728	2018-06-14 15:40:31 +00:00
Craig Topper	82fa048371	[X86] Remove '128' from the internal name of some scalar FP instructions to be consistent with other scalar instructions. llvm-svn: 334727	2018-06-14 15:40:30 +00:00
Craig Topper	b0742bf30d	[X86] Disable load unfolding for a bunch of instruction where unfolding would increase the size of the load. Found by an audit of the manual table vs the autogenerated table. llvm-svn: 334726	2018-06-14 15:40:29 +00:00
Craig Topper	9f829f76e8	[X86] Remove NotMemoryFoldable from some AVX/AVX512 scalar instructions. Some of these instructions are already in the manual folding table so we should have them in the auto table too. llvm-svn: 334725	2018-06-14 15:40:27 +00:00
Lang Hames	b7788ebb4a	[ORC] Filter out self-dependencies in VSO::addDependencies. llvm-svn: 334724	2018-06-14 15:32:59 +00:00
Lang Hames	bd49fb83aa	[ORC] Assert that the query argument to VSO::lookup must be non-null. llvm-svn: 334723	2018-06-14 15:32:59 +00:00
Lang Hames	784fecfe71	[ORC] Add a WaitUntilReady argument to blockingLookup. If WaitUntilReady is set to true then blockingLookup will return once all requested symbols are ready. If WaitUntilReady is set to false then blockingLookup will return as soon as all requested symbols have been resolved. In the latter case, if any error occurs in finalizing the symbols it will be reported to the ExecutionSession, rather than returned by blockingLookup. llvm-svn: 334722	2018-06-14 15:32:58 +00:00
Lang Hames	03395d2e58	[ORC] Strip the Materializing flag off finalized symbols in VSOs. Finalized symbols are no longer in the materializing state. llvm-svn: 334721	2018-06-14 15:32:56 +00:00
Simon Dardis	b4a43d6610	[docs] Update CompilerWriterInfo.rst for MIPS Update the URL of where the documentation can be found. llvm-svn: 334720	2018-06-14 15:16:37 +00:00
Simon Pilgrim	dee9c67f24	[EarlyCSE] Fix MSVC build. NFCI. MSVC doesn't let you assign different lambdas through a ternary operator. llvm-svn: 334715	2018-06-14 14:22:03 +00:00
Simon Pilgrim	607a1e2196	[CostModel][AArch64] Add cost tests for ALTERNATE/SELECT style shuffle masks Precursor to fixing a regression with SLP vectorizer for supporting SELECT shuffles (vs the current ALTERNATE) llvm-svn: 334714	2018-06-14 14:20:20 +00:00
Sam Clegg	8c32e913b5	[MC] Move MCAssembler::dump into the correct cpp file. NFC Differential Revision: https://reviews.llvm.org/D46556 llvm-svn: 334713	2018-06-14 14:04:23 +00:00
Paul Robinson	cc7344aae3	[DWARFv5] Tolerate files not all having an MD5 checksum. In some cases, for example when compiling a preprocessed file, the front-end is not able to provide an MD5 checksum for all files. When that happens, omit the MD5 checksums from the final DWARF, because DWARF doesn't have a way to indicate that some but not all files have a checksum. When assembling a .s file, and some but not all .file directives provide an MD5 checksum, issue a warning and don't emit MD5 into the DWARF. Fixes PR37623. Differential Revision: https://reviews.llvm.org/D48135 llvm-svn: 334710	2018-06-14 13:38:20 +00:00
Simon Dardis	6ad680ab6a	[mips] Correct predicates for MSA pseudo instructions llvm-svn: 334708	2018-06-14 13:03:53 +00:00
Max Kazantsev	ff6d1c9188	[EarlyCSE] Propagate conditions of AND and OR instructions This patches teaches EarlyCSE to figure out that if `and i1 %x, %y` is true then both `%x` and `%y` are true in the taken branch, and if `or i1 %x, %y` is false then both `%x` and `%y` are false in non-taken branch. Fix for PR37635. Differential Revision: https://reviews.llvm.org/D47574 Reviewed By: reames llvm-svn: 334707	2018-06-14 13:02:13 +00:00
Florian Hahn	0a2e0b6b0e	[TableGen] Move some shared_ptrs to avoid unnecessary copies (NFC). Those changes were suggested post-commit for D47463. llvm-svn: 334706	2018-06-14 11:56:19 +00:00
Bjorn Pettersson	e406b29c22	[DebugInfo] Check size of variable in ConvertDebugDeclareToDebugValue Summary: Do not convert a DbgDeclare to DbgValue if the store instruction only refer to a fragment of the variable described by the DbgDeclare. Problem was seen when for example having an alloca for an array or struct, and there were stores to individual elements. In the past we inserted a DbgValue intrinsics for each store, just as if the store wrote the whole variable. When handling store instructions we insert a DbgValue that indicates that the variable is "undefined", as we do not know which part of the variable that is updated by the store. When ConvertDebugDeclareToDebugValue is used with a load/phi instruction we assert that the referenced value is large enough to cover the whole variable. Afaict this should be true for all scenarios where those methods are used on trunk. If the assert blows in the future I guess we could simply skip to insert a dbg.value instruction. In the future I think we should examine which part of the variable that is accessed, and add a DbgValue instrinsic with an appropriate DW_OP_LLVM_fragment expression. Reviewers: dblaikie, aprantl, rnk Reviewed By: aprantl Subscribers: JDevlieghere, llvm-commits Tags: #debug-info Differential Revision: https://reviews.llvm.org/D48024 llvm-svn: 334704	2018-06-14 11:23:42 +00:00
Simon Pilgrim	b234ff136e	[SLPVectorizer] Remove RawInstructionsData/getMainOpcode and merge into getSameOpcode This is part of the work to cleanup use of 'alternate' ops so we can use the more general SK_Select shuffle type. Only getSameOpcode calls getMainOpcode and much of the logic is repeated in both functions. This will require some reworking of D28907 but that patch has hit trouble and is unlikely to be completed anytime soon. Differential Revision: https://reviews.llvm.org/D48120 llvm-svn: 334701	2018-06-14 10:25:19 +00:00
Simon Pilgrim	c0d53aba7b	[CostModel] Cleanup isSingleSourceVectorMask to match other shuffle matchers. NFCI. llvm-svn: 334699	2018-06-14 09:48:19 +00:00
Simon Pilgrim	32702cc86a	[CostModel] Recognise REVERSE shuffle mask if the elements come from the second src llvm-svn: 334698	2018-06-14 09:35:00 +00:00
Clement Courbet	49fad1cbf2	[llvm-exegesis] Use BenchmarkResult::Instructions instead of OpcodeName Summary: Get rid of OpcodeName. To remove the opcode name from an old file: ``` cat old_file \| sed '/opcode_name.*/d' ``` Reviewers: gchatelet Subscribers: tschuett, llvm-commits Differential Revision: https://reviews.llvm.org/D48121 llvm-svn: 334691	2018-06-14 06:57:52 +00:00
Hiroshi Inoue	f209649dfc	[NFC] fix trivial typos in comments llvm-svn: 334687	2018-06-14 05:41:49 +00:00
Craig Topper	b2552e1e08	[x86] fix mappings of cvttp2si/cvttp2ui x86 intrinsics to x86-specific nodes and isel patterns (PR37551) Summary: The tests in: https://bugs.llvm.org/show_bug.cgi?id=37751 ...show miscompiles because we wrongly mapped and folded x86-specific intrinsics into generic DAG nodes. This patch corrects the mappings in X86IntrinsicsInfo.h and adds isel matching corresponding to the new patterns. The complete tests for the failure cases should be in avx-cvttp2si.ll and sse-cvttp2si.ll and avx512-cvttp2i.ll Reviewers: RKSimon, gbedwell, spatel Reviewed By: spatel Subscribers: mcrosier, llvm-commits Differential Revision: https://reviews.llvm.org/D47993 llvm-svn: 334685	2018-06-14 03:16:58 +00:00
Matt Davis	488ac4cb39	[llvm-mca] Introduce the ExecuteStage (was originally the Scheduler class). Summary: This patch transforms the Scheduler class into the ExecuteStage. Most of the logic remains. Reviewers: andreadb, RKSimon, courbet Reviewed By: andreadb Subscribers: mgorny, javed.absar, tschuett, gbedwell, llvm-commits Differential Revision: https://reviews.llvm.org/D47246 llvm-svn: 334679	2018-06-14 01:20:18 +00:00
Tom Stellard	46bbbc33c0	AMDGPU/GlobalISel: Implement select() for 32-bit G_FADD and G_FMUL Reviewers: arsenm, nhaehnle Reviewed By: arsenm Subscribers: kzhuravl, wdng, yaxunl, rovka, kristof.beyls, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D46171 llvm-svn: 334665	2018-06-13 22:30:47 +00:00
Zachary Turner	9b8b0794b8	Revert "Enable ThreadPool to queue tasks that return values." This is failing to compile when LLVM_ENABLE_THREADS is false, and the fix is not immediately obvious, so reverting while I look into it. llvm-svn: 334658	2018-06-13 21:24:19 +00:00
Francis Visoiu Mistrih	03185797d7	Reland: [Timers] Use the pass argument name for JSON keys in time-passes When using clang --save-stats -mllvm -time-passes, both timers and stats end up in the same json file. We could end up with things like: { "asm-printer.EmittedInsts": 1, "time.pass.Virtual Register Map.wall": 2.9015541076660156e-04, "time.pass.Virtual Register Map.user": 2.0500000000000379e-04, "time.pass.Virtual Register Map.sys": 8.5000000000001741e-05, } This patch makes use of the pass argument name (if available) in the JSON key to end up with things like: { "asm-printer.EmittedInsts": 1, "time.pass.virtregmap.wall": 2.9015541076660156e-04, "time.pass.virtregmap.user": 2.0500000000000379e-04, "time.pass.virtregmap.sys": 8.5000000000001741e-05, } This also helps avoiding to write another JSON printer to handle all the cases that we could have in our pass names. Fixed test instead of adding a new one originally from r334649. Differential Revision: https://reviews.llvm.org/D48109 llvm-svn: 334657	2018-06-13 21:03:56 +00:00
Florian Hahn	4dd569c7cc	[TableGen] Make getOnlyTree return a const ref (NFC) This avoids some unnecessary copies of shared_ptrs. Those changes were suggested post-commit for D47463. llvm-svn: 334656	2018-06-13 20:59:53 +00:00
George Karpenkov	9218a37a65	Update comments of CheckedArithmetic API based on Philip Reames feedback. llvm-svn: 334655	2018-06-13 20:48:53 +00:00
Reid Kleckner	12395b7795	[WinASan] Don't instrument globals in sections containing '$' Such globals are very likely to be part of a sorted section array, such the .CRT sections used for dynamic initialization. The uses its own sorted sections called ATL$__a, ATL$__m, and ATL$__z. Instead of special casing them, just look for the dollar sign, which is what invokes linker section sorting for COFF. Avoids issues with ASan and the ATL uncovered after we started instrumenting comdat globals on COFF. llvm-svn: 334653	2018-06-13 20:47:21 +00:00
Francis Visoiu Mistrih	0c3a7761f3	Revert r334649 "[Timers] Use the pass argument name for JSON keys in time-passes" This reverts commit r334649. This breaks a test. llvm-svn: 334651	2018-06-13 20:44:02 +00:00
Francis Visoiu Mistrih	fbd450b052	[Timers] Use the pass argument name for JSON keys in time-passes When using clang --save-stats -mllvm -time-passes, both timers and stats end up in the same json file. We could end up with things like: { "asm-printer.EmittedInsts": 1, "time.pass.Virtual Register Map.wall": 2.9015541076660156e-04, "time.pass.Virtual Register Map.user": 2.0500000000000379e-04, "time.pass.Virtual Register Map.sys": 8.5000000000001741e-05, } This patch makes use of the pass argument name (if available) in the JSON key to end up with things like: { "asm-printer.EmittedInsts": 1, "time.pass.virtregmap.wall": 2.9015541076660156e-04, "time.pass.virtregmap.user": 2.0500000000000379e-04, "time.pass.virtregmap.sys": 8.5000000000001741e-05, } This also helps avoiding to write another JSON printer to handle all the cases that we could have in our pass names. Differential Revision: https://reviews.llvm.org/D48109 llvm-svn: 334649	2018-06-13 20:09:59 +00:00
Craig Topper	f7f663e0a9	[X86] Move RCPSSr_Int, RSQRTSSr_Int, SQRTSDr_Int, SQRTSSr_Int to the correct load folding table. They were in the operand 1 folding table, but their foldable operand is operand 2. llvm-svn: 334648	2018-06-13 20:03:42 +00:00
Zachary Turner	18fc6dc054	Add missing #include. llvm-svn: 334644	2018-06-13 19:37:41 +00:00
Zachary Turner	1b76a128a8	Enable ThreadPool to support tasks that return values. Previously ThreadPool could only queue async "jobs", i.e. work that was done for its side effects and not for its result. It's useful occasionally to queue async work that returns a value. From an API perspective, this is very intuitive. The previous API just returned a shared_future<void>, so all we need to do is make it return a shared_future<T>, where T is the type of value that the operation returns. Making this work required a little magic, but ultimately it's not too bad. Instead of keeping a shared queue<packaged_task<void()>> we just keep a shared queue<unique_ptr<TaskBase>>, where TaskBase is a class with a pure virtual execute() method, then have a templated derived class that stores a packaged_task<T()>. Everything else works out pretty cleanly. Differential Revision: https://reviews.llvm.org/D48115 llvm-svn: 334643	2018-06-13 19:29:16 +00:00
Stanislav Mekhanoshin	7bec57300c	[AMDGPU] Corrected computeKnownBits for V_PERM_B32 Differential Revision: https://reviews.llvm.org/D48133 llvm-svn: 334640	2018-06-13 18:52:54 +00:00
George Karpenkov	788087f5f8	Add checkMulAdd helper function to CheckedArithmetic Multiplication followed by addition (https://en.wikipedia.org/wiki/Multiply–accumulate_operation) is a sufficiently common use-case to warrant a separate helper. Differential Revision: https://reviews.llvm.org/D48138 llvm-svn: 334635	2018-06-13 18:32:02 +00:00
George Karpenkov	3bbaeaf673	Change checked arithmetic functions API to return Optional Returning optional is much safer. The previous API had potential to cause use of undefined variables, if the value passed by pointer was accidentally read afterwards. Differential Revision: https://reviews.llvm.org/D48137 llvm-svn: 334634	2018-06-13 18:31:43 +00:00
Andrea Di Biagio	0ffb2271a1	[llvm-mca] Fixed a bug in the logic that checks if a memory operation is ready to execute. Fixes PR37790. In some (very rare) cases, the LSUnit (Load/Store unit) was wrongly marking a load (or store) as "ready to execute" effectively bypassing older memory barrier instructions. To reproduce this bug, the memory barrier must be the first instruction in the input assembly sequence, and it doesn't have to perform any register writes. llvm-svn: 334633	2018-06-13 18:30:14 +00:00
Jordan Rose	d71614a438	[CMake] Handle 'libtool' being at a path with spaces in it. This can happen on macOS if the user's Xcode is at a path with spaces in it. llvm-svn: 334632	2018-06-13 18:21:47 +00:00
Peter Collingbourne	881ba10465	LTO: Keep file handles open for memory mapped files. On Windows we've observed that if you open a file, write to it, map it into memory and close the file handle, the contents of the memory mapping can sometimes be incorrect. That was what we did when adding an entry to the ThinLTO cache using the TempFile and MemoryBuffer classes, and it was causing intermittent build failures on Chromium's ThinLTO bots on Windows. More details are in the associated Chromium bug (crbug.com/786127). We can prevent this from happening by keeping a handle to the file open while the mapping is active. So this patch changes the mapped_file_region class to duplicate the file handle when mapping the file and close it upon unmapping it. One gotcha is that the file handle that we keep open must not have been created with FILE_FLAG_DELETE_ON_CLOSE, as otherwise the operating system will prevent other processes from opening the file. We can achieve this by avoiding the use of FILE_FLAG_DELETE_ON_CLOSE altogether. Instead, we use SetFileInformationByHandle with FileDispositionInfo to manage the delete-on-close bit. This lets us remove the hack that we used to use to clear the delete-on-close bit on a file opened with FILE_FLAG_DELETE_ON_CLOSE. A downside of using SetFileInformationByHandle/FileDispositionInfo as opposed to FILE_FLAG_DELETE_ON_CLOSE is that it prevents us from using CreateFile to open the file while the flag is set, even within the same process. This doesn't seem to matter for almost every client of TempFile, except for LockFileManager, which calls sys::fs::create_link to create a hard link from the lock file, and in the process of doing so tries to open the file. To prevent this change from breaking LockFileManager I changed it to stop using TempFile by effectively reverting r318550. Differential Revision: https://reviews.llvm.org/D48051 llvm-svn: 334630	2018-06-13 18:03:14 +00:00
Craig Topper	e399f55826	[X86] Add one more intrinsic and test cases to avx512-cvttp2i.ll. spatel noticed it was missing in D47993. llvm-svn: 334629	2018-06-13 17:55:13 +00:00
Saleem Abdulrasool	4d1c854884	IR: fix documentation markup Use `\brief` instead of `\Brief`. NFC. llvm-svn: 334627	2018-06-13 17:51:27 +00:00
Yaxun Liu	fb17bf60dd	[AMDGPU] Change enqueue kernel handle type Currently the handle type is a global pointer which holds 8 bytes. We need a larger type which hold 16 bytes, therefore change it to [i64 x 2]. Differential Revision: https://reviews.llvm.org/D48094 llvm-svn: 334625	2018-06-13 17:31:51 +00:00
Simon Pilgrim	9fd634db22	[CostModel][X86] Test showing failure to recognise REVERSE shuffle mask if the elements come from the second src llvm-svn: 334623	2018-06-13 17:12:11 +00:00
Dmitry Preobrazhensky	32c6b5cb70	[AMDGPU][MC] Enabled parsing of relocations on VALU instructions See bug 37566: https://bugs.llvm.org/show_bug.cgi?id=37566 Reviewers: artem.tamazov, arsenm, nhaehnle Differential Revision: https://reviews.llvm.org/D47884 llvm-svn: 334622	2018-06-13 17:02:03 +00:00
Simon Pilgrim	54a138a0c5	[CostModel] Recognise BROADCAST shuffle mask if the elements come from the second src llvm-svn: 334620	2018-06-13 16:52:02 +00:00
Andrea Di Biagio	d5690628db	Revert: [llvm-mca] Flush the output stream before we start the analysis of a new code region. NFC Not sure why, but it breaks buildbot clang-cmake-armv8-full. It causes a failure in TEST 'Xray-armhf-linux :: TestCases/Posix/profiling-single-threaded.cc'. llvm-svn: 334617	2018-06-13 16:33:52 +00:00
Simon Pilgrim	5af0b99ea4	[CostModel][X86] Test showing failure to recognise BROADCAST shuffle mask if the elements come from the second src llvm-svn: 334616	2018-06-13 16:33:42 +00:00
Andrea Di Biagio	f6ee0c9071	[llvm-mca] Flush the output stream before we start the analysis of a new code region. NFC llvm-svn: 334610	2018-06-13 15:43:56 +00:00
Dmitry Preobrazhensky	ffbee7acdc	[AMDGPU][MC][GFX8][GFX9] Allow LDS direct reads for BUFFER_LOAD_DWORDX2/X3/X4 See bug 37653: https://bugs.llvm.org/show_bug.cgi?id=37653 Reviewers: artem.tamazov, arsenm Differential Revision: https://reviews.llvm.org/D47885 llvm-svn: 334609	2018-06-13 15:32:46 +00:00
Sanjay Patel	7d4929611c	[DAGCombiner] remove hasOneUse() check from fadd constants transform We're constant folding here, so we shouldn't check uses. This matches the IR optimizer behavior. The x86 test shows the expected win. The AArch64 test shows something else. This only seems to happen if the "generic" AArch64 CPU model is used by MachineCombiner, so I'll file a bug report to follow-up. llvm-svn: 334608	2018-06-13 15:22:48 +00:00
Tom Stellard	264c171f36	AMDGPU: Move isSDNodeSourceOfDivergence() implementation to SITargetLowering Summary: The code that handles ISD:Register and ISD::CopyFromReg assumes the target is amdgcn, so this is broken on r600. We don't need this analysis on r600 anyway so we can safely move it to SITargetLowering. Reviewers: alex-t, arsenm, nhaehnle Reviewed By: arsenm Subscribers: msearles, kzhuravl, wdng, yaxunl, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D46298 llvm-svn: 334607	2018-06-13 15:06:37 +00:00
Sanjay Patel	9f3f18d6f6	[x86] add test for fadd with more than one use; NFC The equivalent AArch64 test added at rL334556 isn't showing the expected output from the DAGCombiner code change that would fix this example. That's a machine combiner bug from what I see. llvm-svn: 334605	2018-06-13 15:01:07 +00:00
Cameron McInally	f37bd01ddc	[FPEnv] Expand constrained FP operations Add a helper function to expand constrained FP operations as needed. Note that the Strict POWI operation is not handled in this patch since the format is slightly different from the others. Differential Revision: https://reviews.llvm.org/D47491 llvm-svn: 334603	2018-06-13 14:32:12 +00:00

... 2 3 4 5 6 ...

165609 Commits