llvm-project

Commit Graph

Author	SHA1	Message	Date
Teresa Johnson	3bd4b5a925	[CGP] Build the DominatorTree lazily Summary: In r355512 CGP was changed to build the DominatorTree only once per function traversal, to avoid repeatedly building it each time it was accessed. This solved one compile time issue but introduced another. In the second case, we now were building the DT unnecessarily many times when we performed many function traversals (i.e. more than once per function when running CGP because of changes made each time). Change to saving the DT in the CodeGenPrepare object, and building it lazily when needed. It is reset whenever we need to rebuild it. The case that exposed the issue there are 617 functions, and we walk them (i.e. execute the "while (MadeChange)" loop in runOnFunction) a total of 12083 times (so previously we were building the DT 12083 times). With this patch we only build the DT 844 times (average of 1.37 times per function). We dropped the total time to compile this file from 538.11s without this patch to 339.63s with it. There is still an issue as CGP is taking much longer than all other passes even with this patch, and before a recent compiler release cut at r355392 the total time to this compile was only 97 sec with a huge reduction in CGP time. I suspect that one of the other recent changes to CGP led to iterating each function many more times on average, but I need to do some more investigation. Reviewers: spatel Subscribers: jdoerfert, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D59696 llvm-svn: 356937	2019-03-25 18:38:48 +00:00
Sanjay Patel	f49e33e252	[x86] add another vector zext test; NFC Goes with the proposal in D59777 llvm-svn: 356930	2019-03-25 17:53:56 +00:00
Mikhail R. Gadelha	db695c834f	Moved everything SMT-related to LLVM and updated the cmake scripts. Differential Revision: https://reviews.llvm.org/D54978 llvm-svn: 356929	2019-03-25 17:47:45 +00:00
Matt Arsenault	b27e4974d0	MISched: Don't schedule regions with 0 instructions I think this is correct, but may not necessarily be the correct fix for the assertion I'm really trying to solve. If a scheduling region was found that only has dbg_value instructions, the RegPressure tracker would end up in an inconsistent state because it would skip over any debug instructions and point to an instruction outside of the scheduling region. It may still be possible for this to happen if there are some real schedulable instructions between dbg_values, but I haven't managed to break this. The testcase is extremely sensitive and I'm not sure how to make it more resistent to future scheduler changes that would avoid stressing this situation. llvm-svn: 356926	2019-03-25 17:15:44 +00:00
Tom Stellard	2224181dad	merge-request.sh: Update 8.0 metabug for 8.0.1 llvm-svn: 356924	2019-03-25 17:01:29 +00:00
Matt Arsenault	fa28455116	AMDGPU: Preserve LiveIntervals in WQM This seems to already be done, but wasn't marked. llvm-svn: 356922	2019-03-25 16:47:42 +00:00
James Henderson	1f44814952	[llvm-objcopy]Preserve data in segments not covered by sections llvm-objcopy previously knew nothing about data in segments that wasn't covered by section headers, meaning that it wrote zeroes instead of what was there. As it is possible for this data to be useful to the loader, this patch causes llvm-objcopy to start preserving this data. Data in sections that are explicitly removed continues to be written as zeroes. This fixes https://bugs.llvm.org/show_bug.cgi?id=41005. Reviewed by: jakehehrlich, rupprecht Differential Revision: https://reviews.llvm.org/D59483 llvm-svn: 356919	2019-03-25 16:36:26 +00:00
Simon Pilgrim	77749567a1	[SLPVectorizer] Update file missed in rL356913 Differential Revision: https://reviews.llvm.org/D59738 llvm-svn: 356915	2019-03-25 16:14:21 +00:00
Sanjay Patel	76c1ef3d07	[x86] add tests for vector zext; NFC The AVX1 lowering is poor. llvm-svn: 356914	2019-03-25 15:54:34 +00:00
Simon Pilgrim	ff3abef395	[SLPVectorizer] reorderInputsAccordingToOpcode - remove non-Instruction canonicalization Remove attempts to commute non-Instructions to the LHS - the codegen changes appear to rely on chance more than anything else and also have a tendency to fight existing instcombine canonicalization which moves constants to the RHS of commutable binary ops. This is prep work towards: (a) reusing reorderInputsAccordingToOpcode for alt-shuffles and removing the similar reorderAltShuffleOperands (b) improving reordering to optimized cases with commutable and non-commutable instructions to still find splat/consecutive ops. Differential Revision: https://reviews.llvm.org/D59738 llvm-svn: 356913	2019-03-25 15:53:55 +00:00
Serge Guelton	dcb128f73b	Revert 356905 Commited from wrong directory... llvm-svn: 356907	2019-03-25 15:18:55 +00:00
Serge Guelton	2dea7c4113	Python 2/3 compat: queue vs Queue Differential Revision: https://reviews.llvm.org/D59590 llvm-svn: 356905	2019-03-25 15:14:15 +00:00
Pavel Labath	c5f7905e93	MinidumpYAML.cpp: Fix some code standard violations missed during review functions should begin with lower case letters. NFC. llvm-svn: 356901	2019-03-25 14:45:31 +00:00
Jonas Paulsson	0e75e21eb3	[RegAlloc] Simplify MIR test Remove the IR part from test/CodeGen/X86/regalloc-copy-hints.mir (added by r355854). To make the test remain functional, the parts of the MBB names referring to BB names have been removed, as well as all machine memory operands. llvm-svn: 356899	2019-03-25 14:28:32 +00:00
Brock Wyma	bc92c8c863	[DebugInfo] IntelJitEventListener follow up for "add SectionedAddress ..." Following r354972 the Intel JIT Listener would not report line table information because the section indices did not match. There was a similar issue with the PerfJitEventListener. This change performs the section index lookup when building the object address used to query the line table information. Differential Revision: https://reviews.llvm.org/D59490 llvm-svn: 356895	2019-03-25 13:50:26 +00:00
Alexander Kornienko	e9087fe75c	[clang-tidy] Separate the check-facing interface Summary: Move ClangTidyCheck to a separate header/.cpp Switch checks to #include "ClangTidyCheck.h" Mention ClangTidyCheck.h in the docs Reviewers: hokein, gribozavr, aaron.ballman Reviewed By: hokein Subscribers: mgorny, javed.absar, xazax.hun, arphaman, jdoerfert, llvm-commits, cfe-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D59714 llvm-svn: 356890	2019-03-25 12:36:30 +00:00
George Rimar	e6963be317	[llvm-objcopy] - Refactor the code. NFC. The idea of the patch is about to move out the code to a new helper static functions (to reduce the size of 'handleArgs' and to isolate the parts of it's logic). Differential revision: https://reviews.llvm.org/D59762 llvm-svn: 356889	2019-03-25 12:34:25 +00:00
Petar Avramovic	a034a64f84	[MIPS GlobalISel] Select copy for arguments from FPRBRegBank Move selectCopy into MipsInstructionSelector class. Select copy for arguments from FPRBRegBank for MIPS32. Differential Revision: https://reviews.llvm.org/D59644 llvm-svn: 356886	2019-03-25 11:38:06 +00:00
Nico Weber	3d9fa09aa2	gn build: Clean up README.rst a bit - Make introduction a bit shorter - Add a `git clone` step to Quick start - Put command to run first in each of the Quick start steps - Use ``code`` instead of `label` throughout; this is .rst not .md Differential Revision: https://reviews.llvm.org/D59600 llvm-svn: 356885	2019-03-25 11:33:19 +00:00
Nico Weber	d59857c334	gn build: Let get.py keep zip file in memory instead of using a temp file The zip is small, and it's a bit less code this way. No intended behavior change. Differential Revision: https://reviews.llvm.org/D59677 llvm-svn: 356884	2019-03-25 11:32:27 +00:00
Petar Avramovic	3dfa368d5d	[MIPS GlobalISel] Add floating point register bank Add floating point register bank for MIPS32. Implement getRegBankFromRegClass for float register classes. Differential Revision: https://reviews.llvm.org/D59643 llvm-svn: 356883	2019-03-25 11:30:46 +00:00
Petar Avramovic	5a457e08f6	[MIPS GlobalISel] Lower float and double arguments in registers Lower float and double arguments in registers for MIPS32. When float/double argument is passed through gpr registers select appropriate move instruction. Differential Revision: https://reviews.llvm.org/D59642 llvm-svn: 356882	2019-03-25 11:23:41 +00:00
Xing GUO	ea16be1ca7	[llvm-readobj] Separate `Symbol Version` dumpers into `LLVM style` and `GNU style` Summary: Currently, llvm-readobj can dump symbol version sections only in LLVM style. In this patch, I would like to separate these dumpers into GNU style and LLVM style for future implementation. Reviewers: grimar, jhenderson, mattd, rupprecht Reviewed By: jhenderson, rupprecht Subscribers: ormris, dyung, RKSimon, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D59186 llvm-svn: 356881	2019-03-25 11:02:49 +00:00
Hans Wennborg	5519cb2d94	Fix the build with GCC 4.8 after r356783 llvm-svn: 356875	2019-03-25 09:27:42 +00:00
Sjoerd Meijer	65584d3811	[TTI] Move getIntrinsicCost to allow functions to be overridden. NFC Moving this to base class TargetTransformInfoImplCRTPBase allows static_cast to a subtarget so that calls to e.g. getMemcpyCost actually go the overridden functions. Differential revision: https://reviews.llvm.org/D59706 llvm-svn: 356873	2019-03-25 08:54:47 +00:00
Diana Picus	254b11a0fd	[ARM GlobalISel] 64-bit memops should be aligned We currently use only VLDR/VSTR for all 64-bit loads/stores, so the memory operands must be word-aligned. Mark aligned operations as legal and narrow non-aligned ones to 32 bits. While we're here, also mark non-power-of-2 loads/stores as unsupported. llvm-svn: 356872	2019-03-25 08:54:29 +00:00
Craig Topper	a17287f084	[X86] Update some of the getMachineNode calls from X86ISelDAGToDAG to also include a VT for a EFLAGS result. This makes the nodes consistent with how they would be emitted from the isel table. llvm-svn: 356870	2019-03-25 07:22:18 +00:00
Craig Topper	1cc01c3228	[X86] When selecting (x << C1) op C2 as (x op (C2>>C1)) << C1, use the operation VT for the target constant. Normally when the nodes we use here(AND32ri8 for example) are selected their immediates are just converted from ConstantSDNode to TargetConstantSDNode without changing VT from the original operation VT. So we should still be emitting them with the operation VT. Theoretically this could expose more accurate opportunities for CSE. llvm-svn: 356869	2019-03-25 06:53:45 +00:00
Craig Topper	3810e35d3f	[X86] Remove GetLo8XForm and use GetLo32XForm instead. NFCI We were using this to create an AND32ri8 node from a 64-bit and, but that node normally still uses a 32-bit immediate. So we should just truncate the existing immediate to i32. We already verified it has the same value in bits 31:7. llvm-svn: 356868	2019-03-25 06:53:44 +00:00
Craig Topper	5b43446831	[X86] Remove a couple unused SDNodeXForms. NFC llvm-svn: 356867	2019-03-25 06:53:43 +00:00
Craig Topper	7c2554dd92	Revert r356688 "[X86] Don't avoid folding multiple use sign extended 8-bit immediate into instructions under optsize." Looking back over how the one use optimization works, I don't think this is the right way to fix this. llvm-svn: 356866	2019-03-25 01:25:32 +00:00
Simon Pilgrim	87d4ab8b92	[X86][SSE41] Start shuffle combining from ZERO_EXTEND_VECTOR_INREG (PR40685) Enable SSE41 ZERO_EXTEND_VECTOR_INREG shuffle combines - for the PMOVZX(PSHUFD(V)) -> UNPCKH(V,0) pattern we reduce the shuffles (port5-bottleneck on Intel) at the expense of creating a zero (pxor v,v) and an extra register move - which is a good trade off as these are pretty cheap and in most cases it doesn't increase register pressure. This also exposed a missed opportunity to use combine to ZERO_EXTEND_VECTOR_INREG with folded loads - even if we're in the float domain. llvm-svn: 356864	2019-03-24 19:06:35 +00:00
Heejin Ahn	803c7782d5	[WebAssembly] Rename a variable in CFGSort (NFC) Class `RegionInfo` was `SortUnitInfo` before, so the variables were named `SUI`. Now the class name is `RegionInfo`, so this renames `SUI` to `RI`, matching the class name. llvm-svn: 356861	2019-03-24 17:34:40 +00:00
Craig Topper	07e3071854	[LegalizeDAG] Expand i16 bswap directly to a rotate by 8 instead of relying on DAG combine. An i16 bswap can be implemented with an i16 rotate by 8. We previously emitted a shift and OR sequence that DAG combine should be able to turn back into rotate. But we might as well go there directly. If rotate isn't legal, LegalizeDAG should further legalize it to either the opposite rotate, or the shift and OR pattern. I don't know of any way to get the existing DAG combine reliance to fail. So I don't know any way to add new tests for this that wouldn't have worked previously. llvm-svn: 356860	2019-03-24 17:02:14 +00:00
Simon Pilgrim	4465a765ee	[X86] Remove icmp undef from reduced tests Pre-commit for D59363 (Add icmp UNDEF handling to SelectionDAG::FoldSetCC) Approved by @spatel (Sanjay Patel) llvm-svn: 356859	2019-03-24 17:02:08 +00:00
Simon Pilgrim	a71c0ed471	[X86][AVX] Start shuffle combining from ZERO_EXTEND_VECTOR_INREG (PR40685) Just enable this for AVX for now as SSE41 introduces extra register moves for the PMOVZX(PSHUFD(V)) -> UNPCKH(V,0) pattern (but otherwise helps reduce port5 usage on Intel targets). Only AVX support is required for PR40685 as the issue is due to 8i8->8i32 zext shuffle leftovers. llvm-svn: 356858	2019-03-24 16:30:35 +00:00
Teresa Johnson	4dc851964c	[CGP] Make several static functions member functions (NFC) This is extracted from D59696 as suggested in the review. It is preparation for making the DominatorTree a member variable. llvm-svn: 356857	2019-03-24 15:18:50 +00:00
George Rimar	272571718c	Recommit r356738 "[llvm-objcopy] - Implement replaceSectionReferences for GroupSection class." Fix: r356853 + set AddressAlign to 4 in Inputs/compress-debug-sections.yaml for the new group section introduced. Original commit message: Currently, llvm-objcopy incorrectly handles compression and decompression of the sections from COMDAT groups, because we do not implement the replaceSectionReferences for this type of the sections. The patch does that. Differential revision: https://reviews.llvm.org/D59638 llvm-svn: 356856	2019-03-24 14:41:45 +00:00
Sanjay Patel	7d676dfd86	[x86] improve the default expansion of uaddsat/usubsat This is yet another step towards solving PR14613: https://bugs.llvm.org/show_bug.cgi?id=14613 uaddsat X, Y --> (X >u (X + Y)) ? -1 : X + Y usubsat X, Y --> (X >u Y) ? X - Y : 0 We can't count on a sane vector ISA, so override the default (umin/umax) expansion of unsigned add/sub saturate in cases where we do not have umin/umax. Differential Revision: https://reviews.llvm.org/D59006 llvm-svn: 356855	2019-03-24 13:55:54 +00:00
Simon Pilgrim	5cd4eb96f6	[SLPVectorizer] shouldReorderOperands - just check for reordering. NFCI. Remove the I.getOperand() calls from inside shouldReorderOperands - reorderInputsAccordingToOpcode should handle the creation of the operand lists and shouldReorderOperands should just check to see whether the i'th element should be commuted. llvm-svn: 356854	2019-03-24 13:36:32 +00:00
George Rimar	0a5d4b8472	[llvm-objcopy] - Report SHT_GROUP sections with invalid alignment. This patch fixes the reason of ubsan failure (UB detected) happened after landing the D59638 (I had to revert it). http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux-bootstrap-ubsan/builds/11760/steps/check-llvm%20ubsan/logs/stdio) Problem is the following. Our implementation of GroupSection assumes that its address is 4 bytes aligned when writes it: template <class ELFT> void ELFSectionWriter<ELFT>::visit(const GroupSection &Sec) { ELF::Elf32_Word Buf = reinterpret_cast<ELF::Elf32_Word >(Out.getBufferStart() + Sec.Offset); ... But the test case for D59638 did not set AddressAlign in YAML. So address was not 4 bytes aligned since Sec.Offset was odd. That triggered the issue. This patch teaches llvm-objcopy to report an error for such sections (which should not met in reality), what is better than having UB. Differential revision: https://reviews.llvm.org/D59695 llvm-svn: 356853	2019-03-24 13:31:08 +00:00
Nikita Popov	977934f00f	[ConstantRange] Add getFull() + getEmpty() named constructors; NFC This adds ConstantRange::getFull(BitWidth) and ConstantRange::getEmpty(BitWidth) named constructors as more readable alternatives to the current ConstantRange(BitWidth, /* full */ false) and similar. Additionally private getFull() and getEmpty() member functions are added which return a full/empty range with the same bit width -- these are commonly needed inside ConstantRange.cpp. The IsFullSet argument in the ConstantRange(BitWidth, IsFullSet) constructor is now mandatory for the few usages that still make use of it. Differential Revision: https://reviews.llvm.org/D59716 llvm-svn: 356852	2019-03-24 09:34:40 +00:00
Nico Weber	d48c8fdd30	gn build: Merge r356820 llvm-svn: 356846	2019-03-23 23:22:45 +00:00
Nico Weber	3e016d48c4	gn build: Add build files for modularize and pp-trace Differential Revision: https://reviews.llvm.org/D59701 llvm-svn: 356845	2019-03-23 23:16:41 +00:00
Simon Pilgrim	1466e5c383	Fix unused variable warning on non-asserts builds. NFCI. llvm-svn: 356841	2019-03-23 16:56:23 +00:00
Simon Pilgrim	64feec7977	Remove unused function argument. NFCI. llvm-svn: 356840	2019-03-23 16:20:34 +00:00
Fangrui Song	40483e1831	[DWARF] Delete a stray break and a stray comment. NFC llvm-svn: 356838	2019-03-23 16:15:40 +00:00
Simon Pilgrim	9eb0de8573	[X86][SLP] Show example of failure to uniformly commute splats for 'alt' shuffles. If either the main/alt opcodes isn't commutable we may end up with the splats not correctly commuted to the same side. llvm-svn: 356837	2019-03-23 16:14:04 +00:00
Sanjay Patel	2e92846d36	[x86] reduce code duplication; NFC llvm-svn: 356836	2019-03-23 15:00:52 +00:00
Simon Pilgrim	c7ba9555cf	[SLPVectorizer] reorderInputsAccordingToOpcode - use InstructionState directly. NFCI. llvm-svn: 356832	2019-03-23 13:44:06 +00:00

1 2 3 4 5 ...

176657 Commits