llvm-project

Commit Graph

Author	SHA1	Message	Date
Rui Ueyama	a16fe65b72	Rewrite FileOutputBuffer as two separate classes. This patch is to rewrite FileOutputBuffer as two separate classes; one for file-backed output buffer and the other for memory-backed output buffer. I think the new code is easier to follow because two different implementations are now actually separated as different classes. Unlike the previous implementation, the class that does not replace the final output file using rename(2) does not create a temporary file at all. Instead, it allocates memory using mmap(2) and use it. I think this is an improvement because it is now guaranteed that the temporary memory region doesn't trigger any I/O and there's now zero chance to leave a temporary file behind. Also, it shouldn't impose new restrictions because were using mmap IO too. Differential Revision: https://reviews.llvm.org/D39449 llvm-svn: 317127	2017-11-01 21:38:14 +00:00
Eugene Zelenko	0ad18f888e	[dsymutil, llvm-objcopy] Fix some Clang-tidy modernize and Include What You Use warnings; other minor fixes (NFC). llvm-svn: 317123	2017-11-01 21:16:06 +00:00
Craig Topper	4e56ba271e	[X86] Add custom code to EVEX to VEX pass to turn unmasked 128-bit VPALIGND/Q into VPALIGNR if the extended registers aren't being used. This will enable us to prefer VALIGND/Q during shuffle lowering in order to get the extended register encoding space when BWI isn't available. But if we end up not using the extended registers we can switch VPALIGNR for the shorter VEX encoding. Differential Revision: https://reviews.llvm.org/D39401 llvm-svn: 317122	2017-11-01 21:00:59 +00:00
Adrian Prantl	98c6549e4a	loop-rotate: avoid duplicating dbg.value intrinsics in the entry block. This fixes the second half of PR35113. This reapplies r317106 without modifications. llvm-svn: 317121	2017-11-01 20:53:22 +00:00
Adrian Prantl	d60f34c20a	loop-rotate: eliminate duplicate debug intrinsics after splicing. Fixes part of PR35113. This reapplies r317105 with an additional check for isa<Instruction> as found by the bots. llvm-svn: 317120	2017-11-01 20:43:30 +00:00
Dehao Chen	c6c051f2ea	Include GUIDs from the same module when computing GUIDs that needs to be imported. Summary: In the compile phase of SamplePGO+ThinLTO, ICP is not invoked. Instead, indirect call targets will be included as function metadata for ThinIndex to buidl the call graph. This should not only include functions defined in other modules, but also functions defined in the same module, otherwise ThinIndex may find the callee dead and eliminate it, while ICP in backend will revive the symbol, which leads to undefined symbol. Reviewers: tejohnson Reviewed By: tejohnson Subscribers: sanjoy, llvm-commits, mehdi_amini Differential Revision: https://reviews.llvm.org/D39480 llvm-svn: 317118	2017-11-01 20:26:47 +00:00
Daniel Sanders	9cbe7c7f93	[globalisel][tablegen] Add support for multi-insn emission The importer will now accept nested instructions in the result pattern such as (ADDWrr $a, (SUBWrr $b, $c)). This is only valid when the nested instruction def's a single vreg and the parent instruction consumes a single vreg where a nested instruction is specified. The importer will automatically create a vreg to connect the two using the type information from the pattern. This vreg will be constrained to the register classes given in the instruction definitions. REG_SEQUENCE is explicitly rejected because of this. The definition doesn't constrain to a register class and it therefore needs special handling. llvm-svn: 317117	2017-11-01 19:57:57 +00:00
Philip Reames	7b861f08cd	Revert 317016 and 317048 The former appears to have introduced a miscompile in a stage2 clang build. Revert so I can investigate offline. llvm-svn: 317116	2017-11-01 19:49:20 +00:00
Konstantin Zhuravlyov	435151ad75	AMDGPU: Fix set but not used warnings related to AMDGPUAS Differential Revision: https://reviews.llvm.org/D39499 llvm-svn: 317114	2017-11-01 19:12:38 +00:00
Craig Topper	ca1aa83cbe	[X86] Prevent fast isel from folding loads into the instructions listed in hasPartialRegUpdate. This patch moves the check for opt size and hasPartialRegUpdate into the lower level implementation of foldMemoryOperandImpl to catch the entry point that fast isel uses. We're still folding undef register instructions in AVX that we should also probably disable, but that's a problem for another patch. Unfortunately, this requires reordering a bunch of functions which is why the diff is so large. I can do the function reordering separately if we want. Differential Revision: https://reviews.llvm.org/D39402 llvm-svn: 317112	2017-11-01 18:10:06 +00:00
Graham Yiu	671526148c	Adds code to PPC ISEL lowering to recognize half-word inserts from vector_shuffles, and use P9 shift and vector insert instructions instead of vperm. Differential Revision: https://reviews.llvm.org/D34160 llvm-svn: 317111	2017-11-01 18:06:56 +00:00
Adrian Prantl	c8516346e4	Revert r317105 to investigate bot breakage. llvm-svn: 317110	2017-11-01 18:06:38 +00:00
Adrian Prantl	40a0ea5f29	Revert r317106 to facilitate reverting r317105. llvm-svn: 317109	2017-11-01 18:06:35 +00:00
Peter Collingbourne	9fb6e1a037	LTO: Apply global DCE to ThinLTO modules at LTO opt level 0. This is necessary because DCE is applied to full LTO modules. Without this change, a reference from a dead ThinLTO global to a dead full LTO global will result in an undefined reference at link time. This problem is only observable when --gc-sections is disabled, or when targeting COFF, as the COFF port of lld requires all symbols to have a definition even if all references are dead (this is consistent with link.exe). This change also adds an EliminateAvailableExternally pass at -O0. This is necessary to handle the situation on Windows where a non-prevailing copy of a linkonce_odr function has an SEH filter function; any such filters must be DCE'd because they will contain a call to the llvm.localrecover intrinsic, passing as an argument the address of the function that the filter belongs to, and llvm.localrecover requires this function to be defined locally. Fixes PR35142. Differential Revision: https://reviews.llvm.org/D39484 llvm-svn: 317108	2017-11-01 17:58:39 +00:00
Craig Topper	56db9d6bec	[X86] Regnerate test to attempt to fix build bot failure. llvm-svn: 317107	2017-11-01 17:44:12 +00:00
Adrian Prantl	9259f21604	loop-rotate: avoid duplicating dbg.value intrinsics in the entry block. This fixes the second half of PR35113. llvm-svn: 317106	2017-11-01 17:28:50 +00:00
Adrian Prantl	b627acd0ce	loop-rotate: eliminate duplicate debug intrinsics after splicing. Fixes part of PR35113. llvm-svn: 317105	2017-11-01 17:28:47 +00:00
Jonas Devlieghere	369a7ecc56	[dsymutil][NFC} Rename thread related command line options This makes the command line options consistent with llvm-cov and llvm-profdata, which both use `-num-threads` and `-j`. This also addresses the conflict reported after landing D39355. Differential revision: https://reviews.llvm.org/D39496 llvm-svn: 317104	2017-11-01 17:15:29 +00:00
Craig Topper	5ae677e102	[X86] Add 64-bit int to float/double conversion with AVX to X86FastISel::X86SelectSIToFP Summary: [X86] Teach fast isel to handle i64 sitofp with AVX. For some reason we only handled i32 sitofp with AVX. But with SSE only we support i64 so we should do the same with AVX. Also add i686 command lines for the 32-bit tests. 64-bit tests are in a separate file to avoid a fast-isel abort failure in 32-bit mode. Reviewers: RKSimon, zvi Reviewed By: RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D39450 llvm-svn: 317102	2017-11-01 16:23:06 +00:00
Andrew V. Tischenko	3d971e39f8	Update VCVTx, VMOVNTPx and VROUNDYPx instructions scheduling on btver2. Differential Revision: https://reviews.llvm.org/D39059 llvm-svn: 317101	2017-11-01 16:10:20 +00:00
Petar Jovanovic	f2faee92aa	Correct dwarf unwind information in function epilogue for X86 This patch aims to provide correct dwarf unwind information in function epilogue for X86. It consists of two parts. The first part inserts CFI instructions that set appropriate cfa offset and cfa register in emitEpilogue() in X86FrameLowering. This part is X86 specific. The second part is platform independent and ensures that: - CFI instructions do not affect code generation - Unwind information remains correct when a function is modified by different passes. This is done in a late pass by analyzing information about cfa offset and cfa register in BBs and inserting additional CFI directives where necessary. Changed CFI instructions so that they: - are duplicable - are not counted as instructions when tail duplicating or tail merging - can be compared as equal Added CFIInstrInserter pass: - analyzes each basic block to determine cfa offset and register valid at its entry and exit - verifies that outgoing cfa offset and register of predecessor blocks match incoming values of their successors - inserts additional CFI directives at basic block beginning to correct the rule for calculating CFA Having CFI instructions in function epilogue can cause incorrect CFA calculation rule for some basic blocks. This can happen if, due to basic block reordering, or the existence of multiple epilogue blocks, some of the blocks have wrong cfa offset and register values set by the epilogue block above them. CFIInstrInserter is currently run only on X86, but can be used by any target that implements support for adding CFI instructions in epilogue. Patch by Violeta Vukobrat. Differential Revision: https://reviews.llvm.org/D35844 llvm-svn: 317100	2017-11-01 16:04:11 +00:00
Simon Pilgrim	778810eb42	[X86][SSE] Begun generalizing truncateVectorWithPACKSS to work with PACKSS/PACKUS functions Renamed to truncateVectorWithPACK llvm-svn: 317098	2017-11-01 15:31:51 +00:00
Simon Pilgrim	2b09f39b4d	Regenerate PACKUS/TRUNCS test (PR31773) llvm-svn: 317096	2017-11-01 15:27:23 +00:00
Geoff Berry	eed6531ea2	[BranchProbabilityInfo] Handle irreducible loops. Summary: Compute the strongly connected components of the CFG and fall back to use these for blocks that are in loops that are not detected by LoopInfo when computing loop back-edge and exit branch probabilities. Reviewers: dexonsmith, davidxl Subscribers: mcrosier, llvm-commits Differential Revision: https://reviews.llvm.org/D39385 llvm-svn: 317094	2017-11-01 15:16:50 +00:00
Roger Ferrer Ibanez	9dfbc10522	Revert r313618 "[ARM] Use ADDCARRY / SUBCARRY" That change causes PR35103, so reverting until I figure it out. llvm-svn: 317092	2017-11-01 14:06:57 +00:00
NAKAMURA Takumi	1657f2ad99	Fix warnings discovered by rL317076. [-Wunused-private-field] llvm-svn: 317091	2017-11-01 13:47:55 +00:00
NAKAMURA Takumi	f7d7a59b9e	Suppress a warning discovered by rL317076. [-Wunused-private-field] llvm-svn: 317090	2017-11-01 13:47:51 +00:00
Max Kazantsev	6f5229d7da	Revert rL311205 "[IRCE] Fix buggy behavior in Clamp" This patch reverts rL311205 that was initially a wrong fix. The real problem was in intersection of signed and unsigned ranges (see rL316552), and the patch being reverted masked the problem instead of fixing it. By now, the test against which rL311205 was made works OK even without this code. This revert patch also contains a test case that demonstrates incorrect behavior caused by rL311205: it is caused by incorrect choise of signed max instead of unsigned. llvm-svn: 317088	2017-11-01 13:21:56 +00:00
Simon Pilgrim	687982c181	[SelectionDAG] computeKnownBits - use ashrInPlace on known bits of ISD::SRA input. NFCI. llvm-svn: 317087	2017-11-01 13:16:48 +00:00
Simon Pilgrim	f657ba0cb6	[X86][SSE] Truncate with PACKSS any input with sufficient sign-bits So far we've only been using PACKSS truncations with 'all-bits or zero-bits' patterns (vector comparison results etc.). When really we can safely use it for any case as long as the number of sign bits reach down to the last 16-bits (or 8-bits if we're truncating to bytes). The next steps after this is add the equivalent support for PACKUS and to support packing to sub-128 bit vectors for truncating stores etc. Differential Revision: https://reviews.llvm.org/D39476 llvm-svn: 317086	2017-11-01 11:47:44 +00:00
Florian Hahn	b93c06331e	[CodeExtractor] Fix iterator invalidation in findOrCreateBlockForHoisting. Summary: By replacing branches to CommonExitBlock, we remove the node from CommonExitBlock's predecessors, invalidating the iterator. The problem is exposed when the common exit block has multiple predecessors and needs to sink lifetime info. The modification in the test case trigger the issue. Reviewers: davidxl, davide, wmi Reviewed By: davidxl Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D39112 llvm-svn: 317084	2017-11-01 09:48:12 +00:00
Serguei Katkov	f2c2851efe	Fix APFloat mod sign fmod specification requires the sign of the remainder is the same as numerator in case remainder is zero. Reviewers: gottesmm, scanon, arsenm, davide, craig.topper Reviewed By: scanon Subscribers: wdng, llvm-commits Differential Revision: https://reviews.llvm.org/D39225 llvm-svn: 317081	2017-11-01 07:56:55 +00:00
Craig Topper	688f0ca6a7	[X86] Add more type qualifiers to INSERT_SUBREG operations in rotate patterns so they don't get created with a v64i8 type. Not sure why tablegen didn't error on this. Fixes PR35158. llvm-svn: 317079	2017-11-01 07:11:32 +00:00
NAKAMURA Takumi	53fc7e1763	Reformat. llvm-svn: 317078	2017-11-01 05:14:35 +00:00
NAKAMURA Takumi	7c1ef4a88c	Revert rL317019, "[ADT] Split optional to only include copy mechanics and dtor for non-trivial types." Seems g++-4.8 (eg. Ubuntu 14.04) doesn't like this. llvm-svn: 317077	2017-11-01 05:14:31 +00:00
Craig Topper	c51aac675d	[DAGCombiner] Fix typos in comments. NFC llvm-svn: 317072	2017-11-01 03:30:52 +00:00
Mitch Phillips	41450d583b	Add test dependency on llvm-cfi-verify to fix up the build breakages on sanitizers. llvm-svn: 317060	2017-11-01 00:49:45 +00:00
Craig Topper	a827f84dcc	[X86] Add AVX512 support to X86FastISel::fastMaterializeFloatZero. llvm-svn: 317059	2017-11-01 00:47:45 +00:00
Daniel Sanders	198447a447	[globalisel][tablegen] Stop hard-coding the emitted instruction ID to 0. NFC The next commit will add support for multi-instruction emission so we need to start allocating instruction ID's instead of hard-coding them to 0. llvm-svn: 317057	2017-11-01 00:29:47 +00:00
Jake Ehrlich	cde781015f	Add system-linux to allow tests run with llvm-lit to restrict themselves to linux I need a test that only runs in a reasonable amount of time on systems that have sparse files. The broadest class of systems that support sparse files are linux systems. So restricting my test to linux systems should suffice. This change adds the system-linux feature to llvm-lit so that it can be required. Differential Revision: https://reviews.llvm.org/D39482 llvm-svn: 317055	2017-11-01 00:18:51 +00:00
Benjamin Kramer	f9ab3ddb8f	[AMDGPU] Clean up symbols in the global namespace. llvm-svn: 317051	2017-10-31 23:21:30 +00:00
Mitch Phillips	7db6f7a344	Parse DWARF information to reduce false positives. Summary: Help differentiate code and data by parsing DWARF information. This will reduce false positive rates where data is placed in executable sections and is mistakenly parsed as code, resulting in an inflation in the number of indirect CF instructions (and hence an inflation of the number of unprotected). Also prints the DWARF line data around the region of each indirect CF instruction. Reviewers: pcc Subscribers: probinson, llvm-commits, vlad.tsyrklevich, mgorny, aprantl, kcc Differential Revision: https://reviews.llvm.org/D38654 llvm-svn: 317050	2017-10-31 23:20:05 +00:00
Daniel Sanders	7438b26317	Re-commit: [globalisel][tablegen] Keep track of the insertion point while adding BuildMIAction's. NFC Multi-instruction emission needs to ensure the the instructions are generated a depth-first fashion. For example: (ADDWrr (SUBWrr a, b), c) needs to emit the SUBWrr before the ADDWrr. However, our walk over TreePatternNode's is highly context sensitive which makes it difficult to append BuildMIActions in the order we want. To fix this, we now keep track of the insertion point as we add actions. This will allow multi-insn emission to insert BuildMI's in the correct place. The previous commit failed on the Ubuntu bots using GCC 4.8. These bots lack the const_iterator forms of insert() and emplace() that were added in C++11. As a result I've switched the const_iterators to iterators. llvm-svn: 317049	2017-10-31 23:03:18 +00:00
Philip Reames	357cd3289e	[SimplifyIndVar] Inline makIVComparisonInvariant to eleminate code duplication [NFC] This formulation might be slightly slower since I eagerly compute the cheap replacements. If anyone sees this having a compile time impact, let me know and I'll use lazy population instead. llvm-svn: 317048	2017-10-31 22:56:16 +00:00
Peter Collingbourne	aedb4bf37f	Object: Move some code from ELF.h into ELF.cpp. Differential Revision: https://reviews.llvm.org/D39271 llvm-svn: 317046	2017-10-31 22:49:23 +00:00
Peter Collingbourne	4240c5861b	Inline compareAddr function into its only caller. NFCI. llvm-svn: 317045	2017-10-31 22:49:09 +00:00
Daniel Sanders	f69d1b018c	Revert r317040: [globalisel][tablegen] Keep track of the insertion point while adding BuildMIAction's. NFC The same bots fail but I believe I know what the issue is now. These bots are missing the const_iterator versions of insert/emplace/etc. that were introduced in C++11. llvm-svn: 317042	2017-10-31 21:54:52 +00:00
Reid Kleckner	bc6f52da82	[codeview] Merge file checksum entries for DIFiles with the same absolute path Change the map key from DIFile* to the absolute path string. Computing the absolute path isn't expensive because we already have a map that caches the full path keyed on DIFile*. llvm-svn: 317041	2017-10-31 21:52:15 +00:00
Daniel Sanders	374f71ac90	Re-commit: [globalisel][tablegen] Keep track of the insertion point while adding BuildMIAction's. NFC Multi-instruction emission needs to ensure the the instructions are generated a depth-first fashion. For example: (ADDWrr (SUBWrr a, b), c) needs to emit the SUBWrr before the ADDWrr. However, our walk over TreePatternNode's is highly context sensitive which makes it difficult to append BuildMIActions in the order we want. To fix this, we now keep track of the insertion point as we add actions. This will allow multi-insn emission to insert BuildMI's in the correct place. The previous commit failed on the Ubuntu bots using GCC 4.8. These bots didn't like a call to emplace(). I've replaced it with insert() to see if it's a quirk of the C++11 support. llvm-svn: 317040	2017-10-31 21:34:53 +00:00
Marek Olsak	5914ece6aa	AMDGPU: Select s_buffer_load_dword with a non-constant SGPR offset Summary: Apps that benefit: - alien isolation - bioshock infinite - civilization: beyond earth - company of heroes 2 - dirt showdown - dota 2 - F1 2015 - grid autosport - hitman - legend of grimrock - serious sam 3: bfe - shadow warrior - talos principle - total war: warhammer - UE4 demos: effects cave, elemental, sun temple Reviewers: arsenm, nhaehnle Subscribers: kzhuravl, wdng, yaxunl, dstuttard, tpr, llvm-commits, t-tye Differential Revision: https://reviews.llvm.org/D38914 llvm-svn: 317038	2017-10-31 21:06:42 +00:00

1 2 3 4 5 ...

156136 Commits