llvm-project

Commit Graph

Author	SHA1	Message	Date
Petr Hosek	b1bb3e542e	Revert "[llvm][llvm-objcopy] Added support for outputting to binary in llvm-objcopy" This breaks the build on clang-s390x-linux. This reverts commit r310018. llvm-svn: 310026	2017-08-04 05:33:44 +00:00
Craig Topper	c2d3c631ff	[InstCombine] Move the call to foldSelectICmpAnd into foldSelectInstWithICmp. NFCI llvm-svn: 310025	2017-08-04 05:12:37 +00:00
Craig Topper	a86ca08d26	[InstCombine] Remove unnecessary casts. NFC We're calling an overload of getOpcode that already returns Instruction::CastOps. llvm-svn: 310024	2017-08-04 05:12:35 +00:00
Max Kazantsev	ba1e70e060	Fix SCEVExitLimitForget tests to make Sanitizer happy llvm-svn: 310023	2017-08-04 05:06:44 +00:00
Daniel Jasper	3f47a5c054	Prevent unused warning in non-assert builds (introduced in r310014). llvm-svn: 310022	2017-08-04 05:05:29 +00:00
Victor Leschuk	56b03d0dd6	Un-revert r310014: false revert, it wasn't the cause of build break llvm-svn: 310021	2017-08-04 04:51:15 +00:00
Victor Leschuk	21713ebfb1	Revert r310014 as it breaks build lld-x86_64-darwin13 llvm-svn: 310020	2017-08-04 04:43:54 +00:00
Petr Hosek	10fbc733d0	Reland "[llvm][llvm-objcopy] Added support for outputting to binary in llvm-objcopy" This change adds the "-O binary" flag which directs llvm-objcopy to output the object file to the same format as GNU objcopy does when given the flag "-O binary". This was done by splitting the Object class into two subclasses ObjectELF and ObjectBianry which each output a different format but relay on the same code to read in the Object in Object. Patch by Jake Ehrlich Differential Revision: https://reviews.llvm.org/D34480 llvm-svn: 310018	2017-08-04 03:17:37 +00:00
Reid Kleckner	02aeadcf3d	[Support] Update comments about stdout, raw_fd_ostream, and outs() The full story is in the comments: // Do not attempt to close stdout or stderr. We used to try to maintain the // property that tools that support writing file to stdout should not also // write informational output to stdout, but in practice we were never able to // maintain this invariant. Many features have been added to LLVM and clang // (-fdump-record-layouts, optimization remarks, etc) that print to stdout, so // users must simply be aware that mixed output and remarks is a possibility. NFC, I am just updating comments to reflect reality. llvm-svn: 310016	2017-08-04 01:39:23 +00:00
Adrian Prantl	fd8c8e9fe6	Teach GlobalSRA to update the debug info for split-up globals. This is similar to what we are doing in "regular" SROA and creates DW_OP_LLVM_fragment operations to describe the resulting variables. rdar://problem/33654891 llvm-svn: 310014	2017-08-04 01:19:54 +00:00
Connor Abbott	00755362b9	[AMDGPU] Add missing hazard for DPP-after-EXEC-write Summary: Following the docs, we need at least 5 wait states between an EXEC write and an instruction that uses DPP. Reviewers: tstellar, arsenm Subscribers: kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D34849 llvm-svn: 310013	2017-08-04 01:09:43 +00:00
Vedant Kumar	846b985a92	[llvm-cov] Ignore unclosed line segments when setting line counts This patch makes a slight change to the way llvm-cov determines line execution counts. If there are multiple line segments on a line, the line count is the max count among the regions which start and end on the line. This avoids an issue posed by deferred regions which start on the same line as a terminated region, e.g: if (false) return; //< The line count should be 0, even though a new region //< starts at the semi-colon. foo(); Another change is that counts from line segments which don't correspond to region entries are considered. This enables the first change, and corrects an outstanding issue (see the showLineExecutionCounts.cpp test change). This is related to D35925. Testing: check-profile, llvm-cov lit tests Differential Revision: https://reviews.llvm.org/D36014 llvm-svn: 310012	2017-08-04 00:36:24 +00:00
Vedant Kumar	8a6223887c	[llvm-cov] NFC: make_unique-ify two allocations llvm-svn: 310011	2017-08-04 00:36:24 +00:00
George Karpenkov	5bd0503680	Disable libFuzzer tests on Windows Differential Revision: https://reviews.llvm.org/D36297 llvm-svn: 310009	2017-08-04 00:26:12 +00:00
Vitaly Buka	da611dc630	Revert "[TableGen] AsmMatcher: fix OpIdx computation when HasOptionalOperands is true" Breaks check-llvm under ubsan. This reverts commit r309949. llvm-svn: 310008	2017-08-04 00:25:24 +00:00
Matt Arsenault	5c921a9291	AMDGPU: Remove pointless asserts llvm-svn: 310007	2017-08-04 00:00:13 +00:00
Teresa Johnson	8482e56920	Use profile summary to disable peeling for huge working sets Summary: Detect when the working set size of a profiled application is huge, by comparing the number of counts required to reach the hot percentile in the profile summary to a large threshold. When the working set size is determined to be huge, disable peeling to avoid bloating the working set further. Note that the selected threshold (15K) is significantly larger than the largest working set value in SPEC cpu2006 (which is gcc at around 11K). Reviewers: davidxl Subscribers: mehdi_amini, mzolotukhin, eraman, llvm-commits Differential Revision: https://reviews.llvm.org/D36288 llvm-svn: 310005	2017-08-03 23:42:58 +00:00
Matt Arsenault	a176cc5b93	AMDGPU: Don't use report_fatal_error for unsupported call types llvm-svn: 310004	2017-08-03 23:32:41 +00:00
Matt Arsenault	a202538bfa	AMDGPU: Remove error on calls for amdgcn Repurpose the -amdgpu-function-calls flag. Rather than require it to emit a call, only use it to run the always inline path or not. llvm-svn: 310003	2017-08-03 23:24:05 +00:00
Matt Arsenault	817c253e60	AMDGPU: Fix implicitarg.ptr handling special inputs llvm-svn: 310002	2017-08-03 23:12:44 +00:00
Martell Malone	346a5fdc9b	Support: WOA64 and WOA Signals Reviewers: rnk Differential Revision: https://reviews.llvm.org/D21813 llvm-svn: 310001	2017-08-03 23:12:33 +00:00
Zachary Turner	5869936844	[llvm-pdbutil] Add an option to only dump specific module indices. Often something interesting (like a symbol) is in a particular module, and you don't want to dump symbols from all other 300 modules to see the one you want. This adds a -modi option so that we only dump the specified module. llvm-svn: 310000	2017-08-03 23:11:52 +00:00
Matt Arsenault	8623e8d864	AMDGPU: Pass special input registers to functions llvm-svn: 309998	2017-08-03 23:00:29 +00:00
Eric Christopher	52854dcd34	Fix typo. llvm-svn: 309997	2017-08-03 22:41:12 +00:00
Matt Arsenault	7016f13450	AMDGPU: Add analysis pass for function argument info This will allow only adding necessary inputs to callee functions that need special inputs forwarded from the kernel. llvm-svn: 309996	2017-08-03 22:30:46 +00:00
Easwaran Raman	974d4eea93	[Inliner] Increase threshold for hot callsites without PGO. Summary: This increases the inlining threshold for hot callsites. Hotness is defined in terms of block frequency of the callsite relative to the caller's entry block's frequency. Since this requires BFI in the inliner, this only affects the new PM pipeline. This is enabled by default at -O3. This improves the performance of some internal benchmarks. Notably, an internal benchmark for Gipfeli compression (https://github.com/google/gipfeli) improves by ~7%. Povray in SPEC2006 improves by ~2.5%. I am running more experiments and will update the thread if other benchmarks show improvement/regression. In terms of text size, LLVM test-suite shows an 1.22% text size increase. Diving into the results, 13 of the benchmarks in the test-suite increases by > 10%. Most of these are small, but Adobe-C++/loop_unroll (17.6% increases) and tramp3d(20.7% size increase) have >250K text size. On a large application, the text size increases by 2% Reviewers: chandlerc, davidxl Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D36199 llvm-svn: 309994	2017-08-03 22:23:33 +00:00
Eugene Zelenko	79220eaeec	[Mips] Fix some Clang-tidy modernize-use-using and Include What You Use warnings; other minor fixes (NFC). llvm-svn: 309993	2017-08-03 22:12:30 +00:00
Matt Arsenault	a52391f2db	DAG: Provide access to Pass instance from SelectionDAG This allows accessing an analysis pass during lowering. llvm-svn: 309991	2017-08-03 21:54:00 +00:00
Quentin Colombet	250e050a50	[GlobalISel] Make GlobalISel a non-optional library. With this change, the GlobalISel library gets always built. In particular, this is not possible to opt GlobalISel out of the build using the LLVM_BUILD_GLOBAL_ISEL variable any more. llvm-svn: 309990	2017-08-03 21:52:25 +00:00
Davide Italiano	5974c31d91	[NewGVN] Fix the case where we have a phi-of-ops which goes away. Patch by Daniel Berlin, fixes PR33196 (and probably something else). llvm-svn: 309988	2017-08-03 21:17:49 +00:00
Reid Kleckner	175af4bcc7	[PDB] Fix section contributions Summary: PDB section contributions are supposed to use output section indices and offsets, not input section indices and offsets. This allows the debugger to look up the index of the module that it should look up in the modules stream for symbol information. With this change, windbg can now find line tables, but it still cannot print local variables. Fixes PR34048 Reviewers: zturner Subscribers: hiraditya, ruiu, llvm-commits Differential Revision: https://reviews.llvm.org/D36285 llvm-svn: 309987	2017-08-03 21:15:09 +00:00
Hiroshi Yamauchi	144ee2b4d7	[LVI] Constant-propagate a zero extension of the switch condition value through case edges Summary: (This is a second attempt as https://reviews.llvm.org/D34822 was reverted.) LazyValueInfo currently computes the constant value of the switch condition through case edges, which allows the constant value to be propagated through the case edges. But we have seen a case where a zero-extended value of the switch condition is used past case edges for which the constant propagation doesn't occur. This patch adds a small logic to handle such a case in getEdgeValueLocal(). This is motivated by the Python 2.7 eval loop in PyEval_EvalFrameEx() where the lack of the constant propagation causes longer live ranges and more spill code than necessary. With this patch, we see that the code size of PyEval_EvalFrameEx() decreases by ~5.4% and a performance test improves by ~4.6%. Reviewers: sanjoy Reviewed By: sanjoy Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D36247 llvm-svn: 309986	2017-08-03 21:11:30 +00:00
Taewook Oh	424c88c381	Move unit test to the proper location Summary: Move test/CodeGen/AArch64/reg-bank-128bit.mir to test/CodeGen/AArch64/GlobalISel/reg-bank-128bit.mir so that the test is executed only when global-isel is enabled. lit.local.cfg under test/CodeGen/AArch64/GlobalISel checks if 'global-isel' is in the available_features while the same file under test/CodeGen/AArch64 doesn't. Reviewers: qcolombet, davide Reviewed By: davide Subscribers: davide, aemerson, javed.absar, igorb, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D36282 llvm-svn: 309985	2017-08-03 21:07:12 +00:00
Zachary Turner	33eee1911a	[llvm-pdbutil] Allow diff to force module equivalencies. Sometimes the normal module equivalence detection algorithm doesn't quite work. For example, you might build the same program with MSVC and clang-cl, outputting to different object files, exes, and PDBs, then compare them. If the object files have different names though, then they won't be treated as equivalent. This way we can force specific module indices to be treated as equivalent. llvm-svn: 309983	2017-08-03 20:30:09 +00:00
George Karpenkov	f020c98912	[libFuzzer] Un-reverting change in tests after fixing the failure on Linux. Differential Revision: https://reviews.llvm.org/D36242 llvm-svn: 309982	2017-08-03 20:28:16 +00:00
Connor Abbott	82267a553d	test commit llvm-svn: 309981	2017-08-03 20:22:30 +00:00
Nico Weber	0a920e76b8	Fix llvm-for-windows-on-linux build after LLVM r272701. The file is called "intrin.h". When building targeting Windows on a Linux system, with the SDK mounted in a case-insensitive file system, "Intrin.h" will miss clang's intrin.h header (because that's not in a case-insensitive file system) but then find intrin.h in the Microsoft SDK. clang can't handle the SDK's intrin.h. https://reviews.llvm.org/D36281 llvm-svn: 309980	2017-08-03 20:10:47 +00:00
Greg Bedwell	fd2c379568	Fix check-lit compatibility with multi-config CMake generators Multi-configuration CMake generators such as those for Visual Studio or Xcode do not specify a build config at configure time, but let the user choose at build time. In these cases binaries go into build/${Configuration}/bin rather than build/bin. Prior to this commit, check-lit would fail when using multi-configuration generators as it did not know how to resolve ${Configuration} in order to find tools such as FileCheck. This commit teaches it to resolve llvm_tools_dir within lit using the value specified with --param build_mode. Differential Revision: https://reviews.llvm.org/D36263 llvm-svn: 309967	2017-08-03 17:55:54 +00:00
Teresa Johnson	9a18a6f08b	Disable loop peeling during full unrolling pass. Summary: Peeling should not occur during the full unrolling invocation early in the pipeline, but rather later with partial and runtime loop unrolling. The later loop unrolling invocation will also eventually utilize profile summary and branch frequency information, which we would like to use to control peeling. And for ThinLTO we want to delay peeling until the backend (post thin link) phase, just as we do for most types of unrolling. Ensure peeling doesn't occur during the full unrolling invocation by adding a parameter to the shared implementation function, similar to the way partial and runtime loop unrolling are disabled. Performance results for ThinLTO suggest this has a neutral to positive effect on some internal benchmarks. Reviewers: chandlerc, davidxl Subscribers: mzolotukhin, llvm-commits, mehdi_amini Differential Revision: https://reviews.llvm.org/D36258 llvm-svn: 309966	2017-08-03 17:52:38 +00:00
Dehao Chen	f58df39529	Do not want to use BFI to get profile count for sample pgo Summary: For SamplePGO, we already record the callsite count in the call instruction itself. So we do not want to use BFI to get profile count as it is less accurate. Reviewers: tejohnson, davidxl, eraman Reviewed By: eraman Subscribers: sanjoy, llvm-commits, mehdi_amini Differential Revision: https://reviews.llvm.org/D36025 llvm-svn: 309964	2017-08-03 17:11:41 +00:00
Simon Pilgrim	99d0ab38bc	[X86] Adding a test for vector shuffle extractions. When both the vector inputs of the shuffle vector is comprising of same vector or shuffle mask is accessing elements from only one operand vector (like in PR33758 test already present). Committed on behalf of @jbhateja (Jatin Bhateja) Differential Revision: https://reviews.llvm.org/D36271 llvm-svn: 309963	2017-08-03 17:04:59 +00:00
Tim Northover	869fa74d4b	Revert "[AArch64] Simplify AES*Tied pseudo expansion (NFC)." This reverts commit r309821. My suggestion was wrong because it left the MachineOperands tied which confused the verifier. Since there's no easy way to untie operands, the original BuildMI solution is probably best. llvm-svn: 309962	2017-08-03 16:59:36 +00:00
Simon Pilgrim	0dfd4fa8d3	[X86][AVX512] Tidied up v64i8 vector shuffle tests with triple llvm-svn: 309961	2017-08-03 16:56:52 +00:00
Changpeng Fang	ef4dbb46da	AMDGPU/SI: Don't fix a PHI under uniform branch in SIFixSGPRCopies only when sources and destination are all sgprs Summary: If a PHI has at lease one VGPR operand, we have to fix the PHI in SIFixSGPRCopies. Reviewer: Matt Differential Revision: http://reviews.llvm.org/D34727 llvm-svn: 309959	2017-08-03 16:37:02 +00:00
Benjamin Kramer	a73b410805	Fix use after free in unit test. llvm-svn: 309952	2017-08-03 15:59:37 +00:00
Nirav Dave	3fc1c2365c	[DAG] Allow merging of stores of vector loads Remove restriction disallowing merging of stores vector loads into larger store of larger vector load. Reviewers: RKSimon, efriedma, spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D36158 llvm-svn: 309951	2017-08-03 15:51:20 +00:00
Nico Weber	bfde70b097	Revert r309923, it caused PR34045. llvm-svn: 309950	2017-08-03 15:41:26 +00:00
Nirav Dave	a0003984e5	[TableGen] AsmMatcher: fix OpIdx computation when HasOptionalOperands is true Consider the following instruction: "inst.eq $dst, $src" where ".eq" is an optional flag operand. The $src and $dst operands are registers. If we parse the instruction "inst r0, r1", the flag is not present and it will be marked in the "OptionalOperandsMask" variable. After the matching is complete we call the "convertToMCInst" method. The current implementation works only if the optional operands are at the end of the array. The "Operands" array looks like [token:"inst", reg:r0, reg:r1]. The first operand that must be added to the MCInst is the destination, the r0 register. The "OpIdx" (in the Operands array) for this register is 2. However, since the flag is not present in the Operands, the actual index for r0 should be 1. The flag is not present since we rely on the default value. This patch removes the "NumDefaults" variable and replaces it with an array (DefaultsOffset). This array contains an index for each operand (excluding the mnemonic). At each index, the array contains the number of optional operands that should be subtracted. For the previous example, this array looks like this: [0, 1, 1]. When we need to access the r0 register, we compute its index as 2 - DefaultsOffset[1] = 1. Patch by Alexandru Guduleasa! Reviewers: SamWot, nhaustov, niravd Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D35998 llvm-svn: 309949	2017-08-03 15:40:21 +00:00
Sanjay Patel	7cf745cc94	[NewGVN] fix typos; NFC llvm-svn: 309946	2017-08-03 15:18:27 +00:00
Sanjay Patel	c0fc1c0c8b	[BDCE] add tests to show invalid/incomplete transforms llvm-svn: 309945	2017-08-03 15:07:37 +00:00

1 2 3 4 5 ...

152599 Commits