llvm-project

Commit Graph

Author	SHA1	Message	Date
Bob Haarman	4d2711fbb5	[codeview] respect signedness of APSInts when printing to YAML Summary: This fixes a bug where we always treat APSInts in Codeview as signed when writing them to YAML. One symptom of this problem is that llvm-pdbdump raw would show Enumerator Values that differ between the original PDB and a PDB that has been round-tripped through YAML. Reviewers: zturner Reviewed By: zturner Subscribers: llvm-commits, fhahn Differential Revision: https://reviews.llvm.org/D34013 llvm-svn: 305965	2017-06-21 22:31:52 +00:00
Stanislav Mekhanoshin	a8b26936d0	[AMDGPU] Combine add and adde, sub and sube If one of the arguments of adde/sube is zero we can fold another add/sub into it. Differential Revision: https://reviews.llvm.org/D34374 llvm-svn: 305964	2017-06-21 22:30:01 +00:00
Sam Clegg	705f798bff	Mark dump() methods as const. NFC Add const qualifier to any dump() method where adding one was trivial. Differential Revision: https://reviews.llvm.org/D34481 llvm-svn: 305963	2017-06-21 22:19:17 +00:00
Stanislav Mekhanoshin	e3eb42cef6	[AMDGPU] simplify add x, *ext (setcc) => addc\|subb x, 0, setcc This simplification allows to avoid generating v_cndmask_b32 to serialize condition code between compare and use. Differential Revision: https://reviews.llvm.org/D34300 llvm-svn: 305962	2017-06-21 22:05:06 +00:00
NAKAMURA Takumi	1b587358be	TableGen.cmake: Use DEPFILE for Ninja Generator with CMake>=3.7. CMake emits build targets as relative paths (from build.ninja) but Ninja doesn't identify absolute path (in *.d) as relative path (in build.ninja). So, let file names, in the command line, relative from ${CMAKE_BINARY_DIR}, where build.ninja is. Note that tblgen is executed on ${CMAKE_BINARY_DIR} as working directory. Differential Revision: https://reviews.llvm.org/D33707 llvm-svn: 305961	2017-06-21 22:04:07 +00:00
Dehao Chen	014db29b89	Enable vectorizer-maximize-bandwidth by default. Summary: vectorizer-maximize-bandwidth is generally useful in terms of performance. I've tested the impact of changing this to default on speccpu benchmarks on sandybridge machines. The result shows non-negative impact: spec/2006/fp/C++/444.namd 26.84 -0.31% spec/2006/fp/C++/447.dealII 46.19 +0.89% spec/2006/fp/C++/450.soplex 42.92 -0.44% spec/2006/fp/C++/453.povray 38.57 -2.25% spec/2006/fp/C/433.milc 24.54 -0.76% spec/2006/fp/C/470.lbm 41.08 +0.26% spec/2006/fp/C/482.sphinx3 47.58 -0.99% spec/2006/int/C++/471.omnetpp 22.06 +1.87% spec/2006/int/C++/473.astar 22.65 -0.12% spec/2006/int/C++/483.xalancbmk 33.69 +4.97% spec/2006/int/C/400.perlbench 33.43 +1.70% spec/2006/int/C/401.bzip2 23.02 -0.19% spec/2006/int/C/403.gcc 32.57 -0.43% spec/2006/int/C/429.mcf 40.35 +0.27% spec/2006/int/C/445.gobmk 26.96 +0.06% spec/2006/int/C/456.hmmer 24.4 +0.19% spec/2006/int/C/458.sjeng 27.91 -0.08% spec/2006/int/C/462.libquantum 57.47 -0.20% spec/2006/int/C/464.h264ref 46.52 +1.35% geometric mean +0.29% The regression on 453.povray seems real, but is due to secondary effects as all hot functions are bit-identical with and without the flag. I started this patch to consult upstream opinions on this. It will be greatly appreciated if the community can help test the performance impact of this change on other architectures so that we can decided if this should be target-dependent. Reviewers: hfinkel, mkuper, davidxl, chandlerc Reviewed By: chandlerc Subscribers: rengolin, sanjoy, javed.absar, bjope, dorit, magabari, RKSimon, llvm-commits, mzolotukhin Differential Revision: https://reviews.llvm.org/D33341 llvm-svn: 305960	2017-06-21 22:01:32 +00:00
Krzysztof Parzyszek	5b933fee3c	[Hexagon] Use MachineInstrBuilder instead of changing instruction in place llvm-svn: 305953	2017-06-21 21:03:34 +00:00
Sam Clegg	9fa8af6f82	Rename WinCOFFStreamer.cpp -> MCWinCOFFStreamer.cpp For consistency with other MC*Streamer.cpp files and the header file. Differential Revision: https://reviews.llvm.org/D34466 llvm-svn: 305952	2017-06-21 20:58:17 +00:00
Nirav Dave	6919b9e9f0	Add Aarch64 ldst-opt test. llvm-svn: 305951	2017-06-21 20:50:07 +00:00
Davide Italiano	cae62546ac	[Target/Mips] Add test associated with r305949. llvm-svn: 305950	2017-06-21 20:42:34 +00:00
Davide Italiano	75ed943def	[Target] Implement the ".rdata" MIPS assembly directive. Patch by John Baldwin < jhb at freebsd dot org >! Differential Revision: https://reviews.llvm.org/D34452 llvm-svn: 305949	2017-06-21 20:40:27 +00:00
Davide Italiano	9b8e3d308f	[Solaris] emit .init_array instead of .ctors on Solaris (Sparc/x86) Patch by Fedor Sergeev. Differential Revision: https://reviews.llvm.org/D33868 llvm-svn: 305948	2017-06-21 20:36:32 +00:00
Craig Topper	34caf5396f	[Reassociate] Use early returns in a couple places to reduce indentation and improve readability. NFC llvm-svn: 305946	2017-06-21 19:39:35 +00:00
Craig Topper	99a2e89920	[Reassociate] Const correct a helper function. NFC llvm-svn: 305945	2017-06-21 19:39:33 +00:00
Wolfgang Pieb	258927e3da	[DWARF] Support for DW_FORM_strx3 and complete support for DW_FORM_strx{1,2,4} (consumer). Reviewer: aprantl Differential Revision: https://reviews.llvm.org/D34418 llvm-svn: 305944	2017-06-21 19:37:44 +00:00
Krzysztof Parzyszek	fd048cc0ec	[Hexagon] Handle more types of immediate operands in expand-condsets llvm-svn: 305943	2017-06-21 19:21:30 +00:00
Craig Topper	a074c101e5	[InstCombine] Cleanup using commutable matchers. Make a couple helper methods standalone static functions. Put 'if' around variable declaration instead of after. NFC llvm-svn: 305941	2017-06-21 18:57:00 +00:00
whitequark	ed54b4a798	Add a "probe-stack" attribute This attribute is used to ensure the guard page is triggered on stack overflow. Stack frames larger than the guard page size will generate a call to __probestack to touch each page so the guard page won't be skipped. Reviewed By: majnemer Differential Revision: https://reviews.llvm.org/D34386 llvm-svn: 305939	2017-06-21 18:46:50 +00:00
Michael Kruse	47f856095a	[BasicAA] Use MayAlias instead of PartialAlias for fallback. Using various methods, BasicAA tries to determine whether two GetElementPtr memory locations alias when its base pointers are known to be equal. When none of its heuristics are applicable, it falls back to PartialAlias to, according to a comment, protect TBAA making a wrong decision in case of unions and malloc. PartialAlias is not correct, because a PartialAlias result implies that some, but not all, bytes overlap which is not necessarily the case here. AAResults returns the first analysis result that is not MayAlias. BasicAA is always the first alias analysis. When it returns PartialAlias, no other analysis is queried to give a more exact result (which was the intention of returning PartialAlias instead of MayAlias). For instance, ScopedAA could return a more accurate result. The PartialAlias hack was introduced in r131781 (and re-applied in r132632 after some reverts) to fix llvm.org/PR9971 where TBAA returns a wrong NoAlias result due to a union. A test case for the malloc case mentioned in the comment was not provided and I don't think it is affected since it returns an omnipotent char anyway. Since r303851 (https://reviews.llvm.org/D33328) clang does emit specific TBAA for unions anymore (but "omnipotent char" instead). Hence, the PartialAlias workaround is not required anymore. This patch passes the test-suite and check-llvm/check-clang of a self-hoisted build on x64. Reviewed By: hfinkel Differential Revision: https://reviews.llvm.org/D34318 llvm-svn: 305938	2017-06-21 18:25:37 +00:00
Peter Collingbourne	afaeed5322	Object: Have the irsymtab builder take a string table builder. NFCI. This will be needed in order to share the irsymtab string table with the bitcode string table. Differential Revision: https://reviews.llvm.org/D33971 llvm-svn: 305937	2017-06-21 18:23:19 +00:00
Sanjay Patel	2a6f9f8adf	[CGP, memcmp] replace CreateZextOrTrunc with CreateZext because it can never trunc llvm-svn: 305936	2017-06-21 18:20:52 +00:00
Sanjay Patel	a10f5b626d	[CGP] fix variables to be unsigned in memcmp expansion llvm-svn: 305935	2017-06-21 18:06:13 +00:00
Dehao Chen	50f2aa19e8	Do not inline recursive direct calls in sample loader pass. Summary: r305009 disables recursive inlining for indirect calls in sample loader pass. The same logic applies to direct recursive calls. Reviewers: iteratee, davidxl Reviewed By: iteratee Subscribers: sanjoy, llvm-commits, eraman Differential Revision: https://reviews.llvm.org/D34456 llvm-svn: 305934	2017-06-21 17:57:43 +00:00
Reid Kleckner	d0e6e24a53	[PDB] Add symbols to the PDB Summary: The main complexity in adding symbol records is that we need to "relocate" all the type indices. Type indices do not have anything like relocations, an opaque data structure describing where to find existing type indices for fixups. The linker just has to "know" where the type references are in the symbol records. I added an overload of `discoverTypeIndices` that works on symbol records, and it seems to be able to link the standard library. Reviewers: zturner, ruiu Subscribers: llvm-commits, hiraditya Differential Revision: https://reviews.llvm.org/D34432 llvm-svn: 305933	2017-06-21 17:25:56 +00:00
Lei Huang	84dbbfdeb9	[PowerPC] define target hook isReallyTriviallyReMaterializable() Define target hook isReallyTriviallyReMaterializable() to explicitly specify PowerPC instructions that are trivially rematerializable. This will allow the MachineLICM pass to accurately identify PPC instructions that should always be hoisted. Differential Revision: https://reviews.llvm.org/D34255 llvm-svn: 305932	2017-06-21 17:17:56 +00:00
Sanjay Patel	deed579140	[x86] set the datalayout to match the RUN line triple; NFC I don't think there's any visible difference from having the wrong layout for the 32-bit case at this point, but that could change in the future. llvm-svn: 305931	2017-06-21 17:06:24 +00:00
Craig Topper	5b173f2bb3	[InstCombine] Add range metadata to cttz/ctlz/ctpop intrinsic calls based on known bits Summary: I noticed that passing known bits across these intrinsics isn't great at capturing the information we really know. Turning known bits of the input into known bits of a count output isn't able to convey a lot of what we really know. This patch adds range metadata to these intrinsics based on the known bits. Currently the patch punts if we already have range metadata present. Reviewers: spatel, RKSimon, davide, majnemer Reviewed By: RKSimon Subscribers: sanjoy, hfinkel, llvm-commits Differential Revision: https://reviews.llvm.org/D32582 llvm-svn: 305927	2017-06-21 16:32:35 +00:00
Craig Topper	ae86cc725d	[InstCombine] Don't let folding (select (icmp eq (and X, C1), 0), Y, (or Y, C2)) create more instructions than it removes Summary: Previously this folding had no checks to see if it was going to result in less instructions. This was pointed out during the review of D34184 This patch adds code to count how many instructions its going to create vs how many its going to remove so we can make a proper decision. Reviewers: spatel, majnemer Reviewed By: spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D34437 llvm-svn: 305926	2017-06-21 16:07:13 +00:00
Craig Topper	cbac691c4b	[Reassociate] Support xor reassociating for splat vectors Summary: This patch adds support for xors of splat vectors. Reviewers: mcrosier Reviewed By: mcrosier Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D34354 llvm-svn: 305925	2017-06-21 16:07:09 +00:00
Dmitry Preobrazhensky	851a3d9f05	[AMDGPU][MC][GFX9] Corrected VOP3P relevant code to fix disassembler failures See Bug 33509: https://bugs.llvm.org//show_bug.cgi?id=33509 Reviewers: Sam Kolton, Artem Tamazov, Valery Pykhtin Differential Revision: https://reviews.llvm.org/D34360 llvm-svn: 305923	2017-06-21 16:00:54 +00:00
Nirav Dave	c1b6aa77bb	[DAG] Move BaseIndexOffset into separate Libarary. NFC. Move BaseIndexOffset analysis out of DAGCombiner for use in other files. llvm-svn: 305921	2017-06-21 15:40:43 +00:00
David Blaikie	8f9621ae04	ClangFormat some changes from r305226 Post commit review feedback from Justin Bogner llvm-svn: 305919	2017-06-21 15:20:46 +00:00
Christof Douma	1ee68828b2	[AARCH64][LSE] Preliminary support for ARMv8.1 LSE Atomics. Added test file for ARMv8.1 LSE Atomics that I forgot to include in commit r305893. Patch by Ananth Jasty. Differential Revision: https://reviews.llvm.org/D33586 Change-Id: Ic1ad8ed87c1b584c4c791b459a686c866a3c3087 llvm-svn: 305918	2017-06-21 15:18:39 +00:00
Nirav Dave	9a69d444a3	[DAG] Remove Node csonstruction from BaseIndexOffset match. NFCI. Move GlobalAddress Offset decomposition from initial match into comparision check and removing the possibility of constructing a new offseted global address when examining addresses. llvm-svn: 305917	2017-06-21 15:07:30 +00:00
Simon Pilgrim	550cb7e82c	[X86][SSE] Dropped -mcpu from 256-bit vector shuffle tests Use triple and attribute only for consistency llvm-svn: 305916	2017-06-21 14:51:23 +00:00
Dmitry Preobrazhensky	dc4ac823ec	[AMDGPU][MC] Corrected V_QSAD instructions to check that dest register is different than any of the src See Bug 33279: https://bugs.llvm.org//show_bug.cgi?id=33279 Reviewers: artem.tamazov, vpykhtin Differential Revision: https://reviews.llvm.org/D34003 llvm-svn: 305915	2017-06-21 14:41:34 +00:00
Sanjay Patel	cec6a500a8	[x86] fix formatting; NFC llvm-svn: 305914	2017-06-21 14:27:11 +00:00
Simon Pilgrim	9d0c2b7bad	[X86][SSE] Dropped -mcpu from 128-bit vector shuffle tests Use triple and attribute only for consistency llvm-svn: 305913	2017-06-21 14:23:02 +00:00
Simon Pilgrim	5309b7d5c9	[X86][SSE] Regenerate merge store tests llvm-svn: 305910	2017-06-21 13:46:42 +00:00
Simon Pilgrim	e74e08fe61	[X86][SSE] Dropped -mcpu from vector blend shuffle tests and regenerate Use triple and attribute only for consistency llvm-svn: 305909	2017-06-21 13:45:33 +00:00
Simon Pilgrim	98aab7c6fc	[X86][SSE] Dropped -mcpu from vector shuffle tests Use triple and attribute only for consistency llvm-svn: 305908	2017-06-21 13:26:52 +00:00
Simon Pilgrim	6d5d6b542b	[X86][SSE] Dropped -mcpu from vector zero extend tests Use triple and attribute only for consistency llvm-svn: 305907	2017-06-21 13:17:14 +00:00
Simon Pilgrim	c388ec32e0	[X86][SSE] Dropped -mcpu from variable shuffle tests Use triple and attribute only for consistency llvm-svn: 305906	2017-06-21 13:15:41 +00:00
Simon Pilgrim	73814a2594	[X86][AVX] Add AVX1 shuffle truncation tests llvm-svn: 305905	2017-06-21 12:58:56 +00:00
Simon Pilgrim	db6c3fa872	[X86][SSE] Add SSE2/SSE42 shuffle truncation tests llvm-svn: 305904	2017-06-21 12:58:19 +00:00
Zvi Rackover	845ca8fba9	[X86] Rerun the update_llc_test_checks tool on test. NFC. llvm-svn: 305897	2017-06-21 11:21:43 +00:00
Pavel Labath	2c1e8b7a7e	Fix build after r305892 Make sure to #include <cerrno> in Support/Errno.h llvm-svn: 305895	2017-06-21 11:10:02 +00:00
Christof Douma	c1c28051d2	[AARCH64][LSE] Preliminary support for ARMv8.1 LSE Atomics. Implemented support to AArch64 codegen for ARMv8.1 Large System Extensions atomic instructions. Where supported, these instructions can provide atomic operations with higher performance. Currently supported operations include: fetch_add, fetch_or, fetch_xor, fetch_smin, fetch_min/max (signed and unsigned), swap, and compare_exchange. This implementation implies sequential-consistency ordering, more relaxed ordering is under development. Subtarget->hasLSE is currently supported for Cavium ThunderX2T99. Patch by Ananth Jasty. Differential Revision: https://reviews.llvm.org/D33586 Change-Id: I82f6d3d64255622791ceb0715b7ab9f4dc4d4b2c llvm-svn: 305893	2017-06-21 10:58:31 +00:00
Pavel Labath	1f6aea2eb3	[Support] Add RetryAfterSignal helper function Summary: This function retries an operation if it was interrupted by a signal (failed with EINTR). It's inspired by the TEMP_FAILURE_RETRY macro in glibc, but I've turned that into a template function. I've also added a fail-value argument, to enable the function to be used with e.g. fopen(3), which is documented to fail for any reason that open(2) can fail (which includes EINTR). The main user of this function will be lldb, but there were also a couple of uses within llvm that I could simplify using this function. Reviewers: zturner, silvas, joerg Subscribers: mgorny, llvm-commits Differential Revision: https://reviews.llvm.org/D33895 llvm-svn: 305892	2017-06-21 10:55:34 +00:00
Florian Hahn	8552e591a1	[AArch64] Add early exit to promoteLoadFromStore. There should be at most a single kill flag for the promoted operand between the store/load pair. Discussed in https://reviews.llvm.org/D34402. llvm-svn: 305889	2017-06-21 09:51:52 +00:00

1 2 3 4 5 ...

150495 Commits