llvm-project

Commit Graph

Author	SHA1	Message	Date
Sam Kolton	3c4933fcc6	[AMDGPU] SDWA: add support for GFX9 in peephole pass Summary: Added support based on merged SDWA pseudo instructions. Now peephole allow one scalar operand, omod and clamp modifiers. Added several subtarget features for GFX9 SDWA. This diff also contains changes from D34026. Depends D34026 Reviewers: vpykhtin, rampitec, arsenm Subscribers: kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye Differential Revision: https://reviews.llvm.org/D34241 llvm-svn: 305986	2017-06-22 06:26:41 +00:00
Craig Topper	71e2c1611e	[InstCombine] Add test cases to demonstrate that and->xnor and or->xnor folding can create more instructions than it removed when there are multiple uses. NFC llvm-svn: 305985	2017-06-22 05:20:39 +00:00
Hiroshi Inoue	1d5693c915	[PowerPC] fix potential verification errors This patch fixes trivial mishandling of 32-bit/64-bit instructions that may cause verification errors with -verify-machineinstrs. llvm-svn: 305984	2017-06-22 04:33:44 +00:00
Igor Kudrin	393563a0ce	[ELF] Add an apostrophe after a file name when reporting discarded sections. Differential Revision: https://reviews.llvm.org/D34442 llvm-svn: 305983	2017-06-22 04:07:58 +00:00
Reid Kleckner	b7d716c06f	[llvm-readobj] Dump the COFF image load config This includes the safe SEH tables and the control flow guard function table. LLD will emit the guard table soon, and I need a tool that dumps them for testing. llvm-svn: 305979	2017-06-22 01:10:29 +00:00
Reid Kleckner	ef5817579b	[wasm] Fix WebAssembly asm backend after r305968 llvm-svn: 305978	2017-06-22 01:07:05 +00:00
Marshall Clow	f74609b15f	Add some catch(...) blocks to the tests so that if they fail, we get a good error message. No functional change. llvm-svn: 305977	2017-06-22 00:49:03 +00:00
Rafael Espindola	f9df429068	Also test thumb. llvm-svn: 305976	2017-06-22 00:44:05 +00:00
Davide Italiano	7a6c5c12ad	Revert "[Target] Implement the ".rdata" MIPS assembly directive." This reverts commit r305949 and r305950 as they didn't have the correct commit message. llvm-svn: 305973	2017-06-22 00:11:41 +00:00
Alex Shlyapnikov	f3cc7cc3d8	[Sanitizers] 32 bit allocator respects allocator_may_return_null flag Summary: Make SizeClassAllocator32 return nullptr when it encounters OOM, which allows the entire sanitizer's allocator to follow allocator_may_return_null=1 policy, even for small allocations (LargeMmapAllocator is already fixed by D34243). Will add a test for OOM in primary allocator later, when SizeClassAllocator64 can gracefully handle OOM too. Reviewers: eugenis Subscribers: kubamracek, llvm-commits Differential Revision: https://reviews.llvm.org/D34433 llvm-svn: 305972	2017-06-22 00:02:37 +00:00
Sam Clegg	fe6414b043	[WebAssembly] Cleanup WasmObjectWriter.cpp. NFC - Use auto where appropriate - Use early return to reduce nesting - Remove stray comment line - Use C++ foreach over explicit iterator Differential Revision: https://reviews.llvm.org/D34477 llvm-svn: 305971	2017-06-21 23:46:41 +00:00
Stanislav Mekhanoshin	3ed38c601a	[AMDGPU] Add FP_CLASS to the add/setcc combine This is one of the nodes which also compile as v_cmp_*. Differential Revision: https://reviews.llvm.org/D34485 llvm-svn: 305970	2017-06-21 23:46:22 +00:00
Eugene Zelenko	72208a8226	[ProfileData, Support] Fix some Clang-tidy modernize-use-using and Include What You Use warnings; other minor fixes (NFC). llvm-svn: 305969	2017-06-21 23:19:47 +00:00
Rafael Espindola	88d9e37ec8	Use a MutableArrayRef. NFC. llvm-svn: 305968	2017-06-21 23:06:53 +00:00
Rafael Espindola	6da25f4fc4	Fix build. llvm-svn: 305967	2017-06-21 23:02:57 +00:00
Bob Haarman	4d2711fbb5	[codeview] respect signedness of APSInts when printing to YAML Summary: This fixes a bug where we always treat APSInts in Codeview as signed when writing them to YAML. One symptom of this problem is that llvm-pdbdump raw would show Enumerator Values that differ between the original PDB and a PDB that has been round-tripped through YAML. Reviewers: zturner Reviewed By: zturner Subscribers: llvm-commits, fhahn Differential Revision: https://reviews.llvm.org/D34013 llvm-svn: 305965	2017-06-21 22:31:52 +00:00
Stanislav Mekhanoshin	a8b26936d0	[AMDGPU] Combine add and adde, sub and sube If one of the arguments of adde/sube is zero we can fold another add/sub into it. Differential Revision: https://reviews.llvm.org/D34374 llvm-svn: 305964	2017-06-21 22:30:01 +00:00
Sam Clegg	705f798bff	Mark dump() methods as const. NFC Add const qualifier to any dump() method where adding one was trivial. Differential Revision: https://reviews.llvm.org/D34481 llvm-svn: 305963	2017-06-21 22:19:17 +00:00
Stanislav Mekhanoshin	e3eb42cef6	[AMDGPU] simplify add x, *ext (setcc) => addc\|subb x, 0, setcc This simplification allows to avoid generating v_cndmask_b32 to serialize condition code between compare and use. Differential Revision: https://reviews.llvm.org/D34300 llvm-svn: 305962	2017-06-21 22:05:06 +00:00
NAKAMURA Takumi	1b587358be	TableGen.cmake: Use DEPFILE for Ninja Generator with CMake>=3.7. CMake emits build targets as relative paths (from build.ninja) but Ninja doesn't identify absolute path (in *.d) as relative path (in build.ninja). So, let file names, in the command line, relative from ${CMAKE_BINARY_DIR}, where build.ninja is. Note that tblgen is executed on ${CMAKE_BINARY_DIR} as working directory. Differential Revision: https://reviews.llvm.org/D33707 llvm-svn: 305961	2017-06-21 22:04:07 +00:00
Dehao Chen	014db29b89	Enable vectorizer-maximize-bandwidth by default. Summary: vectorizer-maximize-bandwidth is generally useful in terms of performance. I've tested the impact of changing this to default on speccpu benchmarks on sandybridge machines. The result shows non-negative impact: spec/2006/fp/C++/444.namd 26.84 -0.31% spec/2006/fp/C++/447.dealII 46.19 +0.89% spec/2006/fp/C++/450.soplex 42.92 -0.44% spec/2006/fp/C++/453.povray 38.57 -2.25% spec/2006/fp/C/433.milc 24.54 -0.76% spec/2006/fp/C/470.lbm 41.08 +0.26% spec/2006/fp/C/482.sphinx3 47.58 -0.99% spec/2006/int/C++/471.omnetpp 22.06 +1.87% spec/2006/int/C++/473.astar 22.65 -0.12% spec/2006/int/C++/483.xalancbmk 33.69 +4.97% spec/2006/int/C/400.perlbench 33.43 +1.70% spec/2006/int/C/401.bzip2 23.02 -0.19% spec/2006/int/C/403.gcc 32.57 -0.43% spec/2006/int/C/429.mcf 40.35 +0.27% spec/2006/int/C/445.gobmk 26.96 +0.06% spec/2006/int/C/456.hmmer 24.4 +0.19% spec/2006/int/C/458.sjeng 27.91 -0.08% spec/2006/int/C/462.libquantum 57.47 -0.20% spec/2006/int/C/464.h264ref 46.52 +1.35% geometric mean +0.29% The regression on 453.povray seems real, but is due to secondary effects as all hot functions are bit-identical with and without the flag. I started this patch to consult upstream opinions on this. It will be greatly appreciated if the community can help test the performance impact of this change on other architectures so that we can decided if this should be target-dependent. Reviewers: hfinkel, mkuper, davidxl, chandlerc Reviewed By: chandlerc Subscribers: rengolin, sanjoy, javed.absar, bjope, dorit, magabari, RKSimon, llvm-commits, mzolotukhin Differential Revision: https://reviews.llvm.org/D33341 llvm-svn: 305960	2017-06-21 22:01:32 +00:00
Arnold Schwaighofer	7b871611b9	SwiftCC: Perform physical layout when computing coercion types We need to take type alignment padding into account whe computing physical layouts. The layout must be compatible with the input layout, offsets are defined in terms of offsets within a packed struct which are computed in terms of the alloc size of a type. Usingthe store size we would insert padding for the following type for example: struct { int3 v; long long l; } __attribute((packed)) On x86-64 int3 is padded to int4 alignment. The swiftcc type would be <{ <3 x float>, [4 x i8], i64 }> which is not compatible with <{ <3 x float>, i64 }>. The latter has i64 at offset 16 and the former at offset 20. rdar://32618125 llvm-svn: 305956	2017-06-21 21:43:40 +00:00
Eric Fiselier	0509238077	Attempt to avoid static init ordering issues with globalMemCounter llvm-svn: 305955	2017-06-21 21:42:50 +00:00
Peter Collingbourne	bac3570d53	ELF: Don't dereference Repl in MarkLive. NFCI. This is unnecessary because --gc-sections runs before ICF. Differential Revision: https://reviews.llvm.org/D34465 llvm-svn: 305954	2017-06-21 21:29:51 +00:00
Krzysztof Parzyszek	5b933fee3c	[Hexagon] Use MachineInstrBuilder instead of changing instruction in place llvm-svn: 305953	2017-06-21 21:03:34 +00:00
Sam Clegg	9fa8af6f82	Rename WinCOFFStreamer.cpp -> MCWinCOFFStreamer.cpp For consistency with other MC*Streamer.cpp files and the header file. Differential Revision: https://reviews.llvm.org/D34466 llvm-svn: 305952	2017-06-21 20:58:17 +00:00
Nirav Dave	6919b9e9f0	Add Aarch64 ldst-opt test. llvm-svn: 305951	2017-06-21 20:50:07 +00:00
Davide Italiano	cae62546ac	[Target/Mips] Add test associated with r305949. llvm-svn: 305950	2017-06-21 20:42:34 +00:00
Davide Italiano	75ed943def	[Target] Implement the ".rdata" MIPS assembly directive. Patch by John Baldwin < jhb at freebsd dot org >! Differential Revision: https://reviews.llvm.org/D34452 llvm-svn: 305949	2017-06-21 20:40:27 +00:00
Davide Italiano	9b8e3d308f	[Solaris] emit .init_array instead of .ctors on Solaris (Sparc/x86) Patch by Fedor Sergeev. Differential Revision: https://reviews.llvm.org/D33868 llvm-svn: 305948	2017-06-21 20:36:32 +00:00
George Burgess IV	798feb4147	[test] Make absolute line numbers relative; NFC Done to remove noise from https://reviews.llvm.org/D32332 (and to make this test more resilient to changes in general). llvm-svn: 305947	2017-06-21 19:59:05 +00:00
Craig Topper	34caf5396f	[Reassociate] Use early returns in a couple places to reduce indentation and improve readability. NFC llvm-svn: 305946	2017-06-21 19:39:35 +00:00
Craig Topper	99a2e89920	[Reassociate] Const correct a helper function. NFC llvm-svn: 305945	2017-06-21 19:39:33 +00:00
Wolfgang Pieb	258927e3da	[DWARF] Support for DW_FORM_strx3 and complete support for DW_FORM_strx{1,2,4} (consumer). Reviewer: aprantl Differential Revision: https://reviews.llvm.org/D34418 llvm-svn: 305944	2017-06-21 19:37:44 +00:00
Krzysztof Parzyszek	fd048cc0ec	[Hexagon] Handle more types of immediate operands in expand-condsets llvm-svn: 305943	2017-06-21 19:21:30 +00:00
Justin Bogner	dd862f9106	[sanitizer-coverage] Stop marking this test as unsupported on Darwin The bug that was causing this to fail was fixed in r305429. llvm-svn: 305942	2017-06-21 19:04:59 +00:00
Craig Topper	a074c101e5	[InstCombine] Cleanup using commutable matchers. Make a couple helper methods standalone static functions. Put 'if' around variable declaration instead of after. NFC llvm-svn: 305941	2017-06-21 18:57:00 +00:00
Argyrios Kyrtzidis	d750e1c491	[preprocessor] Fix assertion hit when 'SingleFileParseMode' option is enabled and #if with an undefined identifier and without #else 'HandleEndifDirective' asserts that 'WasSkipping' is false, so switch to using 'FoundNonSkip' as the hint for 'SingleFileParseMode' to keep going with parsing. llvm-svn: 305940	2017-06-21 18:52:44 +00:00
whitequark	ed54b4a798	Add a "probe-stack" attribute This attribute is used to ensure the guard page is triggered on stack overflow. Stack frames larger than the guard page size will generate a call to __probestack to touch each page so the guard page won't be skipped. Reviewed By: majnemer Differential Revision: https://reviews.llvm.org/D34386 llvm-svn: 305939	2017-06-21 18:46:50 +00:00
Michael Kruse	47f856095a	[BasicAA] Use MayAlias instead of PartialAlias for fallback. Using various methods, BasicAA tries to determine whether two GetElementPtr memory locations alias when its base pointers are known to be equal. When none of its heuristics are applicable, it falls back to PartialAlias to, according to a comment, protect TBAA making a wrong decision in case of unions and malloc. PartialAlias is not correct, because a PartialAlias result implies that some, but not all, bytes overlap which is not necessarily the case here. AAResults returns the first analysis result that is not MayAlias. BasicAA is always the first alias analysis. When it returns PartialAlias, no other analysis is queried to give a more exact result (which was the intention of returning PartialAlias instead of MayAlias). For instance, ScopedAA could return a more accurate result. The PartialAlias hack was introduced in r131781 (and re-applied in r132632 after some reverts) to fix llvm.org/PR9971 where TBAA returns a wrong NoAlias result due to a union. A test case for the malloc case mentioned in the comment was not provided and I don't think it is affected since it returns an omnipotent char anyway. Since r303851 (https://reviews.llvm.org/D33328) clang does emit specific TBAA for unions anymore (but "omnipotent char" instead). Hence, the PartialAlias workaround is not required anymore. This patch passes the test-suite and check-llvm/check-clang of a self-hoisted build on x64. Reviewed By: hfinkel Differential Revision: https://reviews.llvm.org/D34318 llvm-svn: 305938	2017-06-21 18:25:37 +00:00
Peter Collingbourne	afaeed5322	Object: Have the irsymtab builder take a string table builder. NFCI. This will be needed in order to share the irsymtab string table with the bitcode string table. Differential Revision: https://reviews.llvm.org/D33971 llvm-svn: 305937	2017-06-21 18:23:19 +00:00
Sanjay Patel	2a6f9f8adf	[CGP, memcmp] replace CreateZextOrTrunc with CreateZext because it can never trunc llvm-svn: 305936	2017-06-21 18:20:52 +00:00
Sanjay Patel	a10f5b626d	[CGP] fix variables to be unsigned in memcmp expansion llvm-svn: 305935	2017-06-21 18:06:13 +00:00
Dehao Chen	50f2aa19e8	Do not inline recursive direct calls in sample loader pass. Summary: r305009 disables recursive inlining for indirect calls in sample loader pass. The same logic applies to direct recursive calls. Reviewers: iteratee, davidxl Reviewed By: iteratee Subscribers: sanjoy, llvm-commits, eraman Differential Revision: https://reviews.llvm.org/D34456 llvm-svn: 305934	2017-06-21 17:57:43 +00:00
Reid Kleckner	d0e6e24a53	[PDB] Add symbols to the PDB Summary: The main complexity in adding symbol records is that we need to "relocate" all the type indices. Type indices do not have anything like relocations, an opaque data structure describing where to find existing type indices for fixups. The linker just has to "know" where the type references are in the symbol records. I added an overload of `discoverTypeIndices` that works on symbol records, and it seems to be able to link the standard library. Reviewers: zturner, ruiu Subscribers: llvm-commits, hiraditya Differential Revision: https://reviews.llvm.org/D34432 llvm-svn: 305933	2017-06-21 17:25:56 +00:00
Lei Huang	84dbbfdeb9	[PowerPC] define target hook isReallyTriviallyReMaterializable() Define target hook isReallyTriviallyReMaterializable() to explicitly specify PowerPC instructions that are trivially rematerializable. This will allow the MachineLICM pass to accurately identify PPC instructions that should always be hoisted. Differential Revision: https://reviews.llvm.org/D34255 llvm-svn: 305932	2017-06-21 17:17:56 +00:00
Sanjay Patel	deed579140	[x86] set the datalayout to match the RUN line triple; NFC I don't think there's any visible difference from having the wrong layout for the 32-bit case at this point, but that could change in the future. llvm-svn: 305931	2017-06-21 17:06:24 +00:00
Rui Ueyama	0f8a345fb4	Use -NOT prefix instead of adding `not` to FileCheck. If we want to make sure that a particular string is not in an output, the regular way of doing it is to add `-NOT` prefix instead of checking if FileCheck resulted in an error. Differential Revision: https://reviews.llvm.org/D34435 llvm-svn: 305930	2017-06-21 16:50:38 +00:00
Rui Ueyama	28ea8c7ad7	[COFF] Set MajorLinkerVersion to 14 instead of 0. This works around a strange interaction with Authenticode signatures, in which a signed PE executable with {Major,Minor}LinkerVersion = 0.0 fails to validate on Windows 7 (but is OK on Windows 10). Setting the linker version to 14.0 (which is what VS2015 outputs) makes it work again. Patch by Simon Tatham <simon.tatham@arm.com>. llvm-svn: 305929	2017-06-21 16:42:08 +00:00
Erich Keane	4bd39300ef	Correct VectorCall x86 (32 bit) behavior for SSE Register Assignment In running some internal vectorcall tests in 32 bit mode, we discovered that the behavior I'd previously implemented for x64 (and applied to x32) regarding the assignment of SSE registers was incorrect. See spec here: https://msdn.microsoft.com/en-us/library/dn375768.aspx My previous implementation applied register argument position from the x64 version to both. This isn't correct for x86, so this removes and refactors that section. Additionally, it corrects the integer/int-pointer assignments. Unlike x64, x86 permits integers to be assigned independent of position. Finally, the code for 32 bit was cleaned up a little to clarify the intent, as well as given a descriptive comment. Differential Revision: https://reviews.llvm.org/D34455 llvm-svn: 305928	2017-06-21 16:37:22 +00:00

... 2 3 4 5 6 ...

265242 Commits All Branches Search

265242 Commits

All Branches