llvm-project

Commit Graph

Author	SHA1	Message	Date
David Stuttard	70e8bc1bf3	[AMDGPU] Add intrinsics for tbuffer load and store Intrinsic already existed for llvm.SI.tbuffer.store Needed tbuffer.load and also re-implementing the intrinsic as llvm.amdgcn.tbuffer.* Added CodeGen tests for the 2 new variants added. Left the original llvm.SI.tbuffer.store implementation to avoid issues with existing code Subscribers: arsenm, kzhuravl, wdng, nhaehnle, yaxunl, tony-tye, tpr Differential Revision: https://reviews.llvm.org/D30687 llvm-svn: 306031	2017-06-22 16:29:22 +00:00
Krzysztof Parzyszek	9bdb460f64	[Hexagon] Fix typo in a testcase llvm-svn: 306030	2017-06-22 16:25:46 +00:00
Craig Topper	dffbbcb3fd	[InstCombine] Teach foldSelectICmpAndOr to recognize (select (icmp slt (trunc (X)), 0), Y, (or Y, C2)) Summary: InstCombine likes to turn (icmp eq (and X, C1), 0) into (icmp slt (trunc (X)), 0) sometimes. This breaks foldSelectICmpAndOr's ability to recognize (select (icmp eq (and X, C1), 0), Y, (or Y, C2))->(or (shl (and X, C1), C3), y). This patch tries to recover this. I had to flip around some of the early out checks so that I could create a new And instruction during the compare processing without it possibly never getting used. Reviewers: spatel, majnemer, davide Reviewed By: spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D34184 llvm-svn: 306029	2017-06-22 16:23:30 +00:00
Teresa Johnson	a690e3cea2	[ThinLTO] Remove unnecessary include of Linker.h (NFC) The ModuleLinker is no longer used by ThinLTO, so this is not needed. Patch by Benoit Belley <Benoit.Belley@autodesk.com> llvm-svn: 306028	2017-06-22 16:18:48 +00:00
Craig Topper	0de5e6a729	[InstCombine] Add one use checks to or/and->xnor folding If the components of the and/or had multiple uses, this transform created an additional instruction. This patch makes sure we remove one of the components. Differential Revision: https://reviews.llvm.org/D34498 llvm-svn: 306027	2017-06-22 16:12:02 +00:00
Krzysztof Parzyszek	f63ad39e7d	[Hexagon] Handle a global operand to A2_addi when creating duplexes llvm-svn: 306012	2017-06-22 15:53:31 +00:00
Sanjay Patel	d1e811979c	[InstCombine] reverse bitcast + bitwise-logic canonicalization (PR33138) There are 2 parts to this patch made simultaneously to avoid a regression. We're reversing the canonicalization that moves bitwise vector ops before bitcasts. We're moving bitwise vector ops after bitcasts instead. That's the 1st and 3rd hunks of the patch. The motivation is that there's only one fold that currently depends on the existing canonicalization (see next), but there are many folds that would automatically benefit from the new canonicalization. PR33138 ( https://bugs.llvm.org/show_bug.cgi?id=33138 ) shows why/how we have these patterns in IR. There's an or(and,andn) pattern that requires an adjustment in order to continue matching to 'select' because the bitcast changes position. This match is unfortunately complicated because it requires 4 logic ops with optional bitcast and sext ops. Test diffs: 1. The bitcast.ll and bitcast-bigendian.ll changes show the most basic difference - bitcast comes before logic. 2. There are also tests with no diffs in bitcast.ll that verify that we're still doing folds that were enabled by the previous canonicalization. 3. icmp-xor-signbit.ll shows the payoff. We don't need to adjust existing icmp patterns to look through bitcasts. 4. logical-select.ll contains several tests for the or(and,andn) --> select fold to verify that we are still handling those cases. The lone diff shows the movement of the bitcast from the new canonicalization rule. Differential Revision: https://reviews.llvm.org/D33517 llvm-svn: 306011	2017-06-22 15:46:54 +00:00
whitequark	cebe8241ca	[X86] Add support for "probe-stack" attribute This commit adds prologue code emission for stack probe function calls. Reviewed By: majnemer Differential Revision: https://reviews.llvm.org/D34387 llvm-svn: 306010	2017-06-22 15:42:53 +00:00
Florian Hahn	5991b5be74	[ARM] Create relocations for beq.w branches to ARM function syms. Summary: The ARM ELF ABI requires the linker to do interworking for wide conditional branches from Thumb code to ARM code. That was pointed out by @peter.smith in the comments for D33436. Reviewers: rafael, peter.smith, echristo Reviewed By: peter.smith Subscribers: aemerson, javed.absar, kristof.beyls, llvm-commits, peter.smith Differential Revision: https://reviews.llvm.org/D34447 llvm-svn: 306009	2017-06-22 15:32:41 +00:00
Sanjay Patel	e800df8eac	[InstCombine] add peekThroughBitcast() helper; NFC This is an NFC portion of D33517. We have similar helpers in the backend. llvm-svn: 306008	2017-06-22 15:28:01 +00:00
Petar Jovanovic	636851b845	[mips] Allow $AT to be used as a register name This patch allows $AT to be used as a register name in assembly files. Currently only $at is recognized as a valid register name. Patch by Stanislav Ocovaj. Differential Revision: https://reviews.llvm.org/D34348 llvm-svn: 306007	2017-06-22 15:24:16 +00:00
Nirav Dave	f2c349ccec	[DAG] Add Target Store Merge pass ordering function Allow targets to specify if they should merge stores before or after legalization. llvm-svn: 306006	2017-06-22 15:07:49 +00:00
Pavel Labath	efd57a8aec	Revert "[Support] Add RetryAfterSignal helper function" and subsequent fix The fix in r306003 uncovered a pretty fundamental problem that libc++ implementation of std::result_of does not handle the prototype of open(2) correctly (presumably because it contains ...). This makes the whole function unusable in its current form, so I am also reverting the original commit (r305892), which introduced the function, at least until I figure out a way to solve the libc++ issue. llvm-svn: 306005	2017-06-22 14:18:55 +00:00
Krzysztof Parzyszek	69ffba4595	[Hexagon] Recognize potential offset overflow for store-imm to stack Reserve an extra scavenging stack slot if the offset field in store- -immediate instructions may overflow. llvm-svn: 306004	2017-06-22 14:11:23 +00:00
Pavel Labath	fafedb11ce	[Support] Fix return type deduction in RetryAfterSignal The default value of the ResultT template argument (which was there only to avoid spelling out the long std::result_of template multiple times) was being overriden by function call template argument deduction. This manifested itself as a compiler error when calling the function as FILE *X = RetryAfterSignal(nullptr, fopen, ...) because the function would try to assign the result of fopen to nullptr_t, but a more insidious side effect was that RetryAfterSignal(-1, read, ...) would return "int" instead of "ssize_t", losing precision along the way. I fix this by having the function take the argument in a way that prevents argument deduction from kicking in and add a test that makes sure the return type is correct. llvm-svn: 306003	2017-06-22 13:55:54 +00:00
Kamil Rytarowski	25374a6849	[Solaris] replace Solaris.h hack with a set of better hacks Summary: Got rid of unwieldy -include Solaris.h portability solution, replacing it with interposed header and moving endian defines into Host.h. Fixes PR28370. Reviewers: joerg, alekseyshl, mgorny Reviewed By: joerg Subscribers: llvm-commits, mgorny, ro, krytarowski Patch by Fedor Sergeev. Differential Revision: https://reviews.llvm.org/D3413 llvm-svn: 306002	2017-06-22 13:18:46 +00:00
Pavel Labath	674421e4de	[Testing/Support] Remove the const_cast in TakeExpected Summary: The const_cast in the "const" version of TakeExpected was quite dangerous, as the function does indeed modify the apparently const argument. I assume the reason the const overload was added was to make the function bind to xvalues(temporaries). That can be also achieved with rvalue references, so I use that instead. Using the ASSERT macros on const Expected objects will now become illegal, but I believe that is correct, as it is not actually possible to inspect the error stored in an Expected object without modifying it. Reviewers: zturner, chandlerc Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D34405 llvm-svn: 306001	2017-06-22 13:11:50 +00:00
Sagar Thakur	15126308c8	Revert [mips] Adds support for R_MIPS_26, HIGHER, HIGHEST relocations in RuntimeDyld Reverting due to build bot failures llvm-svn: 306000	2017-06-22 12:48:04 +00:00
Sam Kolton	ca5a30ed74	[AMDGPU] SDWA: remove support for VOP2 instructions that have only 64-bit encoding Summary: Despite that this instructions are listed in VOP2, they are treated as VOP3 in specs. They should not support SDWA. There are no real instructions for them, but there are pseudo instructions. Reviewers: arsenm, vpykhtin, cfang Subscribers: kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye Differential Revision: https://reviews.llvm.org/D34403 llvm-svn: 305999	2017-06-22 12:42:14 +00:00
Kristof Beyls	9665249fd8	Don't conditionalize Neon instructions, even in IT blocks. This has been deprecated since ARMARM v7-AR, release C.b, published back in 2012. This also removes test/CodeGen/Thumb2/ifcvt-neon.ll that originally was introduced to check that conditionalization of Neon instructions did happen when generating Thumb2. However, the test had evolved and was no longer testing that. Rather than trying to adapt that test, this commit introduces test/CodeGen/Thumb2/ifcvt-neon-deprecated.mir, since we can now use the MIR framework to write nicer/more maintainable tests. llvm-svn: 305998	2017-06-22 12:11:38 +00:00
Sagar Thakur	f8858d0979	[mips] Adds support for R_MIPS_26, HIGHER, HIGHEST relocations in RuntimeDyld After the N64 static relocation model support was added to llvm it is required to add its support in RuntimeDyld also because lldb uses ExecutionEngine for evaluating expressions. Reviewed by sdardis Differential: D31649 llvm-svn: 305997	2017-06-22 11:49:19 +00:00
Simon Dardis	1c73fcc131	[mips] Implement the ".rdata" MIPS assembly directive. Rather than creating a separate ".rdata" section distinct from the customary ".rodata" in ELF, ".rdata" switches to the ".rodata" section. This patch relands r305949 and r305950 with the correct commit message and addresses nit raised during review. Patch By: John Baldwin! Differential Revision: https://reviews.llvm.org/D34452 llvm-svn: 305995	2017-06-22 10:41:51 +00:00
Ekaterina Vaartis	2d5ab6934e	Test commit llvm-svn: 305994	2017-06-22 10:38:49 +00:00
John Brawn	ed78aaf093	[ARM] Add .w aliases of MOV with shifted operand These appear to have been simply missing. Differential Revision: https://reviews.llvm.org/D34461 llvm-svn: 305993	2017-06-22 10:30:53 +00:00
John Brawn	192f74a84d	[ARM] Clean up choice of narrow instructions in ARMAsmParser, NFC This patch makes a couple of changes to how we decide whether to use the narrow or wide encoding of thumb2 instructions: * Common out the detection of the .w qualifier * Check for the CPSR operand in a consistent way Differential Revision: https://reviews.llvm.org/D34460 llvm-svn: 305992	2017-06-22 10:29:31 +00:00
Diana Picus	b512e91515	Revert "Enable vectorizer-maximize-bandwidth by default." This reverts commit r305960 because it broke self-hosting on AArch64. llvm-svn: 305990	2017-06-22 10:00:28 +00:00
Igor Breger	1c29be7e4f	[GlobalISel][X86] Support vector type G_INSERT legalization/selection. Summary: Support vector type G_INSERT legalization/selection. Split from https://reviews.llvm.org/D33665 Reviewers: qcolombet, t.p.northover, zvi, guyblank Reviewed By: guyblank Subscribers: guyblank, rovka, llvm-commits, kristof.beyls Differential Revision: https://reviews.llvm.org/D33956 llvm-svn: 305989	2017-06-22 09:43:35 +00:00
Florian Hahn	b489e56ae2	[ARM] Add macro fusion for AES instructions. Summary: This patch adds a macro fusion using CodeGen/MacroFusion.cpp to pair AES instructions back to back and adds FeatureFuseAES to enable the feature. Reviewers: evandro, javed.absar, rengolin, t.p.northover Reviewed By: javed.absar Subscribers: aemerson, mgorny, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D34142 llvm-svn: 305988	2017-06-22 09:39:36 +00:00
Elena Demikhovsky	2dac0b4d58	AVX-512: Lowering Masked Gather intrinsic - fixed a bug Masked gather for vector length 2 is lowered incorrectly for element type i32. The type <2 x i32> was automatically extended to <2 x i64> and we generated VPGATHERQQ instead of VPGATHERQD. The type <2 x float> is extended to <4 x float>, so there is no bug for this type, but the sequence may be more optimal. In this patch I'm fixing <2 x i32>bug and optimizing <2 x float> sequence for GATHERs only. The same fix should be done for Scatters as well. Differential revision: https://reviews.llvm.org/D34343 llvm-svn: 305987	2017-06-22 06:47:41 +00:00
Sam Kolton	3c4933fcc6	[AMDGPU] SDWA: add support for GFX9 in peephole pass Summary: Added support based on merged SDWA pseudo instructions. Now peephole allow one scalar operand, omod and clamp modifiers. Added several subtarget features for GFX9 SDWA. This diff also contains changes from D34026. Depends D34026 Reviewers: vpykhtin, rampitec, arsenm Subscribers: kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye Differential Revision: https://reviews.llvm.org/D34241 llvm-svn: 305986	2017-06-22 06:26:41 +00:00
Craig Topper	71e2c1611e	[InstCombine] Add test cases to demonstrate that and->xnor and or->xnor folding can create more instructions than it removed when there are multiple uses. NFC llvm-svn: 305985	2017-06-22 05:20:39 +00:00
Hiroshi Inoue	1d5693c915	[PowerPC] fix potential verification errors This patch fixes trivial mishandling of 32-bit/64-bit instructions that may cause verification errors with -verify-machineinstrs. llvm-svn: 305984	2017-06-22 04:33:44 +00:00
Reid Kleckner	b7d716c06f	[llvm-readobj] Dump the COFF image load config This includes the safe SEH tables and the control flow guard function table. LLD will emit the guard table soon, and I need a tool that dumps them for testing. llvm-svn: 305979	2017-06-22 01:10:29 +00:00
Reid Kleckner	ef5817579b	[wasm] Fix WebAssembly asm backend after r305968 llvm-svn: 305978	2017-06-22 01:07:05 +00:00
Rafael Espindola	f9df429068	Also test thumb. llvm-svn: 305976	2017-06-22 00:44:05 +00:00
Davide Italiano	7a6c5c12ad	Revert "[Target] Implement the ".rdata" MIPS assembly directive." This reverts commit r305949 and r305950 as they didn't have the correct commit message. llvm-svn: 305973	2017-06-22 00:11:41 +00:00
Sam Clegg	fe6414b043	[WebAssembly] Cleanup WasmObjectWriter.cpp. NFC - Use auto where appropriate - Use early return to reduce nesting - Remove stray comment line - Use C++ foreach over explicit iterator Differential Revision: https://reviews.llvm.org/D34477 llvm-svn: 305971	2017-06-21 23:46:41 +00:00
Stanislav Mekhanoshin	3ed38c601a	[AMDGPU] Add FP_CLASS to the add/setcc combine This is one of the nodes which also compile as v_cmp_*. Differential Revision: https://reviews.llvm.org/D34485 llvm-svn: 305970	2017-06-21 23:46:22 +00:00
Eugene Zelenko	72208a8226	[ProfileData, Support] Fix some Clang-tidy modernize-use-using and Include What You Use warnings; other minor fixes (NFC). llvm-svn: 305969	2017-06-21 23:19:47 +00:00
Rafael Espindola	88d9e37ec8	Use a MutableArrayRef. NFC. llvm-svn: 305968	2017-06-21 23:06:53 +00:00
Rafael Espindola	6da25f4fc4	Fix build. llvm-svn: 305967	2017-06-21 23:02:57 +00:00
Bob Haarman	4d2711fbb5	[codeview] respect signedness of APSInts when printing to YAML Summary: This fixes a bug where we always treat APSInts in Codeview as signed when writing them to YAML. One symptom of this problem is that llvm-pdbdump raw would show Enumerator Values that differ between the original PDB and a PDB that has been round-tripped through YAML. Reviewers: zturner Reviewed By: zturner Subscribers: llvm-commits, fhahn Differential Revision: https://reviews.llvm.org/D34013 llvm-svn: 305965	2017-06-21 22:31:52 +00:00
Stanislav Mekhanoshin	a8b26936d0	[AMDGPU] Combine add and adde, sub and sube If one of the arguments of adde/sube is zero we can fold another add/sub into it. Differential Revision: https://reviews.llvm.org/D34374 llvm-svn: 305964	2017-06-21 22:30:01 +00:00
Sam Clegg	705f798bff	Mark dump() methods as const. NFC Add const qualifier to any dump() method where adding one was trivial. Differential Revision: https://reviews.llvm.org/D34481 llvm-svn: 305963	2017-06-21 22:19:17 +00:00
Stanislav Mekhanoshin	e3eb42cef6	[AMDGPU] simplify add x, *ext (setcc) => addc\|subb x, 0, setcc This simplification allows to avoid generating v_cndmask_b32 to serialize condition code between compare and use. Differential Revision: https://reviews.llvm.org/D34300 llvm-svn: 305962	2017-06-21 22:05:06 +00:00
NAKAMURA Takumi	1b587358be	TableGen.cmake: Use DEPFILE for Ninja Generator with CMake>=3.7. CMake emits build targets as relative paths (from build.ninja) but Ninja doesn't identify absolute path (in *.d) as relative path (in build.ninja). So, let file names, in the command line, relative from ${CMAKE_BINARY_DIR}, where build.ninja is. Note that tblgen is executed on ${CMAKE_BINARY_DIR} as working directory. Differential Revision: https://reviews.llvm.org/D33707 llvm-svn: 305961	2017-06-21 22:04:07 +00:00
Dehao Chen	014db29b89	Enable vectorizer-maximize-bandwidth by default. Summary: vectorizer-maximize-bandwidth is generally useful in terms of performance. I've tested the impact of changing this to default on speccpu benchmarks on sandybridge machines. The result shows non-negative impact: spec/2006/fp/C++/444.namd 26.84 -0.31% spec/2006/fp/C++/447.dealII 46.19 +0.89% spec/2006/fp/C++/450.soplex 42.92 -0.44% spec/2006/fp/C++/453.povray 38.57 -2.25% spec/2006/fp/C/433.milc 24.54 -0.76% spec/2006/fp/C/470.lbm 41.08 +0.26% spec/2006/fp/C/482.sphinx3 47.58 -0.99% spec/2006/int/C++/471.omnetpp 22.06 +1.87% spec/2006/int/C++/473.astar 22.65 -0.12% spec/2006/int/C++/483.xalancbmk 33.69 +4.97% spec/2006/int/C/400.perlbench 33.43 +1.70% spec/2006/int/C/401.bzip2 23.02 -0.19% spec/2006/int/C/403.gcc 32.57 -0.43% spec/2006/int/C/429.mcf 40.35 +0.27% spec/2006/int/C/445.gobmk 26.96 +0.06% spec/2006/int/C/456.hmmer 24.4 +0.19% spec/2006/int/C/458.sjeng 27.91 -0.08% spec/2006/int/C/462.libquantum 57.47 -0.20% spec/2006/int/C/464.h264ref 46.52 +1.35% geometric mean +0.29% The regression on 453.povray seems real, but is due to secondary effects as all hot functions are bit-identical with and without the flag. I started this patch to consult upstream opinions on this. It will be greatly appreciated if the community can help test the performance impact of this change on other architectures so that we can decided if this should be target-dependent. Reviewers: hfinkel, mkuper, davidxl, chandlerc Reviewed By: chandlerc Subscribers: rengolin, sanjoy, javed.absar, bjope, dorit, magabari, RKSimon, llvm-commits, mzolotukhin Differential Revision: https://reviews.llvm.org/D33341 llvm-svn: 305960	2017-06-21 22:01:32 +00:00
Krzysztof Parzyszek	5b933fee3c	[Hexagon] Use MachineInstrBuilder instead of changing instruction in place llvm-svn: 305953	2017-06-21 21:03:34 +00:00
Sam Clegg	9fa8af6f82	Rename WinCOFFStreamer.cpp -> MCWinCOFFStreamer.cpp For consistency with other MC*Streamer.cpp files and the header file. Differential Revision: https://reviews.llvm.org/D34466 llvm-svn: 305952	2017-06-21 20:58:17 +00:00
Nirav Dave	6919b9e9f0	Add Aarch64 ldst-opt test. llvm-svn: 305951	2017-06-21 20:50:07 +00:00
Davide Italiano	cae62546ac	[Target/Mips] Add test associated with r305949. llvm-svn: 305950	2017-06-21 20:42:34 +00:00
Davide Italiano	75ed943def	[Target] Implement the ".rdata" MIPS assembly directive. Patch by John Baldwin < jhb at freebsd dot org >! Differential Revision: https://reviews.llvm.org/D34452 llvm-svn: 305949	2017-06-21 20:40:27 +00:00
Davide Italiano	9b8e3d308f	[Solaris] emit .init_array instead of .ctors on Solaris (Sparc/x86) Patch by Fedor Sergeev. Differential Revision: https://reviews.llvm.org/D33868 llvm-svn: 305948	2017-06-21 20:36:32 +00:00
Craig Topper	34caf5396f	[Reassociate] Use early returns in a couple places to reduce indentation and improve readability. NFC llvm-svn: 305946	2017-06-21 19:39:35 +00:00
Craig Topper	99a2e89920	[Reassociate] Const correct a helper function. NFC llvm-svn: 305945	2017-06-21 19:39:33 +00:00
Wolfgang Pieb	258927e3da	[DWARF] Support for DW_FORM_strx3 and complete support for DW_FORM_strx{1,2,4} (consumer). Reviewer: aprantl Differential Revision: https://reviews.llvm.org/D34418 llvm-svn: 305944	2017-06-21 19:37:44 +00:00
Krzysztof Parzyszek	fd048cc0ec	[Hexagon] Handle more types of immediate operands in expand-condsets llvm-svn: 305943	2017-06-21 19:21:30 +00:00
Craig Topper	a074c101e5	[InstCombine] Cleanup using commutable matchers. Make a couple helper methods standalone static functions. Put 'if' around variable declaration instead of after. NFC llvm-svn: 305941	2017-06-21 18:57:00 +00:00
whitequark	ed54b4a798	Add a "probe-stack" attribute This attribute is used to ensure the guard page is triggered on stack overflow. Stack frames larger than the guard page size will generate a call to __probestack to touch each page so the guard page won't be skipped. Reviewed By: majnemer Differential Revision: https://reviews.llvm.org/D34386 llvm-svn: 305939	2017-06-21 18:46:50 +00:00
Michael Kruse	47f856095a	[BasicAA] Use MayAlias instead of PartialAlias for fallback. Using various methods, BasicAA tries to determine whether two GetElementPtr memory locations alias when its base pointers are known to be equal. When none of its heuristics are applicable, it falls back to PartialAlias to, according to a comment, protect TBAA making a wrong decision in case of unions and malloc. PartialAlias is not correct, because a PartialAlias result implies that some, but not all, bytes overlap which is not necessarily the case here. AAResults returns the first analysis result that is not MayAlias. BasicAA is always the first alias analysis. When it returns PartialAlias, no other analysis is queried to give a more exact result (which was the intention of returning PartialAlias instead of MayAlias). For instance, ScopedAA could return a more accurate result. The PartialAlias hack was introduced in r131781 (and re-applied in r132632 after some reverts) to fix llvm.org/PR9971 where TBAA returns a wrong NoAlias result due to a union. A test case for the malloc case mentioned in the comment was not provided and I don't think it is affected since it returns an omnipotent char anyway. Since r303851 (https://reviews.llvm.org/D33328) clang does emit specific TBAA for unions anymore (but "omnipotent char" instead). Hence, the PartialAlias workaround is not required anymore. This patch passes the test-suite and check-llvm/check-clang of a self-hoisted build on x64. Reviewed By: hfinkel Differential Revision: https://reviews.llvm.org/D34318 llvm-svn: 305938	2017-06-21 18:25:37 +00:00
Peter Collingbourne	afaeed5322	Object: Have the irsymtab builder take a string table builder. NFCI. This will be needed in order to share the irsymtab string table with the bitcode string table. Differential Revision: https://reviews.llvm.org/D33971 llvm-svn: 305937	2017-06-21 18:23:19 +00:00
Sanjay Patel	2a6f9f8adf	[CGP, memcmp] replace CreateZextOrTrunc with CreateZext because it can never trunc llvm-svn: 305936	2017-06-21 18:20:52 +00:00
Sanjay Patel	a10f5b626d	[CGP] fix variables to be unsigned in memcmp expansion llvm-svn: 305935	2017-06-21 18:06:13 +00:00
Dehao Chen	50f2aa19e8	Do not inline recursive direct calls in sample loader pass. Summary: r305009 disables recursive inlining for indirect calls in sample loader pass. The same logic applies to direct recursive calls. Reviewers: iteratee, davidxl Reviewed By: iteratee Subscribers: sanjoy, llvm-commits, eraman Differential Revision: https://reviews.llvm.org/D34456 llvm-svn: 305934	2017-06-21 17:57:43 +00:00
Reid Kleckner	d0e6e24a53	[PDB] Add symbols to the PDB Summary: The main complexity in adding symbol records is that we need to "relocate" all the type indices. Type indices do not have anything like relocations, an opaque data structure describing where to find existing type indices for fixups. The linker just has to "know" where the type references are in the symbol records. I added an overload of `discoverTypeIndices` that works on symbol records, and it seems to be able to link the standard library. Reviewers: zturner, ruiu Subscribers: llvm-commits, hiraditya Differential Revision: https://reviews.llvm.org/D34432 llvm-svn: 305933	2017-06-21 17:25:56 +00:00
Lei Huang	84dbbfdeb9	[PowerPC] define target hook isReallyTriviallyReMaterializable() Define target hook isReallyTriviallyReMaterializable() to explicitly specify PowerPC instructions that are trivially rematerializable. This will allow the MachineLICM pass to accurately identify PPC instructions that should always be hoisted. Differential Revision: https://reviews.llvm.org/D34255 llvm-svn: 305932	2017-06-21 17:17:56 +00:00
Sanjay Patel	deed579140	[x86] set the datalayout to match the RUN line triple; NFC I don't think there's any visible difference from having the wrong layout for the 32-bit case at this point, but that could change in the future. llvm-svn: 305931	2017-06-21 17:06:24 +00:00
Craig Topper	5b173f2bb3	[InstCombine] Add range metadata to cttz/ctlz/ctpop intrinsic calls based on known bits Summary: I noticed that passing known bits across these intrinsics isn't great at capturing the information we really know. Turning known bits of the input into known bits of a count output isn't able to convey a lot of what we really know. This patch adds range metadata to these intrinsics based on the known bits. Currently the patch punts if we already have range metadata present. Reviewers: spatel, RKSimon, davide, majnemer Reviewed By: RKSimon Subscribers: sanjoy, hfinkel, llvm-commits Differential Revision: https://reviews.llvm.org/D32582 llvm-svn: 305927	2017-06-21 16:32:35 +00:00
Craig Topper	ae86cc725d	[InstCombine] Don't let folding (select (icmp eq (and X, C1), 0), Y, (or Y, C2)) create more instructions than it removes Summary: Previously this folding had no checks to see if it was going to result in less instructions. This was pointed out during the review of D34184 This patch adds code to count how many instructions its going to create vs how many its going to remove so we can make a proper decision. Reviewers: spatel, majnemer Reviewed By: spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D34437 llvm-svn: 305926	2017-06-21 16:07:13 +00:00
Craig Topper	cbac691c4b	[Reassociate] Support xor reassociating for splat vectors Summary: This patch adds support for xors of splat vectors. Reviewers: mcrosier Reviewed By: mcrosier Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D34354 llvm-svn: 305925	2017-06-21 16:07:09 +00:00
Dmitry Preobrazhensky	851a3d9f05	[AMDGPU][MC][GFX9] Corrected VOP3P relevant code to fix disassembler failures See Bug 33509: https://bugs.llvm.org//show_bug.cgi?id=33509 Reviewers: Sam Kolton, Artem Tamazov, Valery Pykhtin Differential Revision: https://reviews.llvm.org/D34360 llvm-svn: 305923	2017-06-21 16:00:54 +00:00
Nirav Dave	c1b6aa77bb	[DAG] Move BaseIndexOffset into separate Libarary. NFC. Move BaseIndexOffset analysis out of DAGCombiner for use in other files. llvm-svn: 305921	2017-06-21 15:40:43 +00:00
David Blaikie	8f9621ae04	ClangFormat some changes from r305226 Post commit review feedback from Justin Bogner llvm-svn: 305919	2017-06-21 15:20:46 +00:00
Christof Douma	1ee68828b2	[AARCH64][LSE] Preliminary support for ARMv8.1 LSE Atomics. Added test file for ARMv8.1 LSE Atomics that I forgot to include in commit r305893. Patch by Ananth Jasty. Differential Revision: https://reviews.llvm.org/D33586 Change-Id: Ic1ad8ed87c1b584c4c791b459a686c866a3c3087 llvm-svn: 305918	2017-06-21 15:18:39 +00:00
Nirav Dave	9a69d444a3	[DAG] Remove Node csonstruction from BaseIndexOffset match. NFCI. Move GlobalAddress Offset decomposition from initial match into comparision check and removing the possibility of constructing a new offseted global address when examining addresses. llvm-svn: 305917	2017-06-21 15:07:30 +00:00
Simon Pilgrim	550cb7e82c	[X86][SSE] Dropped -mcpu from 256-bit vector shuffle tests Use triple and attribute only for consistency llvm-svn: 305916	2017-06-21 14:51:23 +00:00
Dmitry Preobrazhensky	dc4ac823ec	[AMDGPU][MC] Corrected V_QSAD instructions to check that dest register is different than any of the src See Bug 33279: https://bugs.llvm.org//show_bug.cgi?id=33279 Reviewers: artem.tamazov, vpykhtin Differential Revision: https://reviews.llvm.org/D34003 llvm-svn: 305915	2017-06-21 14:41:34 +00:00
Sanjay Patel	cec6a500a8	[x86] fix formatting; NFC llvm-svn: 305914	2017-06-21 14:27:11 +00:00
Simon Pilgrim	9d0c2b7bad	[X86][SSE] Dropped -mcpu from 128-bit vector shuffle tests Use triple and attribute only for consistency llvm-svn: 305913	2017-06-21 14:23:02 +00:00
Simon Pilgrim	5309b7d5c9	[X86][SSE] Regenerate merge store tests llvm-svn: 305910	2017-06-21 13:46:42 +00:00
Simon Pilgrim	e74e08fe61	[X86][SSE] Dropped -mcpu from vector blend shuffle tests and regenerate Use triple and attribute only for consistency llvm-svn: 305909	2017-06-21 13:45:33 +00:00
Simon Pilgrim	98aab7c6fc	[X86][SSE] Dropped -mcpu from vector shuffle tests Use triple and attribute only for consistency llvm-svn: 305908	2017-06-21 13:26:52 +00:00
Simon Pilgrim	6d5d6b542b	[X86][SSE] Dropped -mcpu from vector zero extend tests Use triple and attribute only for consistency llvm-svn: 305907	2017-06-21 13:17:14 +00:00
Simon Pilgrim	c388ec32e0	[X86][SSE] Dropped -mcpu from variable shuffle tests Use triple and attribute only for consistency llvm-svn: 305906	2017-06-21 13:15:41 +00:00
Simon Pilgrim	73814a2594	[X86][AVX] Add AVX1 shuffle truncation tests llvm-svn: 305905	2017-06-21 12:58:56 +00:00
Simon Pilgrim	db6c3fa872	[X86][SSE] Add SSE2/SSE42 shuffle truncation tests llvm-svn: 305904	2017-06-21 12:58:19 +00:00
Zvi Rackover	845ca8fba9	[X86] Rerun the update_llc_test_checks tool on test. NFC. llvm-svn: 305897	2017-06-21 11:21:43 +00:00
Pavel Labath	2c1e8b7a7e	Fix build after r305892 Make sure to #include <cerrno> in Support/Errno.h llvm-svn: 305895	2017-06-21 11:10:02 +00:00
Christof Douma	c1c28051d2	[AARCH64][LSE] Preliminary support for ARMv8.1 LSE Atomics. Implemented support to AArch64 codegen for ARMv8.1 Large System Extensions atomic instructions. Where supported, these instructions can provide atomic operations with higher performance. Currently supported operations include: fetch_add, fetch_or, fetch_xor, fetch_smin, fetch_min/max (signed and unsigned), swap, and compare_exchange. This implementation implies sequential-consistency ordering, more relaxed ordering is under development. Subtarget->hasLSE is currently supported for Cavium ThunderX2T99. Patch by Ananth Jasty. Differential Revision: https://reviews.llvm.org/D33586 Change-Id: I82f6d3d64255622791ceb0715b7ab9f4dc4d4b2c llvm-svn: 305893	2017-06-21 10:58:31 +00:00
Pavel Labath	1f6aea2eb3	[Support] Add RetryAfterSignal helper function Summary: This function retries an operation if it was interrupted by a signal (failed with EINTR). It's inspired by the TEMP_FAILURE_RETRY macro in glibc, but I've turned that into a template function. I've also added a fail-value argument, to enable the function to be used with e.g. fopen(3), which is documented to fail for any reason that open(2) can fail (which includes EINTR). The main user of this function will be lldb, but there were also a couple of uses within llvm that I could simplify using this function. Reviewers: zturner, silvas, joerg Subscribers: mgorny, llvm-commits Differential Revision: https://reviews.llvm.org/D33895 llvm-svn: 305892	2017-06-21 10:55:34 +00:00
Florian Hahn	8552e591a1	[AArch64] Add early exit to promoteLoadFromStore. There should be at most a single kill flag for the promoted operand between the store/load pair. Discussed in https://reviews.llvm.org/D34402. llvm-svn: 305889	2017-06-21 09:51:52 +00:00
Strahinja Petrovic	d280ea4f76	[MIPS] Fix for selecting of DINS/INS instruction This patch adds one more condition in selection DINS/INS instruction, which fixes MultiSource/Applications/JM/ldecod/ for mips32r2 (and mips64r2 n32 abi). Differential Revision: https://reviews.llvm.org/D33725 llvm-svn: 305888	2017-06-21 09:25:51 +00:00
Javed Absar	e3a0cc2ca0	Use range-loop in machine-scheduler. NFCI. Converts to range-loop usage in machine scheduler. This makes the code neater and easier to read, and also keeps pace of the machine scheduler implementation with C++11 features. Reviewed by: Matthias Braun Differential Revision: https://reviews.llvm.org/D34320 llvm-svn: 305887	2017-06-21 09:10:10 +00:00
Sam Kolton	549c89d2c9	[AMDGPU] SDWA: merge VI and GFX9 pseudo instructions Summary: Previously there were two separate pseudo instruction for SDWA on VI and on GFX9. Created one pseudo instruction that is union of both of them. Added verifier to check that operands conform either VI or GFX9. Reviewers: dp, arsenm, vpykhtin Subscribers: kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, artem.tamazov Differential Revision: https://reviews.llvm.org/D34026 llvm-svn: 305886	2017-06-21 08:53:38 +00:00
Florian Hahn	80e485179e	[AArch64] Preserve register flags when promoting a load from store. Summary: This patch updates promoteLoadFromStore to use the store MachineOperand as the source operand of the of the new instruction instead of creating a new register MachineOperand. This way, the existing register flags are preserved. This fixes PR33468 (https://bugs.llvm.org/show_bug.cgi?id=33468). Reviewers: MatzeB, t.p.northover, junbuml Reviewed By: MatzeB Subscribers: aemerson, rengolin, javed.absar, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D34402 llvm-svn: 305885	2017-06-21 08:47:23 +00:00
Guy Blank	52d73fce85	[DAGCombiner] Add another combine from build vector to shuffle Add support for combining a build vector to a shuffle. When the build vector is of extracted elements from 2 vectors (vec1, vec2) where vec2 is 2 times smaller than vec1. llvm-svn: 305883	2017-06-21 07:38:41 +00:00
Max Kazantsev	eac01d4c62	[SCEV] Make MulOpsInlineThreshold lower to avoid excessive compilation time MulOpsInlineThreshold option of SCEV is defaulted to 1000, which is inadequately high. When constructing SCEVs of expressions like: x1 = a * a x2 = x1 * x1 x3 = x2 * x2 ... We actually have huge SCEVs with max allowed amount of operands inlined. Such expressions are easy to get from unrolling of loops looking like x = a for (i = 0; i < n; i++) x = x * x Or more tricky cases where big powers are involved. If some non-linear analysis tries to work with a SCEV that has 1000 operands, it may lead to excessively long compilation. The attached test does not pass within 1 minute with default threshold. This patch decreases its default value to 32, which looks much more reasonable if we use analyzes with complexity O(N^2) or O(N^3) working with SCEV. Differential Revision: https://reviews.llvm.org/D34397 llvm-svn: 305882	2017-06-21 07:28:13 +00:00
Rafael Espindola	2c8e3ed00f	Simplify test. llvm-svn: 305881	2017-06-21 06:42:56 +00:00
Dean Michael Berris	28ecff5cf1	[XRay] Reduce synthetic references emitted by XRay Summary: When we're building with XRay instrumentation, we use a trick that preserves references from the function to a function sled index. This index table lives in a separate section, and without this trick the linker is free to garbage-collect this section and all the segments it refers to. Until we're able to tell the linkers to preserve these sections, we use this reference trick to keep around both the index and the entries in the instrumentation map. Before this change we emitted both a synthetic reference to the label in the instrumentation map, and to the entry in the function map index. This change removes the first synthetic reference and only emits one synthetic reference to the index -- the index entry has the references to the labels in the instrumentation map, so the linker will still preserve those if the function itself is preserved. This reduces the amount of synthetic references we emit from 16 bytes to just 8 bytes in x86_64, and similarly to other platforms. Reviewers: dblaikie Subscribers: javed.absar, kpw, pelikan, llvm-commits Differential Revision: https://reviews.llvm.org/D34340 llvm-svn: 305880	2017-06-21 06:39:42 +00:00
Serguei Katkov	0b0dc57dd8	[ImplicitNullChecks] Uphold an invariant in areMemoryOpsAliased Right now areMemoryOpsAliased has an assertion justified as: MMO1 should have a value due it comes from operation we'd like to use as implicit null check. assert(MMO1->getValue() && "MMO1 should have a Value!"); However, it is possible for that invariant to not be upheld in the following situation (conceptually): Null check %RAX NotNullSucc: %RAX = LEA %RSP, 16 // I0 %RDX = MOV64rm %RAX // I1 With the current code, we will have an early exit from ImplicitNullChecks::isSuitableMemoryOp on I0 with SR_Unsuitable. However, I1 will look plausible (since it loads from %RAX) and will go ahead and call areMemoryOpsAliased(I1, I0). This will cause us to fail the assert mentioned above since I1 does not load from an IR level value and thus is allowed to have a non-Value base address. The fix is to bail out earlier whenever we see an unsuitable instruction overwrite PointerReg. This would guarantee that when we call areMemoryOpsAliased, we're guaranteed to be looking at an instruction that loads from or stores to an IR level value. Original Patch Author: sanjoy Reviewers: sanjoy, mkazantsev, reames Reviewed By: sanjoy Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D34385 llvm-svn: 305879	2017-06-21 06:38:23 +00:00
Davide Italiano	0ec715be1f	[NewGVN] Fix a bug that made the store verifier less effective. We weren't actually checking for duplicated stores, as the condition was always actually false. This was found by Coverity, and I have no clue how to trigger this in real-world code (although I tried for a bit). llvm-svn: 305867	2017-06-20 22:57:40 +00:00
Kevin Enderby	1ce3858488	Updated llvm-objdump with Mach-O files and the -objc-meta-data option so that it symbolically prints the superclass when it has dyld bind info for it. rdar://7638823 llvm-svn: 305866	2017-06-20 22:55:11 +00:00
Rafael Espindola	3ac4c09daf	clang-format a region. It will make a followup patch easier to read. llvm-svn: 305865	2017-06-20 22:53:29 +00:00
Lang Hames	cd22753689	Add a cantFail overload for Expected-reference (Expected<T&>) types. llvm-svn: 305863	2017-06-20 22:18:02 +00:00
Reid Kleckner	91ef9de643	[codeview] YAMLize all section offsets and indices in symbol records We forgot to serialize these because llvm-readobj didn't dump them. They are typically all zeros in an object file. The linker fills them in with relocations before adding them to the PDB. Now we can properly round trip these symbols through pdb2yaml -> yaml2pdb. I made these fields optional with a zero default so that we can elide them from our test cases. llvm-svn: 305857	2017-06-20 21:19:22 +00:00
Adrian Prantl	4d121e2182	Revert "Add previously accidentally uncommitted testcase for r305599." This reverts commit r305852. The testcase already exists but I moved it to the X86 directory on a using a different machine and got confused... llvm-svn: 305856	2017-06-20 21:14:29 +00:00
Rafael Espindola	5f4a10bf23	Make this test a bit more strict. NFC. llvm-svn: 305855	2017-06-20 21:11:58 +00:00
Adrian Prantl	25422dcccb	Fix a crash in DwarfDebug::validThroughout. The instruction it falls over on is an IMPLICT_DEF that also happens to be the only instruction in its lexical scope. That LexicalScope has never been created because its range is empty. This patch skips over all meta-instructions instead of just DBG_VALUEs. Thanks to David Blaikie for providing a testcase! llvm-svn: 305853	2017-06-20 21:08:52 +00:00
Adrian Prantl	36bc095a2e	Add previously accidentally uncommitted testcase for r305599. llvm-svn: 305852	2017-06-20 21:08:19 +00:00
Kevin Enderby	30cf2e87ba	Change llvm-objdump with Mach-O files and the -info-plist option with the -no-leading-headers option so that it does not print the leading header. rdar://27378808 llvm-svn: 305849	2017-06-20 21:00:25 +00:00
Anna Thomas	f765cad13e	[Statepoint] Add helper functions for GCRelocate and GCResult These functions isGCRelocate and isGCResult are similar to isStatepoint(const Value*). llvm-svn: 305847	2017-06-20 20:54:57 +00:00
Saleem Abdulrasool	8199dadab8	Support: chunk writing on Linux This is a workaround for large file writes. It has been witnessed that write(2) failing with EINVAL (22) due to a large value (>2G). Thanks to James Knight for the help with coming up with a sane test case. llvm-svn: 305846	2017-06-20 20:51:51 +00:00
Matt Arsenault	67cd347e93	AMDGPU: Allow vectorization of packed types llvm-svn: 305844	2017-06-20 20:38:06 +00:00
Reid Kleckner	665e1c9240	[codeview] Fully initialize DataSym when mapping from YAML In the object file, the section index and relative offset are typically zero, so make these YAML fields optional with a default. It looks like there may be more partially initialized symbol records, but this should fix the msan bot. llvm-svn: 305842	2017-06-20 20:34:37 +00:00
Stanislav Mekhanoshin	a9d846c6ef	[AMDGPU] Fix illegal shrink of V_SUBB_U32 and V_ADDC_U32 If there is an immediate operand we shall not shrink V_SUBB_U32 and V_ADDC_U32, it does not fit e32 encoding. Differential Revison: https://reviews.llvm.org/D34291 llvm-svn: 305840	2017-06-20 20:33:44 +00:00
Michael Gottesman	7265da8106	[cmake] Add support for using the standalone leaks sanitizer with LLVM. This commit causes LLVM_USE_SANITIZER to now accept the "Leaks" option. This will cause cmake to pass in -fsanitize=leak in all of the appropriate places. I am making this change so that I can setup a linux bot that only detects leaks. llvm-svn: 305839	2017-06-20 20:28:07 +00:00
Matt Arsenault	9698f1c862	AMDGPU: Start adding global_* instructions llvm-svn: 305838	2017-06-20 19:54:14 +00:00
Aditya Nandakumar	855a9e3e06	[GISel]: NFC. Add comment to G_FMA opcode as requested in rL305824 llvm-svn: 305837	2017-06-20 19:52:29 +00:00
Aditya Nandakumar	c6a419123a	[GISel]: Add G_FMA opcode for fused multiply adds https://reviews.llvm.org/D34372 Reviewed by dsanders llvm-svn: 305824	2017-06-20 19:25:23 +00:00
Matt Arsenault	ff3f912e74	AMDGPU: Do operand folding in program order Before it was possible to partially fold use instructions before the defs. After the xor is folded into a copy, the same mov can end up in the fold list twice, so on the second attempt it will fail expecting to see a register to fold. llvm-svn: 305821	2017-06-20 18:56:32 +00:00
Zachary Turner	297b6eb20d	[PDB] Don't write uninitialized bytes to a PDB file. There were certain fields that we didn't know how to write, as well as various padding bytes that we would ignore. This leads to garbage data in the PDB. While not strictly necessary, we should initialize these bytes to something meaningful, as it makes for easier binary comparison between PDBs. llvm-svn: 305819	2017-06-20 18:50:55 +00:00
Zachary Turner	ed130b6ac0	Remove diff pedantic mode. llvm-svn: 305818	2017-06-20 18:50:30 +00:00
Matthias Braun	7a482e2302	RegisterScavenging: Followup to r305625 This does some improvements/cleanup to the recently introduced scavengeRegisterBackwards() functionality: - Rewrite findSurvivorBackwards algorithm to use the existing LiveRegUnit::accumulateBackward() code. This also avoids the Available and Candidates bitset and just need 1 LiveRegUnit instance (= 1 bitset). - Pick registers in allocation order instead of register number order. llvm-svn: 305817	2017-06-20 18:43:14 +00:00
Matt Arsenault	76858f5a1d	AMDGPU: Preserve undef when folding register operands If the source was a copy of an undef register, this would produce a read of an undefined register which is a verifier error. llvm-svn: 305816	2017-06-20 18:41:31 +00:00
Stanislav Mekhanoshin	465a1ff193	[AMDGPU] Eliminate SGPR to VGPR copy when possible SGPRs are generally cheaper, so try to use them over VGPRs. Differential Revision: https://reviews.llvm.org/D34130 llvm-svn: 305815	2017-06-20 18:32:42 +00:00
Matt Arsenault	7f67b35901	AMDGPU: Fix crash with undef vreg input operand llvm-svn: 305814	2017-06-20 18:28:02 +00:00
Hiroshi Inoue	e7a35539c5	[PowerPC] fix trivial typos in comment, NFC llvm-svn: 305813	2017-06-20 17:53:33 +00:00
Simon Pilgrim	68204b83a7	[CostModel][X86] Add scalar arithmetic cost tests llvm-svn: 305810	2017-06-20 17:10:27 +00:00
Simon Pilgrim	36c17935e4	[CostModel][X86] Declare costs variables based on type The alphabetical progression isn't that useful llvm-svn: 305808	2017-06-20 17:04:46 +00:00
Craig Topper	2a053a9f9d	[TableGen] Take a parameter by reference instead of pointer so we don't have to add & on both callers. NFC llvm-svn: 305807	2017-06-20 16:34:37 +00:00
Craig Topper	e8a8e6a6b1	[TableGen] Use range based for loop. NFC llvm-svn: 305806	2017-06-20 16:34:35 +00:00
Yuka Takahashi	ba5d4af490	[GSoC] Flag value completion for clang This is patch for GSoC project, bash-completion for clang. To use this on bash, please run `source clang/utils/bash-autocomplete.sh`. bash-autocomplete.sh is code for bash-completion. In this patch, Options.td was mainly changed in order to add value class in Options.inc. llvm-svn: 305805	2017-06-20 16:31:31 +00:00
Sanjay Patel	0656629b87	[x86] enable CGP memcmp() expansion for 2/4/8 byte sizes There are a couple of potential improvements as seen in the IR and asm: 1. We're unnecessarily extending to a larger type to compare values. 2. The codegen for (select cond, 1, -1) could avoid a cmov. (or we could change the order of the compares, so we have a select with 0 operand) llvm-svn: 305802	2017-06-20 15:58:30 +00:00
Simon Pilgrim	4822b5b649	[X86][SSE] Relax 0/-1 vector element insertion to work for any vector with >=16bit elements Shuffle lowering/combining now does a good job for 256/512-bit vectors - we don't need to prevent this llvm-svn: 305801	2017-06-20 15:19:02 +00:00
Tim Northover	208ddc5bdc	DAG: correctly legalize UMULO. We were incorrectly sign extending into the high word (as you would for SMULO) when legalizing UMULO in terms of a wider full multiplication. Patch by James Duley. llvm-svn: 305800	2017-06-20 15:01:38 +00:00
Vassil Vassilev	9149e30f36	D33466: Make file non-executable. llvm-svn: 305795	2017-06-20 14:20:48 +00:00
Sanjay Patel	4ccbd58d70	[InstCombine] fix code/test comments for r305792; NFC These diffs were in the last version of the patch in D33342, but I accidentally committed the previous rev. llvm-svn: 305793	2017-06-20 12:45:46 +00:00
Sanjay Patel	adca825dc1	[InstCombine] try to canonicalize xor-of-icmps to and-of-icmps We have a large portfolio of folds for and-of-icmps and or-of-icmps in InstSimplify and InstCombine, but hardly anything for xor-of-icmps. Rather than trying to rethink and translate all of those folds, we can use the truth table definition of xor: X ^ Y --> (X \| Y) & !(X & Y) ...to see if we can convert the xor to and/or and then use the existing folds. http://rise4fun.com/Alive/J9v Differential Revision: https://reviews.llvm.org/D33342 llvm-svn: 305792	2017-06-20 12:40:55 +00:00
Daniel Sanders	a6e2cebf98	[globalisel][tablegen] Add support for COPY_TO_REGCLASS. Summary: As part of this * Emitted instructions now have named MachineInstr variables associated with them. This isn't particularly important yet but it's a small step towards multiple-insn emission. * constrainSelectedInstRegOperands() is no longer hardcoded. It's now added as the ConstrainOperandsToDefinitionAction() action. COPY_TO_REGCLASS uses an alternate constraint mechanism ConstrainOperandToRegClassAction() which supports arbitrary constraints such as that defined by COPY_TO_REGCLASS. Reviewers: ab, qcolombet, t.p.northover, rovka, kristof.beyls, aditya_nandakumar Reviewed By: ab Subscribers: javed.absar, igorb, llvm-commits Differential Revision: https://reviews.llvm.org/D33590 llvm-svn: 305791	2017-06-20 12:36:34 +00:00
Simon Pilgrim	916d569b8e	Fix Wdocumentation warning llvm-svn: 305790	2017-06-20 12:28:33 +00:00
Simon Pilgrim	b233c0a5d2	[X86][SSE] Dropped old INSERT_VECTOR_ELT lowering TODO Target shuffle combining now supports the matching of INSERT_VECTOR_ELT/PINSRW/PINSRB for merging multiple insertions into shuffles/bitmasks. llvm-svn: 305788	2017-06-20 10:33:34 +00:00
Simon Pilgrim	b4a77fe83a	Fixed test name. NFCI. llvm-svn: 305787	2017-06-20 10:24:06 +00:00
Igor Breger	0dee0f458e	[GlobalISel][X86] fix compilation error ( -Werror=unused-function ) llvm-svn: 305786	2017-06-20 09:40:57 +00:00
Haojian Wu	6bd5cc6239	[SelectionDAG] Fix an use-after-free issue introduced in r305775. vector.back() will be invalidated when memory reallocation happens. llvm-svn: 305785	2017-06-20 09:29:43 +00:00
Igor Breger	1dcd5e8dc8	[GlobalISel][X86] Get correct RegClass for given RegBank. Summary: In some cases RegClass depends on target feature. Hight (16-31) vector registers exist only if AVX512f available. Split from https://reviews.llvm.org/D33665 Reviewers: qcolombet, t.p.northover, zvi, guyblank Reviewed By: t.p.northover, guyblank Subscribers: guyblank, rovka, llvm-commits, kristof.beyls Differential Revision: https://reviews.llvm.org/D33952 Conflicts: test/CodeGen/X86/GlobalISel/select-memop-scalar.mir llvm-svn: 305784	2017-06-20 09:15:10 +00:00
Igor Breger	14535f0fc2	[GlobalISel] combine not symmetric merge/unmerge nodes. Summary: In some cases legalization ends up with not symmetric merge/unmerge nodes. Transform it to merge/unmerge nodes. Reviewers: t.p.northover, qcolombet, zvi Reviewed By: t.p.northover Subscribers: rovka, kristof.beyls, guyblank, llvm-commits Differential Revision: https://reviews.llvm.org/D33626 llvm-svn: 305783	2017-06-20 08:54:17 +00:00
Max Kazantsev	0bcf6ec85c	[SCEV][NFC] Fix a misleading description of AddOpsInlineThreshold The description of this option was copy-pasted from another one and does not correspond to reality. Differential Revision: https://reviews.llvm.org/D34390 llvm-svn: 305782	2017-06-20 08:37:31 +00:00
Igor Breger	22ab175658	[GlobalISel][X86] add legalizer mir tests. NFC llvm-svn: 305781	2017-06-20 08:30:48 +00:00
NAKAMURA Takumi	9a90b68707	WasmObjectWriter.cpp: Tweak a comment line. [-Wdocumentation] llvm-svn: 305777	2017-06-20 07:21:19 +00:00
Alexandros Lamprineas	2b2b420563	[ARM] Support constant pools in data when generating execute-only code. Resubmission of r305387, which was reverted at r305390. The Address Sanitizer caught a stack-use-after-scope of a Twine variable. This is now fixed by passing the Twine directly as a function parameter. The ARM backend asserts against constant pool lowering when it generates execute-only code in order to prevent the generation of constant pools in the text section. It appears that target independent optimizations might generate DAG nodes that represent constant pools. By lowering such nodes as global addresses we don't violate the semantics of execute-only code and also it is guaranteed that execute-only behaves correct with the position-independent addressing modes that support execute-only code. Differential Revision: https://reviews.llvm.org/D33773 llvm-svn: 305776	2017-06-20 07:20:52 +00:00
Max Kazantsev	b5c3362873	[SelectionDAG] Get rid of recursion in CalcNodeSethiUllmanNumber The recursive implementation of CalcNodeSethiUllmanNumber may overflow stack on extremely long pred chains. This patch replaces it with an equivalent iterative implementation. Differential Revision: https://reviews.llvm.org/D33769 llvm-svn: 305775	2017-06-20 07:07:09 +00:00
Sam Clegg	1fb8daa69a	Fix unused function build error in lld The lld-x86_64-darwin13 is failing with: error: unused function 'operator<<' Wrap the declation in ifndef NDEBUG, which matches what is done in MipsELFObjectWriter.cpp. Differential Revision: https://reviews.llvm.org/D34384 llvm-svn: 305771	2017-06-20 05:05:10 +00:00
Sam Clegg	7f055dee27	[WebAssembly] Fix build failures introduced in r305769 This fixes two build failures that only occur in certain configurations: - error: unused function 'operator<<' - error: control reaches end of non-void function Differential Revision: https://reviews.llvm.org/D34382 llvm-svn: 305770	2017-06-20 04:47:58 +00:00
Sam Clegg	b7787fd076	[WebAssembly] Add support for weak symbols in the binary format This also introduces the updated format for the "linking" section which can represent extra symbol information. See: https://github.com/WebAssembly/tool-conventions/pull/10 Differential Revision: https://reviews.llvm.org/D34019 llvm-svn: 305769	2017-06-20 04:04:59 +00:00
Nirav Dave	47a78a2502	[DAG] Simplify BaseIndexOffset. NFCI. Remove tail calls and cleanup codeflow. llvm-svn: 305768	2017-06-20 02:48:39 +00:00
Vedant Kumar	b1d331a36e	[Coverage] PR33517: Check for failure to load func records With PR33517, it became apparent that symbol table creation can fail when presented with malformed inputs. This patch makes that sort of error detectable, so llvm-cov etc. can fail more gracefully. Specifically, we now check that function records loaded from corrupted coverage mapping data are rejected, e.g when the recorded function name is garbage. Testing: check-{llvm,clang,profile}, some unit test updates. llvm-svn: 305767	2017-06-20 02:05:35 +00:00
Vedant Kumar	b5794ca90c	[ProfileData] PR33517: Check for failure of symtab creation With PR33517, it became apparent that symbol table creation can fail when presented with malformed inputs. This patch makes that sort of error detectable, so llvm-cov etc. can fail more gracefully. Specifically, we now check that function names within the symbol table aren't empty. Testing: check-{llvm,clang,profile}, some unit test updates. llvm-svn: 305765	2017-06-20 01:38:56 +00:00
Pengxuan Zheng	4a99e37edc	[test-release.sh] Enable Polly by default Reviewers: grosser, hans, zinob, bollu Reviewed By: grosser, hans Subscribers: tstellar, llvm-commits Differential Revision: https://reviews.llvm.org/D34306 llvm-svn: 305763	2017-06-20 01:04:25 +00:00
Kevin Enderby	a8c4c016f8	The change to llvm-nm in r305733 added fields to the struct NMSymbol that are not set on the main path. This diff does a memset to 0 the structs so this change is to hopefully fix the sanitizer-x86_64-linux-fast bot. llvm-svn: 305762	2017-06-20 00:41:04 +00:00
Matt Arsenault	c595185f8f	AMDGPU: Fix scratch wave offset relative FI expansion The offset may not be an inline immediate, so this needs to be materialized into a register. The post-RA run of SIShrinkInstructions is able to fold it later if it can. llvm-svn: 305761	2017-06-19 23:47:21 +00:00
Eugene Zelenko	f292a2feca	[ExecutionEngine] Fix some Clang-tidy modernize-use-using and Include What You Use warnings; other minor fixes (NFC). llvm-svn: 305760	2017-06-19 23:37:52 +00:00
Stanislav Mekhanoshin	50c2f251f5	[AMDGPU] Add infer address spaces pass before SROA It adds it for the target after inlining but before SROA where we can get most out of it. Differential Revision: https://reviews.llvm.org/D34366 llvm-svn: 305759	2017-06-19 23:17:36 +00:00
Eugene Zelenko	8361b0a9bb	[Target] Fix some Clang-tidy modernize-use-using and Include What You Use warnings; other minor fixes (NFC). llvm-svn: 305757	2017-06-19 22:43:19 +00:00
Sanjoy Das	7ba830d61c	Fix machine instruction in test case The AMD64rm instruction used in the test case was incorrect. Since the first input register to AND64rm is tied to output register, they must be the same. Thanks for Jesper Antonsson for pointing this out! llvm-svn: 305756	2017-06-19 22:35:48 +00:00
Eugene Zelenko	de6cce2236	[IR] Fix some Clang-tidy modernize-use-using warnings; other minor fixes (NFC). llvm-svn: 305755	2017-06-19 22:05:08 +00:00
Zachary Turner	be548aceef	Mark LLVMTestingSupport as not installed in LLVMBuild. This is causing downstream issues with llvm-config. llvm-svn: 305754	2017-06-19 22:01:50 +00:00
Zachary Turner	a56e4ee346	Try to fix uninitialized read in unit test. llvm-svn: 305753	2017-06-19 21:59:09 +00:00
Geoff Berry	5e46600e3a	[AArch64][Falkor] Fix MOVZ sched predicate to not assert on non-imm operands (e.g. blockaddress). llvm-svn: 305752	2017-06-19 21:57:44 +00:00
Geoff Berry	e9972cabbd	[AArch64][Kryo] Add missing write latency for LDAXP, LDXP second destination. Fixes PR33491 and PR33512. llvm-svn: 305751	2017-06-19 21:57:42 +00:00
Geoff Berry	3cc4b9f780	[AArch64][Falkor] Refine load/store increment latencies. Also fix LDXP & LDAXP write latency to avoid similar assert as PR33491 and PR33512. llvm-svn: 305750	2017-06-19 21:56:21 +00:00
Matt Arsenault	f5d61d7943	Fix typos llvm-svn: 305749	2017-06-19 21:54:25 +00:00
Matt Arsenault	e0e68a757e	AMDGPU: Cleanup CreateLiveInRegister llvm-svn: 305748	2017-06-19 21:52:45 +00:00
Kevin Enderby	0d5ec11702	Fix a FIXME in llvm-objdump for the -exports-trie option that was not adding in the base address. Without this Mach-O files, like 64-bit executables, don’t have the correct addresses printed for their exports. As the default is to link at address 0x100000000 not zero. llvm-svn: 305744	2017-06-19 21:23:07 +00:00
Peter Collingbourne	460eb5de98	Revert r305598, "utils: Add a git-r utility for mapping svn revisions to git revisions in the monorepo." $ git revert `git r 305598` We need to decide whether we want development tools to be written in Go first. llvm-svn: 305741	2017-06-19 20:43:09 +00:00
Xin Tong	bb8dbcf915	[BDCE] Add comments. NFC llvm-svn: 305739	2017-06-19 20:10:41 +00:00
Ana Pazos	f731bde064	[PATCH] [PGO] Fixed cast operation in emIntrinsicVisitor::instrumentOneMemIntrinsic. Reviewers: xur, efriedma, davidxl Reviewed By: davidxl Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D34293 llvm-svn: 305737	2017-06-19 20:04:33 +00:00
Nico Weber	4c5c02a448	Revert r305382, it caused PR33513. llvm-svn: 305735	2017-06-19 19:48:59 +00:00
Sanjay Patel	a351a61cf2	[CGP, PowerPC] try to constant fold before creating loads for memcmp expansion This is the last step needed to avoid regressions for x86 before we flip the switch to allow expansion of the smallest set of memcpy() via CGP. The DAG version checks for constant strings, so we need to do that here too. FWIW, the 2 constant test is not handled by LibCallSimplifier::optimizeMemCmp() because that code is limited to 8-bit constant arrays. LibCallSimplifier will also fail to optimize some 1 constant tests because its alignment requirements are too strict (shouldn't require alignment for a constant operand). Differential Revision: https://reviews.llvm.org/D34071 llvm-svn: 305734	2017-06-19 19:48:35 +00:00
Kevin Enderby	df0d6dabb2	Change llvm-nm for Mach-O files to use dyld info in some cases when printing symbols. In order to reduce swift binary sizes, Apple is now stripping swift symbols from the nlist symbol table. llvm-nm currently only looks at the nlist symbol table and misses symbols that are present in dyld info. This makes it hard to know the set of symbols for a binary using just llvm-nm. Unless you know to run llvm-objdump -exports-trie that can output the exported symbols in the dyld info from the export trie, which does so but in a different format. Also moving forward the time may come a when a fully linked Mach-O file that uses dyld will no longer have an nlist symbol table to avoid duplicating the symbol information. This change adds three flags to llvm-nm, -add-dyldinfo, -no-dyldinfo, and -dyldinfo-only. The first, -add-dyldinfo, has the same effect as when the new bit in the Mach-O header, MH_NLIST_OUTOFSYNC_WITH_DYLDINFO, appears in a binary. In that it looks through the dyld info from the export trie and adds symbols to be printed that are not already in its internal SymbolList variable. The -no-dyldinfo option turns this behavior off. The -dyldinfo-only option only looks at the dyld information and recreates the symbol table from the dyld info from the export trie and binding information. As if it the Mach-O file had no nlist symbol table. Also fixed a few bugs with Mach-O N_INDR symbols not correctly printing the indirect name, or in the same format as the old nm-classic program. rdar://32021551 llvm-svn: 305733	2017-06-19 19:38:22 +00:00
David Blaikie	6ab0eb4764	Remove convenient but probably not worthwhile macro for lambda workaround Cleanup from r305405 llvm-svn: 305731	2017-06-19 19:01:08 +00:00
Eric Beckmann	ddcfbf7d0a	Have writeCOFFWriter return Expected<unique_ptr>. Summary: Have writeCOFFWriter return Expected<unique_ptr> instead of requiring being passed an uninitialized unique_ptr. Reviewers: zturner, ruiu Subscribers: hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D34307 llvm-svn: 305730	2017-06-19 18:49:05 +00:00
Taewook Oh	9083547ae3	Improve profile-guided heuristics to use estimated trip count. Summary: Existing heuristic uses the ratio between the function entry frequency and the loop invocation frequency to find cold loops. However, even if the loop executes frequently, if it has a small trip count per each invocation, vectorization is not beneficial. On the other hand, even if the loop invocation frequency is much smaller than the function invocation frequency, if the trip count is high it is still beneficial to vectorize the loop. This patch uses estimated trip count computed from the profile metadata as a primary metric to determine coldness of the loop. If the estimated trip count cannot be computed, it falls back to the original heuristics. Reviewers: Ayal, mssimpso, mkuper, danielcdh, wmi, tejohnson Reviewed By: tejohnson Subscribers: tejohnson, mzolotukhin, llvm-commits Differential Revision: https://reviews.llvm.org/D32451 llvm-svn: 305729	2017-06-19 18:48:58 +00:00
Bjorn Pettersson	475fcd9cd8	[InstCombine] Make sure AddReachableCodeToWorklist sets MadeIRChange Summary: Some optimizations in AddReachableCodeToWorklist did not update the MadeIRChange state. This could happen both when removing trivially dead instructions (DCE) and at constant folds. It is essential that changes to the IR is reported correctly, since for example InstCombinePass::run() will indicate that all analyses are preserved otherwise. And the CGPassManager determines if the CallGraph is up-to-date based on status from InstructionCombiningPass::runOnFunction(). The new test case early_dce_clobbers_callgraph.ll is a reproducer for some asserts that started to trigger after changes in the inliner in r305245. With this patch the test case passes again. Reviewers: sanjoy, craig.topper, dblaikie Reviewed By: craig.topper Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D34346 llvm-svn: 305725	2017-06-19 18:00:27 +00:00
Hans Wennborg	ca69fc1cb7	Revert r304824 "Fix PR23384 (part 3 of 3)" This seems to be interacting badly with ASan somehow, causing false reports of heap-buffer overflows: PR33514. > Summary: > The patch makes instruction count the highest priority for > LSR solution for X86 (previously registers had highest priority). > > Reviewers: qcolombet > > Differential Revision: http://reviews.llvm.org/D30562 > > From: Evgeny Stupachenko <evstupac@gmail.com> llvm-svn: 305720	2017-06-19 17:57:15 +00:00
Jakub Kuderski	77d0bb4720	[Dominators] Clean up typedefs in GenericDomTreeConstruction. NFC. Summary: This patch cleans up GenericDomTreeConstruction by replacing typedefs with usings and replaces `typename GraphT::NodeRef` with `NodePtr` to make the file more readable. Reviewers: sanjoy, dberlin, chandlerc Reviewed By: dberlin Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D34254 llvm-svn: 305715	2017-06-19 17:24:56 +00:00
Reid Kleckner	44cdb10964	[PDB] Start emitting source file and line information Summary: This is a first step towards getting line info to show up in VS and windbg. So far, only llvm-pdbutil can parse the PDBs that we produce. cvdump doesn't like something about our file checksum tables. I'll have to dig into that next. This patch adds a new DebugSubsectionRecordBuilder which takes bytes directly from some other producer, such as a linker, and sticks it into the PDB. Line tables only need to be relocated. No data needs to be rewritten. File checksums and string tables, on the other hand, need to be re-done. Reviewers: zturner, ruiu Subscribers: llvm-commits, hiraditya Differential Revision: https://reviews.llvm.org/D34257 llvm-svn: 305713	2017-06-19 17:21:45 +00:00
Jakub Kuderski	f6dbefe1b1	[Dominators] Clean up GenericDomTree.h. NFC. Summary: This patch cleans up GenericDomTree.h by: - removing unnecessary <NodeT> in DomTreeNodeBase - removing unnecessary std::move on bools - changing type of DFSNumIn/DFSNumOut from int to unsigned (since the members were used as unsigned anyway) The changes don't affect behavior -- everything works as before. Reviewers: sanjoy, dberlin, chandlerc Reviewed By: dberlin Subscribers: davide, llvm-commits Differential Revision: https://reviews.llvm.org/D34229 llvm-svn: 305710	2017-06-19 16:59:20 +00:00
Reid Kleckner	18d90e17ad	[CodeView] Fix dumping of public symbol record flags I noticed nonsensical type information while dumping PDBs produced by MSVC. llvm-svn: 305708	2017-06-19 16:54:51 +00:00
Davide Italiano	daa9c0e403	[NewGVN] Simplify findConditionEquivalence(). NFCI. llvm-svn: 305707	2017-06-19 16:46:15 +00:00
Dinar Temirbulatov	e2c6991c07	Remove brackets, NFC. llvm-svn: 305706	2017-06-19 16:44:07 +00:00
Craig Topper	a7529b68cc	[InstCombine] Cleanup some duplicated one use checks Summary: These 4 patterns have the same one use check repeated twice for each. Once without a cast and one with. But the cast has no effect on what method is called. For the OR case I believe it is always profitable regardless of the number of uses since we'll never increase the instruction count. For the AND case I believe it is profitable if the pair of xors has one use such that we'll get rid of it completely. Or if the C value is something freely invertible, in which case the not doesn't cost anything. Reviewers: spatel, majnemer Reviewed By: spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D34308 llvm-svn: 305705	2017-06-19 16:23:49 +00:00
Craig Topper	ef85498e05	[Reassociate] Support some reassociation of vector xors Summary: Currently we don't try to do anything with vector xors. This patch adds support for removing duplicate pairs from a chain of vector xors as its pretty easy to support. We still dont' try to combine the xors with and/ors, but I might try that in a future patch. Reviewers: mcrosier, davide, resistor Reviewed By: mcrosier Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D34338 llvm-svn: 305704	2017-06-19 16:23:46 +00:00
Craig Topper	4350734d36	[Reassociate] Make one of the helper methods static because it doesn't use any class variables. NFC llvm-svn: 305703	2017-06-19 16:23:43 +00:00
Artem Tamazov	314eafb73d	[AMDGPU][mc][tests][NFC] Bulk ISA tests: Massive update. Add Gfx9 dasm tests. A new Gfx9 dasm test added with approx 29000 cases. Existing tests extended by (approx.): * Gfx7 asm: 5000 test cases * Gfx8 asm: 5000 test cases * Gfx9 asm: 14400 test cases * Gfx8 dasm: 5200 test cases llvm-svn: 305702	2017-06-19 15:55:02 +00:00
Nirav Dave	8dcd008d18	Allow truncated and extend memory operations in Store Merge. NFCI. As all store merges checks are based on the memory operation performed, allow use of truncated stores and extended loads as valid input candidates for merging. Relanding after fixing selection between truncated and normal store. llvm-svn: 305701	2017-06-19 15:32:28 +00:00
Anna Thomas	7949f4529a	[JumpThreading][LVI] Invalidate LVI information after blocks are merged Summary: After a single predecessor is merged into a basic block, we need to invalidate the LVI information for the new merged block, when LVI is not provably true for all of instructions in the new block. The test cases added show the correct LVI information using the LVI printer pass. Reviewers: reames, dberlin, davide, sanjoy Reviewed by: dberlin, davide Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D34108 llvm-svn: 305699	2017-06-19 15:23:33 +00:00
Xin Tong	b412831d11	[TRE] Improve code motion in TRE, use AA to tell whether a load can be moved before a call that writes to memory. Summary: use AA to tell whether a load can be moved before a call that writes to memory. Reviewers: dberlin, davide, sanjoy, hfinkel Reviewed By: hfinkel Subscribers: hfinkel, llvm-commits Differential Revision: https://reviews.llvm.org/D34115 llvm-svn: 305698	2017-06-19 15:21:18 +00:00
Nirav Dave	4e363e36fb	Add test for store merge with noimplicitfloat llvm-svn: 305697	2017-06-19 15:18:20 +00:00
Florian Hahn	fd44ca6c76	[AArch64] Fix order of checks in shouldScheduleAdjacent. We need to check the opcode of FirstMI before accessing the operands. This caused a buildbot failure during bootstrapping on AArch64. llvm-svn: 305694	2017-06-19 13:45:41 +00:00
Simon Pilgrim	48bed53918	Use range for loops. NFCI. llvm-svn: 305693	2017-06-19 13:24:12 +00:00
Tom Stellard	ff63ee0db5	AMDGPU/GlobalISel: Mark G_BITCAST s32 <--> <2 x s16> legal Reviewers: arsenm Reviewed By: arsenm Subscribers: kzhuravl, wdng, nhaehnle, yaxunl, rovka, kristof.beyls, igorb, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D34129 llvm-svn: 305692	2017-06-19 13:15:45 +00:00
Igor Breger	bd2dedaa38	[GlobalISel][X86] Fold FI/G_GEP into LDR/STR instruction addressing mode. Summary: Implement some of the simplest addressing modes.It should help to test ABI. Reviewers: zvi, guyblank Reviewed By: guyblank Subscribers: rovka, llvm-commits, kristof.beyls Differential Revision: https://reviews.llvm.org/D33888 llvm-svn: 305691	2017-06-19 13:12:57 +00:00
Florian Hahn	5f746c8e27	Recommit rL305677: [CodeGen] Add generic MacroFusion pass Use llvm::make_unique to avoid ambiguity with MSVC. This patch adds a generic MacroFusion pass, that is used on X86 and AArch64, which both define target-specific shouldScheduleAdjacent functions. This generic pass should make it easier for other targets to implement macro fusion and I intend to add macro fusion for ARM shortly. Differential Revision: https://reviews.llvm.org/D34144 llvm-svn: 305690	2017-06-19 12:53:31 +00:00
Diana Picus	78aaf7db04	[ARM] GlobalISel: Support G_ICMP for s8 and s16 Widen to s32 (like all other binary ops). llvm-svn: 305683	2017-06-19 11:47:28 +00:00
Florian Hahn	e16d3106f3	Revert r305677 [CodeGen] Add generic MacroFusion pass. This causes Windows buildbot failures do an ambiguous call. llvm-svn: 305681	2017-06-19 11:26:15 +00:00
Florian Hahn	ee1b096f8a	[CodeGen] Add generic MacroFusion pass. Summary: This patch adds a generic MacroFusion pass, that is used on X86 and AArch64, which both define target-specific shouldScheduleAdjacent functions. This generic pass should make it easier for other targets to implement macro fusion and I intend to add macro fusion for ARM shortly. Reviewers: craig.topper, evandro, t.p.northover, atrick, MatzeB Reviewed By: MatzeB Subscribers: atrick, aemerson, mgorny, javed.absar, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D34144 llvm-svn: 305677	2017-06-19 10:51:38 +00:00
Diana Picus	621894ac76	[ARM] GlobalISel: Support G_ICMP for i32 and pointers Add support throughout the pipeline: - mark as legal for s32 and pointers - map to GPRs - lower to a sequence of instructions, which moves 0 or 1 into the result register based on the flags set by a CMPrr We have copied from FastISel a helper function which maps CmpInst predicates into ARMCC codes. Ideally, we should be able to move it somewhere that both FastISel and GlobalISel can use. llvm-svn: 305672	2017-06-19 09:40:51 +00:00
Guy Blank	f4a09e55a6	[X86] Simplify vector-shuffle-v48 test. NFC. llvm-svn: 305670	2017-06-19 08:58:13 +00:00
Max Kazantsev	35b2a18eb9	[SCEV] Teach SCEVExpander to expand BinPow Current implementation of SCEVExpander demonstrates a very naive behavior when it deals with power calculation. For example, a SCEV for x^8 looks like (x * x * x * x * x * x * x * x) If we try to expand it, it generates a very straightforward sequence of muls, like: x2 = mul x, x x3 = mul x2, x x4 = mul x3, x ... x8 = mul x7, x This is a non-efficient way of doing that. A better way is to generate a sequence of binary power calculation. In this case the expanded calculation will look like: x2 = mul x, x x4 = mul x2, x2 x8 = mul x4, x4 In some cases the code size reduction for such SCEVs is dramatic. If we had a loop: x = a; for (int i = 0; i < 3; i++) x = x * x; And this loop have been fully unrolled, we have something like: x = a; x2 = x * x; x4 = x2 * x2; x8 = x4 * x4; The SCEV for x8 is the same as in example above, and if we for some reason want to expand it, we will generate naively 7 multiplications instead of 3. The BinPow expansion algorithm here allows to keep code size reasonable. This patch teaches SCEV Expander to generate a sequence of BinPow multiplications if we have repeating arguments in SCEVMulExpressions. Differential Revision: https://reviews.llvm.org/D34025 llvm-svn: 305663	2017-06-19 06:24:53 +00:00
David Blaikie	f91b030a95	[Doc] Fix getelementptr description about arguments Section "Arguments" of `getelementptr` [1] says the first argument is a type, the second argument is a pointer or a vector of pointers, and is the base address to start from. Update `getelementptr` FAQ [2] accordingly, based on discussion with David on the mailing list [3]. [1] http://llvm.org/docs/LangRef.html#getelementptr-instruction [2] http://llvm.org/docs/GetElementPtr.html [3] http://lists.llvm.org/pipermail/llvm-dev/2017-June/114294.html Patch by Wei-Ren Chen! Differential Revision: https://reviews.llvm.org/D34325 llvm-svn: 305662	2017-06-19 05:34:21 +00:00
Daniel Berlin	36b08b2088	NewGVN: Fix PR 33461, caused by slightly overzealous verification. llvm-svn: 305657	2017-06-19 00:24:00 +00:00
Sanjay Patel	ac5232201e	[x86] specify triples and auto-generate complete checks; NFC llvm-svn: 305656	2017-06-18 21:48:44 +00:00
Sanjay Patel	5a79bc61d0	[x86] specify triples and auto-generate complete checks; NFC llvm-svn: 305655	2017-06-18 21:42:19 +00:00
Sanjay Patel	0d081e0e4e	[x86] specify triple and auto-generate checks; NFC llvm-svn: 305654	2017-06-18 21:30:57 +00:00
Zachary Turner	26dbc5420d	Delete TypeDatabase. Merge the functionality into the random access type collection. This class was only being used in 2 places, so getting rid of it simplifies the code. llvm-svn: 305653	2017-06-18 20:52:45 +00:00
Craig Topper	c85be52fd8	[APFloat] Move the integerPartWidth constant into APFloatBase. Remove integerPart typedef at file scope and just use the one in APFloatBase everywhere. NFC llvm-svn: 305652	2017-06-18 18:15:41 +00:00
Craig Topper	d96177cf72	[Reassociate] Use APInt::isNullValue() instead of comparing with 0. NFC This should compile to slightly better code. llvm-svn: 305651	2017-06-18 18:15:38 +00:00
Kamil Rytarowski	a841233a76	Implement AllocateRWX and ReleaseRWX for NetBSD Summary: NetBSD ships with PaX MPROTECT disallowing RWX mappings. There is a solution to bypass this restriction with double mapping RX (code) and RW (data) using mremap(2) MAP_REMAPDUP. The initial mapping must be mmap(2)ed with protection: PROT_MPROTECT(PROT_EXEC). This functionality to bypass PaX MPROTECT appeared in NetBSD-7.99.72. This patch fixes 20 failing tests: - LLVM :: DebugInfo/debuglineinfo-macho.test - LLVM :: DebugInfo/debuglineinfo.test - LLVM :: ExecutionEngine/RuntimeDyld/Mips/ELF_Mips64r2N64_PIC_relocations.s - LLVM :: ExecutionEngine/RuntimeDyld/Mips/ELF_N32_relocations.s - LLVM :: ExecutionEngine/RuntimeDyld/Mips/ELF_N64R6_relocations.s - LLVM :: ExecutionEngine/RuntimeDyld/Mips/ELF_O32R6_relocations.s - LLVM :: ExecutionEngine/RuntimeDyld/Mips/ELF_O32_PIC_relocations.s - LLVM :: ExecutionEngine/RuntimeDyld/X86/COFF_i386.s - LLVM :: ExecutionEngine/RuntimeDyld/X86/COFF_x86_64.s - LLVM :: ExecutionEngine/RuntimeDyld/X86/ELF-relaxed.s - LLVM :: ExecutionEngine/RuntimeDyld/X86/ELF_STT_FILE.s - LLVM :: ExecutionEngine/RuntimeDyld/X86/ELF_x64-64_PC8_relocations.s - LLVM :: ExecutionEngine/RuntimeDyld/X86/ELF_x64-64_PIC_relocations.s - LLVM :: ExecutionEngine/RuntimeDyld/X86/ELF_x86-64_PIC-small-relocations.s - LLVM :: ExecutionEngine/RuntimeDyld/X86/ELF_x86-64_debug_frame.s - LLVM :: ExecutionEngine/RuntimeDyld/X86/ELF_x86_64_StubBuf.s - LLVM :: ExecutionEngine/RuntimeDyld/X86/MachO_empty_ehframe.s - LLVM :: ExecutionEngine/RuntimeDyld/X86/MachO_i386_DynNoPIC_relocations.s - LLVM :: ExecutionEngine/RuntimeDyld/X86/MachO_i386_eh_frame.s - LLVM :: ExecutionEngine/RuntimeDyld/X86/MachO_x86-64_PIC_relocations.s Sponsored by <The NetBSD Foundation> Reviewers: joerg, lhames Reviewed By: joerg Subscribers: sdardis, llvm-commits, arichardson Differential Revision: https://reviews.llvm.org/D33874 llvm-svn: 305650	2017-06-18 16:52:32 +00:00
Sanjay Patel	44e3d4c812	x86] adjust test constants to maintain coverage; NFC Increment (add 1) could be transformed to sub -1, and we'd lose coverage for these patterns. llvm-svn: 305646	2017-06-18 14:45:23 +00:00
Sanjay Patel	020bf47c6a	[x86] adjust test constants to maintain coverage; NFC Increment (add 1) could be transformed to sub -1, and we'd lose coverage for these patterns. llvm-svn: 305645	2017-06-18 14:23:47 +00:00
Sanjay Patel	246068b646	[x86] adjust test constants to maintain coverage; NFC Increment (add 1) could be transformed to sub -1, and we'd lose coverage for these patterns. llvm-svn: 305644	2017-06-18 14:01:32 +00:00
Ismail Donmez	c024ac2f22	Revert r305642 llvm-svn: 305643	2017-06-18 10:15:57 +00:00
Ismail Donmez	4f98bc6f80	Test to correct triple for SUSE on ARMv7 llvm-svn: 305642	2017-06-18 10:00:59 +00:00
Xin Tong	9d2a5b1cf7	Add argmononly attribute to strlen and wcslen, i.e. they only read memory (string) passed to them. Summary: This allows strlen to be moved out of the loop in case its argument is not modified in the loop in LICM. Reviewers: hfinkel, davide, sanjoy, dberlin Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D34323 llvm-svn: 305641	2017-06-18 03:10:26 +00:00
Galina Kistanova	90e4c3f357	Fixed the warning introduced by r305625 to make ubuntu-gcc7.1-werror bot green. llvm-svn: 305640	2017-06-17 21:05:28 +00:00
Sanjoy Das	b70ddd8901	[SROA] Add support for non-integral pointers Summary: C.f. http://llvm.org/docs/LangRef.html#non-integral-pointer-type Reviewers: chandlerc, loladiro Reviewed By: loladiro Subscribers: reames, loladiro, mcrosier, llvm-commits Differential Revision: https://reviews.llvm.org/D32203 llvm-svn: 305639	2017-06-17 20:28:13 +00:00
Xin Tong	025780ba6e	[TRE] Add assertion for folding trivial return block llvm-svn: 305637	2017-06-17 16:55:12 +00:00
Xin Tong	d5b4d0b53a	[TRE] Update comments. NFC llvm-svn: 305636	2017-06-17 16:18:36 +00:00
NAKAMURA Takumi	3149aee3c2	[CMake] Get rid of generating obj.*-tblgen if CMake >= 3.9 for Ninja generator. CMake-3.9 doesn't let compilation units depend on their dependent libraries. llvm-svn: 305635	2017-06-17 13:45:55 +00:00
NAKAMURA Takumi	fc7f3b7514	[CMake] Introduce LLVM_TARGET_TRIPLE_ENV as an option to override LLVM_DEFAULT_TARGET_TRIPLE at runtime. No behavior is changed if LLVM_TARGET_TRIPLE_ENV is blank or undefined. If LLVM_TARGET_TRIPLE_ENV is "TEST_TARGET_TRIPLE" and $TEST_TARGET_TRIPLE is not blank, llvm::sys::getDefaultTargetTriple() returns $TEST_TARGET_TRIPLE. Lit resets config.target_triple and config.environment[LLVM_TARGET_TRIPLE_ENV] to change the default target. Without changing LLVM_DEFAULT_TARGET_TRIPLE nor rebuilding, lit can be run; TEST_TARGET_TRIPLE=i686-pc-win32 bin/llvm-lit -sv path/to/test/ TEST_TARGET_TRIPLE=i686-pc-win32 ninja check-clang-tools Differential Revision: https://reviews.llvm.org/D33662 llvm-svn: 305632	2017-06-17 03:19:08 +00:00
Eric Christopher	c70d07b7ea	Rework logic and comment out the default relocation models for PPC. llvm-svn: 305630	2017-06-17 02:25:56 +00:00
Eric Christopher	5ec30ef4e4	Turn a large if block into a smaller early return for clarity. llvm-svn: 305629	2017-06-17 02:25:55 +00:00
Eric Christopher	ded727c5a8	Remove the old and unused PPC32 and PPC64TargetMachine classes. llvm-svn: 305628	2017-06-17 02:25:53 +00:00
Eric Christopher	3b78693b9a	Remove unused forward declaration. llvm-svn: 305627	2017-06-17 02:25:51 +00:00
Eric Christopher	a5b50cd645	Tidy up some calls to getRegister for readability. llvm-svn: 305626	2017-06-17 02:25:49 +00:00
Matthias Braun	537d039104	RegScavenging: Add scavengeRegisterBackwards() Re-apply r276044/r279124/r305516. Fixed a problem where we would refuse to place spills as the very first instruciton of a basic block and thus artifically increase pressure (test in test/CodeGen/PowerPC/scavenging.mir:spill_at_begin) This is a variant of scavengeRegister() that works for enterBasicBlockEnd()/backward(). The benefit of the backward mode is that it is not affected by incomplete kill flags. This patch also changes PrologEpilogInserter::doScavengeFrameVirtualRegs() to use the register scavenger in backwards mode. Differential Revision: http://reviews.llvm.org/D21885 llvm-svn: 305625	2017-06-17 02:08:18 +00:00
Tim Shen	d123c194e0	[PPC] Remove isBarrier from CFENCE8's definition. Summary: This is my misunderstanding on isBarrier. It's not for memory barriers, but for other control flow purposes. lwsync doesn't have it either. This fixes a simple crash with -verify-machineinstrs like below: define void @Foo() { entry: %tmp = load atomic i64, i64* undef acquire, align 8 unreachable } I deliberately don't want to check in the test, since there is little chance to regress on such a mistake. Such a test adds noise to the code base. I plan to check in first, since it fixes a crash, and the fix is obvious. Reviewers: kbarton, echristo Subscribers: sanjoy, nemanjai, hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D34314 llvm-svn: 305624	2017-06-17 01:25:34 +00:00
Davide Italiano	9382c5560b	[SelectionDAG] Update Loop info after splitting critical edges. The analysis is expected to be preserved by SelectionDAG. llvm-svn: 305621	2017-06-17 00:56:27 +00:00
Davide Italiano	64f94fe02a	[InstCombine] Make FPMathOperator working with ConstantExpression(s). Fixes PR33453. Differential Revision: https://reviews.llvm.org/D34303 llvm-svn: 305618	2017-06-17 00:07:22 +00:00
Zachary Turner	b0fdd214b7	Don't crash if a type record can't be found. This was a regression introduced in a previous patch. Adding back the code that handles this case. llvm-svn: 305617	2017-06-17 00:02:24 +00:00
Sam Clegg	9d24fb7ff3	[WebAssembly] Use __stack_pointer global when writing wasm binary This ensures that symbolic relocations are generated for stack pointer manipulations. These relocations are of type R_WEBASSEMBLY_GLOBAL_INDEX_LEB. This change also adds support for reading relocations of this type in WasmObjectFile.cpp. Since its a globally imported symbol this does mean that the get_global/set_global instruction won't be valid until the objects are linked that global used in no longer an imported global. Differential Revision: https://reviews.llvm.org/D34172 llvm-svn: 305616	2017-06-16 23:59:10 +00:00
Zachary Turner	ad859bd472	[CodeView] Fix random access of type names. Suppose we had a type index offsets array with a boundary at type index N. Then you request the name of the type with index N+1, and that name requires the name of index N-1 (think a parameter list, for example). We didn't handle this, and we would print something like (<unknown UDT>, <unknown UDT>). The fix for this is not entirely trivial, and speaks to a larger problem. I think we need to kill TypeDatabase, or at the very least kill TypeDatabaseVisitor. We need a thing that doesn't do any caching whatsoever, just given a type index it can compute the type name "the slow way". The reason for the bug is that we don't have anything like that. Everything goes through the type database, and if we've visited a record, then we're "done". It doesn't know how to do the expensive thing of re-visiting dependent records if they've not yet been visited. What I've done here is more or less copied the code (albeit greatly simplified) from TypeDatabaseVisitor, but wrapped it in an interface that just returns a std::string. The logic of caching the name is now in LazyRandomTypeCollection. Eventually I'd like to move the record database here as well and the visited record bitfield here as well, at which point we can actually just delete TypeDatabase. I don't see any reason for it if a "sequential" collection is just a special case of a random access collection with an empty partial offsets array. Differential Revision: https://reviews.llvm.org/D34297 llvm-svn: 305612	2017-06-16 23:42:44 +00:00
Zachary Turner	59224cba2e	Remove some dead code / includes. I'm trying to get rid of the TypeDatabase class, so the first step is to minimize its footprint. llvm-svn: 305611	2017-06-16 23:42:15 +00:00
Sam Clegg	20c7d432a4	obj2yaml: Improve error reporting Previously only the error codes were reported which meant that useful information about malformed inputs was not shown. Differential Revision: https://reviews.llvm.org/D34008 llvm-svn: 305609	2017-06-16 23:29:54 +00:00
Yonghong Song	a63178f756	bpf: fix a strict-aliasing issue Davide Italiano reported the following issue if llvm is compiled with gcc -Wstrict-aliasing -Werror: ..... lib/Target/BPF/CMakeFiles/LLVMBPFCodeGen.dir/BPFISelDAGToDAG.cpp.o ../lib/Target/BPF/BPFISelDAGToDAG.cpp: In member function ‘virtual void {anonymous}::BPFDAGToDAGISel::PreprocessISelDAG()’: ../lib/Target/BPF/BPFISelDAGToDAG.cpp:264:26: warning: dereferencing type-punned pointer will break strict-aliasing rules [-Wstrict-aliasing] val = (uint16_t )new_val; ..... The error is caused by my previous commit (revision 305560). This patch fixed the issue by introducing an union to avoid type casting. Signed-off-by: Yonghong Song <yhs@fb.com> llvm-svn: 305608	2017-06-16 23:28:04 +00:00
Craig Topper	61e684adcc	[ConstantRange] Implement getSignedMin/Max in a less complicated and faster way Summary: As far as I can tell we should be able to implement these almost the same way we do unsigned, but using signed comparisons and checks for min signed value instead of min unsigned value. Reviewers: pete, davide, sanjoy Reviewed By: davide Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D33815 llvm-svn: 305607	2017-06-16 23:26:23 +00:00
Craig Topper	288b3c9e69	[SelectionDAG] Use APInt::isSubsetOf. NFC llvm-svn: 305606	2017-06-16 23:19:14 +00:00
Craig Topper	ea5b8bc9ef	[SelectionDAG] Use APInt::isNullValue/isOneValue. NFC llvm-svn: 305605	2017-06-16 23:19:12 +00:00
Craig Topper	b681907c50	[TargetLowering] Use ConstantSDNode::isOne and getSExtValue instead of getting the underlying APInt first. NFC llvm-svn: 305604	2017-06-16 23:19:10 +00:00
Wei Mi	c7ba876323	Revert rL305578. There is still some buildbot failure to be fixed. llvm-svn: 305603	2017-06-16 23:14:35 +00:00

... 3 4 5 6 7 ...

150736 Commits