llvm-project

Commit Graph

Author	SHA1	Message	Date
Matt Arsenault	391be09ef3	AMDGPU: Fix adding redundant m0 uses BuildMI already adds these since they are defined correctly now. llvm-svn: 250961	2015-10-21 22:37:51 +00:00
Matt Arsenault	e8c0891e42	AMDGPU: Fix verifier error in SIFoldOperands There may be other use operands that also need their kill flags cleared. This happens in a few tests when SIFoldOperands is moved after PeepholeOptimizer. PeepholeOptimizer rewrites cases that look like: %vreg0 = ... %vreg1 = COPY %vreg0 use %vreg1<kill> %vreg2 = COPY %vreg0 use %vreg2<kill> to use the earlier source to %vreg0 = ... use %vreg0 use %vreg0 Currently SIFoldOperands sees the copied registers, so there is only one use. So far I haven't managed to come up with a test that currently has multiple uses of a foldable VGPR -> VGPR copy. llvm-svn: 250960	2015-10-21 22:37:50 +00:00
Matt Arsenault	b6fd98c7d9	AMDGPU: Split DiagnosticInfoUnsupported into its own file llvm-svn: 250959	2015-10-21 22:37:46 +00:00
Davide Italiano	b59ea9018e	[JIT] Towards a working small memory model. This commit introduces an option, --preallocate, so that we can get memory upfront and use it in small memory model tests (in order to get reliable results). Differential Revision: http://reviews.llvm.org/D13630 llvm-svn: 250956	2015-10-21 22:12:03 +00:00
Matt Arsenault	6005fcbe12	AMDGPU: Simplify VOP3 operand legalization. This was checking for a variety of situations that should never happen. This saves a tiny bit of compile time. We should not be selecting instructions with invalid operands in the first place. Most of the time for registers copys are inserted to the correct operand register class. For VOP3, since all operand types are supported and literal constants never are, we just need to verify the constant bus requirements (all immediates should be legal inline ones). The only possibly tricky case to maybe worry about is if when legalizing operands in moveToVALU with s_add_i32 and similar instructions. If the original s_add_i32 had a literal constant and we need to replace it with v_add_i32_e64 we would have an unsupported literal operand. However, I don't think we should worry about that because SIFoldOperands should handle folding literal constant operands into the SALU instructions based on the uses. At SIFoldOperands time, the legality and profitability of operand types is a bit different. llvm-svn: 250951	2015-10-21 21:51:02 +00:00
Michael Liao	446c714a76	[InstCombine] Revise the test case to match full sequene llvm-svn: 250950	2015-10-21 21:50:58 +00:00
Matt Arsenault	e223cebd10	AMDGPU: Fix not checking implicit operands in verifyInstruction When verifying constant bus restrictions, this wasn't catching uses in implicit operands. llvm-svn: 250948	2015-10-21 21:15:01 +00:00
Matt Arsenault	45cbfa5949	Use numeric_limits instead of LLONG_MAX This is a build fix for configurations where LLONG_MAX is not defined in system headers. llvm-svn: 250946	2015-10-21 21:10:12 +00:00
Matt Arsenault	29f9663f97	LegalizeDAG: Implement promote for build_vector This will be used in future commits for AMDGPU to promote operations on i64 vectors into operations on 32-bit vector components. This will be used / tested in future AMDGPU commits. llvm-svn: 250945	2015-10-21 21:10:10 +00:00
Vedant Kumar	029c35186e	[Verifier] Minor comment update, NFC llvm-svn: 250943	2015-10-21 20:33:31 +00:00
Keno Fischer	ddad187ce7	[RuntimeDyld] Ignore ST_FILE symbols when constructing GlobalSymbolTable Summary: ELF's STT_File symbols may overlap with regular globals in other files, so we should ignore them here in order to avoid having bogus entries in the symbol table that confuse us when resolving relocations. Reviewers: lhames Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D13888 llvm-svn: 250942	2015-10-21 20:22:04 +00:00
Lang Hames	ffd4dc2111	[Orc] Clean up a comment. llvm-svn: 250940	2015-10-21 20:13:41 +00:00
Joerg Sonnenberger	7212809abc	Drop assert that a call with struct return goes to a function with sret attribute. Clang incorrectly misses it on __muldc3 and friends and the type system doesn't include it properly either. llvm-svn: 250938	2015-10-21 20:05:01 +00:00
Reid Kleckner	cfaeb42f9c	[WinEH] Add test for llvm.va.start in catchpad It already works, but we should have a test for it. This used to be PR23094 in the old model. llvm-svn: 250936	2015-10-21 19:54:40 +00:00
Teresa Johnson	c8a8a5e2ae	Silence Visual C++ warning in function summary parsing code (NFC) llvm-svn: 250929	2015-10-21 19:25:14 +00:00
Sanjay Patel	efab8b0d08	[x86] move recursive add match for LEA to helper function; NFCI llvm-svn: 250926	2015-10-21 18:56:06 +00:00
Yaron Keren	fa0adfd3c6	Revert r250923 as config.h is not an installed header. llvm-svn: 250924	2015-10-21 18:36:52 +00:00
Yaron Keren	1a3c0f77df	Include llvm/Config/config.h in FileSystem.h as it depends upon HAVE_SYS_STAT_H which is defined (or not) in config.h. llvm-svn: 250923	2015-10-21 18:28:35 +00:00
David Majnemer	dc3b67b4ca	[SimplifyCFG] Don't use-after-free an SSA value SimplifyTerminatorOnSelect didn't consider the possibility that the condition might be related to one of PHI nodes. This fixes PR25267. llvm-svn: 250922	2015-10-21 18:22:24 +00:00
Peter Zotov	45d29670ae	[OCaml] Expose Llvm.{set_,}unnamed_addr. Patch by Jacques-Pascal Deplaix <jp.deplaix@gmail.com> llvm-svn: 250912	2015-10-21 17:43:02 +00:00
Craig Topper	896c267544	[X86] Add AMD mwaitx, monitorx, and clzero instructions to the assembly parser and disassembler. llvm-svn: 250911	2015-10-21 17:26:45 +00:00
Sanjay Patel	557001d1c7	[x86] add test case that shows holes in LEA isel llvm-svn: 250910	2015-10-21 17:24:00 +00:00
Kevin Enderby	da9dd05011	Backing out commit r250906 as it broke lld. llvm-svn: 250908	2015-10-21 17:13:20 +00:00
Kevin Enderby	e3bf4fd546	This removes the eating of the error in Archive::Child::getSize() when the characters in the size field in the archive header for the member is not a number. To do this we have all of the needed methods return ErrorOr to push them up until we get out of lib. Then the tools and can handle the error in whatever way is appropriate for that tool. So the solution is to plumb all the ErrorOr stuff through everything that touches archives. This include its iterators as one can create an Archive object but the first or any other Child object may fail to be created due to a bad size field in its header. Thanks to Lang Hames on the changes making child_iterator contain an ErrorOr<Child> instead of a Child and the needed changes to ErrorOr.h to add operator overloading for * and -> . We don’t want to use llvm_unreachable() as it calls abort() and is produces a “crash” and using report_fatal_error() to move the error checking will cause the program to stop, neither of which are really correct in library code. There are still some uses of these that should be cleaned up in this library code for other than the size field. Also corrected the code where the size gets us to the “at the end of the archive” which is OK but past the end of the archive will return object_error::parse_failed now. The test cases use archives with text files so one can see the non-digit character, in this case a ‘%’, in the size field. llvm-svn: 250906	2015-10-21 16:59:24 +00:00
Craig Topper	8ea2390c35	[Option] Use an ArrayRef to store the Option Infos in OptTable. NFC llvm-svn: 250901	2015-10-21 16:30:42 +00:00
Vedant Kumar	aaead3309a	[llvm-cov] Adjust column widths for function and file reports Previously, we only expanded function and filename column widths when rendering file reports. This commit makes the change for function reports as well. llvm-svn: 250900	2015-10-21 16:03:32 +00:00
Daniel Sanders	d6cf3e05ef	[mips][mips16] Re-work the inline assembly stubs to work with IAS. NFC. Summary: Previously, we were inserting an InlineAsm statement for each line of the inline assembly. This works for GAS but it triggers prologue/epilogue emission when IAS is in use. This caused: .set noreorder .cpload $25 to be emitted as: .set push .set reorder .set noreorder .set pop .set push .set reorder .cpload $25 .set pop which led to assembler errors and caused the test to fail. The whitespace-after-comma changes included in this patch are necessary to match the output when IAS is in use. Reviewers: vkalintiris Subscribers: rkotler, llvm-commits, dsanders Differential Revision: http://reviews.llvm.org/D13653 llvm-svn: 250895	2015-10-21 12:44:14 +00:00
Chandler Carruth	2be10754a9	[AA] Enhance the new AliasAnalysis infrastructure with an optional "external" AA wrapper pass. This is a generic hook that can be used to thread custom code into the primary AAResultsWrapperPass for the legacy pass manager in order to allow it to merge external AA results into the AA results it is building. It does this by threading in a raw callback and so it is very powerful and should serve almost any use case I have come up with for extending the set of alias analyses used. The only thing not well supported here is using a different order of alias analyses. That form of extension is supportable with the new pass manager, and I can make the callback structure here more elaborate to support it in the legacy pass manager if this is a critical use case that people are already depending on, but the only use cases I have heard of thus far should be reasonably satisfied by this simpler extension mechanism. It is hard to test this using normal facilities (the built-in AAs don't use this for obvious reasons) so I've written a fairly extensive set of custom passes in the alias analysis unit test that should be an excellent test case because it models the out-of-tree users: it adds a totally custom AA to the system. This should also serve as a reasonably good example and guide for out-of-tree users to follow in order to rig up their existing alias analyses. No support in opt for commandline control is provided here however. I'm really unhappy with the kind of contortions that would be required to support that. It would fully re-introduce the analysis group self-recursion kind of patterns. =/ I've heard from out-of-tree users that this will unblock their use cases with extending AAs on top of the new infrastructure and let us retain the new analysis-group-free-world. Differential Revision: http://reviews.llvm.org/D13418 llvm-svn: 250894	2015-10-21 12:15:19 +00:00
Elena Demikhovsky	3ad76a1acd	Masked Load/Store optimization for scalar code When we have to convert the masked.load, masked.store to scalar code, we generate a chain of conditional basic blocks. I added optimization for constant mask vector. Differential Revision: http://reviews.llvm.org/D13855 llvm-svn: 250893	2015-10-21 11:50:54 +00:00
Daniel Sanders	0f596814e9	[mips][msa] Remove copy_u.d and move copy_u.w to MSA64. Summary: The forwards compatibility strategy employed by MIPS is to consider registers to be infinitely sign-extended. Then on ISA's with a wider register, the result of existing instructions are sign-extended to register width and zero-extended counterparts are added. copy_u.w on MSA32 and copy_u.w on MSA64 violate this strategy and we have therefore corrected the MSA specs to fix this. We still keep track of sign/zero-extension during legalization but we now match copy_s.[wd] where required. No change required to clang since __builtin_msa_copy_u_[wd] will map to copy_s.[wd] where appropriate for the target. Reviewers: vkalintiris Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D13472 llvm-svn: 250887	2015-10-21 09:58:54 +00:00
Jonas Paulsson	17ad04535f	Let MachineVerifier be aware of mem-to-mem instructions. A mem-to-mem instruction (that both loads and stores), which store to an FI, cannot pass the verifier since it thinks it is loading from the FI. For the mem-to-mem instruction, do a looser check in visitMachineOperand() and only check liveness at the reg-slot while analyzing a frame index operand. Needed to make CodeGen/SystemZ/xor-01.ll pass with -verify-machineinstrs, which now runs with this flag. Reviewed by Evan Cheng and Quentin Colombet. llvm-svn: 250885	2015-10-21 07:39:47 +00:00
Mehdi Amini	4215236621	Do not use `dyn_cast<X>` after `isa<X>` (NFC) From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 250883	2015-10-21 06:11:01 +00:00
Mehdi Amini	d263856646	Revert "Add missing #include, found by modules build." This reverts commit r250239. It seems unwanted changes got committed here, and part of the patch does not seem correct. For instance RoundUpToAlignment() is called without its returned value actually used. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 250882	2015-10-21 06:10:55 +00:00
Krzysztof Parzyszek	fdb7b693a7	Tail duplication can mix incompatible registers in phi nodes Do not tail duplicate blocks where the successor has a phi node, and the corresponding value in that phi node uses a subregister. http://reviews.llvm.org/D13922 llvm-svn: 250877	2015-10-21 02:40:06 +00:00
JF Bastien	1a59c6b2c9	WebAssembly: support imports C/C++ code can declare an extern function, which will show up as an import in WebAssembly's output. It's expected that the linker will resolve these, and mark unresolved imports as call_import (I have a patch which does this in wasmate). llvm-svn: 250875	2015-10-21 02:23:09 +00:00
Dehao Chen	100424124b	Tolerate negative offset when matching sample profile. In some cases (as illustrated in the unittest), lineno can be less than the heade_lineno because the function body are included from some other files. In this case, offset will be negative. This patch makes clang still able to match the profile to IR in this situation. http://reviews.llvm.org/D13914 llvm-svn: 250873	2015-10-21 01:22:27 +00:00
Krzysztof Parzyszek	ced9941cd4	[Hexagon] Bit-based instruction simplification Analyze bit patterns of operands and values of instructions to perform various simplifications, dead/redundant code elimination, etc. llvm-svn: 250868	2015-10-20 22:57:13 +00:00
Krzysztof Parzyszek	26b2c9080f	[Hexagon] Fix isNVStorable flag in .td files An upper half and a double word cannot be used as value sources in a new-value store. llvm-svn: 250867	2015-10-20 22:40:57 +00:00
Igor Laevsky	68688df94c	[MemorySanitizer] NFC. Do not use GET_INTRINSIC_MODREF_BEHAVIOR table. It is now possible to infer intrinsic modref behaviour purely from intrinsic attributes. This change will allow to completely remove GET_INTRINSIC_MODREF_BEHAVIOR table. Differential Revision: http://reviews.llvm.org/D13907 llvm-svn: 250860	2015-10-20 21:33:30 +00:00
Simon Pilgrim	bb17881731	[X86][SSE] Add 256-bit vector bit rotation tests. llvm-svn: 250853	2015-10-20 20:27:23 +00:00
Duncan P. N. Exon Smith	306d16e038	bugpoint: Remove implicit ilist iterator conversions, NFC This is the last of the implicit ilist iterator conversions in LLVM. Still up for debate whether we let these bitrot back: http://lists.llvm.org/pipermail/llvm-dev/2015-October/091617.html llvm-svn: 250852	2015-10-20 19:36:39 +00:00
Krzysztof Parzyszek	79512b88b0	[Hexagon] Capture aggregate variables by reference, not value llvm-svn: 250851	2015-10-20 19:33:46 +00:00
Krzysztof Parzyszek	e4cff4058c	[Hexagon] Do not fall-through if there is no CFG edge llvm-svn: 250850	2015-10-20 19:30:21 +00:00
Krzysztof Parzyszek	bfe8e92fd1	[Hexagon] Use symbolic name for subregister instead of hardcoded number llvm-svn: 250849	2015-10-20 19:26:36 +00:00
Krzysztof Parzyszek	0257905f27	[Hexagon] Change Based->Base in getBasedWithImmOffset llvm-svn: 250848	2015-10-20 19:21:05 +00:00
Krzysztof Parzyszek	05da79d5ac	[Hexagon] Remove the remnants of isConstExtProfitable llvm-svn: 250845	2015-10-20 19:04:53 +00:00
Duncan P. N. Exon Smith	c8925b1871	unittests: Remove implicit ilist iterator conversions, NFC llvm-svn: 250843	2015-10-20 18:30:20 +00:00
Duncan P. N. Exon Smith	b7fcadf410	llvm-diff: Remove implicit ilist iterator conversions, NFC llvm-svn: 250842	2015-10-20 18:17:05 +00:00
Chris Bieneman	e82e7ce9d7	[CMake] All the checks for if LLVM_VERSION_* variables are set need to be if(DEFINED ...) This is because if you set one of the variables to 0, if(NOT ...) is true, which isn't what you actually want. Should have thought that through better the first time. llvm-svn: 250841	2015-10-20 18:16:37 +00:00
Chris Bieneman	70997c3fb8	[CMake] Refactor subdirectory inclusion code to take a project name. Summary: This refactoring makes some of the code used to control including subdirectories parameterized so it can be re-used elsewhere. Specifically I want to re-use this code in clang to be able to turn off specific tool subdirectories. Reviewers: chapuni, filcab, bogner, Bigcheese Subscribers: emaste, llvm-commits Differential Revision: http://reviews.llvm.org/D13783 llvm-svn: 250835	2015-10-20 16:42:58 +00:00
Artyom Skrobov	c736863a85	Two switch blocks in VectorLegalizer::LegalizeOp already have a default: llvm_unreachable("This action is not supported yet!"); -- so I'm adding one to the third switch block, too. This is a follow-up fix for http://reviews.llvm.org/D13862 llvm-svn: 250830	2015-10-20 15:06:37 +00:00
Jonas Paulsson	4b29f6f7f7	[SystemZ] Use LivePhysRegs helper class in SystemZShortenInst.cpp. Don't use home brewed liveness tracking code for phys regs, since this class does the job. Reviewed by Ulrich Weigand. llvm-svn: 250829	2015-10-20 15:05:58 +00:00
Jonas Paulsson	5558536a4e	[SystemZ] Comment fix in test/CodeGen/SystemZ/fp-cmp-05.ll llvm-svn: 250828	2015-10-20 15:05:54 +00:00
Artyom Skrobov	7fd67e25aa	Adding support for TargetLoweringBase::LibCall Summary: TargetLoweringBase::Expand is defined as "Try to expand this to other ops, otherwise use a libcall." For ISD::UDIV and ISD::SDIV, the choice between the two possibilities was defined in a rather convoluted way: - if DIVREM is legal, expand to DIVREM - if DIVREM has a custom lowering, expand to DIVREM - if DIVREM libcall is defined and a remainder from the same division is computed elsewhere, expand to a DIVREM libcall - else, expand to a DIV libcall This had the undesirable effect that if both DIV and DIVREM are implemented as libcalls, then ISD::UDIV and ISD::SDIV are expanded to the heavier DIVREM libcall, even when the remainder isn't used. The new code adds a new LegalizeAction, TargetLoweringBase::LibCall, so that backends can directly control whether they prefer an expansion or a conversion to a libcall. This makes the generic lowering code even more generic, allowing its reuse in a wider range of target-specific configurations. The useful effect is that ARM backend will now generate a call to __aeabi_{i,u}div rather than __aeabi_{i,u}divmod in cases where it doesn't need the remainder. There's no functional change outside the ARM backend. Reviewers: t.p.northover, rengolin Subscribers: t.p.northover, llvm-commits, aemerson Differential Revision: http://reviews.llvm.org/D13862 llvm-svn: 250826	2015-10-20 13:14:52 +00:00
Artyom Skrobov	b844fa7fc0	Combining DIV+REM->DIVREM doesn't belong in LegalizeDAG; move it over into DAGCombiner. Summary: In addition to moving the code over, this patch amends the DIV,REM -> DIVREM combining to run on all affected nodes at once: if the nodes are converted to DIVREM one at a time, then the resulting DIVREM may get legalized by the backend into something target-specific that we won't be able to recognize and correlate with the remaining nodes. The motivation is to "prepare terrain" for D13862: when we set DIV and REM to be legalized to libcalls, instead of the DIVREM, we otherwise lose the ability to combine them together. To prevent this, we need to take the DIV,REM -> DIVREM combining out of the lowering stage. Reviewers: RKSimon, eli.friedman, rengolin Subscribers: john.brawn, rengolin, llvm-commits Differential Revision: http://reviews.llvm.org/D13733 llvm-svn: 250825	2015-10-20 13:06:02 +00:00
Igor Breger	21296d230a	AVX512: Implemented encoding and intrinsics for VPBROADCASTB/W/D/Q instructions. Differential Revision: http://reviews.llvm.org/D13884 llvm-svn: 250819	2015-10-20 11:56:42 +00:00
Andrea Di Biagio	9a85b7abe0	[x86] Fix AVX maskload/store intrinsic prototypes. The mask value type for maskload/maskstore GCC builtins is never a vector of packed floats/doubles. This patch fixes the following issues: 1. The mask argument for builtin_ia32_maskloadpd and builtin_ia32_maskstorepd should be of type llvm_v2i64_ty and not llvm_v2f64_ty. 2. The mask argument for builtin_ia32_maskloadpd256 and builtin_ia32_maskstorepd256 should be of type llvm_v4i64_ty and not llvm_v4f64_ty. 3. The mask argument for builtin_ia32_maskloadps and builtin_ia32_maskstoreps should be of type llvm_v4i32_ty and not llvm_v4f32_ty. 4. The mask argument for builtin_ia32_maskloadps256 and builtin_ia32_maskstoreps256 should be of type llvm_v8i32_ty and not llvm_v8f32_ty. Differential Revision: http://reviews.llvm.org/D13776 llvm-svn: 250817	2015-10-20 11:20:13 +00:00
Keno Fischer	a010cfa592	Fix missing INITIALIZE_PASS_DEPENDENCY for AddressSanitizer Summary: In r231241, TargetLibraryInfoWrapperPass was added to `getAnalysisUsage` for `AddressSanitizer`, but the corresponding `INITIALIZE_PASS_DEPENDENCY` was not added. Reviewers: dvyukov, chandlerc, kcc Subscribers: kcc, llvm-commits Differential Revision: http://reviews.llvm.org/D13629 llvm-svn: 250813	2015-10-20 10:13:55 +00:00
Manuel Klimek	100f40c0fd	Make class final to pacify -Wnon-virtual-dtor. llvm-svn: 250805	2015-10-20 08:21:01 +00:00
Matt Arsenault	3add6439d0	AMDGPU: Add MachineInstr overloads for instruction format tests llvm-svn: 250797	2015-10-20 04:35:43 +00:00
Lang Hames	c005656052	[Orc] Make CompileOnDemandLayer::findSymbol call BaseLayer::findSymbol if no symbol definition is found in the logical dylibs. llvm-svn: 250796	2015-10-20 04:35:02 +00:00
Matt Arsenault	8f18917a90	AMDGPU: Stop reserving v[254:255] This wasn't doing anything useful. They weren't explicitly used anywhere, and the RegScavenger ignores reserved registers. This for some reason caused a random scheduling change in the test. Getting the check lines to pass is too frustrating, and there's probably not too much value in checking the vector case's operands N times. llvm-svn: 250794	2015-10-20 03:59:58 +00:00
JF Bastien	c8f89e86d5	WebAssembly: fix call/return syntax. They are now typeless, unlike other operations. llvm-svn: 250793	2015-10-20 01:26:54 +00:00
Duncan P. N. Exon Smith	c4829deae8	MSP430: Remove implicit ilist iterator conversions, NFC llvm-svn: 250792	2015-10-20 01:18:39 +00:00
Duncan P. N. Exon Smith	ac331fbdb6	AsmParser: Remove implicit ilist iterator conversions, NFC llvm-svn: 250791	2015-10-20 01:12:49 +00:00
Duncan P. N. Exon Smith	a2c90e4743	SystemZ: Remove implicit ilist iterator conversion, NFC llvm-svn: 250790	2015-10-20 01:12:46 +00:00
Duncan P. N. Exon Smith	0ce253d3a9	XCore: Remove implicit ilist iterator conversions, NFC llvm-svn: 250788	2015-10-20 01:07:42 +00:00
Duncan P. N. Exon Smith	ac65b4c422	PowerPC: Remove implicit ilist iterator conversions, NFC llvm-svn: 250787	2015-10-20 01:07:37 +00:00
Sanjoy Das	3020b1bc8c	[RS4GC] Remove a redundant linear search, NFCI Since LiveVariables is uniqued (we just created it from a `DenseSet`), `FindIndex(LiveVariables, LiveVariables[i])` is always `i`. llvm-svn: 250786	2015-10-20 01:06:31 +00:00
Sanjoy Das	b1942f14cd	[RS4GC] Clean up `find_index`; NFC - Bring it up to the LLVM Coding Style - Sink it inside `CreateGCRelocates`, which is its only user llvm-svn: 250785	2015-10-20 01:06:28 +00:00
Sanjoy Das	7ad67640e9	[RS4GC] Re-purpose `normalizeForInvokeSafepoint`; NFC. `normalizeForInvokeSafepoint` in RewriteStatepointsForGC.cpp, as it is written today, deals with `gc.relocate` and `gc.result` uses of a statepoint equally well. This change documents this fact and adds a test case. There is no functional change here -- only documentation of existing functionality. llvm-svn: 250784	2015-10-20 01:06:24 +00:00
Sanjoy Das	ff3dba736a	[RS4GC] Minor cleanup to `normalizeForInvokeSafepoint`; NFC llvm-svn: 250783	2015-10-20 01:06:17 +00:00
Duncan P. N. Exon Smith	c3f7988472	Sparc: Remove implicit ilist iterator conversions, NFC llvm-svn: 250781	2015-10-20 00:59:43 +00:00
Duncan P. N. Exon Smith	61149b86c3	NVPTX: Remove implicit ilist iterator conversions, NFC llvm-svn: 250779	2015-10-20 00:54:09 +00:00
Duncan P. N. Exon Smith	a72c6e25ec	Hexagon: Remove implicit ilist iterator conversions, NFC There are two things out of the ordinary in this commit. First, I made a loop obviously "infinite" in HexagonInstrInfo.cpp. After checking if an instruction was at the beginning of a basic block (in which case, `break`), the loop decremented and checked the iterator for `nullptr` as the loop condition. This has never been possible (the prev pointers are always been circular, so even with the weird ilist/iplist implementation, this isn't been possible), so I removed the condition. Second, in HexagonAsmPrinter.cpp there was another case of comparing a `MachineBasicBlock::instr_iterator` against `MachineBasicBlock::end()` (which returns `MachineBasicBlock::iterator`). While not incorrect, it's fragile. I switched this to `::instr_end()`. All that said, no functionality change intended here. llvm-svn: 250778	2015-10-20 00:46:39 +00:00
JF Bastien	3b0177c542	WebAssembly: fix syntax for br_if. llvm-svn: 250777	2015-10-20 00:37:42 +00:00
Duncan P. N. Exon Smith	a25ad0685a	AsmPrinter: Remove implicit ilist iterator conversion, NFC llvm-svn: 250776	2015-10-20 00:36:08 +00:00
Duncan P. N. Exon Smith	7869148c47	Mips: Remove implicit ilist iterator conversions, NFC llvm-svn: 250769	2015-10-20 00:15:20 +00:00
Duncan P. N. Exon Smith	19d951874a	CppBackend: Remove implicit ilist iterator conversions, NFC Mostly just converted to range-based for loops. May have converted a couple of extra loops as a drive-by (not sure). llvm-svn: 250766	2015-10-20 00:06:41 +00:00
Duncan P. N. Exon Smith	d95fa4ccf1	BPF: Remove implicit ilist iterator conversion, NFC llvm-svn: 250765	2015-10-20 00:02:50 +00:00
Duncan P. N. Exon Smith	9f9559e807	ARM: Remove implicit ilist iterator conversions, NFC llvm-svn: 250759	2015-10-19 23:25:57 +00:00
Lang Hames	6372a0bd51	[Orc] Fix MSVC bugs introduced in r250749. llvm-svn: 250758	2015-10-19 23:23:17 +00:00
Duncan P. N. Exon Smith	1e59a66c69	ObjCARC: Remove implicit ilist iterator conversions, NFC llvm-svn: 250756	2015-10-19 23:20:14 +00:00
Cong Hou	7745dbc5c4	Enhance loop rotation with existence of profile data in MachineBlockPlacement pass. Currently, in MachineBlockPlacement pass the loop is rotated to let the best exit to be the last BB in the loop chain, to maximize the fall-through from the loop to outside. With profile data, we can determine the cost in terms of missed fall through opportunities when rotating a loop chain and select the best rotation. Basically, there are three kinds of cost to consider for each rotation: 1. The possibly missed fall through edge (if it exists) from BB out of the loop to the loop header. 2. The possibly missed fall through edges (if they exist) from the loop exits to BB out of the loop. 3. The missed fall through edge (if it exists) from the last BB to the first BB in the loop chain. Therefore, the cost for a given rotation is the sum of costs listed above. We select the best rotation with the smallest cost. This is only for PGO mode when we have more precise edge frequencies. Differential revision: http://reviews.llvm.org/D10717 llvm-svn: 250754	2015-10-19 23:16:40 +00:00
Lang Hames	16a4cfb80d	[Orc] Use '= default' for move constructor/assignment as per dblaikie's review. Thanks Dave! llvm-svn: 250749	2015-10-19 22:49:18 +00:00
Duncan P. N. Exon Smith	9934b26b0f	Linker: Remove implicit ilist iterator conversion, NFC llvm-svn: 250748	2015-10-19 22:23:36 +00:00
David Blaikie	437aafdab2	Fix -Wdeprecated regarding ORC copying ValueMaterializers As usual, this is a polymorphic hierarchy without polymorphic ownership, so simply make the dtor protected non-virtual, protected default copy ctor/assign, and make derived classes final. The derived classes will pick up correct default public copy ops (and dtor) implicitly. (wish I could add -Wdeprecated to the build, but last time I tried it triggered on some system headers I still need to look into/figure out) llvm-svn: 250747	2015-10-19 22:15:55 +00:00
Michael Liao	c65d386b81	[InstCombine] Optimize icmp of inc/dec at RHS Allow LLVM to optimize the sequence like the following: %inc = add nsw i32 %i, 1 %cmp = icmp slt %n, %inc into: %cmp = icmp sle i32 %n, %i The case is not handled previously due to the complexity of compuation of %n. Hence, LLVM cannot swap operands of icmp accordingly. llvm-svn: 250746	2015-10-19 22:08:14 +00:00
Duncan P. N. Exon Smith	6b92a14a28	Vectorize: Remove implicit ilist iterator conversions, NFC Besides the usual, I finally added an overload to `BasicBlock::splitBasicBlock()` that accepts an `Instruction*` instead of `BasicBlock::iterator`. Someone can go back and remove this overload later (after updating the callers I'm going to skip going forward), but the most common call seems to be `BB->splitBasicBlock(BB->getTerminator(), ...)` and I'm not sure it's better to add `->getIterator()` to every one than have the overload. It's pretty hard to get the usage wrong. llvm-svn: 250745	2015-10-19 22:06:09 +00:00
Sanjay Patel	69a50a1e17	[CGP] transform select instructions into branches and sink expensive operands This was originally checked in at r250527, but reverted at r250570 because of PR25222. There were at least 2 problems: 1. The cost check was checking for an instruction with an exact cost of TCC_Expensive; that should have been >=. 2. The cause of the clang stage 1 failures was illegally sinking 'call' instructions; we can't sink instructions that may have side effects / are not safe to execute speculatively. Fixed those conditions in sinkSelectOperand() and added test cases. Original commit message: This is a follow-up to the discussion in D12882. Ideally, we would like SimplifyCFG to be able to form select instructions even when the operands are expensive (as defined by the TTI cost model) because that may expose further optimizations. However, we would then like a later pass like CodeGenPrepare to undo that transformation if the target would likely benefit from not speculatively executing an expensive op (this patch). Once we have this safety mechanism in place, we can adjust SimplifyCFG to restore its select-formation behavior that changed with r248439. Differential Revision: http://reviews.llvm.org/D13297 llvm-svn: 250743	2015-10-19 21:59:12 +00:00
Duncan P. N. Exon Smith	d77de6495e	X86: Remove implicit ilist iterator conversions, NFC llvm-svn: 250741	2015-10-19 21:48:29 +00:00
Lang Hames	f1381cb8d0	[RuntimeDyld][COFF] Fix some endianness issues, re-enable the regression test. llvm-svn: 250733	2015-10-19 20:37:52 +00:00
Owen Anderson	faf5187ee0	Restore the original behavior of SelectionDAG::getTargetIndex(). It looks like an extra negation snuck in as apart of restoring it. llvm-svn: 250726	2015-10-19 19:27:40 +00:00
Krzysztof Parzyszek	055c5fd74e	[Hexagon] Remove unnecessary argument sign extends llvm-svn: 250724	2015-10-19 19:10:48 +00:00
Teresa Johnson	3da931f87a	Pass FunctionInfoIndex by reference to WriteFunctionSummaryToFile (NFC) Implemented suggestion by dblakie in review for r250704. llvm-svn: 250723	2015-10-19 19:06:06 +00:00
Lang Hames	2b9e24c02c	[Orc] Add explicit move constructor and assignment operator to make MSVC happy. llvm-svn: 250722	2015-10-19 18:59:22 +00:00
Benjamin Kramer	755e502952	Add missing override noticed by Clang's -Winconsistent-missing-override. llvm-svn: 250720	2015-10-19 18:41:23 +00:00
Jun Bum Lim	d3548303ec	[AArch64]Merge halfword loads into a 32-bit load Convert two halfword loads into a single 32-bit word load with bitfield extract instructions. For example : ldrh w0, [x2] ldrh w1, [x2, #2] becomes ldr w0, [x2] ubfx w1, w0, #16, #16 and w0, w0, #ffff llvm-svn: 250719	2015-10-19 18:34:53 +00:00
Krzysztof Parzyszek	23920ec95d	[Hexagon] Fix debug information for local objects - Isolate the check for the existence of a stack frame into hasFP. - Implement getFrameIndexReference for DWARF address computation. - Use getFrameIndexReference for offset computation in eliminateFrameIndex. - Preserve debug information for dynamically allocated stack objects. - Prefer FP to access local objects at -O0. - Add experimental code to skip allocframe when not strictly necessary (disabled by default). llvm-svn: 250718	2015-10-19 18:30:27 +00:00
Benjamin Kramer	2002aadaad	Put back SelectionDAG::getTargetIndex. While technically this is untested dead code, it has out-of-tree users. This reverts a part of r250434. llvm-svn: 250717	2015-10-19 18:26:16 +00:00
Lang Hames	80fee77e75	[Orc] Lambda needs to capture 'this'. llvm-svn: 250716	2015-10-19 17:53:43 +00:00
Lang Hames	4c17f51b73	[Orc] Remove extraneous semicolon that found its way into r250712. llvm-svn: 250715	2015-10-19 17:49:37 +00:00
Krzysztof Parzyszek	db8677067c	[Hexagon] Delay emission of CFI instructions Emit the CFI instructions after all code transformation have been done. This will avoid any interference between CFI instructions and packetization. llvm-svn: 250714	2015-10-19 17:46:01 +00:00
Matthias Braun	e734195ce3	Revert "RegisterPressure: allocatable physreg uses are always kills" This reverts commit r250596. Reverted for now as the commit triggers assert in the AMDGPU target pending investigation. llvm-svn: 250713	2015-10-19 17:44:22 +00:00
Lang Hames	98c2ac13d3	[Orc] Add support for emitting indirect stubs directly into the JIT target's memory, rather than representing the stubs in IR. Update the CompileOnDemand layer to use this functionality. Directly emitting stubs is much cheaper than building them in IR and codegen'ing them (see below). It also plays well with remote JITing - stubs can be emitted directly in the target process, rather than having to send them over the wire. The downsides are: (1) Care must be taken when resolving symbols, as stub symbols are held in a separate symbol table. This is only a problem for layer writers and other people using this API directly. The CompileOnDemand layer hides this detail. (2) Aliases of function stubs can't be symbolic any more (since there's no symbol definition in IR), but must be converted into a constant pointer expression. This means that modules containing aliases of stubs cannot be cached. In practice this is unlikely to be a problem: There's no benefit to caching such a module anyway. On balance I think the extra performance is more than worth the trade-offs: In a simple stress test with 10000 dummy functions requiring stubs and a single executed "hello world" main function, directly emitting stubs reduced user time for JITing / executing by over 90% (1.5s for IR stubs vs 0.1s for direct emission). llvm-svn: 250712	2015-10-19 17:43:51 +00:00
Teresa Johnson	8af8d4ed93	Convert gold-plugin unnecessary unique_ptr into local (NFC) llvm-svn: 250704	2015-10-19 15:23:03 +00:00
Teresa Johnson	a80b50820b	Fix required library for r250699 to BitWriter instead of BitReader. This should fix the mingw3 bot failure. llvm-svn: 250703	2015-10-19 15:21:46 +00:00
Teresa Johnson	bd715d4702	Fix windows bot failures from r250699 by removing "/" from expected path in test output. llvm-svn: 250701	2015-10-19 15:19:02 +00:00
Teresa Johnson	91a88bba0e	llvm-lto support for generating combined function indexes Summary: This patch adds support to llvm-lto that mirrors the support added by r249270 to the gold plugin. This enables better testing of combined index generation for ThinLTO. Added a new test, and this support will be used in the test in D13515. Reviewers: joker.eph Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D13847 llvm-svn: 250699	2015-10-19 14:30:44 +00:00
Benjamin Kramer	335332329b	Remove CRLF newlines. NFC. llvm-svn: 250698	2015-10-19 13:05:25 +00:00
Asiri Rathnayake	1040a53be3	Fix mapping of @llvm.arm.ssat/usat intrinsics to ssat/usat instructions The mapping of these two intrinsics in ARMInstrInfo.td had a small omission which lead to their operands not being validated/transformed before being lowered into usat and ssat instructions. This can cause incorrect instructions to be emitted. I've also added tests for the remaining two saturating arithmatic intrinsics @llvm.arm.qadd and @llvm.arm.qsub as they are missing codegen tests. llvm-svn: 250697	2015-10-19 11:44:24 +00:00
James Molloy	17379c4ea1	[GlobalsAA] Fix a really horrible iterator invalidation bug We were keeping a reference to an object in a DenseMap then mutating it. At the end of the function we were attempting to clone that reference into other keys in the DenseMap, but DenseMap may well decide to resize its hashtable which would invalidate the reference! It took an extremely complex testcase to catch this - many thanks to Zhendong Su for catching it in PR25225. This fixes PR25225. llvm-svn: 250692	2015-10-19 08:54:59 +00:00
Elena Demikhovsky	20662e39f1	Removed parameter "Consecutive" from isLegalMaskedLoad() / isLegalMaskedStore(). Originally I planned to use the same interface for masked gather/scatter and set isConsecutive to "false" in this case. Now I'm implementing masked gather/scatter and see that the interface is inconvenient. I want to add interfaces isLegalMaskedGather() / isLegalMaskedScatter() instead of using the "Consecutive" parameter in the existing interfaces. Differential Revision: http://reviews.llvm.org/D13850 llvm-svn: 250686	2015-10-19 07:43:38 +00:00
Zlatko Buljan	5292083584	[mips][microMIPS] Implement ADDQ.PH, ADDQ_S.W, ADDQH.PH, ADDQH.W, ADDSC, ADDU.PH, ADDU_S.QB, ADDWC and ADDUH.QB instructions Differential Revision: http://reviews.llvm.org/D13130 llvm-svn: 250685	2015-10-19 07:16:26 +00:00
Zlatko Buljan	d0a7d6e4ee	[mips][microMIPS] Implement ABSQ.QB, ABSQ_S.PH, ABSQ_S.W, ABSQ_S.QB, INSV, MADD, MADDU, MSUB, MSUBU, MULT and MULTU instructions Differential Revision: http://reviews.llvm.org/D13721 llvm-svn: 250683	2015-10-19 06:34:44 +00:00
Xinliang David Li	aa0592cc70	[PGO] Eliminate prof data register calls on FreeBSD platform This is a follow up patch of r250199 after verifying the start/stop section symbols work as spected on FreeBSD. llvm-svn: 250679	2015-10-19 04:17:10 +00:00
Jakub Staszak	f12821a43c	Preserve CFG in MergedLoadStoreMotion. This fixes PR24426. llvm-svn: 250660	2015-10-18 19:34:10 +00:00
Rafael Espindola	da3b3c78e7	Add hashing and DenseMapInfo for ArrayRef Sometimes it is more natural to use a ArrayRef<uint8_t> than a StringRef to represent a range of bytes that is not, semantically, a string. This will be used in lld in a sec. llvm-svn: 250658	2015-10-18 14:04:56 +00:00
Simon Pilgrim	4708060e94	[X86][SSE] Add vector bit rotation tests. llvm-svn: 250656	2015-10-18 12:54:37 +00:00
Simon Pilgrim	04d52d26f6	Use SDValue bool check. NFCI. llvm-svn: 250653	2015-10-18 12:33:54 +00:00
Simon Pilgrim	c2c154e078	Move one-use variable inside test. NFC. llvm-svn: 250651	2015-10-18 11:47:23 +00:00
Asaf Badouh	696e8e0bb7	[X86][AVX512DQ] add scalar fpclass Differential Revision: http://reviews.llvm.org/D13769 llvm-svn: 250650	2015-10-18 11:04:38 +00:00
Igor Breger	cbb9550537	AVX512: Lowering i8/i16 vector CTLZ using the dword LZCNT vector instruction Differential Revision: http://reviews.llvm.org/D13632 llvm-svn: 250649	2015-10-18 09:56:39 +00:00
Craig Topper	92cfdd70f8	[Sparc] Use MCPhysReg instead of unsigned to size static arrays of registers. Should reduce the table size. llvm-svn: 250644	2015-10-18 05:29:05 +00:00
Craig Topper	1d37443718	Use array_lengthof. NFC llvm-svn: 250643	2015-10-18 05:15:38 +00:00
Craig Topper	2626094fa1	Make a bunch of static arrays const. llvm-svn: 250642	2015-10-18 05:15:34 +00:00
Lang Hames	a32d71be4c	[RuntimeDyld] Add support for absolute symbols. llvm-svn: 250639	2015-10-18 01:41:37 +00:00
Xinliang David Li	dab183ed40	Minor Instr PGO code restructuring 1. Key constant values (version, magic) and data structures related to raw and indexed profile format are moved into one centralized file: InstrProf.h. 2. Utility function such as MD5Hash computation is also moved to the common header to allow sharing with other components in the future. 3. A header data structure is introduced for Indexed format so that the reader and writer can always be in sync. 4. Added some comments to document different places where multiple definition of the data structure must be kept in sync (reader/writer, runtime, lowering etc). No functional change is intended. Differential Revision: http://reviews.llvm.org/D13758 llvm-svn: 250638	2015-10-18 01:02:29 +00:00
Sanjoy Das	d295f2c7ca	[SCEV] Fix whitespace issues and remove extra braces; NFC llvm-svn: 250636	2015-10-18 00:29:27 +00:00
Sanjoy Das	f07d2a7143	[SCEV] Use std::all_of and std::any_of; NFC llvm-svn: 250635	2015-10-18 00:29:23 +00:00
Sanjoy Das	6391459069	[SCEV] Use auto where it helps remove line breaks; NFC llvm-svn: 250634	2015-10-18 00:29:20 +00:00
Sanjoy Das	d9f6d33a7f	[SCEV] Use range for loops; NFC llvm-svn: 250633	2015-10-18 00:29:16 +00:00
Craig Topper	ec15ea12e7	Use std::find instead of manual loop. llvm-svn: 250624	2015-10-17 21:32:28 +00:00
Craig Topper	a833451173	Use std::is_sorted to replace a custom version. Also replace a comparison predicate struct with a lambda. llvm-svn: 250623	2015-10-17 21:32:26 +00:00
Simon Pilgrim	86c5e85e84	[X86][XOP] Add VPROT instruction opcodes Added X86ISD opcodes for VPROT vector rotate by variable and by immediate. llvm-svn: 250620	2015-10-17 19:04:24 +00:00
Craig Topper	a2d0635098	Remove unnecessary 'const' pointed out by David Blaikie. llvm-svn: 250619	2015-10-17 18:22:46 +00:00
Simon Pilgrim	4773763bb0	[X86][XOP] Add VPROT rotate by immediate intrinsics tests llvm-svn: 250618	2015-10-17 18:21:53 +00:00
Simon Pilgrim	24057b9566	[DAG] Ensure vector constant folding uses correct scalar undef types Minor fix to D13665 found during post-commit review. llvm-svn: 250616	2015-10-17 16:49:43 +00:00
Craig Topper	9ff9bf4959	Replace a custom table sort check with std::is_sorted. Change a function to take ArrayRef instead of pointer and length. NFC llvm-svn: 250615	2015-10-17 16:37:13 +00:00
Craig Topper	c177d9edb3	Use std::begin/end and std::is_sorted to simplify some code. NFC llvm-svn: 250614	2015-10-17 16:37:11 +00:00
Craig Topper	84c571c962	Use binary search in isCPUStringValid since the array is sorted. llvm-svn: 250613	2015-10-17 16:37:09 +00:00
Simon Pilgrim	a18ae9bd70	[CostModel] Fixed AVX integer shift costs Targets with AVX but without AVX2 were incorrectly reporting costs of 256-bit integer shifts. llvm-svn: 250611	2015-10-17 13:23:38 +00:00
Simon Pilgrim	5b65f28fe7	[X86][FastISel] Teach how to select SSE4A nontemporal stores. Add FastISel support for SSE4A scalar float / double non-temporal stores Follow up to D13698 Differential Revision: http://reviews.llvm.org/D13773 llvm-svn: 250610	2015-10-17 13:04:42 +00:00
Simon Pilgrim	216b1bf5ed	[InstCombine] SSE4A constant folding and conversion to shuffles. This patch improves support for combining the SSE4A EXTRQ(I) and INSERTQ(I) intrinsics: 1 - Converts INSERTQ/EXTRQ calls to INSERTQI/EXTRQI if the 'bit index' and 'length' operands are constant 2 - Converts INSERTQI/EXTRQI calls to shufflevector if the bit index/length are both byte aligned (we can already lower shuffles to INSERTQI/EXTRQI if its useful) 3 - Constant folding support 4 - Add zeroinitializer handling Differential Revision: http://reviews.llvm.org/D13348 llvm-svn: 250609	2015-10-17 11:40:05 +00:00
Davide Italiano	d178f312dd	[JIT/Examples] Fix Fibonacci so that it runs again. The old JIT is (long) gone. llvm-svn: 250604	2015-10-17 06:36:46 +00:00
Kostya Serebryany	fed509e73d	[libFuzzer] add -shuffle flag llvm-svn: 250603	2015-10-17 04:38:26 +00:00
Colin LeMahieu	68d155be8e	[Hexagon] Reverting test file change. llvm-svn: 250601	2015-10-17 01:58:51 +00:00
Colin LeMahieu	7c9587136d	[Hexagon] Adding skeleton of HVX extension instructions. llvm-svn: 250600	2015-10-17 01:33:04 +00:00
Matthias Braun	65e6d4a3f8	RegisterPressure: Unify the sparse sets in LiveRegsSet; NFC Also do some cleanups comment improvements. llvm-svn: 250598	2015-10-17 01:03:44 +00:00
Matthias Braun	cdd2792aa6	RegisterPressure: allocatable physreg uses are always kills This property was already used in the code path when no liveness intervals are present. Unfortunately the code path that uses liveness intervals tried to query a cached live interval for an allocatable physreg, those are usually not computed so a conservative default was used. This doesn't affect any of the lit testcases. This is a foreclosure to upcoming changes which should be NFC but without this patch this tidbit wouldn't be NFC. llvm-svn: 250596	2015-10-17 00:46:57 +00:00
Matthias Braun	5105e05e8f	RegisterPressure: Remove 0 entries from PressureChange This should not change behaviour because as far as I can see all code reading the pressure changes has no effect if the PressureInc is 0. Removing these entries however does avoid unnecessary computation, and results in a more stable debug output. I want the stable debug output to check that some upcoming changes are indeed NFC and identical even at the debug output level. llvm-svn: 250595	2015-10-17 00:35:59 +00:00
JF Bastien	3428ed4f53	WebAssembly: don't omit dead vregs from locals Summary: This is a temporary hack until we get around to remapping the vreg numbers to local numbers. Dead vregs cause bad numbering and make consumers sad. We could also just look at debug info an use named locals instead, but vregs have to work properly anyways so there! Reviewers: binji, sunfish Subscribers: jfb, llvm-commits, dschuff Differential Revision: http://reviews.llvm.org/D13839 llvm-svn: 250594	2015-10-17 00:25:38 +00:00
JF Bastien	4f43e80ece	WebAssembly: fix the syntax for comparisons Summary: It has also slightly changed. Reviewers: binji Subscribers: jfb, dschuff, llvm-commits, sunfish Differential Revision: http://reviews.llvm.org/D13837 llvm-svn: 250591	2015-10-17 00:12:29 +00:00
Matthias Braun	96e411b90c	RegisterPressure: Hide non-const iterators of PressureDiff It is too easy to accidentally violate the ordering requirements when modifying the PressureDiff entries through iterators. llvm-svn: 250590	2015-10-17 00:08:48 +00:00
Matthias Braun	7f47272cda	StreamWriter: List basic types instead of derived ones in HexNumber This avoids problems with different (u)intXX definition on different platforms. Specifically this fixes a case on OS/X which had uint64_t defined as unsigned long long. llvm-svn: 250589	2015-10-17 00:08:45 +00:00
Joseph Tremoulet	55b51e9dcc	[WinEH] Fix eh.exceptionpointer intrinsic lowering Summary: Some shared code for handling eh.exceptionpointer and eh.exceptioncode needs to not share the part that truncates to 32 bits, which is intended just for exception codes. Reviewers: rnk Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D13747 llvm-svn: 250588	2015-10-17 00:08:08 +00:00
Reid Kleckner	ce10cec56a	Disable a test relying on symbol demangling on non-Windows platforms llvm-svn: 250587	2015-10-16 23:56:14 +00:00
Reid Kleckner	793d96386d	Speculative fix for GCC build llvm-svn: 250585	2015-10-16 23:53:12 +00:00
Reid Kleckner	28e490342b	[WinEH] Fix stack alignment in funclets and ParentFrameOffset calculation Our previous value of "16 + 8 + MaxCallFrameSize" for ParentFrameOffset is incorrect when CSRs are involved. We were supposed to have a test case to catch this, but it wasn't very rigorous. The main effect here is that calling _CxxThrowException inside a catchpad doesn't immediately crash on MOVAPS when you have an odd number of CSRs. llvm-svn: 250583	2015-10-16 23:43:27 +00:00
Reid Kleckner	02b74368ce	[llvm-symbolizer] Use the export table if no symbols are present This lets us make guesses about symbols in third party DLLs without debug info, like MSVCR120.dll or kernel32.dll. dbghelp does the same thing. llvm-svn: 250582	2015-10-16 23:43:22 +00:00
Matthias Braun	fdee8ec2bd	RegisterPressure: Use range based for, cleanup llvm-svn: 250579	2015-10-16 23:25:09 +00:00
Davide Italiano	4f05f32bb7	[llvm-readobj] Teach ELFDumper about symbol versioning. Differential Revision: http://reviews.llvm.org/D13824 llvm-svn: 250575	2015-10-16 23:19:01 +00:00
Xinliang David Li	91c7321ea9	Instroduce a template file to define InstrPGO core data structures. Changing PGO data format layout can be a pain. Many different places need to be touched and kept in sync. Failing to do so usually results in errors very time consuming to debug. This file is intended to be the master file that defines the layout of the core runtime data structures. Currently only two structure is covered: Per function ProfData structure and the function record structure used in coverage mapping. No client code has been made yet, so this commit is NFC. llvm-svn: 250574	2015-10-16 23:17:34 +00:00
Chris Bieneman	11a7160237	[CMake] Cleaning up and generalizing the LLVMInstallSymlink script so that it can be used for libraries too. In order to resolve PR25059, we're going to need to be able to generate symlinks to libraries manually, so I need this code to be reusable. llvm-svn: 250573	2015-10-16 23:17:13 +00:00
Kostya Serebryany	d6edce97fb	[libFuzzer] print a stack trace on timeout llvm-svn: 250571	2015-10-16 23:04:31 +00:00
Benjamin Kramer	b43d33bf0f	Revert "This is a follow-up to the discussion in D12882." Breaks clang selfhost, see PR25222. This reverts commits r250527 and r250528. llvm-svn: 250570	2015-10-16 23:00:29 +00:00
Kostya Serebryany	a9da9b48ef	[libFuzzer] reduce the size of artifacts printed on the screen llvm-svn: 250565	2015-10-16 22:47:20 +00:00
Kostya Serebryany	b91c62b1f3	[libFuzzer] When -test_single_input crashes the test it is not necessary to write crash-file because input is already known to the user. Patch by Mike Aizatsky llvm-svn: 250564	2015-10-16 22:41:47 +00:00
Sanjay Patel	bbd524496c	[x86] promote 'add nsw' to a wider type to allow more combines The motivation for this patch starts with PR20134: https://llvm.org/bugs/show_bug.cgi?id=20134 void foo(int *a, int i) { a[i] = a[i+1] + a[i+2]; } It seems better to produce this (14 bytes): movslq %esi, %rsi movl 0x4(%rdi,%rsi,4), %eax addl 0x8(%rdi,%rsi,4), %eax movl %eax, (%rdi,%rsi,4) Rather than this (22 bytes): leal 0x1(%rsi), %eax cltq leal 0x2(%rsi), %ecx movslq %ecx, %rcx movl (%rdi,%rcx,4), %ecx addl (%rdi,%rax,4), %ecx movslq %esi, %rax movl %ecx, (%rdi,%rax,4) The most basic problem (the first test case in the patch combines constants) should also be fixed in InstCombine, but it gets more complicated after that because we need to consider architecture and micro-architecture. For example, AArch64 may not see any benefit from the more general transform because the ISA solves the sexting in hardware. Some x86 chips may not want to replace 2 ADD insts with 1 LEA, and there's an attribute for that: FeatureSlowLEA. But I suspect that doesn't go far enough or maybe it's not getting used when it should; I'm also not sure if FeatureSlowLEA should also mean "slow complex addressing mode". I see no perf differences on test-suite with this change running on AMD Jaguar, but I see small code size improvements when building clang and the LLVM tools with the patched compiler. A more general solution to the sext(add nsw(x, C)) problem that works for multiple targets is available in CodeGenPrepare, but it may take quite a bit more work to get that to fire on all of the test cases that this patch takes care of. Differential Revision: http://reviews.llvm.org/D13757 llvm-svn: 250560	2015-10-16 22:14:12 +00:00
Jim Grosbach	0fdd572763	MC: Don't crash after issuing a diagnostic. Crashing is bad, m'kay? Fixing a 4 year old bug of my own creation. Adding the testcase now which I should have added then which would have long since caught this. The problem is that printMessage() will display the diagnostic but not set HadError to true, resulting in the assembler continuing on its way and trying to create relocations for things that may not allow them or otherwise get itself into trouble. Using the Error() helper function here rather than calling printMessage() directly resolves this. rdar://23133240 llvm-svn: 250557	2015-10-16 22:07:59 +00:00
Joseph Tremoulet	d11a998e81	[WinEH] Fix CatchRetSuccessorColorMap accounting Summary: We now use the block for the catchpad itself, rather than its normal successor, as the funclet entry. Putting the normal successor in the map leads downstream funclet membership computations to erroneous results. Reviewers: majnemer, rnk Subscribers: rnk, llvm-commits Differential Revision: http://reviews.llvm.org/D13798 llvm-svn: 250552	2015-10-16 21:22:54 +00:00
Andrew Kaylor	09b39acc03	Fix assertion failure with fp128 to unsigned i64 conversion Patch by Mitch Bodart Differential Revision: http://reviews.llvm.org/D13780 llvm-svn: 250550	2015-10-16 20:39:20 +00:00
Krzysztof Parzyszek	a7c5f0409c	[Hexagon] Split double registers llvm-svn: 250549	2015-10-16 20:38:54 +00:00
David Majnemer	e696583dba	[WinEH] Remove dead code/includes from WinEHPrepare No functionality change is intended. llvm-svn: 250545	2015-10-16 19:59:52 +00:00
Krzysztof Parzyszek	aec39c68ae	[Hexagon] Delete lib/Target/Hexagon/HexagonRemoveSZExtArgs.cpp llvm-svn: 250543	2015-10-16 19:51:53 +00:00
Krzysztof Parzyszek	5b7dd0cdf9	[Hexagon] Merge adjacent stores llvm-svn: 250542	2015-10-16 19:43:56 +00:00
Diego Novillo	b93483dbce	Sample profiles - Re-arrange binary format to emit head samples only on top functions. The number of samples collected at the head of a function only make sense for top-level functions (i.e., those actually called as opposed to being inlined inside another). Head samples essentially count the time spent inside the function's prologue. This clearly doesn't make sense for inlined functions, so we were always emitting 0 in those. llvm-svn: 250539	2015-10-16 18:54:35 +00:00
JF Bastien	6126d2b883	WebAssembly: fix load/store syntax Summary: The syntax has changed a bit recently. Reviewers: binji Subscribers: llvm-commits, jfb, sunfish, dschuff Differential Revision: http://reviews.llvm.org/D13821 llvm-svn: 250535	2015-10-16 18:24:42 +00:00
Joseph Tremoulet	53e9cbd95a	[WinEH] Fix endpad coloring/numbering Summary: When a cleanup's cleanupendpad or cleanupret targets a catchendpad, stop trying to propagate the cleanup's parent's color to the catchendpad, since what's needed is the cleanup's grandparent's color and the catchendpad will get that color from the catchpad linkage already. We already had this exclusion for invokes, but were missing it for cleanupendpad/cleanupret. Also add a missing line that tags cleanupendpads' states in the EHPadStateMap, without with lowering invokes that target cleanupendpads which unwind to other handlers (and so don't have the -1 state) will fail. This fixes the reduced IR repro in PR25163. Reviewers: majnemer, andrew.w.kaylor, rnk Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D13797 llvm-svn: 250534	2015-10-16 18:08:16 +00:00
Yaron Keren	a6760f98cf	Fix typo, NFC. llvm-svn: 250529	2015-10-16 17:50:47 +00:00
Sanjay Patel	bb5d550c3d	move test case to x86 directory because it specifies an x86 target llvm-svn: 250528	2015-10-16 17:18:07 +00:00
Sanjay Patel	374dd8d88e	This is a follow-up to the discussion in D12882. Ideally, we would like SimplifyCFG to be able to form select instructions even when the operands are expensive (as defined by the TTI cost model) because that may expose further optimizations. However, we would then like a later pass like CodeGenPrepare to undo that transformation if the target would likely benefit from not speculatively executing an expensive op (this patch). Once we have this safety mechanism in place, we can adjust SimplifyCFG to restore its select-formation behavior that changed with r248439. Differential Revision: http://reviews.llvm.org/D13297 llvm-svn: 250527	2015-10-16 16:54:30 +00:00
JF Bastien	53bd975033	WebAssembly: relooper analysis pass Summary: Make the relooper an analysis pass, to convert CFG to AST. Reviewers: sunfish Subscribers: jfb, dschuff Differential Revision: http://reviews.llvm.org/D12744 llvm-svn: 250524	2015-10-16 16:35:49 +00:00
Charlie Turner	434d4599d4	[AArch64] Implement vector splitting on UADDV. Summary: Fixes PR25056. Reviewers: mcrosier, junbuml, jmolloy Subscribers: aemerson, rengolin, llvm-commits Differential Revision: http://reviews.llvm.org/D13466 llvm-svn: 250520	2015-10-16 15:38:25 +00:00
Diego Novillo	17b6f53133	Sample Profiling - Remove useless asserts. NFC. llvm-svn: 250513	2015-10-16 13:54:52 +00:00
Zlatko Buljan	4c4f21b971	Commited two test files which are forgotten during commit of patch for http://reviews.llvm.org/D13376 llvm-svn: 250512	2015-10-16 13:03:10 +00:00
Hrvoje Varga	3c88fbd367	[mips][microMIPS] Implement LB, LBE, LBU and LBUE instructions Differential Revision: http://reviews.llvm.org/D11633 llvm-svn: 250511	2015-10-16 12:24:58 +00:00
Pawel Bylica	2fa025cdcf	Fix path::home_directory() unit test. It turns out that constructing std::string from null pointer is not the very best idea. llvm-svn: 250506	2015-10-16 10:11:07 +00:00
NAKAMURA Takumi	cc275e428d	SupportTests::HomeDirectory: Don't try tests when $HOME is undefined. Lit sanitizes env vars. $HOME is not exported in Lit tests. llvm-svn: 250505	2015-10-16 09:40:01 +00:00
NAKAMURA Takumi	6d5d5bdfaf	Reformat. llvm-svn: 250504	2015-10-16 09:38:49 +00:00
Pawel Bylica	7187e4bba9	Use Windows Vista API to get the user's home directory Summary: This patch replaces usage of deprecated SHGetFolderPathW with SHGetKnownFolderPath. The usage of SHGetKnownFolderPath is wrapped to allow queries for other "known" folders in the near future. Reviewers: aaron.ballman, gbedwell Subscribers: chapuni, llvm-commits Differential Revision: http://reviews.llvm.org/D13753 llvm-svn: 250501	2015-10-16 09:08:59 +00:00
Craig Topper	09b6598572	[X86] Add fxsr feature flag for fxsave/fxrestore instructions. llvm-svn: 250497	2015-10-16 06:03:09 +00:00
Dylan McKay	b1d469c657	Initial migration of AVR backend This patch adds the underlying infrastructure for an AVR backend to be included into LLVM. It is the first of a series of patches aimed at moving the out-of-tree AVR backend into the tree. It consists of adding a new`Triple` target 'avr'. llvm-svn: 250492	2015-10-16 03:10:30 +00:00
Sanjoy Das	58fae7cf6b	[RS4GC] Dont' propagate call attrs related to patchable statepoints The `"statepoint-id"` and `"statepoint-num-patch-bytes"` attributes are used solely to determine properties of the `gc.statepoint` being created. Once the `gc.statepoint` is in place, these should be removed. llvm-svn: 250491	2015-10-16 02:41:23 +00:00
Sanjoy Das	810a59d037	[RS4GC] Bring legalizeCallAttributes up to LLVM coding style; NFC llvm-svn: 250490	2015-10-16 02:41:11 +00:00
Sanjoy Das	25ec1a3e60	[RS4GC] Use "deopt" operand bundles Summary: This is a step towards using operand bundles to carry deopt state till RewriteStatepointsForGC. The change adds a flag to RewriteStatepointsForGC that teaches it to pick up deopt state from a `"deopt"` operand bundle attached to the `call` or `invoke` it is wrapping. The command line flag added, `-rs4gc-use-deopt-bundles`, will only exist for a short while. Once we are able to pipe deopt bundle state through the full optimization pipeline without problems, we will "constant fold" `-rs4gc-use-deopt-bundles` to `true`. Reviewers: swaroop.sridhar, reames Subscribers: llvm-commits, sanjoy Differential Revision: http://reviews.llvm.org/D13372 llvm-svn: 250489	2015-10-16 02:41:00 +00:00
Sanjoy Das	7360f30852	[IndVars] Rename getExtend; NFC Rename `IndVarSimplify::getExtend` to `IndVarSimplify::createExtendInst` to make it obvious that it creates `llvm::Instruction` s. llvm-svn: 250484	2015-10-16 01:00:50 +00:00
Sanjoy Das	37e87c2023	[IndVars] Have `cloneArithmeticIVUser` guess better Summary: `cloneArithmeticIVUser` currently trips over expression like `add %iv, -1` when `%iv` is being zero extended -- it tries to construct the widened use as `add %iv.zext, zext(-1)` and (correctly) fails to prove equivalence to `zext(add %iv, -1)` (here the SCEV for `%iv` is `{1,+,1}`). This change teaches `IndVars` to try sign extending the non-IV operand if that makes the newly constructed IV use equivalent to the widened narrow IV use. Reviewers: atrick, hfinkel, reames Subscribers: sanjoy, llvm-commits Differential Revision: http://reviews.llvm.org/D13717 llvm-svn: 250483	2015-10-16 01:00:47 +00:00
Sanjoy Das	472840a3d3	[IndVars] Extract out a few local variables; NFC llvm-svn: 250482	2015-10-16 01:00:44 +00:00
Sanjoy Das	1fd184e5a2	[IndVars] Split `WidenIV::cloneIVUser`; NFC Summary: This NFC splitting is intended to make a later diff easier to follow. It just tail duplicates `cloneIVUser` into `cloneArithmeticIVUser` and `cloneBitwiseIVUser`. Reviewers: atrick, hfinkel, reames Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D13716 llvm-svn: 250481	2015-10-16 01:00:39 +00:00

... 2 3 4 5 6 ...

122963 Commits