llvm-project

Commit Graph

Author	SHA1	Message	Date
Robert Khasanov	74acbb7767	[SKX] Enabling mask instructions: encoding, lowering KMOVB, KMOVW, KMOVD, KMOVQ, KNOTB, KNOTW, KNOTD, KNOTQ Reviewed by Elena Demikhovsky <elena.demikhovsky@intel.com> llvm-svn: 213757	2014-07-23 14:49:42 +00:00
Tim Northover	14ff2df05c	ARM: spot SBFX-compatbile code expressed with sign_extend_inreg We were assuming all SBFX-like operations would have the shl/asr form, but often when the field being extracted is an i8 or i16, we end up with a SIGN_EXTEND_INREG acting on a shift instead. Simple enough to check for though. llvm-svn: 213754	2014-07-23 13:59:12 +00:00
Tim Northover	7ad2a0e0c2	ARM: add patterns for [su]xta[bh] from just a shift. Although the final shifter operand is a rotate, this actually only matters for the half-word extends when the amount == 24. Otherwise folding a shift in is just as good. llvm-svn: 213753	2014-07-23 13:59:07 +00:00
James Molloy	bc9fed82cc	Enable partial libcall inlining for all targets by default. This pass attempts to speculatively use a sqrt instruction if one exists on the target, falling back to a libcall if the target instruction returned NaN. This was enabled for MIPS and System-Z, but is well guarded and is good for most targets - GCC does this for (that I've checked) X86, ARM and AArch64. llvm-svn: 213752	2014-07-23 13:33:00 +00:00
Tilmann Scheller	2727279117	[ARM] Make the assembler reject unpredictable pre/post-indexed ARM STRB instructions. The ARM ARM prohibits STRB instructions with writeback into the source register. With this commit this constraint is now enforced and we stop assembling STRB instructions with unpredictable behavior. llvm-svn: 213750	2014-07-23 13:03:47 +00:00
Daniel Sanders	a4d18fc606	Added release notes for MIPS. llvm-svn: 213749	2014-07-23 12:59:26 +00:00
Tim Northover	35910d7fa8	AArch64: remove "arm64_be" support in favour of "aarch64_be". There really is no arm64_be: it was a useful fiction to test big-endian support while both backends existed in parallel, but now the only platform that uses the name (iOS) doesn't have a big-endian variant, let alone one called "arm64_be". llvm-svn: 213748	2014-07-23 12:58:11 +00:00
Tilmann Scheller	3352a58ddc	[ARM] Make the assembler reject unpredictable pre/post-indexed ARM STR instructions. The ARM ARM prohibits STR instructions with writeback into the source register. With this commit this constraint is now enforced and we stop assembling STR instructions with unpredictable behavior. llvm-svn: 213745	2014-07-23 12:38:17 +00:00
Tim Northover	e19bed7d33	AArch64: remove arm64 triple enumerator. Having both Triple::arm64 and Triple::aarch64 is extremely confusing, and invites bugs where only one is checked. In reality, the only legitimate difference between the two (arm64 usually means iOS) is also present in the OS part of the triple and that's what should be checked. We still parse the "arm64" triple, just canonicalise it to Triple::aarch64, so there aren't any LLVM-side test changes. llvm-svn: 213743	2014-07-23 12:32:47 +00:00
Andrea Di Biagio	842355e900	Revert r211771. It was: "[X86] Improve the selection of SSE3/AVX addsub instructions". This chang fully reverts r211771. That revision added a canonicalization rule which has the potential to causes a combine-cycle in the target-independent canonicalizing DAG combine. The plan is to move the logic that forms target specific addsub nodes as part of the lowering of shuffles. llvm-svn: 213736	2014-07-23 11:20:24 +00:00
Chandler Carruth	30253f3247	[x86] Clean up a test case to use check labels and spell out the exact instruction sequences with CHECK-NEXT for these test cases. This notably exposes how absolutely horrible the generated code is for several of these test cases, and will make any future updates to the test as our vector instruction selection gets better. llvm-svn: 213732	2014-07-23 09:11:48 +00:00
Tilmann Scheller	bd0c457663	[ARM] Add regression test for the earlyclobber constraint of ARM STRB. The constraint was added in r213369. llvm-svn: 213730	2014-07-23 08:39:50 +00:00
Tilmann Scheller	c28f0d587d	[ARM] Add earlyclobber constraint to pre/post-indexed ARM STRH instructions. The post-indexed instructions were missing the constraint, causing unpredictable STRH instructions to be emitted. The earlyclobber constraint on the pre-indexed STR instructions is not strictly necessary, as the instruction selection for pre-indexed STR instructions goes through an additional layer of pseudo instructions which have the constraint defined, however it doesn't hurt to specify the constraint directly on the pre-indexed instructions as well, since at some point someone might create instances of them programmatically and then the constraint is definitely needed. llvm-svn: 213729	2014-07-23 08:12:51 +00:00
Chandler Carruth	9a0051cd59	[SDAG] Make the DAGCombine worklist not grow endlessly due to duplicate insertions. The old behavior could cause arbitrarily bad memory usage in the DAG combiner if there was heavy traffic of adding nodes already on the worklist to it. This commit switches the DAG combine worklist to work the same way as the instcombine worklist where we null-out removed entries and only add new entries to the worklist. My measurements of codegen time shows slight improvement. The memory utilization is unsurprisingly dominated by other factors (the IR and DAG itself I suspect). This change results in subtle, frustrating churn in the particular order in which DAG combines are applied which causes a number of minor regressions where we fail to match a pattern previously matched by accident. AFAICT, all of these should be using AddToWorklist to directly or should be written in a less brittle way. None of the changes seem drastically bad, and a few of the changes seem distinctly better. A major change required to make this work is to significantly harden the way in which the DAG combiner handle nodes which become dead (zero-uses). Previously, we relied on the ability to "priority-bump" them on the combine worklist to achieve recursive deletion of these nodes and ensure that the frontier of remaining live nodes all were added to the worklist. Instead, I've introduced a routine to just implement that precise logic with no indirection. It is a significantly simpler operation than that of the combiner worklist proper. I suspect this will also fix some other problems with the combiner. I think the x86 changes are really minor and uninteresting, but the avx512 change at least is hiding a "regression" (despite the test case being just noise, not testing some performance invariant) that might be looked into. Not sure if any of the others impact specific "important" code paths, but they didn't look terribly interesting to me, or the changes were really minor. The consensus in review is to fix any regressions that show up after the fact here. Thanks to the other reviewers for checking the output on other architectures. There is a specific regression on ARM that Tim already has a fix prepped to commit. Differential Revision: http://reviews.llvm.org/D4616 llvm-svn: 213727	2014-07-23 07:08:53 +00:00
Nick Lewycky	aba900c252	We may visit a call that uses an alloca multiple times in callUsesLocalStack, sometimes with IsNocapture true and sometimes with IsNocapture false. We accidentally skipped work we needed to do in the IsNocapture=false case if we were called with IsNocapture=true the first time. Fixes PR20405! llvm-svn: 213726	2014-07-23 06:24:49 +00:00
NAKAMURA Takumi	0a6af4391b	Rework to let RuntimeDyld/X86/MachO_x86-64_PIC_relocations.s pass on win32. FIXME: "llvm-rtdyld -verify -check" is still sensitive to path separator. Fix searching StubMap to be tolerant of both '/' and '\\' on Win32. llvm-svn: 213723	2014-07-23 04:32:21 +00:00
NAKAMURA Takumi	24c0b46e18	Suppress a test on win32 for now, llvm/test/ExecutionEngine/RuntimeDyld/X86/MachO_x86-64_PIC_relocations.s. FIXME: Fix searching StubMap with '/' and '\\' on Win32. llvm-svn: 213721	2014-07-23 04:05:58 +00:00
NAKAMURA Takumi	7d79387981	RuntimeDyld/X86/MachO_x86-64_PIC_relocations.s: Use %/T here, or sed(1) would be confused with dos path. llvm-svn: 213720	2014-07-23 04:05:46 +00:00
NAKAMURA Takumi	16d99f93a8	Trailing whitespace. llvm-svn: 213711	2014-07-23 00:42:52 +00:00
NAKAMURA Takumi	ea4a8dae83	RuntimeDyldMachOAArch64.h: Fix a warning. [-Wunused-variable] llvm-svn: 213710	2014-07-23 00:17:44 +00:00
Lang Hames	17e6b9e5ab	[MCJIT] Make stub_addr functionality in RuntimeDyldChecker work in release mode. There's no reason to restrict this particular piece of RuntimeDyldChecker functionality to +Asserts builds. This should fix failures in MachO_x86-64_PIC_relocations.s on release bots. llvm-svn: 213708	2014-07-22 23:50:51 +00:00
Lang Hames	c90a85ff99	[MCJIT] Teach RuntimeDyldChecker to handle underscores at the start of symbols. RuntimeDyldChecker had been testing isalpha(Expr[0]) to recognise symbol tokens, and throwing unrecognized token errors when it hit symbols with leading underscores. This fixes that. llvm-svn: 213706	2014-07-22 23:17:21 +00:00
Juergen Ributzka	7325ac39b6	XFAIL the test on MIPS Not sure how to debug this one without a MIPS machine. Any takers? llvm-svn: 213705	2014-07-22 23:15:01 +00:00
Juergen Ributzka	2581fa505f	[FastIsel][AArch64] Add support for the FastLowerCall and FastLowerIntrinsicCall target-hooks. This commit modifies the existing call lowering functions to be used as the FastLowerCall and FastLowerIntrinsicCall target-hooks instead. This enables patchpoint intrinsic lowering for AArch64. This fixes <rdar://problem/17733076> llvm-svn: 213704	2014-07-22 23:14:58 +00:00
Juergen Ributzka	88f6faf1f4	[AArch64] Use CHECK-LABEL in ARM64 ABI unit tests. llvm-svn: 213703	2014-07-22 23:14:54 +00:00
Lang Hames	cce313b082	[MCJIT] Improve stub_addr file-not-found diagnostic to help track down a buildbot failure. llvm-svn: 213701	2014-07-22 23:07:52 +00:00
Lang Hames	f7acddde5b	[MCJIT] Refactor and add stub inspection to the RuntimeDyldChecker framework. This patch introduces a 'stub_addr' builtin that can be used to find the address of the stub for a given (<file>, <section>, <symbol>) tuple. This address can be used both to verify the contents of stubs (by loading from the returned address) and to verify references to stubs (by comparing against the returned address). Example (1) - Verifying stub contents: Load 8 bytes (assuming a 64-bit target) from the stub for 'x' in the __text section of f.o, and compare that value against the addres of 'x'. # rtdyld-check: *{8}(stub_addr(f.o, __text, x) = x Example (2) - Verifying references to stubs: Decode the immediate of the instruction at label 'l', and verify that it's equal to the offset from the next instruction's PC to the stub for 'y' in the __text section of f.o (i.e. it's the correct PC-rel difference). # rtdyld-check: decode_operand(l, 4) = stub_addr(f.o, __text, y) - next_pc(l) l: movq y@GOTPCREL(%rip), %rax Since stub inspection requires cooperation with RuntimeDyldImpl this patch pimpl-ifies RuntimeDyldChecker. Its implementation is moved in to a new class, RuntimeDyldCheckerImpl, that has access to the definition of RuntimeDyldImpl. llvm-svn: 213698	2014-07-22 22:47:39 +00:00
Juergen Ributzka	0e957cf714	Appease the buildbots. llvm-svn: 213694	2014-07-22 22:02:19 +00:00
Juergen Ributzka	f560928889	[RuntimeDyld][MachO][AArch64] Add a helper function for encoding addends in instructions. Factor out the addend encoding into a helper function and simplify the processRelocationRef. Also add a few simple rtdyld tests. More tests to come once GOTs can be tested too. Related to <rdar://problem/17768539> llvm-svn: 213689	2014-07-22 21:42:55 +00:00
Juergen Ributzka	b13b52efe0	[RuntimeDyld][MachO][AArch64] Implement the decodeAddend method. This adds the required functionality to decode the immediate encoded in an instruction that is referenced in a relocation entry. llvm-svn: 213688	2014-07-22 21:42:51 +00:00
Juergen Ributzka	dd19d33057	[RuntimeDyld][MachO][AArch64] Add assertion to check for duplicate addend definition. In MachO for AArch64 it is possible to have an explicit addend defined by the ARM64_RELOC_ADDEND relocation or having an addend encoded within the instruction. Only one of them are allowed per relocation. llvm-svn: 213687	2014-07-22 21:42:49 +00:00
Juergen Ributzka	175b78b02e	[RuntimeDyld] Change the return type of decodeAddend to match the storage type. llvm-svn: 213686	2014-07-22 21:42:46 +00:00
Suyog Sarda	3a8c2c1e6c	This patch implements optimization as mentioned in PR19753: Optimize comparisons with "ashr/lshr exact" of a constanst. It handles the errors which were seen in PR19958 where wrong code was being emitted due to earlier patch. Added code for lshr as well as non-exact right shifts. It implements : (icmp eq/ne (ashr/lshr const2, A), const1)" -> (icmp eq/ne A, Log2(const2/const1)) -> (icmp eq/ne A, Log2(const2) - Log2(const1)) Differential Revision: http://reviews.llvm.org/D4068 llvm-svn: 213678	2014-07-22 19:19:36 +00:00
Suyog Sarda	b60ec909ca	Added InstCombine transform for pattern "(A & B) ^ (A ^ B) -> (A \| B)" Patch idea by Ankit Jain ! Differential Revision: http://reviews.llvm.org/D4618 llvm-svn: 213677	2014-07-22 18:30:54 +00:00
Suyog Sarda	d64faf6cae	Added InstCombine Transform for patterns: "((~A & B) \| A) -> (A \| B)" and "((A & B) \| ~A) -> (~A \| B)" Original Patch credit to Ankit Jain !! Differential Revision: http://reviews.llvm.org/D4591 llvm-svn: 213676	2014-07-22 18:09:41 +00:00
Dan Liew	54891de4cb	Revert "Treat warnings in Sphinx as errors. The reasons for doing this are..." This reverts commit r213661. Reverting at the request of Sean Silva. llvm-svn: 213675	2014-07-22 18:09:17 +00:00
Dan Liew	3982813700	Add LLVM_TOOLS_BINARY_DIR variable to LLVMConfig.cmake so clients of LLVM using CMake can easily find the tools directory. LLVM_BUILD_TOOLS_BINARY_DIR was removed because it is now superfluous. llvm-svn: 213674	2014-07-22 17:48:51 +00:00
Alexey Samsonov	bad4d0c38a	[ASan] Fix comments about __sanitizer_cov function llvm-svn: 213673	2014-07-22 17:46:09 +00:00
Hal Finkel	ccc7090671	Make use of the align parameter attribute for all pointer arguments We previously supported the align attribute on all (pointer) parameters, but we only used it for byval parameters. However, it is completely consistent at the IR level to treat 'align n' on all pointer parameters as an alignment assumption on the pointer, and now we wll. Specifically, this causes computeKnownBits to use the align attribute on all pointer parameters, not just byval parameters. I've also added an explicit parameter attribute test for this to test/Bitcode/attributes.ll. And I've updated the LangRef to document the align parameter attribute (as it turns out, it was not documented at all previously, although the byval documentation mentioned that it could be used). There are (at least) two benefits to doing this: - It allows enhancing alignment based on the pointer alignment after inlining callees. - It allows simplification of pointer arithmetic. llvm-svn: 213670	2014-07-22 16:58:55 +00:00
Tim Northover	0942e39061	X86: drop relocations on __eh_frame sections globally. Without this, we produce non-extern relocations when targeting older OS X versions that ld64 can't cope with in the particular context of __eh_frame sections (who'd want generic relocation-processing anyway?). This means that an updated linker (ld64 from Xcode 3.2.6 or later) may be needed when targeting such platforms with a modern version of LLVM, but this is probably the case anyway and a reasonable requirement. PR20212, rdar://problem/17544795 llvm-svn: 213665	2014-07-22 15:47:09 +00:00
Dan Liew	066f50a251	Export LLVM_ENABLE_RTTI and LLVM_ENABLE_EH in LLVMConfig.cmake so clients of LLVM know if RTTI and/or EH were enabled in the build of LLVM they are trying to link against. llvm-svn: 213664	2014-07-22 15:41:33 +00:00
Dan Liew	a5bdc846aa	Added LLVM_ENABLE_RTTI and LLVM_ENABLE_EH options that allow RTTI and EH to globally be controlled. Individual targets (e.g. ExceptionDemo) can still override this by using LLVM_REQUIRE_RTTI and LLVM_REQUIRE_EH if they need to be compiled with RTTI or exception handling respectively. llvm-svn: 213663	2014-07-22 15:41:18 +00:00
Suyog Sarda	521237cad6	This patch implements transform for pattern "(A \| B) ^ (~A) -> (A \| ~B)". Patch Credit to Ankit Jain !! Differential Revision: http://reviews.llvm.org/D4588 llvm-svn: 213662	2014-07-22 15:37:39 +00:00
Dan Liew	0d38c3a726	Treat warnings in Sphinx as errors. The reasons for doing this are... - When CMake builds the documentation with sphinx-build it treats warnings as errors. We should be consistent with what we do in CMake. - Having warnings treated as errors will hopefully encourage developers to write documentation correctly. llvm-svn: 213661	2014-07-22 15:07:35 +00:00
Dan Liew	9a1829d3f1	Fix Sphinx warning. llvm-svn: 213660	2014-07-22 14:59:38 +00:00
Peter Zotov	daacd61443	[OCaml] Don't truncate constants over 32 bits in Llvm.const_int. llvm-svn: 213655	2014-07-22 13:55:20 +00:00
Sasa Stankovic	319f0ff3b7	[mips] Fix two patterns that select i32's (for MIPS32r6) / i64's (for MIPS64r6) from setne comparison with an i32. The patterns that are fixed: * (select (i32 (setne i32, immZExt16)), i32, i32) (for MIPS32r6) * (select (i32 (setne i32, immZExt16)), i64, i64) (for MIPS64r6) llvm-svn: 213653	2014-07-22 13:36:02 +00:00
Elena Demikhovsky	f164859efc	AVX-512: Fixed intrinsic of VSQRTPS/PD instructions. I set number and types of parameters according to GCC intrinsics. llvm-svn: 213640	2014-07-22 11:07:31 +00:00
Sanjay Patel	fc3d8f0a78	fixed typo in comment llvm-svn: 213614	2014-07-22 04:57:06 +00:00
Chandler Carruth	41b20e7783	[SDAG] Refactor the code for inserting a newly allocated SDNode into the DAG into a helper function. This adds a trip through the (very minimal) verification logic in a bunch of places that were missing it, but shouldn't have any other impact outside of refactoring. I'm hoping to use this to do more clever things when DAG nodes are inserted into the graph. llvm-svn: 213612	2014-07-22 04:07:55 +00:00
Chandler Carruth	2fc9a2b8eb	[SDAG] Remove a giant pile of asserts that may have helped track down a bug in 2010 when they were added but are adding no value today. In fact, they are utter lies. NodeAllocator is used to allocate almost all of these node types. I don't know what we were trying to assert here, and the docs don't give any answer. Until we once again stumble upon a bug needing help, let's clear the path for improvements. llvm-svn: 213610	2014-07-22 04:03:22 +00:00
Bill Wendling	47bdc929e3	Add openmp to the list of tagged things. llvm-svn: 213608	2014-07-22 03:17:30 +00:00
Richard Smith	a0cc1654ce	Revert of r213521. This change introduced a non-hermetic test (depending on a file not in the test/ area). Backing out now so that this test isn't part of the 3.5 branch. Original commit message: "TableGen: Allow AddedComplexity values to be negative [...]" llvm-svn: 213596	2014-07-22 02:32:12 +00:00
Mark Heffernan	9d20e42765	Rename metadata llvm.loop.vectorize.unroll to llvm.loop.vectorize.interleave. llvm-svn: 213588	2014-07-21 23:11:03 +00:00
Eli Bendersky	24d8aacd94	Add some tests for NVPTX lowering of cmpxchg llvm-svn: 213586	2014-07-21 22:54:44 +00:00
Hal Finkel	d32803b669	Match semantics of PointerMayBeCapturedBefore to its name by default As it turns out, the capture tracker named CaptureBefore used by AA, and now available via the PointerMayBeCapturedBefore function, would have been more-aptly named CapturedBeforeOrAt, because it considers captures at the instruction provided. This is not always what one wants, and it is difficult to get the strictly-before behavior given only the current interface. This adds an additional parameter which controls whether or not you want to include captures at the provided instruction. The default is not to include the instruction provided, so that 'Before' matches its name. No functionality change intended. llvm-svn: 213582	2014-07-21 21:30:22 +00:00
David Blaikie	26f2268cc5	Revert "Recommit r212203: Don't try to construct debug LexicalScopes hierarchy for functions that do not have top level debug information." This reverts commit r212649 while I investigate/reduce/etc PR20367. llvm-svn: 213581	2014-07-21 20:45:59 +00:00
Tom Stellard	9c4c3c58cb	test-release.sh: Add support for dot releases llvm-svn: 213580	2014-07-21 20:20:08 +00:00
Saleem Abdulrasool	913666f9bc	R600: silence GCC warning GCC believes it may be possible to not return a value from the switch: lib/Target/R600/SIRegisterInfo.cpp:187:1: warning: control reaches end of non-void function [-Wreturn-type] Add an unreachable label to indicate that this is not possible and still permit switch coverage checking. llvm-svn: 213572	2014-07-21 17:52:00 +00:00
Tom Stellard	bda32c9e47	R600/SI: Refactor VOP3 instruction definitions llvm-svn: 213571	2014-07-21 17:44:29 +00:00
Tom Stellard	e5a1cdab47	R600/SI: Separate encoding and operand definitions into their own classes llvm-svn: 213570	2014-07-21 17:44:28 +00:00
Logan Chien	63bee2a2bb	Replace the result usages while legalizing cmpxchg. We should update the usages to all of the results; otherwise, we might get assertion failure or SEGV during the type legalization of ATOMIC_CMP_SWAP_WITH_SUCCESS with two or more illegal types. For example, in the following sequence, both i8 and i1 might be illegal in some target, e.g. armv5, mipsel, mips64el, %0 = cmpxchg i8* %ptr, i8 %desire, i8 %new monotonic monotonic %1 = extractvalue { i8, i1 } %0, 1 Since both i8 and i1 should be legalized, the corresponding ATOMIC_CMP_SWAP_WITH_SUCCESS dag will be checked/replaced/updated twice. If we don't update the usage to ALL of the results in the first round, the DAG for extractvalue might be processed earlier. The GetPromotedInteger() will result in assertion failure, because its operand (i.e. the success bit of cmpxchg) is not promoted beforehand. llvm-svn: 213569	2014-07-21 17:33:44 +00:00
Tom Stellard	f757b5ddc2	R600/SI: Initailize encoding fields of unused VOP3 modifiers to 0 llvm-svn: 213564	2014-07-21 17:12:40 +00:00
Tom Stellard	ca000c6c7b	R600/SI: Initialize unused VOP3 sources to 0 instead of SIOperand.ZERO llvm-svn: 213563	2014-07-21 17:12:37 +00:00
Duncan P. N. Exon Smith	6c99015fe2	Revert "[C++11] Add predecessors(BasicBlock ) / successors(BasicBlock ) iterator ranges." This reverts commit r213474 (and r213475), which causes a miscompile on a stage2 LTO build. I'll reply on the list in a moment. llvm-svn: 213562	2014-07-21 17:06:51 +00:00
Tom Stellard	1aaad6970c	R600/SI: Add instruction shrinking pass This pass converts 64-bit instructions to 32-bit when possible. llvm-svn: 213561	2014-07-21 16:55:33 +00:00
Dan Liew	a762a137b7	Fix Sphinx warnings. llvm-svn: 213559	2014-07-21 16:39:00 +00:00
Tom Stellard	63797d4a23	R600/SI: VOPC instructions explicitly define VCC Therefore we don't need to add it to the implict defs list. llvm-svn: 213558	2014-07-21 16:27:24 +00:00
David Blaikie	4b9ae52ac1	Correct the ownership passing semantics of object::createBinary and make them explicit in the type system. createBinary documented that it destroyed the parameter in error cases, though by observation it does not. By passing the unique_ptr by value rather than lvalue reference, callers are now explicit about passing ownership and the function implements the documented contract. Remove the explicit documentation, since now the behavior cannot be anything other than what was documented, so it's redundant. Also drops a unique_ptr::release in llvm-nm that was always run on a null unique_ptr anyway. llvm-svn: 213557	2014-07-21 16:26:24 +00:00
David Blaikie	dc01ca1896	Remove unnecessary use of unique_ptr::release() used to construct another unique_ptr. llvm-svn: 213556	2014-07-21 16:23:21 +00:00
David Blaikie	370a67a56c	Remove unused variable. llvm-svn: 213554	2014-07-21 16:13:24 +00:00
Tom Stellard	e812f2fdd8	R600/SI: Clean up some of the unused REGISTER_{LOAD,STORE} code There are a few more cleanups to do, but I ran into some problems with ext loads and trunc stores, when I tried to change some of the vector loads and stores from custom to legal, so I wasn't able to get rid of everything. llvm-svn: 213552	2014-07-21 15:45:06 +00:00
Tom Stellard	b02094e115	R600/SI: Use scratch memory for large private arrays llvm-svn: 213551	2014-07-21 15:45:01 +00:00
Tom Stellard	42639a57de	R600/SI: Specify wavefront size for SI and CI llvm-svn: 213550	2014-07-21 15:44:58 +00:00
Tom Stellard	8e44d948b6	R600/SI: Remove vaddr operand from BUFFER_LOAD_*_OFFSET instructions This operand is never used. llvm-svn: 213549	2014-07-21 15:44:55 +00:00
Daniel Sanders	e22244b733	[mips] Do not emit '.module fp=...' unless we really need to. We now emit this value when we need to contradict the default value. This restores support for binutils 2.24. When a suitable binutils has been released we can resume unconditionally emitting .module directives. This is preferable to omitting the .module directives since the .module directives protect against, for example, accidentally assembling FP32 code with -mfp64 and producing an unusuable object. llvm-svn: 213548	2014-07-21 15:25:24 +00:00
Robert Khasanov	bfa0131365	[SKX] Enabling SKX target and AVX512BW, AVX512DQ, AVX512VL features. Enabling HasAVX512{DQ,BW,VL} predicates. Adding VK2, VK4, VK32, VK64 masked register classes. Adding new types (v64i8, v32i16) to VR512. Extending calling conventions for new types (v64i8, v32i16) Patch by Zinovy Nis <zinovy.y.nis@intel.com> Reviewed by Elena Demikhovsky <elena.demikhovsky@intel.com> llvm-svn: 213545	2014-07-21 14:54:21 +00:00
Tom Stellard	32411403b2	docs: Update relaease documents to include the patch number in the RELEASE tags This will make it easier to update the release scripts to support bug-fix releases. llvm-svn: 213544	2014-07-21 14:28:31 +00:00
Dan Liew	12902a0ed8	Export LLVM_ENABLE_ASSERTIONS in LLVMConfig.cmake so clients know if the version of LLVM they are trying to use was built with or without assertions. llvm-svn: 213532	2014-07-21 14:17:15 +00:00
Tom Stellard	067c81567b	R600/SI: Store constant initializer data in constant memory This implements a solution for constant initializers suggested by Vadim Girlin, where we store the data after the shader code and then use the S_GETPC instruction to compute its address. This saves use the trouble of creating a new buffer for constant data and then having to pass the pointer to the kernel via user SGPRs or the input buffer. llvm-svn: 213530	2014-07-21 14:01:14 +00:00
Tom Stellard	b2114caf62	R600/SI: Add isCFDepth0 Predicate to SALU addc pattern llvm-svn: 213529	2014-07-21 14:01:12 +00:00
Tom Stellard	54a3b65bb9	R600/SI: Use VALU for i1 XOR llvm-svn: 213528	2014-07-21 14:01:10 +00:00
Tom Stellard	01825afad7	R600/SI: Use a custom encoding method for simm16 in SOPP branch instructions This allows us to explicitly define the type of fixup that is needed, so we can distinguish this from future fixup types. llvm-svn: 213527	2014-07-21 14:01:08 +00:00
Tom Stellard	e08fe68bdd	R600/SI: Rename SOPP operands to match the encoding fields llvm-svn: 213526	2014-07-21 14:01:05 +00:00
Daniel Sanders	68c3747efb	[mips] Add MipsOptionRecord abstraction and use it to implement .reginfo/.MIPS.options This abstraction allows us to support the various records that can be placed in the .MIPS.options section in the future. We currently use it to record register usage information (the ODK_REGINFO record in our ELF64 spec). Each .MIPS.options record should subclass MipsOptionRecord and provide an implementation of EmitMipsOptionRecord. Patch by Matheus Almeida and Toma Tabacu llvm-svn: 213522	2014-07-21 13:30:55 +00:00
Tom Stellard	edf1570d4e	TableGen: Allow AddedComplexity values to be negative This is useful for cases when stand-alone patterns are preferred to the patterns included in the instruction definitions. Instead of requiring that stand-alone patterns set a larger AddedComplexity value, which can be confusing to new developers, the allows us to reduce the complexity of the included patterns to achieve the same result. llvm-svn: 213521	2014-07-21 13:28:54 +00:00
Hal Finkel	b035621720	Move the CapturesBefore tracker from AA into CaptureTracking There were two generally-useful CaptureTracker classes defined in LLVM: the simple tracker defined in CaptureTracking (and made available via the PointerMayBeCaptured utility function), and the CapturesBefore tracker available only inside of AA. This change moves the CapturesBefore tracker into CaptureTracking, generalizes it slightly (by adding a ReturnCaptures parameter), and makes it generally available via a PointerMayBeCapturedBefore utility function. This logic will be needed, for example, to perform noalias function parameter attribute inference. No functionality change intended. llvm-svn: 213519	2014-07-21 13:15:48 +00:00
Aaron Ballman	659b96670c	This declaration has no definition, which is causing MSVC to emit several "no suitable definition provided for explicit template instantiation request" C4661 warnings. llvm-svn: 213517	2014-07-21 13:08:08 +00:00
Aaron Ballman	6c078a5960	Fixing an MSVC conversion warning about implicitly converting the shift results to 64-bits. No functional change intended. llvm-svn: 213515	2014-07-21 12:31:43 +00:00
Hal Finkel	c782aa5a9b	Move isIdentifiedFunctionLocal from BasicAA to AA The ability to identify function locals will exist outside of BasicAA (for example, logic for inferring noalias function arguments will need this), so make this concept generally accessible without code duplication. No functionality change. llvm-svn: 213514	2014-07-21 12:27:23 +00:00
Daniel Sanders	decb7a2b0b	[mips] Try to fix the test/ExecutionEngine tests on a MIPS host. Fix a dangerous default case that caused MipsCodeEmitter to discard pseudo instructions it didn't recognize. It will now call llvm_unreachable() for unrecognized pseudo's and explicitly handles PseudoReturn, PseudoReturn64, PseudoIndirectBranch, PseudoIndirectBranch64, CFI_INSTRUCTION, IMPLICIT_DEF, and KILL. There may be other pseudos that need handling but this was enough for the ExecutionEngine tests to pass on my test system. llvm-svn: 213513	2014-07-21 12:25:34 +00:00
Daniel Sanders	d7c2796045	[mips] Do not emit '.module [no]oddspreg' unless we really need to. We now emit this directive when we need to contradict the default value (e.g. -mno-odd-spreg is given) or an option changed the default value (e.g. -mfpxx is given). This restores support for the currently available head of binutils. However, at this point binutils 2.24 is still not sufficient since it does not support '.module fp=...'. llvm-svn: 213511	2014-07-21 10:45:47 +00:00
Chandler Carruth	efd14a62a3	FileCheck-ize a test. llvm-svn: 213508	2014-07-21 09:23:21 +00:00
Tim Northover	f7a02c1762	CodeGen: emit IR-level f16 conversion intrinsics as fptrunc/fpext This makes the first stage DAG for @llvm.convert.to.fp16 an fptrunc, and correspondingly @llvm.convert.from.fp16 an fpext. The legalisation path is now uniform, regardless of the input IR: fptrunc -> FP_TO_FP16 (if f16 illegal) -> libcall fpext -> FP16_TO_FP (if f16 illegal) -> libcall Each target should be able to select the version that best matches its operations and not be required to duplicate patterns for both fptrunc and FP_TO_FP16 (for example). As a result we can remove some redundant AArch64 patterns. llvm-svn: 213507	2014-07-21 09:13:56 +00:00
Chandler Carruth	3c0012beb6	[SDAG,cleanup] Switch the DAG combiner over to use the spelling 'Worklist' consistently rather than a deeply confusing mixture of 'WorkList' and 'Worklist'. Notably, the very 'WorkList' of the DAG combiner was exposed to target specific DAG combines under an interface 'AddToWorklist' which was implemented by in turn calling 'AddToWorkList' in the combiner. This has sent me circling with the wrong case in grep one too many times. I chose to normalize on 'Worklist' because that one won the grep-vote for llvm/lib/... by a hundered hits or so, and it is used in places relatively "canonical" such as InstCombine's Worklist. Let's all jsut pick this casing, whether "correct", "good", or "bad" and be consistent... llvm-svn: 213506	2014-07-21 08:56:44 +00:00
Chandler Carruth	24ceb0ce66	[SDAG] Rather than using a narrow test against the one dummy node on the stack, filter all handle nodes from the DAG combiner worklist. This will also handle cases where other handle nodes might be (erroneously) added to the worklist and then cause bugs and explosions when deleted. For example, when running the legalizer within the DAG combiner, there are times when other handle nodes are used and can end up here. llvm-svn: 213505	2014-07-21 08:32:31 +00:00
Andrea Di Biagio	0fb2013192	[DAGCombiner] Improve the shuffle-vector folding logic. Canonicalize shuffles according to rules: * shuffle(A, shuffle(A, B)) -> shuffle(shuffle(A,B), A) * shuffle(B, shuffle(A, B)) -> shuffle(shuffle(A,B), B) * shuffle(B, shuffle(A, Undef)) -> shuffle(shuffle(A, Undef), B) This patch helps identifying more shuffle pairs that could be combined reusing the already existing rules in the DAGCombiner. Added new test 'combine-vec-shuffle-5.ll' to verify that the canonicalized shuffles are now folded into a single shuffle node by the DAGCombiner. Added more test cases to 'combine-vec-shuffle-4.ll'. llvm-svn: 213504	2014-07-21 07:30:54 +00:00
Andrea Di Biagio	4d8bd41600	[DAG] Refactor some logic. No functional change. This patch removes function 'CommuteVectorShuffle' from X86ISelLowering.cpp and moves its logic into SelectionDAG.cpp as method 'getCommutedVectorShuffles'. This refactoring is in preperation of an upcoming change to the DAGCombiner. llvm-svn: 213503	2014-07-21 07:28:51 +00:00
Gerolf Hoflehner	ae1ec299df	Fix for regression: [Bug 20369] wrong code at -O3 on x86_64-linux-gnu in 64-bit mode Prevents hoisting of loads above stores and sinking of stores below loads in MergedLoadStoreMotion.cpp (rdar://15991737) llvm-svn: 213497	2014-07-21 03:02:46 +00:00
Ulrich Weigand	85d5df25de	[PowerPC] ELFv2 aggregate passing support This patch adds infrastructure support for passing array types directly. These can be used by the front-end to pass aggregate types (coerced to an appropriate array type). The details of the array type being used inform the back-end about ABI-relevant properties. Specifically, the array element type encodes: - whether the parameter should be passed in FPRs, VRs, or just GPRs/stack slots (for float / vector / integer element types, respectively) - what the alignment requirements of the parameter are when passed in GPRs/stack slots (8 for float / 16 for vector / the element type size for integer element types) -- this corresponds to the "byval align" field Using the infrastructure provided by this patch, a companion patch to clang will enable two features: - In the ELFv2 ABI, pass (and return) "homogeneous" floating-point or vector aggregates in FPRs and VRs (this is similar to the ARM homogeneous aggregate ABI) - As an optimization for both ELFv1 and ELFv2 ABIs, pass aggregates that fit fully in registers without using the "byval" mechanism The patch uses the functionArgumentNeedsConsecutiveRegisters callback to encode that special treatment is required for all directly-passed array types. The isInConsecutiveRegs / isInConsecutiveRegsLast bits set as a results are then used to implement the required size and alignment rules in CalculateStackSlotSize / CalculateStackSlotAlignment etc. As a related change, the ABI routines have to be modified to support passing floating-point types in GPRs. This is necessary because with homogeneous aggregates of 4-byte float type we can now run out of FPRs before we run out of the 64-byte argument save area that is shadowed by GPRs. Any extra floating-point arguments that no longer fit in FPRs must now be passed in GPRs until we run out of those too. Note that there was already code to pass floating-point arguments in GPRs used with vararg parameters, which was done by writing the argument out to the argument save area first and then reloading into GPRs. The patch re-implements this, however, in favor of code packing float arguments directly via extension/truncation, BITCAST, and BUILD_PAIR operations. This is required to support the ELFv2 ABI, since we cannot unconditionally write to the argument save area (which the caller might not have allocated). The change does, however, affect ELFv1 varags routines too; but even here the overall effect should be advantageous: Instead of loading the argument into the FPR, then storing the argument to the stack slot, and finally reloading the argument from the stack slot into a GPR, the new code now just loads the argument into the FPR, and subsequently loads the argument into the GPR (via BITCAST). That BITCAST might imply a save/reload from a stack temporary (in which case we're no worse than before); but it might be implemented more efficiently in some cases. The final part of the patch enables up to 8 FPRs and VRs for argument return in PPCCallingConv.td; this is required to support returning ELFv2 homogeneous aggregates. (Note that this doesn't affect other ABIs since LLVM wil only look for which register to use if the parameter is marked as "direct" return anyway.) Reviewed by Hal Finkel. llvm-svn: 213493	2014-07-21 00:13:26 +00:00

1 2 3 4 5 ...

105858 Commits