llvm-project

Commit Graph

Author	SHA1	Message	Date
Matt Arsenault	becd656c7c	R600/SI: Remove i1 pseudo VALU ops Select i1 logical ops directly to 64-bit SALU instructions. Vector i1 values are always really in SGPRs, with each bit for each item in the wave. This saves about 4 instructions when and/or/xoring any condition, and also helps write conditions that need to be passed in vcc. This should work correctly now that the SGPR live range fixing pass works. More work is needed to eliminate the VReg_1 pseudo regclass and possibly the entire SILowerI1Copies pass. llvm-svn: 223206	2014-12-03 05:22:35 +00:00
Matt Arsenault	2f470c62cb	R600/SI: Fix suspicious indexing The loop is over the operands of an instruction, and checks the register with the sub reg index of the dest register. This probably meant to be checking the sub reg index of the same operand. llvm-svn: 223205	2014-12-03 05:22:32 +00:00
Matt Arsenault	691ae3d657	R600/SI: Fix running SILowerI1Copies a second time llvm-svn: 223204	2014-12-03 05:22:30 +00:00
Matt Arsenault	0d2832ae8d	R600/SI: Fix live range error hidden by SIFoldOperands m0 is treated as a virtual register class with a single register rather than the physical register it really is. This was updating the live range of the used virtual copy of m0 from the first ds_read instruction, and leaving the unused copy unchanged. This resulted in a "Live segment doesn't end at a valid instruction" verifier error because the erased instructions. Update the live range of the second copy (which should be dead). No test since I'm not sure how to trigger this with SIFoldOperands enabled. llvm-svn: 223203	2014-12-03 05:22:29 +00:00
Duncan P. N. Exon Smith	78116d7ace	ADT: Add SmallVector<>::emplace_back(): fixup Add missing `void` return type from `!LLVM_HAS_VARIADIC_TEMPLATES` case in r223201. llvm-svn: 223202	2014-12-03 04:49:16 +00:00
Duncan P. N. Exon Smith	f2396e6552	ADT: Add SmallVector<>::emplace_back() llvm-svn: 223201	2014-12-03 04:45:09 +00:00
Rui Ueyama	d31cf6065f	[PECOFF] Fix a bug in /export option handler. /export option can be given multiple times to specify multiple symbols to be exported. /export accepts both decorated and undecorated name. If you give both undecorated and decorated name of the same symbol to /export, they are resolved to the same symbol. In this case, we need to de-duplicate the exported names, so that we don't have duplicated items in the export symbol table in a DLL. We remove duplicate items from a vector. The bug was there. Because we had pointers pointing to elements of the vector, after an item is removed, they would point wrong elements. This patch is to remove these pointers. Added a test for that case. llvm-svn: 223200	2014-12-03 04:34:20 +00:00
Tom Stellard	1f0dded057	StructurizeCFG: Use LoopInfo analysis for better loop detection We were assuming that each back-edge in a region represented a unique loop, which is not always the case. We need to use LoopInfo to correctly determine which back-edges are loops. llvm-svn: 223199	2014-12-03 04:28:32 +00:00
Duncan P. N. Exon Smith	c280ff0e47	NVPTX: Delete dead code `MDNode` does not inherit from `User`, and it never has a name. llvm-svn: 223198	2014-12-03 04:13:23 +00:00
Tom Stellard	369308061b	R600/SI: Enable inline assembly We just needed to remove the assertion in AMDGPURegisterInfo::getFrameRegister(), which is called when initializing the parser for inline assembly. llvm-svn: 223197	2014-12-03 04:08:00 +00:00
Jason Molenda	286fd1aaac	Update setMCJITMemoryManager call to keep in line with llvm r223183. Patch from Ryan Goodfellow. llvm-svn: 223196	2014-12-03 04:02:03 +00:00
Peter Zotov	fcefcf96e3	[OCaml] [cmake] Disable OCaml bindings if ctypes >=0.3 is not found. llvm-svn: 223195	2014-12-03 03:39:01 +00:00
Matt Arsenault	fb13b22d9a	R600/SI: Change mubuf offsets to print as decimal This matches SC's behavior. llvm-svn: 223194	2014-12-03 03:12:13 +00:00
Nick Lewycky	2e8a6219fc	Emit the entry block first and the exit block second, then all the blocks in between afterwards. This is what gcc always does, and some out of tree tools depend on that. llvm-svn: 223193	2014-12-03 02:45:01 +00:00
NAKAMURA Takumi	0a64776cc0	GCRelocateOperands: Try to appease msc17. llvm-svn: 223192	2014-12-03 02:40:24 +00:00
Peter Collingbourne	6b46e381e6	Update test to check for prologue instead of prefix llvm-svn: 223191	2014-12-03 02:37:10 +00:00
Peter Collingbourne	1a0a9a3c75	UBSan now uses prologue data instead of prefix data As the semantics of prefix data has changed. See D6454. Patch by Ben Gamari! Test Plan: Testsuite Differential Revision: http://reviews.llvm.org/D6489 llvm-svn: 223190	2014-12-03 02:08:51 +00:00
Peter Collingbourne	51d2de7b9e	Prologue support Patch by Ben Gamari! This redefines the `prefix` attribute introduced previously and introduces a `prologue` attribute. There are a two primary usecases that these attributes aim to serve, 1. Function prologue sigils 2. Function hot-patching: Enable the user to insert `nop` operations at the beginning of the function which can later be safely replaced with a call to some instrumentation facility 3. Runtime metadata: Allow a compiler to insert data for use by the runtime during execution. GHC is one example of a compiler that needs this functionality for its tables-next-to-code functionality. Previously `prefix` served cases (1) and (2) quite well by allowing the user to introduce arbitrary data at the entrypoint but before the function body. Case (3), however, was poorly handled by this approach as it required that prefix data was valid executable code. Here we redefine the notion of prefix data to instead be data which occurs immediately before the function entrypoint (i.e. the symbol address). Since prefix data now occurs before the function entrypoint, there is no need for the data to be valid code. The previous notion of prefix data now goes under the name "prologue data" to emphasize its duality with the function epilogue. The intention here is to handle cases (1) and (2) with prologue data and case (3) with prefix data. References ---------- This idea arose out of discussions[1] with Reid Kleckner in response to a proposal to introduce the notion of symbol offsets to enable handling of case (3). [1] http://lists.cs.uiuc.edu/pipermail/llvmdev/2014-May/073235.html Test Plan: testsuite Differential Revision: http://reviews.llvm.org/D6454 llvm-svn: 223189	2014-12-03 02:08:38 +00:00
NAKAMURA Takumi	4c71cc1dcb	ExceptionDemo: Let setMCJITMemoryManager() take unique_ptr, since r223183. llvm-svn: 223188	2014-12-03 02:05:51 +00:00
Ahmed Bougacha	d65f787a5f	[X86][MC] Intel syntax: accept implicit memory operand sizes larger than 80. The X86AsmParser intel handling was refactored in r216481, making it try each different memory operand size to see which one matches. Operand sizes larger than 80 ("[xyz]mmword ptr") were forgotten, which led to an "invalid operand" error for code such as: movdqa [rax], xmm0 llvm-svn: 223187	2014-12-03 02:03:26 +00:00
Nico Weber	736a993828	Add support for has_feature(cxx_alignof) and has_feature(c_alignof). r142020 added support for has_feature(cxx_alignas). This does the same for alignof. llvm-svn: 223186	2014-12-03 01:25:49 +00:00
Nico Weber	aad4af6d50	Fix incorrect codegen for devirtualized calls to virtual overloaded operators. Consider this program: struct A { virtual void operator-() { printf("base\n"); } }; struct B final : public A { virtual void operator-() override { printf("derived\n"); } }; int main() { B* b = new B; -static_cast<A&>(*b); } Before this patch, clang saw the virtual call to A::operator-(), figured out that it can be devirtualized, and then just called A::operator-() directly, without going through the vtable. Instead, it should've looked up which operator-() the call devirtualizes to and should've called that. For regular virtual member calls, clang gets all this right already. So instead of giving EmitCXXOperatorMemberCallee() all the logic that EmitCXXMemberCallExpr() already has, cut the latter function into two pieces, call the second piece EmitCXXMemberOrOperatorMemberCallExpr(), and use it also to generate code for calls to virtual member operators. This way, virtual overloaded operators automatically don't get devirtualized if they have covariant returns (like it was done for regular calls in r218602), etc. This also happens to fix (or at least improve) codegen for explicit constructor calls (`A a; a.A::A()`) in MS mode with -fsanitize-address-field-padding=1. (This adjustment for virtual operator calls seems still wrong with the MS ABI.) llvm-svn: 223185	2014-12-03 01:21:41 +00:00
Richard Smith	e8efd99b24	PR21706: -Wunsequenced was missing warnings when leaving a sequenced region that contained side effects. llvm-svn: 223184	2014-12-03 01:05:50 +00:00
Lang Hames	4a5697e659	[MCJIT] Unique-ptrify the RTDyldMemoryManager member of MCJIT. NFC. llvm-svn: 223183	2014-12-03 00:51:19 +00:00
Hal Finkel	01fa7701e6	[PowerPC] Fix readcyclecounter to be custom expanded for all 32-bit targets We need to use the custom expansion of readcyclecounter on all 32-bit targets (even those with 64-bit registers). This should fix the ppc64 buildbot. llvm-svn: 223182	2014-12-03 00:19:17 +00:00
Kostya Serebryany	c93c84e882	[asan] fix four asan tests to run in use-after-return mode llvm-svn: 223181	2014-12-03 00:08:41 +00:00
Tim Northover	4a8ac260cc	AArch64: strengthen Darwin ABI alignment assumptions A global variable without an explicit alignment specified should be assumed to be ABI-aligned according to its type, like on other platforms. This allows us to use better memory operations when accessing it. rdar://18533701 llvm-svn: 223180	2014-12-02 23:53:43 +00:00
David Majnemer	00973ce683	FullProduct should be _FullProduct llvm-svn: 223179	2014-12-02 23:44:40 +00:00
Pete Cooper	521b5a8ae6	Use a typed enum instead of 'unsigned char' for packed field. NFC. This makes it easier to debug Twine as the 'Kind' fields now show their enum values in lldb and not escaped characters. llvm-svn: 223178	2014-12-02 23:34:23 +00:00
Kaelyn Takata	999dd85e16	Ensure typos in the default values of template parameters get diagnosed. llvm-svn: 223177	2014-12-02 23:32:20 +00:00
David Majnemer	5450763dd8	Intrin: shrx_u64 should be _shrx_u64 llvm-svn: 223176	2014-12-02 23:30:26 +00:00
David Majnemer	5f9afc59f8	Intrin: Add _umul128 Implement _umul128; it provides the high and low halves of a 128-bit multiply. We can simply use our __int128 arithmetic to implement this, we generate great code for it: movq %rdx, %rax mulq %rcx movq %rdx, (%r8) retq Differential Revision: http://reviews.llvm.org/D6486 llvm-svn: 223175	2014-12-02 23:30:24 +00:00
Jason Molenda	22f58dffeb	Mark the armv7 q0-q3 and q8-q15 registers as volatile (not callee preserved) in the ABI. Realistically lldb isn't able to track register saves of any of the neon regs right now so we should probably mark all of the regs as unavailable when you're not on stack frame 0... <rdar://problem/19115127> llvm-svn: 223174	2014-12-02 23:21:05 +00:00
Justin Bogner	111c6533c2	InstrProf: Use the same names for variables as we use in the profile There's no need to use different names for the local variables than we use in the profile itself, and it's a bit simpler and easier to debug if we're consistent. llvm-svn: 223173	2014-12-02 23:15:30 +00:00
Tim Northover	ec7ebebe55	AArch64: don't be too greedy when folding :lo12: accesses into mem ops. This frequently leads to cases like: ldr xD, [xN, :lo12:var] add xA, xN, :lo12:var ldr xD, [xA, #8] where the ADD would have been needed anyway, and the two distinct addressing modes can prevent the formation of an ldp. Because of how we handle ADRP (aggressively forming an ADRP/ADD pseudo-inst at ISel time), this pattern also results in duplicated ADRP instructions (one on its own to cover the ldr, and one combined with the add). llvm-svn: 223172	2014-12-02 23:13:39 +00:00
Michael Zolotukhin	ea8327b80f	PR21302. Vectorize only bottom-tested loops. rdar://problem/18886083 llvm-svn: 223171	2014-12-02 22:59:06 +00:00
Michael Zolotukhin	540580ca06	Apply loop-rotate to several vectorizer tests. Such loops shouldn't be vectorized due to the loops form. After applying loop-rotate (+simplifycfg) the tests again start to check what they are intended to check. llvm-svn: 223170	2014-12-02 22:59:02 +00:00
Fariborz Jahanian	334becf46a	Another warning with no group name bites the dust. rdar://19116886 llvm-svn: 223168	2014-12-02 22:42:52 +00:00
Justin Bogner	7f8cf5bff7	InstrProf: Remove some pointless indirection (NFC) It doesn't make much sense to have std::unique_ptrs of std::string and std::vector. Avoid some useless indirection by using these types directly. llvm-svn: 223166	2014-12-02 22:38:52 +00:00
Simon Pilgrim	6b988ad8f2	[X86][SSE] Keep 4i32 vector insertions in integer domain on SSE4.1 targets 4i32 shuffles for single insertions into zero vectors lowers to X86vzmovl which was using (v)blendps - causing domain switch stalls. This patch fixes this by using (v)pblendw instead. The updated tests on test/CodeGen/X86/sse41.ll still contain a domain stall due to the use of insertps - I'm looking at fixing this in a future patch. Differential Revision: http://reviews.llvm.org/D6458 llvm-svn: 223165	2014-12-02 22:31:23 +00:00
Alexey Samsonov	656c29b08f	Replace InternalScopedBuffer<char> with InternalScopedString where applicable. Summary: No functionality change. Test Plan: make check-all Reviewers: kcc Reviewed By: kcc Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D6472 llvm-svn: 223164	2014-12-02 22:20:11 +00:00
Chris Matthews	5618e73a45	Give lit a --xunit-xml-output option for saving results in xunit format --xunit-xml-output saves test results to disk in JUnit's xml format. This will allow Jenkins to report the details of a lit run. Based on a patch by David Chisnall. llvm-svn: 223163	2014-12-02 22:19:21 +00:00
Kaelyn Takata	c71dda2c38	Diagnose TypoExprs in a couple of error cases in ParsePostfixExpressionSuffix. Also have CorrectDelayedTyposInExpr check that the Expr* isn't null before trying to access its members. Fixes PR21679. llvm-svn: 223162	2014-12-02 22:05:35 +00:00
Hal Finkel	bbdee93638	[PowerPC] Implement readcyclecounter for PPC32 We've long supported readcyclecounter on PPC64, but it is easier there (the read of the 64-bit time-base register can be accomplished via a single instruction). This now provides an implementation for PPC32 as well. On PPC32, the time-base register is still 64 bits, but can only be read 32 bits at a time via two separate SPRs. The ISA manual explains how to do this properly (it involves re-reading the upper bits and looping if the counter has wrapped while being read). This requires PPC to implement a custom integer splitting legalization for the READCYCLECOUNTER node, turning it into a target-specific SDAG node, which then gets turned into a pseudo-instruction, which is then expanded to the necessary sequence (which has three SPR reads, the comparison and the branch). Thanks to Paul Hargrove for pointing out to me that this was still unimplemented. llvm-svn: 223161	2014-12-02 22:01:00 +00:00
Tom Stellard	b8fd6eff89	R600/SI: Emit amd_kernel_code_t header for AMDGPU environment llvm-svn: 223160	2014-12-02 22:00:07 +00:00
Eric Christopher	7398622eed	Make sure that the TargetOptions operator== is checking the full contents of the class. llvm-svn: 223159	2014-12-02 21:57:15 +00:00
Alexey Samsonov	3da2b06593	Add missing #include to fix Android build. llvm-svn: 223157	2014-12-02 21:40:41 +00:00
Lang Hames	a7395bf49b	[AArch64][Stackmaps] Optimize stackmap shadows on AArch64. Reduce the number of nops emitted for stackmap shadows on AArch64 by counting non-stackmap instructions up to the next branch target towards the requested shadow. <rdar://problem/14959522> llvm-svn: 223156	2014-12-02 21:36:24 +00:00
Zachary Turner	be40b2f1b2	Fix broken test suite on Windows after r223091. Differential Revision: http://reviews.llvm.org/D6484 Reviewed by: Oleksiy Vyalov llvm-svn: 223155	2014-12-02 21:32:44 +00:00
Tom Stellard	4df465bd5e	R600/SI: Move more information into SIProgramInfo struct llvm-svn: 223154	2014-12-02 21:28:53 +00:00

1 2 3 4 5 ...

187792 Commits All Branches Search

187792 Commits

All Branches