llvm-project

Commit Graph

Author	SHA1	Message	Date
Tobias Grosser	75b9c7df4d	[test] Correct spelling in test case and explicitly enable invariant load hoisting for this test case. llvm-svn: 278668	2016-08-15 13:58:04 +00:00
Tobias Grosser	6e6264c142	[tests] Force invariant load hoisting for test cases that need it This will make it easier to switch the default of Polly's invariant load hoisting strategy and also makes it very clear that these test cases indeed require invariant code hoisting to work. llvm-svn: 278667	2016-08-15 13:27:49 +00:00
Roman Gareev	1c892e91e3	Perform replacement of access relations and creation of new arrays according to the packing transformation This is the third patch to apply the BLIS matmul optimization pattern on matmul kernels (http://www.cs.utexas.edu/users/flame/pubs/TOMS-BLIS-Analytical.pdf). BLIS implements gemm as three nested loops around a macro-kernel, plus two packing routines. The macro-kernel is implemented in terms of two additional loops around a micro-kernel. The micro-kernel is a loop around a rank-1 (i.e., outer product) update. In this change we perform replacement of the access relations and create empty arrays, which are steps to implement the packing transformation. In subsequent changes we will implement copying to created arrays. Reviewed-by: Tobias Grosser <tobias@grosser.es> Differential Revision: http://reviews.llvm.org/D22187 llvm-svn: 278666	2016-08-15 12:22:54 +00:00
Valery Pykhtin	c761675ef4	[AMDGPU] fix failure on printing of non-existing instruction operands. Differential revision: https://reviews.llvm.org/D23323 llvm-svn: 278665	2016-08-15 10:56:48 +00:00
Pavel Labath	8749089c8c	Fix a race in Broadcaster/Listener interaction Summary: The following problem was occuring: - broadcaster B had two listeners: L1 and L2 (thread T1) - (T1) B has started to broadcast an event, it has locked a shared_ptr to L1 (in ListenerIterator()) - on another thread T2 the penultimate reference to L1 was destroyed (the transient object in B is now the last reference) - (T2) the last reference to L2 was destroyed as well - (T1) B has finished broadcasting the event to L1 and destroyed the last shared_ptr - (T1) this triggered the destructor, which called into B->RemoveListener() - (T1) all pointers in the m_listeners list were now stale, so RemoveListener emptied the list - (T1) Eventually control returned to the ListenerIterator() for doing broadcasting, which was still in the middle of iterating through the list - (T1) Only now, it was holding onto a dangling iterator. BOOM. I fix this issue by making sure nothing can interfere with the iterate-and-remove-expired-pointers loop, by moving this logic into a single function, which first locks (or clears) the whole list and then returns the list of valid and locked Listeners for further processing. Instead of std::list I use an llvm::SmallVector which should hopefully offset the fact that we create a copy of the list for the common case where we have only a few listeners (no heap allocations). A slight difference in behaviour is that now RemoveListener does not remove an element from the list -- it only sets it's mask to 0, which means it will be removed during the next iteration of GetListeners(). This is purely an implementation detail and it should not be externally noticable. I was not able to reproduce this bug reliably without inserting sleep statements into the code, so I do not add a test for it. Instead, I add some unit tests for the functions that I do modify. Reviewers: clayborg, jingham Subscribers: tberghammer, lldb-commits Differential Revision: https://reviews.llvm.org/D23406 llvm-svn: 278664	2016-08-15 09:53:08 +00:00
Eugene Leviant	b6f1bb13ae	[ELF] Linkerscript: fix bug in assignOffsets (check Sym for non-null) llvm-svn: 278663	2016-08-15 09:19:51 +00:00
Pavel Labath	b3d1290c44	Fixup r278524 for non-apple targets The commit started passing a nullptr port into GDBRemoteCommunication::StartDebugserverProcess. The function was mostly handling the null value correctly, but it one case it did not check it's value before assigning to it. Fix that. llvm-svn: 278662	2016-08-15 09:17:13 +00:00
Sjoerd Meijer	58156715b4	MachineLoop: add methods findLoopControlBlock and findLoopPreheader This adds two new utility functions findLoopControlBlock and findLoopPreheader to MachineLoop and MachineLoopInfo. These functions are refactored and taken from the Hexagon target as they are target independent; thus this is intendend to be a non-functional change. Differential Revision: https://reviews.llvm.org/D22959 llvm-svn: 278661	2016-08-15 08:22:42 +00:00
James Molloy	9a3c82f5cf	[SimplifyCFG] Rewrite SinkThenElseCodeToEnd The new version has several advantages: 1) IMSHO it's more readable and neater 2) It handles loads and stores properly 3) It can handle any number of incoming blocks rather than just two. I'll be taking advantage of this in a followup patch. With this change we can now finally sink load-modify-store idioms such as: if (a) return b += 3; else return b += 4; => %z = load i32, i32* %y %.sink = select i1 %a, i32 5, i32 7 %b = add i32 %z, %.sink store i32 %b, i32* %y ret i32 %b When this works for switches it'll be even more powerful. llvm-svn: 278660	2016-08-15 08:04:56 +00:00
Prakhar Bahuguna	a305a435a6	[Thumb] Validate branch target for CBZ/CBNZ instructions. Summary: The assembler currently does not check the branch target for CBZ/CBNZ instructions, which only permit branching forwards with a positive offset. This adds validation for the branch target to ensure negative PC-relative offsets are not encoded into the instruction, whether specified as a literal or as an assembler symbol. Reviewers: rengolin, t.p.northover Subscribers: llvm-commits, rengolin Differential Revision: https://reviews.llvm.org/D23312 llvm-svn: 278659	2016-08-15 07:57:44 +00:00
James Molloy	196ad0823e	[LSR] Don't try and create post-inc expressions on non-rotated loops If a loop is not rotated (for example when optimizing for size), the latch is not the backedge. If we promote an expression to post-inc form, we not only increase register pressure and add a COPY for that IV expression but for all IVs! Motivating testcase: void f(float a, float b, float c, int n) { while (n-- > 0) c++ = a++ + b++; } It's imperative that the pointer increments be located in the latch block and not the header block; if not, we cannot use post-increment loads and stores and we have to keep both the post-inc and pre-inc values around until the end of the latch which bloats register usage. llvm-svn: 278658	2016-08-15 07:53:03 +00:00
Eugene Leviant	a5c8a68535	[ELF] Do not add start and end symbols in case they are already defined llvm-svn: 278657	2016-08-15 07:24:20 +00:00
David Majnemer	b439dfe6ba	[CodeGen] Ignore unnamed bitfields before handling vector fields We processed unnamed bitfields after our logic for non-vector field elements in records larger than 128 bits. The vector logic would determine that the bit-field disqualifies the record from occupying a register despite the unnamed bit-field not participating in the record size nor its alignment. N.B. This behavior matches GCC and ICC. llvm-svn: 278656	2016-08-15 07:20:40 +00:00
David Majnemer	b229cb0a43	[CodeGen] Correctly implement the AVX512 psABI rules An __m512 vector type wrapped in a structure should be passed in a vector register. Our prior implementation was based on a draft version of the psABI. This fixes PR28975. N.B. The update to the ABI was made here: https://github.com/hjl-tools/x86-psABI/commit/30f9c9 llvm-svn: 278655	2016-08-15 06:39:18 +00:00
Craig Topper	f774de6d54	[X86] PADDUSB/W instructions should be commutable. llvm-svn: 278654	2016-08-15 06:31:57 +00:00
Craig Topper	80c8b80919	[X86] Mark some of the X86 SDNodes as commutative. llvm-svn: 278653	2016-08-15 04:47:30 +00:00
Craig Topper	dbc387cfc9	[X86] X86ISD::FANDN is not commutative or associative. llvm-svn: 278652	2016-08-15 04:47:28 +00:00
David Majnemer	3b47a5a562	[ScopedNoAliasAA] collectMDInDomain should be a free function collectMDInDomain doesn't use any class members, making it a free function is not a functional change. llvm-svn: 278651	2016-08-15 03:56:06 +00:00
Richard Smith	b1a268aae7	cxx_status: update features implemented in clang 3.9 from "svn" to "Clang 3.9" now that svn trunk is 4.0. llvm-svn: 278650	2016-08-15 02:47:23 +00:00
Richard Smith	6ec6b248ab	cxx_status: mark decomposition declarations as "partial": the implementation is essentially complete, other than parts where design questions have been raised (lambda capture, decomposition of arrays by copy). llvm-svn: 278649	2016-08-15 02:37:43 +00:00
Richard Smith	1879f1069b	Disable lambda-capture of decomposition declaration bindings for now, until CWG agrees on how they're supposed to work. llvm-svn: 278648	2016-08-15 02:34:23 +00:00
Richard Smith	39a7c4c65a	Add a triple to this test to make buildbots happier. llvm-svn: 278647	2016-08-15 02:24:00 +00:00
David Majnemer	8b8869f8ef	[ScopedNoAliasAA] Only collect noalias nodes if we have alias.scope nodes No functional change is intended. llvm-svn: 278646	2016-08-15 02:23:50 +00:00
David Majnemer	ddc7ab26fc	[ScopedNoAliasAA] Replace !ScopeNodes.size() with ScopeNodes.empty() No functional change is intended. llvm-svn: 278645	2016-08-15 02:23:48 +00:00
David Majnemer	c77a1390de	Revert "[ScopedNoAliasAA] Remove an unneccesary set" This reverts commit r278641. I'm not sure why but this has upset the multistage builders... llvm-svn: 278644	2016-08-15 02:23:46 +00:00
Eric Fiselier	921a3f1c8d	Check in SFINAE base class for use in optional/variant llvm-svn: 278643	2016-08-15 01:51:54 +00:00
Richard Smith	da38363784	P0217R3: code generation support for decomposition declarations. llvm-svn: 278642	2016-08-15 01:33:41 +00:00
David Majnemer	5ec9c58f13	[ScopedNoAliasAA] Remove an unneccesary set We are trying to prove that one group of operands is a subset of another. We did this by populating two Sets and determining that every element within one was inside the other. However, this is unnecessary. We can simply construct a single set and test if each operand is within it. llvm-svn: 278641	2016-08-15 00:13:04 +00:00
Richard Smith	97fcf4be9b	Explicitly generate a reference variable to hold the initializer for a tuple-like decomposition declaration. This significantly simplifies the semantics of BindingDecls for AST consumers (they can now always be evalated at the point of use). llvm-svn: 278640	2016-08-14 23:15:52 +00:00
Sanjay Patel	52fe9ae990	[InstCombine] add test for missing vector icmp fold llvm-svn: 278639	2016-08-14 22:56:46 +00:00
Eric Fiselier	599a842522	Add private option to change build dialect from C++11 Although libc++ only requires C++11 to build, there are other reasons to turn on a newer dialect in the build. For example IDE's may not highlight any C++14/C++17 in the headers when configured for C++11. This patch add's a private option for changing this. llvm-svn: 278638	2016-08-14 22:51:54 +00:00
Sanjay Patel	7e57b00274	[InstCombine] add tests for vector icmp folds llvm-svn: 278637	2016-08-14 22:44:10 +00:00
Sanjay Patel	8554f70c07	[InstCombine] add test for potentially missing vector icmp fold llvm-svn: 278636	2016-08-14 22:30:07 +00:00
Sanjay Patel	beebe05af1	[InstCombine] add test for missing vector icmp fold llvm-svn: 278635	2016-08-14 22:29:27 +00:00
Sanjay Patel	ba1f9fbddc	[InstCombine] add tests for missing vector icmp folds llvm-svn: 278634	2016-08-14 22:28:50 +00:00
Sanjay Patel	f6559404d5	[InstCombine] remove unnecessary function attributes from tests llvm-svn: 278633	2016-08-14 21:48:21 +00:00
Sanjay Patel	b44ca3bfa9	[InstCombine] add tests for missing vector icmp folds llvm-svn: 278632	2016-08-14 21:36:22 +00:00
Sanjay Patel	bbb3dffd0a	[InstCombine] add test for missing vector icmp fold llvm-svn: 278631	2016-08-14 21:05:08 +00:00
Sanjay Patel	66a3457a4c	[InstCombine] add test for missing vector icmp fold llvm-svn: 278630	2016-08-14 20:39:42 +00:00
Craig Topper	37e8c5443c	[AVX-512] Mark VPMADDWD as commutable to match SSE/AVX version. llvm-svn: 278629	2016-08-14 17:57:22 +00:00
Craig Topper	c677e97dff	[AVX-512] Add masked commutable floating point max/min instructions to folding tables. llvm-svn: 278628	2016-08-14 17:57:19 +00:00
Craig Topper	29fbdc309a	[AVX-512] Add masked logical operations to memory folding tables. llvm-svn: 278627	2016-08-14 17:57:16 +00:00
Igor Breger	505f2cc468	[AVX512] Fix VFPCLASSSD/VFPCLASSSS intrinsic lowering. The i1 result should be zero extended according to SPEC. Differential Revision: http://reviews.llvm.org/D23489 llvm-svn: 278626	2016-08-14 13:58:57 +00:00
Ed Maste	0767b40e25	elf-core: set powerpc vector byte order correctly It was being set to the fp reg set's byte order due to an apparent copy-and-pasteo. llvm-svn: 278625	2016-08-14 11:30:11 +00:00
Igor Breger	6fc00b0acf	autogenerate checks llvm-svn: 278624	2016-08-14 09:34:39 +00:00
Igor Breger	8672408db0	[AVX512] Fix insertelement i1 lowering. 1. Use shuffle to insert element i1 into vector. The previous implementation was incorrect ( dest_bit OR src_bit , it doesn't clear the bit if src_bit=0 ) 2. Improve shuffle i1 vector, use CVT2MASK if supported instead TRUNCATE. Differential Revision: http://reviews.llvm.org/D23347 llvm-svn: 278623	2016-08-14 05:25:07 +00:00
Saleem Abdulrasool	98541b09f4	Revert "gold: add a cast to appease std::max NFC" This was fixed differently by Teresa and this should no longer be needed. llvm-svn: 278622	2016-08-14 05:07:20 +00:00
Diana Picus	68be1eb885	Revert "CodeGen: If Convert blocks that would form a diamond when tail-merged." This reverts commit r278287. This commit broke the clang-cmake-thumbv7-a15-full-sh bot. See https://llvm.org/bugs/show_bug.cgi?id=28949 llvm-svn: 278621	2016-08-14 02:10:18 +00:00
Diana Picus	35ccf53e75	Revert "Codegen: Don't tail-duplicate blocks with un-analyzable fallthrough." This reverts commit r278288. r278287 broke the clang-cmake-thumbv7-a15-full-sh bot. Revert this so we can get to r278287. llvm-svn: 278620	2016-08-14 02:10:12 +00:00
Sanjoy Das	35459f0e34	[IRCE] Change variable grouping; NFC llvm-svn: 278619	2016-08-14 01:04:50 +00:00

1 2 3 4 5 ...

239409 Commits All Branches Search

239409 Commits

All Branches