llvm-project

Commit Graph

Author	SHA1	Message	Date
Jatin Bhateja	2c139f77c7	[X86] Allow cross-lane permutations for sub targets supporting AVX2. Summary: Most instructions in AVX work “in-lane”, that is, each source element is applied only to other elements of the same lane, thus a cross lane permutation is costly and needs more than one instrution. AVX2 includes instructions to perform any-to-any permutation of words over a 256-bit register and vectorized table lookup. This should also Fix PR34369 Differential Revision: https://reviews.llvm.org/D37388 llvm-svn: 312608	2017-09-06 02:58:47 +00:00
Lang Hames	6dbf0876c1	[ORC] Fix some comments in JITSymbol. Patch by Breckin Loggins. Thanks Breckin! llvm-svn: 312607	2017-09-06 02:53:37 +00:00
Petr Hosek	53335d6d86	[libcxxabi] When built with ASan, __cxa_throw calls __asan_handle_no_return The ASan runtime on many systems intercepts cxa_throw just so it can call asan_handle_no_return first. Some newer systems such as Fuchsia don't use interceptors on standard library functions at all, but instead use sanitizer-instrumented versions of the standard libraries. When libc++abi is built with ASan, cxa_throw can just call asan_handle_no_return itself so no interceptor is required. This is a re-land of r311045, which has become safe after r311869 changed compiler-rt to declare __asan_handle_no_return. Patch by Roland McGrath Differential Revision: https://reviews.llvm.org/D37229 llvm-svn: 312606	2017-09-06 02:43:54 +00:00
Eric Beckmann	0aa4b7d4c5	Fix crbug 759265 by suppressing llvm mt warnings. Summary: Previous would throw warning whenever libxml2 is not installed. Now only give this warning if merging manifest fails. Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D37240 llvm-svn: 312604	2017-09-06 01:50:36 +00:00
Rafael Espindola	dc8b7a96bd	Use the section name if a STT_SECTION symbol has empty name. Without this we would have multiple relocations pointing to symbols with the same name: the empty string. There was no way for yaml2obj to be able to handle that. A more general solution would be to unique symbol names in a similar way to how we unique section names. In practice I think this covers all common cases and is a bit more user friendly than using names like sym1, sym2, sym3, etc. llvm-svn: 312603	2017-09-06 00:57:53 +00:00
Bruno Cardoso Lopes	bad2c4a000	Fix indentation mistake from r312595 llvm-svn: 312599	2017-09-06 00:44:10 +00:00
Yaxun Liu	fc5121a722	[AMDGPU] Transform __read_pipe_* and __write_pipe_* When packet size equals packet align and is power of 2, transform __read_pipe* and __write_pipe* to specialized library function. Differential Revision: https://reviews.llvm.org/D36831 llvm-svn: 312598	2017-09-06 00:30:27 +00:00
Evgeniy Stepanov	9566d28997	[msan] Remove a stale fixme (NFC). It was fixed in 312576. llvm-svn: 312597	2017-09-06 00:28:52 +00:00
Petr Hosek	4f4bdc3c20	[sanitizer_common][Fuchsia] Update Fuchsia sanitizer markup Include URLs to the markup format specification in code comments. Use sanitizer markup in the sancov message about a dump just produced. Patch by Roland McGrath Differential Revision: https://reviews.llvm.org/D37273 llvm-svn: 312596	2017-09-06 00:00:46 +00:00
Bruno Cardoso Lopes	c1843c2c85	[Darwin] Enable -fstack-protector (back) by default with -ffreestanding Go back to behavior prior to r289005. rdar://problem/32987198 llvm-svn: 312595	2017-09-05 23:50:58 +00:00
Nico Weber	a05cbb8b95	lld-link: Add --rsp-quoting= flag. This ports https://reviews.llvm.org/D19425 from clang / https://reviews.llvm.org/D22015 from the ELF port to COFF lld. This can be useful when linking COFF files on a posix host. https://reviews.llvm.org/D37452 llvm-svn: 312594	2017-09-05 23:46:45 +00:00
Kostya Serebryany	79cdf36a2c	[libFuzzer] remporary disable an unstable test llvm-svn: 312593	2017-09-05 23:45:54 +00:00
Sanjay Patel	6840c5ff75	[ValueTracking, InstCombine] canonicalize fcmp ord/uno with non-NAN ops to null constants This is a preliminary step towards solving the remaining part of PR27145 - IR for isfinite(): https://bugs.llvm.org/show_bug.cgi?id=27145 In order to solve that one more generally, we need to add matching for and/or of fcmp ord/uno with a constant operand. But while looking at those patterns, I realized we were missing a canonicalization for nonzero constants. Rather than limiting to just folds for constants, we're adding a general value tracking method for this based on an existing DAG helper. By transforming everything to 0.0, we can simplify the existing code in foldLogicOfFCmps() and pick up missing vector folds. Differential Revision: https://reviews.llvm.org/D37427 llvm-svn: 312591	2017-09-05 23:13:13 +00:00
Rafael Espindola	8db11a4f1c	Fix a use after free. llvm-svn: 312590	2017-09-05 23:00:51 +00:00
Eli Friedman	c22c699882	[ARM] Make ARMExpandPseudo add implicit uses for predicated instructions Missing these could potentially screw up post-ra scheduling. Issue found by inspection, so I don't have a real testcase. Included test just verifies the expected operands after expansion. Differential Revision: https://reviews.llvm.org/D35156 llvm-svn: 312589	2017-09-05 22:54:06 +00:00
Eli Friedman	06d0ee734a	[ARM] Register ARMExpandPseudo pass. This allows -run-pass etc. to refer to it. (Split off from D35156.) llvm-svn: 312587	2017-09-05 22:45:23 +00:00
Rafael Espindola	88ee57ebed	obj2yaml: Print unique section names. Without this patch passing a .o file with multiple sections with the same name to obj2yaml produces a yaml file that yaml2obj cannot handle. This is pr34162. The problem is that when specifying, for example, the section of a symbol, we get only Section: foo and don't know which of the sections whose name is foo we have to use. One alternative would be to use section numbers. This would work, but the output from obj2yaml would be very inconvenient to edit as deleting a section would invalidate all indexes. Another alternative would be to invent a unique section id that would exist only on yaml. This would work, but seems a bit heavy handed. We could make the id optional and default it to the section name. Since in the last alternative the id is basically what this patch uses as a name, it can be implemented as a followup patch if needed. llvm-svn: 312585	2017-09-05 22:30:00 +00:00
Lang Hames	4c74402601	[ORC] Convert null remote symbols to null JITSymbols. The existing code created a JITSymbol with an invalid materializer instead, guaranteeing a 'missing symbol' error when someone tried to materialize the symbol. llvm-svn: 312584	2017-09-05 22:24:40 +00:00
Zachary Turner	37c747498d	[CodeView] Don't output S_UDTs for nested typedefs. S_UDT records are basically the "bridge" between the debugger's expression evaluator and the type information. If you type (Foo)nullptr into the watch window, the debugger looks for an S_UDT record named Foo. If it can find one, it displays your type. Otherwise you get an error. We have always understood this to mean that if you have code like this: struct A { int X; }; struct B { typedef A AT; AT Member; }; that you will get 3 S_UDT records. "A", "B", and "B::AT". Because if you were to type (B::AT)nullptr into the debugger, it would need to find an S_UDT record named "B::AT". But "B::AT" is actually the S_UDT record that would be generated if B were a namespace, not a struct. So the debugger needs to be able to distinguish this case. So what it does is: 1. Look for an S_UDT named "B::AT". If it finds one, it knows that AT is in a namespace. 2. If it doesn't find one, split at the scope resolution operator, and look for an S_UDT named B. If it finds one, look up the type for B, and then look for AT as one of its members. With this algorithm, S_UDT records for nested typedefs are not just unnecessary, but actually wrong! The results of implementing this in clang are dramatic. It cuts our /DEBUG:FASTLINK PDB sizes by more than 50%, and we go from being ~20% larger than MSVC PDBs on average, to ~40% smaller. It also slightly speeds up link time. We get about 10% faster links than without this patch. Differential Revision: https://reviews.llvm.org/D37410 llvm-svn: 312583	2017-09-05 22:06:39 +00:00
Vedant Kumar	3ae4170480	Revert "[Decompression] Fail gracefully when out of memory" This reverts commit r312526. Revert "Fix test/DebugInfo/dwarfdump-decompression-invalid-size.test" This reverts commit r312527. It causes an ASan failure: http://lab.llvm.org:8080/green/job/clang-stage2-cmake-RgSan_check/4150 llvm-svn: 312582	2017-09-05 22:04:00 +00:00
Evgeniy Stepanov	29c7487167	Remove ld.config.txt for Android O. ld.config.txt defines linker namespaces in a way that is incompatible with ASan. Remove the file when installing ASan on an Android O (8.0.x) device. Patch by Jiyong Park. llvm-svn: 312581	2017-09-05 21:51:20 +00:00
Richard Smith	056bf77faf	Fix memory leak after r312467. The ModuleMap is the owner of the global module object until it's reparented under a real module. llvm-svn: 312580	2017-09-05 21:46:22 +00:00
Davide Italiano	f887406a7d	[unittest/ReverseIteration] Unbreak when compiling with GCC. llvm-svn: 312579	2017-09-05 21:27:23 +00:00
Sanjay Patel	18e126e5d4	[InstCombine] add nnan tests; NFC As suggested in D37427, we could have a value tracking function and folds that use it to simplify these cases. llvm-svn: 312578	2017-09-05 21:20:35 +00:00
Rui Ueyama	2ea27186b4	Use raw_string_ostream::str to get a result string. Looks like raw_string_ostream is buffered. If we do not call `flush` nor `str`, it is not guaranteed that a result string has all characters that were written to it. It wasn't failing on buildbots, but I could reproduce the issue on my Windows workstation. llvm-svn: 312577	2017-09-05 21:17:32 +00:00
Evgeniy Stepanov	8b80b328d1	[msan] Check sigset_t and sigaction arguments. Summary: Check sigset_t arguments in ppoll, sigwait, sigprocmask interceptors, and the entire "struct sigaction" in sigaction. This can be done because sigemptyset/sigfullset are intercepted and signal masks should be correctly marked as initialized. Reviewers: vitalybuka Subscribers: kubamracek, llvm-commits Differential Revision: https://reviews.llvm.org/D37367 llvm-svn: 312576	2017-09-05 21:08:56 +00:00
Davide Italiano	32504cf661	[GVNHoist] Move duplicated code to a helper function. NFCI. llvm-svn: 312575	2017-09-05 20:49:41 +00:00
Mandeep Singh Grang	9837e9945f	[unittests] Add reverse iteration unit test for pointer-like keys Reviewers: dblaikie, efriedma, mehdi_amini Reviewed By: dblaikie Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D37241 llvm-svn: 312574	2017-09-05 20:39:01 +00:00
Reid Kleckner	d53c39ba46	Commit changes missing from r312572 llvm-svn: 312573	2017-09-05 20:38:29 +00:00
Reid Kleckner	30701edf76	[ms] Implement the __annotation intrinsic llvm-svn: 312572	2017-09-05 20:27:35 +00:00
Reid Kleckner	d4523689a6	Fix RST syntax in LangRef for llvm.codeview.annotation intrinsic llvm-svn: 312571	2017-09-05 20:26:25 +00:00
Rui Ueyama	888da8c232	Do not use invalid iterators to fix Windows build. std::vector::insert invalidates all iterators, so it was not safe to do Script->Opt.Commands.insert(++I, Make(ElfSym::End1)); Script->Opt.Commands.insert(++I, Make(ElfSym::End2)); because after the first line, `I` is no longer valid. This patch rewrites fixes the issue. I belive the new code without higher-order functions is a bit more readable than before. llvm-svn: 312570	2017-09-05 20:17:37 +00:00
Reid Kleckner	e33c94f1b0	Add llvm.codeview.annotation to implement MSVC __annotation Summary: This intrinsic represents a label with a list of associated metadata strings. It is modelled as reading and writing inaccessible memory so that it won't be removed as dead code. I think the intention is that the annotation strings should appear at most once in the debug info, so I marked it noduplicate. We are allowed to inline code with annotations as long as we strip the annotation, but that can be done later. Reviewers: majnemer Subscribers: eraman, llvm-commits, hiraditya Differential Revision: https://reviews.llvm.org/D36904 llvm-svn: 312569	2017-09-05 20:14:58 +00:00
Daniel Neilson	3f0e4ad833	[SCEV] Ensure ScalarEvolution::createAddRecFromPHIWithCastsImpl properly handles out of range truncations of the start and accum values Summary: When constructing the predicate P1 in ScalarEvolution::createAddRecFromPHIWithCastsImpl() it is possible for the PHISCEV from which the predicate is constructed to be a SCEVConstant instead of a SCEVAddRec. If this happens, then the cast<SCEVAddRec>(PHISCEV) in the code will assert. Such a PHISCEV is possible if either the start value or the accumulator value is a constant value that not equal to its truncated value, and if the truncated value is zero. This patch adds tests that demonstrate the cast<> assertion, and fixes this problem by checking whether the PHISCEV is a constant before constructing the P1 predicate; if it is, then P1 is equivalent to one of P2 or P3. Additionally, if we know that the start value or accumulator value are constants then we check whether the P2 and/or P3 predicates are known false at compile time; if either is, then we bail out of constructing the AddRec. Reviewers: sanjoy, mkazantsev, silviu.baranga Reviewed By: mkazantsev Subscribers: mkazantsev, llvm-commits Differential Revision: https://reviews.llvm.org/D37265 llvm-svn: 312568	2017-09-05 19:54:03 +00:00
Peter Collingbourne	d0e9c167d8	LTO: Try to open cache files before renaming them. It appears that a potential race between the cache client and the cache pruner that I thought was unlikely actually happened in practice [1]. Try to avoid the race condition by opening the temporary file before renaming it. Do this only on non-Windows platforms because we cannot rename open files on Windows using the sys::fs::rename function. [1] https://luci-logdog.appspot.com/v/?s=chromium%2Fbb%2Fchromium.memory%2FLinux_CFI%2F1610%2F%2B%2Frecipes%2Fsteps%2Fcompile%2F0%2Fstdout Differential Revision: https://reviews.llvm.org/D37410 llvm-svn: 312567	2017-09-05 19:51:38 +00:00
Michael Kruse	420c4863a9	[Simplify] Actually remove unsed instruction from region header. Since r312249 instructions of a entry block of region statements are not marked as root anymore and hence can theoretically be removed if unused. Theoretically, because the instruction list was not changed. Still, MemoryAccesses for unused instructions were removed. This lead to a failed assertion in the code generator when the MemoryAccess for the still listed instruction was not found. This hould fix the Assertion failed: ArrayAccess && "No array access found for instruction!", file ScopInfo.h, line 1494 compiler crashes. llvm-svn: 312566	2017-09-05 19:44:39 +00:00
Gor Nishanov	db419a6f7c	[coroutines] Make sure auto return type of await_resume is properly handled Reviewers: rsmith, EricWF Reviewed By: rsmith Subscribers: javed.absar, cfe-commits Differential Revision: https://reviews.llvm.org/D37454 llvm-svn: 312565	2017-09-05 19:31:52 +00:00
Craig Topper	784fa8a4e3	[X86] Remove unnecessary (v4f32 (X86vzmovl (v4f32 (scalar_to_vector FR32X)))) patterns We had already disabled the pattern for SSE4.1 and SSE4.2. But it got re-enabled for AVX and AVX512. With SSE41 we rely on a separate (v4f32 (X86vzmovl VR128)) pattern to select blendps with a xorps to create zeroess. And a separate (v4f32 (scalar_to_vector FR32X)) to select a COPY_TO_REG_CLASS to move FR32 to VR128 The same thing can happen for AVX with vblendps and those separate patterns already exist. For AVX512, (v4f32 (X86vzmov VR128)) will select a VMOVSS instruction instead of VBLENDPS due to their not being a EVEX VBLENDPS. This is what we were getting out of the larger pattern anyway. So the larger pattern is unneeded for AVX512 too. For SSE1-SSSE3 we can rely on (v4f32 (X86vzmov VR128)) selecting a MOVSS similar to AVX512. Again this is what the larger pattern did too. So the only real change here is that AVX1/2 now properly outputs a VBLENDPS during isel instead of a VMOVSS to match SSE41. Most tests didn't notice because the two address instruction pass knows how to turn VMOVSS into VBLENDPS to get an independent destination register. llvm-svn: 312564	2017-09-05 19:09:02 +00:00
Konstantin Zhuravlyov	80528702c9	AMDGPU: Cleanup/refactor SIMemoryLegalizer [3]: - Refactor SIMemOpInfo's constructors - Allow construction of NotAtomic SIMemOpInfo Differential Revision: https://reviews.llvm.org/D37396 llvm-svn: 312563	2017-09-05 19:01:10 +00:00
Jan Kratochvil	75a79e72ae	Fix DW_FORM_strp parsing Differential revision: https://reviews.llvm.org/D37441 llvm-svn: 312562	2017-09-05 19:01:01 +00:00
Matt Arsenault	22cdb61a78	AMDGPU: Fix not accounting for tail call resource usage If the only call in a function is a tail call, the function isn't considered to have a call since it's a type of return. llvm-svn: 312561	2017-09-05 18:36:36 +00:00
Zvi Rackover	2096893f34	X86 Tests: Adding missing AVX512 fptoui coverage tests. NFC. Some of the cases show missing pattern i intend to fix shortly. llvm-svn: 312560	2017-09-05 18:24:39 +00:00
Tony Jiang	61ef1c540c	[PPC][NFC] Renaming things with 'xxinsert' moniker to 'vecinsert' to make it more general. Commit on behalf of Graham Yiu (gyiu@ca.ibm.com) llvm-svn: 312547	2017-09-05 18:08:02 +00:00
Jonas Devlieghere	5dc87861d6	[diagtool] Change default tree behavior to print only flags This patch changes the default behavior of `diagtool tree` to only display warning flags and not the internal warnings flags. The latter is an implementation detail of the compiler and usually not what the users wants. Furthermore, flags that are enabled by default are now also printed in green. Originally, this was only the case for the diagnostic names. Differential revision: https://reviews.llvm.org/D37390 llvm-svn: 312546	2017-09-05 18:04:40 +00:00
Jonas Devlieghere	e4563d1733	[NFC] Loop modernization in diagtool Precommit for https://reviews.llvm.org/D37390 llvm-svn: 312545	2017-09-05 18:04:34 +00:00
Adam Nemet	9c35f6383b	Split opt-remark YAML and opt output testing on this test This prepares for https://reviews.llvm.org/D33514 llvm-svn: 312544	2017-09-05 18:03:39 +00:00
Craig Topper	33caeadd90	[AVX512] Remove patterns for (v8f32 (X86vzmovl (insert_subvector undef, (v4f32 (scalar_to_vector FR32X:)), (iPTR 0)))) and the same for v4f64. We don't have this same pattern for AVX2 so I don't believe we should have it for AVX512. We also didn't have it for v16f32. llvm-svn: 312543	2017-09-05 17:33:58 +00:00
Erich Keane	e916d54614	[Preprocessor] Correct internal token parsing of newline characters in CRLF Correct implementation: Apparently I managed in r311683 to submit the wrong version of the patch for this, so I'm correcting it now. Differential Revision: https://reviews.llvm.org/D37079 llvm-svn: 312542	2017-09-05 17:32:36 +00:00
Konstantin Zhuravlyov	1aa667fe64	AMDGPU/NFC: Cleanup/refactor SIMemoryLegalizer [2]: - Make SIMemOpInfo a class - Add accessor methods to SIMemOpInfo - Move get*Info methods to SIMemOpInfo Differential Revision: https://reviews.llvm.org/D37395 llvm-svn: 312541	2017-09-05 16:41:25 +00:00
Konstantin Zhuravlyov	844845ae06	AMDGPU/NFC: Cleanup/refactor SIMemoryLegalizer [1]: - Rename MemOpInfo -> SIMemOpInfo - Move SIMemOpInfo class out of SIMemoryLegalizer class Differential Revision: https://reviews.llvm.org/D37394 llvm-svn: 312540	2017-09-05 16:18:05 +00:00

1 2 3 4 5 ...

270941 Commits All Branches Search

270941 Commits

All Branches