llvm-project

Commit Graph

Author	SHA1	Message	Date
James Molloy	b7efa6c227	[SimplifyCFG] Fix bootstrap failure after r280220 We check that a sinking candidate is used by only one PHI node during our legality checks. However for instructions that are used by other sinking candidates our heuristic is less conservative. This can result in a candidate actually being illegal when we come to sink it because of how we sunk a predecessor. Do the used-by-only-one-PHI checks again during sinking to ensure we don't crash. llvm-svn: 280228	2016-08-31 12:33:48 +00:00
Sjoerd Meijer	f2392f69d5	Revision r280064 adds new options -fdenormal-fp-math and passes through option -ffast-math to CC1, but it included a wrong llvm regression tests which was removed in r280065. Although regression test noexceptionsfpmath.c makes sure -fno-trapping-math ends up as a function attribute, this adds a test that explicitly checks the driver output for -fno-trapping-math. llvm-svn: 280227	2016-08-31 12:31:03 +00:00
Rafael Espindola	a6c9744a6c	Delete DefinedBitcode. Given that we almost always want to handle it as DefinedRegular, just use DefinedRegular. llvm-svn: 280226	2016-08-31 12:30:34 +00:00
Davide Italiano	29fa6ab7b1	[LTO/InputFiles] Merge two ifs into one. NFCI. llvm-svn: 280225	2016-08-31 12:27:47 +00:00
Davide Italiano	30ed8106ad	[LTO] Simplify unnamed_addr handling logic. NFCI. llvm-svn: 280224	2016-08-31 12:20:46 +00:00
Simon Atanasyan	e5532a12f7	[ELF][MIPS] Support R_MIPS_HIGHER / R_MIPS_HIGHEST relocations calculation llvm-svn: 280223	2016-08-31 11:47:21 +00:00
Simon Atanasyan	97519cba2e	[ELF][MIPS] Inline function. NFC llvm-svn: 280222	2016-08-31 11:47:17 +00:00
Nikolay Haustov	eba808957e	AMDGPU/SI: Handle aliases in AMDGPUAlwaysInlinePass Summary: Simply replace usage of aliases to functions with aliasee. This came up when bitcode linking to builtin library and calls to aliases not being resolved. Also made minor improvements to existing test. Reviewers: tstellarAMD, alex-t, vpykhtin Subscribers: arsenm, wdng, rampitec Differential Revision: https://reviews.llvm.org/D24023 llvm-svn: 280221	2016-08-31 11:18:33 +00:00
James Molloy	75b1fb9b9e	Attempt to pacify buildbots after r280217 These clang tests check diagnostics from the backend by giving it an unvectorizable loop. This loop is now vectorized :/ Make it really unvectorizable by making it unprofitable to ifconvert. llvm-svn: 280220	2016-08-31 11:01:41 +00:00
James Molloy	171fdac7ce	[SimplifyCFG] Add a workaround to fix PR30188 We're sinking stores, which is a good thing, but in the process creating selects for the store address operand, which SROA/Mem2Reg can't look through, which caused serious regressions. The real fix is in SROA, which I'll be looking into. llvm-svn: 280219	2016-08-31 10:46:45 +00:00
James Molloy	8e69b032e5	[SimplifyCFG] Improve FoldValueComparisonIntoPredecessors to handle more cases A very important case is not handled here: multiple arcs to a single block with a PHI. Consider: a: %1 = icmp %b, 1 br %1, label %c, label %e c: %2 = icmp %b, 2 br %2, label %d, label %e d: br %e e: phi [0, %a], [1, %c], [2, %d] FoldValueComparisonIntoPredecessors will refuse to fold this, as it doesn't know how to deal with two arcs to a common destination with different PHI values. The answer is obvious - just split all conflicting arcs. llvm-svn: 280218	2016-08-31 10:46:39 +00:00
James Molloy	c53b40b509	[SimplifyCFG] Handle tail-sinking of more than 2 incoming branches This was a real restriction in the original version of SinkIfThenCodeToEnd. Now it's been rewritten, the restriction can be lifted. As part of this, we handle a very common and useful case where one of the incoming branches is actually conditional. Consider: if (a) x(1); else if (b) x(2); This produces the following CFG: [if] / \ [x(1)] [if] \| \| \ \| \| \ \| [x(2)] \| \ \| / [ end ] [end] has two unconditional predecessor arcs and one conditional. The conditional refers to the implicit empty 'else' arc. This same pattern can also be caused by an empty default block in a switch. We can't sink the call to x() down to end because no call to x() happens on the third incoming arc (assume that x() has sideeffects for the sake of argument; if something is safe to speculate we could indeed sink nevertheless but this cannot happen in the general case and causes many extra selects). We are now able to detect this case and split off the unconditional arcs to a common successor: [if] / \ [x(1)] [if] \| \| \ \| \| \ \| [x(2)] \| \ / \| [sink.split] \| \ / [ end ] Now we can sink the call to x() into %sink.split. This can cause significant code simplification in many testcases. llvm-svn: 280217	2016-08-31 10:46:33 +00:00
James Molloy	55bd04cd20	[SimplifyCFG] Change the algorithm in SinkThenElseCodeToEnd r279460 rewrote this function to be able to handle more than two incoming edges and took pains to ensure this didn't regress anything. This time we change the logic for determining if an instruction should be sunk. Previously we used a single pass greedy algorithm - sink instructions until one requires more than one PHI node or we run out of instructions to sink. This had the problem that sinking instructions that had non-identical but trivially the same operands needed extra logic so we sunk them aggressively. For example: %a = load i32* %b %d = load i32* %b %c = gep i32* %a, i32 0 %e = gep i32* %d, i32 1 Sinking %c and %e would naively require two PHI merges as %a != %d. But the loads are obviously equivalent (and maybe can't be hoisted because there is no common predecessor). This is why we implemented the fairly complex function areValuesTriviallySame(), to look through trivial differences like this. However it's just not clever enough. Instead, throw areValuesTriviallySame away, use pointer equality to check equivalence of operands and switch to a two-stage algorithm. In the "scan" stage, we look at every sinkable instruction in isolation from end of block to front. If it's sinkable, we keep track of all operands that required PHI merging. In the "sink" stage, we iteratively sink the last non-terminator in the source blocks. But when calculating how many PHIs are actually required to be inserted (to work out if we should stop or not) we remove any values that have already been sunk from the set of PHI-merges required, which allows us to be more aggressive. This turns an algorithm with potentially recursive lookahead (looking through GEPs, casts, loads and any other instruction potentially not CSE'd) to two linear scans. llvm-svn: 280216	2016-08-31 10:46:23 +00:00
James Molloy	923e98c232	[SimplifyCFG] Tail-merge calls with sideeffects This was deliberately disabled during my rewrite of SinkIfThenToEnd to keep behaviour at least vaguely consistent with the previous version and keep it as close to NFC as I could. There's no real reason not to merge sideeffect calls though, so let's do it! Small fixup along the way to ensure we don't create indirect calls. Should fix PR28964. llvm-svn: 280215	2016-08-31 10:46:16 +00:00
Simon Pilgrim	7b09af193a	[X86][SSE] Improve awareness of fptrunc implicit zeroing of upper 64-bits of xmm result Add patterns to avoid inserting unnecessary zeroing shuffles when lowering fptrunc to (v)cvtpd2ps Differential Revision: https://reviews.llvm.org/D23797 llvm-svn: 280214	2016-08-31 10:35:13 +00:00
Filipe Cabecinhas	453b55551f	Fix buildbot bug: Wasn't printing scariness for DoubleFree llvm-svn: 280213	2016-08-31 09:39:47 +00:00
George Rimar	20b6598c10	[ELF] - Remove VersionScriptParser class and move the members to ScriptParser Patch removes VersionScriptParser class and moves the members to ScriptParser It opens road for implementation of VERSION linkerscript command. Differential revision: https://reviews.llvm.org/D23774 llvm-svn: 280212	2016-08-31 09:08:26 +00:00
George Rimar	ebf1da565c	[ELF] - Fix (partial) for bug 28843 - Make sure we handle options with opposing meanings. As stated in PR28843: we should handle command lines with -target1-rel -target1-abs --demangle --no-demangle Patch implements this for specified options. There are probably other conflicting options can exist, so fix is called "partial". Differential revision: https://reviews.llvm.org/D23867 llvm-svn: 280211	2016-08-31 08:53:21 +00:00
Eugene Leviant	aa49819162	Add DT_REL(A)COUNT tag to .dynamic section This patch groups relative relocations in a single block in combrelocs mode and adds DT_RELCOUNT or DT_RELACOUNT tag to .dynamic section Differential revision: https://reviews.llvm.org/D23661 llvm-svn: 280210	2016-08-31 08:51:39 +00:00
George Rimar	9503f6d211	[ELF] - Introduce DiscardPolicy instead of 3 relative bool fields. DiscardPolicy is enum replacing several boolean options. This approach is not only consistent with what we use for unresolveds (UnresolvedPolicy), but also should help to solve a problem of options with opposing meanings, mentioned in PR28843 Differential revision: https://reviews.llvm.org/D23868 llvm-svn: 280209	2016-08-31 08:46:30 +00:00
Pavel Labath	b1c4b836b9	XFail new TestPyObjSynthProvider.py on linux until I can investigate the cause of the problem llvm-svn: 280208	2016-08-31 08:43:40 +00:00
Pavel Labath	b9739d4090	Revert r280137 and 280139 and subsequent build fixes The rewrite of StringExtractor::GetHexMaxU32 changes functionality in a way which makes lldb-server crash. The crash (assert) happens when parsing the "qRegisterInfo0" packet, because the function tries to drop_front more bytes than the packet contains. It's not clear to me whether we should consider this a bug in the caller or the callee, but it any case, it worked before, so I am reverting this until we can figure out what the proper interface should be. llvm-svn: 280207	2016-08-31 08:43:37 +00:00
George Rimar	f21aade0d8	[ELF] - Introduce StripPolicy instead of Config->StripAll/StripDebug flags. This approach is not only consistent with UnresolvedPolicy, but also should help to solve a problem of options with opposing meanings, mentioned in PR28843 Differential revision: https://reviews.llvm.org/D23869 llvm-svn: 280206	2016-08-31 08:38:11 +00:00
Eugene Leviant	20889c51b7	Allow adding start/end symbols to any section Allows adding start and/or end symbols to special output sections, like .eh_frame_hdr, which aren't lists of regular input sections. Differential revision: https://reviews.llvm.org/D23716 llvm-svn: 280205	2016-08-31 08:13:33 +00:00
Pavel Labath	1e3b086749	Revert r280200 and put it a proper fix PeekChar returns a character, we want the whole string there. llvm-svn: 280204	2016-08-31 07:49:37 +00:00
Eugene Leviant	e4f590faeb	Allow .eh_frame_hdr to be placed before .eh_frame Differential revision: https://reviews.llvm.org/D24041 llvm-svn: 280203	2016-08-31 07:43:50 +00:00
Pavel Labath	30ff4b4851	Fix lldb build on Mac. Summary: `e80f43fd78` greatly improved an API, but missed one more occurence of legacy usage. This leads to: if (extractor.GetHexBytes(&payload_bytes[0], payload_bytes.size(), '\xdd') != payload_bytes.size()) ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~ /lldb/include/lldb/Utility/StringExtractor.h:151:5: note: 'GetHexBytes' declared here Reviewers: zturner Subscribers: lldb-commits Differential Revision: https://reviews.llvm.org/D24064 Author: Taras Tsugrii <ttsugrii@fb.com> llvm-svn: 280202	2016-08-31 07:42:38 +00:00
Filipe Cabecinhas	b16672d91d	Reify ErrorDoubleFree Summary: Keep reifying other errors. Reviewers: kcc, samsonov Subscribers: llvm-commits, kubabrecka Differential Revision: https://reviews.llvm.org/D23717 llvm-svn: 280201	2016-08-31 07:38:09 +00:00
Sylvestre Ledru	2c07a069b8	Update the Linux code to reflect the changes done by zturner in r280139 llvm-svn: 280200	2016-08-31 07:16:56 +00:00
Igor Kudrin	fc05ee344c	[Coverage] Suppress creating a code region if the same area is covered by an expansion region. In most cases these code regions are just redundant, but sometimes they could be assigned to the counter of the parent code region instead of the counter of the nested block. Differential Revision: https://reviews.llvm.org/D23987 llvm-svn: 280199	2016-08-31 07:04:16 +00:00
Igor Kudrin	f3c8a9cfbb	[Coverage] Make sorting criteria for CounterMappingRegions local. Move the comparison function into the only place there it is used, i.e. the call to std::stable_sort in CoverageMappingWriter::write(). Add sorting by region kinds as it is required to ensure stable order in our tests and to simplify D23987. Differential Revision: https://reviews.llvm.org/D24034 llvm-svn: 280198	2016-08-31 07:01:17 +00:00
Craig Topper	a815f488d5	[AVX-512] Implement masked floating point logical operations with native IR and remove the builtins. llvm-svn: 280197	2016-08-31 05:38:58 +00:00
Craig Topper	d0681d528d	[X86] Use v2i64 vectors to implement _mm_and/andn/or/xor_pd. These will be reused when removing some builtins from avx512vldqintrin.h and this will make the tests for that change show a better number of vector elements. llvm-svn: 280196	2016-08-31 05:38:55 +00:00
Craig Topper	8f6827c945	[AVX-512] Add patterns to select masked logical operations if the select has a floating point type. This is needed in order to replace the masked floating point logical op intrinsics with native IR. llvm-svn: 280195	2016-08-31 05:37:52 +00:00
Craig Topper	0f8fb47637	[AVX-512] Add test cases for masked floating point logic operations with bitcasts between the logic ops and the select. We don't currently select masked operations for these cases. Test cases taken from optimized clang output after trying to convert the masked floating point logical op intrinsics to native IR. llvm-svn: 280194	2016-08-31 05:37:50 +00:00
Craig Topper	de8b1a0012	[X86] Regenerate a test using update_llc_test_checks.py. llvm-svn: 280193	2016-08-31 05:37:47 +00:00
Dean Michael Berris	047669f18c	[XRay] Support multiple return instructions in a single basic block Add a .mir test to catch this case, and fix the xray-instrumentation pass to handle it appropriately. llvm-svn: 280192	2016-08-31 05:20:08 +00:00
David Majnemer	a90e51e106	[Loads] Properly populate the visited set in isDereferenceableAndAlignedPointer There were paths where we wouldn't populate the visited set, causing us to recurse forever if an SSA variable was defined in terms of itself. This fixes PR30210. llvm-svn: 280191	2016-08-31 03:22:32 +00:00
Richard Smith	54f18e8a85	PR12298 et al: don't recursively instantiate a template specialization from within the instantiation of that same specialization. This could previously happen for eagerly-instantiated function templates, variable templates, exception specifications, default arguments, and a handful of other cases. We still have an issue here for default template arguments that recursively make use of themselves and likewise for substitution into the type of a non-type template parameter, but in those cases we're producing a different entity each time, so they should instead be caught by the instantiation depth limit. However, currently we will typically run out of stack before we reach it. :( llvm-svn: 280190	2016-08-31 02:15:21 +00:00
Richard Trieu	5ed6fe739f	Concatenate two FileCheck lines in a test. 'cc1' is a valid sequence of hexadecimal and sometimes can occur in the path when testing. This can lead to FileCheck matching the incorrect occurance of the 'cc1' string and causing a test failure. Join two adjacent flags together into one check to prevent this. llvm-svn: 280189	2016-08-31 01:57:12 +00:00
Hal Finkel	97a189c716	[PowerPC] Don't spill the frame pointer twice When a function contains something, such as inline asm, which explicitly clobbers the register used as the frame pointer, don't spill it twice. If we need a frame pointer, it will be saved/restored in the prologue/epilogue code. Explicitly spilling it again will reuse the same spill slot used by the prologue/epilogue code, thus clobbering the saved value. The same applies to the base-pointer or PIC-base register. Partially fixes PR26856. Thanks to Ulrich for his analysis and the small inline-asm reproducer. llvm-svn: 280188	2016-08-31 00:52:03 +00:00
NAKAMURA Takumi	3766d106c8	clangTooling: Update libdeps: LLVMOptions, since r280118. llvm-svn: 280187	2016-08-31 00:46:32 +00:00
NAKAMURA Takumi	6110c9aa02	clangTooling depends on ClangDriverOptions since r280118. llvm-svn: 280186	2016-08-31 00:46:25 +00:00
Kostya Serebryany	4fd30769c1	[sanitizer] remove kBatchClassID that is not used any more; NFC llvm-svn: 280185	2016-08-31 00:37:33 +00:00
Gor Nishanov	50d7fb974f	[Coroutines] Part 10: Add coroutine promise support. Summary: 1) CoroEarly now lowers llvm.coro.promise intrinsic that allows to obtain a coroutine promise pointer from a coroutine frame and vice versa. 2) CoroFrame now interprets Promise argument of llvm.coro.begin to place CoroutinPromise alloca at a deterministic offset from the coroutine frame. Now, the coroutine promise example from docs\Coroutines.rst compiles and produces expected result (see test/Transform/Coroutines/ex4.ll). Reviewers: majnemer Subscribers: llvm-commits, mehdi_amini Differential Revision: https://reviews.llvm.org/D23993 llvm-svn: 280184	2016-08-31 00:35:41 +00:00
Sanjay Patel	7d9ebaf337	[InstCombine] clean up InsertRangeTest; NFCI It's much less code and easier to read if we don't duplicate everything between the 'Inside' and not 'Inside' cases. As noted with the FIXME, the goal is to make this vector-friendly in a follow-up patch. llvm-svn: 280183	2016-08-31 00:19:35 +00:00
Jason Henline	ba65d4412e	[StreamExecutor] Add Stream::blockHostUntilDone Summary: Add the type-safe wrapper to the platform-specific implementation. Reviewers: jlebar Subscribers: jprice, parallel_libs-commits Differential Revision: https://reviews.llvm.org/D24063 llvm-svn: 280182	2016-08-31 00:11:14 +00:00
Vedant Kumar	8938f92a5e	[llvm-cov] Drop redundant "No." suffix in a column title llvm-svn: 280181	2016-08-31 00:09:44 +00:00
Piotr Padlewski	d57be707b8	[clang-tidy] modernize-make-{smart_ptr} private ctor bugfix Summary: Bugfix for 27321. When the constructor of stored pointer type is private then it is invalid to change it to make_shared or make_unique. Reviewers: alexfh, aaron.ballman, hokein Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D23343 llvm-svn: 280180	2016-08-31 00:06:55 +00:00
Alina Sbirlea	3f8f7840bf	[LoadStoreVectorizer] Change VectorSet to Vector to match head and tail positions. Resolves PR29148. Summary: LSV was using two vector sets (heads and tails) to track pairs of adjiacent position to vectorize. A recent optimization is trying to obtain the longest chain to vectorize and assumes the positions in heads(H) and tails(T) match, which is not the case is there are multiple tails for the same head. e.g.: i1: store a[0] i2: store a[1] i3: store a[1] Leads to: H: i1 T: i2 i3 Instead of: H: i1 i1 T: i2 i3 So the positions for instructions that follow i3 will have different indexes in H/T. This patch resolves PR29148. This issue also surfaced the fact that if the chain is too long, and TLI returns a "not-fast" answer, the whole chain will be abandoned for vectorization, even though a smaller one would be beneficial. Added a testcase and FIXME for this. Reviewers: tstellarAMD, arsenm, jlebar Subscribers: mzolotukhin, wdng, llvm-commits Differential Revision: https://reviews.llvm.org/D24057 llvm-svn: 280179	2016-08-30 23:53:59 +00:00

1 2 3 4 5 ...

240840 Commits All Branches Search

240840 Commits

All Branches