llvm-project

Commit Graph

Author	SHA1	Message	Date
Kostya Serebryany	6cdb5a61b5	[libFuzzer] implement more correct way of computing feature index for Inline8bitCounters llvm-svn: 309647	2017-08-01 01:16:26 +00:00
Kostya Serebryany	4f2970037a	[libFuzzer] enable -fsanitize-coverage=pc-table for all tests llvm-svn: 309646	2017-08-01 00:48:44 +00:00
Alina Sbirlea	30d8a881e8	Default MemoryLocation passed to getModRefInfo should be None (D35441) llvm-svn: 309645	2017-08-01 00:47:17 +00:00
Kostya Serebryany	a1f12ba17e	[sanitizer-coverage] relax an assertion llvm-svn: 309644	2017-08-01 00:44:05 +00:00
Eli Friedman	37f41d1e7c	[ScheduleDAG] Don't schedule node with physical register interference https://reviews.llvm.org/D31536 didn't really solve the problem it was trying to solve; it got rid of the assertion failure, but we were still scheduling the DAG incorrectly (mixing together instructions from different calls), leading to a MachineVerifier failure. In order to schedule the DAG correctly, we have to make sure we don't schedule a node which should be blocked by an interference. Fix ScheduleDAGRRList::PickNodeToScheduleBottomUp so it doesn't pick a node like that. The added call to FindAvailableNode() is the key change here; this makes sure we don't try to schedule a call while we're in the middle of scheduling a different call. I'm not sure this is the right approach; in particular, I'm not sure how to prove we don't end up with an infinite loop of repeatedly backtracking. This also reverts the code change from D31536. It doesn't do anything useful: we should never schedule an ADJCALLSTACKDOWN unless we've already scheduled the corresponding ADJCALLSTACKUP. Differential Revision: https://reviews.llvm.org/D33818 llvm-svn: 309642	2017-08-01 00:28:40 +00:00
Alina Sbirlea	967e7966fc	Allow None as a MemoryLocation to getModRefInfo Summary: Adding part of the changes in D30369 (needed to make progress): Current patch updates AliasAnalysis and MemoryLocation, but does _not_ clean up MemorySSA. Original summary from D30369, by dberlin: Currently, we have instructions which affect memory but have no memory location. If you call, for example, MemoryLocation::get on a fence, it asserts. This means things specifically have to avoid that. It also means we end up with a copy of each API, one taking a memory location, one not. This starts to fix that. We add MemoryLocation::getOrNone as a new call, and reimplement the old asserting version in terms of it. We make MemoryLocation optional in the (Instruction, MemoryLocation) version of getModRefInfo, and kill the old one argument version in favor of passing None (it had one caller). Now both can handle fences because you can just use MemoryLocation::getOrNone on an instruction and it will return a correct answer. We use all this to clean up part of MemorySSA that had to handle this difference. Note that literally every actual getModRefInfo interface we have could be made private and replaced with: getModRefInfo(Instruction, Optional<MemoryLocation>) and getModRefInfo(Instruction, Optional<MemoryLocation>, Instruction, Optional<MemoryLocation>) and delegating to the right ones, if we wanted to. I have not attempted to do this yet. Reviewers: dberlin, davide, dblaikie Subscribers: sanjoy, hfinkel, chandlerc, llvm-commits Differential Revision: https://reviews.llvm.org/D35441 llvm-svn: 309641	2017-08-01 00:28:29 +00:00
Craig Topper	410d252f5b	[AVX-512] Add unmasked subvector inserts and extract to the execution domain tables. llvm-svn: 309632	2017-07-31 22:07:29 +00:00
David Blaikie	038e28a5a7	DebugInfo: Put range base specifier entry functionality behind a flag Chromium's gold build seems to have trouble with this (gold produces errors) - not sure if it's gold that's not coping with the valid representation, or a bug in the implementation in LLVM, etc. llvm-svn: 309630	2017-07-31 21:48:42 +00:00
Reid Kleckner	2de471dca9	[codeview] Ignore DBG_VALUEs when choosing a BB start source loc When the first instruction of a basic block has no location (consider a LEA materializing the address of an alloca for a call), we want to start the line table for the block with the first valid source location in the block. We need to ignore DBG_VALUE instructions during this scan to get decent line tables. llvm-svn: 309628	2017-07-31 21:03:08 +00:00
Sanjay Patel	dac0ab272c	[InstCombine] allow mask hoisting transform for vector types llvm-svn: 309627	2017-07-31 21:01:53 +00:00
Peter Collingbourne	bcd204b478	Update phi nodes in LowerTypeTests control flow simplification D33925 added a control flow simplification for -O2 --lto-O0 builds that manually splits blocks and reassigns conditional branches but does not correctly update phi nodes. If the else case being branched to had incoming phi nodes the control-flow simplification would leave phi nodes in that BB with an unhandled predecessor. Patch by Vlad Tsyrklevich! Differential Revision: https://reviews.llvm.org/D36012 llvm-svn: 309621	2017-07-31 20:43:07 +00:00
Kostya Serebryany	b2a1eba2f5	[libFuzzer] implement __sanitizer_cov_pcs_init and add pc-table to build flags for one test (for now) llvm-svn: 309615	2017-07-31 20:20:59 +00:00
Konstantin Belochapka	b77d0a5cf1	[X86][MMX] Added custom lowering action for MMX SELECT (PR30418) Fix for pr30418 - error in backend: Cannot select: t17: x86mmx = select_cc t2, Constant:i64<0>, t7, t8, seteq:ch Differential Revision: https://reviews.llvm.org/D34661 llvm-svn: 309614	2017-07-31 20:11:49 +00:00
Kostya Serebryany	bfc83fa8d7	[sanitizer-coverage] don't instrument available_externally functions llvm-svn: 309611	2017-07-31 20:00:22 +00:00
Kostya Serebryany	bb6f079a45	[sanitizer-coverage] ensure minimal alignment for coverage counters and guards llvm-svn: 309610	2017-07-31 19:49:45 +00:00
Zachary Turner	8d927b6bf9	[lld/pdb] Add an empty globals stream. We don't write any actual symbols to this stream yet, but for now we just create the stream and hook it up to the appropriate places and give it a valid header. Differential Revision: https://reviews.llvm.org/D35290 llvm-svn: 309608	2017-07-31 19:36:08 +00:00
Davide Italiano	e4c2782cba	[SLPVectorizer] Unbreak the build with -Werror. GCC was complaining about `&&` within `\|\|` without explicit parentheses. NFCI. llvm-svn: 309606	2017-07-31 19:14:19 +00:00
Craig Topper	317a51e886	[X86][InstCombine] Add some simplifications for BZHI intrinsics This intrinsic clears the upper bits starting at a specified index. If the index is a constant we can do some simplifications. This could be in InstSimplify, but we don't handle any target specific intrinsics there today. Differential Revision: https://reviews.llvm.org/D36069 llvm-svn: 309604	2017-07-31 18:52:15 +00:00
Craig Topper	8324003818	[X86][InstCombine] Add basic simplification support for BEXTR/BEXTRI intrinsics. This patch adds simplification support for the BEXTR/BEXTRI intrinsics to match gcc. This only supports cases that fold to 0 or can be fully constant folded. Theoretically we could support converting to AND if the shift part is unused or to only a shift if the mask doesn't modify any bits after an equivalent shl. gcc doesn't do these transformations either. I put this in InstCombine, but it could be done in InstSimplify. It would be the first target specific intrinsic in InstSimplify. Differential Revision: https://reviews.llvm.org/D36063 llvm-svn: 309603	2017-07-31 18:52:13 +00:00
Quentin Colombet	15f6ffbf7c	[TargetPassConfig] Feature generic options to setup start/stop-after/before This patch refactors the code used in llc such that all the users of the addPassesToEmitFile API have access to a homogeneous way of handling start/stop-after/before options right out of the box. In particular, just invoking addPassesToEmitFile will set the proper pipeline without additional effort (modulo parsing a .mir file if the start-before/after options are used. NFC. Differential Revision: https://reviews.llvm.org/D30913 llvm-svn: 309599	2017-07-31 18:24:07 +00:00
Sanjay Patel	fea731a4aa	[CGP] use subtract or subtract-of-cmps for result of memcmp expansion As noted in the code comment, transforming this in the other direction might require a separate transform here in CGP given the block-at-a-time DAG constraint. Besides that theoretical motivation, there are 2 practical motivations for the subtract-of-cmps form: 1. The codegen for both x86 and PPC is better for this IR (though PPC could be better still). There is discussion about canonicalizing IR to the select form ( http://lists.llvm.org/pipermail/llvm-dev/2017-July/114885.html ), so we probably need to add DAG transforms for those patterns anyway, but this improves the memcmp output without waiting for that step. 2. If we allow vector-sized chunks for the load and compare, x86 is better prepared to convert that to optimal code when using subtract-of-cmps, so another prerequisite patch is avoided if we choose to enable that. Differential Revision: https://reviews.llvm.org/D34904 llvm-svn: 309597	2017-07-31 18:08:24 +00:00
Spyridoula Gravani	70d35e102e	[DWARF] Added verification check for tags in accelerator tables. This patch verifies that the atom tag is actually the same with the tag of the DIE that we retrieve from the table. Differential Revision: https://reviews.llvm.org/D35963 llvm-svn: 309596	2017-07-31 18:01:16 +00:00
David Majnemer	91c6330c96	[IPSCCP] Guard a user of getInitializer with hasDefinitiveInitializer We are not allowed to reason about an initializer value without first consulting hasDefinitiveInitializer. llvm-svn: 309594	2017-07-31 17:47:07 +00:00
Craig Topper	cb0e74975a	[AVX-512] Remove patterns that select vmovdqu8/16 for unmasked loads. Prefer vmovdqa64/vmovdqu64 instead. These were taking priority over the aligned load instructions since there is no vmovda8/16. I don't think there is really a difference between aligned and unaligned on newer cpus so I don't think it matters which instructions we use. But with this change we reduce the size of the isel table a little and we allow the aligned information to pass through to the evex->vec pass and produce the same output has avx/avx2 in some cases. I also generally dislike patterns rooted in a bitcast which these were. Differential Revision: https://reviews.llvm.org/D35977 llvm-svn: 309589	2017-07-31 17:35:44 +00:00
Simon Pilgrim	7b89ab5887	Strip trailing whitespace. NFCI. llvm-svn: 309584	2017-07-31 17:09:27 +00:00
Simon Pilgrim	77bdbc197e	Fix typo in comment. llvm-svn: 309583	2017-07-31 17:06:55 +00:00
Aditya Nandakumar	02c602e18c	[GISel]: Support Widening G_ICMP's destination operand. Updated AArch64 to widen destination to s32. https://reviews.llvm.org/D35737 Reviewed by Tim llvm-svn: 309579	2017-07-31 17:00:16 +00:00
Amaury Sechet	4358c5217d	Do not recombine FMA when that is not needed. Summary: As per title. This creates useless recombines. Reviewers: jyknight, nemanjai, mkuper, spatel, RKSimon, zvi, bkramer Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D33848 llvm-svn: 309578	2017-07-31 16:56:25 +00:00
Florian Hahn	a3ad61d874	Exclude more unused functions from release build. llvm-svn: 309576	2017-07-31 16:44:28 +00:00
Florian Hahn	1b09a37c07	Extend ifndef to printDebugLoc. GCC7 did not warn about that, but Clang does. llvm-svn: 309573	2017-07-31 16:29:00 +00:00
Florian Hahn	94b8a87c8e	Extend ifdefs to more unused helper functions. This fixes a buildbot failure with -Werror introduced by r309553 llvm-svn: 309572	2017-07-31 16:11:43 +00:00
Benjamin Kramer	d7b1e5af0a	[DebugInfo] Don't overwrite DWARFUnit fields if the CU DIE doesn't have them. DIEs are lazily deserialized so it's possible that the DWO CU is created before the DIE is parsed. DWO shares .debug_addr and .debug_ranges with the object file so overwriting the offset with 0 will make the CU unusable. No test case because I couldn't get clang to emit a non-zero range base. llvm-svn: 309570	2017-07-31 15:32:39 +00:00
Alexey Bataev	0ab22bb991	[SLP] Initial rework for min/max horizontal reduction vectorization, NFC. Summary: All getReductionCost() functions are renamed to getArithmeticReductionCost() + added basic infrastructure to handle non-binary reduction operations. Reviewers: spatel, mzolotukhin, Ayal, mkuper, gilr, hfinkel Subscribers: RKSimon, llvm-commits Differential Revision: https://reviews.llvm.org/D29402 llvm-svn: 309566	2017-07-31 14:36:05 +00:00
Alexey Bataev	3e9b3eb91d	[Cost] Rename getReductionCost() to getArithmeticReductionCost(), NFC. llvm-svn: 309563	2017-07-31 14:19:32 +00:00
Simon Dardis	8930b4a049	[SelectionDAG][mips] Fix PR33883 PR33883 shows that calls to intrinsic functions should not have their vector arguments or returns subject to ABI changes required by the target. This resolves PR33883. Thanks to Alex Crichton for reporting the issue! Reviewers: zoran.jovanovic, atanasyan Differential Revision: https://reviews.llvm.org/D35765 llvm-svn: 309561	2017-07-31 14:06:58 +00:00
Ayal Zaks	e841b214b1	[LV] Avoid redundant operations manipulating masks The Loop Vectorizer generates redundant operations when manipulating masks: AND with true, OR with false, compare equal to true. Instead of relying on a subsequent pass to clean them up, this patch avoids generating them. Use null (no-mask) to represent all-one full masks, instead of a constant all-one vector, following the convention of masked gathers and scatters. Preparing for a follow-up VPlan patch in which these mask manipulating operations are modeled using recipes. Differential Revision: https://reviews.llvm.org/D35725 llvm-svn: 309558	2017-07-31 13:21:42 +00:00
Martin Storsjo	4a5764e3ca	[llvm-dlltool] Write correct weak externals Previously, the created object files for the import library were broken. Write the symbol table before the string table. Simplify the code by using a separate variable Prefix instead of duplicating a few lines. Also update the coff-weak-exports to actually check that the generated weak symbols can be found as intended. Differential Revision: https://reviews.llvm.org/D36065 llvm-svn: 309555	2017-07-31 11:18:41 +00:00
Florian Hahn	6b3216aad8	Guard print() functions only used by dump() functions. Summary: Since r293359, most dump() function are only defined when `!defined(NDEBUG) \|\| defined(LLVM_ENABLE_DUMP)` holds. print() functions only used by dump() functions are now unused in release builds, generating lots of warnings. This patch only defines some print() functions if they are used. Reviewers: MatzeB Reviewed By: MatzeB Subscribers: arsenm, mzolotukhin, nhaehnle, llvm-commits Differential Revision: https://reviews.llvm.org/D35949 llvm-svn: 309553	2017-07-31 10:07:49 +00:00
George Rimar	e36d7a6d68	[Support/GlobPattern] - Do not crash when pattern has characters with int value < 0. Found it during work on LLD, it would crash on following linker script: SECTIONS { .foo : { ("®") } } That happens because ® has int value -82. And chars are used as array index in code, and are signed by default. Differential revision: https://reviews.llvm.org/D35891 llvm-svn: 309549	2017-07-31 09:26:50 +00:00
Florian Hahn	4284049dcc	[LoopInterchange] Do not interchange loops with function calls. Summary: Without any information about the called function, we cannot be sure that it is safe to interchange loops which contain function calls. For example there could be dependences that prevent interchanging between accesses in the called function and the loops. Even functions without any parameters could cause problems, as they could access memory using global pointers. For now, I think it is only safe to interchange loops with calls marked as readnone. With this patch, the LLVM test suite passes with `-O3 -mllvm -enable-loopinterchange` and LoopInterchangeProfitability::isProfitable returning true for all loops. check-llvm and check-clang also pass when bootstrapped in a similar fashion, although only 3 loops got interchanged. Reviewers: karthikthecool, blitz.opensource, hfinkel, mcrosier, mkuper Reviewed By: mcrosier Subscribers: mzolotukhin, llvm-commits Differential Revision: https://reviews.llvm.org/D35489 llvm-svn: 309547	2017-07-31 09:00:52 +00:00
Guy Blank	b169d56dc3	[X86][AVX512] Add masked MOVS[S\|D] patterns Added patterns to recognize AND 1 on the mask of a scalar masked move is not needed since only the lower bit is relevant for the instruction. Differential Revision: https://reviews.llvm.org/D35897 llvm-svn: 309546	2017-07-31 08:26:14 +00:00
Hiroshi Inoue	5703fe37ab	[PowerPC] Change method names; NFC Changed method names based on the discussion in https://reviews.llvm.org/D34986: getInt64 -> selectI64Imm, getInt64Count -> selectI64ImmInstrCount. llvm-svn: 309541	2017-07-31 06:27:09 +00:00
Craig Topper	97e9fa7954	[X86] Add pattern to use bzhi for 64-bit 'and' with a mask when there is a load involved. We already had a pattern without load, but with a load we were falling back to a regular 'and' due to pattern complexity priority. llvm-svn: 309535	2017-07-31 05:55:54 +00:00
David Blaikie	4dd663752d	DebugInfo: Fix r309526, ensure resetting base address selection entries are used Missed the resetting base address selections when going from a base address version to zero base address for non-base-addressed entries. llvm-svn: 309529	2017-07-31 00:18:24 +00:00
David Blaikie	89c81a0b91	DebugInfo: Use base address selection entries in debug_ranges to reduce relocations (from comments in the test) Group ranges in a range list that apply to the same section and use a base address selection entry to reduce the number of relocations to one reloc per section per range list. DWARF5 debug_rnglist will be more efficient than this in terms of relocations, but it's still better than one reloc per entry in a range list. This is an object/executable size tradeoff - shrinking objects, but growing the linked executable. In one large binary tested, total object size (not just debug info) shrank by 16%, entirely relocation entries. Linked executable grew by 4%. This was with compressed debug info in the objects, uncompressed in the linked executable. Without compression in the objects, the win would be smaller (the growth of debug_ranges itself would be more significant). llvm-svn: 309526	2017-07-30 22:10:00 +00:00
David Blaikie	a62f1cb1fa	DebugInfo: Fix for CU index usage in 309507 Not sure quite how I failed so clearly to test this, but anyway. llvm-svn: 309514	2017-07-30 15:15:58 +00:00
Coby Tayree	48d67cdbb4	[x86][inline-asm][ms-compat] legalize the use of "jc/jz short <op>" MS ignores the keyword "short" when used after a jc/jz instruction, LLVM ought to do the same. Test: D35893 Differential Revision: https://reviews.llvm.org/D35892 llvm-svn: 309509	2017-07-30 11:12:47 +00:00
David Blaikie	ebac0b9c62	DebugInfo: Use DWP cu_index to speed up symbolizing (as intended) I was a bit lazy when I first implemented this & skipped the index lookup - obviously for large files this becomes pretty crucial, so here we go, do the index lookup. Speeds up large DWP symbolizing by... lots. (20m -> 20s, actually, maybe more in a release build (that was a release build without index lookup, compared to a debug/non-release build with the index usage)) llvm-svn: 309507	2017-07-30 08:12:07 +00:00
Craig Topper	951f0ca104	[X86] Add addsub intrinsics to the intrinsic lowering table so we have a single set of isel patterns. llvm-svn: 309502	2017-07-30 06:02:59 +00:00
Dehao Chen	95f003003d	Refactor the build{Module\|Function}SimplificationPipeline to expose optimization phase. Summary: This is in preparation of https://reviews.llvm.org/D36052 Reviewers: chandlerc, davidxl, tejohnson Reviewed By: chandlerc Subscribers: sanjoy, llvm-commits Differential Revision: https://reviews.llvm.org/D36053 llvm-svn: 309500	2017-07-30 04:55:39 +00:00

1 2 3 4 5 ...

105133 Commits