llvm-project

Commit Graph

Author	SHA1	Message	Date
Justin Lebar	25eeb38acc	[ADT] Use memcpy for type punning in MathExtras. Summary: Previously we type-punned through a union, which is not safe. Reviewers: rnk Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28161 llvm-svn: 290715	2016-12-29 18:15:34 +00:00
Reid Kleckner	cd46c1df80	Revert "[COFF] Use 32-bit jump table entries in .rdata for Win64" This reverts commit r290694. It broke sanitizer tests on Win64. I'll probably bring this back, but the jump tables will just live in .text like they do for MSVC. llvm-svn: 290714	2016-12-29 17:07:10 +00:00
Sanjoy Das	00d76a5754	[TBAAVerifier] Be stricter around verifying scalar nodes This fixes the issue exposed in PR31393, where we weren't trying sufficiently hard to diagnose bad TBAA metadata. This does reduce the variety in the error messages we print out, but I think the tradeoff of verifying more, simply and quickly overrules the need for more helpful error messags here. llvm-svn: 290713	2016-12-29 15:47:05 +00:00
Sanjoy Das	600d2a5a6b	[TBAAVerifier] Make things const-consistent; NFC llvm-svn: 290712	2016-12-29 15:47:01 +00:00
Sanjoy Das	55f12d9de9	[TBAAVerifier] Memoize validity of scalar tbaa nodes; NFCI llvm-svn: 290711	2016-12-29 15:46:57 +00:00
Artem Tamazov	25478d821b	[AMDGPU][mc] Enable absolute expressions in .hsa_code_object_isa directive Among other stuff, this allows to use predefined .option.machine_version_major /minor/stepping symbols in the directive. Relevant test expanded at once (also file renamed for clarity). Differential Revision: https://reviews.llvm.org/D28140 llvm-svn: 290710	2016-12-29 15:41:52 +00:00
Igor Laevsky	fedab1572d	Fix documentation generator warnings after rL290708. llvm-svn: 290709	2016-12-29 15:08:57 +00:00
Igor Laevsky	4f31e52f94	Introduce element-wise atomic memcpy intrinsic This change adds a new intrinsic which is intended to provide memcpy functionality with additional atomicity guarantees. Please refer to the review thread or language reference for further details. Differential Revision: https://reviews.llvm.org/D27133 llvm-svn: 290708	2016-12-29 14:31:07 +00:00
Craig Topper	17b5568bc7	[InstCombine] Use getVectorNumElements instead of explicitly casting to VectorType and calling getNumElements. NFC llvm-svn: 290707	2016-12-29 07:03:18 +00:00
Craig Topper	62f06e241b	[InstCombine] Fix typo in comment. NFC llvm-svn: 290706	2016-12-29 05:38:31 +00:00
Craig Topper	2e18bcfc60	[InstCombine] Use a 32-bits instead of 64-bits for storing the number of elements in VectorType for a ShuffleVector. While there getVectorNumElements to avoid an explicit cast. NFC llvm-svn: 290705	2016-12-29 04:24:32 +00:00
Craig Topper	1a8a3377cc	[InstCombine][X86] If the lowest element of a scalar intrinsic isn't used make sure we add it to the worklist so we can DCE it sooner. We bypassed the intrinsic and returned the passthru operand, but we should also add the intrinsic to the worklist since its now dead. This can allow DCE to find it sooner and remove it. Similar was done for InsertElement when the inserted element isn't demanded. llvm-svn: 290704	2016-12-29 03:30:17 +00:00
Kostya Serebryany	d723804fa2	[libFuzzer] make __sanitizer_cov_trace_switch more predictable llvm-svn: 290703	2016-12-29 02:50:35 +00:00
Craig Topper	b57a84dace	[InstCombine] Fix some of the AVX-512 scalar arithmetic test cases to do a better job of testing what they intended to test. The accidentally had trivially dead code. Also needed to adjust the rounding mode to not CUR_DIRECTION so the intrinsics don't get converted to native operations before going through SimplifyDemandedVectorElts. llvm-svn: 290702	2016-12-29 02:29:04 +00:00
Mehdi Amini	fce3af0192	Remove BitstreamWriter::Emit64(), it was never called (NFC) llvm-svn: 290701	2016-12-29 01:40:53 +00:00
Reid Kleckner	32f171fec4	Fix mingw build by moving the static const data member before the bitfields Apparently GCC targeting Windows breaks bitfields on static data members: struct Foo { unsigned X : 16; static const int M = 42; unsigned Y : 16; }; static_assert(sizeof(Foo) == 4, "asdf"); // fails Who knew. llvm-svn: 290700	2016-12-29 01:14:41 +00:00
Daniel Berlin	6658cc9ead	NewGVN: Sort Dominator Tree in RPO order, and use that for generating order. Summary: The optimal iteration order for this problem is RPO order. We want to process as many preds of a backedge as we can before we process the backedge. At the same time, as we add predicate handling, we want to be able to touch instructions that are dominated by a given block by ranges (because a change in value numbering a predicate possibly affects all users we dominate that are using that predicate). If we don't do it this way, we can't do value inference over backedges (the paper covers this in depth). The newgvn branch currently overshoots the last part, and guarantees that it will touch at least the right set of instructions, but it does touch more. This is because the bitvector instruction ranges are currently generated in RPO order (so we take the max and the min of the ranges of dominated blocks, which means there are some in the middle we didn't have to touch that we did). We can do better by sorting the dominator tree, and then just using dominator tree order. As a preliminary, the dominator tree has some RPO guarantees, but not enough. It guarantees that for a given node, your idom must come before you in the RPO ordering. It guarantees no relative RPO ordering for siblings. We add siblings in whatever order they appear in the module. So that is what we fix. We sort the children array of the domtree into RPO order, and then use the dominator tree for ordering, instead of RPO, since the dominator tree is now a valid RPO ordering. Note: This would help any other pass that iterates a forward problem in dominator tree order. Most of them are single pass. It will still maximize whatever result they compute. We could also build the dominator tree in this order, but our incremental updates would still put it out of sort order, and recomputing the sort order is almost as hard as general incremental updates of the domtree. Also note that the sorting does not affect any tests, etc. Nothing depends on domtree order, including the verifier, the equals functions for domtree nodes, etc. How much could this matter, you ask? Here are the current numbers. This is generated by running NewGVN over all files in LLVM. Note that once we propagate equalities, the differences go up by an order of magnitude or two (IE instead of 29, the max ends up in the thousands, since the worst case we add a factor of N, where N is the number of branch predicates). So while it doesn't look that stark for the default ordering, it gets much much worse. There are also programs in the wild where the difference is already pretty stark (2 iterations vs hundreds). RPO ordering: 759040 Number of iterations is 1 112908 Number of iterations is 2 Default dominator tree ordering: 755081 Number of iterations is 1 116234 Number of iterations is 2 603 Number of iterations is 3 27 Number of iterations is 4 2 Number of iterations is 5 1 Number of iterations is 7 Dominator tree sorted: 759040 Number of iterations is 1 112908 Number of iterations is 2 <yay!> Really bad ordering (sort domtree siblings in postorder. not quite the worst possible, but yeah): 754008 Number of iterations is 1 21 Number of iterations is 10 8 Number of iterations is 11 6 Number of iterations is 12 5 Number of iterations is 13 2 Number of iterations is 14 2 Number of iterations is 15 3 Number of iterations is 16 1 Number of iterations is 17 2 Number of iterations is 18 96642 Number of iterations is 2 1 Number of iterations is 20 2 Number of iterations is 21 1 Number of iterations is 22 1 Number of iterations is 29 17266 Number of iterations is 3 2598 Number of iterations is 4 798 Number of iterations is 5 273 Number of iterations is 6 186 Number of iterations is 7 80 Number of iterations is 8 42 Number of iterations is 9 Reviewers: chandlerc, davide Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28129 llvm-svn: 290699	2016-12-29 01:12:36 +00:00
Reid Kleckner	e9c8d7f87b	Add a static_assert about the sizeof(GlobalValue) I added one for Value back in r262045, and I'm starting to think we should have these for any class with bitfields whose memory efficiency really matters. llvm-svn: 290698	2016-12-29 00:55:51 +00:00
Daniel Berlin	7ad1ea0984	Update equalsStoreHelper for the fact that only one branch can be true llvm-svn: 290697	2016-12-29 00:49:32 +00:00
Justin Lebar	ddece375a1	[GlobalValue] Move HasLLVMReservedName into existing bitfield. NFC Summary: Follow-up to r290691, where I introduced HasLLVMReservedName. rnk pointed out that that patch added an extra word to GlobalValue on MSVC, because it doesn't pack bitfields with different types. This patch moves HasLLVMReservedName into the existing bitfield, where we appear to have plenty of bits to spare. Reviewers: rnk Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28149 llvm-svn: 290696	2016-12-29 00:30:46 +00:00
Justin Lebar	23a53501a4	[IR] Clarify that Value::getName() is not actually cheap. It involves a hashtable lookup when the Value has a name. llvm-svn: 290695	2016-12-29 00:30:42 +00:00
Reid Kleckner	c9e0a153cf	[COFF] Use 32-bit jump table entries in .rdata for Win64 Summary: We were already using 32-bit jump table entries, but this was a consequence of the default PIC model on Win64, and not an intentional design decision. This patch ensures that we always use 32-bit label difference jump table entries on Win64 regardless of the PIC model. This is a good idea because it saves executable size and object file size. Moving the jump tables to .rdata cleans up the disassembled object code and reduces the available ROP targets, but it requires adding one more RIP-relative lea to the code. COFF doesn't have relocations to express the difference between two arbitrary symbols, so we can't use the jump table label in the label difference like we do elsewhere. Fixes PR31488 Reviewers: majnemer, compnerd Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28141 llvm-svn: 290694	2016-12-29 00:12:39 +00:00
Mehdi Amini	5022bb7238	Change Metadata Index emission in the bitcode to use 2x32 bits for the placeholder The Bitstream reader and writer are limited to handle a "size_t" at most, which means that we can't backpatch and read back a 64bits value on 32 bits platform. llvm-svn: 290693	2016-12-28 23:45:54 +00:00
Piotr Padlewski	6c37d298d9	Revert "[NewGVN] replace emplace_back with push_back" llvm-svn: 290692	2016-12-28 23:24:02 +00:00
Justin Lebar	291abd3ebb	Speed up Function::isIntrinsic() by adding a bit to GlobalValue. NFC Summary: Previously isIntrinsic() called getName(). This involves a hashtable lookup, so is nontrivially expensive. And isIntrinsic() is called frequently, particularly by dyn_cast<IntrinsicInstr>. This patch steals a bit of IntID and uses that to store whether or not getName() starts with "llvm." Reviewers: bogner, arsenm, joker-eph Subscribers: sanjoy, llvm-commits Differential Revision: https://reviews.llvm.org/D22949 llvm-svn: 290691	2016-12-28 22:59:45 +00:00
Mehdi Amini	e98f925834	Add an index for Module Metadata record in the bitcode This index record the position for each metadata record in the bitcode, so that the reader will be able to lazy-load on demand each individual record. We also make sure that every abbrev is emitted upfront so that the block can be skipped while reading. I don't plan to commit this before having the reader counterpart, but I figured this can be reviewed mostly independently. Recommit r290684 (was reverted in r290686 because a test was broken) after adding a threshold to avoid emitting the index when unnecessary (little amount of metadata). This optimization "hides" a limitation of the ability to backpatch in the bitstream: we can only backpatch safely when the position has been flushed. So if we emit an index for one metadata, it is possible that (part of) the offset placeholder hasn't been flushed and the backpatch will fail. Differential Revision: https://reviews.llvm.org/D28083 llvm-svn: 290690	2016-12-28 22:30:28 +00:00
Evgeniy Stepanov	c0e82d6bfc	Decrease kLargeMalloc block size in ASAN unit tests. Summary: Make kLargeMalloc big enough to be handled by secondary allocator and small enough to fit into quarantine for all configurations. It become too big to fit into quarantine on Android after D27873. Reviewers: eugenis Patch by Alex Shlyapnikov. Subscribers: danalbert, llvm-commits, kubabrecka Differential Revision: https://reviews.llvm.org/D28142 llvm-svn: 290689	2016-12-28 21:57:51 +00:00
Greg Clayton	8369b28da0	Fix the variable view in the "gui" curses mode so that variables whose children change will update correctly. Previously the variable view would update the children once and not change. If you were stepping through code where the dynamic type of a variable would change the value and its children, or a synthetic type (like say for a std::vector<int>), the variable view wouldn't update. Now it caches the children and uses the process stop ID to tell when the children need to be updated. llvm-svn: 290688	2016-12-28 21:22:37 +00:00
Greg Clayton	02f74b0861	Quiet a warning where we weren't checking if this was the same and rhs. llvm-svn: 290687	2016-12-28 21:19:42 +00:00
Saleem Abdulrasool	2b59eca1f7	Revert "Add an index for Module Metadata record in the bitcode" This reverts commit a0ca6ae2d38339e4ede0dfa588086fc23d87e836. Revert at Mehdi's request as it is breaking bots. llvm-svn: 290686	2016-12-28 20:37:22 +00:00
Piotr Padlewski	629a7f2cc0	[NewGVN] replace emplace_back with push_back emplace_back is not faster if it is equivalent to push_back. In this cases emplaced value had the same type that the one stored in container. It is ugly and it might be even slower (see Scott Meyers presentation about emplacement). llvm-svn: 290685	2016-12-28 20:36:08 +00:00
Mehdi Amini	32ca148198	Add an index for Module Metadata record in the bitcode Summary: This index record the position for each metadata record in the bitcode, so that the reader will be able to lazy-load on demand each individual record. We also make sure that every abbrev is emitted upfront so that the block can be skipped while reading. I don't plan to commit this before having the reader counterpart, but I figured this can be reviewed mostly independently. Reviewers: pcc, tejohnson Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28083 llvm-svn: 290684	2016-12-28 19:44:19 +00:00
Piotr Padlewski	26dada79ff	[NewGVN] Simplyfy loop NFC llvm-svn: 290683	2016-12-28 19:42:49 +00:00
Mehdi Amini	cc7fbf718d	[ThinLTO] Honor -O{0,1,2,4} passed through the libLTO interface for ThinLTO This was hardcoded to be O3 till now, without any way to change it without changing the code. llvm-svn: 290682	2016-12-28 19:37:16 +00:00
Tobias Grosser	ba3ea97689	Update to isl-0.18-28-gccb9f33 Another set of isl coalesce changes. llvm-svn: 290681	2016-12-28 19:35:49 +00:00
Piotr Padlewski	e4047b89ad	[NewGVN] replace typedefs with usings llvm-svn: 290680	2016-12-28 19:29:26 +00:00
Piotr Padlewski	fc5727b2a2	[NewGVN] NFC fixes llvm-svn: 290679	2016-12-28 19:17:17 +00:00
Reid Kleckner	92647369fc	[WinEH] Don't assume endFunction is called while in .text Jump table emission can switch to .rdata before WinException::endFunction gets called. Just remember the appropriate text section we started in and reset back to it when we end the function. We were already switching sections back from .xdata anyway. Fixes the first problem in PR31488, so that now COFF switch tables can live in .rdata if we want them to. llvm-svn: 290678	2016-12-28 19:05:12 +00:00
Piotr Padlewski	a587ca51d5	[ItaniumABI] NFC changes llvm-svn: 290677	2016-12-28 18:26:08 +00:00
Piotr Padlewski	25bb130628	Revert "Mention devirtualization in release notes" Accidental commit. LLVM changes have not been pushed yet This reverts commit 592453413690a2d16784667d1644758b9af700c1. llvm-svn: 290676	2016-12-28 18:25:30 +00:00
Piotr Padlewski	a280f674b5	Mention devirtualization in release notes llvm-svn: 290675	2016-12-28 18:23:23 +00:00
Teresa Johnson	f9b17d4878	[ThinLTO] No need to rediscover imports in distributed backend Summary: We can simply import all external values with summaries included in the individual index file created for the distributed backend job, as only those are added to the individual index file created by the WriteIndexesThinBackend (in addition to summaries for the original module, which are skipped here). While computing the cross module imports on this index would come to the same conclusion as the original thin link import logic, it is unnecessary work. And when tuning, it avoids the need to pass the same function importing parameters (e.g. -import-instr-limit) to both the thin link and the backends (otherwise they won't make the same decisions). Reviewers: mehdi_amini, pcc Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D28139 llvm-svn: 290674	2016-12-28 18:00:08 +00:00
Kelvin Li	26fd21ab80	Fix format. NFC llvm-svn: 290673	2016-12-28 17:57:07 +00:00
Reid Kleckner	8c19083f76	Revert "Driver: switch Windows to static RelocModel" This reverts commit r290569. It caused unforeseen codegen changes for switch jump tables described in PR31488. llvm-svn: 290672	2016-12-28 17:41:36 +00:00
Teresa Johnson	3f4c87d0b5	[ThinLTO] Add missing FileCheck invocation One of the intended checks was not being performed. llvm-svn: 290671	2016-12-28 16:45:37 +00:00
Davide Italiano	0e71480523	[NewGVN] Global sweep replacing NULL with nullptr. NFCI. llvm-svn: 290670	2016-12-28 14:00:11 +00:00
Davide Italiano	0fb3c7cde5	[NewGVN] Remove redundant code. NFCI. llvm-svn: 290669	2016-12-28 13:54:16 +00:00
Alexander Kornienko	dd0c0ba82c	[clang-tidy] google-explicit-constructor: ignore compiler-generated conversion operators. llvm-svn: 290668	2016-12-28 13:48:03 +00:00
Davide Italiano	b111409015	[NewGVN] equals() for loads/stores is the same. Unify. Differential Revision: https://reviews.llvm.org/D28116 llvm-svn: 290667	2016-12-28 13:37:17 +00:00
Eric Fiselier	99940720c8	Fix typo in comment llvm-svn: 290666	2016-12-28 11:09:18 +00:00

1 2 3 4 5 ...

250741 Commits All Branches Search

250741 Commits

All Branches