llvm-project

Commit Graph

Author	SHA1	Message	Date
Reid Kleckner	b034526853	Reland "Use ScopedPrinter in llvm-pdbdump" This reverts r268508 and reinstates r268506 with an additional cast from TypeLeafKind to unsigned to allow conversion to HexNumber. llvm-svn: 268517	2016-05-04 16:09:04 +00:00
Chad Rosier	89d9de5af7	Typo. NFC. llvm-svn: 268513	2016-05-04 15:42:29 +00:00
Chad Rosier	20dbbf3542	Revert "Use ScopedPrinter in llvm-pdbdump" This reverts commit r268506 due to build breakage. llvm-svn: 268508	2016-05-04 15:25:06 +00:00
Zachary Turner	cdd313ca19	Use ScopedPrinter in llvm-pdbdump When printing raw PDB file fields, streams, and records, use the ScopedPrinter class so we have consistency with llvm-readobj's output format. For the most part this is pretty mechanical, but I had to fix up the test file to conform to the new YAMLesque output format. i added a few additional helper functions to the ScopedPrinter such as one to print a dotted version, etc. Differential Revision: http://reviews.llvm.org/D19897 Reviewed By: rnk llvm-svn: 268506	2016-05-04 15:05:12 +00:00
Igor Laevsky	fb1811d3a0	[RS4GC] Use SetVector/MapVector instead of DenseSet/DenseMap to guarantee stable ordering Goal of this change is to guarantee stable ordering of the statepoint arguments and other newly inserted values such as gc.relocates. Previously we had explicit sorting in a couple of places. However for unnamed values ordering was partial and overall we didn't have any strong invariant regarding it. This change switches all data structures to use SetVector's and MapVector's which provide possibility for deterministic iteration over them. Explicit sorting is now redundant and was removed. Differential Revision: http://reviews.llvm.org/D19669 llvm-svn: 268502	2016-05-04 14:55:36 +00:00
Zachary Turner	e9bc5ce124	Make ListScope and DictScope re-use the same code. llvm-svn: 268472	2016-05-04 01:46:59 +00:00
Richard Smith	6cf4325cb8	Remove (unused) "using namespace llvm;" in Support header. This caused the modules build to break. llvm-svn: 268469	2016-05-04 00:21:33 +00:00
David Majnemer	95549497ec	[GlobalDCE, Misc] Don't remove functions referenced by ifuncs We forgot to consider the target of ifuncs when considering if a function was alive or dead. N.B. Also update a few auxiliary tools like bugpoint and verify-uselistorder. This fixes PR27593. llvm-svn: 268468	2016-05-04 00:20:48 +00:00
Vedant Kumar	27370a09dd	[Support] Add a free toString function for Error toString() consumes an Error and returns a string representation of its contents. This commit also adds a message() method to ErrorInfoBase for convenience. Differential Revision: http://reviews.llvm.org/D19883 llvm-svn: 268465	2016-05-03 23:32:31 +00:00
Kevin Enderby	a8e3ab0c56	Produce another specific error message for a malformed Mach-O file when a load command has a size less than 8 bytes. I think the existing test case in test/Object/macho-invalid.test for macho64-invalid-too-small-load-command was trying to test for this but that test case triggered a different error given how it was constructed. So I constructed a new test case that would trigger this specific error. I also changed the error message to be consistent with the other malformed Mach-O file error messages. I also removed object_error::macho_small_load_command from Object/Error.h as it is not needed and can just use object_error::parse_failed and let the error message string distinguish the error. llvm-svn: 268463	2016-05-03 23:13:50 +00:00
Zachary Turner	2d02ceefdc	Move CodeViewTypeStream to DebugInfo/CodeView Ability to parse codeview type streams is also needed by DebugInfoPDB for parsing PDBs, so moving this into a library gives us this option. Since DebugInfoPDB had already hand rolled some code to do this, that code is now convereted over to using this common abstraction. Differential Revision: http://reviews.llvm.org/D19887 Reviewed By: dblaikie, amccarth llvm-svn: 268454	2016-05-03 22:18:17 +00:00
Justin Bogner	d0d2341f30	PM: Port LoopRotation to the new loop pass manager llvm-svn: 268452	2016-05-03 22:02:31 +00:00
Justin Bogner	ab6a513b4e	PM: Port LoopSimplifyCFG to the new pass manager llvm-svn: 268446	2016-05-03 21:47:32 +00:00
Justin Bogner	e839c3e6ab	PM: Check that loop passes preserve a basic set of analyses A loop pass that didn't preserve this entire set of passes wouldn't play well with other loop passes, since these are generally a basic requirement to do any interesting transformations to a loop. Adds a helper to get the set of analyses a loop pass should preserve, and checks that any loop pass we run satisfies the requirement. llvm-svn: 268444	2016-05-03 21:35:08 +00:00
Vedant Kumar	845018d24a	[Support] Drop template line in favor of 'inline' Reviewed-by: lhames llvm-svn: 268438	2016-05-03 21:04:42 +00:00
Reid Kleckner	1e4ba37df1	[ADT] Add drop_front method to ArrayRef We have it for StringRef but not ArrayRef, and ArrayRef has drop_back, so I see no reason it shouldn't have drop_front. Splitting this out of a change that I have that will use this funcitonality. llvm-svn: 268434	2016-05-03 20:53:20 +00:00
Davide Italiano	66228c4cf1	[IPO/GlobalDCE] Port to the new pass manager. Differential Revision: http://reviews.llvm.org/D19782 llvm-svn: 268425	2016-05-03 19:39:15 +00:00
Davide Italiano	596b0c0f24	[Reassociate] Remove unneeded constructor. llvm-svn: 268417	2016-05-03 18:34:51 +00:00
Lang Hames	6307827e75	[ExecutionEngine] Add comment explainging that ExecutionEngine::addGlobalMapping can only be used on named values. https://llvm.org/bugs/PR23497 llvm-svn: 268415	2016-05-03 18:19:35 +00:00
Vedant Kumar	43cba7333c	[ProfileData] Add error codes for compression failures Be more specific in describing compression failures. Also, check for this kind of error in emitNameData(). This is part of a series of patches to transition ProfileData over to the stricter Error/Expected interface. llvm-svn: 268400	2016-05-03 16:53:17 +00:00
Mehdi Amini	b52eb8b226	Provide some default values for the ThinLTO Cache pruning This control how the cache is pruned. The cache still has to be explicitely enabled/disabled by providing a path. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 268393	2016-05-03 15:17:50 +00:00
Daniel Sanders	fe98b2f54b	[mips] Use MipsMCExpr instead of MCSymbolRefExpr for all relocations. Summary: This is much closer to the way MIPS relocation expressions work (%hi(foo + 2) rather than %hi(foo) + 2) and removes the need for the various bodges in MipsAsmParser::evaluateRelocExpr(). Removing those bodges ensures that the constant stored in MCValue is the full 32 or 64-bit (depending on ABI) offset from the symbol. This will be used to correct the %hi/%lo matching needed to sort the relocation table correctly. As part of this: * Gave MCExpr::print() the ability to omit parenthesis when emitting a symbol reference inside a MipsMCExpr operator like %hi(X). Without this we print things like %lo(($L1)). * %hi(%neg(%gprel(X))) is now three MipsMCExpr's instead of one. Most of the related special cases have been removed or moved to MipsMCExpr. We can remove the rest as we gain support for the less common relocations when they are not part of this specific combination. * Renamed MipsMCExpr::VariantKind and the enum prefix ('VK_') to avoid confusion with MCSymbolRefExpr::VariantKind and its prefix (also 'VK_'). * fixup_Mips_GOT_Local and fixup_Mips_GOT_Global were found to be identical and merged into fixup_Mips_GOT. * MO_GOT16 and MO_GOT turned out to be identical and have been merged into MO_GOT. * VK_Mips_GOT and VK_Mips_GOT16 turned out to be the same thing so they have been merged into MEK_GOT Reviewers: sdardis Subscribers: dsanders, sdardis, llvm-commits Differential Revision: http://reviews.llvm.org/D19716 llvm-svn: 268379	2016-05-03 13:35:44 +00:00
Craig Topper	3fc0e668ff	[CodeGen] Add some space optimized forms of EmitNode and MorphNodeTo that implicitly indicate the number of result VTs. This shaves about 16K off the X86 matching table taking it down to about 470K. Overall this reduces the llc binary size with all in-tree targets by about 40K. llvm-svn: 268365	2016-05-03 05:54:13 +00:00
David Majnemer	3d90bb79c4	[LoopUnroll] Unroll loops which have exit blocks to EH pads We were overly cautious in our analysis of loops which have invokes which unwind to EH pads. The loop unroll transform is safe because it only clones blocks in the loop body, it does not try to split critical edges involving EH pads. Instead, move the necessary safety check to LoopUnswitch. N.B. The safety check for loop unswitch is covered by an existing test which fails without it. llvm-svn: 268357	2016-05-03 03:57:40 +00:00
Reid Kleckner	bc041b82d4	[codeview] Maintain the type enum-to-classname mapping in the .def file This way it will be easy to stamp out something like a type visitor. llvm-svn: 268347	2016-05-03 00:45:14 +00:00
Zachary Turner	f5c59654f7	Parse the TPI (type information) stream of PDB files. This parses the TPI stream (stream 2) from the PDB file. This stream contains some header information followed by a series of codeview records. There is some additional complexity here in that alongside this stream of codeview records is a serialized hash table in order to efficiently query the types. We parse the necessary bookkeeping information to allow us to reconstruct the hash table, but we do not actually construct it yet as there are still a few things that need to be understood first. Differential Revision: http://reviews.llvm.org/D19840 Reviewed By: ruiu, rnk llvm-svn: 268343	2016-05-03 00:28:21 +00:00
Zachary Turner	88bb163f81	Move llvm-readobj/StreamWriter to Support. We wish to re-use this from llvm-pdbdump, and it provides a nice way to print structured data in scoped format that could prove useful for many other dumping tools as well. Moving to support and changing name to ScopedPrinter to better reflect its purpose. llvm-svn: 268342	2016-05-03 00:28:04 +00:00
Mehdi Amini	5b85d8d67b	ThinLTO: do not import function whose linkage prevents inlining. There is not point in importing a "weak" or a "linkonce" function since we won't be able to inline it anyway. We already had a targeted check for WeakAny, this is using the same check on GlobalValue as the inline, i.e. isMayBeOverriddenLinkage() From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 268341	2016-05-03 00:27:28 +00:00
Matthias Braun	d1aabb2813	livePhysRegs: Pass MBB by reference in addLive{Ins\|Outs}(); NFC The block must no be nullptr for the addLiveIns()/addLiveOuts() function. llvm-svn: 268340	2016-05-03 00:24:32 +00:00
Matthias Braun	24f26e6d91	LivePhysRegs: Automatically determine presence of pristine regs. Remove the AddPristinesAndCSRs parameters from addLiveIns()/addLiveOuts(). We need to respect pristine registers after prologue epilogue insertion, Seeing that we got this wrong in at least two commits already, we should rather pay the small price to query MachineFrameInfo for it. There are three cases that did not set AddPristineAndCSRs to true even after register allocation: - ExecutionDepsFix: live-out registers are used as a hint that the register is used soon. This is not true for pristine registers so use the new addLiveOutsNoPristines() to maintain this behaviour. - SystemZShortenInst: Not setting AddPristineAndCSRs to true looks like a bug, should do the right thing automatically now. - StackMapLivenessAnalysis: Not adding pristine registers looks like a bug to me. Added a FIXME comment but maintain the current behaviour as a change may need to get coordinated with GC runtimes. llvm-svn: 268336	2016-05-03 00:08:46 +00:00
Reid Kleckner	97837b7b09	[MC] Create unique .pdata sections for every .text section Summary: This adds a unique ID to the COFF section uniquing map, similar to the one we have for ELF. The unique id is not currently exposed via the assembler because we don't have a use case for it yet. Users generally create .pdata with the .seh_* family of directives, and the assembler internally needs to produce .pdata and .xdata sections corresponding to the code section. The association between .text sections and the assembler-created .xdata and .pdata sections is maintained as an ID field of MCSectionCOFF. The CFI-related sections are created with the given unique ID, so if more code is added to the same text section, we can find and reuse the CFI sections that were already created. Reviewers: majnemer, rafael Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D19376 llvm-svn: 268331	2016-05-02 23:22:18 +00:00
Quentin Colombet	4e1d389ac5	[X86] Model FAULTING_LOAD_OP as a terminator and branch. This operation may branch to the handler block and we do not want it to happen anywhere within the basic block. Moreover, by marking it "terminator and branch" the machine verifier does not wrongly assume (because of AnalyzeBranch not knowing better) the branch is analyzable. Indeed, the target was seeing only the unconditional branch and not the faulting load op and thought it was a simple unconditional block. The machine verifier was complaining because of that and moreover, other optimizations could have done wrong transformation! In the process, simplify the representation of the handler block in the faulting load op. Now, we directly reference the handler block instead of using a label. This has the benefits of: 1. MC knows how to issue a label for a BB, so leave that to it. 2. Accessing the target BB from its label is painful, whereas it is direct from a MBB operand. Note: The 2 bytes offset in implicit-null-check.ll comes from the fact the unconditional jumps are not removed anymore, as the whole terminator sequence is not analyzable anymore. Will fix it in a subsequence commit. llvm-svn: 268327	2016-05-02 22:58:54 +00:00
Wolfgang Pieb	56aa4b0629	DebugInfo: Avoid propagating incorrect debug locations in SelectionDAG via CSE. Summary: When SelectionDAG performs CSE it is possible that the context's source location is different from that of the selected node. This can lead to incorrect line number records. We update the debug location to the one that occurs earlier in the instruction sequence. This fixes PR21006. Reviewers: echristo, sdmitrouk Subscribers: jevinskie, asl, llvm-commits Differential Revision: http://reviews.llvm.org/D12094 llvm-svn: 268323	2016-05-02 22:50:51 +00:00
Mehdi Amini	1e918c9cb3	Revert "ThinLTO: do not import function whose linkage prevents inlining." This reverts commit r268315, the tests are not passing. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 268317	2016-05-02 22:26:04 +00:00
Mehdi Amini	bda9b2ae9e	ThinLTO: do not import function whose linkage prevents inlining. There is not point in importing a "weak" or a "linkonce" function since we won't be able to inline it anyway. We already had a targeted check for WeakAny, this is using the same check on GlobalValue as the inline, i.e. isMayBeOverriddenLinkage() From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 268315	2016-05-02 22:11:27 +00:00
Vedant Kumar	58d1cd39be	Fix a typo, NFC llvm-svn: 268311	2016-05-02 21:32:07 +00:00
Kevin Enderby	7bd8d99497	Thread Expected<...> up from libObject’s getType() for symbols to allow llvm-objdump to produce a good error message. Produce another specific error message for a malformed Mach-O file when a symbol’s section index is more than the number of sections. The existing test case in test/Object/macho-invalid.test for macho-invalid-section-index-getSectionRawName now reports the error with the message indicating that a symbol at a specific index has a bad section index and that bad section index value. Again converting interfaces to Expected<> from ErrorOr<> does involve touching a number of places. Where the existing code reported the error with a string message or an error code it was converted to do the same. Also there some were bugs in the existing code that did not deal with the old ErrorOr<> return values. So now with Expected<> since they must be checked and the error handled, I added a TODO and a comment: "// TODO: Actually report errors helpfully" and a call something like consumeError(NameOrErr.takeError()) so the buggy code will not crash since needed to deal with the Error. llvm-svn: 268298	2016-05-02 20:28:12 +00:00
John Regehr	e1c481dccf	[LVI] Add an API to LazyValueInfo so that it can export ConstantRanges that it computes. Currently this is used for testing and precision tuning, but it might be used by optimizations later. Differential Revision: http://reviews.llvm.org/D19179 llvm-svn: 268291	2016-05-02 19:58:00 +00:00
Reid Kleckner	bca59d2a43	Revert "[SimplifyCFG] Extend TryToSimplifyUncondBranchFromEmptyBlock for empty block including lifetime intrinsics" This reverts commit r268254. This change causes assertion failures while building Chromium. Reduced test case coming soon. llvm-svn: 268288	2016-05-02 19:43:22 +00:00
Zachary Turner	b56d904433	PDB - Instead of hardcoding stream numbers, use an enum. llvm-svn: 268270	2016-05-02 18:09:21 +00:00
Zachary Turner	0eace0bae5	Parse PDB Name Hash Table PDB has a lot of similar data structures. We already have code for parsing a Name Map, but PDB seems to have a different but very similar structure that is a hash table. This is the beginning of code needed in order to parse the name hash table, but it is not yet complete. It parses the basic metadata of the hash table, the bucket array, and the names buffer, but doesn't use any of these fields yet as the data structure requires a non-trivial amount of work to understand. llvm-svn: 268268	2016-05-02 18:09:14 +00:00
Derek Schuff	31680dd832	[WebAssembly] Rename memory_size intrinsic to current_memory This follows the recent renaming in the wasm spec. llvm-svn: 268255	2016-05-02 17:25:22 +00:00
Hans Wennborg	b7599329fc	[SimplifyCFG] Extend TryToSimplifyUncondBranchFromEmptyBlock for empty block including lifetime intrinsics Make it possible that TryToSimplifyUncondBranchFromEmptyBlock merges empty basic block including lifetime intrinsics as well as phi nodes and unconditional branch into its successor or predecessor(s). If successor of empty block has single predecessor, all contents including lifetime intrinsics are sinked into the successor. Otherwise, they are hoisted into its predecessor(s) and then merged into the predecessor(s). Patch by Josh Yoon <josh.yoon@samsung.com>! Differential Revision: http://reviews.llvm.org/D19257 llvm-svn: 268254	2016-05-02 17:22:54 +00:00
Rafael Espindola	694210cddc	Expose a getFullName for thin archive members. It will be used in lld. llvm-svn: 268226	2016-05-02 13:45:06 +00:00
Craig Topper	e3c1e225d7	[CodeGen] Add OPC_MoveChild0-OPC_MoveChild7 opcodes to isel matching tables to optimize table size. Shaves about 12K off the X86 matcher table. llvm-svn: 268209	2016-05-02 01:53:30 +00:00
Amaury Sechet	8a367d404f	Properly name LLVMSetIsInBounds's argument. NFC llvm-svn: 268176	2016-05-01 02:23:14 +00:00
Amaury Sechet	81243a73ef	Capitalize align argument in the C API as per convention. NFC llvm-svn: 268175	2016-05-01 01:42:34 +00:00
Lang Hames	2307f405cc	[ORC] Save AArch64 NEON state in the JIT reentry block. The earlier version of the resolver code did not save NEON state, so it would have broken any callees that used floating point. llvm-svn: 268173	2016-05-01 00:14:45 +00:00
Sriraman Tallam	c198d3344e	Differential Revision: http://reviews.llvm.org/D19753 Delete Target Option PositionIndependentExecutable as PIE is now part of module flags. llvm-svn: 268155	2016-04-30 04:18:52 +00:00
Lang Hames	df29078dc8	[Orc] Fix the AArch64 resolver size. llvm-svn: 268146	2016-04-30 00:50:26 +00:00
Vedant Kumar	62db78449f	Fix a typo (NFC) llvm-svn: 268144	2016-04-30 00:32:54 +00:00
Xinliang David Li	4b2fdccad9	Reapply r268107 after fixing a bug breaks debug build. Makes the new method to set data needed by debug dump. llvm-svn: 268130	2016-04-29 22:59:36 +00:00
Sanjay Patel	1d0ac7c5b8	clean up documentation comments; NFC llvm-svn: 268122	2016-04-29 22:03:27 +00:00
Xinliang David Li	0552521b03	Revert r268107 -- debug build failure llvm-svn: 268116	2016-04-29 21:43:28 +00:00
Lang Hames	f74e93b600	[Orc] Add ORC lazy-compilation support for AArch64. The ORC compile callbacks and indirect stubs APIs will now work for AArc64, allowing functions to be lazily compiled and/or updated. llvm-svn: 268112	2016-04-29 21:32:00 +00:00
Lang Hames	ae643ee2d1	[Orc] Make sure we don't drop the internal error in OrcRemoteTargetClient when the constructor fails, as this would lead to an 'unchecked error' crash. llvm-svn: 268111	2016-04-29 21:29:48 +00:00
Xinliang David Li	1ffa28a3f1	[inliner]: Refactor inline deferring logic into its own method /NFC The implemented heuristic has a large body of code which better sits in its own function for better readability. It also allows adding more heuristics easier in the future. llvm-svn: 268107	2016-04-29 21:21:44 +00:00
Sriraman Tallam	7da9b445ea	Differential Revision: http://reviews.llvm.org/D19733 llvm-svn: 268106	2016-04-29 21:19:16 +00:00
Matt Arsenault	dc4ebad6d4	AMDGPU: Add kernarg.segment.ptr intrinsic llvm-svn: 268105	2016-04-29 21:16:52 +00:00
Matt Arsenault	2c15956749	Fix comment llvm-svn: 268102	2016-04-29 21:02:04 +00:00
Easwaran Raman	dc7071226b	Move coverage related code into a separate library. Differential Revision: http://reviews.llvm.org/D19333 llvm-svn: 268089	2016-04-29 18:53:05 +00:00
Zachary Turner	2f09b5091c	Put PDB parsing code into a pdb namespace. llvm-svn: 268072	2016-04-29 17:28:47 +00:00
Zachary Turner	6ba65deeb9	Refactor the PDB Stream reading interface. The motivation for this change is that PDB has the notion of streams and substreams. Substreams often consist of variable length structures that are convenient to be able to treat as guaranteed, contiguous byte arrays, whereas the streams they are contained in are not necessarily so, as a single stream could be spread across many discontiguous blocks. So, when processing data from a substream, we want to be able to assume that we have a contiguous byte array so that we can cast pointers to variable length arrays and such. This leads to the question of how to be able to read the same data structure from either a stream or a substream using the same interface, which is where this patch comes in. We separate out the stream's read state from the underlying representation, and introduce a `StreamReader` class. Then we change the name of `PDBStream` to `MappedBlockStream`, and introduce a second kind of stream called a `ByteStream` which is simply a sequence of contiguous bytes. Finally, we update all of the std::vectors in `PDBDbiStream` to use `ByteStream` instead as a proof of concept. llvm-svn: 268071	2016-04-29 17:22:58 +00:00
Filipe Cabecinhas	7894938a45	Add operator- to Path's reverse_iterator. Needed for D19666 Reviewers: rafael, craig.topper, bogner Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D19724 llvm-svn: 268062	2016-04-29 16:48:07 +00:00
Filipe Cabecinhas	0da9937517	Unify XDEBUG and EXPENSIVE_CHECKS (into the latter), and add an option to the cmake build to enable them. Summary: Historically, we had a switch in the Makefiles for turning on "expensive checks". This has never been ported to the cmake build, but the (dead-ish) code is still around. This will also make it easier to turn it on in buildbots. Reviewers: chandlerc Subscribers: jyknight, mzolotukhin, RKSimon, gberry, llvm-commits Differential Revision: http://reviews.llvm.org/D19723 llvm-svn: 268050	2016-04-29 15:22:48 +00:00
Sanjay Patel	0a36e301af	fix typo; NFC llvm-svn: 268045	2016-04-29 14:53:54 +00:00
David Majnemer	fadc6db036	[GlobalOpt] Propagate operand bundles We neglected to transfer operand bundles for some transforms. These were found via inspection, I'll try to come up with some test cases. llvm-svn: 268011	2016-04-29 08:07:22 +00:00
Zachary Turner	84c3a8ba3d	Read the rest of the DBI substreams, and parse source info. We now read out the rest of the substreams from the DBI streams. One of these substreams, the FileInfo substream, contains information about which source files contribute to each module (aka compiland). This patch additionally parses out the file information from that substream, and dumps it in llvm-pdbdump. Differential Revision: http://reviews.llvm.org/D19634 Reviewed by: ruiu llvm-svn: 267928	2016-04-28 20:05:18 +00:00
Krzysztof Parzyszek	7ea9a529aa	Reset the TopRPTracker's position in ScheduleDAGMILive::initQueues ScheduleDAGMI::initQueues changes the RegionBegin to the first non-debug instruction. Since it does not track register pressure, it does not affect any RP trackers. ScheduleDAGMILive inherits initQueues from ScheduleDAGMI, and it does reset the TopTPTracker in its schedule method. Any derived, target-specific scheduler will need to do it as well, but the TopRPTracker is only exposed as a "const" object to derived classes. Without the ability to modify the tracker directly, this leaves a derived scheduler with a potential of having the TopRPTracker out-of-sync with the CurrentTop. The symptom of the problem: void llvm::ScheduleDAGMILive::scheduleMI(llvm::SUnit *, bool): Assertion `TopRPTracker.getPos() == CurrentTop && "out of sync"' failed. Differential Revision: http://reviews.llvm.org/D19438 llvm-svn: 267918	2016-04-28 19:17:44 +00:00
Eugene Zelenko	f918b00490	Fix builds broken in r267910. llvm-svn: 267915	2016-04-28 18:27:35 +00:00
Sriraman Tallam	46d47b8ce2	Add "PIE Level" metadata to module flags. http://reviews.llvm.org/D19671 llvm-svn: 267911	2016-04-28 18:15:44 +00:00
Eugene Zelenko	5354a8aa4d	Fix some Clang-tidy modernize and Include What You Use warnings. Differential revision: http://reviews.llvm.org/D19673 llvm-svn: 267910	2016-04-28 18:04:41 +00:00
Chad Rosier	712b7d7630	[GVN] Minor code cleanup. NFC. Differential Revision: http://reviews.llvm.org/D18828 Patch by Aditya Kumar! llvm-svn: 267898	2016-04-28 16:00:15 +00:00
Matthias Braun	fbe85ae12e	CodeGen: Add DetectDeadLanes pass. The DetectDeadLanes pass performs a dataflow analysis of used/defined subregister lanes across COPY instructions and instructions that will get lowered to copies. It detects dead definitions and uses reading undefined values which are obscured by COPY and subregister usage. These dead definitions cause trouble in the register coalescer which cannot deal with definitions suddenly becoming dead after coalescing COPY instructions. For now the pass only adds dead and undef flags to machine operands. It should be possible to extend it in the future to remove the dead instructions and redo the analysis for the affected virtual registers. Differential Revision: http://reviews.llvm.org/D18427 llvm-svn: 267851	2016-04-28 03:07:16 +00:00
Matthias Braun	39d1fad554	TableGen: Produce CoveredBySubRegs summary for register classes This will be used in the upcoming "DetectDeadLanes" pass. llvm-svn: 267850	2016-04-28 03:07:11 +00:00
Matthias Braun	cd69bcf6d5	TargetRegisterInfo: Introduce reverseComposeSubRegIndexLaneMask() This function performs the reverse computation of composeSubRegIndexLaneMask(). It will be used in the upcoming "DetectDeadLanes" pass. llvm-svn: 267849	2016-04-28 03:07:07 +00:00
Matthias Braun	ff04541f4a	TableGen: Support lanemasks for classes without subregisters Previously using lanemasks on registers without any subregisters was not well defined. This commit extends TargetRegisterInfo/tablegen to: - Report a lanemask of 1 for regclasses without subregisters - Do the right thing when mapping a 0/1 lanemask from a class without subregisters into a class with subregisters in TargetRegisterInfo::composeSubRegIndexLaneMasks(). This will be used in the upcoming "DetectDeadLanes" patch. llvm-svn: 267848	2016-04-28 03:07:04 +00:00
Richard Smith	86ae84b0d9	Fix bogus documentation for StringRef::slice in the End < Start case. llvm-svn: 267831	2016-04-28 00:57:14 +00:00
Zachary Turner	1822af542f	Parse module information from DBI stream. This gets more data out of the DBI strema of the PDB. In particular it extracts the metadata for the list of modules (compilands) that this PDB contains info about, and adds support for dumping these fields to llvm-pdbdump. Differential Revision: http://reviews.llvm.org/D19570 Reviewed By: ruiu llvm-svn: 267818	2016-04-27 23:41:42 +00:00
Rong Xu	6e34c490ff	[PGO] Promote indirect calls to conditional direct calls with value-profile This patch implements the transformation that promotes indirect calls to conditional direct calls when the indirect-call value profile meta-data is available. Differential Revision: http://reviews.llvm.org/D17864 llvm-svn: 267815	2016-04-27 23:20:27 +00:00
Colin LeMahieu	485d905510	[MCAssembler] Allow backend to finalize layout post-relaxation. Differential revision: http://reviews.llvm.org/D19429 llvm-svn: 267796	2016-04-27 21:26:13 +00:00
Lang Hames	8959531c51	[RuntimeDyld] Plumb Error/Expected through the internals of RuntimeDyld. Also replaces a number of calls to report_fatal_error with Error returns. The plumbing will make it easier to return errors originating in libObject. Replacing report_fatal_errors with Error returns will give JIT clients the opportunity to recover gracefully when the JIT is unable to produce/relocate code, as well as providing meaningful error messages that can be used to file bug reports. llvm-svn: 267776	2016-04-27 20:24:48 +00:00
Justin Lebar	6e1089b3a7	Add TargetMachine::addEarlyAsPossiblePasses, and call it from opt. Summary: This is a hook to allow TargetMachine to install passes at the EP_EarlyAsPossible PassManagerBuilder extension point. Reviewers: chandlerc Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D18614 llvm-svn: 267763	2016-04-27 19:08:24 +00:00
Ahmed Bougacha	b0624a2cb4	[TLI] Unify LibFunc attribute inference. NFCI. Now the pass is just a tiny wrapper around the util. This lets us reuse the logic elsewhere (done here for BuildLibCalls) instead of duplicating it. The next step is to have something like getOrInsertLibFunc that also sets the attributes. Differential Revision: http://reviews.llvm.org/D19470 llvm-svn: 267759	2016-04-27 19:04:40 +00:00
Ahmed Bougacha	d765a82b54	[TLI] Unify LibFunc signature checking. NFCI. I tried to be as close as possible to the strongest check that existed before; cleaning these up properly is left for future work. Differential Revision: http://reviews.llvm.org/D19469 llvm-svn: 267758	2016-04-27 19:04:35 +00:00
Gerolf Hoflehner	50426191d7	[DAGCombiner] Follow coding convention for function name (NFC) llvm-svn: 267745	2016-04-27 17:27:16 +00:00
Nicolai Haehnle	f66bdb5ea8	AMDGPU/SI: Add llvm.amdgcn.s.waitcnt.all intrinsic Summary: So it appears that to guarantee some of the ordering requirements of a GLSL memoryBarrier() executed in the shader, we need to emit an s_waitcnt. (We can't use an s_barrier, because memoryBarrier() may appear anywhere in the shader, in particular it may appear in non-uniform control flow.) Reviewers: arsenm, mareko, tstellarAMD Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D19203 llvm-svn: 267729	2016-04-27 15:46:01 +00:00
Matthew Simpson	e5dfb08fcb	[TTI] Add hook for vector extract with extension This change adds a new hook for estimating the cost of vector extracts followed by zero- and sign-extensions. The motivating example for this change is the SMOV and UMOV instructions on AArch64. These instructions move data from vector to general purpose registers while performing the corresponding extension (sign-extend for SMOV and zero-extend for UMOV) at the same time. For these operations, TargetTransformInfo can assume the extensions are free and only report the cost of the vector extract. The SLP vectorizer has been updated to make use of the new hook. Differential Revision: http://reviews.llvm.org/D18523 llvm-svn: 267725	2016-04-27 15:20:21 +00:00
Teresa Johnson	02e98331c0	[ThinLTO] Use valueid instead of bitcode offsets in combined index file Summary: With the removal of support for lazy parsing of combined index summary records (e.g. r267344), we no longer need to include the summary record bitcode offset in the VST entries for definitions. Change the combined index format to be similar to the per-module index format in using value ids to cross-reference from the summary record to the VST entry (rather than the summary record bitcode offset to cross-reference in the other direction). The visible changes are: 1) Add the value id to the combined summary records 2) Remove the summary offset from the combined VST records, which has the following effects: - No longer need the VST_CODE_COMBINED_GVDEFENTRY record, as all combined index VST entries now only contain the value id and corresponding GUID. - No longer have duplicate VST entries in the case where there are multiple definitions of a symbol (e.g. weak/linkonce), as they all have the same value id and GUID. An implication of #2 above is that in order to hook up an alias to the correct aliasee based on the value id of the aliasee recorded in the combined index alias record, we need to scan the entries in the index for that GUID to find the one from the same module (i.e. the case where there are multiple entries for the aliasee). But the reader no longer has to maintain a special map to hook up the alias/aliasee. Reviewers: joker.eph Subscribers: joker.eph, llvm-commits Differential Revision: http://reviews.llvm.org/D19481 llvm-svn: 267712	2016-04-27 13:28:35 +00:00
Artur Pilipenko	345f01481b	NFC. Introduce Value::getPointerDerferecnceableBytes Extract a part of isDereferenceableAndAlignedPointer functionality to Value::getPointerDerferecnceableBytes. Currently it's a NFC, but in future I'm going to accumulate all the logic about value dereferenceability in this function similarly to Value::getPointerAlignment function (D16144). Reviewed By: reames Differential Revision: http://reviews.llvm.org/D17572 llvm-svn: 267708	2016-04-27 12:51:01 +00:00
Artur Pilipenko	9bb6beabf4	isSafeToLoadUnconditionally support queries without a context This is required to use this function from isSafeToSpeculativelyExecute Reviewed By: hfinkel Differential Revision: http://reviews.llvm.org/D16231 llvm-svn: 267692	2016-04-27 11:00:48 +00:00
Adam Nemet	d2fa414718	[LoopDist] Add llvm.loop.distribute.enable loop metadata Summary: D19403 adds a new pragma for loop distribution. This change adds support for the corresponding metadata that the pragma is translated to by the FE. As part of this I had to rethink the flag -enable-loop-distribute. My goal was to be backward compatible with the existing behavior: A1. pass is off by default from the optimization pipeline unless -enable-loop-distribute is specified A2. pass is on when invoked directly from opt (e.g. for unit-testing) The new pragma/metadata overrides these defaults so the new behavior is: B1. A1 + enable distribution for individual loop with the pragma/metadata B2. A2 + disable distribution for individual loop with the pragma/metadata The default value whether the pass is on or off comes from the initiator of the pass. From the PassManagerBuilder the default is off, from opt it's on. I moved -enable-loop-distribute under the pass. If the flag is specified it overrides the default from above. Then the pragma/metadata can further modifies this per loop. As a side-effect, we can now also use -enable-loop-distribute=0 from opt to emulate the default from the optimization pipeline. So to be precise this is the new behavior: C1. pass is off by default from the optimization pipeline unless -enable-loop-distribute or the pragma/metadata enables it C2. pass is on when invoked directly from opt unless -enable-loop-distribute=0 or the pragma/metadata disables it Reviewers: hfinkel Subscribers: joker.eph, mzolotukhin, llvm-commits Differential Revision: http://reviews.llvm.org/D19431 llvm-svn: 267672	2016-04-27 05:28:18 +00:00
Vaivaswatha Nagaraj	08efb0efcd	[Cloning] cloneLoopWithPreheader(): add assert to ensure no sub-loops Summary: cloneLoopWithPreheader() does not update LoopInfo for sub-loop of the original loop being cloned. Add assert to ensure no sub-loops for loop being cloned. Reviewers: anemet, ashutosh.nema, hfinkel Subscribers: mzolotukhin, llvm-commits Differential Revision: http://reviews.llvm.org/D15922 llvm-svn: 267671	2016-04-27 05:25:09 +00:00
Mehdi Amini	c7b950171d	Revert "Support "preserving" the summary information when using setModule() API in LTOCodeGenerator" This reverts commit r267665. ASAN shows that there is a use of undefined value. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 267668	2016-04-27 05:11:44 +00:00
Mehdi Amini	360ed847bc	Support "preserving" the summary information when using setModule() API in LTOCodeGenerator Another attempt at r267655... From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 267665	2016-04-27 04:24:10 +00:00
Mehdi Amini	a1b8b6cd56	Revert "Support "preserving" the summary information when using setModule() API in LTOCodeGenerator" This reverts commit r267657, r267656, and r267655. The test does not pass on multiple bots, I'm unsure why yet but let's unbreak them. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 267664	2016-04-27 03:34:28 +00:00
Mehdi Amini	e2a65fe5ec	Support "preserving" the summary information when using setModule() API in LTOCodeGenerator From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 267655	2016-04-27 01:46:48 +00:00
Mehdi Amini	b4e1e8297b	ThinLTO: do not promote GlobalVariable that have a specific section. Differential Revision: http://reviews.llvm.org/D18298 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 267646	2016-04-27 00:32:13 +00:00
Quentin Colombet	e7d1e99543	[MachineInstrBundle] Update the comment for PhysRegInfo::DeadDef. I missed read the comment when I commited r267621 and thought the comment did not need update. Matthias kindly proved me wrong. Fixing that. llvm-svn: 267638	2016-04-26 23:55:41 +00:00
Justin Bogner	c2bf63d29d	PM: Port Reassociate to the new pass manager llvm-svn: 267631	2016-04-26 23:39:29 +00:00
Quentin Colombet	3f19245015	[MachineInstrBundle] Improvement the recognition of dead definitions. Now, it is possible to know that partial definitions are dead definitions and recognize that clobbered registers are also dead. llvm-svn: 267621	2016-04-26 23:14:24 +00:00
Ahmed Bougacha	128f8732a5	[CodeGen] Add getBuildVector and getSplatBuildVector helpers. NFCI. Differential Revision: http://reviews.llvm.org/D17176 llvm-svn: 267606	2016-04-26 21:15:30 +00:00
Justin Bogner	4563a06cee	PM: Port Internalize to the new pass manager llvm-svn: 267596	2016-04-26 20:15:52 +00:00
Zachary Turner	ff788aa0ee	Fix warnings and -Werror build on clang. llvm-svn: 267589	2016-04-26 19:24:10 +00:00
Zachary Turner	53a65ba5c9	Parse and dump PDB DBI Stream Header Information The DBI stream contains a lot of bookkeeping information for other streams. In particular it contains information about section contributions and linked modules. This patch is a first attempt at parsing some of the information out of the DBI stream. It currently only parses and dumps the headers of the DBI stream, so none of the module data or section contribution data is pulled out. This is just a proof of concept that we understand the basic properties of the DBI stream's metadata, and followup patches will try to extract more detailed information out. Differential Revision: http://reviews.llvm.org/D19500 Reviewed By: majnemer, ruiu llvm-svn: 267585	2016-04-26 18:42:34 +00:00
Krzysztof Parzyszek	4773f647bd	[Tail duplication] Handle source registers with subregisters When a block is tail-duplicated, the PHI nodes from that block are replaced with appropriate COPY instructions. When those PHI nodes contained use operands with subregisters, the subregisters were dropped from the COPY instructions, resulting in incorrect code. Keep track of the subregister information and use this information when remapping instructions from the duplicated block. Differential Revision: http://reviews.llvm.org/D19337 llvm-svn: 267583	2016-04-26 18:36:34 +00:00
Justin Bogner	4d0dcb9891	Internalize: More consistent file header and include guards. NFC Match the style here to the other headers in Transforms/IPO. llvm-svn: 267581	2016-04-26 18:25:30 +00:00
Sanjay Patel	d66607bd8c	[CodeGenPrepare] use branch weight metadata to decide if a select should be turned into a branch This is part of solving PR27344: https://llvm.org/bugs/show_bug.cgi?id=27344 CGP should undo the SimplifyCFG transform for the same reason that earlier patches have used this same mechanism: it's possible that passes between SimplifyCFG and CGP may be able to optimize the IR further with a select in place. For the TLI hook default, >99% taken or not taken is chosen as the default threshold for a highly predictable branch. Even the most limited HW branch predictors will be correct on this branch almost all the time, so even a massive mispredict penalty perf loss would be overcome by the win from all the times the branch was predicted correctly. As a follow-up, we could make the default target hook less conservative by using the SchedMachineModel's MispredictPenalty. Or we could just let targets override the default by implementing the hook with that and other target-specific options. Note that trying to statically determine mispredict rates for close-to-balanced profile weight data is generally impossible if the HW is sufficiently advanced. Ie, 50/50 taken/not-taken might still be 100% predictable. Finally, note that this patch as-is will not solve PR27344 because the current __builtin_unpredictable() branch weight default values are 4 and 64. A proposal to change that is in D19435. Differential Revision: http://reviews.llvm.org/D19488 llvm-svn: 267572	2016-04-26 17:11:17 +00:00
Zachary Turner	ce36c1f2ec	Fix build broken due to order of initialization problem. llvm-svn: 267571	2016-04-26 16:57:53 +00:00
Zachary Turner	f34e01624a	Refactor some more PDB reading code into DebugInfoPDB. Differential Revision: http://reviews.llvm.org/D19445 Reviewed By: David Majnemer llvm-svn: 267564	2016-04-26 16:20:00 +00:00
Lang Hames	2bcc9ad88c	[ORC] Try to work around a GCC 4.7 bug triggered by r267457. llvm-svn: 267510	2016-04-26 01:27:54 +00:00
Junmo Park	3c65acf87e	Remove MinLatency in SchedMachineModel. NFC. Summary: We don't use MinLatency any more since r184032. Reviewers: atrick, hfinkel, mcrosier Differential Revision: http://reviews.llvm.org/D19474 llvm-svn: 267502	2016-04-26 00:37:46 +00:00
Justin Bogner	1a07501379	PM: Port GlobalOpt to the new pass manager llvm-svn: 267499	2016-04-26 00:28:01 +00:00
Sanjay Patel	82059090d3	Add check for "branch_weights" with prof metadata While we're here, fix the comment and variable names to make it clear that these are raw weights, not percentages. llvm-svn: 267491	2016-04-25 23:15:16 +00:00
Chris Bieneman	ed737d7881	[CMake] If set we should pass LLVM_VERSION_INFO into config.h Autoconf used to support setting LLVM_VERSION_INFO and there is some code filtered around llvm in Support/CommandLine.cpp and LTO/LTOCodeGenerator.cpp that uses it if it is set. We also shouldn't be explicitly setting it as a define on llvm-shlib. It is pointless there because there is no code using it in llvm-shlib, and it is better to have it as part of the generated config.h so that it is available everywhere. llvm-svn: 267490	2016-04-25 23:02:47 +00:00
Amaury Sechet	a9250d111a	Comment formating. NFC llvm-svn: 267483	2016-04-25 22:23:30 +00:00
Lang Hames	1fa0e0e006	[ORC] clang-format code that was touched in r267457. NFC. Commit r267457 made a lot of type-substitutions threw off code formatting and alignment. This patch should tidy those changes up. llvm-svn: 267475	2016-04-25 21:21:20 +00:00
Tim Northover	5c3140f745	ARM: put extern __thread stubs in a special section. The linker needs to know that the symbols are thread-local to do its job properly. llvm-svn: 267473	2016-04-25 21:12:04 +00:00
Teresa Johnson	c851d216e2	[ThinLTO] Introduce typedef for commonly-used map type (NFC) Add a typedef for the std::map<GlobalValue::GUID, GlobalValueSummary *> map that is passed around to identify summaries for values defined in a particular module. This shortens up declarations in a variety of places. llvm-svn: 267471	2016-04-25 21:09:51 +00:00
Lang Hames	ef5a0ee2c3	[ORC] Thread Error/Expected through the RPC library. This replaces use of std::error_code and ErrorOr in the ORC RPC support library with Error and Expected. This required updating the OrcRemoteTarget API, Client, and server code, as well as updating the Orc C API. This patch also fixes several instances where Errors were dropped. llvm-svn: 267457	2016-04-25 19:56:45 +00:00
Matt Arsenault	48ab526f12	AMDGPU: Add queue ptr intrinsic llvm-svn: 267451	2016-04-25 19:27:18 +00:00
Matt Arsenault	621d3675cb	Add useful helpers to AddrSpaceCastInst llvm-svn: 267450	2016-04-25 19:27:13 +00:00
Lang Hames	285639f803	[Support] Fix latent bugs in Expected and ExitOnError that were preventing them from working with reference types. llvm-svn: 267448	2016-04-25 19:21:57 +00:00
Zachary Turner	0a43efea95	Resubmit "Refactor raw pdb dumper into library" This fixes a number of endianness issues as well as an ODR violation that hopefully causes everything to be happy. llvm-svn: 267431	2016-04-25 17:38:08 +00:00
Chad Rosier	e2cbd13e56	[ValueTracking] Improve isImpliedCondition when the dominating cond is false. llvm-svn: 267430	2016-04-25 17:23:36 +00:00
Davide Italiano	63593f6202	[ELFRelocs] Other architectures do not have *_NUM reloc. It also seems to be unused. Get rid of it. Thanks to Rafael for pointing out. llvm-svn: 267428	2016-04-25 17:13:39 +00:00
David Majnemer	dd21523653	[WinEH] Update SplitAnalysis::computeLastSplitPoint to cope with multiple EH successors We didn't have logic to correctly handle CFGs where there was more than one EH-pad successor (these are novel with WinEH). There were situations where a register was live in one exceptional successor but not another but the code as written would only consider the first exceptional successor it found. This resulted in split points which were insufficiently early if an invoke was present. This fixes PR27501. N.B. This removes getLandingPadSuccessor. llvm-svn: 267412	2016-04-25 14:31:32 +00:00
Igor Kudrin	ed99a96f06	[Coverage] Restore the correct count value after processing a nested region in case of combined regions. If several regions cover the same area of code, we have to restore the combined value for that area when return from a nested region. This patch achieves that by combining regions before calling buildSegments. Differential Revision: http://reviews.llvm.org/D18610 llvm-svn: 267390	2016-04-25 09:43:37 +00:00
Igor Breger	fd63b80486	fix comments related to Differential Revision: http://reviews.llvm.org/D17913 llvm-svn: 267383	2016-04-25 08:30:28 +00:00
Michael Zuckerman	1bd66dd1c2	Fixing wrong mask size error. From __mmask8 to __mmask16. Was reviewed over the shoulder by AsafBadouh. Connected to review http://reviews.llvm.org/D19195. llvm-svn: 267379	2016-04-25 05:27:51 +00:00
Davide Italiano	22ac493a15	[Support/ELFRelocs] Add R_386_GOT32X. The new relocation recently defined in the Intel386 psABI was still missing from this file. A subsequent commit will add support for GOT32X in MC, together with a test. llvm-svn: 267378	2016-04-25 04:38:08 +00:00
Teresa Johnson	28e457bccd	[ThinLTO] Remove GlobalValueInfo class from index Summary: Remove the GlobalValueInfo and change the ModuleSummaryIndex to directly reference summary objects. The info structure was there to support lazy parsing of the combined index summary objects, which is no longer needed and not supported. Reviewers: joker.eph Subscribers: joker.eph, llvm-commits Differential Revision: http://reviews.llvm.org/D19462 llvm-svn: 267344	2016-04-24 14:57:11 +00:00
Duncan P. N. Exon Smith	a425bbbfb8	ModuleSummaryIndex: Avoid enum bitfields for MSVC portability Enum bitfields have crazy portability issues with MSVC. Use unsigned instead of LinkageTypes here in the ModuleSummaryIndex to address Takumi's concerns from r267335. llvm-svn: 267342	2016-04-24 14:25:37 +00:00
Duncan P. N. Exon Smith	f265dae1a3	Revert "Declare GlobalValue::LinkageTypes based on unsigned." This reverts commit r267335. The build has been broken for hours because of it: http://lab.llvm.org:8080/green/job/clang-stage1-cmake-RA-incremental_build/23352/ The correct fix is avoid using any enum in a bitfield. llvm-svn: 267341	2016-04-24 14:13:17 +00:00
NAKAMURA Takumi	da8f1f4ebb	Declare GlobalValue::LinkageTypes based on unsigned. Or, "LinkageTypes Linkage : 4;" might be sign-extended on msc. llvm-svn: 267335	2016-04-24 10:11:45 +00:00
Mehdi Amini	ca2c54e04e	Add "hasSection" flag in the Summary Reviewers: tejohnson Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D19405 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 267329	2016-04-24 05:31:43 +00:00
Gerolf Hoflehner	01b3a6184a	[MachineCombiner] Support for floating-point FMA on ARM64 (re-commit r267098) The original patch caused crashes because it could derefence a null pointer for SelectionDAGTargetInfo for targets that do not define it. Evaluates fmul+fadd -> fmadd combines and similar code sequences in the machine combiner. It adds support for float and double similar to the existing integer implementation. The key features are: - DAGCombiner checks whether it should combine greedily or let the machine combiner do the evaluation. This is only supported on ARM64. - It gives preference to throughput over latency: the heuristic used is to combine always in loops. The targets decides whether the machine combiner should optimize for throughput or latency. - Supports for fmadd, f(n)msub, fmla, fmls patterns - On by default at O3 ffast-math llvm-svn: 267328	2016-04-24 05:14:01 +00:00
Mehdi Amini	c3ed48c1bd	Reorganize GlobalValueSummary with a "Flags" bitfield. Right now it only contains the LinkageType, but will be extended with "hasSection", "isOptSize", "hasInlineAssembly", etc. Differential Revision: http://reviews.llvm.org/D19404 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 267319	2016-04-24 03:18:18 +00:00
Mehdi Amini	8fe6936e18	Add a version field in the bitcode for the summary Differential Revision: http://reviews.llvm.org/D19456 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 267318	2016-04-24 03:18:11 +00:00
Mehdi Amini	059464fe36	Add an internalization step to the ThinLTOCodeGenerator Keeping as much as possible internal/private is known to help the optimizer. Let's try to benefit from this in ThinLTO. Note: this is early work, but is enough to build clang (and all the LLVM tools). I still need to write some lit-tests... Differential Revision: http://reviews.llvm.org/D19103 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 267317	2016-04-24 03:18:01 +00:00
Davide Italiano	f59b0da654	[MC/ELF] Implement support for GOTPCRELX/REX_GOTPCRELX. The option to control the emission of the new relocations is -relax-relocations (blatantly copied from GNU as). It can't be enabled by default because it breaks relatively recent versions of ld.bfd/ld.gold (late 2015). llvm-svn: 267307	2016-04-24 01:03:57 +00:00
Mehdi Amini	ae64eafd31	Store and emit original name in combined index Summary: As discussed in D18298, some local globals can't be renamed/promoted (because they have a section, or because they are referenced from inline assembly). To be able to detect naming collision, we need to keep around the "GUID" using their original name without taking the linkage into account. Reviewers: tejohnson Subscribers: joker.eph, llvm-commits Differential Revision: http://reviews.llvm.org/D19454 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 267304	2016-04-23 23:38:17 +00:00
Duncan P. N. Exon Smith	a59d3e5af8	DebugInfo: Remove MDString-based type references Eliminate DITypeIdentifierMap and make DITypeRef a thin wrapper around DIType*. It is no longer legal to refer to a DICompositeType by its 'identifier:', and DIBuilder no longer retains all types with an 'identifier:' automatically. Aside from the bitcode upgrade, this is mainly removing logic to resolve an MDString-based reference to an actualy DIType. The commits leading up to this have made the implicit type map in DICompileUnit's 'retainedTypes:' field superfluous. This does not remove DITypeRef, DIScopeRef, DINodeRef, and DITypeRefArray, or stop using them in DI-related metadata. Although as of this commit they aren't serving a useful purpose, there are patchces under review to reuse them for CodeView support. The tests in LLVM were updated with deref-typerefs.sh, which is attached to the thread "[RFC] Lazy-loading of debug info metadata": http://lists.llvm.org/pipermail/llvm-dev/2016-April/098318.html llvm-svn: 267296	2016-04-23 21:08:00 +00:00
Sanjay Patel	dc88bd6e1f	replace duplicated static functions for profile metadata access with BranchInst member function; NFCI llvm-svn: 267295	2016-04-23 20:01:22 +00:00
Sanjay Patel	85ce0f1f1f	improve documentation comments; NFC llvm-svn: 267292	2016-04-23 16:31:48 +00:00
Duncan P. N. Exon Smith	4b1bc647f0	BitcodeReader: Avoid referencing unresolved nodes from distinct ones Each reference to an unresolved MDNode is expensive, since the RAUW support in MDNode uses a separate allocation and side map. Since a distinct MDNode doesn't require its operands on creation (unlike uniuqed nodes, there's no need to check for structural equivalence), use nullptr for any of its unresolved operands. Besides reducing the burden on MDNode maps, this can avoid allocating temporary MDNodes in the first place. We need some way to track operands. Invent DistinctMDOperandPlaceholder for this purpose, which is a Metadata subclass that holds an ID and points at its single user. DistinctMDOperandPlaceholder::replaceUseWith is just like RAUW, but its name highlights that there is only ever exactly one use. There is no support for moving (or, obviously, copying) these. Move support would be possible but expensive; leaving it unimplemented prevents user error. In the BitcodeReader I originally considered allocating on a BumpPtrAllocator and keeping a vector of pointers to them, and then I realized that std::deque implements exactly this. A couple of obvious follow-ups: - Change ValueEnumerator to emit distinct nodes first to take more advantage of this optimization. (How convenient... I think I might have a couple of patches for this.) - Change DIBuilder and its consumers (like CGDebugInfo in clang) to use something like this when constructing debug info in the first place. llvm-svn: 267270	2016-04-23 04:15:56 +00:00
Amaury Sechet	b130f43bfb	Style fix in Core.h / Core.cpp. NFC llvm-svn: 267257	2016-04-23 00:12:45 +00:00
Tim Northover	9e8eb418e5	MachO: remove weird ARM/Thumb interface from MachOObjectFile Only one consumer (llvm-objdump) actually cared about the fact that there were two triples. Others were actively working around the fact that the Triple returned by getArch might have been invalid. As for llvm-objdump, it needs to be acutely aware of both Triples anyway, so being generic in the exposed API is no benefit. Also rename the version of getArch returning a Triple. Users were having to pass an unwanted nullptr to disambiguate the two, which was nasty. The only functional change here is that armv7m and armv7em object files no longer crash llvm-objdump. llvm-svn: 267249	2016-04-22 23:21:13 +00:00
David Blaikie	e438cff475	llvm-symbolizer: Avoid infinite recursion walking dwos where the dwo contains a dwo_name attribute The dwo_name was added to dwo files to improve diagnostics in dwp, but it confuses tools that attempt to load any dwo named by a dwo_name, even ones inside dwos. Avoid this by keeping track of whether a unit is already a dwo unit, and if so, not loading further dwos. llvm-svn: 267241	2016-04-22 22:50:56 +00:00
Nicolai Haehnle	580471033e	Fix comment about IntrMem properties Summary: Follow up to D19291: it now makes sense to use two IntrMem properties, in particular IntrReadMem + IntrArgMemOnly is common. Pointed out by Mikael Holmén. Reviewers: uabelho, joker.eph, reames Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D19418 llvm-svn: 267238	2016-04-22 22:37:58 +00:00

1 2 3 4 5 ...

27336 Commits