llvm-project

Commit Graph

Author	SHA1	Message	Date
Tobias Grosser	3cdc37c5bc	Move delinearization from SCEVAddRecExpr to ScalarEvolution The expressions we delinearize do not necessarily have to have a SCEVAddRecExpr at the outermost level. At this moment, the additional flexibility is not exploited in LLVM itself, but in Polly we will soon soonish use this functionality. For LLVM, this change should not affect existing functionality (which is covered by test/Analysis/Delinearization/) llvm-svn: 240952	2015-06-29 14:42:48 +00:00
Rafael Espindola	6a1bfb2f9b	Factor out the checking of string tables. This moves the error checking for string tables to getStringTable which returns an ErrorOr<StringRef>. This improves error checking, makes it uniform across all string tables and makes it possible to check them once instead of once per name. llvm-svn: 240950	2015-06-29 14:39:25 +00:00
Rafael Espindola	f934a6a104	Convert an assert that can fail into error checking. llvm-svn: 240944	2015-06-29 14:02:24 +00:00
Rafael Espindola	719dc7c436	Remove Elf_Sym_Iter. It was a fairly broken concept for an ELF only class. An ELF file can have two symbol tables, but they have exactly the same format. There is no concept of a dynamic or a static symbol. Storing this on the iterator also makes us do more work per symbol than necessary. To fetch a name we would: * Find if we had a static or a dynamic symbol. * Look at the corresponding symbol table and find the string table section. * Look at the string table section to fetch its contents. * Compute the name as a substring of the string table. All but the last step can be done per symbol table instead of per symbol. This is a step in that direction. llvm-svn: 240939	2015-06-29 12:38:31 +00:00
Elena Demikhovsky	30bc4ca313	AVX-512: all forms of SCATTER instruction on SKX, encoding, intrinsics and tests. llvm-svn: 240936	2015-06-29 12:14:24 +00:00
Javed Absar	d5526303b7	[ARM]: Extend -mfpu options for half-precision and vfpv3xd Some of the the permissible ARM -mfpu options, which are supported in GCC, are currently not present in llvm/clang.This patch adds the options: 'neon-fp16', 'vfpv3-fp16', 'vfpv3-d16-fp16', 'vfpv3xd' and 'vfpv3xd-fp16. These are related to half-precision floating-point and single precision. Reviewers: rengolin, ranjeet.singh Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10645 llvm-svn: 240930	2015-06-29 09:32:29 +00:00
Igor Breger	a7a8e9a018	AVX-512: Implemented missing encoding and intrinsics for FMA instructions Added tests for DAG lowering ,encoding and intrinsics Differential Revision: http://reviews.llvm.org/D10796 llvm-svn: 240926	2015-06-29 09:10:00 +00:00
Asaf Badouh	7ec4b7a8bb	[x86][AVX512] Add vscalef support include encoding and intrinsics review: http://reviews.llvm.org/D10730 llvm-svn: 240906	2015-06-28 14:30:39 +00:00
Elena Demikhovsky	6a1a357f1f	AVX-512: Added all SKX forms of GATHER instructions. Added intrinsics. Added encoding and tests. llvm-svn: 240905	2015-06-28 10:53:29 +00:00
Benjamin Kramer	5b455f0b62	[SDAG] Now that we have a way to communicate the exact bit on sdiv use it to simplify sdiv by a constant. We had a hack in SDAGBuilder in place to work around this but now we can avoid that. Call BuildExactSDIV from BuildSDIV so DAGCombiner can perform this trick automatically. The added check in DAGCombiner is necessary to prevent exact sdiv by pow2 from regressing as the target-specific pow2 lowering is not aware of exact bits yet. This is mostly covered by existing tests. One side effect is that we get the better lowering for exact vector sdivs now too :) llvm-svn: 240891	2015-06-27 20:33:26 +00:00
Duncan P. N. Exon Smith	203cbe7f6f	AsmPrinter: Document why DIEValueList uses a linked-list, NFC There are two main reasons why a linked-list makes sense for `DIEValueList`. 1. We want `DIE` to be on a `BumpPtrAllocator` to improve teardown efficiency. Making `DIEValueList` array-based would make that much more complicated. 2. The singly-linked list is fairly memory efficient. The histogram [1] shows that most DIEs have relatively few values, so we often pay less than the 2/3-pointer static overhead of a vector. Furthermore, we don't know ahead of time exactly how many values a `DIE` needs, so a vector-like scheme will on average over-allocate by ~50%. As it happens, that's the same memory overhead as the linked list node. [1]: http://lists.cs.uiuc.edu/pipermail/llvmdev/2015-May/085910.html The comment I added to the code is a little more succinct, but I think it's enough to give the idea. llvm-svn: 240868	2015-06-27 01:19:17 +00:00
Duncan P. N. Exon Smith	1f8a99a9ae	IR: Expose ModuleSlotTracker in Value::print() Allow callers of `Value::print()` and `Metadata::print()` to pass in a `ModuleSlotTracker`. This allows them to pay only once for calculating module-level slots (such as Metadata). This is related to PR23865, where there was a huge cost for `MachineFunction::print()`. Although I don't have a particular user in mind for this new code, I have hit big slowdowns before when running `opt -debug`, and I think this will be useful. Going forward, if someone hits a big slowdown with `print()` statements, they can create a `ModuleSlotTracker` and send it through. Similarly, adding support to `Value::dump()` and `Metadata::dump()` should be trivial. I added unit tests to be sure the `print()` functions actually behave the same way with and without the slot tracker. llvm-svn: 240867	2015-06-27 00:38:26 +00:00
Lang Hames	0000afd88c	[StackMaps] Add a lightweight parser for stackmap version 1 sections. The parser provides a convenient interface for reading llvm stackmap v1 sections in object files. This patch also includes a new option for llvm-readobj, '-stackmap', which uses the parser to pretty-print stackmap sections for debugging/testing purposes. llvm-svn: 240860	2015-06-26 23:56:53 +00:00
Duncan P. N. Exon Smith	6529ed40bc	CodeGen: Push the ModuleSlotTracker through Metadata For another 1% speedup on the testcase in PR23865, push the `ModuleSlotTracker` through to metadata-related printing in `MachineBasicBlock::print()`. llvm-svn: 240848	2015-06-26 22:28:47 +00:00
Duncan P. N. Exon Smith	f48e982706	CodeGen: Push the ModuleSlotTracker through MachineOperands Push `ModuleSlotTracker` through `MachineOperand`s, dropping the time for `llc -print-machineinstrs` on the testcase in PR23865 from ~13 seconds to ~9 seconds. Now `SlotTracker::processFunctionMetadata()` accounts for only 8% of the runtime, which seems reasonable. llvm-svn: 240845	2015-06-26 22:06:47 +00:00
Philip Reames	9818dd77b4	[Verifier] Follow on to 240836 Address one missed review comment and do the rename I left out of that patch to make it reviewable. llvm-svn: 240843	2015-06-26 22:04:34 +00:00
Duncan P. N. Exon Smith	3269215401	CodeGen: Use a single SlotTracker in MachineFunction::print() Expose enough of the IR-level `SlotTracker` so that `MachineFunction::print()` can use a single one for printing `BasicBlock`s. Next step would be to lift this through a few more APIs so that we can make other print methods faster. Fixes PR23865, changing the runtime of `llc -print-machineinstrs` from many minutes (killed after 3 minutes, but it wasn't very close) to 13 seconds for a 502185 line dump. llvm-svn: 240842	2015-06-26 22:04:20 +00:00
Philip Reames	a3c6f0048c	[Verifier] Verify invokes of intrinsics We support invoking a subset of llvm's intrinsics, but the verifier didn't account for this. We had previously added a special case to verify invokes of statepoints. By generalizing the code in terms of CallSite, we can verify invokes of other intrinsics as well. Interestingly, this found one test case which was invalid. Note: I'm deliberately leaving the naming change from CI to CS to a follow up change. That will happen shortly, I just wanted to reduce the diff to make it clear what was happening with this one. Differential Revision: http://reviews.llvm.org/D10118 llvm-svn: 240836	2015-06-26 21:39:44 +00:00
Tom Stellard	91efe9cebe	AMDGPU/SI: Set ELF OS/ABI to ELFOSABI_AMDGPU_HSA Reviewers: arsenm, rafael Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10708 llvm-svn: 240832	2015-06-26 21:15:11 +00:00
Mehdi Amini	c83ac464e6	DataLayout now returns a const ref to its member string representation There was no particular reason to return by value in the first place. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 240826	2015-06-26 20:44:16 +00:00
Benjamin Kramer	f2bbd9cf54	Add Value.def to the list of textual includes, excluding it from the modules build. llvm-svn: 240823	2015-06-26 20:16:44 +00:00
Nemanja Ivanovic	f502a428e6	Add missing builtins to the PPC back end for ABI compliance (vol. 1) This patch corresponds to review: http://reviews.llvm.org/D10638 This is the back end portion of patch http://reviews.llvm.org/D10637 It just adds the code gen and intrinsic functions necessary to support that patch to the back end. llvm-svn: 240820	2015-06-26 19:26:53 +00:00
Pete Cooper	9271ccc345	Convert a bunch of loops to foreach. NFC. This uses the new SDNode::op_values() iterator range committed in r240805. llvm-svn: 240817	2015-06-26 19:18:49 +00:00
Benjamin Kramer	31f6733af1	Make header parse standalone. NFC. llvm-svn: 240814	2015-06-26 19:04:11 +00:00
Rafael Espindola	bce4801943	Fix error handling in getString and simplify callers. llvm-svn: 240810	2015-06-26 18:42:17 +00:00
Rafael Espindola	e7d47dc48e	Delete dead code. NFC. llvm-svn: 240807	2015-06-26 18:32:53 +00:00
Pete Cooper	3af9a25b65	Add op_values() to iterate over the SDValue operands of an SDNode. SDNode already had ops() which would iterate over the operands and return SDUse*. This version instead gets the SDValue's out of the SDUse's so that we can use foreach in more places. Reviewed by David Blaikie. llvm-svn: 240805	2015-06-26 18:17:36 +00:00
David Blaikie	b447ac6435	Move VectorUtils from Transforms to Analysis to correct layering violation llvm-svn: 240804	2015-06-26 18:02:52 +00:00
David Blaikie	1213dbf1fd	Fix ODR violation waiting to happen by making static function definitions in VectorUtils.h non-static and defined out of line Patch by Ashutosh Nema Differential Revision: http://reviews.llvm.org/D10682 llvm-svn: 240794	2015-06-26 16:57:30 +00:00
Alex Lorenz	33f0aef32f	MIR Serialization: Serialize machine basic block operands. This commit serializes machine basic block operands. The machine basic block operands use the following syntax: %bb.<id>[.<name>] This commit also modifies the YAML representation for the machine basic blocks - a new, required field 'id' is added to the MBB YAML mapping. The id is used to resolve the MBB references to the actual MBBs. And while the name of the MBB can be included in a MBB reference, this name isn't used to resolve MBB references - as it's possible that multiple MBBs will reference the same BB and thus they will have the same name. If the name is specified, the parser will verify that it is equal to the name of the MBB with the specified id. Reviewers: Duncan P. N. Exon Smith Differential Revision: http://reviews.llvm.org/D10608 llvm-svn: 240792	2015-06-26 16:46:11 +00:00
Rafael Espindola	58d6edc041	ELF: Simplify the rel/rela implementation. Now the rela class inherits from rel and just adds the addend. llvm-svn: 240790	2015-06-26 15:27:04 +00:00
Rafael Espindola	854038ed1a	Rename getObjectFile to getObject for consistency. llvm-svn: 240785	2015-06-26 14:51:16 +00:00
Rafael Espindola	b39953d2df	Implement elf_section_iterator and getELFType(). And with those, simplify getSymbolNMTypeChar. llvm-svn: 240780	2015-06-26 13:11:15 +00:00
Rafael Espindola	edd5f84419	Expose getFlags via ELFSectionRef. llvm-svn: 240779	2015-06-26 12:44:10 +00:00
Rafael Espindola	41401e9c80	Add a ELFSectionRef class and use it to expose getSectionType. llvm-svn: 240778	2015-06-26 12:33:37 +00:00
Rafael Espindola	2fa80cc5fd	Simplify getSymbolType. This is still a really odd function. Most calls are in object format specific contexts and should probably be replaced with a more direct query, but at least now this is not too obnoxious to use. llvm-svn: 240777	2015-06-26 12:18:49 +00:00
Rafael Espindola	eef7ffe2e9	Make getOther ELF only. No other format has this field. llvm-svn: 240774	2015-06-26 11:39:57 +00:00
Hao Liu	1c1e0c9e71	[InterleavedAccess] Add a pass InterleavedAccess to identify interleaved memory accesses and transform into target specific intrinsics. E.g. An interleaved load (Factor = 2): %wide.vec = load <8 x i32>, <8 x i32>* %ptr %v0 = shuffle <8 x i32> %wide.vec, <8 x i32> undef, <0, 2, 4, 6> %v1 = shuffle <8 x i32> %wide.vec, <8 x i32> undef, <1, 3, 5, 7> It can be transformed into a ld2 intrinsic in AArch64 backend or a vld2 intrinsic in ARM backend. E.g. An interleaved store (Factor = 3): %i.vec = shuffle <8 x i32> %v0, <8 x i32> %v1, <0, 4, 8, 1, 5, 9, 2, 6, 10, 3, 7, 11> store <12 x i32> %i.vec, <12 x i32>* %ptr It can be transformed into a st3 intrinsic in AArch64 backend or a vst3 intrinsic in ARM backend. Differential Revision: http://reviews.llvm.org/D10533 llvm-svn: 240751	2015-06-26 02:10:27 +00:00
Duncan P. N. Exon Smith	aaa68a60a9	AsmPrinter: More explicitly scope iterator for MSVC r240748 seems to be on the right path. Be more explicit. http://lab.llvm.org:8011/builders/clang-x64-ninja-win7/builds/1961/ llvm-svn: 240750	2015-06-26 00:53:44 +00:00
Duncan P. N. Exon Smith	8caacfabfc	AsmPrinter: Explicitly scope iterator for MSVC Try to placate bots by explicitly scoping a conversion constructor from `iterator` to `const_iterator`. http://lab.llvm.org:8011/builders/sanitizer-windows/builds/5931/ llvm-svn: 240748	2015-06-26 00:41:53 +00:00
Duncan P. N. Exon Smith	827200c822	AsmPrinter: Use an intrusively linked list for DIE::Children Replace the `std::vector<>` for `DIE::Children` with an intrusively linked list. This is a strict memory improvement: it requires no auxiliary storage, and reduces `sizeof(DIE)` by one pointer. It also factors out the DIE-related malloc traffic. This drops llc memory usage from 735 MB down to 718 MB, or ~2.3%. (I'm looking at `llc` memory usage on `verify-uselistorder.lto.opt.bc`; see r236629 for details.) llvm-svn: 240736	2015-06-25 23:52:10 +00:00
Duncan P. N. Exon Smith	4fb1f9cda6	AsmPrinter: Convert DIE::Values to a linked list Change `DIE::Values` to a singly linked list, where each node is allocated on a `BumpPtrAllocator`. In order to support `push_back()`, the list is circular, and points at the tail element instead of the head. I abstracted the core list logic out to `IntrusiveBackList` so that it can be reused for `DIE::Children`, which also cares about `push_back()`. This drops llc memory usage from 799 MB down to 735 MB, about 8%. (I'm looking at `llc` memory usage on `verify-uselistorder.lto.opt.bc`; see r236629 for details.) llvm-svn: 240733	2015-06-25 23:46:41 +00:00
Michael J. Spencer	7d313d72ef	[ELF] Move ELF{32,64}{L,B}E typedefs to llvm. llvm-svn: 240731	2015-06-25 23:41:23 +00:00
Michael J. Spencer	deeaa31ce5	[ELF] Add some accessors for lld. llvm-svn: 240730	2015-06-25 23:40:41 +00:00
Rafael Espindola	dbb6bd3345	Add an ELFSymbolRef type. This allows user code to say Sym.getSize() instead of having to manually fetch the object. llvm-svn: 240708	2015-06-25 22:10:04 +00:00
Michael J. Spencer	594c028183	[Object][ELF] Add support for dumping dynamic relocations when sections are stripped. llvm-svn: 240703	2015-06-25 21:47:32 +00:00
Rafael Espindola	101824d345	llvm-nm: Don't print mapping symbols. This matches the behavior of gnu nm. Fixes pr23930. llvm-svn: 240695	2015-06-25 21:00:51 +00:00
Douglas Katzman	eb283241e8	Add Arg::getValues method with const 'this' and const result llvm-svn: 240673	2015-06-25 18:48:26 +00:00
Jonathan Roelofs	4dd6173bac	Doxygen-ify a few comments. NFC llvm-svn: 240647	2015-06-25 15:06:47 +00:00
Rafael Espindola	b85e10c17f	Use range loop. NFC. llvm-svn: 240645	2015-06-25 15:00:38 +00:00

1 2 3 4 5 ...

24132 Commits