llvm-project

Commit Graph

Author	SHA1	Message	Date
Vasileios Kalintiris	8fcb3986d0	[mips][FastISel] Implement srem/urem and sdiv/udiv instructions. Summary: Implement the LLVM assembly urem/srem and sdiv/udiv instructions in MIPS FastISel. Based on a patch by Reed Kotler. Test Plan: srem1.ll div1.ll test-suite at O0/O2 for mips32 r1/r2 Reviewers: dsanders, rkotler Subscribers: llvm-commits, rfuhler Differential Revision: http://reviews.llvm.org/D7028 llvm-svn: 238757	2015-06-01 16:17:37 +00:00
Vasileios Kalintiris	127f894b55	[mips][FastISel] Implement the select statement for MIPS FastISel. Summary: Implement the LLVM IR select statement for MIPS FastISelsel. Based on a patch by Reed Kotler. Test Plan: "Make check" test included now. Passes test-suite at O2/O0 mips32 r1/r2. Reviewers: dsanders, rkotler Subscribers: llvm-commits, rfuhler Differential Revision: http://reviews.llvm.org/D6774 llvm-svn: 238756	2015-06-01 15:56:40 +00:00
Vasileios Kalintiris	7f680e156e	[mips][FastISel] Clobber HI0/LO0 registers in MUL instructions. Summary: The contents of the HI/LO registers are unpredictable after the execution of the MUL instruction. In addition to implicitly defining these registers in the MUL instruction definition, we have to mark those registers as dead too. Without this the fast register allocator is running out of registers when the MUL instruction is followed by another one that tries to allocate the AC0 register. Based on a patch by Reed Kotler. Reviewers: dsanders, rkotler Subscribers: llvm-commits, rfuhler Differential Revision: http://reviews.llvm.org/D9825 llvm-svn: 238755	2015-06-01 15:48:09 +00:00
Hans Wennborg	9c806c432e	Drop remaining Dragonegg support in release scripts r236077 and r236081 dropped Dragonegg support from the release scripts but left some pieces. The most notable change is that Dragonegg won't be tagged any more. Patch by David Wiberg <dwiberg@gmail.com>. llvm-svn: 238753	2015-06-01 15:37:58 +00:00
Rafael Espindola	7f7caf9167	Fix relocation selection for foo-. on mips. This handles only the 32 bit case. llvm-svn: 238751	2015-06-01 15:10:51 +00:00
Rafael Espindola	ccb8d1a114	Simplify code, NFC. llvm-svn: 238750	2015-06-01 14:58:29 +00:00
Artur Pilipenko	a82f8db0b3	Add isConstant argument to MDBuilder::createTBAAStructTagNode According to the TBAA description struct-path tag node can have an optional IsConstant field. Add corresponding argument to MDBuilder::createTBAAStructTagNode. Reviewed By: hfinkel Differential Revision: http://reviews.llvm.org/D10160 llvm-svn: 238749	2015-06-01 14:53:55 +00:00
Colin LeMahieu	a739a4b3c7	[Hexagon] Adding basic ELF relocation generation and testing advanced relaxation codepath. llvm-svn: 238748	2015-06-01 14:51:26 +00:00
Rafael Espindola	499c99c229	The fragment implies the section, don't store both. This reduces MCSymbol from 64 to 56 bytes on x86_64. llvm-svn: 238747	2015-06-01 14:34:40 +00:00
Asaf Badouh	f6289f24f7	First commit test. llvm-svn: 238745	2015-06-01 13:56:00 +00:00
Greg Bedwell	0100439162	[CMake] Revert commits r238740/r238741 for embedding Windows version info. The clang Windows bots are showing mysterious failures. Reverting until I can figure out what's going on. llvm-svn: 238744	2015-06-01 13:40:14 +00:00
Elena Demikhovsky	67afb630e1	AVX-512: Optimized vector shuffle for v16f32 and v16i32 types. llvm-svn: 238743	2015-06-01 13:26:18 +00:00
Luke Cheeseman	4c476858cc	Removing commited assembly file. llvm-svn: 238742	2015-06-01 13:18:53 +00:00
Greg Bedwell	1fec7e41df	remove the use of the LOCATION CMake variable from r238740. It caused the following failure: "Policy CMP0026 is not set: Disallow use of the LOCATION target property." llvm-svn: 238741	2015-06-01 13:06:10 +00:00
Greg Bedwell	e0539fc50c	In MSVC builds embed a VERSIONINFO resource in our exe and DLL files. This embeds Windows version information into our executables and DLLs. The most visible place to view this data is in the details tab of the file properties window in Windows explorer. Differential Revision: http://reviews.llvm.org/D7828 llvm-svn: 238740	2015-06-01 12:41:55 +00:00
Luke Cheeseman	85fd06d389	Re-commit of r238201 with fix for building with shared libraries. llvm-svn: 238739	2015-06-01 12:02:47 +00:00
Elena Demikhovsky	3582eb3b39	AVX-512: Implemented VRANGEPD and VRANGEPD instructions for SKX. Implemented DAG lowering for all these forms. Added tests for encoding. By Igor Breger (igor.breger@intel.com) llvm-svn: 238738	2015-06-01 11:05:34 +00:00
Elena Demikhovsky	0c41088ebf	AVX-512: Implemented vector shuffle lowering for v8i64 and v8f64 types. I removed the vector-shuffle-512-v8.ll, it is auto-generated test, not valid any more. llvm-svn: 238735	2015-06-01 09:49:53 +00:00
David Majnemer	279306cb0d	[WinCOFF] Ignore .safeseh for non-x86 architectures We don't want to bother with creating .sxdata sections on Win64; all the relevant information is already in the .pdata section. llvm-svn: 238730	2015-06-01 07:34:26 +00:00
Elena Demikhovsky	75ede68793	AVX-512: added all forms of VPSHUFD and VPSHUFHW, VPSHUFLW including encodings. llvm-svn: 238729	2015-06-01 07:17:23 +00:00
Elena Demikhovsky	42c96d9c0a	AVX-512: Implemented VFIXUPIMMPD and VFIXUPIMMPS instructions for KNL and SKX Implemented DAG lowering for all these forms. Added tests for encoding. by Igor Breger (igor.breger@intel.com) llvm-svn: 238728	2015-06-01 06:50:49 +00:00
Craig Topper	6548196c6f	[TableGen] Move a couple virtual methods out of line so vtable anchors can be removed. NFC llvm-svn: 238727	2015-06-01 06:44:18 +00:00
Craig Topper	8eb887fefc	[TableGen] Remove unnecessary explicit initialization to null of a unique_ptr. NFC llvm-svn: 238726	2015-06-01 06:44:16 +00:00
Craig Topper	a0303bf5ef	[TableGen] Remove unnecessary forward declarations. NFC llvm-svn: 238725	2015-06-01 06:44:14 +00:00
Elena Demikhovsky	dd68d0cb0f	AVX-512: Fixed a bug in compress and expand intrinsics. By Igor Breger (igor.breger@intel.com) llvm-svn: 238724	2015-06-01 06:30:13 +00:00
Matt Arsenault	bd7d80a4a6	Add address space argument to isLegalAddressingMode This is important because of different addressing modes depending on the address space for GPU targets. This only adds the argument, and does not update any of the uses to provide the correct address space. llvm-svn: 238723	2015-06-01 05:31:59 +00:00
David Blaikie	f5147ef0b9	[opaque pointer type] Explicitly store the pointee type of the result of a GEP Alternatively, this type could be derived on-demand whenever getResultElementType is called - if someone thinks that's the better choice (simple time/space tradeoff), I'm happy to give it a go. llvm-svn: 238716	2015-06-01 03:09:34 +00:00
Rafael Espindola	3fc422d50b	Try to fix the build of IntelJITEventListener. llvm-svn: 238709	2015-06-01 02:18:14 +00:00
Rafael Espindola	2641014812	Rename HasData to IsRegistered. There is no MCSectionData, so the old name is now meaningless. Also remove some asserts/checks that were there just because the information they used was in MCSectionData. llvm-svn: 238708	2015-06-01 01:52:18 +00:00
Rafael Espindola	cc91cc1f3a	Remove trivial forwarding function. llvm-svn: 238707	2015-06-01 01:39:15 +00:00
Rafael Espindola	63702e2bf7	Store a bit in MCSection saying if it was registered with MCAssembler. With this we can replace a SetVector with a plain std::vector. llvm-svn: 238706	2015-06-01 01:30:01 +00:00
Rafael Espindola	a66395e184	Use a bitfield. NFC. llvm-svn: 238705	2015-06-01 01:05:07 +00:00
Rafael Espindola	09d3ecc60d	Use a 32 bit field for the symbol index. Even 64 ELF uses a 32 bit field to refer to symbols. llvm-svn: 238704	2015-06-01 00:58:31 +00:00
Rafael Espindola	5eb02e45e3	Simplify another function that doesn't fail. llvm-svn: 238703	2015-06-01 00:27:26 +00:00
David Majnemer	7666be70e4	[PHITransAddr] Don't translate unreachable values Unreachable values may use themselves in strange ways due to their dominance property. Attempting to translate through them can lead to infinite recursion, crashing LLVM. Instead, claim that we weren't able to translate the value. This fixes PR23096. llvm-svn: 238702	2015-06-01 00:15:08 +00:00
David Majnemer	fc41f63d77	[PHITransAddr] Use std::find instead of std::count There is no need to visit all the elements if we are merely performing a membership check. NFCI. llvm-svn: 238701	2015-06-01 00:15:04 +00:00
Rafael Espindola	a4d22472f3	Simplify interface of function that doesn't fail. llvm-svn: 238700	2015-05-31 23:52:50 +00:00
Keno Fischer	c2c6018cce	[DWARF] Fix a bug in line info handling This fixes a bug in the line info handling in the dwarf code, based on a problem I when implementing RelocVisitor support for MachO. Since addr+size will give the first address past the end of the function, we need to back up one line table entry. Fix this by looking up the end_addr-1, which is the last address in the range. Note that this also removes a duplicate output from the llvm-rtdyld line table dump. The relevant line is the end_sequence one in the line table and has an offset of the first address part the end of the range and hence should not be included. Also factor out the common functionality into a separate function. This comes up on MachO much more than on ELF, since MachO doesn't store the symbol size separately, hence making said situation always occur. Differential Revision: http://reviews.llvm.org/D9925 llvm-svn: 238699	2015-05-31 23:37:04 +00:00
Rafael Espindola	a82ce1d97a	For COFF and MachO, compute the gap between to symbols. Before r238028 we used to do this in O(N^2), now we do it in O(N log N). llvm-svn: 238698	2015-05-31 23:15:35 +00:00
NAKAMURA Takumi	072a58a7fd	ARMConstantIslandPass.cpp: Prune an empty \brief. [-Wdocumentation] llvm-svn: 238697	2015-05-31 23:05:35 +00:00
Colin LeMahieu	a97365b8e0	[Hexagon] Including raw_ostream for debug builds. llvm-svn: 238695	2015-05-31 22:29:33 +00:00
Colin LeMahieu	b819d3c465	[Hexagon] classes are actually structs. llvm-svn: 238694	2015-05-31 22:18:42 +00:00
Rafael Espindola	da1762754b	Use a range loop. NFC. llvm-svn: 238693	2015-05-31 22:13:51 +00:00
Colin LeMahieu	b23c47bab3	[Hexagon] Adding MC packet shuffler. llvm-svn: 238692	2015-05-31 21:57:09 +00:00
Tim Northover	a603c4076c	ARM: recommit r237590: allow jump tables to be placed as constant islands. The original version didn't properly account for the base register being modified before the final jump, so caused miscompilations in Chromium and LLVM. I've fixed this and tested with an LLVM self-host (I don't have the means to build & test Chromium). The general idea remains the same: in pathological cases jump tables can be too far away from the instructions referencing them (like other constants) so they need to be movable. Should fix PR23627. llvm-svn: 238680	2015-05-31 19:22:07 +00:00
Benjamin Kramer	412c4dbbd9	[MC] Simplify code. No functionality change intended. llvm-svn: 238676	2015-05-31 18:49:28 +00:00
Davide Italiano	3dbd7ae0e3	Clarify how the binary file checked in was generated. llvm-svn: 238665	2015-05-30 22:43:36 +00:00
Colin LeMahieu	b510fb38f5	[Hexagon] Adding override specifier and removing erroneous assertion llvm-svn: 238664	2015-05-30 20:03:07 +00:00
Keno Fischer	281b6941cf	Add RelocVisitor support for MachO This commit adds partial support for MachO relocations to RelocVisitor. A simple test case is added to show that relocations are indeed being applied and that using llvm-dwarfdump on MachO files no longer errors. Correctness is not yet tested, due to an unrelated bug in DebugInfo, which will be fixed with appropriate testcase in a followup commit. Differential Revision: http://reviews.llvm.org/D8148 llvm-svn: 238663	2015-05-30 19:44:53 +00:00
Colin LeMahieu	86f218e7ec	[Hexagon] Adding basic relaxation functionality. llvm-svn: 238660	2015-05-30 18:55:47 +00:00
Colin LeMahieu	a01780facf	[MC] Allow backends to decide relaxation for unresolved fixups. Differential Revision: http://reviews.llvm.org/D8217 llvm-svn: 238659	2015-05-30 18:42:22 +00:00
Kostya Serebryany	2ea204e645	[lib/Fuzzer] make assertions more informative and update comments for the user-supplied mutator llvm-svn: 238658	2015-05-30 17:33:13 +00:00
Benjamin Kramer	977d598d78	[MC] Reorder MCSymbol members to reduce padding. sizeof(MCSymbol) goes from 72 to 64 bytes on x86_64. llvm-svn: 238655	2015-05-30 13:52:30 +00:00
Simon Pilgrim	f19ef9f741	Stripped trailing whitespace. NFC. llvm-svn: 238654	2015-05-30 13:01:42 +00:00
Renato Golin	5d78c9ce58	Comment change. NFC That comment misleads the current discussions in mentioned bug. Leave the discussions to the bug. Also, adding a future change FIXME. llvm-svn: 238653	2015-05-30 10:44:07 +00:00
Chandler Carruth	cb58910ce8	[x86] Unify the horizontal adding used for popcount lowering taking the best approach of each. For vNi16, we use SHL + ADD + SRL pattern that seem easily the best. For vNi32, we use the PUNPCK + PSADBW + PACKUSWB pattern. In some cases there is a huge improvement with this in IACA's estimated throughput -- over 2x higher throughput!!!! -- but the measurements are too good to be true. In one narrow case, the SHL + ADD + SHL + ADD + SRL pattern looks slightly faster, but I'm not sure I believe any of the measurements at this point. Both are the exact same uops though. Hard to be confident of anything past that. If anyone wants to collect very detailed (Agner-level) timings with the result of this patch, or with the i32 case replaced with SHL + ADD + SHl + ADD + SRL, I'd be very interested. Note that you'll need to test it on both Ivybridge and Haswell, with both SSE3, SSSE3, and AVX selected as I saw unique behavior in each of these buckets with IACA all of which should be checked against measured performance. But this patch is still a useful improvement by dropping duplicate work and getting the much nicer PSADBW lowering for v2i64. I'd still like to rephrase this in terms of generic horizontal sum. It's a bit lame to have a special case of that just for popcount. llvm-svn: 238652	2015-05-30 10:35:03 +00:00
Renato Golin	230d298320	[ARMTargetParser] Move IAS arch ext parser. NFC The plan was to move the whole table into the already existing ArchExtNames but some fields depend on a table-generated file, and we don't yet have this feature in the generic lib/Support side. Once the minimum target-specific table-generated files are available in a generic fashion to these libraries, we'll have to keep it in the ASM parser. llvm-svn: 238651	2015-05-30 10:30:02 +00:00
Chandler Carruth	11e6f8fed1	[x86] Split out the horizontal byte sum lowering component of the LUT lowering into a helper function. NFC. llvm-svn: 238650	2015-05-30 09:46:16 +00:00
Craig Topper	15864f1518	[TableGen] Merge RecTy::typeIsConvertibleTo and RecTy::baseClassOf. NFC typeIsConvertibleTo was just calling baseClassOf(this) on the argument passed to it, but there weren't different signatures for baseClassOf so passing 'this' didn't really do anything interesting. typeIsConvertibleTo could have just been a non-virtual method in RecTy. But since that would be kind of a silly method, I instead re-distributed the logic from baseClassOf into typeIsConvertibleTo. llvm-svn: 238648	2015-05-30 07:36:01 +00:00
Craig Topper	974ed6d3e7	Fix indentation. NFC. llvm-svn: 238647	2015-05-30 07:35:21 +00:00
Craig Topper	9581906983	[TableGen] Remove all the variations of RecTy::convertValue and just handle the conversions in convertInitializerTo directly. This saves a bunch of vtable entries. NFC llvm-svn: 238646	2015-05-30 07:34:51 +00:00
Chandler Carruth	3bedf4407b	[x86] Update the order of instructions after I switched to a bitcast helper that skips creating a cast when it isn't necessary. It's really somewhat concerning that this was caused by the the presence of a no-op bitcast, but... llvm-svn: 238642	2015-05-30 06:02:37 +00:00
David Majnemer	4eecd30d19	[WinCOFF] Add support for the .safeseh directive .safeseh adds an entry to the .sxdata section to register all the appropriate functions which may handle an exception. This entry is not a relocation to the symbol but instead the symbol table index of the function. llvm-svn: 238641	2015-05-30 04:56:02 +00:00
Chandler Carruth	9cc2516676	[x86] Replace the long spelling of getting a bitcast with the much shorter one. NFC. In addition to being much shorter to type and requiring fewer arguments, this change saves over 30 lines from this one file, all wasted on total boilerplate... llvm-svn: 238640	2015-05-30 04:23:13 +00:00
Chandler Carruth	060cdca996	[x86] Replace the long spelling of getting a bitcast with the new short spelling. NFC. llvm-svn: 238639	2015-05-30 04:19:57 +00:00
Chandler Carruth	502b23a7a9	[sdag] Add the helper I most want to the DAG -- building a bitcast around a value using its existing SDLoc. Start using this in just one function to save omg lines of code. llvm-svn: 238638	2015-05-30 04:14:10 +00:00
Chandler Carruth	2599da3cfd	[x86] Restore the bitcasts I removed when refactoring this to avoid shifting vectors of bytes as x86 doesn't have direct support for that. This removes a bunch of redundant masking in the generated code for SSE2 and SSE3. In order to avoid the really significant code size growth this would have triggered, I also factored the completely repeatative logic for shifting and masking into two lambdas which in turn makes all of this much easier to read IMO. llvm-svn: 238637	2015-05-30 04:05:11 +00:00
Chandler Carruth	6ba9730a4e	[x86] Implement a faster vector population count based on the PSHUFB in-register LUT technique. Summary: A description of this technique can be found here: http://wm.ite.pl/articles/sse-popcount.html The core of the idea is to use an in-register lookup table and the PSHUFB instruction to compute the population count for the low and high nibbles of each byte, and then to use horizontal sums to aggregate these into vector population counts with wider element types. On x86 there is an instruction that will directly compute the horizontal sum for the low 8 and high 8 bytes, giving vNi64 popcount very easily. Various tricks are used to get vNi32 and vNi16 from the vNi8 that the LUT computes. The base implemantion of this, and most of the work, was done by Bruno in a follow up to D6531. See Bruno's detailed post there for lots of timing information about these changes. I have extended Bruno's patch in the following ways: 0) I committed the new tests with baseline sequences so this shows a diff, and regenerated the tests using the update scripts. 1) Bruno had noticed and mentioned in IRC a redundant mask that I removed. 2) I introduced a particular optimization for the i32 vector cases where we use PSHL + PSADBW to compute the the low i32 popcounts, and PSHUFD + PSADBW to compute doubled high i32 popcounts. This takes advantage of the fact that to line up the high i32 popcounts we have to shift them anyways, and we can shift them by one fewer bit to effectively divide the count by two. While the PSHUFD based horizontal add is no faster, it doesn't require registers or load traffic the way a mask would, and provides more ILP as it happens on different ports with high throughput. 3) I did some code cleanups throughout to simplify the implementation logic. 4) I refactored it to continue to use the parallel bitmath lowering when SSSE3 is not available to preserve the performance of that version on SSE2 targets where it is still much better than scalarizing as we'll still do a bitmath implementation of popcount even in scalar code there. With #1 and #2 above, I analyzed the result in IACA for sandybridge, ivybridge, and haswell. In every case I measured, the throughput is the same or better using the LUT lowering, even v2i64 and v4i64, and even compared with using the native popcnt instruction! The latency of the LUT lowering is often higher than the latency of the scalarized popcnt instruction sequence, but I think those latency measurements are deeply misleading. Keeping the operation fully in the vector unit and having many chances for increased throughput seems much more likely to win. With this, we can lower every integer vector popcount implementation using the LUT strategy if we have SSSE3 or better (and thus have PSHUFB). I've updated the operation lowering to reflect this. This also fixes an issue where we were scalarizing horribly some AVX lowerings. Finally, there are some remaining cleanups. There is duplication between the two techniques in how they perform the horizontal sum once the byte population count is computed. I'm going to factor and merge those two in a separate follow-up commit. Differential Revision: http://reviews.llvm.org/D10084 llvm-svn: 238636	2015-05-30 03:20:59 +00:00
Chandler Carruth	c2e400de83	[x86] Restructure the parallel bitmath lowering of popcount into a separate routine, generalize it to work for all the integer vector sizes, and do general code cleanups. This dramatically improves lowerings of byte and short element vector popcount, but more importantly it will make the introduction of the LUT-approach much cleaner. The biggest cleanup I've done is to just force the legalizer to do the bitcasting we need. We run these iteratively now and it makes the code much simpler IMO. Other changes were minor, and mostly naming and splitting things up in a way that makes it more clear what is going on. The other significant change is to use a different final horizontal sum approach. This is the same number of instructions as the old method, but shifts left instead of right so that we can clear everything but the final sum with a single shift right. This seems likely better than a mask which will usually have to read the mask from memory. It is certaily fewer u-ops. Also, this will be temporary. This and the LUT approach share the need of horizontal adds to finish the computation, and we have more clever approaches than this one that I'll switch over to. llvm-svn: 238635	2015-05-30 03:20:55 +00:00
Jim Grosbach	13760bd152	MC: Clean up MCExpr naming. NFC. llvm-svn: 238634	2015-05-30 01:25:56 +00:00
Filipe Cabecinhas	14e686774d	[BitcodeReader] Change an assert to a call to a call to Error() It's reachable from user input. Bug found with AFL fuzz. llvm-svn: 238633	2015-05-30 00:17:20 +00:00
Fiona Glaser	b82e33106b	SelectionDAG: fix logic for promoting shift types r238503 fixed the problem of too-small shift types by promoting them during legalization, but the correct solution is to promote only the operands that actually demand promotion. This fixes a crash on an out-of-tree target caused by trying to promote an operand that can't be promoted. llvm-svn: 238632	2015-05-29 23:37:22 +00:00
Reid Kleckner	e6531a5588	[WinEH] Adjust the 32-bit SEH prologue to better match reality It turns out that _except_handler3 and _except_handler4 really use the same stack allocation layout, at least today. They just make different choices about encoding the LSDA. This is in preparation for lowering the llvm.eh.exceptioninfo(). llvm-svn: 238627	2015-05-29 22:57:46 +00:00
Jingyue Wu	2cfd9d574d	[docs] fix the declarations of the llvm.nvvm.ptr.gen.to.* intrinsics Summary: These intrinsics should take a generic input address space and outputs a non-generic address space. Test Plan: no Reviewers: jholewinski, eliben Reviewed By: eliben Subscribers: eliben, jholewinski, llvm-commits Differential Revision: http://reviews.llvm.org/D10132 llvm-svn: 238620	2015-05-29 22:18:03 +00:00
Reid Kleckner	173a72524f	Disable FP elimination in funcs using 32-bit MSVC EH personalities The value in 'ebp' acts as an implicit argument to the outlined handlers, and is recovered with frameaddress(1). llvm-svn: 238619	2015-05-29 21:58:11 +00:00
Rafael Espindola	4d37b2a259	Remove getData. This completes the mechanical part of merging MCSymbol and MCSymbolData. llvm-svn: 238617	2015-05-29 21:45:01 +00:00
Reid Kleckner	5b8ebfbc25	Only add the EH state insertion pass on 32-bit Windows llvm-svn: 238612	2015-05-29 20:43:10 +00:00
Rafael Espindola	beb6060a51	Remove the MCSymbolData typedef. The getData member function is next. llvm-svn: 238611	2015-05-29 20:41:47 +00:00
Rafael Espindola	e45f0c1609	Merge MCSymbol and MCSymbolData. As a transition hack leave MCSymbolData as a typedef of MCSymbol. I will be removing that in a second. llvm-svn: 238609	2015-05-29 20:31:23 +00:00
Kostya Serebryany	3fe7682fb0	[lib/Fuzzer] relax an assertion llvm-svn: 238608	2015-05-29 20:31:17 +00:00
Rafael Espindola	b5d316bfc3	Rename getOrCreateSymbolData to registerSymbol and return void. Another step in merging MCSymbol and MCSymbolData. llvm-svn: 238607	2015-05-29 20:21:02 +00:00
Benjamin Kramer	f5e2fc474d	Replace push_back(Constructor(foo)) with emplace_back(foo) for non-trivial types If the type isn't trivially moveable emplace can skip a potentially expensive move. It also saves a couple of characters. Call sites were found with the ASTMatcher + some semi-automated cleanup. memberCallExpr( argumentCountIs(1), callee(methodDecl(hasName("push_back"))), on(hasType(recordDecl(has(namedDecl(hasName("emplace_back")))))), hasArgument(0, bindTemporaryExpr( hasType(recordDecl(hasNonTrivialDestructor())), has(constructExpr()))), unless(isInTemplateInstantiation())) No functional change intended. llvm-svn: 238602	2015-05-29 19:43:39 +00:00
Rafael Espindola	2229d33a9c	Move Flags from MCSymbolData to MCSymbol. llvm-svn: 238598	2015-05-29 19:07:51 +00:00
Rafael Espindola	d31c0e2673	Fix build without asserts. llvm-svn: 238597	2015-05-29 19:04:38 +00:00
Rafael Espindola	e3b2acf274	Pass MCSymbols to the helper functions in MCELF.h. llvm-svn: 238596	2015-05-29 18:47:23 +00:00
Chris Bieneman	7c445dd6c7	[CMake] Bug 23468 - LLVM_OPTIMIZED_TABLEGEN does not work with Visual Studio Summary: Multi-configuration builds put their binaries into ${CMAKE_BINARY_DIR}/Release/bin/. The table-gen cross-compilation support needs to take that into account. Reviewers: yaron.keren Reviewed By: yaron.keren Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10102 llvm-svn: 238592	2015-05-29 18:34:41 +00:00
Rafael Espindola	e19faeed71	Use an explicitly defaulted constructor. llvm-svn: 238591	2015-05-29 18:31:17 +00:00
Rafael Espindola	ece40ca43d	Pass a MCSymbol to needsRelocateWithSymbol. llvm-svn: 238589	2015-05-29 18:26:09 +00:00
Matthias Braun	165d467125	MachineCopyPropagation: Remove the copies instead of using KILL instructions. For some history here see the commit messages of r199797 and r169060. The original intent was to fix cases like: %EAX<def> = COPY %ECX<kill>, %RAX<imp-def> %RCX<def> = COPY %RAX<kill> where simply removing the copies would have RCX undefined as in terms of machine operands only the ECX part of it is defined. The machine verifier would complain about this so 169060 changed such COPY instructions into KILL instructions so some super-register imp-defs would be preserved. In r199797 it was finally decided to always do this regardless of super-register defs. But this is wrong, consider: R1 = COPY R0 ... R0 = COPY R1 getting changed to: R1 = KILL R0 ... R0 = KILL R1 It now looks like R0 dies at the first KILL and won't be alive until the second KILL, while in reality R0 is alive and must not change in this part of the program. As this only happens after register allocation there is not much code still performing liveness queries so the issue was not noticed. In fact I didn't manage to create a testcase for this, without unrelated changes I am working on at the moment. The fix is simple: As of r223896 the MachineVerifier allows reads from partially defined registers, so the whole transforming COPY->KILL thing is not necessary anymore. This patch also changes a similar (but more benign case as the def and src are the same register) case in the VirtRegRewriter. Differential Revision: http://reviews.llvm.org/D10117 llvm-svn: 238588	2015-05-29 18:19:25 +00:00
Frederic Riss	3733c03d3b	YAML traits need to be in the llvm::yaml namespace. Hope this fixes the bits, eg: http://lab.llvm.org:8011/builders/clang-hexagon-elf/builds/27147 llvm-svn: 238586	2015-05-29 18:14:55 +00:00
Frederic Riss	4939e6a1b8	[YAMLIO] Make line-wrapping configurable and test it. Summary: We would wrap flow mappings and sequences when they go over a hardcoded 70 characters limit. Make the wrapping column configurable (and default to 70 co the change should be NFC for current users). Passing 0 allows to completely suppress the wrapping which makes it easier to handle in tools like FileCheck. Reviewers: bogner Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10109 llvm-svn: 238584	2015-05-29 17:56:28 +00:00
Rafael Espindola	14672508b1	Move common symbol related information from MCSectionData to MCSymbol. llvm-svn: 238583	2015-05-29 17:48:04 +00:00
Rafael Espindola	66ccf49a0c	Store MCSymbols in PendingLabels. llvm-svn: 238582	2015-05-29 17:41:59 +00:00
Rafael Espindola	7c23cba65c	Move SymbolSize from MCSymbolData to MCSymbol. llvm-svn: 238580	2015-05-29 17:24:52 +00:00
Pete Cooper	c5a7177772	Fix crash in MCExpr::print. Symbols are no longer required to be named, but this leads to a crash here if an unnamed symbol checks that its first character is '$'. Change the code to first check for a name, then check its first character. No test case i'm afraid as this is debugging code, but any test case with temp labels and 'llc --debug --filetype=obj' would have crashed. llvm-svn: 238579	2015-05-29 17:19:11 +00:00
Nemanja Ivanovic	376e17364f	Add support for VSX FMA single-precision instructions to the PPC back end This patch corresponds to review: http://reviews.llvm.org/D9941 It adds the various FMA instructions introduced in the version 2.07 of the ISA along with the testing for them. These are operations on single precision scalar values in VSX registers. llvm-svn: 238578	2015-05-29 17:13:25 +00:00
Alex Lorenz	09b832cac5	MIR Serialization: use correct line and column numbers for LLVM IR errors. This commit translates the line and column numbers for LLVM IR errors from the numbers in the YAML block scalar to the numbers in the MIR file so that the MIRParser users can report LLVM IR errors with the correct line and column numbers. Reviewers: Duncan P. N. Exon Smith Differential Revision: http://reviews.llvm.org/D10108 llvm-svn: 238576	2015-05-29 17:05:41 +00:00
Reid Kleckner	1d3d4adbb9	[WinEH] Emit EH tables for __CxxFrameHandler3 on 32-bit x86 Small (really small!) C++ exception handling examples work on 32-bit x86 now. This change disables the use of .seh_* directives in WinException when CFI is not in use. It also uses absolute symbol references in the tables instead of imagerel32 relocations. Also fixes a cache invalidation bug in MMI personality classification. llvm-svn: 238575	2015-05-29 17:00:57 +00:00
Jingyue Wu	995dde2799	[NVPTXFavorNonGenericAddrSpaces] recursively trace into GEP and BitCast Summary: This patch allows NVPTXFavorNonGenericAddrSpaces to remove addrspacecast from longer chains consisting of GEPs and BitCasts. For example, it can now optimize %0 = addrspacecast [10 x float] addrspace(3)* @a to [10 x float]* %1 = gep [10 x float]* %0, i64 0, i64 %i %2 = bitcast float* %1 to i32* %3 = load i32* %2 ; emits ld.u32 to %0 = gep [10 x float] addrspace(3)* @a, i64 0, i64 %i %1 = bitcast float addrspace(3)* %0 to i32 addrspace(3)* %3 = load i32 addrspace(3)* %1 ; emits ld.shared.f32 Test Plan: @ld_int_from_global_float in access-non-generic.ll Reviewers: broune, eliben, jholewinski, meheff Subscribers: jholewinski, llvm-commits Differential Revision: http://reviews.llvm.org/D10074 llvm-svn: 238574	2015-05-29 17:00:27 +00:00
Jingyue Wu	a84feb1727	[DependenceAnalysis] Extend unifySubscriptType for handling coupled subscript groups. Summary: In continuation to an earlier commit to DependenceAnalysis.cpp by jingyue (r222100), the type for all subscripts in a coupled group need to be the same since constraints from one subscript may be propagated to another during testing. During testing, new SCEVs may be created and the operands for these need to be the same. This patch extends unifySubscriptType() to work on lists of subscript pairs, ensuring a common extended type for all of them. Test Plan: Added a test case to NonCanonicalizedSubscript.ll which causes dependence analysis to crash without this fix. All regression tests pass. Reviewers: spop, sebpop, jingyue Reviewed By: jingyue Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9698 llvm-svn: 238573	2015-05-29 16:58:08 +00:00
Rafael Espindola	f4b4430f8c	Simplify now that symbols contain the correct section. The complexity in here was because before r233995 variable symbols would report the incorrect section. llvm-svn: 238559	2015-05-29 15:07:27 +00:00
Colin LeMahieu	35436a2634	[Objdump] Removing unused parameter. llvm-svn: 238557	2015-05-29 14:48:25 +00:00
Colin LeMahieu	68d967d92e	[Hexagon] Disassembling, printing, and emitting instructions a whole-bundle at a time which is the semantic unit for Hexagon. Fixing tests to use the new format. Disabling tests in the direct object emission path for a followup patch. llvm-svn: 238556	2015-05-29 14:44:13 +00:00
Rafael Espindola	10d238751e	Fix ELFObjectWriter::isLocal for signature symbols. And with that simplify the logic for inserting them in ExternalSymbolData or LocalSymbolData. No functionality change overall since the old code avoided the isLocal bug. llvm-svn: 238555	2015-05-29 14:20:40 +00:00
Toma Tabacu	b45fb36f20	[mips] Remove 2 unused variables in MipsTargetStreamer.cpp. NFC. llvm-svn: 238554	2015-05-29 13:52:56 +00:00
Aaron Ballman	1196ca2113	Removing a switch statement that only contains a default; NFC. llvm-svn: 238552	2015-05-29 13:00:07 +00:00
Craig Topper	2af5e6fbf9	[TableGen] Remove convertValue functions for UnOpInit, BinOpInit, and TernOpInit as they weren't able to be called. I don't think converting the inputs to the Ops was the right behavior anyway. llvm-svn: 238543	2015-05-29 05:51:32 +00:00
Matthias Braun	27a6cfd823	This should have been a reference llvm-svn: 238540	2015-05-29 02:59:59 +00:00
Matthias Braun	e41e146c16	CodeGen: Use mop_iterator instead of MIOperands/ConstMIOperands MIOperands/ConstMIOperands are classes iterating over the MachineOperand of a MachineInstr, however MachineInstr::mop_iterator does the same thing. I assume these two iterators exist to have a uniform interface to iterate over the operands of a machine instruction bundle and a single machine instruction. However in practice I find it more confusing to have 2 different iterator classes, so this patch transforms (nearly all) the code to use mop_iterators. The only exception being MIOperands::anlayzePhysReg() and MIOperands::analyzeVirtReg() still needing an equivalent, I leave that as an exercise for the next patch. Differential Revision: http://reviews.llvm.org/D9932 This version is slightly modified from the proposed revision in that it introduces MachineInstr::getOperandNo to avoid the extra counting variable in the few loops that previously used MIOperands::getOperandNo. llvm-svn: 238539	2015-05-29 02:56:46 +00:00
Quentin Colombet	5f834c2260	Add a test for the MachineCopyPropagation change landed in r238518. llvm-svn: 238537	2015-05-29 01:40:00 +00:00
Ahmed Bougacha	eb4dbd8552	[TableGen][AsmMatcherEmitter] Only parse isolated tokens as registers. Fixes PR23455, where, when TableGen generates the matcher from the AsmString, it splits "cmp${cc}ss" into tokens, and the "ss" suffix is recognized as the SS register. I can't think of a situation where that's a feature, not a bug, hence: when a token is "isolated", i.e., it is followed and preceded by separators, it shouldn't be parsed as a register. Differential Revision: http://reviews.llvm.org/D9844 llvm-svn: 238536	2015-05-29 01:03:37 +00:00
Ahmed Bougacha	d8dc2acda2	[TableGen][AsmMatcherEmitter] Factor out AsmOperand creation. NFC. llvm-svn: 238534	2015-05-29 00:55:55 +00:00
Ahmed Bougacha	0ea9d1e753	[IR] fptrunc-of-fptrunc isn't an EliminableCastPair. Double and single rounding can produce different results. This is the IR counterpart to r228911. llvm-svn: 238531	2015-05-29 00:04:30 +00:00
Matthias Braun	111f5d88fb	MachineFrameInfo: Simplify pristine register calculation. About pristine regsiters: Pristine registers "hold a value that is useless to the current function, but that must be preserved - they are callee saved registers that have not been saved." This concept saves compile time as it frees the prologue/epilogue inserter from adding every such register to every basic blocks live-in list. However the current code in getPristineRegs is formulated in a complicated way: Inside the function prologue and epilogue all callee saves are considered pristine, while in the rest of the code only the non-saved ones are considered pristine. This requires logic to differentiate between prologue/epilogue and the rest and in the presence of shrink-wrapping this even becomes complicated/expensive. It's also unnecessary because the prologue epilogue inserters already mark callee-save registers that are saved/restores properly in the respective blocks in the prologue/epilogue (see updateLiveness() in PrologueEpilogueInserter.cpp). So only declaring non-saved/restored callee saved registers as pristine just works. Differential Revision: http://reviews.llvm.org/D10101 llvm-svn: 238524	2015-05-28 23:20:35 +00:00
Eric Christopher	536f0a95e5	Fix typos in variable/grammar names. llvm-svn: 238523	2015-05-28 23:07:39 +00:00
Reid Kleckner	60b640bb80	Rename Win64Exception.(cpp\|h) to WinException.(cpp\|h) This is in preparation for reusing this for 32-bit x86 EH table emission. Also updates the type name for consistency. NFC llvm-svn: 238521	2015-05-28 22:47:01 +00:00
Chandler Carruth	39691c41bf	[x86] Move the vector popcount tests into non-ISA files, and instead organize them by the width of vector. This makes it a lot easier to see that we're covering all of the vector types but not doing so excessively. This also adds tests across the spectrum of SSE versions in addition to the AVX versions. If you're really tired of seeing the massive sprawl of scalarized code for this, don't worry, I'm just about to land Bruno's patch that dramatically improve the situation for SSSE3 and newer. llvm-svn: 238520	2015-05-28 22:46:48 +00:00
Alex Lorenz	78d7831b0f	MIR Serialization: print and parse machine function names. This commit introduces a serializable structure called 'llvm::yaml::MachineFunction' that stores the machine function's name. This structure will mirror the machine function's state in the future. This commit prints machine functions as YAML documents containing a YAML mapping that stores the state of a machine function. This commit also parses the YAML documents that contain the machine functions. Reviewers: Duncan P. N. Exon Smith Differential Revision: http://reviews.llvm.org/D9841 llvm-svn: 238519	2015-05-28 22:41:12 +00:00
Quentin Colombet	75afbfd4a1	[MachineCopyPropagation] Fix a bug with undef handling when the value is actualy alive. Test case will follow. llvm-svn: 238518	2015-05-28 22:38:40 +00:00
Chris Bieneman	2607a0d655	Fixing broken bots after r238505. Need non-const iterator inserts too. These failures seem to be due to differences in the versions of libstdc++ on various operating systems. llvm-svn: 238516	2015-05-28 22:18:34 +00:00
David Majnemer	4e6438c534	Add testcase for r238503. llvm-svn: 238515	2015-05-28 22:12:27 +00:00
Reid Kleckner	fe4d491bd9	[WinEH] Start inserting state number stores for C++ EH This moves all the state numbering code for C++ EH to WinEHPrepare so that we can call it from the X86 state numbering IR pass that runs before isel. Now we just call the same state numbering machinery and insert a bunch of stores. It also populates MachineModuleInfo with information about the current function. llvm-svn: 238514	2015-05-28 22:00:24 +00:00
Rafael Espindola	bb35ebd189	Don't special case undefined symbol when deciding the symbol order. ELF has no restrictions on where undefined symbols go relative to other defined symbols. In fact, gas just sorts them together. Do the same. This was there since r111174 probably just because the MachO writer has it. llvm-svn: 238513	2015-05-28 21:59:34 +00:00
Diego Novillo	6555adb110	Update documentation for llvm-profdata. These options have been present for a while, but I had never updated the documentation. Fixed. llvm-svn: 238511	2015-05-28 21:57:17 +00:00
Chris Bieneman	fa150d2aee	Fixing the polly build. I broke the polly build in r238505. This fixes the failure by adding non-const iterator erase methods to cl::list_storage. llvm-svn: 238509	2015-05-28 21:51:52 +00:00
Andy Ayers	b63298e0c8	Revise test to run llc and llvm-mc separately. Differential Revision: http://reviews.llvm.org/D10066 llvm-svn: 238508	2015-05-28 21:49:50 +00:00
Wei Mi	e2538b5639	Enable exitValue rewrite only when the cost of expansion is low. The patch evaluates the expansion cost of exitValue in indVarSimplify pass, and only does the rewriting when the expansion cost is low or loop can be deleted with the rewriting. It provides an option "-replexitval=" to control the default aggressiveness of the exitvalue rewriting. It also fixes some missing cases in SCEVExpander::isHighCostExpansionHelper to enhance the evaluation of SCEV expansion cost. Differential Revision: http://reviews.llvm.org/D9800 llvm-svn: 238507	2015-05-28 21:49:07 +00:00
Rafael Espindola	3a5d3cce80	Remove a trivial forwarding function. NFC. llvm-svn: 238506	2015-05-28 21:36:02 +00:00
Chris Bieneman	72ea707fe7	Re-landing "Refactoring cl::list_storage from "is a" to "has a" std::vector." Originally landed r238485 MSVC resolves identifiers differently from Clang and GCC, this resulted in build bot failures. This pach re-lands r238485 and fixes the build failures. llvm-svn: 238505	2015-05-28 21:31:22 +00:00
David Majnemer	22d2b02706	[SelectionDAG] Scalar shift amounts may require legalization The shift amount may be too small to cope with promoted left hand side, make sure to promote it as well. This fixes PR23664. llvm-svn: 238503	2015-05-28 21:29:59 +00:00
Reid Kleckner	bfcad2f181	Remove debug prints from r238487 llvm-svn: 238501	2015-05-28 21:23:53 +00:00
Colin LeMahieu	0b5890d411	[llvm] Adding vdtor to fix warning. llvm-svn: 238494	2015-05-28 20:59:08 +00:00
Rafael Espindola	5e9ed90279	Inline trivial method. NFC. llvm-svn: 238492	2015-05-28 20:53:09 +00:00
Chris Bieneman	bcb6ddc0a4	Revert "Refactoring cl::list_storage from "is a" to "has a" std::vector." This reverts commit 117715ca0613d3db144241499401f2ec5398f1d5. llvm-svn: 238491	2015-05-28 20:47:02 +00:00
Reid Kleckner	80956a0142	Disable x86 tail call optimizations that jump through GOT For x86 targets, do not do sibling call optimization when materializing the callee's address would require a GOT relocation. We can still do tail calls to internal functions, hidden functions, and protected functions, because they do not require this kind of relocation. It is still possible to get GOT relocations when the user explicitly asks for it with musttail or -tailcallopt, both of which are supposed to guarantee TCO. Based on a patch by Chih-hung Hsieh. Reviewers: srhines, timmurray, danalbert, enh, void, nadav, rnk Subscribers: joerg, davidxl, llvm-commits Differential Revision: http://reviews.llvm.org/D9799 llvm-svn: 238487	2015-05-28 20:44:28 +00:00
Chris Bieneman	f2d1c73cf3	Refactoring cl::list_storage from "is a" to "has a" std::vector. Summary: This isn't necessarily an ideal change, and I want to at least reduce the API surface area, but for the new API we really shouldn't be relying on cl::list being a std::vector. Reviewers: chandlerc Reviewed By: chandlerc Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10093 llvm-svn: 238485	2015-05-28 20:38:12 +00:00
Daniel Sanders	b34dab3d00	Revert r238427 - [mips] Make TTypeEncoding indirect to allow .eh_frame to be read-only. It caused a smaller number of failures than the previous attempt at committing but still caused a couple on the llvm-linux-mips builder. Reverting while I investigate the remainder. llvm-svn: 238483	2015-05-28 20:30:32 +00:00
Alexey Samsonov	6ecbd064e1	Object, ELF: Use error code instead of calling report_fatal_error() Make createELFObjectFile() return object_error::parse_failed on encountering invalid ELF file, instead of crashing the program. llvm-svn: 238481	2015-05-28 20:25:42 +00:00
Rafael Espindola	e48421f6fc	Remove structure field that can be computed just before use. llvm-svn: 238480	2015-05-28 20:25:29 +00:00
Rafael Espindola	d7f10f0576	Avoid warnings when building without asserts. llvm-svn: 238479	2015-05-28 20:19:31 +00:00
Rafael Espindola	cfbd35c9ad	Move these vectors to the only function where they are used. llvm-svn: 238477	2015-05-28 20:11:34 +00:00
Peter Collingbourne	450fbee6b2	Thumb2: Modify codegen for memcpy intrinsic to prefer LDM/STM. We were previously codegen'ing these as regular load/store operations and hoping that the register allocator would allocate registers in ascending order so that we could apply an LDM/STM combine after register allocation. According to the commit that first introduced this code (r37179), we planned to teach the register allocator to allocate the registers in ascending order. This never got implemented, and up to now we've been stuck with very poor codegen. A much simpler approach for achiveing better codegen is to create LDM/STM instructions with identical sets of virtual registers, let the register allocator pick arbitrary registers and order register lists when printing an MCInst. This approach also avoids the need to repeatedly calculate offsets which ultimately ought to be eliminated pre-RA in order to decrease register pressure. This is implemented by lowering the memcpy intrinsic to a series of SD-only MCOPY pseudo-instructions which performs a memory copy using a given number of registers. During SD->MI lowering, we lower MCOPY to LDM/STM. This is a little unusual, but it avoids the need to encode register lists in the SD, and we can take advantage of SD use lists to decide whether to use the _UPD variant of the instructions. Fixes PR9199. Differential Revision: http://reviews.llvm.org/D9508 llvm-svn: 238473	2015-05-28 20:02:45 +00:00
Reid Kleckner	e2e57faa7d	[WinEH] Remove debugging dump() call llvm-svn: 238472	2015-05-28 20:02:05 +00:00
Rafael Espindola	0cbea2997c	Merge redundant loops. NFC. llvm-svn: 238471	2015-05-28 20:00:13 +00:00
Duncan P. N. Exon Smith	8d3197f657	AsmPrinter: Stop exposing underlying DIE children list, NFC Update `DIE` API to hide the implementation of `DIE::Children` so we can swap it out. llvm-svn: 238468	2015-05-28 19:56:34 +00:00
Rafael Espindola	b32552faf6	Simplify LastLocalSymbolIndex computation. NFC. llvm-svn: 238465	2015-05-28 19:46:36 +00:00
Rafael Espindola	dcda9979ba	Use range loops. NFC. llvm-svn: 238463	2015-05-28 19:43:20 +00:00
Pete Cooper	b9d2e34a4a	Add BranchProbabilityInfo::releaseMemory to clear the Weights field. BranchProbabilityInfo was leaking 3MB of memory when running 'opt -O2 verify-uselistorder.lto.bc'. This was due to the Weights member not being cleared once the pass is no longer needed. This adds the releaseMemory override to clear that field. The other fields are cleared at the end of runOnFunction so can stay there. llvm-svn: 238462	2015-05-28 19:43:06 +00:00
Rafael Espindola	1fd36275a1	Remove temporary FileSymbolData. NFC. llvm-svn: 238461	2015-05-28 19:29:15 +00:00
Colin LeMahieu	fb76b007d3	[Objdump] Allow instruction pretty printing to be specialized by the target triple. Differential Revision: http://reviews.llvm.org/D8427 llvm-svn: 238457	2015-05-28 19:07:14 +00:00

1 2 3 4 5 ...

117816 Commits