llvm-project

Commit Graph

Author	SHA1	Message	Date
Rafael Espindola	284a750c5f	Make all temporary symbols unnamed. What this does is make all symbols that would otherwise start with a .L (or L on MachO) unnamed. Some of these symbols still show up in the symbol table, but we can just make them unnamed. In order to make sure we produce identical results when going thought assembly, all .L (not just the compiler produced ones), are now unnamed. Running llc on llvm-as.opt.bc, the peak memory usage goes from 208.24MB to 205.57MB. llvm-svn: 240130	2015-06-19 12:16:55 +00:00
Ahmed Bougacha	9a9094260d	[ARM] Look through concat when lowering in-place shuffles (VZIP, ..) Currently, we canonicalize shuffles that produce a result larger than their operands with: shuffle(concat(v1, undef), concat(v2, undef)) -> shuffle(concat(v1, v2), undef) because we can access quad vectors (see PerformVECTOR_SHUFFLECombine). This is useful in the general case, but there are special cases where native shuffles produce larger results: the two-result ops. We can look through the concat when lowering them: shuffle(concat(v1, v2), undef) -> concat(VZIP(v1, v2):0, :1) This lets us generate the native shuffles instead of scalarizing to dozens of VMOVs. Differential Revision: http://reviews.llvm.org/D10424 llvm-svn: 240118	2015-06-19 02:32:35 +00:00
Ahmed Bougacha	7dbea8cec9	[ARM] Add D-sized vtrn/vuzp/vzip tests, and cleanup. NFC. llvm-svn: 240114	2015-06-19 02:15:34 +00:00
Eric Christopher	572e03a396	Fix "the the" in comments. llvm-svn: 240112	2015-06-19 01:53:21 +00:00
Alex Lorenz	82a9a7e42c	MIR Serialization: Reenable one of the MIRParser tests by reverting r239805. The test 'llvm/test/CodeGen/MIR/machine-function.mir' was disabled on x86 msc18 in r239805 as it failed. My commit r240054 have fixed the problem, so this commit reverts the commit that disabled the test as it should pass now. llvm-svn: 240074	2015-06-18 22:46:27 +00:00
Rafael Espindola	9ac06a0e6b	Improve the --expand-relocs handling of MachO. In a relocation target can take 3 basic forms * A r_value in scattered relocations. * A symbol in external relocations. * A section is non-external relocations. Have the dump reflect that. With this change we go from CHECK-NEXT: Extern: 0 CHECK-NEXT: Type: X86_64_RELOC_SUBTRACTOR (5) CHECK-NEXT: Symbol: 0x2 CHECK-NEXT: Scattered: 0 To just // CHECK-NEXT: Type: X86_64_RELOC_SUBTRACTOR (5) // CHECK-NEXT: Section: __data (2) Since the relocation is with a section, we print the seciton name and don't need to say that it is not scattered or external. Someone motivated can add further special cases for things like ARM64_RELOC_ADDEND and ARM_RELOC_PAIR. llvm-svn: 240073	2015-06-18 22:38:20 +00:00
Yi Jiang	e0b3499db7	Avoid redundant select node in early if-conversion pass llvm-svn: 240072	2015-06-18 22:34:09 +00:00
Hans Wennborg	67d492a544	Switch lowering: enable whole-switch jump tables at -O0. To same compile time, the analysis to find dense case-clusters in switches is not done at -O0. However, when the whole switch is dense enough, it is easy to turn it into a jump table, resulting in much faster code with no extra effort. llvm-svn: 240071	2015-06-18 22:22:30 +00:00
Rafael Espindola	cf022ba270	Pass --expand-relocs to a few more tests. llvm-svn: 240069	2015-06-18 22:12:47 +00:00
Sanjay Patel	c3e018e6fd	add test to show suboptimal load merging behavior llvm-svn: 240063	2015-06-18 21:34:26 +00:00
Simon Pilgrim	de94fa6438	[X86][SSE][CostModel] Fixed uitofp/sitofp cost target tests to specify sse2/avx2/avx512f directly instead of via a cpu model. llvm-svn: 240062	2015-06-18 21:26:01 +00:00
Sanjay Patel	9fce2bc7b1	fixed to test attributes and use better checks 1. Used update_llc_test_checks.py to tighten checks 2. Fixed triple (nothing Darwin-specific here) 3. Replaced CPU specifiers with attributes 4. Fixed comments 5. Removed IvyBridge run because it did not add any coverage llvm-svn: 240058	2015-06-18 21:12:24 +00:00
Rafael Espindola	aaaa575f71	Use --expand-relocs in a test. It will make the next change easier to read. llvm-svn: 240053	2015-06-18 20:57:35 +00:00
Colin LeMahieu	d2158755eb	[Hexagon] Printing packet brackets when asm printing and adding a number of tests that test packet brackets. llvm-svn: 240051	2015-06-18 20:43:50 +00:00
Sanjoy Das	c65d43e649	[CallGraph] Teach the CallGraph about non-leaf intrinsics. Summary: Currently intrinsics don't affect the creation of the call graph. This is not accurate with respect to statepoint and patchpoint intrinsics -- these do call (or invoke) LLVM level functions. This change fixes this inconsistency by adding a call to the external node for call sites that call these non-leaf intrinsics. This coupled with the fact that these intrinsics also escape the function pointer they call gives us a conservatively correct call graph. Reviewers: reames, chandlerc, atrick, pgavlin Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10526 llvm-svn: 240039	2015-06-18 19:28:26 +00:00
David Majnemer	46c852e438	[CodeGen] Don't emit a random reference to the personality function This should fix issues we've been seeing with Darwin. llvm-svn: 240036	2015-06-18 18:31:46 +00:00
James Y Knight	f90346f8f6	[SPARC] Repair GOT references to internal symbols. They had been getting emitted as a section + offset reference, which is bogus since the value needs to be the offset within the GOT, not the actual address of the symbol's object. Differential Revision: http://reviews.llvm.org/D10441 llvm-svn: 240020	2015-06-18 15:05:15 +00:00
Rafael Espindola	f14eec8d78	Convert a few tests to use llvm-mc. llvm-svn: 240017	2015-06-18 13:39:07 +00:00
Simon Pilgrim	1739421893	[X86][AVX2] Added AVX2 SINT_TO_FP/UINT_TO_FP tests llvm-svn: 240013	2015-06-18 12:32:28 +00:00
Asaf Badouh	81f03c30a5	[AVX512] add instructions: VPAVGB and VPAVGW review http://reviews.llvm.org/D10504 llvm-svn: 240012	2015-06-18 12:30:53 +00:00
Elena Demikhovsky	d3057e5e37	AVX-512: (fixed) Added encoding of all forms of VPERMT2W/D/Q/PS/PD and VPERMI2W/D/Q/PS/PD. Intrinsics and tests for them are comming in the next patch. llvm-svn: 240003	2015-06-18 08:56:19 +00:00
Elena Demikhovsky	4f13f3f9b8	reverted 239999 due to test failures llvm-svn: 240001	2015-06-18 08:06:49 +00:00
Elena Demikhovsky	975a637cd9	AVX-512: Added encoding of all forms of VPERMT2W/D/Q/PS/PD and VPERMI2W/D/Q/PS/PD. Intrinsics and tests for them are comming in the next patch. llvm-svn: 239999	2015-06-18 07:29:40 +00:00
Benjamin Kramer	c6e8bfc41d	[AsmPrinter] Make isRepeatedByteSequence smarter about odd integer types - zext the value to alloc size first, then check if the value repeats with zero padding included. If so we can still emit a .space - Do the checking with APInt.isSplat(8), which handles non-pow2 types - Also handle large constants (bit width > 64) - In a ConstantArray all elements have the same type, so it's sufficient to check the first constant recursively and then just compare if all following constants are the same by pointer compare llvm-svn: 239977	2015-06-17 23:55:17 +00:00
Simon Pilgrim	3aa039a4a8	[X86][SSE] Improved support for vector i16 to float conversions. Added explicit sign extension for v4i16/v8i16 to v4i32/v8i32 before conversion to floats. Matches existing support for v4i8/v8i8. Follow up to D10433 llvm-svn: 239966	2015-06-17 22:43:34 +00:00
Jingyue Wu	cd3afea451	Add NVPTXLowerAlloca pass to convert alloca'ed memory to local address Summary: This is done by first adding two additional instructions to convert the alloca returned address to local and convert it back to generic. Then replace all uses of alloca instruction with the converted generic address. Then we can rely NVPTXFavorNonGenericAddrSpace pass to combine the generic addresscast and the corresponding Load, Store, Bitcast, GEP Instruction together. Patched by Xuetian Weng (xweng@google.com). Test Plan: test/CodeGen/NVPTX/lower-alloca.ll Reviewers: jholewinski, jingyue Reviewed By: jingyue Subscribers: meheff, broune, eliben, jholewinski, llvm-commits Differential Revision: http://reviews.llvm.org/D10483 llvm-svn: 239964	2015-06-17 22:31:02 +00:00
David Majnemer	7fddeccb8b	Move the personality function from LandingPadInst to Function The personality routine currently lives in the LandingPadInst. This isn't desirable because: - All LandingPadInsts in the same function must have the same personality routine. This means that each LandingPadInst beyond the first has an operand which produces no additional information. - There is ongoing work to introduce EH IR constructs other than LandingPadInst. Moving the personality routine off of any one particular Instruction and onto the parent function seems a lot better than have N different places a personality function can sneak onto an exceptional function. Differential Revision: http://reviews.llvm.org/D10429 llvm-svn: 239940	2015-06-17 20:52:32 +00:00
Ahmed Bougacha	f32991461f	[CodeGenPrepare] Generalize inserted set from truncs to any inst. It's been used before to avoid infinite loops caused by separate CGP optimizations undoing one another. We found one more such issue caused by r238054. To avoid it, generalize the "InsertedTruncs" set to any inst, and use it to avoid touching those again. llvm-svn: 239938	2015-06-17 20:44:32 +00:00
Colin LeMahieu	bb71f7d251	[Hexagon] Adding a number of other tests for min/max instructions and loading i1s. llvm-svn: 239935	2015-06-17 20:29:33 +00:00
Peter Collingbourne	4fc603ded3	LowerBitSets: Do not assign names to aliases of unnamed bitset element objects. The restriction on unnamed aliases was removed in r239921. Mostly reverts r239590, but we keep the test. llvm-svn: 239923	2015-06-17 18:31:02 +00:00
Rafael Espindola	54fc298bbc	Allow aliases to be unnamed. If globals can be unnamed, there is no reason for aliases to be different. The restriction was there since the original implementation in r36435. I can only guess it was there because of the old bison parser for the old alias syntax. llvm-svn: 239921	2015-06-17 17:53:31 +00:00
Colin LeMahieu	ca8a82d5c7	[Hexagon] Adding some compare tests, fixing existing XFAILed tests, and removing mcpu=hexagonv4 since that's the minimum version anyway. llvm-svn: 239917	2015-06-17 17:19:05 +00:00
Diego Novillo	8c49a57266	Add documentation for new backedge mass propagation in irregular loops. Tweak test cases and rename headerIndexFor -> getHeaderIndex. llvm-svn: 239915	2015-06-17 16:28:22 +00:00
Benjamin Kramer	58675d4f84	[MC/Dwarf] Encode DW_CFA_advance_loc in target endianess. This matches GNU as output. llvm-svn: 239911	2015-06-17 15:14:35 +00:00
Toma Tabacu	f712ede932	[mips] [IAS] Add support for expanding LASym with a source register operand. Reviewers: dsanders Reviewed By: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9348 llvm-svn: 239910	2015-06-17 14:31:51 +00:00
Toma Tabacu	1a1083285c	[mips] [IAS] Add support for the B{L,G}{T,E}(U) branch pseudo-instructions. Summary: This does not include support for the immediate variants of these pseudo-instructions. Fixes llvm.org/PR20968. Reviewers: dsanders Reviewed By: dsanders Subscribers: seanbruno, emaste, llvm-commits Differential Revision: http://reviews.llvm.org/D8537 llvm-svn: 239905	2015-06-17 13:20:24 +00:00
Toma Tabacu	9e7b90c244	[mips] [IAS] Fix LA with relative label operands. Summary: Call MCSymbolRefExpr::create() with a MCSymbol* argument, not with a StringRef of the Symbol's name, in order to avoid creating invalid temporary symbols for relative labels (e.g. {$,.L}tmp00, {$,.L}tmp10 etc.). Reviewers: dsanders Reviewed By: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10498 llvm-svn: 239901	2015-06-17 12:30:37 +00:00
Toma Tabacu	6a1e0eb27d	[mips] [IAS] Add test for SW with relative label operands. NFC. Reviewers: dsanders Reviewed By: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10497 llvm-svn: 239899	2015-06-17 11:46:37 +00:00
Toma Tabacu	07c97b3b7e	[mips] [IAS] Fix LW with relative label operands. Summary: Previously, MCSymbolRefExpr::create() was called with a StringRef of the symbol name, which it would then search for in the Symbols StringMap (from MCContext). However, relative labels (which are temporary symbols) are apparently not stored in the Symbols StringMap, so we end up creating a new {$,.L}tmp symbol ({$,.L}tmp00, {$,.L}tmp10 etc.) each time we create an MCSymbolRefExpr by passing in the symbol name as a StringRef. Fortunately, there is a version of MCSymbolRefExpr::create() which takes an MCSymbol* and we already have an MCSymbol* at that point, so we can just pass that in instead of the StringRef. I also removed the local StringRef calls to MCSymbolRefExpr::create() from expandMemInst(), as those cases can be handled by evaluateRelocExpr() anyway. Reviewers: dsanders Reviewed By: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9938 llvm-svn: 239897	2015-06-17 10:43:45 +00:00
Igor Breger	dfcc3d31a7	AVX-512: cvtusi2ss/d intrinsics. Change builtin function name and signature ( add third parameter - rounding mode ). Added tests for intrinsics. Differential Revision: http://reviews.llvm.org/D10473 llvm-svn: 239888	2015-06-17 07:23:57 +00:00
Matthias Braun	8321006d44	Revert "AArch64: Use CMP;CCMP sequences for and/or/setcc trees." The patch triggers a miscompile on SPEC 2006 403.gcc with the (ref) 200.i and scilab.i inputs. I opened PR23866 to track analysis of this. This reverts commit r238793. llvm-svn: 239880	2015-06-17 04:02:32 +00:00
Colin LeMahieu	be99a02b1b	[Hexagon] Adding MC ELF streamer and updating addend relocation test which shows correct ELF symbol. llvm-svn: 239876	2015-06-17 03:06:16 +00:00
Sanjay Patel	0848a8be92	Add some tests based on PR21711 These were originally added in r227242, but that patch was reverted because it caused a failure on AArch64. llvm-svn: 239860	2015-06-16 22:37:50 +00:00
Simon Atanasyan	6e07e9305b	[llvm-readobj] Print MIPS .reginfo section content llvm-svn: 239856	2015-06-16 21:47:43 +00:00
Simon Pilgrim	cae7b94cbd	[X86][SSE] Vectorize v2i32 to v2f64 conversions This patch enables support for the conversion of v2i32 to v2f64 to use the CVTDQ2PD xmm instruction and stay on the SSE unit instead of scalarizing, sign extending to i64 and using CVTSI2SDQ scalar conversions. Differential Revision: http://reviews.llvm.org/D10433 llvm-svn: 239855	2015-06-16 21:40:28 +00:00
Philip Reames	c25df11614	Reapply 239795 - [InstCombine] Propagate non-null facts to call parameters The original change broke clang side tests. I will be submitting those momentarily. This change includes post commit feedback on the original change from from Pete Cooper. Original Submission comments: If a parameter to a function is known non-null, use the existing parameter attributes to record that fact at the call site. This has no optimization benefit by itself - that I know of - but is an enabling change for http://reviews.llvm.org/D9129. Differential Revision: http://reviews.llvm.org/D9132 llvm-svn: 239849	2015-06-16 20:24:25 +00:00
Rafael Espindola	c6afe0d4e9	Improve handling of end of file in the bitcode reader. Before this patch the bitcode reader would read a module from a file that contained in order: * Any number of non MODULE_BLOCK sub blocks. * One MODULE_BLOCK * Any number of non MODULE_BLOCK sub blocks. * 4 '\n' characters to handle OS X's ranlib. Since we support lazy reading of modules, any information that is relevant for the module has to be in the MODULE_BLOCK or before it. We don't gain anything from checking what is after. This patch then changes the reader to stop once the MODULE_BLOCK has been successfully parsed. This avoids the ugly special case for .bc files in an archive and makes it easier to embed bitcode files. llvm-svn: 239845	2015-06-16 20:03:39 +00:00
Diego Novillo	9a779623d9	Fix PR 23525 - Separate header mass propagation in irregular loops. Summary: When propagating mass through irregular loops, the mass flowing through each loop header may not be equal. This was causing wrong frequencies to be computed for irregular loop headers. Fixed by keeping track of masses flowing through each of the headers in an irregular loop. To do this, we now keep track of per-header backedge weights. After the loop mass is distributed through the loop, the backedge weights are used to re-distribute the loop mass to the loop headers. Since each backedge will have a mass proportional to the different branch weights, the loop headers will end up with a more approximate weight distribution (as opposed to the current distribution that assumes that every loop header is the same). Reviewers: dexonsmith Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10348 llvm-svn: 239843	2015-06-16 19:10:58 +00:00
Igor Laevsky	8f3fa0ec63	[Statepoints] Test only change. Check that statepoint lowering didn't generate more than expected amount of spills. See http://reviews.llvm.org/D10402 for related discussion. llvm-svn: 239842	2015-06-16 19:07:05 +00:00
Frederic Riss	40baa0aad4	Have MachOObjectFile::isValidArch() accept armv7 llvm-svn: 239833	2015-06-16 17:37:03 +00:00
Alex Lorenz	5ef16b8a7c	MIR Parser: Report an error when a machine function doesn't have a corresponding function. This commit reports an error when a machine function from a MIR file that contains LLVM IR can't find a function with the same name in the loaded LLVM IR module. Reviewers: Duncan P. N. Exon Smith Differential Revision: http://reviews.llvm.org/D10468 llvm-svn: 239831	2015-06-16 17:06:29 +00:00
Rafael Espindola	35f6faed67	Add a test for padded bitcode files. llvm-svn: 239829	2015-06-16 16:36:15 +00:00
Kit Barton	4f79f96fd7	Properly handle the mftb instruction. The mftb instruction was incorrectly marked as deprecated in the PPC Backend. Instead, it should not be treated as deprecated, but rather be implemented using the mfspr instruction. A similar patch was put into GCC last year. Details can be found at: https://sourceware.org/ml/binutils/2014-11/msg00383.html. This change will replace instances of the mftb instruction with the mfspr instruction for all CPUs except 601 and pwr3. This will also be the default behaviour. Additional details can be found in: https://llvm.org/bugs/show_bug.cgi?id=23680 Phabricator review: http://reviews.llvm.org/D10419 llvm-svn: 239827	2015-06-16 16:01:15 +00:00
Matt Arsenault	ed891b5561	Revert "Revert "Fix merges of non-zero vector stores"" Reapply r239539. Don't assume the collected number of stores is the same vector size. Just take the first N stores to fill the vector. llvm-svn: 239825	2015-06-16 15:51:48 +00:00
Benjamin Kramer	1ee59cba5d	[InstSimplify] Allow folding of fdiv X, X with just NaNs ignored Any combination of +-inf/+-inf is NaN so it's already ignored with nnan and we can skip checking for ninf. Also rephrase logic in comments a bit. llvm-svn: 239821	2015-06-16 14:57:29 +00:00
Daniel Sanders	58405d856e	[mips][ias] Expand on r238751 to cover as many relocs as possible. Summary: Relocs that can be converted from absolute to PC-relative now do so if IsPCRel is true. Relocs that require PC-relative now call llvm_unreachable() if IsPCRel is false and similarly those that require absolute assert that IsPCRel is false. Note that while it looks like some relocs (e.g. R_MIPS_26) can be converted into the MIPS32r6/MIPS64r6 relocs (R_MIPS_PC*_S2), it isn't actually valid to do so. Placeholders have been left in the testcase for unsupported relocs and relocs that cannot be generated at the moment. Reviewers: vkalintiris Reviewed By: vkalintiris Subscribers: llvm-commits, rafael Differential Revision: http://reviews.llvm.org/D10184 llvm-svn: 239817	2015-06-16 13:46:26 +00:00
Daniel Sanders	c535d93b47	[llvm-mc] The object form of the GNU triple should be the same as the string form. Summary: GetTarget() may modify TripleName without also updating TheTriple. This can lead to situations where the MCObjectStreamer has a different triple to the rest of LLVM. This inconsistency caused sparc-little-endian.s to pass on Windows because most of LLVM had sparcel-pc-win32 while MCObjectStreamer had "". I believe the same kind of thing was also true of Darwin. Reviewers: rengolin Reviewed By: rengolin Subscribers: llvm-commits, rengolin, rafael Differential Revision: http://reviews.llvm.org/D10450 llvm-svn: 239808	2015-06-16 09:57:38 +00:00
Asaf Badouh	02d126cb9d	[AVX512] add integer min/max intrinsics support. review: http://reviews.llvm.org/D10439 llvm-svn: 239806	2015-06-16 08:39:27 +00:00
NAKAMURA Takumi	f68c7a27f4	Disable llvm/test/CodeGen/MIR/machine-function.mir on x86 msc18 for now. Investigating. The emission was as below; --- name: foo alignment: 31428584 exposesReturnsTwice: true hasInlineAsm: false ... --- name: bar alignment: 1701667182 exposesReturnsTwice: false hasInlineAsm: false ... --- name: func alignment: 8 exposesReturnsTwice: false hasInlineAsm: false ... --- name: func2 alignment: 16 exposesReturnsTwice: true hasInlineAsm: true ... llvm-svn: 239805	2015-06-16 06:57:35 +00:00
Elena Demikhovsky	77f0e9f662	X86: optimized i64 vector multiply with constant When we multiply two 64-bit vectors, we extract lower and upper part and use the PMULUDQ instruction. When one of the operands is a constant, the upper part may be zero, we know this at compile time. Example: %a = mul <4 x i64> %b, <4 x i64> < i64 5, i64 5, i64 5, i64 5>. I'm checking the value of the upper part and prevent redundant "multiply", "shift" and "add" operations. llvm-svn: 239802	2015-06-16 06:07:24 +00:00
Philip Reames	1a6305f313	Revert 239795 I forgot to update some clang test cases. I'll fix and resubmit tomorrow. llvm-svn: 239800	2015-06-16 01:20:53 +00:00
Ahmed Bougacha	8c7754b965	[AArch64] Generalize extract-high DUP extension to MOVI/MVNI. These are really immediate DUPs, and suffer from the same problem with long instructions with a high/2 variant (e.g. smull). By extending a MOVI (or DUP, before this patch), we can avoid an ext on the other operand of the long instruction, e.g. turning: ext.16b v0, v0, v0, #8 movi.4h v1, #0x53 smull.4s v0, v0, v1 into: movi.8h v1, #0x53 smull2.4s v0, v0, v1 While there, add a now-necessary combine to fold (VT NVCAST (VT x)). llvm-svn: 239799	2015-06-16 01:18:14 +00:00
Ahmed Bougacha	d300722b93	[AArch64] Robustize neon-2velem-high test. NFC. llvm-svn: 239798	2015-06-16 01:05:39 +00:00
Philip Reames	dfc29fba60	[InstCombine] Propagate non-null facts to call parameters If a parameter to a function is known non-null, use the existing parameter attributes to record that fact at the call site. This has no optimization benefit by itself - that I know of - but is an enabling change for http://reviews.llvm.org/D9129. Differential Revision: http://reviews.llvm.org/D9132 llvm-svn: 239795	2015-06-16 00:43:54 +00:00
Alex Lorenz	5b5f97537f	MIR Serialization: Print and parse simple machine function attributes. This commit serializes the simple, scalar attributes from the 'MachineFunction' class. Reviewers: Duncan P. N. Exon Smith Differential Revision: http://reviews.llvm.org/D10449 llvm-svn: 239790	2015-06-16 00:10:47 +00:00
Alex Lorenz	8e7a58d7cc	MIR Serialization: Create dummy functions when the MIR file doesn't have LLVM IR. This commit creates a dummy LLVM IR function with one basic block and an unreachable instruction for each parsed machine function when the MIR file doesn't have LLVM IR. This change is required as the machine function analysis pass creates machine functions only for the functions that are defined in the current LLVM module. Reviewers: Duncan P. N. Exon Smith Differential Revision: http://reviews.llvm.org/D10135 llvm-svn: 239778	2015-06-15 23:07:38 +00:00
Alex Lorenz	fe2aa97bab	MIR Serialization: Report an error when machine functions have the same name. This commit reports an error when the MIR parser encounters a machine function with the name that is the same as the name of a different machine function. Reviewers: Duncan P. N. Exon Smith Differential Revision: http://reviews.llvm.org/D10130 llvm-svn: 239774	2015-06-15 22:23:23 +00:00
Peter Collingbourne	58af6d1594	Add safestack attribute to LLVMAttribute enum and Go bindings. Correct constants in commented-out part of LLVMAttribute enum. Add tests that verify that the safestack attribute is only allowed as a function attribute. llvm-svn: 239772	2015-06-15 22:16:51 +00:00
Colin LeMahieu	ded2e90600	[Hexagon] Using readobj rather than objdump. llvm-svn: 239770	2015-06-15 21:57:41 +00:00
Colin LeMahieu	a071a8e5b6	[Hexagon] PC-relative offsets are relative to packet start rather than the offset of the relocation. Set relocation addend and check it's correct in the ELF. llvm-svn: 239769	2015-06-15 21:52:13 +00:00
Simon Pilgrim	aa9f712967	[X86][SSE] Added tests for vector i8/i16 to f32/f64 conversions llvm-svn: 239767	2015-06-15 21:49:31 +00:00
Peter Collingbourne	82437bf7a5	Protection against stack-based memory corruption errors using SafeStack This patch adds the safe stack instrumentation pass to LLVM, which separates the program stack into a safe stack, which stores return addresses, register spills, and local variables that are statically verified to be accessed in a safe way, and the unsafe stack, which stores everything else. Such separation makes it much harder for an attacker to corrupt objects on the safe stack, including function pointers stored in spilled registers and return addresses. You can find more information about the safe stack, as well as other parts of or control-flow hijack protection technique in our OSDI paper on code-pointer integrity (http://dslab.epfl.ch/pubs/cpi.pdf) and our project website (http://levee.epfl.ch). The overhead of our implementation of the safe stack is very close to zero (0.01% on the Phoronix benchmarks). This is lower than the overhead of stack cookies, which are supported by LLVM and are commonly used today, yet the security guarantees of the safe stack are strictly stronger than stack cookies. In some cases, the safe stack improves performance due to better cache locality. Our current implementation of the safe stack is stable and robust, we used it to recompile multiple projects on Linux including Chromium, and we also recompiled the entire FreeBSD user-space system and more than 100 packages. We ran unit tests on the FreeBSD system and many of the packages and observed no errors caused by the safe stack. The safe stack is also fully binary compatible with non-instrumented code and can be applied to parts of a program selectively. This patch is our implementation of the safe stack on top of LLVM. The patches make the following changes: - Add the safestack function attribute, similar to the ssp, sspstrong and sspreq attributes. - Add the SafeStack instrumentation pass that applies the safe stack to all functions that have the safestack attribute. This pass moves all unsafe local variables to the unsafe stack with a separate stack pointer, whereas all safe variables remain on the regular stack that is managed by LLVM as usual. - Invoke the pass as the last stage before code generation (at the same time the existing cookie-based stack protector pass is invoked). - Add unit tests for the safe stack. Original patch by Volodymyr Kuznetsov and others at the Dependable Systems Lab at EPFL; updates and upstreaming by myself. Differential Revision: http://reviews.llvm.org/D6094 llvm-svn: 239761	2015-06-15 21:07:11 +00:00
Alex Lorenz	735c47ec3e	MIR Serialization: Connect the machine function analysis pass to the MIR parser. This commit connects the machine function analysis pass (which creates machine functions) to the MIR parser, which will initialize the machine functions with the state from the MIR file and reconstruct the machine IR. This commit introduces a new interface called 'MachineFunctionInitializer', which can be used to provide custom initialization for the machine functions. This commit also introduces a new diagnostic class called 'DiagnosticInfoMIRParser' which is used for MIR parsing errors. This commit modifies the default diagnostic handling in LLVMContext - now the the diagnostics are printed directly into llvm::errs() so that the MIR parsing errors can be printed with colours. Reviewers: Justin Bogner Differential Revision: http://reviews.llvm.org/D9928 llvm-svn: 239753	2015-06-15 20:30:22 +00:00
Sanjoy Das	784582f116	Add "REQUIRES: asserts" to test case that uses -debug-only llvm-svn: 239748	2015-06-15 20:05:38 +00:00
Sanjoy Das	69fad0799e	[CodeGen] Add a pass to fold null checks into nearby memory operations. Summary: This change adds an "ImplicitNullChecks" target dependent pass. This pass folds null checks into memory operation using the FAULTING_LOAD pseudo-op introduced in previous patches. Depends on D10197 Depends on D10199 Depends on D10200 Reviewers: reames, rnk, pgavlin, JosephTremoulet, atrick Reviewed By: atrick Subscribers: ab, JosephTremoulet, llvm-commits Differential Revision: http://reviews.llvm.org/D10201 llvm-svn: 239743	2015-06-15 18:44:27 +00:00
Evgeny Astigeevich	ff1f4be4c7	On behalf of Alexandros Lamprineas: LLVM targeting aarch64 doesn't correctly produce aligned accesses for non-aligned data at -O0/fast-isel (-mno-unaligned-access). The root cause seems to be in fast-isel not producing unaligned access correctly for -mno-unaligned-access. The patch just aborts fast-isel for loads and stores when -mno-unaligned-access is present. The regression test is updated to check this new test case (-mno-unaligned-access together with fast-isel). Differential Revision: http://reviews.llvm.org/D10360 llvm-svn: 239732	2015-06-15 15:48:44 +00:00
Rafael Espindola	92200d237a	gold-plugin: save the .o when given -save-temps. The plugin now save the bitcode before and after optimizations and the .o that is passed to the linker. llvm-svn: 239726	2015-06-15 13:36:27 +00:00
Jingyue Wu	12b0c2835e	[ValueTracking] do not overwrite analysis results already computed Summary: ValueTracking used to overwrite the analysis results computed from assumes and dominating conditions. This patch fixes this issue. Test Plan: test/Analysis/ValueTracking/assume.ll Reviewers: hfinkel, majnemer Reviewed By: majnemer Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10283 llvm-svn: 239718	2015-06-15 05:46:29 +00:00
Hao Liu	1c2e89a57a	[AArch64] Delete two empty files, which should be removed by r239713. llvm-svn: 239715	2015-06-15 02:56:40 +00:00
Hao Liu	d0ca8d7edd	[AArch64] Revert r239711 again. We need to discuss how to share code between AArch64 and ARM backend. llvm-svn: 239713	2015-06-15 01:56:40 +00:00
Hao Liu	cb070e3833	[AArch64] Match interleaved memory accesses into ldN/stN instructions. Re-commit after adding "-aarch64-neon-syntax=generic" to fix the failure on OS X. This patch was firstly committed in r239514, then reverted in r239544 because of a syntax incompatible failure on OS X. llvm-svn: 239711	2015-06-15 01:35:49 +00:00
Benjamin Kramer	228680ded8	[InstSimplify] fsub nnan x, x -> 0.0 is valid without ninf Both inf - inf and (-inf) - (-inf) are NaN, so it's already covered by nnan. llvm-svn: 239702	2015-06-14 21:01:20 +00:00
Benjamin Kramer	4f0524614e	[InstSimplify] Add self-fdiv identities for -ffinite-math-only. When NaNs and Infs are ignored we can fold X / X -> 1.0 -X / X -> -1.0 X / -X -> -1.0 llvm-svn: 239701	2015-06-14 18:53:58 +00:00
Igor Breger	5e49697138	AVX-512: Implemented DAG lowering for shuff62x2/shufi62x2 instuctions ( Shuffle Packed Values at 128-bit Granularity ) Tests added , vector-shuffle-512-v8.ll test re-generated. Differential Revision: http://reviews.llvm.org/D10300 llvm-svn: 239697	2015-06-14 13:07:47 +00:00
Michael Kuperstein	e3de07a529	Add support for parsing the XOR operator in Intel syntax inline assembly. Differential Revision: http://reviews.llvm.org/D10385 Patch by marina.yatsina@intel.com llvm-svn: 239695	2015-06-14 12:59:45 +00:00
Igor Breger	abe4a79b75	AVX-512: Implemented cvtsi2ss/d cvtusi2ss/d instructions with round control for KNL. Added intrinsics for cvtsi2ss/d instructions. Added tests for intrinsics and encoding. Differential Revision: http://reviews.llvm.org/D10430 llvm-svn: 239694	2015-06-14 12:44:55 +00:00
Colin LeMahieu	b8575b14be	[Hexagon] Adding some codegen tests and updating some to match spec. llvm-svn: 239690	2015-06-13 21:46:39 +00:00
Simon Pilgrim	d3f6427446	[DAGCombiner] Added BSWAP(BSWAP(x)) -> x combine pattern. llvm-svn: 239682	2015-06-13 16:25:12 +00:00
Simon Pilgrim	011381d48b	[DAGCombiner] Added BSWAP vector constant folding support. llvm-svn: 239675	2015-06-13 14:08:15 +00:00
Tom Stellard	45bb48ea19	R600 -> AMDGPU rename llvm-svn: 239657	2015-06-13 03:28:10 +00:00
Tim Northover	02cfdbb7f1	AArch64: map bare-metal arm64-macho triple to MachO MC layer. Far better than an assertion about expecting ELF. llvm-svn: 239647	2015-06-12 23:37:11 +00:00
Tom Stellard	12a1910e87	R600/SI: Add assembler support for FLAT instructions - Add glc, slc, and tfe operands to flat instructions - Add missing flat instructions - Fix the encoding of flat_load_dwordx3 and flat_store_dwordx3. llvm-svn: 239637	2015-06-12 20:47:06 +00:00
Colin LeMahieu	79ec06525e	[Hexagon] Making intrinsic tests agnostic to register allocation. Narrowing intrinsic parameters to appropriate width. llvm-svn: 239634	2015-06-12 19:57:32 +00:00
Rafael Espindola	de28b7375f	Don't depend on the interleaving of stdout and stderr. That can change as we change the buffering. llvm-svn: 239602	2015-06-12 12:20:03 +00:00
John Brawn	d9e39d53b6	[ARM] Disabling vfp4 should disable fp16 ARMTargetParser::getFPUFeatures should disable fp16 whenever it disables vfp4, as otherwise something like -mcpu=cortex-a7 -mfpu=none leaves us with fp16 enabled (though the only effect that will have is a wrong build attribute). Differential Revision: http://reviews.llvm.org/D10397 llvm-svn: 239599	2015-06-12 09:38:51 +00:00
Peter Collingbourne	005354b1f4	LowerBitSets: Give names to aliases of unnamed bitset element objects. It is valid for globals to be unnamed, but aliases must have a name. To avoid creating invalid IR, we need to assign names to any aliases we create that point to unnamed objects that have been moved into combined globals. llvm-svn: 239590	2015-06-12 03:25:05 +00:00
Alexey Samsonov	9947e48cd1	[GVN] Use a simpler form of IRBuilder constructor. Summary: A side effect of this change is that it IRBuilder now automatically created debug info locations for new instructions, which is the same as debug location of insertion point. This is fine for the functions in questions (GetStoreValueForLoad and GetMemInstValueForLoad), as they are used in two situations: * GVN::processLoad, which tries to eliminate a load. In this case new instructions would have the same debug location as the load they eventually replace; * MaterializeAdjustedValue, which adds new instructions to the end of the basic blocks, which could later be used to replace the load definition. In this case we don't yet know the way the load would be eventually replaced (either by assembling the precomputed values via PHI, or by using them directly), so just using the basic block strategy seems to be reasonable. There is also a special case in the code that would adjust the location of the last instruction replacing the load definition to the location of the load. Test Plan: regression test suite Reviewers: echristo, dberlin, dblaikie Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10405 llvm-svn: 239585	2015-06-12 01:39:48 +00:00
Reid Kleckner	81d1cc00b7	[WinEH] Put finally pointers in the handler scope table field We were putting them in the filter field, which is correct for 64-bit but wrong for 32-bit. Also switch the order of scope table entry emission so outermost entries are emitted first, and fix an obvious state assignment bug. llvm-svn: 239574	2015-06-11 23:37:18 +00:00
Reid Kleckner	a9d6253572	[WinEH] Create an llvm.x86.seh.exceptioninfo intrinsic This intrinsic is like framerecover plus a load. It recovers the EH registration stack allocation from the parent frame and loads the exception information field out of it, giving back a pointer to an EXCEPTION_POINTERS struct. It's designed for clang to use in SEH filter expressions instead of accessing the EXCEPTION_POINTERS parameter that is available on x64. This required a minor change to MC to allow defining a label variable to another absolute framerecover label variable. llvm-svn: 239567	2015-06-11 22:32:23 +00:00
Peter Collingbourne	82e657b509	Object: Prepend __imp_ when mangling a dllimport symbol in IRObjectFile. We cannot prepend __imp_ in the IR mangler because a function reference may be emitted unmangled in a constant initializer. The linker is expected to resolve such references to thunks. This is covered by the new test case. Strictly speaking we ought to emit two undefined symbols, one with __imp_ and one without, as we cannot know which symbol the final object file will refer to. However, this would require rather intrusive changes to IRObjectFile, and lld works fine without it for now. This reimplements r239437, which was reverted in r239502. Differential Revision: http://reviews.llvm.org/D10400 llvm-svn: 239560	2015-06-11 21:42:18 +00:00

1 2 3 4 5 ...

30569 Commits