llvm-project

Commit Graph

Author	SHA1	Message	Date
Rafael Espindola	2d5d23d41d	llvm-nm: treat weak undefined as undefined. This matches the behavior of gnu ld. llvm-svn: 241512	2015-07-06 21:36:23 +00:00
Reid Kleckner	f2ea7e1d02	[WinEH] Add some test cases I forgot to add to previous commits llvm-svn: 241510	2015-07-06 21:13:53 +00:00
Reid Kleckner	da76bd444f	[WinEH] Insert the EH code load before the block terminator The previous code put the load after the terminator, leading to invalid IR and downstream crashes. This caused http://crbug.com/506446. llvm-svn: 241509	2015-07-06 21:13:43 +00:00
Simon Pilgrim	d85cae3d52	[X86][SSE4A] Shuffle lowering using SSE4A EXTRQ/INSERTQ instructions This patch adds support for v8i16 and v16i8 shuffle lowering using the immediate versions of the SSE4A EXTRQ and INSERTQ instructions. Although rather limited (they can only act on the lower 64-bits of the source vectors, leave the upper 64-bits of the result vector undefined and don't have VEX encoded variants), the instructions are still useful for the zero extension of any lane (EXTRQ) or inserting a lane into another vector (INSERTQ). Testing demonstrated that it wasn't typically worth it to use these instructions for v2i64 or v4i32 vector shuffles although they are capable of it. As well as adding specific pattern matching for the shuffles, the patch uses EXTRQ for zero extension cases where SSE41 isn't available and its more efficient than the SSE2 'unpack' default approach. It also adds shuffle decode support for the EXTRQ / INSERTQ cases when the instructions are handling full byte-sized extractions / insertions. From this foundation, future patches will be able to make use of the instructions for situations that use their ability to extract/insert at the bit level. Differential Revision: http://reviews.llvm.org/D10146 llvm-svn: 241508	2015-07-06 20:46:41 +00:00
Rafael Espindola	e511051f4b	When sorting by address, undefined symbols go first. This matches gnu nm. llvm-svn: 241488	2015-07-06 19:21:04 +00:00
Reid Kleckner	fc0f93832b	[llvm-extract] Drop comdats from declarations The verifier rejects comdats on declarations. llvm-svn: 241483	2015-07-06 18:48:02 +00:00
Rafael Espindola	80c3354634	Fix printing of common symbols. Printing the symbol size matches the behavior or both gnu nm and freebsd nm. llvm-svn: 241480	2015-07-06 18:18:44 +00:00
Alex Lorenz	e2d75239d1	llc: Add a 'run-pass' option. This commit adds a 'run-pass' option to llc, which instructs the compiler to run one specific code generation pass only. Llc already has the 'start-after' and the 'stop-after' options, and this new option complements the other two by making it easier to write tests that want to invoke a single pass only. Reviewers: Duncan P. N. Exon Smith Differential Revision: http://reviews.llvm.org/D10776 llvm-svn: 241476	2015-07-06 17:44:26 +00:00
Matt Arsenault	706f930b72	AMDGPU/SI: Add debugging subtarget feature for DS offsets We don't have a good way to detect most situations where DS offsets are usable on SI, so add an option to force using them even if unsafe for debugging performance problems. llvm-svn: 241462	2015-07-06 16:01:58 +00:00
James Y Knight	89ac11de32	[Sparc] Add more instruction aliases. These are mostly from the chart in the SparcV8 spec, section "A.3 Synthetic Instructions". Differential Revision: http://reviews.llvm.org/D9834 llvm-svn: 241461	2015-07-06 16:01:07 +00:00
James Y Knight	7208a12eef	[Sparc] Add support for flush instruction. Differential Revision: http://reviews.llvm.org/D9833 llvm-svn: 241460	2015-07-06 16:01:04 +00:00
Rafael Espindola	76d650e8d7	Check that COFF .obj files have sections with zero virtual address spaces. When talking about the virtual address of sections the coff spec says: ... for simplicity, compilers should set this to zero. Otherwise, it is an arbitrary value that is subtracted from offsets during relocation. We don't currently subtract it, so check that it is zero. If some producer does create such files, we can change getRelocationOffset instead. llvm-svn: 241447	2015-07-06 14:26:07 +00:00
Simon Pilgrim	a092fd8fc4	[X86][SSE] Added missing stack folding test for SQRTSD and SQRTSS instructions. llvm-svn: 241445	2015-07-06 14:15:02 +00:00
Asaf Badouh	c6f3c82ffc	[X86][AVX512] Multiply Packed Unsigned Integers with Round and Scale pmulhrsw review: http://reviews.llvm.org/D10948 llvm-svn: 241443	2015-07-06 14:03:40 +00:00
Petar Jovanovic	0326a06c15	[Mips] Add support for MCJIT for MIPS32r6 Add support for resolving MIPS32r6 relocations in MCJIT. Patch by Vladimir Radosavljevic. Differential Revision: http://reviews.llvm.org/D10687 llvm-svn: 241442	2015-07-06 12:50:55 +00:00
Rafael Espindola	5504eb79b4	Fix handling of ELF::R_MIPS_32 on Mips64. Thanks to Aboud, Amjad for reporting the regression and providing the testcase. llvm-svn: 241440	2015-07-06 12:18:44 +00:00
Rafael Espindola	37d8b67426	Make this test a bit more interesting. Before every test was using a section with an address of zero. llvm-svn: 241427	2015-07-06 02:45:01 +00:00
Sanjay Patel	1a6a58caf5	change CHECK to CHECK-LABEL for more precision llvm-svn: 241422	2015-07-05 23:19:16 +00:00
Sanjay Patel	787e12aec1	remove unnecessary test specifications llvm-svn: 241419	2015-07-05 22:37:51 +00:00
Sanjay Patel	8ee056c5e7	minimize test case and remove unnecessary opt passes llvm-svn: 241418	2015-07-05 22:30:12 +00:00
Peter Collingbourne	46eb0f539c	Verifier: Forbid comdats on linker declarations. Differential Revision: http://reviews.llvm.org/D10945 llvm-svn: 241414	2015-07-05 20:52:40 +00:00
Simon Pilgrim	08049fedc7	[X86][SSE3] Just use an explicit SSE3 target attribute - not a cpu type. Merged arch/target into a specific triple - we had i686 and x86_64 targets overriding each other.... llvm-svn: 241410	2015-07-05 19:06:32 +00:00
Simon Pilgrim	434cb684a8	[X86][SSE2] Just use an explicit SSE2 target attribute - not a cpu type. corei7 is capable of a lot more than just SSE2.... llvm-svn: 241409	2015-07-05 19:03:51 +00:00
Asaf Badouh	73f26f8ffc	[x86][AVX512] add Multiply High Op include encoding and intrinsics tests. review http://reviews.llvm.org/D10896 llvm-svn: 241406	2015-07-05 12:23:20 +00:00
Michael Kuperstein	5f05153fbb	[X86] Fix incorrect/inefficient pushw encodings for x86-64 targets Correctly support assembling "pushw $imm8" on x86-64 targets. Also some cleanup of the PUSH instructions (PUSH64i16 and PUSHi16 actually represent the same instruction) This fixes PR23996 Patch by: david.l.kreitzer@intel.com Differential Revision: http://reviews.llvm.org/D10878 llvm-svn: 241404	2015-07-05 10:25:41 +00:00
Nemanja Ivanovic	d358b8f80d	Add missing builtins to the PPC back end for ABI compliance (vol. 2) This patch corresponds to review: http://reviews.llvm.org/D10874 Back end portion of the second round of additions to altivec.h. llvm-svn: 241398	2015-07-05 06:03:51 +00:00
Simon Pilgrim	ea1b6ee366	[X86][SSE] Improved i8/i16 to f64 uint2fp vector conversions Followup to D10433 and D10589 that fixes i8/i16 uint2fp vector conversions by zero extending to i32 and using the sint2fp path (unless the target does actually support uint2fp). llvm-svn: 241394	2015-07-04 15:33:34 +00:00
Lang Hames	78937c2ae5	[RuntimeDyld] Skip relocations for external symbols with 64-bit address ~0ULL. Requested by Eugene Rozenfeld of the LLILC team, this feature allows JIT clients to skip relocations for selected external symbols by returning ~0ULL from their symbol resolver. If this value is returned for a given symbol, RuntimeDyld will skip all relocations for that symbol. The client will be responsible for applying the skipped relocations manually before the code is executed. llvm-svn: 241383	2015-07-04 01:35:26 +00:00
Craig Topper	de8395229a	[X86] Add proper 64-bit mode checks to jrcxz and jcxz. llvm-svn: 241381	2015-07-04 00:01:07 +00:00
Simon Atanasyan	5db0276925	[ELFYAML] Fix handling SHT_NOBITS sections by obj2yaml/yaml2obj tools SHT_NOBITS sections do not have content in an object file. Now the yaml2obj tool does not accept `Content` field for such sections, and the obj2yaml tool does not attempt to read the section content from a file. Restore r241350 and r241352. llvm-svn: 241377	2015-07-03 23:00:54 +00:00
Simon Pilgrim	c36cfe7af0	[X86] Added 32-bit builds to fp<->int tests. Ensure that i686 x87/SSE/SSE2 targets all build. llvm-svn: 241368	2015-07-03 20:07:57 +00:00
Rafael Espindola	e9da9aa4f3	This reverts commit r241350 and r241352. r241350 broke lld tests. r241352 depends on r241350. Original messages: "[ELFYAML] Fix handling SHT_NOBITS sections by obj2yaml/yaml2obj tools" "[ELFYAML] Make the Size field for .bss section optional" llvm-svn: 241354	2015-07-03 14:54:02 +00:00
Simon Atanasyan	d0f7b425a7	[ELFYAML] Make the Size field for .bss section optional It's a common case to have a zero-size .bss section in an object file. llvm-svn: 241352	2015-07-03 14:19:06 +00:00
Simon Atanasyan	b776eaed2e	[ELFYAML] Fix handling SHT_NOBITS sections by obj2yaml/yaml2obj tools SHT_NOBITS sections do not have content in an object file. Now yaml2obj tool does not accept `Content` field for such sections, and obj2yaml tool does not attempt to read the section content from a file. llvm-svn: 241350	2015-07-03 14:07:06 +00:00
NAKAMURA Takumi	7779f75cc8	llvm/test/CodeGen/ARM/fnattr-trap.ll: Add -mtriple, to appease targeting *-win32. LLVM ERROR: CPU: 'generic' does not support ARM mode execution! llvm-svn: 241329	2015-07-03 08:21:38 +00:00
Simon Pilgrim	6a8e75c735	whitespace tidyup. NFC. llvm-svn: 241326	2015-07-03 08:02:12 +00:00
Simon Pilgrim	b504263e4a	[X86][SSE] Sign extension for target vector sizes less than 128 bits (pt2) Add support for v2i8/v2i16 to v2f64 by using a sign extension to v2i32 before conversion to v2f64. Differential Revision: http://reviews.llvm.org/D10589 llvm-svn: 241325	2015-07-03 08:01:36 +00:00
Simon Pilgrim	385bf00ea2	[X86][SSE] Sign extension for target vector sizes less than 128 bits (pt1) This patch adds support for sign extension for sub 128-bit vectors, such as to v2i32. It concatenates with UNDEF subvectors up to 128-bits, performs the sign extension (i.e. as v4i32) and then extracts the target subvector. Patch 1/2 of D10589 - the second patch covers the conversion of v2i8/v2i16 to v2f64. llvm-svn: 241323	2015-07-03 07:51:01 +00:00
Nadav Rotem	754eb7c563	Fix an overly aggressive assertion in getCopyFromPartsVector. The assertion in getCopyFromPartsVector assumed that the vector 'part' must match the type of argument (arguments are potentially split into multiple parts). However, in some cases the targets return a 'part' of the right size but with a different type. We already handle this case correctly later on and generate a bitcast. This commit just makes sure that we are actually checking the property that we care about. llvm-svn: 241312	2015-07-02 23:23:52 +00:00
Akira Hatanaka	56c70441dc	Use function attribute "trap-func-name" and remove TargetOptions::TrapFuncName. This commit changes normal isel and fast isel to read the user-defined trap function name from function attribute "trap-func-name" attached to llvm.trap or llvm.debugtrap instead of from TargetOptions::TrapFuncName. This is needed to use clang's command line option "-ftrap-function" for LTO and enable changing the trap function name on a per-call-site basis. Out-of-tree projects currently using TargetOptions::TrapFuncName to specify the trap function name should attach attribute "trap-func-name" to the call sites of llvm.trap and llvm.debugtrap instead. rdar://problem/21225723 Differential Revision: http://reviews.llvm.org/D10832 llvm-svn: 241305	2015-07-02 22:13:27 +00:00
Bill Schmidt	a1c30053e7	[PPC64LE] Remove implicit-subreg restriction from VSX swap removal In r241285, I removed the SUBREG_TO_REG restriction from VSX swap removal, determining that this was overly conservative. We have another form of the same restriction in that we check for the presence of implicit subregs in vector operations. As with SUBREG_TO_REG for partial register conversions, an implicit subreg is safe in and of itself, provided no other operation makes a lane-sensitive assumption about the result. This patch removes that restriction, by removing the HasImplicitSubreg flag and all code that relies on it. I've added a test case that fails to optimize before this patch is applied, and optimizes properly with the patch. Test based on a report from Anton Blanchard. llvm-svn: 241290	2015-07-02 19:01:22 +00:00
Bill Schmidt	7c691fee1c	[PPC64LE] Teach swap optimization about the doubleword splat idiom With a previous patch, the VSX swap optimization is able to recognize the doubleword load-splat idiom that can be implemented using lxvdsx. However, that does not cover a doubleword splat where the source is a register. We can implement this using xxspltd (a special form of xxpermdi). This patch teaches the swap optimization pass about this idiom. As a prerequisite, it also permits swap optimization to succeed for all forms of SUBREG_TO_REG. Previously we were conservative and only allowed SUBREG_TO_REG when it copied a full register. However, on reflection any form of SUBREG_TO_REG is safe in and of itself, so long as an unsafe operation is not performed on its result. In particular, a widening SUBREG_TO_REG often occurs as an input to a doubleword splat idiom, particularly in auto-vectorized code. The doubleword splat idiom is an XXPERMDI operation where both source registers are identical, and the selection mask is either 0 (splat the first element) or 3 (splat the second element). To determine whether the registers are identical, we use the existing mechanism for looking through "copy-like" operations. That mechanism has a side effect of marking the XXPERMDI operation as using a physical register, which would invalidate its presence in a swap-optimized region. This is correct for the form of XXPERMDI that performs a swap and hence would be removed, but is not what we want for a doubleword-splat variety of XXPERMDI. Therefore we reset the physical-register flag on the XXPERMDI when it represents a splat. A simple test case is added to verify that we generate the splat and that we also remove the xxswapd instructions that would otherwise be associated with the load and store of another operand. llvm-svn: 241285	2015-07-02 17:03:06 +00:00
Gabor Ballabas	5fe650c5e1	Reworking the test part of r241149 The test part of r241149 has been reverted in r241451, due to misplaced test cases. This patch splits those test cases among the appropriate targets. Differential Revision: http://reviews.llvm.org/D10897 llvm-svn: 241283	2015-07-02 16:53:23 +00:00
Rafael Espindola	4e7212177f	Fix for PR23310: llvm-dis crashes when trying to upgrade an intrinsic. When trying to upgrade @llvm.x86.sse2.psrl.dq while parsing a module, BitcodeReader adds the function to its worklist twice, resulting in a crash when accessing it the second time. This patch replaces the worklist vector by a map. Patch by Philip Pfaffe. llvm-svn: 241281	2015-07-02 16:22:40 +00:00
Michael Kuperstein	16d307fb80	[X86] Convert an instruction relaxation test to use objdump instead of readobj Patch by: david.l.kreitzer@intel.com llvm-svn: 241270	2015-07-02 14:27:35 +00:00
Rafael Espindola	2119a96279	Improve error message. Thanks to Sean Silva for the suggestion. llvm-svn: 241255	2015-07-02 11:48:48 +00:00
Pawel Bylica	c52eabb285	Reapply r240291: Fix shl folding in DAG combiner. The code responsible for shl folding in the DAGCombiner was assuming incorrectly that all constants are less than 64 bits. This patch simply changes the way values are compared. It has been reverted previously because of some problems with comparing APInt with raw uint64_t. That has been fixed/changed with r241204. llvm-svn: 241254	2015-07-02 11:44:54 +00:00
Sanjoy Das	7869d4b846	[LazyCallGraph] Port test case from r240039 to LCG. Summary: r240039 adds a test case to check that CallGraph does the right thing with respect to non-leaf intrinsics like statepoint and patchpoint. This ports the same test case to LazyCallGraph. LazyCallGraph already does the right thing with respect to escaping function pointers so there is no need to change any code. Reviewers: chandlerc Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10582 llvm-svn: 241226	2015-07-02 02:03:58 +00:00
Eric Christopher	ced3032be5	Make an X86 specific directory and put the recent X86 tti specific inlining test into it. llvm-svn: 241223	2015-07-02 01:36:31 +00:00
Eric Christopher	e100226879	Implement TargetTransformInfo::hasCompatibleFunctionAttributes for X86. This checks subtarget feature compatibility for inlining by verifying that the callee is a strict subset of the caller's features. This includes the cpu as part of the subtarget we can get via the incoming functions as the backend takes CPUs as feature sets. This allows us to inline things like: int foo() { return baz(); } int __attribute__((target("sse4.2"))) bar() { return foo(); } so that generic code can be inlined into specialized functions. llvm-svn: 241221	2015-07-02 01:11:50 +00:00

1 2 3 4 5 ...

30774 Commits