llvm-project

Commit Graph

Author	SHA1	Message	Date
Simon Pilgrim	427154db2a	[X86][AVX512] Converted the VSHUFPD intrinsics to generic IR llvm-svn: 274523	2016-07-04 21:30:47 +00:00
Simon Pilgrim	30db811526	[X86][AVX512] Converted the VPERMPD/VPERMQ intrinsics to generic IR llvm-svn: 274502	2016-07-04 13:34:44 +00:00
Simon Pilgrim	17388f2569	[X86][AVX512] Converted the VPERMILPD/VPERMILPS intrinsics to generic IR llvm-svn: 274492	2016-07-04 11:06:15 +00:00
Craig Topper	ac1823f6e9	[AVX512] Modify what indices we emit for the zero vector we use for zero extension of the result of a v2i1 or v4i1 masked compare. This way we emit something that the backend easily interprets as a concatenation rather than a true shuffle. This delivers slightly better codegen with the current backend capabilities. llvm-svn: 274484	2016-07-04 07:09:46 +00:00
Simon Pilgrim	275d721485	[X86][AVX512] Converted the MOVDDUP/MOVSLDUP/MOVSHDUP masked intrinsics to generic IR llvm companion patch imminent llvm-svn: 274442	2016-07-02 17:16:25 +00:00
Craig Topper	b3a4477b13	[X86] Replace 128-bit and 256 masked vpermilps/vpermilpd builtins with native IR. llvm-svn: 274425	2016-07-02 05:36:43 +00:00
Pirama Arumuga Nainar	54a213d280	Add TargetInfo for 32-bit and 64-bit RenderScript Summary: The TargetInfo for 'renderscript32' and 'renderscript64' ArchTypes are subclasses of ARMleTargetInfo and AArch64leTargetInfo respectively. RenderScript32TargetInfo modifies the ARM ABI to set LongWidth and LongAlign to be 64-bits. Other than this modification, the underlying TargetInfo base classes is initialized as if they have "armv7" and "aarch64" architecture type respectively. Reviewers: rsmith, echristo Subscribers: aemerson, tberghammer, cfe-commits, danalbert, mehdi_amini, srhines Differential Revision: http://reviews.llvm.org/D21334 llvm-svn: 274409	2016-07-02 00:05:42 +00:00
Tim Shen	53547d95ca	Removes CHECKs for symbolic label names (as Debug Clang will generate). Differential Revision: http://reviews.llvm.org/D20499 llvm-svn: 274396	2016-07-01 22:50:00 +00:00
Tim Shen	ff12edbff4	Remove unncessary CHECKs from r274385 llvm-svn: 274387	2016-07-01 21:16:58 +00:00
Tim Shen	421119fd89	[Temporary, Lifetime] Add lifetime marks for temporaries With all MaterializeTemporaryExprs coming with a ExprWithCleanups, it's easy to add correct lifetime.end marks into the right RunCleanupsScope. Differential Revision: http://reviews.llvm.org/D20499 llvm-svn: 274385	2016-07-01 21:08:47 +00:00
Matt Arsenault	f652caea65	Emit more intrinsics for builtin functions This is important for building libclc. Since r273039 tests are failing due to now emitting calls to these functions instead of emitting the DAG node. The libm function names are implemented for OpenCL, and should call the locally defined versions, so -fno-builtin is used. The IR Some functions use the __builtins and expect the intrinsics to be emitted. Without this we end up with nobuiltin calls to intrinsics or to unsupported library calls. llvm-svn: 274370	2016-07-01 17:38:14 +00:00
Michael Zuckerman	3f316abdce	[Clang][Intrinsics][AVX512][BuiltIn] adding intrinsics for vrangesd instruction set Differential Revision: http://reviews.llvm.org/D21734 llvm-svn: 274218	2016-06-30 08:05:46 +00:00
David Majnemer	b4b671e4a8	[CodeView] Implement support for bitfields in Clang Emit the underlying storage offset in addition to the starting bit position of the field. This fixes PR28162. Differential Revision: http://reviews.llvm.org/D21783 llvm-svn: 274201	2016-06-30 03:01:59 +00:00
Simon Pilgrim	6350054017	[X86][SSE2] Updated tests to match llvm\test\CodeGen\X86\sse2-intrinsics-fast-isel-x86_64.ll llvm-svn: 274126	2016-06-29 14:04:08 +00:00
Igor Breger	2c880cf9b1	[AVX512] Zero extend cmp intrinsic return value. Differential Revision: http://reviews.llvm.org/D21746 llvm-svn: 274110	2016-06-29 08:14:17 +00:00
Artur Pilipenko	70d4bb566c	Update the expected masked load/store intrinsics names in tests The mangling of their names was changed in order to support arbitrary addrspace pointers as arguments in rL274043. llvm-svn: 274044	2016-06-28 18:28:45 +00:00
Chris Dewhurst	7cc4cfe4fc	[SPARC] Allows inlining of atomics for Sparc32 with appropriate store barrier. The final change is required to extend the back-end's AtomicExpandPass that was implemented for Sparc (64 bit) and later extended for Sparc (32 bit). llvm-svn: 274012	2016-06-28 12:55:55 +00:00
Asaf Badouh	57819aa185	[X86] add _mm_loadu_si64 Differential Revision: http://reviews.llvm.org/D21504 llvm-svn: 273812	2016-06-26 13:51:54 +00:00
Craig Topper	50e3dfe9d0	[X86] Fix pslldq/psrldq intrinsics to not fail compilation with immediates larger than 16. This was accidentally broken in r272246. llvm-svn: 273775	2016-06-25 07:31:14 +00:00
Rafael Espindola	0fa668072f	Add support for musl-libc on ARM Linux. Patch by Lei Zhang! llvm-svn: 273735	2016-06-24 21:35:06 +00:00
Peter Collingbourne	8dd14da0dc	CodeGen: Update Clang to use the new type metadata. Differential Revision: http://reviews.llvm.org/D21054 llvm-svn: 273730	2016-06-24 21:21:46 +00:00
Strahinja Petrovic	7ba5bf5dc7	Fix make-check issues Fixing build issue for test test/CodeGen/struct-union-BE.c. llvm-svn: 273675	2016-06-24 13:11:15 +00:00
Strahinja Petrovic	515a1eb44c	This patch fixes problem with passing structures and unions smaller than register as argument in variadic functions on big endian architectures. Differential Revision: http://reviews.llvm.org/D21611 llvm-svn: 273665	2016-06-24 12:12:41 +00:00
Dehao Chen	bd3ed3c55b	Invoke simplifycfg and sroa before instcombine. Summary: InstCombine needs to be performed after simplifycfg and sroa, otherwise it may make bad optimization decisions. Reviewers: davidxl, wmi, dnovillo Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D21568 llvm-svn: 273606	2016-06-23 20:13:10 +00:00
Saleem Abdulrasool	6e9e88b30a	CodeGen: support linker options on Windows ARM We would incorrectly emit the directive sections due to the missing overridden methods. We now emit the expected "/DEFAULTLIB" rather than "-l" options for requested linkage llvm-svn: 273558	2016-06-23 13:45:33 +00:00
Craig Topper	79f53ca0b5	[AVX512] Replace masked unpack builtins with shufflevector and selects. llvm-svn: 273533	2016-06-23 06:36:42 +00:00
Hans Wennborg	44d061a471	Add support for /Ob1 and -finline-hint-functions flags Add support for /Ob1 (and equivalent -finline-hint-functions), which enable inlining only for functions marked inline, either explicitly (via inline keyword, for example), or implicitly (function definition in class body, for example). This works by enabling inlining pass, and adding noinline attribute to every function not marked inline. Patch by Rudy Pons <rudy.pons@ilod.org>! Differential Revision: http://reviews.llvm.org/D20647 llvm-svn: 273440	2016-06-22 16:56:16 +00:00
Hans Wennborg	9565cf581e	Widen EHScope::ClenupBitFields::FixupDepth to avoid overflowing it (PR23490) It currently only takes 2048 gotos to overflow the FixupDepth bitfield, causing silent miscompilation. Apparently some parser generators run into this (see PR). I don't know that that data structure is terribly size sensitive anyway, and since there's no room to widen the bitfield, let's just use a separate word in EHCatchScope for it. Differential Revision: http://reviews.llvm.org/D21566 llvm-svn: 273434	2016-06-22 16:21:14 +00:00
Michael Zuckerman	716859aa64	[Clang][bmi][intrinsics] Adding _mm_tzcnt_64 _mm_tzcnt_32 intrinsics to clang. Differential Revision: http://reviews.llvm.org/D21373 llvm-svn: 273401	2016-06-22 12:32:43 +00:00
Craig Topper	08181f795f	[AVX512] Fix _mm_setzero_di to not require avx512vl since its used by the avx512dqintrin.h. Also update the avx512dq test to not enable avx512vl feature so we can ensure correct dependencies. llvm-svn: 273388	2016-06-22 06:36:21 +00:00
Craig Topper	d1691c7026	[AVX512] Replace masked integer cmp and ucmp builtins with native IR. llvm-svn: 273378	2016-06-22 04:47:58 +00:00
Craig Topper	c56f0f8485	[AVX512] Use correct types for mask parameters in avx512vlbw cmp builtin tests. llvm-svn: 273377	2016-06-22 04:47:55 +00:00
Peter Collingbourne	aa463c2a18	Require an x86 target for the thinlto_backend.ll test. llvm-svn: 273361	2016-06-22 01:40:47 +00:00
Peter Collingbourne	2ff9c25d93	Specify a target triple to fix the test on non-Linux. llvm-svn: 273356	2016-06-22 01:17:30 +00:00
Peter Collingbourne	91227f2195	CodeGen: Replace test/CodeGen/thinlto_backend.c with a functional test. This new test tests that functions are capable of being imported, rather than that the import pass is run. This new test is compatible with the approach being developed in D20268 which runs the importer on its own rather than in a pass. Differential Revision: http://reviews.llvm.org/D21542 llvm-svn: 273347	2016-06-22 00:57:26 +00:00
Pirama Arumuga Nainar	a7484c9180	Emit the DWARF tag for the RenderScript language Summary: If the RenderScript LangOpt is set, either via '-x renderscript' or the '.rs' file extension, set the DWARF language tag to be that of RenderScript. Reviewers: rsmith Subscribers: cfe-commits, srhines Differential Revision: http://reviews.llvm.org/D21451 llvm-svn: 273321	2016-06-21 21:35:11 +00:00
Sanjay Patel	a4d156980e	[x86] AVX FP compare builtins should require AVX target feature (PR28112) This is a fix for PR28112: https://llvm.org/bugs/show_bug.cgi?id=28112 The FP comparison intrinsics that take an immediate parameter (rather than specifying a comparison predicate in the function name) were added with AVX; these are macros in avxintrin.h. This patch makes clang behavior match gcc (error if a program tries to use these without -mavx) and matches the Intel documentation, eg: VCMPPS: m128 _mm_cmp_ps(m128 a, __m128 b, const int imm) 'V' means this is intended to only work with the AVX form of the instruction. Differential Revision: http://reviews.llvm.org/D21306 llvm-svn: 273311	2016-06-21 20:22:55 +00:00
Dehao Chen	1997d8684f	Invoke PruneEH pass before Sample Profile pass. Summary: We need to call PruneEH pass before AutoFDO pass so that some EH-related calls can get inlined in Sample Profile pass. Reviewers: davidxl, dnovillo Subscribers: junbuml, llvm-commits Differential Revision: http://reviews.llvm.org/D21197 llvm-svn: 273298	2016-06-21 19:16:41 +00:00
Artem Belevich	4987dc85b4	[aarch64] Update datalayout for aarch64 tests This brings the tests in sync with the changes in r273280. llvm-svn: 273289	2016-06-21 17:35:31 +00:00
Craig Topper	879b0978f4	[AVX512] Move the 128-bit and 256-bit lzcnt intrinsics to avx512vlcdintrin.h where they belong. llvm-svn: 273249	2016-06-21 06:53:58 +00:00
Simon Pilgrim	03a899957f	[X86][XOP] Refreshed builtin tests ready for creation of llvm fast-isel tests llvm-svn: 273090	2016-06-18 18:20:14 +00:00
Simon Pilgrim	c44a3b9599	[X86][TBM] Refreshed builtin tests ready for creation of llvm fast-isel tests llvm-svn: 273086	2016-06-18 17:09:40 +00:00
David Majnemer	3370c20c7e	[CodeGen] Use pointer-sized integers for ptrtoint sources Given something like: void v = (void )100; We need to synthesize a ptrtoint operation from 100. During constant emission, we choose i64 as the type for our constant because it guaranteed not to drop any bits from our CharUnits representation of the value. However, this is suboptimal for 32-bit targets: LLVM passes like GlobalOpt will get confused by these sorts of casts resulting in pessimization. Instead, make sure the ptrtoint operand has a pointer-sized integer type. llvm-svn: 273020	2016-06-17 17:47:24 +00:00
Simon Pilgrim	d39d026324	[X86][SSE4A] Use native IR for mask movntsd/movntss intrinsics. Depends on llvm side commit r273002. llvm-svn: 273003	2016-06-17 14:28:16 +00:00
Ranjeet Singh	ca2b3e7b5c	[ARM] Add mrrc/mrrc2 intrinsics and update existing mcrr/mcrr2 intrinsics. Reapplying patch in r272777 which was reverted because the llvm patch which added support for generating the mcrr/mcrr2 instructions from the intrinsic was causing an assertion failure. This has now been fixed in llvm. llvm-svn: 272983	2016-06-17 00:59:41 +00:00
George Burgess IV	419996ccb5	[CodeGen] Fix a segfault caused by pass_object_size. This patch fixes a bug where we'd segfault (in some cases) if we saw a variadic function with one or more pass_object_size arguments. Differential Revision: http://reviews.llvm.org/D17462 llvm-svn: 272971	2016-06-16 23:06:04 +00:00
Sanjay Patel	dbd68dd09d	[x86] generate IR for AVX2 integer min/max builtins Sibling patch to r272932: http://reviews.llvm.org/rL272932 llvm-svn: 272933	2016-06-16 18:45:01 +00:00
Marcin Koscielnicki	a46fade624	[Builtin] Make __builtin_thread_pointer target-independent. This is now supported for ARM, AArch64, PowerPC, SystemZ, SPARC, Mips. Differential Revision: http://reviews.llvm.org/D19589 llvm-svn: 272893	2016-06-16 13:41:54 +00:00
Sanjay Patel	280cfd1a69	[x86] translate SSE packed FP comparison builtins to IR As noted in the code comment, a potential follow-on would be to remove the builtins themselves. Other than ord/unord, this already works as expected. Eg: typedef float v4sf __attribute__((__vector_size__(16))); v4sf fcmpgt(v4sf a, v4sf b) { return a > b; } Differential Revision: http://reviews.llvm.org/D21268 llvm-svn: 272840	2016-06-15 21:20:04 +00:00
Sanjay Patel	7495ec026e	[x86] generate IR for SSE integer min/max builtins Sibling patch to r272806: http://reviews.llvm.org/rL272806 llvm-svn: 272807	2016-06-15 17:18:50 +00:00

1 2 3 4 5 ...

3762 Commits