llvm-project

Commit Graph

Author	SHA1	Message	Date
Joey Gouly	fa76b49cef	[OpenCL] Add missing subgroup builtins This adds get_kernel_max_sub_group_size_for_ndrange and get_kernel_sub_group_count_for_ndrange. llvm-svn: 309678	2017-08-01 13:27:09 +00:00
Joey Gouly	53160cdc45	[OpenCL] Enable subgroup extension in tests This fixes the test, so that it can be run on different hosts that may have different OpenCL extensions enabled. llvm-svn: 309571	2017-07-31 15:50:27 +00:00
Joey Gouly	84ae3364df	[OpenCL] Add extension Sema check for subgroup builtins Check the subgroup extension is enabled, before doing other Sema checks. llvm-svn: 309567	2017-07-31 15:15:59 +00:00
Alexey Sotkin	7d7f0dc08b	[OpenCL] Fix access qualifiers metadata for kernel arguments with typedef Subscribers: cfe-commits, yaxunl, Anastasia Differential Revision: https://reviews.llvm.org/D35420 llvm-svn: 309155	2017-07-26 18:49:54 +00:00
Egor Churaev	53f9a30543	[OpenCL] Added extended tests on metadata generation for half data type and arrays. Reviewers: Anastasia Reviewed By: Anastasia Subscribers: bader, cfe-commits, yaxunl Differential Revision: https://reviews.llvm.org/D35000 llvm-svn: 308266	2017-07-18 06:04:01 +00:00
Yaxun Liu	25d1b4341f	[AMDGPU] Fix size and alignment of size_t and pointer types Differential Revision: https://reviews.llvm.org/D34995 llvm-svn: 307121	2017-07-05 04:58:24 +00:00
Yaxun Liu	3ba4a720ad	[AMDGPU] Fix regressions on mesa/clover with libclc due to address space Currently AMDGPUTargetInfo does not initialize AddrSpaceMap in constructor, which causes regressions in mesa/clover with libclc. This patch fixes that. Differential Revision: https://reviews.llvm.org/D34987 llvm-svn: 307105	2017-07-04 19:57:18 +00:00
Yaxun Liu	e9e5c4f975	CodeGen: Fix invalid bitcast for coerced function argument Clang assumes coerced function argument is in address space 0, which is not always true and results in invalid bitcasts. This patch fixes failure in OpenCL conformance test api/get_kernel_arg_info with amdgcn---amdgizcl triple, where non-zero alloca address space is used. Differential Revision: https://reviews.llvm.org/D34777 llvm-svn: 306721	2017-06-29 18:47:45 +00:00
Alexey Bader	364a11651e	[OpenCL] Fix OpenCL and SPIR version metadata generation. Summary: OpenCL and SPIR version metadata must be generated once per module instead of once per mangled global value. Reviewers: Anastasia, yaxunl Reviewed By: Anastasia Subscribers: ahatanak, cfe-commits Differential Revision: https://reviews.llvm.org/D34235 llvm-svn: 305796	2017-06-20 14:30:18 +00:00
Pekka Jaaskelainen	2eb0bcc9e6	[OpenCL] spir_kern by defaul: fix old test cases llvm-svn: 304396	2017-06-01 08:19:43 +00:00
Pekka Jaaskelainen	fc2629a65a	[OpenCL] Makes kernels use the SPIR_KERNEL CC by default. Rationale: OpenCL kernels are called via an explicit runtime API with arguments set with clSetKernelArg(), not as normal sub-functions. Return SPIR_KERNEL by default as the kernel calling convention to ensure the fingerprint is fixed such way that each OpenCL argument gets one matching argument in the produced kernel function argument list to enable feasible implementation of clSetKernelArg() with aggregates etc. In case we would use the default C calling conv here, clSetKernelArg() might break depending on the target-specific conventions; different targets might split structs passed as values to multiple function arguments etc. https://reviews.llvm.org/D33639 llvm-svn: 304389	2017-06-01 07:18:49 +00:00
Javed Absar	0841d620c5	Fix issue with test that caused bildbot failure These tests did not specify the target. The failure was triggered by change - https://reviews.llvm.org/D33205 http://lab.llvm.org:8011/builders/clang-cmake-armv7-a15-full/builds/7314 which sets vector alignment to 8-byte for arm-targets (except for Android). So, fixing the test to make it target specific. llvm-svn: 304210	2017-05-30 13:34:26 +00:00
Egor Churaev	dd7d82c408	[OpenCL] Test on half immediate support. Reviewers: Anastasia Reviewed By: Anastasia Subscribers: yaxunl, cfe-commits, bader Differential Revision: https://reviews.llvm.org/D33592 llvm-svn: 304134	2017-05-29 07:44:22 +00:00
Mehdi Amini	6aa9e9b41a	IRGen: Add optnone attribute on function during O0 Amongst other, this will help LTO to correctly handle/honor files compiled with O0, helping debugging failures. It also seems in line with how we handle other options, like how -fnoinline adds the appropriate attribute as well. Differential Revision: https://reviews.llvm.org/D28404 llvm-svn: 304127	2017-05-29 05:38:20 +00:00
Konstantin Zhuravlyov	1f144a18ff	Resubmit r303861. [AMDGPU] add __builtin_amdgcn_s_getpc Patch by Tim Corringham llvm-svn: 304033	2017-05-26 21:08:20 +00:00
Reid Kleckner	581a6c5d56	Revert "[AMDGPU] add __builtin_amdgcn_s_getpc" This reverts commit r303861, the LLVM intrinsic was reverted. llvm-svn: 303908	2017-05-25 20:28:26 +00:00
Tim Corringham	702fe45bcd	[AMDGPU] add __builtin_amdgcn_s_getpc Summary: Added the builtin corresponding to the s_getpc intrinsic added in llvm D32862 Subscribers: kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye Differential Revision: https://reviews.llvm.org/D33276 llvm-svn: 303861	2017-05-25 14:16:11 +00:00
Yaxun Liu	af3d4db64b	[AMDGPU] Do not require opencl triple environment for OpenCL A recent change requires opencl triple environment for compiling OpenCL program, which causes regressions in libclc. This patch fixes that. Instead of deducing language based on triple environment, it checks LangOptions. Differential Revision: https://reviews.llvm.org/D33445 llvm-svn: 303644	2017-05-23 16:15:53 +00:00
Yaxun Liu	6d96f16347	CodeGen: Cast alloca to expected address space Alloca always returns a pointer in alloca address space, which may be different from the type defined by the language. For example, in C++ the auto variables are in the default address space. Therefore cast alloca to the expected address space when necessary. Differential Revision: https://reviews.llvm.org/D32248 llvm-svn: 303370	2017-05-18 18:51:09 +00:00
Yaxun Liu	4f33b3d396	[OpenCL] Emit function-scope variable in constant address space as static variable Differential Revision: https://reviews.llvm.org/D32977 llvm-svn: 303072	2017-05-15 14:47:47 +00:00
Xiuli Pan	be6da4bbdb	[OpenCL] Add intel_reqd_sub_group_size attribute support Summary: Add intel_reqd_sub_group_size attribute support as intel extension cl_intel_required_subgroup_size from https://www.khronos.org/registry/OpenCL/extensions/intel/cl_intel_required_subgroup_size.txt Reviewers: Anastasia, bader, hfinkel, pxli168 Reviewed By: Anastasia, bader, pxli168 Subscribers: cfe-commits, yaxunl Differential Revision: https://reviews.llvm.org/D30805 llvm-svn: 302125	2017-05-04 07:31:20 +00:00
Adrian Prantl	c3782a1a6f	Debug Info: Remove special-casing of indirect function argument handling. LLVM has changed the semantics of dbg.declare for describing function arguments. After this patch a dbg.declare always takes the address of a variable as the first argument, even if the argument is not an alloca. https://bugs.llvm.org/show_bug.cgi?id=32382 rdar://problem/31205000 llvm-svn: 300523	2017-04-18 01:22:01 +00:00
Yaxun Liu	d7523283a7	CodeGen: Let byval parameter use alloca address space Differential Revision: https://reviews.llvm.org/D32133 llvm-svn: 300487	2017-04-17 20:10:44 +00:00
Yaxun Liu	7f7f323e4f	CodeGen: Let lifetime intrinsic use alloca address space Differential Revision: https://reviews.llvm.org/D31717 llvm-svn: 300485	2017-04-17 20:03:11 +00:00
Konstantin Zhuravlyov	e668b1cd1e	[AMDGPU][GFX9] Set +fp32-denormals for >=gfx900 unless -cl-denorms-are-zero is set Differential Revision: https://reviews.llvm.org/D31482 llvm-svn: 300306	2017-04-14 05:33:57 +00:00
Yaxun Liu	b34ec829be	[OpenCL] Map default address space to alloca address space For OpenCL, the private address space qualifier is 0 in AST. Before this change, 0 address space qualifier is always mapped to target address space 0. As now target private address space is specified by alloca address space in data layout, address space qualifier 0 needs to be mapped to alloca addr space specified by the data layout. This change has no impact on targets whose alloca addr space is 0. With contributions from Matt Arsenault, Tony Tye and Wen-Heng (Jack) Chung Differential Revision: https://reviews.llvm.org/D31404 llvm-svn: 299965	2017-04-11 17:24:23 +00:00
Yaxun Liu	b122ed9181	[AMDGPU] Temporarily change constant address space from 4 to 2 for the new address space mapping Change constant address space from 4 to 2 for the new address space mapping in Clang. Differential Revision: https://reviews.llvm.org/D31771 llvm-svn: 299691	2017-04-06 19:18:36 +00:00
Stanislav Mekhanoshin	921a42314b	[AMDGPU] Translate reqd_work_group_size into amdgpu_flat_work_group_size These two attributes specify the same info in a different way. AMGPU BE only checks the latter as a target specific attribute as opposed to language specific reqd_work_group_size. This change produces amdgpu_flat_work_group_size out of reqd_work_group_size if specified. Differential Revision: https://reviews.llvm.org/D31728 llvm-svn: 299678	2017-04-06 18:15:44 +00:00
Egor Churaev	a8d2451533	[OpenCL] Enables passing sampler initializer to function argument Reviewers: Anastasia, cfe-commits Reviewed By: Anastasia Subscribers: yaxunl, bader Differential Revision: https://reviews.llvm.org/D31594 llvm-svn: 299524	2017-04-05 09:02:56 +00:00
Jin-Gu Kang	e7cdcdea73	Preserve vec3 type. Summary: Preserve vec3 type with CodeGen option. Reviewers: Anastasia, bruno Reviewed By: Anastasia Subscribers: bruno, ahatanak, cfe-commits Differential Revision: https://reviews.llvm.org/D30810 llvm-svn: 299445	2017-04-04 16:40:25 +00:00
Egor Churaev	ba8b84d7fb	[OpenCL] Do not generate "kernel_arg_type_qual" metadata for non-pointer args Summary: "kernel_arg_type_qual" metadata should contain const/volatile/restrict tags only for pointer types to match the corresponding requirement of the OpenCL specification. OpenCL 2.0 spec 5.9.3 Kernel Object Queries: CL_KERNEL_ARG_TYPE_VOLATILE is returned if the argument is a pointer and the referenced type is declared with the volatile qualifier. [...] Similarly, CL_KERNEL_ARG_TYPE_CONST is returned if the argument is a pointer and the referenced type is declared with the restrict or const qualifier. [...] CL_KERNEL_ARG_TYPE_RESTRICT will be returned if the pointer type is marked restrict. Reviewers: Anastasia, cfe-commits Reviewed By: Anastasia Subscribers: bader, yaxunl Differential Revision: https://reviews.llvm.org/D31321 llvm-svn: 299192	2017-03-31 10:14:52 +00:00
Egor Churaev	45c26ee0bf	[OpenCL] Extended mapping of parcing CodeGen arguments Summary: Enable cl_mad_enamle and cl_no_signed_zeros options when user turns on cl_unsafe_math_optimizations or cl_fast_relaxed_math options. Reviewers: Anastasia, cfe-commits Reviewed By: Anastasia Subscribers: bader, yaxunl Differential Revision: https://reviews.llvm.org/D31324 llvm-svn: 298838	2017-03-27 10:38:01 +00:00
Yaxun Liu	3464f92e23	[AMDGPU] Switch address space mapping by triple environment amdgiz For target environment amdgiz and amdgizcl (giz means Generic Is Zero), AMDGPU will use new address space mapping where generic address space is 0 and private address space is 5. The data layout is also changed correspondingly. Differential Revision: https://reviews.llvm.org/D31210 llvm-svn: 298767	2017-03-25 03:46:25 +00:00
Konstantin Zhuravlyov	9c1e310c16	Fix array sizes where address space is not yet known For variables in generic address spaces, for example: ``` unsigned char V[6442450944]; ... ``` the address space is not yet known when we get into getConstantArrayType, it is 0. AMDGCN target's address space 0 has 32 bits pointers, so when we call getPointerWidth with 0, the array size is trimmed to 32 bits, which is not right. Differential Revision: https://reviews.llvm.org/D30845 llvm-svn: 298420	2017-03-21 18:55:39 +00:00
Egor Churaev	c217f37cb6	[OpenCL] Added implicit conversion rank for overloading functions with vector data type in OpenCL Summary: I added a new rank to ImplicitConversionRank enum to resolve the function overload ambiguity with vector types. Rank of scalar types conversion is lower than vector splat. So, we can choose which function should we call. See test for more details. Reviewers: Anastasia, cfe-commits Reviewed By: Anastasia Subscribers: bader, yaxunl Differential Revision: https://reviews.llvm.org/D30816 llvm-svn: 298366	2017-03-21 12:55:55 +00:00
Matt Arsenault	bf5e3e4391	AMDGPU: Make 0 the private nullptr value We can't actually pretend that 0 is valid for address space 0. r295877 added a workaround to stop allocating user objects there, so we can use 0 as the invalid pointer. Some of the tests seemed to be using private as the non-0 null test address space, so add copies using local to make sure this is still stressed. llvm-svn: 297659	2017-03-13 19:47:53 +00:00
Yaxun Liu	4d86799219	[AMDGPU] Add builtin functions readlane ds_permute mov_dpp Differential Revision: https://reviews.llvm.org/D30551 llvm-svn: 297436	2017-03-10 01:30:46 +00:00
Konstantin Zhuravlyov	2b4917fcc9	[DebugInfo] Append extended dereferencing mechanism to variables' DIExpression for targets that support more than one address space Differential Revision: https://reviews.llvm.org/D29673 llvm-svn: 297397	2017-03-09 18:06:23 +00:00
Konstantin Zhuravlyov	d1ba16e762	[DebugInfo] Add address space when creating DIDerivedTypes Differential Revision: https://reviews.llvm.org/D29671 llvm-svn: 297321	2017-03-08 23:56:48 +00:00
Jan Vesely	9488560bb8	AMDGPU: export s_sendmsg{halt} instrinsics Differential Revision: https://reviews.llvm.org/D30366 llvm-svn: 296241	2017-02-25 04:20:24 +00:00
Jan Vesely	c255097517	AMDGPU: export l1 cache invalidation intrinsics Differential Revision: https://reviews.llvm.org/D30360 llvm-svn: 296240	2017-02-25 04:20:22 +00:00
Jan Vesely	d26dbb389f	AMDGPU: export s_waitcnt builtin Differential Revision: https://reviews.llvm.org/D30359 llvm-svn: 296239	2017-02-25 04:20:20 +00:00
Matt Arsenault	a0c6dca15b	AMDGPU: Add fmed3 half builtin llvm-svn: 295874	2017-02-22 20:55:59 +00:00
Jan Vesely	a6f369c727	[OpenCL] r600 needs OpenCL kernel calling convention Differential Revision: https://reviews.llvm.org/D30236 llvm-svn: 295843	2017-02-22 15:01:42 +00:00
Anastasia Stulova	58984e7087	[OpenCL] Correct ndrange_t implementation Removed ndrange_t as Clang builtin type and added as a struct type in the OpenCL header. Use type name to do the Sema checking in enqueue_kernel and modify IR generation accordingly. Review: D28058 Patch by Dmitry Borisenkov! llvm-svn: 295311	2017-02-16 12:27:47 +00:00
Matt Arsenault	77ce553891	AMDGPU: Add a test checking alignments of emitted globals/allocas Make sure the spec required type alignments are used in preparation for a possible change which may break this. llvm-svn: 294278	2017-02-07 04:28:02 +00:00
Matt Arsenault	a274b209f5	AMDGPU: Add builtin for fmed3 intrinsic llvm-svn: 293600	2017-01-31 03:42:07 +00:00
Anastasia Stulova	af0a7bbbe2	[OpenCL] Add missing address spaces in IR generation of blocks Modify ObjC blocks impl wrt address spaces as follows: - keep default private address space for blocks generated as local variables (with captures); - add global address space for global block literals (no captures); - make the block invoke function and enqueue_kernel prototype with the generic AS block pointer parameter to accommodate both private and global AS cases from above; - add block handling into default AS because it's implemented as a special pointer type (BlockPointer) in the frontend and therefore it is used as a pointer everywhere. This is also needed to accommodate both private and global AS blocks for the two cases above. - removes ObjC RT specific symbols (NSConcreteStackBlock and NSConcreteGlobalBlock) in the OpenCL mode. Review: https://reviews.llvm.org/D28814 llvm-svn: 293286	2017-01-27 15:11:34 +00:00
Matt Arsenault	09cca093a3	AMDGPU: Update for changed subtarget feature name llvm-svn: 292838	2017-01-23 22:31:14 +00:00
Matt Arsenault	24b5ae4497	AMDGPU: Add builtin for getreg intrinsic llvm-svn: 292636	2017-01-20 19:24:22 +00:00
Egor Churaev	28f00aab73	[OpenCL] Align fake address space map with the SPIR target maps. Summary: We compile user opencl kernel code with spir triple. But built-ins are written in OpenCL and we compile it with triple x86_64 to be able to use x86 intrinsics. And we need address spaces to match in both cases. So, we change fake address space map in OpenCL for matching with spir. On CPU address spaces are not really important but we'd like to preserve address space information in order to perform optimizations relying on this info like enhanced alias analysis. Reviewers: pekka.jaaskelainen, Anastasia Subscribers: pekka.jaaskelainen, yaxunl, bader, cfe-commits Differential Revision: https://reviews.llvm.org/D28048 llvm-svn: 290436	2016-12-23 16:11:25 +00:00
Egor Churaev	89831421af	Fix problems in "[OpenCL] Enabling the usage of CLK_NULL_QUEUE as compare operand." Summary: Fixed warnings in commit: https://reviews.llvm.org/rL290171 Reviewers: djasper, Anastasia Subscribers: yaxunl, cfe-commits, bader Differential Revision: https://reviews.llvm.org/D27981 llvm-svn: 290431	2016-12-23 14:55:49 +00:00
Chandler Carruth	fcd33149b4	Cleanup the handling of noinline function attributes, -fno-inline, -fno-inline-functions, -O0, and optnone. These were really, really tangled together: - We used the noinline LLVM attribute for -fno-inline - But not for -fno-inline-functions (breaking LTO) - But we did use it for -finline-hint-functions (yay, LTO is happy!) - But we didn't for -O0 (LTO is sad yet again...) - We had weird structuring of CodeGenOpts with both an inlining enumeration and a boolean. They interacted in weird ways and needlessly. - A lot of set smashing went on with setting these, and then got worse when we considered optnone and other inlining-effecting attributes. - A bunch of inline affecting attributes were managed in a completely different place from -fno-inline. - Even with -fno-inline we failed to put the LLVM noinline attribute onto many generated function definitions because they didn't show up as AST-level functions. - If you passed -O0 but -finline-functions we would run the normal inliner pass in LLVM despite it being in the O0 pipeline, which really doesn't make much sense. - Lastly, we used things like '-fno-inline' to manipulate the pass pipeline which forced the pass pipeline to be much more parameterizable than it really needs to be. Instead we can just use the optimization level to select a pipeline and control the rest via attributes. Sadly, this causes a bunch of churn in tests because we don't run the optimizer in the tests and check the contents of attribute sets. It would be awesome if attribute sets were a bit more FileCheck friendly, but oh well. I think this is a significant improvement and should remove the semantic need to change what inliner pass we run in order to comply with the requested inlining semantics by relying completely on attributes. It also cleans up tho optnone and related handling a bit. One unfortunate aspect of this is that for generating alwaysinline routines like those in OpenMP we end up removing noinline and then adding alwaysinline. I tried a bunch of other approaches, but because we recompute function attributes from scratch and don't have a declaration here I couldn't find anything substantially cleaner than this. Differential Revision: https://reviews.llvm.org/D28053 llvm-svn: 290398	2016-12-23 01:24:49 +00:00
George Burgess IV	e37633713d	Add the alloc_size attribute to clang, attempt 2. This is a recommit of r290149, which was reverted in r290169 due to msan failures. msan was failing because we were calling `isMostDerivedAnUnsizedArray` on an invalid designator, which caused us to read uninitialized memory. To fix this, the logic of the caller of said function was simplified, and we now have a `!Invalid` assert in `isMostDerivedAnUnsizedArray`, so we can catch this particular bug more easily in the future. Fingers crossed that this patch sticks this time. :) Original commit message: This patch does three things: - Gives us the alloc_size attribute in clang, which lets us infer the number of bytes handed back to us by malloc/realloc/calloc/any user functions that act in a similar manner. - Teaches our constexpr evaluator that evaluating some `const` variables is OK sometimes. This is why we have a change in test/SemaCXX/constant-expression-cxx11.cpp and other seemingly unrelated tests. Richard Smith okay'ed this idea some time ago in person. - Uniques some Blocks in CodeGen, which was reviewed separately at D26410. Lack of uniquing only really shows up as a problem when combined with our new eagerness in the face of const. llvm-svn: 290297	2016-12-22 02:50:20 +00:00
Daniel Jasper	9068938eb0	Revert "[OpenCL] Enabling the usage of CLK_NULL_QUEUE as compare operand." This reverts commit r290171. It triggers a bunch of warnings, because the new enumerator isn't handled in all switches. We want a warning-free build. Replied on the commit with more details. llvm-svn: 290173	2016-12-20 10:05:04 +00:00
Egor Churaev	67c3f3ec68	[OpenCL] Enabling the usage of CLK_NULL_QUEUE as compare operand. Summary: Enabling the compression of CLK_NULL_QUEUE to variable of type queue_t. Reviewers: Anastasia Subscribers: cfe-commits, yaxunl, bader Differential Revision: https://reviews.llvm.org/D27569 llvm-svn: 290171	2016-12-20 09:15:21 +00:00
Chandler Carruth	d7738fe6ad	Revert r290149: Add the alloc_size attribute to clang. This commit fails MSan when running test/CodeGen/object-size.c in a confusing way. After some discussion with George, it isn't really clear what is going on here. We can make the MSan failure go away by testing for the invalid bit, but why things are invalid isn't clear. And yet, other code in the surrounding area is doing precisely this and testing for invalid. George is going to take a closer look at this to better understand the nature of the failure and recommit it, for now backing it out to clean up MSan builds. llvm-svn: 290169	2016-12-20 08:28:19 +00:00
George Burgess IV	a747027bc6	Add the alloc_size attribute to clang. This patch does three things: - Gives us the alloc_size attribute in clang, which lets us infer the number of bytes handed back to us by malloc/realloc/calloc/any user functions that act in a similar manner. - Teaches our constexpr evaluator that evaluating some `const` variables is OK sometimes. This is why we have a change in test/SemaCXX/constant-expression-cxx11.cpp and other seemingly unrelated tests. Richard Smith okay'ed this idea some time ago in person. - Uniques some Blocks in CodeGen, which was reviewed separately at D26410. Lack of uniquing only really shows up as a problem when combined with our new eagerness in the face of const. Differential Revision: https://reviews.llvm.org/D14274 llvm-svn: 290149	2016-12-20 01:05:42 +00:00
Yaxun Liu	5b74665a41	Recommit r289979 [OpenCL] Allow disabling types and declarations associated with extensions Fixed undefined behavior due to cast integer to bool in initializer list. llvm-svn: 290056	2016-12-18 05:18:55 +00:00
Yaxun Liu	35f6d66b0d	Revert r289979 due to regressions llvm-svn: 289991	2016-12-16 21:23:55 +00:00
Yaxun Liu	2e8331cab6	[OpenCL] Allow disabling types and declarations associated with extensions Added a map to associate types and declarations with extensions. Refactored existing diagnostic for disabled types associated with extensions and extended it to declarations for generic situation. Fixed some bugs for types associated with extensions. Allow users to use pragma to declare types and functions for supported extensions, e.g. #pragma OPENCL EXTENSION the_new_extension_name : begin // declare types and functions associated with the extension here #pragma OPENCL EXTENSION the_new_extension_name : end Differential Revision: https://reviews.llvm.org/D21698 llvm-svn: 289979	2016-12-16 19:22:08 +00:00
Yaxun Liu	402804b6d6	Re-commit r289252 and r289285, and fix PR31374 llvm-svn: 289787	2016-12-15 08:09:08 +00:00
Nico Weber	7849eeb035	Revert 289252 (and follow-up 289285), it caused PR31374 llvm-svn: 289713	2016-12-14 21:38:18 +00:00
Neil Hickey	c881be1c23	Fixing build failure by adding triple option to new test condition. Adding -triple option to ensure target supports double for fpmath test. llvm-svn: 289552	2016-12-13 17:04:33 +00:00
Neil Hickey	88c0fac534	Improve handling of floating point literals in OpenCL to only use double precision if the target supports fp64. This change makes sure single-precision floating point types are used if the cl_fp64 extension is not supported by the target. Also removed the check to see whether the OpenCL version is >= 1.2, as this has been incorporated into the extension setting code. Differential Revision: https://reviews.llvm.org/D24235 llvm-svn: 289544	2016-12-13 16:22:50 +00:00
Egor Churaev	24939d479e	[OpenCL] Enable unroll hint for OpenCL 1.x. Summary: Although the feature was introduced only in OpenCL C v2.0 spec., it's useful for OpenCL 1.x too and doesn't require HW support. Reviewers: Anastasia Subscribers: yaxunl, cfe-commits, bader Differential Revision: https://reviews.llvm.org/D27453 llvm-svn: 289535	2016-12-13 14:02:35 +00:00
Yaxun Liu	8f66b4b44a	Add support for non-zero null pointer for C and OpenCL In amdgcn target, null pointers in global, constant, and generic address space take value 0 but null pointers in private and local address space take value -1. Currently LLVM assumes all null pointers take value 0, which results in incorrectly translated IR. To workaround this issue, instead of emit null pointers in local and private address space, a null pointer in generic address space is emitted and casted to local and private address space. Tentative definition of global variables with non-zero initializer will have weak linkage instead of common linkage since common linkage requires zero initializer and does not have explicit section to hold the non-zero value. Virtual member functions getNullPointer and performAddrSpaceCast are added to TargetCodeGenInfo which by default returns ConstantPointerNull and emitting addrspacecast instruction. A virtual member function getNullPointerValue is added to TargetInfo which by default returns 0. Each target can override these virtual functions to get target specific null pointer and the null pointer value for specific address space, and perform specific translations for addrspacecast. Wrapper functions getNullPointer is added to CodegenModule and getTargetNullPointerValue is added to ASTContext to facilitate getting the target specific null pointers and their values. This change has no effect on other targets except amdgcn target. Other targets can provide support of non-zero null pointer in a similar way. This change only provides support for non-zero null pointer for C and OpenCL. Supporting for other languages will be added later incrementally. Differential Revision: https://reviews.llvm.org/D26196 llvm-svn: 289252	2016-12-09 19:01:11 +00:00
Alexey Bader	a60db59d6f	[OpenCL] Added a LIT test for ensuring address space mangling is done the same both in OpenCL1.2 and OpenCL2.0. Patch by Egor Churaev (echuraev). Reviewers: Anastasia Subscribers: yaxunl, cfe-commits, bader Differential Revision: https://reviews.llvm.org/D27403 llvm-svn: 288891	2016-12-07 08:43:49 +00:00
Alexey Bader	b3190829e5	[OpenCL] Fix SPIR version generation. Patch by Egor Churaev (echuraev). Reviewers: Anastasia Subscribers: bader, yaxunl, cfe-commits Differential Revision: https://reviews.llvm.org/D27300 llvm-svn: 288890	2016-12-07 08:38:24 +00:00
Anastasia Stulova	e4a1c38109	[OpenCL] Prevent generation of globals in non-constant AS for OpenCL. Avoid using shortcut for const qualified non-constant address space aggregate variables while generating them on the stack such that the alloca object is used instead of a global variable containing initializer. Review: https://reviews.llvm.org/D27109 llvm-svn: 288163	2016-11-29 17:01:19 +00:00
Konstantin Zhuravlyov	62ae8f671c	[AMDGPU] Change frexp.exp builtin to return i16 for f16 input Differential Revision: https://reviews.llvm.org/D26863 llvm-svn: 287390	2016-11-18 22:31:51 +00:00
Stanislav Mekhanoshin	cd433d2811	[AMDGPU] Add wave barrier builtin The wave barrier represents the discardable barrier. Its main purpose is to carry convergent attribute, thus preventing illegal CFG optimizations. All lanes in a wave come to convergence point simultaneously with SIMT, thus no special instruction is needed in the ISA. The barrier is discarded during code generation. Differential Revision: https://reviews.llvm.org/D26584 llvm-svn: 287006	2016-11-15 18:58:03 +00:00
Anastasia Stulova	0df4ac3f94	[OpenCL] Fix for integer parameters of enqueue_kernel Make handling integer parameters more flexible: - For the number of events argument allow to pass larger integers than 32 bits as soon as compiler can prove that the range fits in 32 bits. If not, the diagnostic will be given. - Change type of the arguments specifying the sizes of the corresponding block arguments to be size_t. Review: https://reviews.llvm.org/D26509 llvm-svn: 286849	2016-11-14 17:39:58 +00:00
Anastasia Stulova	2b46120a09	[OpenCL] Change to clk_event parameter in enqueue_kernel. - Accept NULL pointer as a valid parameter value for clk_event. - Generate clk_event_t arguments of internal __enqueue_kernel_XXX function as pointers in generic address space. Review: https://reviews.llvm.org/D26507 llvm-svn: 286836	2016-11-14 15:34:01 +00:00
Pekka Jaaskelainen	5136dd81ad	Fix r286819 (accidentally patched multiple times. llvm-svn: 286821	2016-11-14 13:14:38 +00:00
Pekka Jaaskelainen	2a1cc587bf	[OpenCL] always use SPIR address spaces for kernel_arg_addr_space MD It doesn't make sense to use the target's address space ids in this context as this is metadata that should be referring to the "logical" OpenCL address spaces. For flat AS machines like all "CPUs" in general, the logical AS info gets lost as there's only one address space (0). This commit changes the logic such that we always use the SPIR address space ids for the argument metadata. It thus allows implementing the clGetKernelArgInfo() and the other detection needs. https://reviews.llvm.org/D26157 llvm-svn: 286819	2016-11-14 13:08:30 +00:00
Renato Golin	6a051ba614	Revert "Improve handling of floating point literals in OpenCL to only use double precision if the target supports fp64." This reverts commit r286815, as it broke all ARM and AArch64 bots. llvm-svn: 286818	2016-11-14 12:19:18 +00:00
Neil Hickey	f603672b5c	Improve handling of floating point literals in OpenCL to only use double precision if the target supports fp64. This change makes sure single-precision floating point types are used if the cl_fp64 extension is not supported by the target. Also removed the check to see whether the OpenCL version is >= 1.2, as this has been incorporated into the extension setting code. Differential Revision: https://reviews.llvm.org/D24235 llvm-svn: 286815	2016-11-14 11:15:51 +00:00
Konstantin Zhuravlyov	81a78bb864	[AMDGPU] Add f16 builtin functions (VI+) Differential Revision: https://reviews.llvm.org/D26476 llvm-svn: 286741	2016-11-13 02:37:05 +00:00
NAKAMURA Takumi	5a8949caa2	clang/test/CodeGenOpenCL/convergent.cl: Satisfy -Asserts with "opt -instnamer". llvm-svn: 285733	2016-11-01 20:08:17 +00:00
Yaxun Liu	7d07ae7c85	[OpenCL] Mark group functions as convergent in opencl-c.h Certain OpenCL builtin functions are supposed to be executed by all threads in a work group or sub group. Such functions should not be made divergent during transformation. It makes sense to mark them with convergent attribute. The adding of convergent attribute is based on Ettore Speziale's work and the original proposal and patch can be found at https://www.mail-archive.com/cfe-commits@lists.llvm.org/msg22271.html. Differential Revision: https://reviews.llvm.org/D25343 llvm-svn: 285725	2016-11-01 18:45:32 +00:00
Alexey Bader	abdcfc1809	[OpenCL] Setting constant address space for array initializers Summary: Setting constant address space for global constants used for memcpy-initialization of arrays. Patch by Alexey Sotkin. Reviewers: bader, yaxunl, Anastasia Subscribers: cfe-commits, AlexeySotkin Differential Revision: https://reviews.llvm.org/D25305 llvm-svn: 285557	2016-10-31 10:26:31 +00:00
Yaxun Liu	a91da4ba47	[OpenCL] Allow partial initializer for array and struct Currently Clang allows partial initializer for C99 but not for OpenCL, e.g. float a[16][16] = {1.0f, 2.0f}; is allowed in C99 but not allowed in OpenCL. This patch fixes that. Differential Revision: https://reviews.llvm.org/D25335 llvm-svn: 283891	2016-10-11 15:53:28 +00:00
Yaxun Liu	ea6b796e0e	[OpenCL] Fix bug in __builtin_astype causing invalid LLVM cast instructions __builtin_astype is used to cast OpenCL opaque types to other types, as such, it needs to be able to handle casting from and to pointer types correctly. Current it cannot handle 1) casting between pointers of different addr spaces 2) casting between pointer type and non-pointer types. This patch fixes that. Differential Revision: https://reviews.llvm.org/D25123 llvm-svn: 283114	2016-10-03 14:41:50 +00:00
Konstantin Zhuravlyov	5b48d725a0	[AMDGPU] Expose flat work group size, register and wave control attributes __attribute__((amdgpu_flat_work_group_size(<min>, <max>))) - request minimum and maximum flat work group size __attribute__((amdgpu_waves_per_eu(<min>[, <max>]))) - request minimum and/or maximum waves per execution unit Differential Revision: https://reviews.llvm.org/D24513 llvm-svn: 282371	2016-09-26 01:02:57 +00:00
Alexey Bader	465c18973d	[OpenCL] Augment pipe built-ins with pipe packet size and alignment. Reviewers: Anastasia, vpykhtin Subscribers: dmitry, cfe-commits Differential Revision: https://reviews.llvm.org/D23992 llvm-svn: 282252	2016-09-23 14:20:00 +00:00
Neil Hickey	eb62b17d8f	Reverting r281714 due to causing an assert when calling builtins that expect a double, from CL llvm-svn: 281899	2016-09-19 11:42:14 +00:00
Neil Hickey	ddfb093b72	Improve handling of floating point literals in OpenCL to only use double precision if the target supports fp64 https://reviews.llvm.org/D24235 llvm-svn: 281714	2016-09-16 10:15:06 +00:00
Yaxun Liu	d3e85b98be	AMDGPU: Fix target options fp32/64-denormals Fix target options for fp32/64-denormals so that +fp64-denormals is set if fp64 is supported -fp32-denormals if fp32 denormals is not supported, or -cl-denorms-are-zero is set +fp32-denormals if fp32 denormals is supported and -cl-denorms-are-zero is not set If target feature fp32/64-denormals is explicitly set, they will override default options and options deduced from -cl-denorms-are-zero. Differential Revision: https://reviews.llvm.org/D24512 llvm-svn: 281357	2016-09-13 17:37:09 +00:00
Alexey Bader	af17c7959e	[OpenCL] Fix pipe built-in functions return type. By default return type of call expressions calling built-in functions is set to bool. Fixes https://llvm.org/bugs/show_bug.cgi?id=30219. Reviewers: Anastasia Subscribers: dmitry, cfe-commits, yaxunl Differential Revision: https://reviews.llvm.org/D24136 llvm-svn: 280800	2016-09-07 10:32:03 +00:00
Alexey Bader	3e0b817b91	[OpenCL] Remove access qualifiers on images in arg info metadata. Summary: Remove access qualifiers on images in arg info metadata: * kernel_arg_type * kernel_arg_base_type Image access qualifiers are inseparable from type in clang implementation, but OpenCL spec provides a special query to get access qualifier via clGetKernelArgInfo with CL_KERNEL_ARG_ACCESS_QUALIFIER. Besides that OpenCL conformance test_api get_kernel_arg_info expects image types without access qualifier. Patch by Evgeniy Tyurin. Reviewers: bader, yaxunl, Anastasia Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D23915 llvm-svn: 280699	2016-09-06 10:10:28 +00:00
Matt Arsenault	88d7da01ca	AMDGPU: Handle structs directly in AMDGPUABIInfo Structs are currently handled as pointer + byval, which makes AMDGPU LLVM backend generate incorrect code when structs are used. This patch changes struct argument to be handled directly and without flattening, which Clover (Mesa 3D Gallium OpenCL state tracker) will be able to handle. Flattening would expand the struct to individual elements and pass each as a separate argument, which Clover can not handle. Furthermore, such expansion does not fit the OpenCL programming model which requires to explicitely specify each argument index, size and memory location. Patch by Vedran Miletić llvm-svn: 279463	2016-08-22 19:25:59 +00:00
Valery Pykhtin	4b5d9d16d3	[AMDGPU] add s_incperflevel/s_decperflevel builtins Differential revision: https://reviews.llvm.org/D23668 llvm-svn: 279235	2016-08-19 12:54:31 +00:00
Yaxun Liu	26f7566ff8	Re-commit [OpenCL] AMDGCN: Fix size_t type There was a premature cast to pointer type in emitPointerArithmetic which caused assertion in tests with assertion enabled. llvm-svn: 279206	2016-08-19 05:17:25 +00:00
Changpeng Fang	03bdd8f797	AMDGPU: Add clang builtin for ds_swizzle. Summary: int __builtin_amdgcn_ds_swizzle (int a, int imm); while imm is a constant. Differential Revision: http://reviews.llvm.org/D23682 llvm-svn: 279165	2016-08-18 22:04:54 +00:00
Yaxun Liu	dea5ccb04b	Revert [OpenCL] AMDGCN: Fix size_t type due to regressions in test/CodeGen/exprs.c on certain platforms. llvm-svn: 279127	2016-08-18 20:01:06 +00:00
Yaxun Liu	6305f8a351	[OpenCL] AMDGCN: Fix size_t type Pointers of certain GPUs in AMDGCN target in private address space is 32 bit but pointers in other address spaces are 64 bit. size_t type should be defined as 64 bit for these GPUs so that it could hold pointers in all address spaces. Also fixed issues in pointer arithmetic codegen by using pointer specific intptr type. Differential Revision: https://reviews.llvm.org/D23361 llvm-svn: 279121	2016-08-18 19:34:04 +00:00
Joey Gouly	b95e36027f	[OpenCL] Fix typo in test that I accidentally introduced in my previous commit. llvm-svn: 278235	2016-08-10 16:04:14 +00:00
Joey Gouly	ddbda40245	[OpenCL] Change block descriptor address space to constant. The block descriptor is a GlobalVariable in the LLVM IR, so it shouldn't be in the private address space. llvm-svn: 278234	2016-08-10 15:57:02 +00:00
Yaxun Liu	ffb60901fe	[OpenCL] Handle -cl-fp32-correctly-rounded-divide-sqrt Let the driver pass the option to frontend. Do not set precision metadata for division instructions when this option is set. Set function attribute "correctly-rounded-divide-sqrt-fp-math" based on this option. Differential Revision: https://reviews.llvm.org/D22940 llvm-svn: 278155	2016-08-09 20:10:18 +00:00
Yaxun Liu	2c17e82bc7	[OpenCL][AMDGPU] Add support for -cl-denorms-are-zero Adjust target features for amdgcn target when -cl-denorms-are-zero is set. Denormal support is controlled by feature strings fp32-denormals fp64-denormals in amdgcn target. If -cl-denorms-are-zero is not set and the command line does not set fp32/64-denormals feature string, +fp32-denormals +fp64-denormals will be on for GPU's supporting them. A new virtual function virtual void TargetInfo::adjustTargetOptions(const CodeGenOptions &CGOpts, TargetOptions &TargetOpts) const is introduced to allow adjusting target option by codegen option. Differential Revision: https://reviews.llvm.org/D22815 llvm-svn: 278151	2016-08-09 19:43:38 +00:00
Wei Ding	91c8450967	AMDGPU : Add Clang builtin intrinsics for compare with the full wavefront result. Differential Revision: http://reviews.llvm.org/D22934 llvm-svn: 277824	2016-08-05 15:38:46 +00:00
Yaxun Liu	c8acb4f37b	[OpenCL] Add the lit test for image size which was omitted by r277647. llvm-svn: 277756	2016-08-04 19:35:17 +00:00
Alexey Bader	d81623261a	[OpenCL] Added underscores to the names of 'to_addr' OpenCL built-ins. Summary: In order to re-define OpenCL built-in functions 'to_{private,local,global}' in OpenCL run-time library LLVM names must be different from the clang built-in function names. Reviewers: yaxunl, Anastasia Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D23120 llvm-svn: 277743	2016-08-04 18:06:27 +00:00
Yaxun Liu	0bc4b2d337	[OpenCL] Generate opaque type for sampler_t and function call for the initializer Currently Clang use int32 to represent sampler_t, which have been a source of issue for some backends, because in some backends sampler_t cannot be represented by int32. They have to depend on kernel argument metadata and use IPA to find the sampler arguments and global variables and transform them to target specific sampler type. This patch uses opaque pointer type opencl.sampler_t* for sampler_t. For each use of file-scope sampler variable, it generates a function call of __translate_sampler_initializer. For each initialization of function-scope sampler variable, it generates a function call of __translate_sampler_initializer. Each builtin library can implement its own __translate_sampler_initializer(). Since the real sampler type tends to be architecture dependent, allowing it to be initialized by a library function simplifies backend design. A typical implementation of __translate_sampler_initializer could be a table lookup of real sampler literal values. Since its argument is always a literal, the returned pointer is known at compile time and easily optimized to finally become some literal values directly put into image read instructions. This patch is partially based on Alexey Sotkin's work in Khronos Clang (`3d4eec6162`). Differential Revision: https://reviews.llvm.org/D21567 llvm-svn: 277024	2016-07-28 19:26:30 +00:00
Yaxun Liu	37ceedeabd	[OpenCL] AMDGCN target will generate images in constant address space Allows AMDGCN target to generate images (such as %opencl.image2d_t) in constant address space. Images will still be generated in global address space by default. Added tests to existing opencl-types.cl in test\CodeGenOpenCL. Patch by Aaron En Ye Shi. Differential Revision: https://reviews.llvm.org/D22523 llvm-svn: 276161	2016-07-20 19:21:11 +00:00
David Majnemer	24547108d6	Let FuncAttrs infer the 'returned' argument attribute This reverts commit r275756. llvm-svn: 276014	2016-07-19 19:59:24 +00:00
Yaxun Liu	f2e8ab2566	[OpenCL] Fixes bug of missing OCL version metadata on the AMDGCN target Added the opencl.ocl.version metadata to be emitted with amdgcn. Created a static function emitOCLVerMD which is shared between triple spir and target amdgcn. Also added new testcases to existing test file, spir_version.cl inside test/CodeGenOpenCL. Patch by Aaron En Ye Shi. Differential Revision: https://reviews.llvm.org/D22424 llvm-svn: 276010	2016-07-19 19:39:45 +00:00
NAKAMURA Takumi	966bde50c3	Revert r275678, "Revert "Revert r275027 - Let FuncAttrs infer the 'returned' argument attribute"" This reverts also r275029, "Update Clang tests after adding inference for the returned argument attribute" It broke LTO build. Seems miscompilation. llvm-svn: 275756	2016-07-18 03:23:25 +00:00
Hal Finkel	81cdef31e6	Revert "Revert r275029 - Update Clang tests after adding inference for the returned argument attribute" This reverts commit r275043 after reapplying the underlying LLVM commit. llvm-svn: 275679	2016-07-16 07:22:09 +00:00
Matt Arsenault	c7536a5d60	AMDGPU: Remove legacy ldexp builtin llvm-svn: 275623	2016-07-15 21:33:06 +00:00
Matt Arsenault	c86671da09	AMDGPU: Update for rsq intrinsic changes llvm-svn: 275622	2016-07-15 21:33:02 +00:00
Wei Ding	ea41f356bb	AMDGPU: Add Clang Builtin for v_lerp_u8 Differential Revision: http://reviews.llvm.org/D22380 llvm-svn: 275577	2016-07-15 16:43:03 +00:00
Alexey Bader	10e9e59898	[OpenCL] Fix code generation of kernel pipe parameters. Improved test with user define structure pipe type case. Reviewers: Anastasia, pxli168 Subscribers: yaxunl, cfe-commits Differential revision: http://reviews.llvm.org/D21744 llvm-svn: 275259	2016-07-13 10:28:13 +00:00
Hal Finkel	9a17d7ac6e	Revert r275029 - Update Clang tests after adding inference for the returned argument attribute The associated backend change is causing miscompiles from the AArch64 backend. llvm-svn: 275043	2016-07-11 04:52:07 +00:00
Jan Vesely	d7e03a5bd9	AMDGPU: Export workitem builtins Reviewers: tstellardAMD Differential Revision: http://reviews.llvm.org/D20299 llvm-svn: 275030	2016-07-10 22:38:04 +00:00
Hal Finkel	617c962752	Update Clang tests after adding inference for the returned argument attribute Adjusting tests after r275027. llvm-svn: 275029	2016-07-10 22:26:52 +00:00
Yaxun Liu	79c99fb7eb	[OpenCL] Add missing -cl-no-signed-zeros option into driver Add OCL option -cl-no-signed-zeros to driver options. Also added to opencl.cl testcases. Patch by Aaron En Ye Shi. Differential Revision: http://reviews.llvm.org/D22067 llvm-svn: 274923	2016-07-08 20:28:29 +00:00
Alexey Bader	c813c8113d	[OpenCL] Fix access qualifiers handling for typedefs OpenCL s6.6: "Access qualifier must be used with image object arguments of kernels and of user-defined functions [...] If no qualifier is provided, read_only is assumed". This does not define the behavior for image types used in typedef declaration, but following the spec logic, we should allow access qualifiers specification in typedefs, e.g.: typedef write_only image1d_t img1d_wo; Unlike cv-qualifiers, user cannot add access qualifier to a typedef type, i.e. this is not allowed: typedef image1d_t img1d; // note: previously declared 'read_only' here void foo(write_only img1d im) {} // error: multiple access qualifier Patch by Andrew Savonichev. Reviewers: Anastasia Stulova. Differential revision: http://reviews.llvm.org/D20948 llvm-svn: 274858	2016-07-08 15:34:59 +00:00
Anastasia Stulova	db7a31cce7	[OpenCL] An implementation of device side enqueue (DSE) from OpenCL v2.0 s6.13.17. - Added new Builtins: enqueue_kernel, get_kernel_work_group_size and get_kernel_preferred_work_group_size_multiple. These Builtins use custom check to diagnose parameters of the passed Blocks i. e. variable number of 'local void*' type params, and check different overloads specified in Table 6.31 of OpenCL v2.0. - IR is generated as an internal library call for each OpenCL Builtin, reusing ObjC Block implementation. Review: http://reviews.llvm.org/D20249 llvm-svn: 274540	2016-07-05 11:31:24 +00:00
Nikolay Haustov	8c6538b86d	AMDGPU: Set amdgpu_kernel calling convention for OpenCL kernels. Summary: Summary: Change Clang calling convention SpirKernel to OpenCLKernel. Set calling convention OpenCLKernel for amdgcn as well. Add virtual method .getOpenCLKernelCallingConv() to TargetCodeGenInfo and use it to set target calling convention for AMDGPU and SPIR. Update tests. Reviewers: rsmith, tstellarAMD, Anastasia, yaxunl Subscribers: kzhuravl, cfe-commits Differential Revision: http://reviews.llvm.org/D21367 llvm-svn: 274220	2016-06-30 09:06:33 +00:00
Alexey Bader	56fac57b57	[OpenCL] Fix typo in as_type test. Reset astype variable in f6 function to avoid matching with wrong value from f5 function. llvm-svn: 274120	2016-06-29 12:25:58 +00:00
Matt Arsenault	64665bc50d	AMDGPU: Add builtin to read exec mask llvm-svn: 273965	2016-06-28 00:13:17 +00:00
Daniel Sanders	e6ca7b6a6b	Attempt to fix MIPS buildbots after r273425. MIPS has a 'signext' attribute that was causing the check to fail. llvm-svn: 273552	2016-06-23 09:29:38 +00:00
Yaxun Liu	ba28cba882	[OpenCL] Use function metadata to represent kernel attributes This patch uses function metadata to represent reqd_work_group_size, work_group_size_hint and vector_type_hint kernel attributes and kernel argument info. Differential Revision: http://reviews.llvm.org/D20979 llvm-svn: 273425	2016-06-22 14:56:35 +00:00
Peter Collingbourne	bcf909d737	Update clang for D20348 Differential Revision: http://reviews.llvm.org/D20339 llvm-svn: 272710	2016-06-14 21:02:05 +00:00
Yaxun Liu	c564701fbd	[OpenCL] Fix __builtin_astype for vec3 types. __builtin_astype does not generate correct LLVM IR for vec3 types. This patch inserts bitcasts to/from vec4 when necessary in addition to generating vector shuffle. Sema and codegen tests are added. Differential Revision: http://reviews.llvm.org/D20133 llvm-svn: 272153	2016-06-08 15:11:21 +00:00
Matt Arsenault	250024f905	AMDGPU: Verify subtarget specific builtins Cleanup setup of subtarget features. llvm-svn: 272091	2016-06-08 01:56:42 +00:00
Xiuli Pan	244e3f69e4	[OPENCL] Fix wrongly vla error for OpenCL array. Summary: OpenCL should support array with const value size length, those const varibale in global and constant address space and variable in constant address space. Fixed test case error. Reviewers: Anastasia, yaxunl, bader Subscribers: bader, cfe-commits Differential Revision: http://reviews.llvm.org/D20090 llvm-svn: 271978	2016-06-07 04:34:00 +00:00
Xiuli Pan	a219552ca8	Revert "[OPENCL] Fix wrongly vla error for OpenCL array." Test case break on system-z. This reverts commit 9a7212e1e87f1396952d74f8c62314a775ccbb1c. llvm-svn: 271975	2016-06-07 03:41:07 +00:00
Xiuli Pan	bdfbaaaefe	[OPENCL] Fix wrongly vla error for OpenCL array. Summary: OpenCL should support array with const value size length, those const varibale in global and constant address space and variable in constant address space. Reviewers: Anastasia, yaxunl, bader Subscribers: bader, cfe-commits Differential Revision: http://reviews.llvm.org/D20090 llvm-svn: 271971	2016-06-07 03:13:39 +00:00
Matt Arsenault	2d51059ebb	AMDGPU: Add fract builtin llvm-svn: 271080	2016-05-28 00:43:27 +00:00
Yaxun Liu	f7449a179b	[OpenCL] Add to_{global\|local\|private} builtin functions. OpenCL builtin functions to_{global\|local\|private} accepts argument of pointer type to arbitrary pointee type, and return a pointer to the same pointee type in different addr space, i.e. global gentype to_global(gentype p); It is not desirable to declare it as global void to_global(void ); in opencl header file since it misses diagnostics. This patch implements these builtin functions as Clang builtin functions. In the builtin def file they are defined to have signature void(void). When handling call expressions, their declarations are re-written to have correct parameter type and return type corresponding to the call argument. In codegen call to addr void to_addr(void) is generated with addrcasts or bitcasts to facilitate implementation in builtin library. Differential Revision: http://reviews.llvm.org/D19932 llvm-svn: 270261	2016-05-20 19:54:38 +00:00
Yaxun Liu	c537c8a72b	[OpenCL] Allow explicit cast of 0 to event_t. Patch by Aaron Enye Shi. Differential Revision: http://reviews.llvm.org/D17578 llvm-svn: 270238	2016-05-20 17:18:16 +00:00
Yaxun Liu	39cf40f6b4	[OpenCL] Add supported OpenCL extensions to target info. Add supported OpenCL extensions to target info. It serves as default values to save the users of the burden setting each supported extensions and optional core features in command line. Re-commit after fixing build error due to missing override attribute. Differential Revision: http://reviews.llvm.org/D19484 llvm-svn: 269670	2016-05-16 17:06:34 +00:00
Yaxun Liu	fa1df45c0d	Revert "[OpenCL] Add supported OpenCL extensions to target info." Revert r269431 due to build failure caused by warning msg: llvm/tools/clang/lib/Basic/Targets.cpp:2090:9: error: 'setSupportedOpenCLOpts' overrides a member function but is not marked 'override' [-Werror,-Winconsistent-missing-override] void setSupportedOpenCLOpts() { llvm-svn: 269435	2016-05-13 17:16:26 +00:00
Yaxun Liu	64936ce91d	[OpenCL] Add supported OpenCL extensions to target info. Add supported OpenCL extensions to target info. It serves as default values to save the users of the burden setting each supported extensions and optional core features in command line. Differential Revision: http://reviews.llvm.org/D19484 llvm-svn: 269431	2016-05-13 15:44:37 +00:00
Nikolay Haustov	1771948d72	Revert "AMDGPU/SI: Use amdgpu_kernel calling convention for OpenCL kernels." This reverts commit f7053ec90d0fc56f0837e43c2c759e85b56c21a1. It broke calling OpenCL kernel from another kernel. llvm-svn: 268740	2016-05-06 15:00:51 +00:00
Nikolay Haustov	4961ea85d7	AMDGPU/SI: Use amdgpu_kernel calling convention for OpenCL kernels. Reviewers: tstellarAMD, arsenm Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D19918 llvm-svn: 268718	2016-05-06 09:15:24 +00:00
Yaxun Liu	ab93394a29	[OpenCL] Fix bug in mergeTypes which causes equivalent types treated as different. When comparing unqualified types, canonical types should be used, otherwise equivalent types may be treated as different type. Differential Revision: http://reviews.llvm.org/D19662 llvm-svn: 267906	2016-04-28 17:34:57 +00:00
Matt Arsenault	1dd2752e7a	AMDGPU: Add test for generic builtin behavior llvm-svn: 266383	2016-04-14 22:34:39 +00:00
Yaxun Liu	a1a87adf59	PR19957: [OpenCL] Incorrectly accepts implicit address space conversion with ternary operator. Generates addrspacecast instead of bitcast for ternary operator when necessary, and diagnose ternary operator with incompatible second and third operands. https://llvm.org/bugs/show_bug.cgi?id=19957 Differential Revision: http://reviews.llvm.org/D17412 llvm-svn: 266111	2016-04-12 19:43:36 +00:00
Yaxun Liu	b7b6d0fc66	[OpenCL] Handle AddressSpaceConversion when target address space does not change. In codegen different address spaces may be mapped to the same address space for a target, e.g. in x86/x86-64 all address spaces are mapped to 0. Therefore AddressSpaceConversion should be translated by CreatePointerBitCastOrAddrSpaceCast instead of CreateAddrSpaceCast. Differential Revision: http://reviews.llvm.org/D18713 llvm-svn: 266107	2016-04-12 19:03:49 +00:00
Yaxun Liu	c5cec39c0e	Verify commit right by adding a blank line to test/CodeGenOpenCL/address-spaces-conversions.cl. llvm-svn: 266083	2016-04-12 15:46:24 +00:00
Alexey Bader	954ba21f85	[OpenCL] Complete image types support. I. Current implementation of images is not conformant to spec in the following points: 1. It makes no distinction with respect to access qualifiers and therefore allows to use images with different access type interchangeably. The following code would compile just fine: void write_image(write_only image2d_t img); kernel void foo(read_only image2d_t img) { write_image(img); } // Accepted code which is disallowed according to s6.13.14. 2. It discards access qualifier on generated code, which leads to generated code for the above example: call void @write_image(%opencl.image2d_t* %img); In OpenCL2.0 however we can have different calls into write_image with read_only and wite_only images. Also generally following compiler steps have no easy way to take different path depending on the image access: linking to the right implementation of image types, performing IR opts and backend codegen differently. 3. Image types are language keywords and can't be redeclared s6.1.9, which can happen currently as they are just typedef names. 4. Default access qualifier read_only is to be added if not provided explicitly. II. This patch corrects the above points as follows: 1. All images are encapsulated into a separate .def file that is inserted in different points where image handling is required. This avoid a lot of code repetition as all images are handled the same way in the code with no distinction of their exact type. 2. The Cartesian product of image types and image access qualifiers is added to the builtin types. This simplifies a lot handling of access type mismatch as no operations are allowed by default on distinct Builtin types. Also spec intended access qualifier as special type qualifier that are combined with an image type to form a distinct type (see statement above - images can't be created w/o access qualifiers). 3. Improves testing of images in Clang. Author: Anastasia Stulova Reviewers: bader, mgrang. Subscribers: pxli168, pekka.jaaskelainen, yaxunl. Differential Revision: http://reviews.llvm.org/D17821 llvm-svn: 265783	2016-04-08 13:40:33 +00:00
Matt Arsenault	3fb963389e	AMDGPU: Add frexp_mant + frexp_exp builtins llvm-svn: 264960	2016-03-30 22:57:40 +00:00
Xiuli Pan	972bea8a2e	[OpenCL] Add ocl and spir version for spir target Summary: Add opencl.spir.version and opencl.ocl.version metadata for CodeGen to identify OpenCL version. Reviewers: yaxunl, Anastasia Subscribers: cfe-commits, pekka.jaaskelainen Differential Revision: http://reviews.llvm.org/D17596 llvm-svn: 264241	2016-03-24 03:57:17 +00:00
Matt Arsenault	39edcd0e1d	AMDGPU: Add builtins for recently added intrinsics llvm-svn: 262126	2016-02-27 09:54:43 +00:00
Matt Arsenault	b015d623d6	AMDGPU: Fix inconsistent register name for flat_scratch llvm-svn: 262123	2016-02-27 09:06:22 +00:00
Paul Robinson	65ab102be3	Fix Clang tests that used CHECK-NEXT-NOT and CHECK-DAG-NOT. FileCheck actually doesn't support combo suffixes. Differential Revision: http://reviews.llvm.org/D17589 llvm-svn: 262052	2016-02-26 19:34:01 +00:00
Xiuli Pan	379554ac5b	[OpenCL] Add Sema checks for types Summary: Add Sema checks for opencl type: image, pipe.... This patch is partitioned from http://reviews.llvm.org/D16047 Reviewers: Anastasia, yaxunl Subscribers: pekka.jaaskelainen, cfe-commits Differential Revision: http://reviews.llvm.org/D17437 llvm-svn: 261818	2016-02-25 03:34:20 +00:00
Anastasia Stulova	6bdbcbb3d9	[OpenCL] Generate metadata for opencl_unroll_hint attribute Add support for opencl_unroll_hint attribute from OpenCL v2.0 s6.11.5. Reusing most of metadata generation from CGLoopInfo helper class. The code is based on Khronos OpenCL compiler: https://github.com/KhronosGroup/SPIR/tree/spirv-1.0 Patch by Liu Yaxun (Sam)! Differential Revision: http://reviews.llvm.org/D16686 llvm-svn: 261350	2016-02-19 18:30:11 +00:00
Matt Arsenault	9b277b4ad4	AMDGPU: Add sin/cos builtins llvm-svn: 260783	2016-02-13 01:21:09 +00:00
Matt Arsenault	f5c1f47181	AMDGPU: Update builtin for intrinsic change llvm-svn: 260781	2016-02-13 01:03:09 +00:00
Ulrich Weigand	1a0c1804b3	Add target triple to CodeGenOpenCL/pipe_types.cl test case The test is failing on SystemZ since different IR is being generated due to platform ABI differences. Add a target triple. Fix suggested by Anastasia Stulova. llvm-svn: 259183	2016-01-29 10:45:23 +00:00
Matt Arsenault	cf70cb9d00	AMDGPU: Add amdgcn cube builtins llvm-svn: 258794	2016-01-26 06:37:54 +00:00
Xiuli Pan	bb4d8d30b1	Recommit: R258773 [OpenCL] Pipe builtin functions Fix arc patch fuzz error. Summary: Support for the pipe built-in functions for OpenCL 2.0. The pipe builtin functions may have infinite kinds of element types, one approach would be to just generate calls that would always use generic types such as void*. This patch is based on bader's opencl support patch on SPIR-V branch. Reviewers: Anastasia, pekka.jaaskelainen Subscribers: keryell, bader, cfe-commits Differential Revision: http://reviews.llvm.org/D15914 llvm-svn: 258782	2016-01-26 04:03:48 +00:00
David Majnemer	747f168e8d	Revert "[OpenCL] Pipe builtin functions" This reverts commit r258773, it broke the build bots: http://bb.pgr.jp/builders/cmake-clang-x86_64-linux/builds/43853 llvm-svn: 258775	2016-01-26 02:22:31 +00:00
Xiuli Pan	3a9952c9e7	[OpenCL] Pipe builtin functions Summary: Support for the pipe built-in functions for OpenCL 2.0. The pipe builtin functions may have infinite kinds of element types, one approach would be to just generate calls that would always use generic types such as void*. This patch is based on bader's opencl support patch on SPIR-V branch. Reviewers: Anastasia, pekka.jaaskelainen Subscribers: keryell, bader, cfe-commits Differential Revision: http://reviews.llvm.org/D15914 llvm-svn: 258773	2016-01-26 02:06:04 +00:00
Matt Arsenault	721d21b821	AMDGPU: Add barrier builtin llvm-svn: 258564	2016-01-22 21:56:30 +00:00
Matt Arsenault	8a4078c741	AMDGPU: Rename builtins to use amdgcn prefix Keep the ones still used by libclc around for now. Emit the new amdgcn intrinsic name if not targeting r600, in which case the old AMDGPU name is still used. llvm-svn: 258560	2016-01-22 21:30:53 +00:00
George Burgess IV	df1ed0099b	[Bugfix] Fix ICE on constexpr vector splat. In {CG,}ExprConstant.cpp, we weren't treating vector splats properly. This patch makes us treat splats more properly. Additionally, this patch adds a new cast kind which allows a bool->int cast to result in -1 or 0, instead of 1 or 0 (for true and false, respectively), so we can sanely model OpenCL bool->int casts in the AST. Differential Revision: http://reviews.llvm.org/D14877 llvm-svn: 257559	2016-01-13 01:52:39 +00:00
Xiuli Pan	9c14e28211	[OpenCL] Pipe type support Summary: Support for OpenCL 2.0 pipe type. This is a bug-fix version for bader's patch reviews.llvm.org/D14441 Reviewers: pekka.jaaskelainen, Anastasia Subscribers: bader, Anastasia, cfe-commits Differential Revision: http://reviews.llvm.org/D15603 llvm-svn: 257254	2016-01-09 12:53:17 +00:00
Anastasia Stulova	784fb78274	[OpenCL 2.0] Apply default address space (AS). If AS of a variable/parameter declaration is not set by the source, OpenCL v2.0 s6.5 defines explicit rules for default ASes: - The AS of global and local static variables defaults to global; - All pointers point to generic AS. http://reviews.llvm.org/D13168 llvm-svn: 253863	2015-11-23 11:14:44 +00:00
Anastasia Stulova	b02e7835c5	[OpenCL] Fix casting a true boolean to an integer vector. OpenCL v1.1 s6.2.2: for the boolean value true, every bit in the result vector should be set. This change treats the i1 value as signed for the purposes of performing the cast to integer, and therefore sign extend into the result. Patch by Neil Hickey! http://reviews.llvm.org/D13349 llvm-svn: 249301	2015-10-05 11:27:41 +00:00
Simon Pilgrim	4034b9f0b9	Fix invalid shufflevector operands This patch fixes bug 23800 ( https://llvm.org/bugs/show_bug.cgi?id=23800#c2 ). There existed a case where the index operand from extractelement was directly used to create a shufflevector mask. Since the index can be of any integral type but the mask must only contain 32 bit integers a 64 bit index operand led to an assertion error later on. Committed on behalf of mpflanzer (Moritz Pflanzer) Differential Revision: http://reviews.llvm.org/D10838 llvm-svn: 243851	2015-08-02 15:28:10 +00:00
David Blaikie	ea3e51d73f	Account for calling convention specifiers in function definitions in IR test cases Several tests wouldn't pass when executed on an armv7a_pc_linux triple due to the non-default arm_aapcs calling convention produced on the function definitions in the IR output. Account for this with the application of a little regex. Patch by Ying Yi. llvm-svn: 240971	2015-06-29 17:29:50 +00:00
NAKAMURA Takumi	74a0022fde	clang/test/CodeGenOpenCL/opencl_types.cl: Tweak expressions according to r237548. With MS mangler, the signature is: x86: define void @"\01?bad1@@$$J0YAXPAPAUocl_image1d@@PAPAUocl_image2d@@1@Z" (%opencl.image1d_t %b, %opencl.image2d_t %c, %opencl.image2d_t %d) nounwind x64: define void @"\01?bad1@@$$J0YAXPEAPAUocl_image1d@@PEAPAUocl_image2d@@1@Z"(%opencl.image1d_t %b, %opencl.image2d_t %c, %opencl.image2d_t %d) nounwind llvm-svn: 237551	2015-05-18 03:58:27 +00:00
Sanjay Patel	a932bb93b0	Remove the cl-no-signed-zeros cc1 option Use the driver flag -fno-signed-zeros instead. This was recommended but not implemented in D6873: http://reviews.llvm.org/D6873 which was checked in at r226915: http://reviews.llvm.org/rL226915 llvm-svn: 234093	2015-04-04 14:54:24 +00:00
Tom Stellard	b919c7d9eb	Sema: Accept pointers to any address space for builtin functions As long as they don't have an address space explicitly defined. This allows builtins with pointer arguments to be used with OpenCL. llvm-svn: 233706	2015-03-31 16:39:02 +00:00
Ahmed Bougacha	6ba3831ebe	[CodeGen] Support native half inc/dec amounts. We previously defaulted to long double, but it's also possible to have a half inc/dec amount, when LangOpts NativeHalfType is set. Currently, that's only true for OpenCL. llvm-svn: 233135	2015-03-24 23:44:42 +00:00
David Blaikie	bdf40a62a7	Test case updates for explicit type parameter to the gep operator llvm-svn: 232187	2015-03-13 18:21:46 +00:00
Sameer Sahasrabuddhe	a75db66eee	Restores r228382, which was reverted in r228406. The original commit failed to handle "shift assign" (<<=), which broke the test mentioned in r228406. This is now fixed and the test added to the lit tests under SemaOpenCL. * Original commit message from r228382 * OpenCL: handle shift operator with vector operands Introduce a number of checks: 1. If LHS is a scalar, then RHS cannot be a vector. 2. Operands must be of integer type. 3. If both are vectors, then the number of elements must match. Relax the requirement for "usual arithmetic conversions": When LHS is a vector, a scalar RHS can simply be expanded into a vector; OpenCL does not require that its rank be lower than the LHS. For example, the following code is not an error even if the implicit type of the constant literal is "int". char2 foo(char2 v) { return v << 1; } Consolidate existing tests under CodeGenOpenCL, and add more tests under SemaOpenCL. llvm-svn: 230464	2015-02-25 05:48:23 +00:00
Matt Arsenault	b8e668a52f	Revert r229409: "Hack to try deleting file from build bots" llvm-svn: 229411	2015-02-16 18:03:59 +00:00
Matt Arsenault	6f8c4cdcf6	Hack to try deleting file from build bots llvm-svn: 229409	2015-02-16 17:33:12 +00:00
Matt Arsenault	e7f4f86dff	Don't create output file in test llvm-svn: 229407	2015-02-16 17:11:58 +00:00
Matt Arsenault	a03280de75	OpenCL: Accept -cl-strict-aliasing This was in 1.0, but deprecated in 1.1. Accept it and do nothing for compatability. llvm-svn: 229403	2015-02-16 16:43:13 +00:00
Tom Stellard	96d5dc77fa	Revert "OpenCL: handle shift operator with vector operands" This reverts commit r228382. This breaks the following case: Reported by Jeroen Ketema: http://lists.cs.uiuc.edu/pipermail/cfe-commits/Week-of-Mon-20150202/122961.html typedef __attribute__((ext_vector_type(3))) char char3; void foo() { char3 v = {1,1,1}; char3 w = {1,2,3}; w <<= v; } If I compile with: clang -x cl file.c Then an error is produced: file.c:10:5: error: expression is not assignable w <<= v; ~ ^ 1 error generated. llvm-svn: 228406	2015-02-06 17:30:04 +00:00
Sameer Sahasrabuddhe	c65605d008	OpenCL: handle shift operator with vector operands Introduce a number of checks: 1. If LHS is a scalar, then RHS cannot be a vector. 2. Operands must be of integer type. 3. If both are vectors, then the number of elements must match. Relax the requirement for "usual arithmetic conversions": When LHS is a vector, a scalar RHS can simply be expanded into a vector; OpenCL does not require that its rank be lower than the LHS. For example, the following code is not an error even if the implicit type of the constant literal is "int". char2 foo(char2 v) { return v << 1; } Consolidate existing tests under CodeGenOpenCL, and add more tests under SemaOpenCL. llvm-svn: 228382	2015-02-06 05:44:55 +00:00
Alexander Kornienko	21de0ae3d4	Re-apply "r226548 - Introduce SPIR calling conventions" reverted in r226558. The test was fixed after a discussion with the revision author: the check pattern was made more flexible as the "%call" part is not what we actually want to check strictly there. The original patch description: === Introduce SPIR calling conventions. This implements Section 3.7 from the SPIR 1.2 spec: SPIR kernels should use "spir_kernel" calling convention. Non-kernel functions use "spir_func" calling convention. All other calling conventions are disallowed. The patch works only for OpenCL source. Any other uses will need to ensure that kernels are assigned the spir_kernel calling convention correctly. === llvm-svn: 226561	2015-01-20 11:20:41 +00:00
Alexander Kornienko	22c9d67e34	Reverting r226548 as one of the tests fails in some configurations. Here's the fail log from our internal setup: === .../tools/clang/clang -cc1 -internal-isystem .../tools/clang/staging/include -nostdsysteminc .../tools/clang/test/CodeGenOpenCL/spir-calling-conv.cl -triple spir-unknown-unknown -emit-llvm -o - FileCheck .../tools/clang/test/CodeGenOpenCL/spir-calling-conv.cl .../tools/clang/test/CodeGenOpenCL/spir-calling-conv.cl:11:12: error: expected string not found in input // CHECK: %call = tail call spir_func i32 @get_dummy_id(i32 0) ^ <stdin>:6:52: note: scanning from here define spir_kernel void @foo(i32 addrspace(1)* %A) #0 { ^ <stdin>:7:2: note: possible intended match here %1 = tail call spir_func i32 @get_dummy_id(i32 0) #2 ^ === Here's a failure on a public CI server: http://lab.llvm.org:8080/green/job/clang-stage2-configure-Rlto_check/1183/ llvm-svn: 226558	2015-01-20 10:55:33 +00:00
Sameer Sahasrabuddhe	450a58b8af	Introduce SPIR calling conventions. This implements Section 3.7 from the SPIR 1.2 spec: SPIR kernels should use "spir_kernel" calling convention. Non-kernel functions use "spir_func" calling convention. All other calling conventions are disallowed. The patch works only for OpenCL source. Any other uses will need to ensure that kernels are assigned the spir_kernel calling convention correctly. llvm-svn: 226548	2015-01-20 06:44:32 +00:00
Matt Arsenault	6365ffea3e	Add __builtin_amdgpu_class llvm-svn: 225314	2015-01-06 23:14:57 +00:00
Tom Stellard	d8e38a3206	R600: Handle amdgcn triple For now there is no difference between amdgcn and r600. llvm-svn: 225294	2015-01-06 20:34:47 +00:00
Pekka Jaaskelainen	e94b0e1870	Fix an address space id reset with array decay's implicit conversion. The issue was produced with OpenCL C code that called a function with a constant string literal argument. llvm-svn: 224592	2014-12-19 18:04:27 +00:00
Duncan P. N. Exon Smith	b3a66691f8	IR: Make metadata typeless in assembly, clang side Match LLVM changes from r224257. llvm-svn: 224259	2014-12-15 19:10:08 +00:00
Pekka Jaaskelainen	3701450b06	OpenCL C: Add support for a set of floating point arithmetic relaxation flags: -cl-no-signed-zeros -cl-unsafe-math-optimizations -cl-finite-math-only -cl-fast-relaxed-math Propagate the info to FP instruction flags as well as function attributes where they are available. llvm-svn: 223928	2014-12-10 16:41:14 +00:00
Matt Arsenault	43fae6c855	Add attributes for AMDGPU register limits. This is a performance hint that can be applied to kernels to attempt to limit the number of used registers. llvm-svn: 223384	2014-12-04 20:38:18 +00:00
Sameer Sahasrabuddhe	c6093fea03	Always emit kernel arg info for SPIR. http://llvm.org/bugs/show_bug.cgi?id=21555 Currently, kernel argument metadata is omitted unless the "-cl-kernel-arg-info" option is specified. But the SPIR 1.2 spec requires that all metadata except kernel_arg_name should always be emitted, and kernel_arg_name is only emitted when "-cl-kernel-arg-info" is specified. Patch ported by Ryan Burn from the Khronos SPIR generator. https://github.com/KhronosGroup/SPIR llvm-svn: 223340	2014-12-04 05:30:58 +00:00
NAKAMURA Takumi	ddd125a3f9	clang/test/CodeGenOpenCL/opencl_types.cl: Appease i686-msvc. llvm-svn: 222969	2014-11-30 00:32:02 +00:00
NAKAMURA Takumi	c3ee2c5a77	Tweak clang/test/CodeGenOpenCL/opencl_types.cl to appease msvc since r222941. llvm-svn: 222956	2014-11-29 17:27:07 +00:00
David Majnemer	eea02eefe7	AST: Consider pseudo-struct builtin types as substitutable We didn't consider types like ObjCSel as a substitution candidate. This fixes PR21688. llvm-svn: 222941	2014-11-28 22:22:46 +00:00
Tim Northover	8603941057	OpenCL: fix test for lack of names in release builds llvm-svn: 222853	2014-11-26 22:33:04 +00:00
Aaron Ballman	eda0373900	Adding an explicit triple to this test to get it to pass all build bots. llvm-svn: 222837	2014-11-26 16:17:20 +00:00
Anastasia Stulova	5d8ad8a7b8	[OpenCL] Implemented restrictions for pointer conversions specified in OpenCL v2.0. OpenCL v2.0 s6.5.5 restricts conversion of pointers to different address spaces: - the named address spaces (__global, __local, and __private) => __generic - implicitly converted; - __generic => named - with an explicit cast; - named <=> named - disallowed; - __constant <=> any other - disallowed. llvm-svn: 222834	2014-11-26 15:36:41 +00:00
Matt Arsenault	3f6469b4c6	Emit OpenCL local global variables without zeorinitializer Local variables are not initialized, and every target has been (incorrectly) ignoring the unnecessary request for zero initialization. llvm-svn: 221162	2014-11-03 16:51:53 +00:00
NAKAMURA Takumi	729be14435	Prune CRLF. llvm-svn: 220678	2014-10-27 12:37:26 +00:00
Matt Arsenault	2174a9dc28	R600: Update for div_fmas intrinsic change llvm-svn: 220339	2014-10-21 22:21:41 +00:00
Tom Stellard	ade13b2a4e	OpenCL: Emit global variables in the constant addr space as constant globals llvm-svn: 219929	2014-10-16 15:29:19 +00:00
Tom Stellard	f414fb75b0	Driver: Implement -cl-denorms-are-zero This is currently a no-op, which is allowed by the OpenCL specification. llvm-svn: 216179	2014-08-21 13:58:36 +00:00
Matt Arsenault	dbb84916d9	R600: Add ldexp intrinsic llvm-svn: 215738	2014-08-15 17:44:32 +00:00
Fraser Cormack	dadc371e85	Add OpenCL/SPIR kernel_arg_base_type metadata node As defined in the SPIR 1.2 specification, this node behaves similarly to kernel_arg_type but will print the underlying type name, e.g., without typedefs. Example: typedef unsigned int myunsignedint; would report: 'myunsignedint' in the kernel_arg_type node 'uint' in the kernel_arg_base_type node llvm-svn: 214308	2014-07-30 14:39:53 +00:00
Fraser Cormack	152493b635	Fix OpenCL/SPIR kernel_arg_type metadata node This fixes a bug where kernel_arg_type was always changing 'unsigned ' to 'u' for any parameter type, including non-canonical types. Example: typedef unsigned int myunsignedint; would report: "myunt" llvm-svn: 214305	2014-07-30 13:41:12 +00:00
Matt Arsenault	8587711164	Add codegen for more R600 builtins llvm-svn: 213079	2014-07-15 17:23:46 +00:00
Matt Arsenault	56f008d538	Add R600 builtin codegen. llvm-svn: 211631	2014-06-24 20:45:01 +00:00
Rafael Espindola	df540cbf92	Update for llvm api change. llvm-svn: 210303	2014-06-06 01:20:47 +00:00
Matt Arsenault	10e3ef8d2d	Bug 18567: Fix constantexpr pointer casts with address spaces. Getting a pointer into a struct at a non-zero offset would try to use the default address space. llvm-svn: 206478	2014-04-17 17:45:37 +00:00
Joey Gouly	92a47442f4	When printing types for the OpenCL kernel metadata, use the PrintingPolicy. This allows 'half' to be printed as 'half' and not as '__fp16'. Patch by Fraser Cormack! llvm-svn: 205624	2014-04-04 13:43:57 +00:00
Hans Wennborg	c9bd88e681	Remove the -cxx-abi command-line flag. This makes the C++ ABI depend entirely on the target: MS ABI for -win32 triples, Itanium otherwise. It's no longer possible to do weird combinations. To be able to run a test with a specific ABI without constraining it to a specific triple, new substitutions are added to lit: %itanium_abi_triple and %ms_abi_triple can be used to get the current target triple adjusted to the desired ABI. For example, if the test suite is running with the i686-pc-win32 target, %itanium_abi_triple will expand to i686-pc-mingw32. Differential Revision: http://llvm-reviews.chandlerc.com/D2545 llvm-svn: 199250	2014-01-14 19:35:09 +00:00
Hans Wennborg	9125b08b52	Update tests in preparation for using the MS ABI for Win32 targets In preparation for making the Win32 triple imply MS ABI mode, make all tests pass in this mode, or make them use the Itanium mode explicitly. Differential Revision: http://llvm-reviews.chandlerc.com/D2401 llvm-svn: 199130	2014-01-13 19:48:13 +00:00
Pekka Jaaskelainen	3587b32e1c	The OpenCL specification states that images are allocated from the global address space (6.5.1 of the OpenCL 1.2 specification). This makes clang construct the image arguments in the global address space and generate the argument metadata with the correct address space descriptor. Patch by Pedro Ferreira! llvm-svn: 198868	2014-01-09 13:37:30 +00:00
Joey Gouly	cf4143b55e	Fix a crash in EmitStoreThroughExtVectorComponentLValue for vectors of odd sizes. In OpenCL a vector of 3 elements, acts like a vector of four elements. So for a vector of size 3 the '.hi' and '.odd' accessors, would access the elements {2, 3} and {1, 3} respectively. However, in EmitStoreThroughExtVectorComponentLValue we are still operating on a vector of size 3, so we should only access {2} and {1}. We do this by checking the last element to be accessed, and ignore it if it is out-of-bounds. EmitLoadOfExtVectorElementLValue doesn't have a similar problem, because it does a direct shufflevector with undef, so an out-of-bounds access just gives an undef value. Patch by Anastasia Stulova! llvm-svn: 195367	2013-11-21 17:09:05 +00:00
Joey Gouly	561bba2e9f	[OpenCL] Make sure we put string literals in the constant address space. llvm-svn: 194717	2013-11-14 18:26:10 +00:00
David Tweed	31d09b0cef	Certain multi-platform languages, such as OpenCL, have the concept of address spaces which is both (1) a "semantic" concept and (2) possibly a hardware level restriction. It is desirable to be able to discard/merge the LLVM-level address spaces on arguments for which there is no difference to the current backend while keeping track of the semantic address spaces in a funciton prototype. To do this enable addition of the address space into the name-mangling process. Add some tests to document this behaviour against inadvertent changes. Patch by Michele Scandale! llvm-svn: 190684	2013-09-13 12:04:22 +00:00
Rafael Espindola	ff7cea8c1a	Don't pass -O0 to clang_cc1, it is the default. llvm-svn: 189910	2013-09-04 04:12:25 +00:00
Stephen Lin	4362261b00	CHECK-LABEL-ify some code gen tests to improve diagnostic experience when tests fail. llvm-svn: 188447	2013-08-15 06:47:53 +00:00
Justin Holewinski	368374308d	Use kernel metadata to differentiate between kernel and device functions for the NVPTX target. llvm-svn: 178418	2013-03-30 14:38:24 +00:00
Guy Benyei	fb36ede52e	Generate metadata to implement the -cl-kernel-arg-info option. OpenCL 1.2 spec. 5.7.3. llvm-svn: 177839	2013-03-24 13:58:12 +00:00
Guy Benyei	3832bfd557	Fix indirect byval passing of records in address spaced memory. Allocate memory on stack, and memcpy the actual value before the call. llvm-svn: 176786	2013-03-10 12:59:00 +00:00
Joey Gouly	aba589cceb	Add support for the OpenCL attribute 'vec_type_hint'. Patch by Murat Bolat! llvm-svn: 176686	2013-03-08 09:42:32 +00:00
Joey Gouly	c975cdcc58	Add a 64-bit triple to these tests, to fix 32-bit bots. llvm-svn: 175736	2013-02-21 13:42:33 +00:00
Joey Gouly	15eeddebdc	Fix an OpenCL test case. Pointer arguments to kernels must be declared with the __global, __constant or __local qualifier. llvm-svn: 175735	2013-02-21 12:06:32 +00:00
Joey Gouly	7d00f00f1d	Add support to Sema and CodeGen for floating point vector types in OpenCL. llvm-svn: 175734	2013-02-21 11:49:56 +00:00
Tanya Lattner	60e93a6390	Use the target address space value when mangling names. llvm-svn: 174688	2013-02-08 01:07:32 +00:00
Guy Benyei	610541989a	Add OpenCL samplers as Clang builtin types and check sampler related restrictions. llvm-svn: 174601	2013-02-07 10:55:47 +00:00
Joey Gouly	dd7f4566b1	Add a new LangOpt NativeHalfType. This option allows for native half/fp16 operations (as opposed to storage only half/fp16). Also add some semantic checks for OpenCL half types. llvm-svn: 173254	2013-01-23 11:56:20 +00:00
Guy Benyei	1b4fb3e08b	Implement OpenCL event_t as Clang builtin type, including event_t related OpenCL restrictions (OpenCL 1.2 spec 6.9) llvm-svn: 172973	2013-01-20 12:31:11 +00:00
David Tweed	c38f11de40	r172047 lacked a test (due to incomplete OpenCL support in clang). Use a modified version of a test by Joey Gouly to use attributes to materialise the unsupported types and test vector shifts. llvm-svn: 172053	2013-01-10 10:42:08 +00:00
NAKAMURA Takumi	160087b1f6	clang/test/CodeGenOpenCL/shifts.cl: Fixup for -Asserts. llvm-svn: 171820	2013-01-08 00:15:53 +00:00
David Tweed	042e0883cb	Scalar shifts in the OpenCL specification (as of v. 1.2) are defined to be with respect to the lower "left-hand-side bitwidth" bits, even when negative); see OpenCL spec 6.3j. This patch both implements this behaviour in the code generator and "constant folding" bits of Sema, and also prevents tests to detect undefinedness in terms of the weaker C99 or C++ specifications from being applied. llvm-svn: 171755	2013-01-07 16:43:27 +00:00
Guy Benyei	d8a08ea98d	Re-commit r170428 changes with Linux style file endings. Add OpenCL images as clang builtin types. llvm-svn: 170432	2012-12-18 14:38:23 +00:00
Guy Benyei	11169dded0	Revert changes from r170428, as I accidentally changed the line endings of these files to Windows style. llvm-svn: 170431	2012-12-18 14:30:41 +00:00
Guy Benyei	b13abb952a	Add OpenCL images as clang builtin types. llvm-svn: 170428	2012-12-18 12:30:03 +00:00
Richard Trieu	29a2ffc7aa	Fix line ending is tests. No functional change. llvm-svn: 169947	2012-12-12 00:52:15 +00:00
Guy Benyei	b798fc9849	Add SPIR32/SPIR64 targets to Clang llvm-svn: 169917	2012-12-11 21:38:14 +00:00
NAKAMURA Takumi	18fc445af5	FP_CONTRACT: Fix two tests for -Asserts. llvm-svn: 165024	2012-10-02 16:36:54 +00:00
Lang Hames	5de91cc35f	Add FP_CONTRACT support for clang. Clang will now honor the FP_CONTRACT pragma and emit LLVM fmuladd intrinsics for expressions of the form A * B + C (when they occur in a single statement). llvm-svn: 164989	2012-10-02 04:45:10 +00:00
Tanya Lattner	bd837a8bde	Remove names from the CHECK lines. llvm-svn: 162003	2012-08-16 00:22:16 +00:00
Tanya Lattner	a9dd49fe5b	Convert loads and stores of vec3 to vec4 to achieve better code generation. Add test case. llvm-svn: 162002	2012-08-16 00:10:13 +00:00
Simon Atanasyan	38c63ed5f8	Fix the test case. Now it does not depend on the method used to pass vector arguments to the function. Reviewed by Anton Lokhmotov. llvm-svn: 161597	2012-08-09 17:49:22 +00:00
Tanya Lattner	7445ada9c8	Add OpenCL metadata for kernel arg names. This output is controlled via a flag as noted in the OpenCL Spec. Includes a test case. llvm-svn: 160092	2012-07-11 23:02:10 +00:00
Tanya Lattner	bcffcdfd18	Patch by Anton Lokhmotov to add OpenCL work group size attributes. llvm-svn: 159965	2012-07-09 22:06:01 +00:00
Justin Holewinski	83e9668133	Replace PTX back-end with NVPTX back-end in all places where Clang cares NV_CONTRIB llvm-svn: 157403	2012-05-24 17:43:12 +00:00
Duncan Sands	6fc461984a	Rename "fpaccuracy" metadata to the more generic "fpmath". That's because I'm thinking of generalizing it to be able to specify other freedoms beyond accuracy (such as that NaN's don't have to be respected). I'd like the 3.1 release (the first one with this metadata) to have the more generic name already rather than having to auto-upgrade it in 3.2. llvm-svn: 154745	2012-04-14 12:37:26 +00:00
Duncan Sands	e81111ca71	Express the number of ULPs in fpaccuracy metadata as a real rather than a rational number, eg as 2.5 rather than 5, 2. OK'd by Peter Collingbourne. llvm-svn: 154388	2012-04-10 08:23:07 +00:00
Tanya Lattner	3dd33b296a	A few style changes. Change CheckVectorLogicalOperands to pass params by ref. Add another test case. llvm-svn: 148452	2012-01-19 01:16:16 +00:00
Eli Friedman	2dd5d653f2	Fix test so it doesn't depend on the host's calling convention lowering code. llvm-svn: 147545	2012-01-04 20:43:57 +00:00
Eli Friedman	b9c7129012	Support constant evaluation for OpenCL nested vector literals. Patch by Anton Lokhmotov. llvm-svn: 147496	2012-01-03 23:24:20 +00:00
Peter Collingbourne	95fd2ca69f	Annotate imprecise FP division with fpaccuracy metadata The OpenCL single precision division operation is only required to be accurate to 2.5ulp. Annotate the fdiv instruction with metadata which signals to the backend that an imprecise divide instruction may be used. llvm-svn: 143136	2011-10-27 19:19:51 +00:00
Justin Holewinski	38031978b5	PTX: Set proper calling conventions for PTX in OpenCL mode. llvm-svn: 141193	2011-10-05 17:58:44 +00:00

... 3 4 5 6 7 ...

458 Commits