llvm-project

Commit Graph

Author	SHA1	Message	Date
Jan Vesely	17e8679493	popcount: Provide function implementation rather than intrinsic redirect amdgcn will need to override this Reviewer: Aaron Watry <awatry@gmail.com> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 327042	2018-03-08 18:58:00 +00:00
Jan Vesely	a02d0e2c50	integer/sub_sat: Use clang builtin instead of llvm asm reviewer: Tom Stellard Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 314703	2017-10-02 18:39:03 +00:00
Jan Vesely	1964df8fad	integer/add_sat: Use clang builtin instead of llvm asm reviewer: Tom Stellard Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 314702	2017-10-02 18:39:00 +00:00
Jan Vesely	943057a288	integer/clz: Use clang builtin instead of llvm asm The generated llvm IR mostly identical. char/uchar case is a bit worse. reviewer: Tom Stellard Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 314701	2017-10-02 18:38:57 +00:00
Jan Vesely	ef513d392b	Implement generic mad_sat v2: Fix trailing whitespace Fix signed long overflow improve comment v3: fix typo Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Tom Stellard <tom@stellard.net> llvm-svn: 216923	2014-09-02 17:55:02 +00:00
Aaron Watry	9447097636	Revert "Implement generic mad_sat" This reverts commit cf62eded8b623a1c10d3692d25e5882b7939f564. I didn't mean to commit this... Jan has a v3 incoming llvm-svn: 216322	2014-08-23 14:06:01 +00:00
Aaron Watry	6bfac7ae69	Implement generic mad_sat v2: Fix trailing whitespace Fix signed long overflow improve comment Signed-off-by: Jan Vesely <jan.vesely at rutgers.edu> llvm-svn: 216320	2014-08-23 14:04:33 +00:00
Jeroen Ketema	526fe2d501	Move clcmacro.h to avoid cluttering user namespace v2 v2: - use quotes instead of <> - add include to r600/lib/math/nextafter.c changed Reviewed-by: Tom Stellard <tom@stellard.net> Reviewed-by: Aaron Watry <awatry@gmail.com> llvm-svn: 211576	2014-06-24 09:36:32 +00:00
Tom Stellard	3a9632d544	s/_CLC_DECL/_CLC_DEF/ Some function definitions were using _CLC_DECL, which meant that they weren't being marked as always_inline. Reviewed-by and Tested-by: Aaron Watry <awatry@gmail.com> llvm-svn: 193754	2013-10-31 15:50:53 +00:00
Aaron Watry	fbe439f8c0	Add mul_hi implementation [v2] Everything except long/ulong is handled by just casting to the next larger type, doing the math and then shifting/casting the result. For 64-bit types, we break the high/low parts of each operand apart, and do a FOIL-based multiplication. v2: Discard the stack-overflow implementation due to copyright concerns. - The implementation is still FOIL-based, but discards the previous code. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 188684	2013-08-19 18:31:49 +00:00
Aaron Watry	8548725f29	Add rhadd builtin rhadd = (x+y+1)>>1 Implemented as: (x>>1) + (y>>1) + ((x&1)\|(y&1)) This prevents us having to do assembly addition and overflow detection Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 188477	2013-08-15 19:21:10 +00:00
Aaron Watry	7659157f1b	Add hadd builtin (x + y) >> 1 gets changed to: (x>>1) + (y>>1) + (x&y&1) Saves us having to do any llvm assembly and overflow checking in the addition. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 188476	2013-08-15 19:21:07 +00:00
Aaron Watry	1769b1fca9	Implement generic upsample() Reduces all vector upsamples down to its scalar components, so probably not the most efficient thing in the world, but it does what the spec says it needs to do. Another possible implementation would be to convert/cast everything as unsigned if necessary, upsample the input vectors, create the upsampled value, and then cast back to signed if required. Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Tom Stellard <thomas.stellard at amd.com> llvm-svn: 186691	2013-07-19 16:44:37 +00:00
Tom Stellard	6f33168bb7	Implement mad24() and mul24() builtins Reviewed-by: Aaron Watry <awatry@gmail.com> llvm-svn: 185839	2013-07-08 17:27:13 +00:00
Tom Stellard	d768ac0395	Add __CLC_ prefix to all macro definitions in headers libclc was defining and undefing GENTYPE and several other macros with common names in its header files. This was preventing applications from defining macros with identical names as command line arguments to the compiler, because the definitions in the header files were masking the macros defined as compiler arguements. Reviewed-by: Aaron Watry <awatry@gmail.com> llvm-svn: 185838	2013-07-08 17:27:02 +00:00
Tom Stellard	e78344dfae	libclc: Implement clz() builtin Squashed commit of the following: commit a0df0a0e86c55c1bdc0b9c0f5a739e5adef4b056 Author: Aaron Watry <awatry@gmail.com> Date: Mon Apr 15 18:42:04 2013 -0500 libclc: Rename clz.ll to clz_if.ll to ensure it gets built. configure.py treats files that have the same name with the .cl and .ll extensions as overriding eachother. E.g. If you have clz.cl and clz.ll both specified to be built in the same SOURCES file, only the first file listed will actually be built. Since the contents of clz.ll were an interface that is implemented in clz_impl.ll, rename clz.ll to clz_if.ll to make sure that the interface is built. commit 931b62bed05c58f737de625bd415af09571a6a5a Author: Aaron Watry <awatry@gmail.com> Date: Sat Apr 13 12:32:54 2013 -0500 libclc: llvm assembly implementation of clz Untested... currently crashes in the same manner as add_sat. commit 6ef0b7b0b6d2e5584086b4b9a9243743b2e0538f Author: Aaron Watry <awatry@gmail.com> Date: Sat Mar 23 12:35:27 2013 -0500 libclc: Add stub clz builtin For scalar int/uint, attempt to use the clz llvm builtin.. for all others return 0 until an actual implementation is finished. Patch by: Aaron Watry llvm-svn: 185004	2013-06-26 18:21:55 +00:00
Tom Stellard	29b5b9816b	libclc: Rename [add\|sub]_sat.ll to [add\|sub]_sat_if.ll configure.py allows overloading .cl with .ll, but will only ever build the first file listed in SOURCES of ${file}.cl and ${file}.ll add_sat, sub_sat, (and the soon to be submitted clz) all define interfaces in ${function_name}.ll which are implemented in ${function_name}_impl.ll. Renaming the interface files is enough to get them to build again, fixing CL usage of these functions. Tested on clover/r600g. Patch by: Aaron Watry llvm-svn: 185000	2013-06-26 18:21:31 +00:00
Tom Stellard	a30713710c	Add a another TODO note. Patch by: Aaron Watry llvm-svn: 184999	2013-06-26 18:21:25 +00:00
Tom Stellard	4974f6c6d0	Add a TODO note. Patch by: Aaron Watry llvm-svn: 184998	2013-06-26 18:21:22 +00:00
Tom Stellard	8c1e72f46a	Simplify rotate implementation a bit.. Much more understandable/readable as a result, and probably more efficient. Patch by: Aaron Watry llvm-svn: 184997	2013-06-26 18:21:18 +00:00
Tom Stellard	0bb381eaec	libclc: implement rotate builtin This implementation does a lot of bit shifting and masking. Suffice to say, this is somewhat suboptimal... but it does look to produce correct results (after the piglit tests were corrected for sign extension issues). Someone who knows LLVM better than I could re-write this more efficiently. Patch by: Aaron Watry llvm-svn: 184996	2013-06-26 18:21:13 +00:00
Tom Stellard	cb133c9322	libclc: Move max builtin to shared/ Max(x,y) is available for all integer/floating types. Patch by: Aaron Watry llvm-svn: 184995	2013-06-26 18:21:06 +00:00
Tom Stellard	cd88a4ebb6	libclc: Fix abs_diff builtin integer function Patch by: Aaron Watry llvm-svn: 184993	2013-06-26 18:20:50 +00:00
Tom Stellard	ec87fb0b0c	libclc: Add max() builtin function Adds this function for both int and floating data types. Patch by: Aaron Watry llvm-svn: 184992	2013-06-26 18:20:46 +00:00
Peter Collingbourne	3cd1bcb358	Do not use linkonce_odr linkage in .ll files. This prevented them from being linked into the library under lazy linkage. llvm-svn: 161314	2012-08-05 22:25:48 +00:00
Peter Collingbourne	1e373f07af	Implement sub_sat builtin. Patch by Lei Mou! llvm-svn: 161312	2012-08-05 22:25:12 +00:00
Peter Collingbourne	49b4d8cc65	Fix declarations of __clc_add_sat_*. Patch by Lei Mou! llvm-svn: 161311	2012-08-05 22:24:36 +00:00
Peter Collingbourne	d5395fbf03	Initial commit. llvm-svn: 147756	2012-01-08 22:09:58 +00:00

28 Commits