llvm-project

History

Justin Lebar 2e4ecfdebe [CUDA] Implement __ldg using intrinsics. Summary: Previously it was implemented as inline asm in the CUDA headers. This change allows us to use the [addr+imm] addressing mode when executing ld.global.nc instructions. This translates into a 1.3x speedup on some benchmarks that call this instruction from within an unrolled loop. Reviewers: tra, rsmith Subscribers: jhen, cfe-commits, jholewinski Differential Revision: http://reviews.llvm.org/D19990 llvm-svn: 270150		2016-05-19 22:49:13 +00:00
..
clang	[CUDA] Implement __ldg using intrinsics.	2016-05-19 22:49:13 +00:00
clang-c	Enable support for __float128 in Clang and enable it on pertinent platforms	2016-05-09 08:52:33 +00:00
CMakeLists.txt	…