llvm-project

Commit Graph

Author	SHA1	Message	Date
Adam Nemet	286ae08e7d	Implement AVX1 vbroadcast intrinsics with vector initializers These intrinsics are special because they directly take a memory operand (AVX2 adds the register counterparts). Typically, other non-memop intrinsics take registers and then it's left to isel to fold memory operands. In order to LICM intrinsics directly reading memory, we require that no stores are in the loop (LICM) or that the folded load accesses constant memory (MachineLICM). When neither is the case we fail to hoist a loop-invariant broadcast. We can work around this limitation if we expose the load as a regular load and then just implement the broadcast using the vector initializer syntax. This exposes the load to LICM and other optimizations. At the IR level this is translated into a series of insertelements. The sequence is already recognized as a broadcast so there is no impact on the quality of codegen. _mm256_broadcast_pd and _mm256_broadcast_ps are not updated by this patch because right now we lack the DAG-combiner smartness to recover the broadcast instructions. This will be tackled in a follow-on. There will be completing changes on the LLVM side to remove the LLVM intrinsics and to auto-upgrade bitcode files. Fixes <rdar://problem/16494520> llvm-svn: 209846	2014-05-29 20:47:29 +00:00
Craig Topper	26e74e50b6	Convert vperm2f128 and vperm2i128 intrinsics back to using llvm intrinsics. Unfortunately, these instructions have behavior that can't be modeled with shuffle vector. llvm-svn: 154906	2012-04-17 05:16:56 +00:00
Craig Topper	678a53c350	Fix shuffle vector calculation for mm_permute_ps. Fixes PR 12401. llvm-svn: 153724	2012-03-30 05:09:18 +00:00
Craig Topper	e5ea3b0239	Remove vperm2f* and vperm2i builtins. Same effect can be achieved with builtin_shufflevector. llvm-svn: 150064	2012-02-08 07:33:36 +00:00
Craig Topper	fec9f8edb7	Remove vpermilp* builtins. Same effect can be achieved with builtin_shufflevector. llvm-svn: 150056	2012-02-08 05:16:54 +00:00
Bruno Cardoso Lopes	fbb8b84f5f	Add testcase for r138411 llvm-svn: 138422	2011-08-24 01:35:04 +00:00

6 Commits