llvm-project/llvm
Andrea Di Biagio ae47bc6ab9 [X86][DAG] Disable target specific combine on INSERTPS dag nodes at -O0.
This patch disables target specific combine on X86ISD::INSERTPS dag nodes
if optlevel is CodeGenOpt::None.

The backend currently implements a target specific combine rule that converts
a vector load used by an INSERTPS dag node into a scalar load plus a
scalar_to_vector. This allows ISel to select a single INSERTPSrm instead of
two instructions (i.e. a vector load plus INSERTPSrr).

However, the existing target combine rule on INSERTPS nodes only works under
the assumption that ISel will always be able to match an INSERTPSrm. This is
not true in general at -O0, since the backend only allows folding a load into
the memory operand of an instruction if the optimization level is not
CodeGenOpt::None.

In the example below:

//
__m128 test(__m128 a, __m128 *b) {
  __m128 c = _mm_insert_ps(a, *b, 1 << 6);
  return c;
}
//

Before this patch, at -O0, the backend would have canonicalized the load to 'b'
into a scalar load plus scalar_to_vector. Later on, ISel would have selected an
INSERTPSrr leaving the insertps mask in an inconsistent state:

  movss 4(%rdi), %xmm1
  insertps  $64, %xmm1, %xmm0 # xmm0 = xmm1[1],xmm0[1,2,3].

With this patch, the backend avoids folding the vector load into the operand of
the INSERTPS. The new codegen at -O0 is:

  movaps (%rdi), %xmm1
  insertps  $64, %xmm1, %xmm0 # %xmm1[1],xmm0[1,2,3].

llvm-svn: 226277
2015-01-16 14:55:26 +00:00
..
autoconf Trunk is now 3.7.0svn 2015-01-14 17:38:03 +00:00
bindings [OCaml] Allow out-of-tree builds of LLVM bindings. 2015-01-13 12:17:56 +00:00
cmake [OCaml] Use $CAMLORIGIN, an rpath-$ORIGIN-like mechanism in OCaml. 2015-01-13 09:47:59 +00:00
docs [PowerPC] Adjust PatchPoints for ppc64le 2015-01-16 04:40:58 +00:00
examples Manually specify the folder that Kaleidescope should reside in for CMake-produced solutions that care about such things (like MSVC). This takes the Kaleidescope target out of the root solution folder and places it into the Examples folder where it belongs. 2015-01-07 14:26:07 +00:00
include Use report_fatal_error instead of llvm_unreachable, so we don't crash on user input 2015-01-16 04:54:12 +00:00
lib [X86][DAG] Disable target specific combine on INSERTPS dag nodes at -O0. 2015-01-16 14:55:26 +00:00
projects
test [X86][DAG] Disable target specific combine on INSERTPS dag nodes at -O0. 2015-01-16 14:55:26 +00:00
tools Add the option, -archive-headers, used with -macho to print the Mach-O archive headers to llvm-objdump. 2015-01-15 23:19:11 +00:00
unittests Fix the C-API MCJIT test for 32-bit big endian machines. 2015-01-15 15:36:04 +00:00
utils Replace size method call of containers to empty method where appropriate 2015-01-15 11:41:30 +00:00
.arcconfig
.clang-format
.clang-tidy
.gitignore .gitignore: add some rules for tagging programs 2015-01-10 19:11:29 +00:00
CMakeLists.txt Trunk is now 3.7.0svn 2015-01-14 17:38:03 +00:00
CODE_OWNERS.TXT Add myself as SystemZ code owner 2014-12-18 19:27:50 +00:00
CREDITS.TXT
LICENSE.TXT
LLVMBuild.txt
Makefile
Makefile.common
Makefile.config.in Add a check for misbehaving -Wcomment from gcc-4.7 and add 2014-11-05 00:35:15 +00:00
Makefile.rules Add a check for misbehaving -Wcomment from gcc-4.7 and add 2014-11-05 00:35:15 +00:00
README.txt Test commit. 2015-01-07 22:07:33 +00:00
configure Trunk is now 3.7.0svn 2015-01-14 17:38:03 +00:00
llvm.spec.in

README.txt

Low Level Virtual Machine (LLVM)
================================

This directory and its subdirectories contain source code for LLVM,
a toolkit for the construction of highly optimized compilers,
optimizers, and runtime environments.

LLVM is open source software. You may freely distribute it under the terms of
the license agreement found in LICENSE.txt.

Please see the documentation provided in docs/ for further
assistance with LLVM, and in particular docs/GettingStarted.rst for getting
started with LLVM and docs/README.txt for an overview of LLVM's
documentation setup.

If you're writing a package for LLVM, see docs/Packaging.rst for our
suggestions.