llvm-project

History

Adam Nemet 053c4e825c [AVX512] Fix miscompile for unpack r189189 implemented AVX512 unpack by essentially performing a 256-bit unpack between the low and the high 256 bits of src1 into the low part of the destination and another unpack of the low and high 256 bits of src2 into the high part of the destination. I don't think that's how unpack works. AVX512 unpack simply has more 128-bit lanes but other than it works the same way as AVX. So in each 128-bit lane, we're always interleaving certain parts of both operands rather different parts of one of the operands. E.g. for this: __v16sf a = { 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15 }; __v16sf b = { 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31 }; __v16sf c = __builtin_shufflevector(a, b, 0, 8, 1, 9, 4, 12, 5, 13, 16, 24, 17, 25, 20, 28, 21, 29); we generated punpcklps (notice how the elements of a and b are not interleaved in the shuffle). In turn, c was set to this: 0 16 1 17 4 20 5 21 8 24 9 25 12 28 13 29 Obviously this should have just returned the mask vector of the shuffle vector. I mostly reverted this change and made sure the original AVX code worked for 512-bit vectors as well. Also updated the tests because they matched the logic from the code. llvm-svn: 217602		2014-09-11 16:51:10 +00:00
..
Analysis	Make use @llvm.assume for loop guards in ScalarEvolution	2014-09-07 21:37:59 +00:00
Assembler	[inline asm] Add a check in InlineAsm::ConstraintInfo::Parse to make sure '{'	2014-09-05 22:30:32 +00:00
Bindings	Restore the ability to check if LLVMCreateObjectFile was successful	2014-09-05 21:22:09 +00:00
Bitcode	Teach llvm-bcanalyzer to use one stream's BLOCKINFO to read another stream.	2014-08-30 17:07:55 +00:00
BugPoint	llvm/test/BugPoint/compile-custom.ll: Use explicit %python to invoke a test script, compile-custom.ll.py, for shebang-incapable hosts.	2014-07-11 14:44:10 +00:00
CodeGen	[AVX512] Fix miscompile for unpack	2014-09-11 16:51:10 +00:00
DebugInfo	DebugInfo: Do not use DW_FORM_GNU_addr_index in skeleton CUs, GDB 7.8 errors on this.	2014-09-07 17:31:42 +00:00
ExecutionEngine	[MCJIT] Make sure eh-frame fixups use the target's pointer type, not the host's.	2014-09-04 04:53:03 +00:00
Feature	Use "weak alias" instead of "alias weak"	2014-07-30 22:51:54 +00:00
FileCheck	FileCheck: Add a flag to allow checking empty input	2014-08-07 18:40:37 +00:00
Instrumentation	[asan-assembly-instrumentation] Added CFI directives to the generated instrumentation code.	2014-09-10 09:45:49 +00:00
Integer	…
JitListener	…
LTO	Change the default input for llvm-nm to be a.out instead of standard input	2014-06-23 20:27:53 +00:00
Linker	Merge alignment of common GlobalValue.	2014-09-09 17:48:18 +00:00
MC	Object: Add support for bigobj	2014-09-10 12:51:52 +00:00
Object	Nuke MCAnalysis.	2014-09-02 22:32:20 +00:00
Other	Teach llvm-bcanalyzer to use one stream's BLOCKINFO to read another stream.	2014-08-30 17:07:55 +00:00
TableGen	Tablegen fixes for new syntax when initializing bits from variables.	2014-08-29 19:41:04 +00:00
Transforms	[AlignmentFromAssumptions] Don't crash just because the target is 32-bit	2014-09-11 08:40:17 +00:00
Unit	Let test/Unit/lit.cfg add config.shlibdir to $PATH on DLL platforms like cygming.	2014-07-04 05:11:55 +00:00
Verifier	Verifier: Don't reject varargs callee cleanup functions	2014-08-29 21:25:28 +00:00
YAMLParser	…
tools	Remember to eraseFromParent after replaceAllUsesWith.	2014-09-10 19:39:41 +00:00
.clang-format	…
CMakeLists.txt	Add LLVMgold target to test dependencies.	2014-09-10 22:20:49 +00:00
Makefile	Delete support for AuroraUX.	2014-08-14 15:15:09 +00:00
Makefile.tests	…
TestRunner.sh	…
lit.cfg	Reinstate "Nuke the old JIT."	2014-09-02 22:28:02 +00:00
lit.site.cfg.in	Add missing Interpreter intrinsic lowering for sin, cos and ceil	2014-08-08 15:00:12 +00:00