llvm-project

Commit Graph

Author	SHA1	Message	Date
Adrian Prantl	58c1910642	[LLParser] Make the line field of DIMacro(File) optional. Otherwise these records do not survive roundtrips. llvm-svn: 290291	2016-12-22 00:29:00 +00:00
Alexander Kornienko	b5ca17f817	[clang-tidy] Ignore `size() == 0` in the container implementation. llvm-svn: 290289	2016-12-21 23:44:23 +00:00
Adrian Prantl	ec9ebba778	Legalize metadata in legacy testcases llvm-svn: 290288	2016-12-21 23:38:17 +00:00
Adrian Prantl	762e4b72c6	Legalize metadata in legacy testcases llvm-svn: 290287	2016-12-21 23:36:06 +00:00
Adrian Prantl	aad5df484c	Legalize metadata in legacy testcases llvm-svn: 290286	2016-12-21 23:30:35 +00:00
Adrian Prantl	b767f31290	Legalize metadata in legacy testcases llvm-svn: 290285	2016-12-21 23:28:49 +00:00
Ahmed Bougacha	36f7035bd7	[GlobalISel] Add basic Selector-emitter tblgen backend. This adds a basic tablegen backend that analyzes the SelectionDAG patterns to find simple ones that are eligible for GlobalISel-emission. That's similar to FastISel, with one notable difference: we're not fed ISD opcodes, so we need to map the SDNode operators to generic opcodes. That's done using GINodeEquiv in TargetGlobalISel.td. Otherwise, this is mostly boilerplate, and lots of filtering of any kind of "complicated" pattern. On AArch64, this is sufficient to match G_ADD up to s64 (to ADDWrr/ADDXrr) and G_BR (to B). Differential Revision: https://reviews.llvm.org/D26878 llvm-svn: 290284	2016-12-21 23:26:20 +00:00
Ahmed Bougacha	aa9fe53278	[AsmWriter] Remove redundant cast<>s. NFC. llvm-svn: 290283	2016-12-21 23:26:13 +00:00
Sean Callanan	756cb33b6a	specify -DNDEBUG for BNI builds of all targets in the Xcode build llvm-svn: 290282	2016-12-21 23:21:11 +00:00
Dan Gohman	a2b9b349e7	[WebAssembly] Fix the opcode value for i64.rotr. llvm-svn: 290281	2016-12-21 23:09:42 +00:00
Peter Collingbourne	1b4137a7f9	IR: Function summary representation for type tests. Each function summary has an attached list of type identifier GUIDs. The idea is that during the regular LTO phase we would match these GUIDs to type identifiers defined by the regular LTO module and store the resolutions in a top-level "type identifier summary" (which will be implemented separately). Differential Revision: https://reviews.llvm.org/D27967 llvm-svn: 290280	2016-12-21 23:03:45 +00:00
Evgeniy Stepanov	76a60a8ccd	Increase the treshold in unit test to accomodate for qurantine size increase. Reviewers: eugenis Patch by Alex Shlyapnikov. Subscribers: llvm-commits, kubabrecka Differential Revision: https://reviews.llvm.org/D28029 llvm-svn: 290279	2016-12-21 22:50:08 +00:00
Mike Aizatsky	bfe5045b9c	[sancov] skip duplicated points llvm-svn: 290278	2016-12-21 22:10:01 +00:00
Mike Aizatsky	987f6420ac	[sancov] hash prefix results in huge merge files, use shorter prefix llvm-svn: 290277	2016-12-21 22:09:57 +00:00
Richard Smith	52e624f3ec	Perform type-checking for a converted constant expression in a template argument even if the expression is value-dependent (we need to suppress the final portion of the narrowing check, but the rest of the checking can still be done eagerly). This affects template template argument validity and partial ordering under p0522r0. llvm-svn: 290276	2016-12-21 21:42:57 +00:00
Haicheng Wu	6bb0e39321	[AArch64] Remove a redundant check. NFC. The case AM.Scale == 0 is already handled by the code right above. Differential Revision: https://reviews.llvm.org/D28003 llvm-svn: 290275	2016-12-21 21:40:47 +00:00
Greg Clayton	78a07bfa66	Add the ability for DWARFDie objects to get the parent DWARFDie. In order for the llvm DWARF parser to be used in LLDB we will need to be able to get the parent of a DIE. This patch adds that functionality by changing the DWARFDebugInfoEntry class to store a depth field instead of a sibling index. Using a depth field allows us to easily calculate the sibling and the parent without increasing the size of DWARFDebugInfoEntry. I tested llvm-dsymutil on a debug version of clang where this fully parses DWARF in over 1200 .o files to verify there was no serious regression in performance. Added a full suite of unit tests to test this functionality. Differential Revision: https://reviews.llvm.org/D27995 llvm-svn: 290274	2016-12-21 21:37:06 +00:00
Chris Bieneman	e9ce09b89f	[CMake] Support distribution install for LLDB.framework This patch adds the last bit of support to get LLVM_DISTRIBUTION_COMPONENTS working with libLLDB when built as a framework. This patch adds dummy install targets for binaries built into the framework's Resources directory, and makes the framework's install target depend on all the binaries that get installed with the framework. llvm-svn: 290273	2016-12-21 21:23:27 +00:00
Andrey Churbanov	76d4285460	Fix for the __kmpc_global_num_threads function to return the value of the __kmp_all_nth global var. Patch by Yonghong Yan. Differential Revision: https://reviews.llvm.org/D27975 llvm-svn: 290272	2016-12-21 21:20:20 +00:00
Justin Bogner	c11760d4ed	cmake: Don't build llvm-config and tblgen concurrently in cross builds This sets USES_TERMINAL for the native llvm-config build, so that it doesn't run at the same time as builds of other native tools (namely, tablegen). Without this, if you're very unlucky with the timing it's possible to be relinking libSupport as one of the tools is linking, causing a spurious failure. The tablegen build adopted USES_TERMINAL for this same reason in r280748. llvm-svn: 290271	2016-12-21 21:19:00 +00:00
Ed Maste	084062803e	Update mailing list post URL and add libunwind reference RTDyldMemoryManager.cpp describes the differing __register_frame API between libunwind and libgcc, with a mailing list posting URL. The original link was 404; replace it with what I believe is the intended post, as well as a reference to the "OS X" implementation in libunwind. Differential Revision: https://reviews.llvm.org/D27965 llvm-svn: 290269	2016-12-21 20:51:42 +00:00
Tim Northover	c67803fb14	ARM: define a macro for the FPv5 FPU in ARM mode. FPv5 is in Cortex-M7 and the 64-bit CPUs when running in 32-bit mode. The name is from the Cortex-M7 TRM. llvm-svn: 290268	2016-12-21 20:49:43 +00:00
Simon Pilgrim	081abbb164	[X86][SSE] Improve lowering of vXi64 multiplies As mentioned on PR30845, we were performing our vXi64 multiplication as: AloBlo = pmuludq(a, b); AloBhi = pmuludq(a, psrlqi(b, 32)); AhiBlo = pmuludq(psrlqi(a, 32), b); return AloBlo + psllqi(AloBhi, 32)+ psllqi(AhiBlo, 32); when we could avoid one of the upper shifts with: AloBlo = pmuludq(a, b); AloBhi = pmuludq(a, psrlqi(b, 32)); AhiBlo = pmuludq(psrlqi(a, 32), b); return AloBlo + psllqi(AloBhi + AhiBlo, 32); This matches the lowering on gcc/icc. Differential Revision: https://reviews.llvm.org/D27756 llvm-svn: 290267	2016-12-21 20:00:10 +00:00
David Majnemer	b0761a0c1b	Revert "[InstCombine] New opportunities for FoldAndOfICmp and FoldXorOfICmp" This reverts commit r289813, it caused PR31449. llvm-svn: 290266	2016-12-21 19:21:59 +00:00
Tom Stellard	d8ea85aced	AMDGPU/SI: Fix file header llvm-svn: 290265	2016-12-21 19:06:24 +00:00
Peter Collingbourne	35f3f7cdc7	TypeMetadataUtils: Simplify; spotted by Mehdi. llvm-svn: 290264	2016-12-21 19:00:47 +00:00
Zachary Turner	ab266cf95b	Add missing includes on Windows. Patch by Andrey Khalyavin Differential Revision: https://reviews.llvm.org/D27915 llvm-svn: 290263	2016-12-21 18:50:52 +00:00
Paul Robinson	80ba2929e6	Make some diagnostic tests C++11 clean. Differential Revision: http://reviews.llvm.org/D27794 llvm-svn: 290262	2016-12-21 18:33:17 +00:00
Michael Kuperstein	88f15eedbb	[LLParser] Parse vector GEP constant expression correctly The constantexpr parsing was too constrained and rejected legal vector GEPs. This relaxes it to be similar to the ones for instruction parsing. This fixes PR30816. Differential Revision: https://reviews.llvm.org/D28013 llvm-svn: 290261	2016-12-21 18:29:47 +00:00
Michael Kuperstein	dd92c78669	[ConstantFolding] Fix vector GEPs harder For vector GEPs, CastGEPIndices can end up in an infinite recursion, because we compare the vector type to the scalar pointer type, find them different, and then try to cast a type to itself. Differential Revision: https://reviews.llvm.org/D28009 llvm-svn: 290260	2016-12-21 17:34:21 +00:00
Daniel Jasper	083d1700a0	clang-format: Fix bug in handling of single-column lists. Members that are themselves wrapped in fake parentheses would lead to AvoidBinPacking be set on the wrong ParenState. After: vector<int> aaaa = { aaaaaa.aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa, aaaaaa.aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa, aaaaaa.aaaaaaa, aaaaaa.aaaaaaa, aaaaaa.aaaaaaa, aaaaaa.aaaaaaa, }; Before we were falling back to bin-packing these. llvm-svn: 290259	2016-12-21 17:02:06 +00:00
Simon Pilgrim	12c3f33650	Wdocumentation fix llvm-svn: 290258	2016-12-21 16:39:09 +00:00
Simon Pilgrim	c93cd30fac	[CostModel] Pass shuffle mask args with ArrayRef. NFCI. llvm-svn: 290257	2016-12-21 15:49:01 +00:00
Roman Gareev	be5299af0b	Change the determination of parameters of macro-kernel Typically processor architectures do not include an L3 cache, which means that Nc, the parameter of the micro-kernel, is, for all practical purposes, redundant ([1]). However, its small values can cause the redundant packing of the same elements of the matrix A, the first operand of the matrix multiplication. At the same time, big values of the parameter Nc can cause segmentation faults in case the available stack is exceeded. This patch adds an option to specify the parameter Nc as a multiple of the parameter of the micro-kernel Nr. In case of Intel Core i7-3820 SandyBridge and the following options, clang -O3 gemm.c -I utilities/ utilities/polybench.c -DPOLYBENCH_TIME -march=native -mllvm -polly -mllvm -polly-pattern-matching-based-opts=true -DPOLYBENCH_USE_SCALAR_LB -mllvm -polly-target-cache-level-associativity=8,8 -mllvm -polly-target-cache-level-sizes=32768,262144 -mllvm -polly-target-latency-vector-fma=8 it helps to improve the performance from 11.303 GFlops/sec (39,247% of theoretical peak) to 17.896 GFlops/sec (62,14% of theoretical peak). Refs.: [1] - http://www.cs.utexas.edu/users/flame/pubs/TOMS-BLIS-Analytical.pdf Reviewed-by: Tobias Grosser <tobias@grosser.es> Differential Revision: https://reviews.llvm.org/D28019 llvm-svn: 290256	2016-12-21 12:51:12 +00:00
Michael Zuckerman	85e12d2851	revert first commit . removing empty line in X86.h llvm-svn: 290255	2016-12-21 12:48:01 +00:00
Michael Zuckerman	58838cf29d	First commit adding new line to X86.h llvm-svn: 290254	2016-12-21 12:44:47 +00:00
Roman Gareev	bd5c6039c6	Align newly created arrays to the first level cache line boundary Aligning data to cache lines boundaries helps to avoid overheads related to an access to it ([1]). This patch aligns newly created arrays and adds an option to specify the first level cache line size. By default we use 64 bytes, which is a typical cache-line size ([2]). In case of Intel Core i7-3820 SandyBridge and the following options, clang -O3 gemm.c -I utilities/ utilities/polybench.c -DPOLYBENCH_TIME -march=native -mllvm -polly -mllvm -polly-pattern-matching-based-opts=true -DPOLYBENCH_USE_SCALAR_LB -mllvm -polly-target-cache-level-associativity=8,8 -mllvm -polly-target-cache-level-sizes=32768,262144 -mllvm -polly-target-latency-vector-fma=8 it helps to improve the performance from 11.303 GFlops/sec (39,247% of theoretical peak) to 12.63 GFlops/sec (43,8542% of theoretical peak). Refs.: [1] - http://www.alexonlinux.com/aligned-vs-unaligned-memory-access [2] - http://igoro.com/archive/gallery-of-processor-cache-effects/ Differential Revision: https://reviews.llvm.org/D28020 Reviewed-by: Tobias Grosser <tobias@grosser.es> llvm-svn: 290253	2016-12-21 12:37:36 +00:00
Davide Italiano	7116dc908c	[ELF/tests] Use cpio -it instead of cpio -t. OpenBSD's cpio does not accept the -t option without -i. Apparently some systems implement cpio -t as a shortcut for cpio -it, the latter is the only thing that's documented. This change avoids test failures on OpenBSD. Patch by Mark Kettenis! Differential Revision: https://reviews.llvm.org/D28002 llvm-svn: 290252	2016-12-21 12:22:19 +00:00
Roman Gareev	92c446016a	[Polly] Use three-dimensional arrays to store packed operands of the matrix multiplication Previously we had two-dimensional accesses to store packed operands of the matrix multiplication for the sake of simplicity of the packed arrays. However, addition of the third dimension helps to simplify the corresponding memory access, reduce the execution time of isl operations applied to it, and consequently reduce the compile-time of Polly. For example, in case of Intel Core i7-3820 SandyBridge and the following options, clang -O3 gemm.c -I utilities/ utilities/polybench.c -DPOLYBENCH_TIME -march=native -mllvm -polly -mllvm -polly-pattern-matching-based-opts=true -DPOLYBENCH_USE_SCALAR_LB -mllvm -polly-target-cache-level-associativity=8,8 -mllvm -polly-target-cache-level-sizes=32768,262144 -mllvm -polly-target-latency-vector-fma=7 it helps to reduce the compile-time from about 361.456 seconds to about 0.816 seconds. Reviewed-by: Michael Kruse <llvm@meinersbur.de>, Tobias Grosser <tobias@grosser.es> Differential Revision: https://reviews.llvm.org/D27878 llvm-svn: 290251	2016-12-21 11:18:42 +00:00
Elena Demikhovsky	7c7bf1b432	Added a template for building target specific memory node in DAG. I added API for creation a target specific memory node in DAG. Today, all memory nodes are common for all targets and their constructors are located in SelectionDAG.cpp. There are some cases in X86 where we need to create a special node - truncation-with-saturation store, float-to-half-store. In the current patch I added truncation-with-saturation nodes and I'm using them for intrinsics. In the future I plan to implement DAG lowering for truncation-with-saturation pattern. Differential Revision: https://reviews.llvm.org/D27899 llvm-svn: 290250	2016-12-21 10:43:36 +00:00
Davide Italiano	c96272c47c	[AMDGPU] Garbage collect dead code. NFCI. llvm-svn: 290249	2016-12-21 10:19:00 +00:00
Oren Ben Simhon	cb692157b7	[X86] Vectorcall Calling Convention - Adding CodeGen Complete Support Fixing a warning. llvm-svn: 290248	2016-12-21 09:47:31 +00:00
George Rimar	d450065308	[ELF] - Linkerscript: Fall back to search paths when INCLUDE not found From https://sourceware.org/binutils/docs/ld/File-Commands.html: The file will be searched for in the current directory, and in any directory specified with the -L option. Patch done by Alexander Richardson. Differential revision: https://reviews.llvm.org/D27831 llvm-svn: 290247	2016-12-21 09:42:25 +00:00
Oren Ben Simhon	cecc4af496	[X86] Vectorcall Calling Convention - Adding CodeGen Complete Support Fixing failing test. llvm-svn: 290246	2016-12-21 09:18:37 +00:00
Oren Ben Simhon	c11addb506	Reverting last change. llvm-svn: 290245	2016-12-21 09:04:08 +00:00
Oren Ben Simhon	de2eea7298	[X86] Vectorcall Calling Convention - Adding CodeGen Complete Support Fixing build issues. llvm-svn: 290244	2016-12-21 08:59:42 +00:00
George Rimar	cc9302d0b7	[ELF] - Removed trailing whitespaces. NFC. llvm-svn: 290243	2016-12-21 08:58:36 +00:00
Oren Ben Simhon	016f2af3c7	[X86] Vectorcall Calling Convention - Adding CodeGen Complete Support Fixing build issues. llvm-svn: 290242	2016-12-21 08:58:19 +00:00
Rui Ueyama	4f2f50dc64	De-template DefinedSynthetic. DefinedSynthetic is not created for a real ELF object, so it doesn't have to be a template function. It has a virtual st_value, which is either 32 bit or 64 bit, but we can simply use 64 bit. llvm-svn: 290241	2016-12-21 08:40:09 +00:00
Oren Ben Simhon	3b95157090	[X86] Vectorcall Calling Convention - Adding CodeGen Complete Support The vectorcall calling convention specifies that arguments to functions are to be passed in registers, when possible. vectorcall uses more registers for arguments than fastcall or the default x64 calling convention use. The vectorcall calling convention is only supported in native code on x86 and x64 processors that include Streaming SIMD Extensions 2 (SSE2) and above. The current implementation does not handle Homogeneous Vector Aggregates (HVAs) correctly and this review attempts to fix it. This aubmit also includes additional lit tests to cover better HVAs corner cases. Differential Revision: https://reviews.llvm.org/D27392 llvm-svn: 290240	2016-12-21 08:31:45 +00:00

1 2 3 4 5 ...

250327 Commits All Branches Search

250327 Commits

All Branches