llvm-project

Commit Graph

Author	SHA1	Message	Date
Craig Topper	7091a743b4	[InstCombine] Support (X \| C1) & C2 --> (X & C2^(C1&C2)) \| (C1&C2) for vector splats Note the original code I deleted incorrectly listed this as (X \| C1) & C2 --> (X & C2^(C1&C2)) \| C1 Which is only valid if C1 is a subset of C2. This relied on SimplifyDemandedBits to remove any extra bits from C1 before we got to that code. My new implementation avoids relying on that behavior so that it can be naively verified with alive. Differential Revision: https://reviews.llvm.org/D36384 llvm-svn: 310272	2017-08-07 18:10:39 +00:00
Kuba Mracek	b0d208a0ab	[sanitizer] Remove use of task_for_pid from sanitizer_stoptheworld_mac.cc Using task_for_pid to get the "self" task is not necessary, and it can fail (e.g. for sandboxed processes). Let's just use mach_task_self(). Differential Revision: https://reviews.llvm.org/D36284 llvm-svn: 310271	2017-08-07 18:07:20 +00:00
Abhishek Aggarwal	a8af18ac1d	Fixed build failure for revision r310261 -- Was failing for Linux llvm-svn: 310270	2017-08-07 17:15:26 +00:00
Matt Arsenault	aac47c1c00	AMDGPU: Use a custom areInlineCompatible Fixes not inlining OpenCL library functions on AMDGPU, which don't have an explicitly set target-cpu. llvm-svn: 310269	2017-08-07 17:08:44 +00:00
Simon Pilgrim	0242cead2c	[X86][AVX] Add full test coverage of subvector_broadcasts from registers X86SubVBroadcast is for memory subvector broadcasts, but we must test that it handles all cases without the load as well just in case. This was noticed while I was triaging the test cases from PR34041. llvm-svn: 310268	2017-08-07 16:49:09 +00:00
Simon Dardis	b1b52c0200	[DebugInfo][DWARF] Address paulr's comment on rL310253. llvm-svn: 310267	2017-08-07 16:08:11 +00:00
Abhishek Aggarwal	96bea51234	Fixed build failure for revision r310261 -- Build was failing for freebsd llvm-svn: 310266	2017-08-07 15:53:30 +00:00
Simon Pilgrim	8b9c4f2855	[X86][AVX] Cleanup subvector broadcast tests - remove old prefixes. llvm-svn: 310265	2017-08-07 15:50:43 +00:00
Sanjay Patel	807f92b8ff	[x86] revert r310208 to investigate test-suite failures (PR34105 / PR34097) llvm-svn: 310264	2017-08-07 15:47:48 +00:00
Gheorghe-Teodor Bercea	47e0cf378c	[OpenMP] Add flag for specifying the target device architecture for OpenMP device offloading Summary: OpenMP has the ability to offload target regions to devices which may have different architectures. A new -fopenmp-target-arch flag is introduced to specify the device architecture. In this patch I use the new flag to specify the compute capability of the underlying NVIDIA architecture for the OpenMP offloading CUDA tool chain. Only a host-offloading test is provided since full device offloading capability will only be available when [[ https://reviews.llvm.org/D29654 \| D29654 ]] lands. Reviewers: hfinkel, Hahnfeld, carlo.bertolli, caomhin, ABataev Reviewed By: hfinkel Subscribers: guansong, cfe-commits Tags: #openmp Differential Revision: https://reviews.llvm.org/D34784 llvm-svn: 310263	2017-08-07 15:39:11 +00:00
Simon Dardis	02d9945e6f	[DebugInfo][DWARF] Correct some usages of PRIx32 to PRIx64 These lead to tests failing spuriously as the values after being rendered to a string were incorrect. Reviewers: clayborg Differential Revision: https://reviews.llvm.org/D36319 llvm-svn: 310262	2017-08-07 15:37:57 +00:00
Abhishek Aggarwal	307db0f897	Tool for using Intel(R) Processor Trace hardware feature Summary: 1. Provide single library for all Intel specific hardware features instead of individual libraries for each feature 2. Added Intel(R) Processor Trace hardware feature in this single library. Details about the tool implementing this feature is as follows: Tool developed on top of LLDB to provide its users the execution trace of the debugged inferiors. Tool's API are exposed as C++ object oriented interface in a shared library. API are designed especially to be easily integrable with IDEs providing LLDB as an application debugger. Entire API is also available as Python functions through a script bridging interface allowing development of python modules. This patch also provides a CLI wrapper to use the Tool through LLDB's command line. Highlights of the Tool and the wrapper are given below: **************************** Intel(R) Processor Trace Tool: ************************** - Provides execution trace of the debugged application - Uses Intel(R) Processor Trace hardware feature (already implemented inside LLDB) for this purpose -- Collects trace packets generated by this feature from LLDB, decodes and post-processes them -- Constructs the execution trace of the application -- Presents execution trace as a list of assembly instructions - Provides 4 APIs (exposed as C++ object oriented interface) -- start trace with configuration options for a thread/process, -- stop trace for a thread/process, -- get the execution flow (assembly instructions) for a thread, -- get trace specific information for a thread - Easily integrable into IDEs providing LLDB as application debugger - Entire API available as Python functions through script bridging interface -- Allows developing python apps on top of Tool - README_TOOL.txt provides more details about the Tool, its dependencies, building steps and API usage - Tool ready to use through LLDB's command line -- CLI wrapper has been developed on top of the Tool for this purpose ***************************** CLI wrapper: cli-wrapper-pt.cpp ******************************* - Provides 4 commands (syntax similar to LLDB's CLI commands): -- processor-trace start -- processor-trace stop -- processor-trace show-trace-options -- processor-trace show-instr-log - README_CLI.txt provides more details about commands and their options Signed-off-by: Abhishek Aggarwal <abhishek.a.aggarwal@intel.com> Reviewers: clayborg, jingham, lldb-commits, labath Reviewed By: clayborg Subscribers: ravitheja, emaste, krytarowski, mgorny Differential Revision: https://reviews.llvm.org/D33035 llvm-svn: 310261	2017-08-07 15:26:11 +00:00
Alexey Bataev	9581b42589	[SLP] General improvements of SLP vectorization process. Patch tries to improve two-pass vectorization analysis, existing in SLP vectorizer. What it does: 1. Defines key nodes, that are the vectorization roots. Previously vectorization started if StoreInst or ReturnInst is found. For now, the vectorization started for all Instructions with no users and void types (Terminators, StoreInst) + CallInsts. 2. CmpInsts, InsertElementInsts and InsertValueInsts are stored in the array. This array is processed only after the vectorization of the first-after-these instructions key node is finished. Vectorization goes in reverse order to try to vectorize as much code as possible. Reviewers: mzolotukhin, Ayal, mkuper, gilr, hfinkel, RKSimon Subscribers: ashahid, anemet, RKSimon, mssimpso, llvm-commits Differential Revision: https://reviews.llvm.org/D29826 llvm-svn: 310260	2017-08-07 15:25:49 +00:00
Matt Arsenault	faeac6b15e	Fix typo in comment llvm-svn: 310259	2017-08-07 14:58:43 +00:00
Matt Arsenault	8728c5f2db	AMDGPU: Cleanup subtarget features Try to avoid mutually exclusive features. Don't use a real default GPU, and use a fake "generic". The goal is to make it easier to see which set of features are incompatible between feature strings. Most of the test changes are due to random scheduling changes from not having a default fullspeed model. llvm-svn: 310258	2017-08-07 14:58:04 +00:00
Alexey Bataev	53d523c9eb	Revert "[SLP] General improvements of SLP vectorization process." This reverts commit r310255. llvm-svn: 310257	2017-08-07 14:51:52 +00:00
Nirav Dave	3d3bde7682	[DAG] Extend visitSCALAR_TO_VECTOR optimization to truncated vector. Relanding after case to insert explicit truncation as necessary. Allow SCALAR_TO_VECTOR of EXTRACT_VECTOR_ELT to reduce to EXTRACT_SUBVECTOR of vector shuffle when output is smaller. Marginally improves vector shuffle computations. Reviewers: efriedma, RKSimon, spatel Subscribers: javed.absar, llvm-commits Differential Revision: https://reviews.llvm.org/D35566 llvm-svn: 310256	2017-08-07 14:07:49 +00:00
Alexey Bataev	faace8f1f1	[SLP] General improvements of SLP vectorization process. Summary: Patch tries to improve two-pass vectorization analysis, existing in SLP vectorizer. What it does: 1. Defines key nodes, that are the vectorization roots. Previously vectorization started if StoreInst or ReturnInst is found. For now, the vectorization started for all Instructions with no users and void types (Terminators, StoreInst) + CallInsts. 2. CmpInsts, InsertElementInsts and InsertValueInsts are stored in the array. This array is processed only after the vectorization of the first-after-these instructions key node is finished. Vectorization goes in reverse order to try to vectorize as much code as possible. Reviewers: mzolotukhin, Ayal, mkuper, gilr, hfinkel, RKSimon Subscribers: ashahid, anemet, RKSimon, mssimpso, llvm-commits Differential Revision: https://reviews.llvm.org/D29826 llvm-svn: 310255	2017-08-07 14:03:17 +00:00
Nirav Dave	b2f3fad40c	[TableGen] AsmMatcher: fix OpIdx computation when HasOptionalOperands is true Relanding after fixing UB issue with DefaultOffsets. Consider the following instruction: "inst.eq $dst, $src" where ".eq" is an optional flag operand. The $src and $dst operands are registers. If we parse the instruction "inst r0, r1", the flag is not present and it will be marked in the "OptionalOperandsMask" variable. After the matching is complete we call the "convertToMCInst" method. The current implementation works only if the optional operands are at the end of the array. The "Operands" array looks like [token:"inst", reg:r0, reg:r1]. The first operand that must be added to the MCInst is the destination, the r0 register. The "OpIdx" (in the Operands array) for this register is 2. However, since the flag is not present in the Operands, the actual index for r0 should be 1. The flag is not present since we rely on the default value. This patch removes the "NumDefaults" variable and replaces it with an array (DefaultsOffset). This array contains an index for each operand (excluding the mnemonic). At each index, the array contains the number of optional operands that should be subtracted. For the previous example, this array looks like this: [0, 1, 1]. When we need to access the r0 register, we compute its index as 2 - DefaultsOffset[1] = 1. Patch by Alexandru Guduleasa! Reviewers: SamWot, nhaustov, niravd Reviewed By: niravd Subscribers: vitalybuka, llvm-commits Differential Revision: https://reviews.llvm.org/D35998 llvm-svn: 310254	2017-08-07 13:55:27 +00:00
Simon Dardis	ec4ea99766	[DebugInfo][DWARF] Use PRIx64 explicitly in output. llvm-svn: 310253	2017-08-07 13:30:03 +00:00
Michael Zuckerman	680ac10aa7	[X86][LLVM]Expanding Supports lowerInterleavedStore() in X86InterleavedAccess (VF16 stride 4). This patch expands the support of lowerInterleavedStore to 16x8i stride 4. LLVM creates suboptimal shuffle code-gen for AVX2. In overall, this patch is a specific fix for the pattern (Strid=4 VF=16) and we plan to include more patterns in the future. The patch goal is to optimize the following sequence: At the end of the computation, we have ymm2, ymm0, ymm12 and ymm3 holding each 16 chars: c0, c1, , c16 m0, m1, , m16 y0, y1, , y16 k0, k1, ., k16 And these need to be transposed/interleaved and stored like so: c0 m0 y0 k0 c1 m1 y1 k1 c2 m2 y2 k2 c3 m3 y3 k3 .... Differential Revision: https://reviews.llvm.org/D35829 llvm-svn: 310252	2017-08-07 13:22:39 +00:00
Dmitry Preobrazhensky	50805a0b83	[AMDGPU][MC] Corrected VOP3 version of v_interp_* instructions for VI See bug 32621: https://bugs.llvm.org//show_bug.cgi?id=32621 Reviewers: vpykhtin, SamWot, arsenm Differential Revision: https://reviews.llvm.org/D35902 llvm-svn: 310251	2017-08-07 13:14:12 +00:00
Simon Dardis	3886705689	[llvm-objdump] Use PRIx64 for output of ARM64_RELOC_ADDEND llvm-svn: 310250	2017-08-07 12:29:38 +00:00
Simon Pilgrim	e2bb83a4ee	[X86][AVX] Added test for broadcast shuffle with undefs (PR34041) llvm-svn: 310249	2017-08-07 12:24:33 +00:00
Kamil Rytarowski	dc213718db	Add NetBSD support in sanitizer_test_utils.h Summary: NetBSD ships with printf_l(3) like FreeBSD. NetBSD does not ship with memalign, pvalloc, malloc with "usable size" and is the same here as Darwin, Android, FreeBSD and Windows. Sponsored by <The NetBSD Foundation> Reviewers: joerg, vitalybuka, kcc, fjricci, filcab Reviewed By: vitalybuka Subscribers: srhines, llvm-commits, emaste, kubamracek, #sanitizers Tags: #sanitizers Differential Revision: https://reviews.llvm.org/D36373 llvm-svn: 310248	2017-08-07 10:59:44 +00:00
Kamil Rytarowski	b0ca299cfe	Add NetBSD support in asan_errors.cc Summary: Part of the code inspired by the original work on libsanitizer in GCC 5.4 by Christos Zoulas. Sponsored by <The NetBSD Foundation> Reviewers: joerg, fjricci, vitalybuka, filcab, kcc Reviewed By: vitalybuka Subscribers: llvm-commits, kubamracek, #sanitizers Tags: #sanitizers Differential Revision: https://reviews.llvm.org/D36374 llvm-svn: 310247	2017-08-07 10:58:48 +00:00
Kamil Rytarowski	767960bf86	Add NetBSD support in asan_interceptors.h Summary: Part of the code inspired by the original work on libsanitizer in GCC 5.4 by Christos Zoulas. Sponsored by <The NetBSD Foundation> Reviewers: joerg, filcab, kcc, fjricci, vitalybuka Reviewed By: vitalybuka Subscribers: kubamracek, llvm-commits, #sanitizers Tags: #sanitizers Differential Revision: https://reviews.llvm.org/D36375 llvm-svn: 310246	2017-08-07 10:57:58 +00:00
Kamil Rytarowski	9d1f5dc37b	Enable LLVM asan support for NetBSD/i386 Summary: Verified to work and useful to run check-asan, as this target tests 32-bit and 64-bit execution. Sponsored by <The NetBSD Foundation> Reviewers: joerg, filcab, dim, vitalybuka Reviewed By: vitalybuka Subscribers: #sanitizers, cfe-commits Tags: #sanitizers Differential Revision: https://reviews.llvm.org/D36378 llvm-svn: 310245	2017-08-07 10:57:03 +00:00
Vitaly Buka	bdd455f0d5	[asan] Return sizeof missed by r309914 llvm-svn: 310244	2017-08-07 09:08:44 +00:00
Andre Vieira	7dffb9bfa6	[ARM] Fix assembly and disassembly for VMRS/VMSR This patch addresses two issues with assembly and disassembly for VMRS/VMSR: 1.currently VMRS/VMSR instructions accessing fpsid, mvfr{0-2} and fpexc, are accepted for non ARMv8-A targets. 2. all VMRS/VMSR instructions accept writing/reading to PC and SP, when only ARMv7-A and ARMv8-A should be allowed to write/read to SP and none to PC. This patch addresses those issues and adds tests for these cases. Differential Revision: https://reviews.llvm.org/D36306 llvm-svn: 310243	2017-08-07 08:41:05 +00:00
Vitaly Buka	5d432ec929	[asan] Fix asan dynamic shadow check before copyArgsPassedByValToAllocas llvm-svn: 310242	2017-08-07 07:35:33 +00:00
Vitaly Buka	629047de8e	[asan] Disable checking of arguments passed by value for --asan-force-dynamic-shadow Fails with "Instruction does not dominate all uses!" llvm-svn: 310241	2017-08-07 07:12:34 +00:00
Vitaly Buka	f20b9bc218	Add -asan-force-dynamic-shadow test llvm-svn: 310240	2017-08-07 07:12:33 +00:00
Guy Blank	5ca01695f7	[SelectionDAG] reset NewNodesMustHaveLegalTypes flag between basic blocks The NewNodesMustHaveLegalTypes flag is set to false at the beginning of CodeGenAndEmitDAG, and set to true after legalizing types. But before calling CodeGenAndEmitDAG we build the DAG for the basic block. So for the first basic block NewNodesMustHaveLegalTypes would be 'false' during the SDAG building, and for all other basic blocks it would be 'true'. This patch sets the flag to false before SDAG building each basic block. Differential Revision: https://reviews.llvm.org/D33435 llvm-svn: 310239	2017-08-07 05:51:14 +00:00
Davide Italiano	b53b075bb1	[Reassociate] Use a range loop for clarity. NFCI. While here, rename `i` to `Rank` as the latter is more self-explanatory (and this code also uses `I` two lines below to identify an Instruction). llvm-svn: 310238	2017-08-07 01:57:21 +00:00
Davide Italiano	a5cdc22e70	[Reassociate] Try to bail out early when canonicalizing. This commit rearranges the checks to avoid calls to getRank() when not needed (e.g. when RHS == LHS). llvm-svn: 310237	2017-08-07 01:49:09 +00:00
Tobias Grosser	305d3164f2	[ScopInfo] Make Scop::canAlwaysBeHoisted a member function llvm-svn: 310236	2017-08-07 00:10:11 +00:00
Tobias Grosser	e69b272260	[ScopInfo] Move Scop::addInvariantLoads to isl++ [NFC] llvm-svn: 310235	2017-08-06 23:50:25 +00:00
Craig Topper	576fb91aef	[InstCombine] Remove shift handling from OptAndOp. Summary: This is all handled by SimplifyDemandedBits. Reviewers: spatel, davide Reviewed By: davide Subscribers: davide, llvm-commits Differential Revision: https://reviews.llvm.org/D36382 llvm-svn: 310234	2017-08-06 23:30:49 +00:00
Craig Topper	a1693a2ed3	[InstCombine] Support (X ^ C1) & C2 --> (X & C2) ^ (C1&C2) for vector splats. llvm-svn: 310233	2017-08-06 23:11:49 +00:00
Craig Topper	9cbdbefd0f	[InstCombine] Support '(C - X) ^ signmask -> (C + signmask - X)' and '(X + C) ^ signmask -> (X + C + signmask)' for vector splats. llvm-svn: 310232	2017-08-06 22:17:21 +00:00
Tobias Grosser	61bd3a4840	[ScopInfo] Move Scop::getPwAffOnly to isl++ [NFC] llvm-svn: 310231	2017-08-06 21:42:38 +00:00
Tobias Grosser	31df6f31c0	[ScopInfo] Move Scop::getDomains to isl++ [NFC] llvm-svn: 310230	2017-08-06 21:42:25 +00:00
Tobias Grosser	04ec2eb8c9	[ScopInfo] Move Scop::getInvalidContext to isl++ [NFC] llvm-svn: 310229	2017-08-06 21:42:16 +00:00
Tobias Grosser	e127033f98	[ScopInfo] Move Scop::getAssumedContext to isl++ [NFC] llvm-svn: 310228	2017-08-06 21:42:09 +00:00
Simon Pilgrim	7c53c0f586	[SLPVectorizer][X86] Cleanup test case. NFCI Remove excess attributes/metadata llvm-svn: 310227	2017-08-06 20:50:19 +00:00
Erik Pilkington	39dc8800c1	[demangler] Fix another oss-fuzz bug llvm-svn: 310226	2017-08-06 20:46:33 +00:00
Tobias Grosser	232fdad4f2	[ScopInfo] Move Scop::addNonEmptyDomainConstraints to isl++ [NFC] llvm-svn: 310225	2017-08-06 20:19:26 +00:00
Tobias Grosser	b65ccc4302	[ScopInfo] Translate Scop::getParamSpace to isl++ [NFC] llvm-svn: 310224	2017-08-06 20:11:59 +00:00
Martin Storsjo	c9263f4e49	[llvm-dlltool] Map the "arm64" machine type Differential Revision: https://reviews.llvm.org/D36365 llvm-svn: 310223	2017-08-06 19:58:13 +00:00

1 2 3 4 5 ...

269056 Commits All Branches Search

269056 Commits

All Branches