llvm-project

Commit Graph

Author	SHA1	Message	Date
Zvi Rackover	b26530cd69	[Doc][LangRef] Fix typo-ish error in description of Masked Gather Summary: Fix the example of equivalent expansion for when mask is all ones. Reviewers: delena Reviewed By: delena Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D29179 llvm-svn: 293206	2017-01-26 20:29:15 +00:00
Sanjay Patel	0ca3f64c4d	[InstCombine] add tests for shift-shift folds; NFC llvm-svn: 293205	2017-01-26 20:10:55 +00:00
Balaram Makam	b73d2962ba	[AArch64] Refine Kryo Machine Model Summary: Refine floating point SQRT and DIV with accurate latency information. Reviewers: mcrosier Subscribers: aemerson, rengolin, llvm-commits Differential Revision: https://reviews.llvm.org/D29191 llvm-svn: 293204	2017-01-26 20:10:41 +00:00
Kyle Butt	c4614b3e76	[IfConversion] Use reverse_iterator to simplify. NFC This simplifies skipping debug instructions and shrinking ranges. llvm-svn: 293202	2017-01-26 20:02:47 +00:00
Kuba Mracek	6393aa3a62	[tsan] Fix os_id of main thread Currently, os_id of the main thread contains the PID instead of a thread ID. Let's fix this. Differential Revision: https://reviews.llvm.org/D29106 llvm-svn: 293201	2017-01-26 19:20:30 +00:00
Sean Fertile	3c8c385a77	[PPC] cleanup of mayLoad/mayStore flags and memory operands. 1) Explicitly sets mayLoad/mayStore property in the tablegen files on load/store instructions. 2) Updated the flags on a number of intrinsics indicating that they write memory. 3) Added SDNPMemOperand flags for some target dependent SDNodes so that they propagate their memory operand Review: https://reviews.llvm.org/D28818 llvm-svn: 293200	2017-01-26 18:59:15 +00:00
Akira Hatanaka	c05c42567e	Turn on -Wblock-capture-autoreleasing by default. Turning on the warning by default helps the users as it's a common mistake to capture out-parameters in a block without ensuring the object assigned doesn't get released. rdar://problem/30200058 llvm-svn: 293199	2017-01-26 18:51:10 +00:00
Daniel Berlin	66e3a3d0ac	NewGVN: Fix output of pr31578 testcase now that we mark unreachable blocks as unreachable llvm-svn: 293198	2017-01-26 18:49:03 +00:00
Dimitry Andric	83dca5c3d1	Disable thread safety analysis for some functions in __thread_support Many thread-related libc++ test cases fail on FreeBSD, due to the following -Werror warnings: In file included from test/std/thread/thread.threads/thread.thread.this/sleep_until.pass.cpp:17: In file included from include/thread:97: In file included from include/__mutex_base:17: include/__threading_support:222:1: error: mutex '__m' is still held at the end of function [-Werror,-Wthread-safety-analysis] } ^ include/__threading_support:221:10: note: mutex acquired here return pthread_mutex_lock(__m); ^ include/__threading_support:231:10: error: releasing mutex '__m' that was not held [-Werror,-Wthread-safety-analysis] return pthread_mutex_unlock(__m); ^ include/__threading_support:242:1: error: mutex '__m' is still held at the end of function [-Werror,-Wthread-safety-analysis] } ^ include/__threading_support:241:10: note: mutex acquired here return pthread_mutex_lock(__m); ^ include/__threading_support:251:10: error: releasing mutex '__m' that was not held [-Werror,-Wthread-safety-analysis] return pthread_mutex_unlock(__m); ^ include/__threading_support:272:10: error: calling function 'pthread_cond_wait' requires holding mutex '__m' exclusively [-Werror,-Wthread-safety-analysis] return pthread_cond_wait(__cv, __m); ^ include/__threading_support:278:10: error: calling function 'pthread_cond_timedwait' requires holding mutex '__m' exclusively [-Werror,-Wthread-safety-analysis] return pthread_cond_timedwait(__cv, __m, __ts); ^ 6 errors generated. This is because on FreeBSD, the pthread functions have lock annotations. Since the functions in __thread_support are internal to libc++ only, add no_thread_safety_analysis attributes to suppress these warnings. Reviewers: mclow.lists, EricWF, delesley, aaron.ballman Reviewed By: aaron.ballman Subscribers: ed, aaron.ballman, joerg, emaste, cfe-commits Differential Revision: https://reviews.llvm.org/D28520 llvm-svn: 293197	2017-01-26 18:37:18 +00:00
Daniel Berlin	2b83492eee	NewGVN: Make unreachable blocks be marked with unreachable llvm-svn: 293196	2017-01-26 18:30:29 +00:00
Oleg Ranevskyy	41abca4355	[Compiler-rt] Broken compiler-rt CMake configuring on Windows Summary: Hi Michal, Would you be able to review this simple fix, please? Since r291504 compiler-rt uses `llvm-config --cmakedir` to get the path to the LLVM CMake modules. On Windows this option returns Windows style path with backslashes. CMake treats backslashes as beginning of an escaped character and thus fails to append the path to `CMAKE_MODULE_PATH`. Reviewers: compnerd, mgorny Reviewed By: mgorny Subscribers: compnerd, llvm-commits, dberris Differential Revision: https://reviews.llvm.org/D28908 llvm-svn: 293195	2017-01-26 18:16:02 +00:00
Akira Hatanaka	5d55a6c69d	[Sema][ObjC] Make sure -Wblock-capture-autoreleasing issues a warning even in the presence of nullability qualifiers. This commit fixes bugs in r285031 where -Wblock-capture-autoreleasing wouldn't issue warnings when the function parameters were annotated with nullability qualifiers. Specifically, look through the sugar and see if there is an AttributedType of kind attr_objc_ownership to determine whether __autoreleasing was explicitly specified or implicitly added by the compiler. rdar://problem/30193488 llvm-svn: 293194	2017-01-26 18:13:06 +00:00
Stanislav Mekhanoshin	61da067393	Use TargetMachine adjustPassManager hook Differential Revision: https://reviews.llvm.org/D28340 llvm-svn: 293190	2017-01-26 16:49:21 +00:00
Stanislav Mekhanoshin	81598117b6	Replace addEarlyAsPossiblePasses callback with adjustPassManager This change introduces adjustPassManager target callback giving a target an opportunity to tweak PassManagerBuilder before pass managers are populated. This generalizes and replaces addEarlyAsPossiblePasses target callback. In particular that can be used to add custom passes to extension points other than EP_EarlyAsPossible. Differential Revision: https://reviews.llvm.org/D28336 llvm-svn: 293189	2017-01-26 16:49:08 +00:00
Nirav Dave	d32a421f75	Revert "In visitSTORE, always use FindBetterChain, rather than only when UseAA is enabled." This reverts commit r293184 which is failing in LTO builds llvm-svn: 293188	2017-01-26 16:46:13 +00:00
Eric Liu	9122916ee5	[change-namespace] correctly shorten namespace when references have leading '::' Reviewers: bkramer Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D29182 llvm-svn: 293187	2017-01-26 16:31:32 +00:00
Serge Rogatch	c4540b371d	[XRay][Arm32] Reduce the portion of the stub and implement more staging for tail calls - in compiler-rt Summary: This patch provides more staging for tail calls in XRay Arm32 . When the logging part of XRay is ready for tail calls, its support in the core part of XRay Arm32 may be as easy as changing the number passed to the handler from 1 to 2. Coupled patch: - https://reviews.llvm.org/D28673 Reviewers: dberris, rengolin Reviewed By: dberris, rengolin Subscribers: llvm-commits, iid_iunknown, aemerson Differential Revision: https://reviews.llvm.org/D28674 llvm-svn: 293186	2017-01-26 16:18:13 +00:00
Serge Rogatch	e09ba748cf	[XRay][Arm32] Reduce the portion of the stub and implement more staging for tail calls - in LLVM Summary: This patch provides more staging for tail calls in XRay Arm32 . When the logging part of XRay is ready for tail calls, its support in the core part of XRay Arm32 may be as easy as changing the number passed to the handler from 1 to 2. Coupled patch: - https://reviews.llvm.org/D28674 Reviewers: dberris, rengolin Reviewed By: dberris Subscribers: llvm-commits, iid_iunknown, aemerson, rengolin, dberris Differential Revision: https://reviews.llvm.org/D28673 llvm-svn: 293185	2017-01-26 16:17:03 +00:00
Nirav Dave	de6516c466	In visitSTORE, always use FindBetterChain, rather than only when UseAA is enabled. * Simplify Consecutive Merge Store Candidate Search Now that address aliasing is much less conservative, push through simplified store merging search and chain alias analysis which only checks for parallel stores through the chain subgraph. This is cleaner as the separation of non-interfering loads/stores from the store-merging logic. When merging stores search up the chain through a single load, and finds all possible stores by looking down from through a load and a TokenFactor to all stores visited. This improves the quality of the output SelectionDAG and the output Codegen (save perhaps for some ARM cases where we correctly constructs wider loads, but then promotes them to float operations which appear but requires more expensive constant generation). Some minor peephole optimizations to deal with improved SubDAG shapes (listed below) Additional Minor Changes: 1. Finishes removing unused AliasLoad code 2. Unifies the chain aggregation in the merged stores across code paths 3. Re-add the Store node to the worklist after calling SimplifyDemandedBits. 4. Increase GatherAllAliasesMaxDepth from 6 to 18. That number is arbitrary, but seems sufficient to not cause regressions in tests. 5. Remove Chain dependencies of Memory operations on CopyfromReg nodes as these are captured by data dependence 6. Forward loads-store values through tokenfactors containing {CopyToReg,CopyFromReg} Values. 7. Peephole to convert buildvector of extract_vector_elt to extract_subvector if possible (see CodeGen/AArch64/store-merge.ll) 8. Store merging for the ARM target is restricted to 32-bit as some in some contexts invalid 64-bit operations are being generated. This can be removed once appropriate checks are added. This finishes the change Matt Arsenault started in r246307 and jyknight's original patch. Many tests required some changes as memory operations are now reorderable, improving load-store forwarding. One test in particular is worth noting: CodeGen/PowerPC/ppc64-align-long-double.ll - Improved load-store forwarding converts a load-store pair into a parallel store and a memory-realized bitcast of the same value. However, because we lose the sharing of the explicit and implicit store values we must create another local store. A similar transformation happens before SelectionDAG as well. Reviewers: arsenm, hfinkel, tstellarAMD, jyknight, nhaehnle llvm-svn: 293184	2017-01-26 16:02:24 +00:00
Arpith Chacko Jacob	cca61a3a74	[OpenMP] Codegen support for 'target teams' on the NVPTX device. This is a simple patch to teach OpenMP codegen to emit the construct in Generic mode. Reviewers: ABataev Differential Revision: https://reviews.llvm.org/D29143 llvm-svn: 293183	2017-01-26 15:43:27 +00:00
Eric Liu	bc715504da	[change-namespace] add leading '::' to references in new namespace when name conflict is possible. Summary: For example, when we change 'na' to "nb::nc", we need to add leading '::' to references "::nc::X" in the changed namespace. Reviewers: bkramer Reviewed By: bkramer Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D29176 llvm-svn: 293182	2017-01-26 15:08:44 +00:00
Rafael Espindola	82149a1aa9	Use shouldAssumeDSOLocal in classifyGlobalReference. And teach shouldAssumeDSOLocal that ppc has no copy relocations. The resulting code handle a few more case than before. For example, it knows that a weak symbol can be resolved to another .o file, but it will still be in the main executable. llvm-svn: 293180	2017-01-26 15:02:31 +00:00
Marshall Clow	a98b5fd999	Fixed a couple of invalid statuses for 2665 and 2758 llvm-svn: 293179	2017-01-26 14:36:14 +00:00
Simon Pilgrim	027bb453d9	[X86][SSE] Add support for combining ANDNP byte masks with target shuffles llvm-svn: 293178	2017-01-26 14:31:12 +00:00
Rafael Espindola	0b034d6f25	Fix -r when the input has a relocation with no symbol. Should fix a few freebsd packages with dtrace. llvm-svn: 293177	2017-01-26 14:09:18 +00:00
Daniil Fukalov	b09dac59fc	[SCEV] Introduce add operation inlining limit Inlining in getAddExpr() can cause abnormal computational time in some cases. New parameter -scev-addops-inline-threshold is intruduced with default value 500. Reviewers: sanjoy Subscribers: mzolotukhin, llvm-commits Differential Revision: https://reviews.llvm.org/D28812 llvm-svn: 293176	2017-01-26 13:33:17 +00:00
Simon Pilgrim	3057fd53f9	[X86][SSE] Pull out target shuffle resolve code into helper. NFCI. Pulled out code that removed unused inputs from a target shuffle mask into a helper function to allow it to be reused in a future commit. llvm-svn: 293175	2017-01-26 13:06:02 +00:00
Daniel Sanders	f69fe68628	Remove a '#if 0' that wasn't intended for commit in r293173. The '#if 0' contained the code I had intended to use but clang rejects it (possibly incorrectly). llvm-svn: 293174	2017-01-26 12:10:43 +00:00
Daniel Sanders	b222431144	Attempt to fix windows buildbots after r293172. llvm-svn: 293173	2017-01-26 11:23:49 +00:00
Daniel Sanders	dc662ff047	[globalisel] Re-factor ISel matchers into a hierarchy. NFC Summary: This should make it possible to easily add everything needed to import all the existing SelectionDAG rules. It should also serve the likely kinds of GlobalISel rules (some of which are not currently representable in SelectionDAG) once we've nailed down the tablegen definition for that. The hierarchy is as follows: MatcherRule - A matching rule. Currently used to emit C++ ISel code but will \| also be used to emit test cases and tablegen definitions in the \| near future. \|- Instruction(s) - Represents the instruction to be matched. \|- Instruction Predicate(s) - Test the opcode, arithmetic flags, etc. of an \| instruction. \- Operand(s) - Represents a particular operand of the instruction. In the \| future, there may be subclasses to test the same predicates \| on multiple operands (including for variadic instructions). \ Operand Predicate(s) - Test the type, register bank, etc. of an operand. This is where the ComplexPattern equivalent will be represented. It's also nested-instruction matching will live as a predicate that follows the DefUse chain to the Def and tests a MatcherRule from that position. Support for multiple instruction matchers in a rule has been retained from the existing code but has been adjusted to assert when it is used. Previously it would silently drop all but the first instruction matcher. The tablegen-erated file is not functionally changed but has more parentheses and no longer attempts to format the if-statements since keeping track of the indentation is tricky in the presence of the matcher hierarchy. It would be nice to have CMakes tablegen() run the output through clang-format (when available) so we don't have to complicate TableGen with pretty-printing. It's also worth mentioning that this hierarchy will also be able to emit TableGen definitions and test cases in the near future. This is the reason for favouring explicit emit*() calls rather than the << operator. Reviewers: aditya_nandakumar, rovka, t.p.northover, qcolombet, ab Reviewed By: ab Subscribers: igorb, dberris, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D28942 llvm-svn: 293172	2017-01-26 11:10:14 +00:00
Valery Pykhtin	75d1de903f	[AMDGPU] Fix typo in GCNSchedStrategy Differential revision: https://reviews.llvm.org/D28980 llvm-svn: 293171	2017-01-26 10:51:47 +00:00
Simon Dardis	5b67a4f75f	Revert "[mips] N64 static relocation model support" This reverts commit r293164. There are multiple tests failing. llvm-svn: 293170	2017-01-26 10:46:07 +00:00
Tobias Grosser	77363965c0	[ScopDetectionDiagnostic] Add meaningfull enduser message for regions with entry block Before this change the user only saw "Unspecified Error", when a region contained the entry block. Now we report: "Scop contains function entry (not yet supported)." llvm-svn: 293169	2017-01-26 10:41:37 +00:00
Chandler Carruth	6f4ed077d0	[LV] Fix an issue where forming LCSSA in the place that we did would change the set of uniform instructions in the loop causing an assert failure. The problem is that the legalization checking also builds data structures mapping various facts about the loop body. The immediate cause was the set of uniform instructions. If these then change when LCSSA is formed, the data structures would already have been built and become stale. The included test case triggered an assert in loop vectorize that was reduced out of the new PM's pipeline. The solution is to form LCSSA early enough that no information is cached across the changes made. The only really obvious position is outside of the main logic to vectorize the loop. This also has the advantage of removing one case where forming LCSSA could mutate the loop but we wouldn't track that as a "Changed" state. If it is significantly advantageous to do some legalization checking prior to this, we can do a more careful positioning but it seemed best to just back off to a safe position first. llvm-svn: 293168	2017-01-26 10:41:09 +00:00
Asiri Rathnayake	e246350467	Fix chromium build (libcxx) Remove the reference to pthread_mach_thread_np() in libcxx headers. llvm-svn: 293167	2017-01-26 10:40:17 +00:00
Asiri Rathnayake	085b612c31	Fix chromium build (libcxxabi) Pull the dependency on pthread_mach_thread_np() back into libcxxabi. llvm-svn: 293166	2017-01-26 10:38:03 +00:00
Tobias Grosser	64bbb1357f	ScopDetectionDiagnostics: Also emit diagnostics in case no debug info is available In this case, we just use the start of the scop as the debug location. llvm-svn: 293165	2017-01-26 10:30:55 +00:00
Simon Dardis	09e65efd09	[mips] N64 static relocation model support This patch makes one change to GOT handling and two changes to N64's relocation model handling. Furthermore, the jumptable encodings have been corrected for static N64. Big GOT handling is now done via a new SDNode MipsGotHi - this node is unconditionally lowered to an lui instruction. The first change to N64's relocation handling is the lifting of the restriction that N64 always uses PIC. Now it is possible to target static environments. The second change adds support for 64 bit symbols and enables them by default. Previously N64 had patterns for sym32 mode only. In this mode all symbols are assumed to have 32 bit addresses. sym32 mode support is selectable with attribute 'sym32'. A follow on patch for clang will add the necessary frontend parameter. This partially resolves PR/23485. Thanks to Brooks Davis for reporting the issue! Reviewers: dsanders, seanbruno, zoran.jovanovic, vkalintiris Differential Revision: https://reviews.llvm.org/D23652 llvm-svn: 293164	2017-01-26 10:19:02 +00:00
Diana Picus	278c722e6d	[ARM] GlobalISel: Load i1, i8 and i16 args from stack Add support for loading i1, i8 and i16 arguments from the stack, with or without the ABI extension flags. When the ABI extension flags are present, we load a 4-byte value, otherwise we preserve the size of the load and let the instruction selector replace it with a LDRB/LDRH. This generates the same thing as DAGISel. Differential Revision: https://reviews.llvm.org/D27803 llvm-svn: 293163	2017-01-26 09:20:47 +00:00
Alexey Bataev	7a7510ea97	[SLP] Add one more reduction operation for extra argument test to make it vectorizable. llvm-svn: 293162	2017-01-26 09:18:41 +00:00
Sean Callanan	1912d9633f	Removed an unneccesary #if now that debugserver-mini links Foundation. llvm-svn: 293161	2017-01-26 08:51:32 +00:00
Chandler Carruth	41421df02b	[PM] Use PoisoningVH correctly when merely deleting entries in a map with it. This code was dereferencing the PoisoningVH which isn't allowed once it is poisoned. But the code itself really doesn't need to access the pointer, it is just doing the safe stuff of clearing out data structures keyed on the pointer value. Change the code to use iterators to erase directly from a DenseMap. This is also substantially more efficient as it avoids lots of hashing and lookups to do the erasure. DenseMap supports iterating behind the iteration which is fairly easy to implement. Sadly, I don't have a test case here. I'm not even close and I don't know that I ever will be. The issue is that several of the tricky aspects of fixing this only show up when you cause the stack's SmallVector to be in EXACTLY the right location. I only ever got a reproduction for those with Clang, and only with exactly the right command line flags. Any adjustment, even to seemingly unrelated flags, would make partial and half-way solutions magically start to "work". In good news, all of this was caught with the LLVM test suite. Also, there is no specific code here that is untested, just that the old pattern of code won't immediately fail on any test case I've managed to contrive. llvm-svn: 293160	2017-01-26 08:31:54 +00:00
NAKAMURA Takumi	949d54ebd9	Chapter3/KaleidoscopeJIT.h: Fix a warning. [-Wunused-lambda-capture] "this", aka class members, is not referred in the body. llvm-svn: 293159	2017-01-26 08:31:14 +00:00
Craig Topper	05078de912	[TargetTransformInfo] Add override keywords to supporess -Winconsistent-missing-override. llvm-svn: 293158	2017-01-26 08:04:27 +00:00
Craig Topper	bad53cce26	[AVX-512] Move the combine that runs combineBitcastForMaskedOp to the last DAG combine phase where I had originally meant to put it. llvm-svn: 293157	2017-01-26 07:17:58 +00:00
Craig Topper	f0bab7b739	[X86] When bitcasting INSERT_SUBVECTOR/EXTRACT_SUBVECTOR to match masked operations, use the correct type for the immediate operand. llvm-svn: 293156	2017-01-26 07:17:53 +00:00
Jonas Paulsson	8e2f948ef0	[TargetTransformInfo] Refactor and improve getScalarizationOverhead() Refactoring to remove duplications of this method. New method getOperandsScalarizationOverhead() that looks at the present unique operands and add extract costs for them. Old behaviour was to just add extract costs for one operand of the type always, which still happens in getArithmeticInstrCost() if no operands are provided by the caller. This is a good start of improving on this, but there are more places that can be improved by using getOperandsScalarizationOverhead(). Review: Hal Finkel https://reviews.llvm.org/D29017 llvm-svn: 293155	2017-01-26 07:03:25 +00:00
Marshall Clow	3a3c09c5dd	Use the new __has_feature(cxx_constexpr_string_builtins) for detection of the C-string intrinsics for constexpr support in std::char_traits. Thanks to Richard for the intrisic support. llvm-svn: 293154	2017-01-26 06:58:29 +00:00
Alexey Bataev	7046a852b3	[SLP] Fixed test for extra arguments in horizontal reductions. llvm-svn: 293153	2017-01-26 06:19:52 +00:00
Craig Topper	001aad7da7	[DAGCombiner] Fold extract_subvector of undef to undef. Fold away inserting undef subvectors. llvm-svn: 293152	2017-01-26 05:38:46 +00:00

1 2 3 4 5 ...

253037 Commits All Branches Search

253037 Commits

All Branches