llvm-project

Commit Graph

Author	SHA1	Message	Date
Austin Kerbow	a4f35ab232	[AMDGPU] Fix mai hazard VALU to LD/ST Fixes: SWDEV-251863 Differential Revision: https://reviews.llvm.org/D89079	2020-10-08 17:13:02 -07:00
Richard Smith	d1751d14a6	PR47175: Ensure type-dependent function-style casts have dependent types. Previously, a type-dependent cast to a deduced class template specialization type would end up with a non-dependent class template specialization type, leading to confusion downstream.	2020-10-08 17:00:22 -07:00
Yuanfang Chen	caedf7937c	[NFC] Fix a comment in MachinePassManager.h Fix "warning: '\returns' command used in a comment that is not attached to a function or method declaration [-Wdocumentation] 1 warning generated."	2020-10-08 15:38:57 -07:00
Fangrui Song	e36a41b3cf	[X86] Fix some clang-tidy bugprone-argument-comment issues	2020-10-08 15:26:50 -07:00
Jim Ingham	a68ffb19d3	Change the default handling of SIGCONT to nosuppress/nostop/notify Except for the few people actually debugging shells, stopping on a SIGCONT doesn't add any value. And for people trying to run tests under the debugger, stopping here is actively inconvenient. So this patch switches the default behavior to not stop. Differential Revision: https://reviews.llvm.org/D89019	2020-10-08 15:24:19 -07:00
Mircea Trofin	4cfc4025cc	[NFC][MC] MCRegister API typing. Mostly LiveIntervals, with their effects (users). Differential Revision: https://reviews.llvm.org/D89018	2020-10-08 15:08:34 -07:00
Thomas Raoux	19119dda16	[mlir][vector] Add integration test for vector distribute transformation Differential Revision: https://reviews.llvm.org/D89062	2020-10-08 14:45:56 -07:00
Thomas Raoux	cf402a1987	[mlir][vector] Add unit test for vector distribute by block When distributing a vector larger than the given multiplicity, we can distribute it by block where each id gets a chunk of consecutive element along the dimension distributed. This adds a test for this case and adds extra checks to make sure we don't distribute for cases not multiple of multiplicity. Differential Revision: https://reviews.llvm.org/D89061	2020-10-08 14:44:03 -07:00
Arthur Eubanks	afff74e5c2	[HWAsan][NewPM] Handle hwasan like other sanitizers Move it as an EP callback (-O[123]) or in addSanitizersAtO0. This makes it not run in ThinLTO pre-link (like the other sanitizers), so don't check LTO runs in hwasan-new-pm.c. Changing its position also seems to change the generated IR. I think we just need to make sure the pass runs. Reviewed By: leonardchan Differential Revision: https://reviews.llvm.org/D88936	2020-10-08 14:43:21 -07:00
Alexandre Ganea	97e7fbb343	[LLDB] More Windows non-English locales fixes This is a follow-up for https://reviews.llvm.org/D88975	2020-10-08 17:22:42 -04:00
Simon Pilgrim	d9f064dc0b	[InstCombine] visitTrunc - trunc(shl(X, C)) --> shl(trunc(X),trunc(C)) vector support Annoyingly vectors aren't supported by shouldChangeType(), but we have precedents for always performing this on vector types (e.g. narrowBinOp). Differential Revision: https://reviews.llvm.org/D89067	2020-10-08 22:07:51 +01:00
Quentin Colombet	fd8275e04a	[GlobalISel] Add missing pass dependencies for IRTranslator The IRTranslator depends on the branch probability info pass when the optimization level is different than None and it depends all the time on the StackProtector pass. We have to explicitly call out pass dependencies otherwise the pass manager may not be able to schedule the IRTranslator. Before this patch, we were lucky because previous passes depend on the branch probability info pass (like the Global Variable Optimization) and the stack protector pass is initialized in initializeCodeGen. However, if the target has a custom pipeline without any passes like Global Variable Optimization, the pipeline creation will fail, at least because of the branch probability info pass dependency (it is unlikely that initializeCodeGen is not called). This patch adds the missing dependencies to the IRTranslator. Differential Revision: https://reviews.llvm.org/D89063	2020-10-08 13:57:21 -07:00
Paula Toth	f60686f35c	[libc] Update buildbot worker version to 2.8.4. Tested locally by connecting to LLVM master. (: http://lab.llvm.org:8011/#/builders/78/builds/1 Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D89069	2020-10-08 13:43:53 -07:00
Simon Pilgrim	e1b5fcb942	[InstCombine] Add additional trunc(shl(x,c)) -> shl(trunc(x),trunc(c)) vector tests	2020-10-08 21:11:48 +01:00
Sanjay Patel	f688ae7a0e	[InstCombine] allow vector splats for add+xor with low-mask This can be allowed with undef elements too, but that can be another step: https://alive2.llvm.org/ce/z/hnC4Z-	2020-10-08 15:53:38 -04:00
Mehdi Amini	69efcd03bd	Fix typo `DenseElementAttr`-> `DenseElementsAttr` in some comments (NFC)	2020-10-08 19:40:48 +00:00
Simon Pilgrim	6aa10ae5bf	[Transforms] visitCmpBlock - don't dereference a dyn_cast<>. NFCI. Use cast<> as we immediately dereference the pointer afterwards - cast<> will assert if we fail. Prevents clang static analyzer warning that we could deference a null pointer.	2020-10-08 20:18:32 +01:00
Sanjay Patel	5ac89add1e	[InstCombine] remove unnecessary one-use check from add-xor transform Pre-conditions seem to be optimal, but we don't need a use check because we are only replacing an add with a sub. https://rise4fun.com/Alive/hzN Pre: (~C1 \| C2 == -1) && isPowerOf2(C2+1) %m = and i8 %x, C1 %f = xor i8 %m, C2 %r = add i8 %f, C3 => %r = sub i8 C2 + C3, %m	2020-10-08 15:08:51 -04:00
Sanjay Patel	a52159a1c3	[InstCombine] add tests for add-xor; NFC	2020-10-08 15:08:51 -04:00
Simon Pilgrim	8568113101	Fix Wparentheses warning. NFCI. Wrap the containErrors() calls together - assert we have any containErrors cases in the conditional operator.	2020-10-08 20:02:19 +01:00
Simon Pilgrim	0716805c02	[SLP] optimizeGatherSequence - assert every Instruction in the worklist is non-null. Fixes clang static analyzer warning.	2020-10-08 20:02:18 +01:00
Heejin Ahn	750b3ddd80	[WebAssembly] Handle indirect uses of longjmp In LowerEmscriptenEHSjLj, `longjmp` used to be replaced with `emscripten_longjmp_jmpbuf(jmp_buf, i32)`, which will eventually be lowered to `emscripten_longjmp(i32, i32)`. The reason we used two different names was because they had different signatures in the IR pass. D88697 fixed this by only using `emscripten_longjmp(i32, i32)` and adding a `ptrtoint` cast to its first argument, so ``` longjmp(buf, 0) ``` becomes ``` emscripten_longjmp((i32)buf, 0) ``` But this assumed all uses of `longjmp` was a direct call to it, which was not the case. This patch handles indirect uses of `longjmp` by replacing ``` longjmp ``` with ``` (i32()(jmp_buf*, i32))emscripten_longjmp ``` Reviewed By: tlively Differential Revision: https://reviews.llvm.org/D89032	2020-10-08 11:37:19 -07:00
Quentin Colombet	d421e0484a	[KnownBits] Add a sextOrTrunc method We already offer zextOrTrunc and it seems natural to offer the same capability for sign extension. This patch is a preparatory addition useful for future computeKnownBits developments. Differential Revision: https://reviews.llvm.org/D88937	2020-10-08 11:33:06 -07:00
Quentin Colombet	9431f8ad2e	[KnownBits] Add a computeForMul method This patch refactors the logic in ValueTracking.cpp so that computeKnownBitsForMul now uses a helper function from KnownBits. NFC Differential Revision: https://reviews.llvm.org/D88935	2020-10-08 11:33:06 -07:00
Quentin Colombet	f1f31eb2da	[unittests] Add a few tests for computeKnownBits with ranges These tests make sure that the range information is properly understood during computeKnownBits analysis. NFC Differential Revision: https://reviews.llvm.org/D88934	2020-10-08 11:33:06 -07:00
Louis Dionne	504bc07d1a	[runtimes] Use int main(int, char**) consistently in tests This is needed when running the tests in Freestanding mode, where main() isn't treated specially. In Freestanding, main() doesn't get mangled as extern "C", so whatever runtime we're using fails to find the entry point. One way to solve this problem is to define a symbol alias from __Z4mainiPPc to _main, however this requires all definitions of main() to have the same mangling. Hence this commit.	2020-10-08 14:28:13 -04:00
Rahman Lavaee	2b0c5d76a6	Introduce and use a new section type for the bb_addr_map section. This patch lets the bb_addr_map (renamed to __llvm_bb_addr_map) section use a special section type (SHT_LLVM_BB_ADDR_MAP) instead of SHT_PROGBITS. This would help parsers, dumpers and other tools to use the sh_type ELF field to identify this section rather than relying on string comparison on the section name. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D88199	2020-10-08 11:13:19 -07:00
Simon Pilgrim	8f0658ae67	[Transforms] CodeExtractor::verifyAssumptionCache - don't dereference a dyn_cast<>. NFCI. Use cast<> as we immediately dereference the pointer afterwards - cast<> will assert if we fail. Prevents clang static analyzer warning that we could deference a null pointer.	2020-10-08 19:04:30 +01:00
Simon Pilgrim	119a143699	[Analysis] ScalarEvolution::getUMinFromMismatchedTypes - assert we've found the max type. NFCI. Found by clang static analyzer.	2020-10-08 19:04:29 +01:00
Simon Pilgrim	df9ae806bb	[AVR] Fix null dereference warning. NFCI. We were checking if the ConstantSDNode was null but then immediately dereferencing it afterward - fold these both into a single check. Use the APInt::ult() helper as well. Found by clang static analyzer.	2020-10-08 19:04:29 +01:00
Joseph Huber	3cc1f1fc1d	[OpenMP] Replace OpenMP RTL Functions With OMPIRBuilder and OMPKinds.def Summary: Replace the OpenMP Runtime Library functions used in CGOpenMPRuntimeGPU for OpenMP device code generation with ones in OMPKinds.def and use OMPIRBuilder for generating runtime calls. This allows us to consolidate more OpenMP code generation into the OMPIRBuilder. Future additions to the GPU runtime functions should now go in OMPKinds.def Reviewers: jdoerfert Subscribers: aaron.ballman cfe-commits guansong llvm-commits sstefan1 yaxunl Tags: #OpenMP #LLVM #clang Differential Revision: https://reviews.llvm.org/D88430	2020-10-08 14:00:22 -04:00
Dan Liew	295d4e420f	[lit] Try to remove the flakeyness of `shtest-timeout.py` and `googletest-timeout.py`. The tests previously relied on the `short.py` and `FirstTest.subTestA` script being executed on a machine within a short time window (1 or 2 seconds). While this "seems to work" it can fail on resource constrained machines. We could bump the timeout a little bit (bumping it too much would mean the test would take a long time to execute) but it wouldn't really solve the problem of the test being prone to failures. This patch tries to remove this flakeyness by separating testing into two separate parts: 1. Testing if a test can hit a timeout. 2. Testing if a test can run to completion in the presence of a timeout. This way we can give (1.) a really short timeout (to make the test run as fast as possible) and (2.) a really long timeout. This means for (2.) we are no longer trying to rely on the "short" test executing within some short time window. Instead the window is now 3600 seconds which should be long enough even for a heavily resource constrained machine to execute the "short" test. Thanks to Julian Lettner for suggesting this approach. This superseeds my original approach in https://reviews.llvm.org/D88807. This patch also changes the command line override test to run the quick test rather than the slow one to make the test run faster. Differential Revision: https://reviews.llvm.org/D89020	2020-10-08 10:46:18 -07:00
Teresa Johnson	f775cb8994	[sanitizer] Fix Fuchsia bot failure Fixes bot failure from 4d5b1de40eccc7ffcfb859cef407e5f30bee77f8: https://luci-milo.appspot.com/p/fuchsia/builders/ci/clang-linux-x64/b8867057367989385504 Updates the version of RenderFrame used by Fuchsia and adds a version of the new RenderNeedsSymbolization.	2020-10-08 10:44:40 -07:00
David Green	a15bd0bfc2	[AIX] Add REQUIRES for powerpc test. NFC	2020-10-08 18:40:09 +01:00
Amara Emerson	283b4d6ba3	[GlobalISel] Add G_VECREDUCE_* opcodes for vector reductions. These mirror the IR and SelectionDAG intrinsics & nodes. Opcodes added: G_VECREDUCE_SEQ_FADD G_VECREDUCE_SEQ_FMUL G_VECREDUCE_FADD G_VECREDUCE_FMUL G_VECREDUCE_FMAX G_VECREDUCE_FMIN G_VECREDUCE_ADD G_VECREDUCE_MUL G_VECREDUCE_AND G_VECREDUCE_OR G_VECREDUCE_XOR G_VECREDUCE_SMAX G_VECREDUCE_SMIN G_VECREDUCE_UMAX G_VECREDUCE_UMIN Differential Revision: https://reviews.llvm.org/D88750	2020-10-08 10:33:19 -07:00
Leonard Chan	64c0792946	[clang][feature] Add cxx_abi_relative_vtable feature This will be enabled if relative vtables is enabled. Differential revision: https://reviews.llvm.org/D85924	2020-10-08 10:30:54 -07:00
MaheshRavishankar	4a1682e931	[mlir][Linalg] Add some depedence query methods to LinalgDependenceGraph. The methods allow to check - if an operation has dependencies, - if there is a dependence from one operation to another. Differential Revision: https://reviews.llvm.org/D88993	2020-10-08 10:17:18 -07:00
peter klausler	3e86eda18c	[flang] Allow "name: value" in compiler directives Some legacy compiler directives use colons rather than equals signs. Differential revision: https://reviews.llvm.org/D89017	2020-10-08 10:01:37 -07:00
Pavel Labath	19d64138e6	[lldb] Fix "frame var" for large bitfields The problem here is in the "sliding" code in ValueObjectChild::UpdateValue. It modifies m_bitfield_bit_offset and m_value to ensure the bitfield value fits the window given by the underlying type. However, this is broken next time UpdateValue is called, because it updates the m_value value from the parent. However, the value cannot be slid again because the m_bitfield_bit_offset is already modified. It seems this can happen only under specific circumstances. One way to trigger is is to run an expression which can be interpreted (jitting it causes a new StackFrame and ValueObject variables to be created). I fix this bug by modifying m_byte_offset instead of m_scalar, and ensuring the changes are folded into m_scalar regardless of how many times UpdateValue is called. Differential Revision: https://reviews.llvm.org/D88992	2020-10-08 18:42:50 +02:00
Pavel Labath	d4a7c70751	[lldb] Add a cmake warning about the python/swig incompatibility Raise awareness of the fact that some versions of swig and python (and build types) just don't mix. One day this will be a reason to require swig>=4.0, but this version is too hot off the press right now.. Differential Revision: https://reviews.llvm.org/D88967	2020-10-08 18:42:50 +02:00
Petr Hosek	4424d2428a	[libcxx] Fix the thousands_sep test failure This fixes the issue introduced in `80ef4126b`.	2020-10-08 09:14:52 -07:00
Joseph Huber	d564409946	[OpenMP] Change CMake Configuration to Build for Highest CUDA Architecture by Default Summary: This patch changes the CMake files for Clang and Libomptarget to query the system for its supported CUDA architecture. This makes it much easier for the user to build optimal code without needing to set the flags manually. This relies on the now deprecated FindCUDA method in CMake, but full support for architecture detection is only availible in CMake >3.18 Reviewers: jdoerfert ye-luo Subscribers: cfe-commits guansong mgorny openmp-commits sstefan1 yaxunl Tags: #clang #OpenMP Differential Revision: https://reviews.llvm.org/D87946	2020-10-08 12:09:34 -04:00
Alexandre Ganea	79809f58b0	[LLDB] On Windows, fix tests This patch fixes a few issues seen when running `ninja check-lldb` in a Release build with VS2017: - Some binaries couldn't be found (such as lldb-vscode.exe), because .exe wasn't appended to the file name. - Many tests used to fail since our installed locale is in French - the OS error messages are not emitted in English. - Our codepage being Windows-1252, python failed to decode some error messages with accentuations. Differential Revision: https://reviews.llvm.org/D88975	2020-10-08 11:46:59 -04:00
Geoff Levner	b9225543e8	DeferredDiagnosticsEmitter crashes Patch VisitCXXDeleteExpr() in clang::UsedDeclVisitor to avoid it crashing when the expression's destroyed type is null. According to the comments in CXXDeleteExpr::getDestroyedType(), this can happen when the type to delete is a dependent type. Patch by Geoff Levner. Differential Revision: https://reviews.llvm.org/D88949	2020-10-08 11:42:21 -04:00
Fangrui Song	db1988f038	[ELF] Don't change binding to STB_WEAK for an undefined specified by -u Similar to D66992. In GNU ld, a -u specified symbol is a STB_DEFAULT undefined. It cannot be changed to STB_WEAK by a later STB_WEAK undefined in a regular object file. The behavior is consistent with our model because -u means "we need to fetch a lazy definition". It should not be altered just because there is also a STB_WEAK undefined. Note, our -u semantics are still different from GNU ld (https://github.com/ClangBuiltLinux/linux/issues/515): we don't force the specified symbol to appear in .symtab This is a deliberate decision. Reviewed By: grimar Differential Revision: https://reviews.llvm.org/D88945	2020-10-08 08:31:34 -07:00
Sanjay Patel	b57451b011	[InstCombine] allow vector splats for add+xor with signmask	2020-10-08 10:46:34 -04:00
Sanjay Patel	395963cbe6	[InstCombine] add vector splat tests for add of signmask; NFC	2020-10-08 10:46:33 -04:00
Jay Foad	7238faa4ae	[AMDGPU] Add patterns for mad/mac legacy f32 instructions Note that all subtargets up to GFX10.1 have v_mad_legacy_f32, but GFX8/9 lack v_mac_legacy_f32. GFX10.3 has no mad/mac f32 instructions at all. Differential Revision: https://reviews.llvm.org/D88890	2020-10-08 15:20:06 +01:00
Nico Weber	02e4800eeb	[gn build] (manually) port `9b58b0c06e` better	2020-10-08 10:13:54 -04:00
Nico Weber	c78fecba32	[gn build] (manually) port `9b58b0c06e`	2020-10-08 10:08:45 -04:00

1 2 3 4 5 ...

368449 Commits All Branches Search

368449 Commits

All Branches