llvm-project

Commit Graph

Author	SHA1	Message	Date
Bradley Smith	9f37980d45	[AArch64][SVE] Fold insert(zero, extract(X, 0), 0) -> X, when X is known to zero lanes 1-N Specifically, this allow us to rely on the lane zero'ing behaviour of SVE reduce instructions. Co-authored-by: Paul Walker <paul.walker@arm.com> Differential Revision: https://reviews.llvm.org/D101369	2021-05-04 15:05:05 +01:00
Simon Pilgrim	b04148f777	Local.cpp - Avoid DebugLoc copies - use const reference from getDebugLoc. NFCI.	2021-05-04 14:31:50 +01:00
Anastasia Stulova	64911eec75	[OpenCL] Allow pipe as a valid identifier prior to OpenCL 2.0. Pipe has not been a reserved keyword in the earlier OpenCL standards. However we failed to allow its use as an identifier in the original commit. This issues is fixed now and testing is improved accordingly. Differential Revision: https://reviews.llvm.org/D101052	2021-05-04 14:30:42 +01:00
Jan Svoboda	00895831ab	[clang][cli][docs] Clarify marshalling infrastructure documentation	2021-05-04 15:16:32 +02:00
Simon Pilgrim	2bb41851a1	[Utils] recognizeBSwapOrBitReverseIdiom - support matching from funnel shift roots (PR40058) We were missing bitreverse matches in cases where InstCombine had seen a byte-level rotation at the end of a bitreverse sequence (replacing or() with fshl()), hindering the exhaustive bitreverse matching in CodeGenPrepare later on.	2021-05-04 13:46:45 +01:00
Simon Pilgrim	e0dd708f40	[CodeGenPrepare][X86] Add bitreverse detection tests Initially only test for XOP which is the only thing that supports scalar bitreverse - we can add vector tests later.	2021-05-04 13:29:19 +01:00
Simon Pilgrim	bd82cec016	[X86] Update PR20841 test description to make it clear we SHOULDN'T be folding EFLAGS with XADD	2021-05-04 13:29:19 +01:00
Jan Svoboda	d0e3a15e36	[clang][cli] NFC: Remove confusing `EmptyKPM` variable	2021-05-04 14:27:57 +02:00
Adrian Kuegel	93537fabce	[mlir] Add lowering from math.expm1 to LLVM. Differential Revision: https://reviews.llvm.org/D96776	2021-05-04 14:22:10 +02:00
David Stuttard	8f2948731e	[AMDGPU][AsmParser] Correct the order of optional operands to mimg Ordering of operands was incorrect meaning that a16 operand was treated as tfe Differential Revision: https://reviews.llvm.org/D101618 Change-Id: I3b15e71ef5ff625f19f52823414ab684d76aca33	2021-05-04 13:14:48 +01:00
Florian Hahn	d65e5f60f1	[IndVarSimplify] Add additional tests using isImpliedViaMerge.	2021-05-04 13:11:47 +01:00
Alexey Bataev	369cd2ae52	Revert "[SLP]Allow masked gathers only if allowed by target." This reverts commit `fd18547e07`. Need to add a check for the size of the vectorization tree to avoid some extra vectorization.	2021-05-04 04:53:22 -07:00
Dávid Bolvanský	80b897e21b	[InstCombine] ctpop(X) ^ ctpop(Y) & 1 --> ctpop(X^Y) & 1 (PR50094) Original pattern: (__builtin_parity(x) ^ __builtin_parity(y)) LLVM rewrites it as: (__builtin_popcount(x) ^ __builtin_popcount(y)) & 1 Optimized form: __builtin_popcount(X^Y) & 1 Alive proof: https://alive2.llvm.org/ce/z/-GdWFr Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D101802	2021-05-04 13:16:18 +02:00
Georgy Komarov	c2e9baf2e8	[clang-tidy] Fix cppcoreguidelines-pro-type-vararg false positives with __builtin_ms_va_list This commit fixes cppcoreguidelines-pro-type-vararg false positives on 'char ' variables. The incorrect warnings generated by clang-tidy can be illustrated with the following minimal example: ``` goid foo(char in) { char tmp = in; } ``` The problem is that __builtin_ms_va_list desugared as 'char ', which leads to false positives. Fixes bugzilla issue 48042. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D101259	2021-05-04 13:49:20 +03:00
Kirill Bobyrev	34593ae998	Introduce clangd-server-monitor tool Reviewed By: kadircet Differential Revision: https://reviews.llvm.org/D101516	2021-05-04 12:48:21 +02:00
Luis Penagos	8fa56f7ede	[clang-format] Prevent extraneous space insertion in bitshift operators This serves to augment the improvements made in https://reviews.llvm.org/D86581. It prevents clang-format from interpreting bitshift operators as template arguments in certain circumstances. This is an attempt at fixing https://bugs.llvm.org/show_bug.cgi?id=49868 Reviewed By: MyDeveloperDay, krasimir Differential Revision: https://reviews.llvm.org/D100778	2021-05-04 12:28:49 +02:00
Jessica Clarke	fb92cf9208	[RISCV] Pre-commit tests for D101342 These tests show inefficient sign extension for AMOs on RISC-V. The normal CodeGen tests use anyext return values, but if marked signext then we end up generating unnecessary sign extension instructions. This can be seen when compiling C that returns an i32 (signed or unsigned), where the calling convention results in a signext return value.	2021-05-04 11:12:43 +01:00
David Zarzycki	05146fe517	[llvm] Unbreak no-assertion testing	2021-05-04 06:05:38 -04:00
LLVM GN Syncbot	caa8431404	[gn build] Port `1db4dbba24`	2021-05-04 09:56:46 +00:00
Bjorn Pettersson	3655f0757f	Make dependency between certain analysis passes transitive LazyBlockFrequenceInfoPass, LazyBranchProbabilityInfoPass and LoopAccessLegacyAnalysis all cache pointers to their nestled required analysis passes. One need to use addRequiredTransitive to describe that the nestled passes can't be freed until those analysis passes no longer are used themselves. There is still a bit of a mess considering the getLazyBPIAnalysisUsage and getLazyBFIAnalysisUsage functions. Those functions are used from both Transform, CodeGen and Analysis passes. I figure it is OK to use addRequiredTransitive also when being used from Transform and CodeGen passes. On the other hand, I figure we must do it when used from other Analysis passes. So using addRequiredTransitive should be more correct here. An alternative solution would be to add a bool option in those functions to let the user tell if it is a analysis pass or not. Since those lazy passes will be obsolete when new PM has conquered the world I figure we can leave it like this right now. Intention with the patch is to fix PR49950. It at least solves the problem for the reproducer in PR49950. However, that reproducer need five passes in a specific order, so there are lots of various "solutions" that could avoid the crash without actually fixing the root cause. Differential Revision: https://reviews.llvm.org/D100958	2021-05-04 11:50:08 +02:00
Simon Moll	1db4dbba24	Recommit "[VP,Integer,#2] ExpandVectorPredication pass" This reverts the revert `02c5ba8679` Fix: Pass was registered as DUMMY_FUNCTION_PASS causing the newpm-pass functions to be doubly defined. Triggered in -DLLVM_ENABLE_MODULE=1 builds. Original commit: This patch implements expansion of llvm.vp.* intrinsics (https://llvm.org/docs/LangRef.html#vector-predication-intrinsics). VP expansion is required for targets that do not implement VP code generation. Since expansion is controllable with TTI, targets can switch on the VP intrinsics they do support in their backend offering a smooth transition strategy for VP code generation (VE, RISC-V V, ARM SVE, AVX512, ..). Reviewed By: rogfer01 Differential Revision: https://reviews.llvm.org/D78203	2021-05-04 11:47:52 +02:00
Kadir Cetinkaya	f800ac8309	[clangd] Fix hover crash on broken code Differential Revision: https://reviews.llvm.org/D101743	2021-05-04 11:42:31 +02:00
serge-sans-paille	b83b23275b	Introduce -Wreserved-identifier Warn when a declaration uses an identifier that doesn't obey the reserved identifier rule from C and/or C++. Differential Revision: https://reviews.llvm.org/D93095	2021-05-04 11:19:01 +02:00
Fraser Cormack	46fa214a6f	[RISCV] Lower splats of non-constant i1s as SETCCs This patch adds support for splatting i1 types to fixed-length or scalable vector types. It does so by lowering the operation to a SETCC of the equivalent i8 type. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D101465	2021-05-04 09:14:05 +01:00
David Green	18883a3fec	[TTI] Replace ceil lambdas with divideCeil. NFCI As pointed out in D101726, this function already exists in MathExtras. It uses different types, but with the values used here I believe that should not make a functional difference.	2021-05-04 09:04:44 +01:00
LLVM GN Syncbot	97f4789c38	[gn build] Port `ed51156084`	2021-05-04 06:39:48 +00:00
Reshabh Sharma	ed51156084	[ModuleUtils] NFC: Add unit tests for appendToUsedList This patch adds initial unit tests for appendToUsedList in the ModuleUtils. It specifically tests changes from https://reviews.llvm.org/D101363 which intent to allow insertion of globals in non-zero address spaces into the llvm used lists. Reviewed by: dblaikie Differential Revision: https://reviews.llvm.org/D101746	2021-05-04 12:05:50 +05:30
Greg McGary	27b426b0c8	[lld-macho] Implement builtin section renaming ld64 automatically renames many sections depending on output type and assorted flags. Here, we implement the most common configs. We can add more obscure flags and behaviors as needed. Depends on D101393 Differential Revision: https://reviews.llvm.org/D101395	2021-05-03 21:26:51 -07:00
Shivam Gupta	959eec1fdd	[NFC] Give better diagnose on clang-format not found error Contributors often confused by whether this is a server or local issue.	2021-05-04 09:22:06 +05:30
Alex Lorenz	2669abaecf	[clang][CodeGen] Use llvm::stable_sort for multi version resolver options The use of llvm::sort causes periodic failures on the bot with EXPENSIVE_CHECKS enabled, as the regular sort pre-shuffles the array in the expensive checks mode, leading to a non-deterministic test result which causes the CodeGenCXX/attr-cpuspecific-outoflinedefs.cpp testcase to fail on the bot (http://lab.llvm.org:8080/green/job/clang-stage1-cmake-RA-expensive/).	2021-05-03 20:07:00 -07:00
Matthias Springer	aa58281979	[mlir] Fix bug in TransferOpReduceRank when all dims are broadcasts TransferReadOps that are a scalar read + broadcast are handled by TransferReadToVectorLoadLowering. Differential Revision: https://reviews.llvm.org/D101808	2021-05-04 11:21:44 +09:00
natashaknk	07ce5c99d7	[mlir][tosa] Add lowerings for tosa.equal and tosa.arithmetic_right_shift Lowerings equal and arithmetic_right_shift for elementwise ops to linalg dialect using linalg.generic Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D101804	2021-05-03 18:26:49 -07:00
Philip Reames	11326cbcdb	[IndVarSimplify][NFC] Removed mayThrow from if-condition in predicateLoopExits of IndVarSimplify Instruction has mayHaveSideEffects method that returns true if mayThrow return true because this is called internally in the first method. As such, the call being removed is redundant. Patch By: vdsered (Daniil Seredkin) Differential Revision: https://reviews.llvm.org/D101685	2021-05-03 18:25:07 -07:00
Sam Clegg	808fcddae4	[lld][WebAssembly] Fix crash with `-pie` without `--allow-undefined` `shouldImport` was not returning true in PIC mode even though out assumption elsewhere (in Relocations.cpp:scanRelocations) is that we don't report undefined symbols in PIC mode today. This was resulting functions that were undefined and but also not imported which hits an assert later on that all functions have valid indexes. Differential Revision: https://reviews.llvm.org/D101716	2021-05-03 18:04:55 -07:00
Juneyoung Lee	24ce194cfe	[InstCombine] generalize select + select/and/or folding using implied conditions This patch optimizes the remaining possible cases in D101191 by generalizing isImpliedCondition()-based foldings. Assume that there is `op a, (select b, _, _)` where op is one of `and i1`, `or i1` or their select forms. We can do the following optimization based on the result of `isImpliedCondition(a, b)`: If a = true implies… - b = true: - select a, (select b, A, B), false => select a, A, false : https://alive2.llvm.org/ce/z/WCnZYh - and a, (select b, A, B) => select a, A, false : https://alive2.llvm.org/ce/z/uZhcMG - b = false: - select a, (select b, A, B), false => select a, B, false : https://alive2.llvm.org/ce/z/c2hJpV - and a, (select b, A, B) => select a, B, false : https://alive2.llvm.org/ce/z/5ggwMM If a = false implies… - b = true: - select a, true, (select b, A, B) => select a, true, A : https://alive2.llvm.org/ce/z/tidKvH - or a, (select b, A, B) => select a, true, A : https://alive2.llvm.org/ce/z/cC-uyb - b = false: - select a, true, (select b, A, B) => select a, true, B : https://alive2.llvm.org/ce/z/ZXpJq9 - or a, (select b, A, B) => select a, true, B : https://alive2.llvm.org/ce/z/hnDrJj Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D101720	2021-05-04 09:42:06 +09:00
Juneyoung Lee	012f01e227	Precommit tests for D101720 (NFC)	2021-05-04 09:30:04 +09:00
Jim Ingham	60ad0fd3c8	Clarify the help for "breakpoint command add" and "watchpoint command add". These two commands add a list of commands to the breakpoint/watchpoint. The current implementation only supports replacing the current command list. I started with that as overwrite seems to be the most common operation. But using "add" will allow us to later offer other add-modes: "prepend", "append" and "insert". That and "overwrite" then make up a useful set of options for this operation.	2021-05-03 17:22:43 -07:00
Arthur Eubanks	d14d84af2f	[NewPM] Only invalidate modified functions' analyses in CGSCC passes Previously, any change in any function in an SCC would cause all analyses for all functions in the SCC to be invalidated. With this change, we now manually invalidate analyses for functions we modify, then let the pass manager know that all function analyses should be preserved. So far this only touches the inliner, argpromotion, funcattrs, and updateCGAndAnalysisManager(), since they are the most used. Slight compile time improvements: http://llvm-compile-time-tracker.com/compare.php?from=326da4adcb8def2abdd530299d87ce951c0edec9&to=8942c7669f330082ef159f3c6c57c3c28484f4be&stat=instructions Reviewed By: mtrofin Differential Revision: https://reviews.llvm.org/D100917	2021-05-03 17:21:44 -07:00
Sam Clegg	4ef1f90e4d	[lld][WebAssembly] Convert more tests to asm format. NFC Two of these are trivial. The third (shared.s) did have some expectations changes but only due to two data symbols being re-ordered. Differential Revision: https://reviews.llvm.org/D101711	2021-05-03 17:16:31 -07:00
Raman Tenneti	a72499e475	[libc] Introduce asctime, asctime_r to LLVM libc [libc] Introduce asctime, asctime_r to LLVM libc asctime and asctime_r share the same common code. They call asctime_internal a static inline function. asctime uses snprintf to return the string representation in a buffer. It uses the following format (26 characters is the buffer size) as per 7.27.3.1 section in http://www.open-std.org/jtc1/sc22/wg14/www/docs/n2478.pdf. The buf parameter for asctime_r shall point to a buffer of at least 26 bytes. snprintf(buf, 26, "%.3s %.3s%3d %.2d:%.2d:%.2d %d\n",...) Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D99686	2021-05-03 17:15:00 -07:00
LLVM GN Syncbot	47ee47e1e8	[gn build] Port `7310403e3c`	2021-05-04 00:04:57 +00:00
Tomasz Miąsko	7310403e3c	[demangler] Initial support for the new Rust mangling scheme Add a demangling support for a small subset of a new Rust mangling scheme, with complete support planned as a follow up work. Intergate Rust demangling into llvm-cxxfilt and use llvm-cxxfilt for end-to-end testing. The new Rust mangling scheme uses "_R" as a prefix, which makes it easy to disambiguate it from other mangling schemes. The public API is modeled after __cxa_demangle / llvm::itaniumDemangle, since potential candidates for further integration use those. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D101444	2021-05-03 16:44:30 -07:00
Sam Clegg	73332d73e1	[lld][WebAssembly] Do not merge comdat data segments When running in relocatable mode any input data segments that are part of a comdat group should not be merged with other segments of the same name. This is because the final linker needs to keep the separate so they can be included/excluded individually. Often this is not a problem since normally only one section with a given name `foo` ends up in the output object file. However, the problem occurs when one input contains `foo` which part of a comdat and another object contains a local symbol `foo` we were attempting to merge them. This behaviour matches (I believe) that of the ELF linker. See `LinkerScript.cpp:addInputSec`. Fixes: https://github.com/emscripten-core/emscripten/issues/9726 Differential Revision: https://reviews.llvm.org/D101703	2021-05-03 16:43:29 -07:00
Philip Reames	e38ccb729b	Recommit "Generalize getInvertibleOperand recurrence handling slightly" This was reverted because of a reported problem. It turned out this patch didn't introduce said problem, it just exposed it more widely. `15a4233` fixes the root issue, so this simple a) rebases over that, and b) adds a much more extensive comment explaining why that weakened assert is correct. Original commit message follows: Follow up to D99912, specifically the revert, fix, and reapply thereof. This generalizes the invertible recurrence logic in two ways: * By allowing mismatching operand numbers of the phi, we can recurse through a pair of phi recurrences whose operand orders have not been canonicalized. * By allowing recurrences through operand 1, we can invert these odd (but legal) recurrence. Differential Revision: https://reviews.llvm.org/D100884	2021-05-03 16:40:56 -07:00
Arthur Eubanks	2df3426fd1	[NewPM] Invalidate AAManager after populating GlobalsAA GlobalsAA is only created at the beginning of the inliner pipeline. If an AAManager is cached from previous passes, it won't get rebuilt to include the newly created GlobalsAA. Reviewed By: mtrofin Differential Revision: https://reviews.llvm.org/D101379	2021-05-03 16:37:32 -07:00
Joseph Huber	182831258b	[Attributor] Add AAExecutionDomainInfo interface to OpenMPOpt Summary: Add the AAExecutionDomainInfo attributor instance to OpenMPOpt. This will infer information relating to domain information that an instruction might be expecting in. Right now this only includes a very crude check for instructions that will be executed by the master thread by comparing a thread-id function with a constant zero. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D101578	2021-05-03 19:24:19 -04:00
Philip Reames	2d6aff84c9	One more test case inspired by PR50191	2021-05-03 16:23:04 -07:00
Eugene Zhulenev	9b67096fe9	[mlir] Linalg: add vector transfer lowering patterns to the contraction lowering This fixes a performance regression in vec-mat vectorization Reviewed By: asaadaldien Differential Revision: https://reviews.llvm.org/D101795	2021-05-03 16:21:51 -07:00
Peyton, Jonathan L	9982f33e2c	[OpenMP] Refactor/Rework topology discovery code This patch does the following: 1) Introduce kmp_topology_t as the runtime-friendly structure (the corresponding global variable is __kmp_topology) to determine the exact machine topology which can vary widely among current and future architectures. The current design is not easy to expand beyond the assumed three layer topology: sockets, cores, and threads so a rework capable of using the existing KMP_AFFINITY mechanisms is required. This new topology structure has: * The depth and types of the topology * Ratio count for each consecutive level (e.g., number of cores per socket, number of threads per core) * Absolute count for each level (e.g., 2 sockets, 16 cores, 32 threads) * Equivalent topology layer map (e.g., Numa domain is equivalent to socket, L1/L2 cache equivalent to core) * Whether it is uniform or not The hardware threads are represented with the kmp_hw_thread_t structure. This structure contains the ids (e.g., socket 0, core 1, thread 0) and other information grabbed from the previous Address structure. The kmp_topology_t structure contains an array of these. 2) Generalize the KMP_HW_SUBSET envirable for the new kmp_topology_t structure. The algorithm doesn't assume any order with tiles,numa domains,sockets,cores,threads. Instead it just parses the envirable, makes sure it is consistent with the detected topology (including taking into account equivalent layers) and then trims away the unneeded subset of hardware threads. To enable this, a new kmp_hw_subset_t structure is introduced which contains a vector of items (hardware type, number user wants, offset). Any keyword within __kmp_hw_get_keyword() can be used as a name and can be shortened as well. e.g., KMP_HW_SUBSET=1s,2numa,4tile,2c,3t can be used on the KNL SNC-4 machine. 3) Simplify topology detection functions so they only do the singular task of detecting the machine's topology. Printing, and all canonicalizing functionality is now done afterwards. So many lines of duplicated code are eliminated. 4) Add new ll_caches and numa_domains to OMP_PLACES, and consequently, KMP_AFFINITY's granularity setting. All the names within __kmp_hw_get_keyword() are available for use in OMP_PLACES or KMP_AFFINITY's granularity setting. 5) Simplify and future-proof code where explicit lists of allowed affinity settings keywords inside if() conditions. 6) Add x86 CPUID leaf 4 cache detection to existing x2apic id method so equivalent caches could be detected (in particular for the ll_caches place). Differential Revision: https://reviews.llvm.org/D100997	2021-05-03 18:00:24 -05:00
Philip Reames	32b500431c	Add some additional test cases inspired by PR50191	2021-05-03 15:56:37 -07:00

1 2 3 4 5 ...

387374 Commits All Branches Search

387374 Commits

All Branches