llvm-project

Commit Graph

Author	SHA1	Message	Date
Fangrui Song	80f9fd4ce3	[ELF][test] Rework non-preemptible ifunc tests	2021-07-15 11:31:05 -07:00
Nikita Popov	c03d25860a	[Verifier] Use isIntrinsic() (NFC) Call Function::isIntrinsic() instead of manually checking the function name for an "llvm." prefix.	2021-07-15 20:30:42 +02:00
Simon Pilgrim	ecf5813c82	[InstCombine] Add select(cond,gep(gep(x,y),z),gep(x,y)) tests from PR51069	2021-07-15 19:26:24 +01:00
Sam Tebbs	ff0ef6a518	[ARM][LowOverheadLoops] Make some stack spills valid for tail predication This patch makes vector spills valid for tail predication when all loads from the same stack slot are within the loop Differential Revision: https://reviews.llvm.org/D105443	2021-07-15 19:23:52 +01:00
Quinn Pham	de3956605a	[PowerPC] Fix popcntb XL Compat Builtin for 32bit This patch implements the `__popcntb` XL compatibility builtin for 32bit in the frontend and backend. This patch also updates tests for `__popcntb` and other XL Compat sync related builtins. Reviewed By: #powerpc, nemanjai, amyk Differential Revision: https://reviews.llvm.org/D105360	2021-07-15 13:19:47 -05:00
Simon Pilgrim	0a614ca225	Fix "unknown pragma 'GCC'" MSVC warning. NFCI.	2021-07-15 18:50:19 +01:00
Simon Pilgrim	d2cd3f88e7	[InstCombine] Add 3-operand gep test with different ptr and same indices	2021-07-15 18:50:19 +01:00
Dmitry Vyukov	7b302fc9b0	tsan: strip top inlined internal frames The new GET_CURRENT_PC() can lead to spurious top inlined internal frames. Here are 2 examples from bots, in both cases the malloc is supposed to be the top frame (#0): WARNING: ThreadSanitizer: signal-unsafe call inside of a signal #0 __sanitizer::StackTrace::GetNextInstructionPc(unsigned long) #1 malloc Location is heap block of size 99 at 0xbe3800003800 allocated by thread T1: #0 __sanitizer::StackTrace::GetNextInstructionPc(unsigned long) #1 malloc Let's strip these internal top frames from reports. With other code changes I also observed some top frames from __tsan::ScopedInterceptor, proactively remove these as well. Differential Revision: https://reviews.llvm.org/D106081	2021-07-15 19:37:44 +02:00
Philip Reames	a99d420a93	[SCEV] Fix unsound reasoning in howManyLessThans This is split from D105216, it handles only a subset of the cases in that patch. Specifically, the issue being fixed is that the code incorrectly assumed that (Start-Stide) < End implied that the backedge was taken at least once. This is not true when e.g. Start = 4, Stride = 2, and End = 3. Note that we often do produce the right backedge taken count despite the flawed reasoning. The fix chosen here is to use an alternate form of uceil (ceiling of unsigned divide) lowering which is safe when max(RHS,Start) > Start - Stride. (Note that signedness of both max expression and comparison depend on the signedness of the comparison being analyzed, and that overflow in the Start - Stride expression is allowed.) Note that this is weaker than proving the backedge is taken because it allows start - stride < end < start. Some cases which can't be proven safe are sent down the generic path, and we do end up generating less optimal expressions in a few cases. Credit for coming up with the approach goes entirely to Eli. I just split it off, tweaked the comments a bit, and did some additional testing. Differential Revision: https://reviews.llvm.org/D105942	2021-07-15 10:32:47 -07:00
Louis Dionne	4628ff4c31	[libc++] NFC: Reindent the run-buildbot script	2021-07-15 13:29:58 -04:00
Fangrui Song	aa3df8ddcd	[test] Avoid llvm-readelf/llvm-readobj one-dash long options and deprecated aliases (e.g. --file-headers)	2021-07-15 10:26:21 -07:00
Vy Nguyen	a35480f859	[llvm-exegesis] Fix missing-headers build errors. Details: Switch all #includes to use <> because that is consistent with what happens in the cmake checks. Otherwise, we could be in the situation where cmake checks see that headers exist at <perfmon/...> but in llvm-exegesis code, we use "perfmon/...", which may not exist. Related PR/revisions: D84076, PR51017+D105615 Differential Revision: https://reviews.llvm.org/D105861	2021-07-15 13:20:25 -04:00
Arthur Eubanks	99cb2507f3	Revert "[SLP]Workaround for InsertSubVector cost." This reverts commit `2eb50baf05`. Causes hangs, see comments on D105827.	2021-07-15 10:19:41 -07:00
Jessica Paquette	5da0f9ab61	[GlobalISel] Fix infinite loop in reassociationCanBreakAddressingModePattern It didn't update the opcode while walking through G_INTTOPTR/G_PTRTOINT. Differential Revision: https://reviews.llvm.org/D106080	2021-07-15 10:09:07 -07:00
Wouter van Oortmerssen	4157b6033d	[WebAssembly] Fixed LLD generation of 64-bit __wasm_apply_data_relocs Differential Revision: https://reviews.llvm.org/D105863	2021-07-15 10:02:02 -07:00
Leonard Grey	c931ff72bd	[lld-macho] Add LTO cache support This adds support for the lld-only `--thinlto-cache-policy` option, as well as implementations for ld64's `-cache_path_lto`, `-prune_interval_lto`, `-prune_after_lto`, and `-max_relative_cache_size_lto`. Test is adapted from lld/test/ELF/lto/cache.ll Differential Revision: https://reviews.llvm.org/D105922	2021-07-15 12:56:13 -04:00
Stanislav Mekhanoshin	c46d99e4ba	[AMDGPU] Refine -O0 and -O1 passes. Differential Revision: https://reviews.llvm.org/D105579	2021-07-15 09:51:54 -07:00
Fangrui Song	96e9bc4244	[llvm-nm] Remove one-dash long options except -arch The documentation and help messages have recommended the double-dash forms for quite a while. Remove one-dash long options which are not recognized by GNU style `getopt_long`. `-arch` is kept as it is in the manpage of classic nm https://keith.github.io/xcode-man-pages/nm.1.html Note: the dyldinfo related options don't have a test. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D105948	2021-07-15 09:50:37 -07:00
Fangrui Song	7299c6f635	[test] Avoid llvm-nm one-dash long options	2021-07-15 09:50:36 -07:00
Aart Bik	2b6e433230	[mlir][sparse] add shift ops support Arbitrary shifts have some complications, but shift by invariants (viz. tensor index exp only at left hand side) can be easily handled with the conjunctive rule. Reviewed By: gussmith23 Differential Revision: https://reviews.llvm.org/D106002	2021-07-15 09:43:12 -07:00
Andrzej Warzynski	9f6ff37a36	[flang][driver] Randomise the names of the unparsed files This patch makes sure that the base name of the temporary unparsed files (generated by the `flang` bash script) are randomised and unique to a particular invocation of the script. Otherwise, we cannot reliably run the script in parallel. Differential Revision: https://reviews.llvm.org/D106052	2021-07-15 17:17:50 +01:00
Andrzej Warzynski	47f846f8c5	Enable Flang by default in the test-release.sh script I've also brought this up on llvm-dev: https://lists.llvm.org/pipermail/llvm-dev/2021-July/151744.html Differential Revision: https://reviews.llvm.org/D105885	2021-07-15 17:17:49 +01:00
Aart Bik	e6e79b3f0b	[mlir][sparse] remove linalg-to-loops from integration tests With the migration from linalg.copy to memref.copy, this pass (which was there solely to handle the linalg.copy op) is no longer required for the end-to-end path for sparse compilation. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D106073	2021-07-15 09:14:46 -07:00
Louis Dionne	1f8e286cdc	[libc++] Add a CMake target to re-generate files and revamp CONTRIBUTING.rst As we automate more and more things in the library, it becomes useful for contributors to have a single target for running all the automation as part of their workflow. This commit adds a new `libcxx-generate-files` target that should re-generate all the auto-generated files in the library. As a fly-by, I also revamped the documentation on Contributing to account for this new target and present it as a bullet list of things to check before committing. I also added a few things that are often overlooked to that list, such as updating the synopsis and the status files. Differential Revision: https://reviews.llvm.org/D106067	2021-07-15 12:07:26 -04:00
Nikita Popov	c191035f42	[IR] Add elementtype attribute This implements the elementtype attribute specified in D105407. It just adds the attribute and the specified verifier rules, but doesn't yet make use of it anywhere. Differential Revision: https://reviews.llvm.org/D106008	2021-07-15 18:04:26 +02:00
Nikita Popov	1fd23a065b	[LangRef] Add elementtype attribute This adds an elementtype(<ty>) attribute, which can be used to attach an element type to a pointer typed argument. It is similar to byval/byref in purpose, but unlike those does not carry any specific semantics by itself. However, certain intrinsics may require it and interpret it in specific ways. The in-tree use cases for this that I'm currently aware of are: call ptr @llvm.preserve.array.access.index.p0.p0(ptr elementtype(%ty) %base, i32 %dim, i32 %index) call ptr @llvm.preserve.struct.access.index.p0.p0(ptr elementtype(%ty) %base, i32 %gep_index, i32 %di_index) call token @llvm.experimental.gc.statepoint.p0(i64 0, i32 0, ptr elementtype(void ()) @foo, i32 0, i32 0, i32 0, i32 0, ptr addrspace(1) %obj) Notably, the gc.statepoint case needs a function as element type, in which case the workaround of adding a separate %ty undef argument would not work, as arguments cannot be unsized. Differential Revision: https://reviews.llvm.org/D105407	2021-07-15 18:04:25 +02:00
Arthur Eubanks	04b75c05b0	[InstCombine] Look through invariant group intrinsics when removing malloc Fixes some regressions with -fstrict-vtable-pointers in llvm-test-suite. Reviewed By: lebedev.ri Differential Revision: https://reviews.llvm.org/D106017	2021-07-15 09:02:40 -07:00
Geoffrey Martin-Noble	ab03ef124a	[Bazel] Update for `01bdb0f75e` Update the build files for https://github.com/llvm/llvm-project/commit/01bdb0f75efb Tested: bazel query //... + @llvm-project//... \| xargs bazel test --config=generic_clang --config=rbe --test_output=errors --test_ta g_filters=-nobuildkite --build_tag_filters=-nobuildkite Differential Revision: https://reviews.llvm.org/D106075	2021-07-15 09:00:40 -07:00
Philip Reames	95346ba877	[LV] Enable vectorization of multiple exit loops w/computable exit counts This change enables vectorization of multiple exit loops when the exit count is statically computable. That requirement - shared with the rest of LV - in turn requires each exit to be analyzeable and to dominate the latch. The majority of work to support this was done in a set of previous patches. In particular,, `72314466` avoids having multiple edges from the middle block to the exits, and `4b33b2387` which added support for non-latch single exit and multiple exits with a single exiting block. As a result, this change is basically just removing a bailout and adjusting some tests now that the prerequisite work is done and has stuck in tree for a bit. Differential Revision: https://reviews.llvm.org/D105817	2021-07-15 08:53:51 -07:00
Nikita Popov	f59209a86e	[AsmParser] Unify parsing of attributes Continuing on from D105780, this should be the last major bit of attribute cleanup. Currently, LLParser implements attribute parsing for functions, parameters and returns separately, enumerating all supported (and unsupported) attributes each time. This patch extracts the common parsing logic, and performs a check afterwards whether the attribute is valid in the given position. Parameters and returns are handled together, while function attributes need slightly different logic to support attribute groups. Differential Revision: https://reviews.llvm.org/D105938	2021-07-15 17:51:11 +02:00
Dmitry Vyukov	e33446ea58	tsan: make obtaining current PC faster We obtain the current PC is all interceptors and collectively common interceptor code contributes to overall slowdown (in particular cheaper str/mem* functions). The current way to obtain the current PC involves: 4493e1: e8 3a f3 fe ff callq 438720 <_ZN11__sanitizer10StackTrace12GetCurrentPcEv> 4493e9: 48 89 c6 mov %rax,%rsi and the called function is: uptr StackTrace::GetCurrentPc() { 438720: 48 8b 04 24 mov (%rsp),%rax 438724: c3 retq The new way uses address of a local label and involves just: 44a888: 48 8d 35 fa ff ff ff lea -0x6(%rip),%rsi I am not switching all uses of StackTrace::GetCurrentPc to GET_CURRENT_PC because it may lead some differences in produced reports and break tests. The difference comes from the fact that currently we have PC pointing to the CALL instruction, but the new way does not yield any code on its own so the PC points to a random instruction in the function and symbolizing that instruction can produce additional inlined frames (if the random instruction happen to relate to some inlined function). Reviewed By: melver Differential Revision: https://reviews.llvm.org/D106046	2021-07-15 17:34:00 +02:00
Victor Huang	d40e8091bd	[PowerPC] Add PowerPC rotate related builtins and emit target independent code for XL compatibility This patch is in a series of patches to provide builtins for compatibility with the XL compiler. This patch adds the builtins and emit target independent code for rotate related operations. Reviewed By: nemanjai, #powerpc Differential revision: https://reviews.llvm.org/D104744	2021-07-15 10:23:54 -05:00
Shilei Tian	a70ef3f568	Revert "[AbstractAttributor] Fold function calls to `__kmpc_is_spmd_exec_mode` if possible" This reverts commit `1100e4aafe`.	2021-07-15 11:19:28 -04:00
Simon Pilgrim	0aece73aba	[DAG] Fold select(cond,binop(x,y),binop(x,z)) -> binop(x,select(cond,y,z)) Similar to the folds performed in InstCombinerImpl::foldSelectOpOp, this attempts to push a select further up to help merge a pair of binops. I'm primarily interested in select(cond,add(x,y),add(x,z)) folds to help expose pointer math (see https://bugs.llvm.org/show_bug.cgi?id=51069 etc.) but I've tried to use the more generic isBinOp(). Differential Revision: https://reviews.llvm.org/D106058	2021-07-15 16:08:30 +01:00
Aart Bik	68ac2e53ff	[mlir][sparse] replace linalg.copy with memref.copy Note, this revision relies on the following revision for a bugfix in the memref copy library in order for all sparse integration tests to pass. https://reviews.llvm.org/D106036 Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D106038	2021-07-15 07:56:50 -07:00
Simon Pilgrim	3cc38703d5	[NVPTX] Tweak fast-math tests to avoid select(binop(x,y),binop(x,z)) fold As suggested on D106058, tweak the tests to keep the combineRepeatedFPDivisors test coverage.	2021-07-15 15:42:25 +01:00
Gabor Marton	d0d37fcc4e	[Analyzer][solver] Remove unused functions ../../git/llvm-project/clang/lib/StaticAnalyzer/Core/RangeConstraintManager.cpp:2395:17: warning: 'clang::ento::ProgramStateRef {anonymous}::RangeConstraintManager::setRange(clang::ento::ProgramStateRef, {anonymous}::EquivalenceClass, clang::ento::RangeSet)' defined but not used [-Wunused-function] ../../git/llvm-project/clang/lib/StaticAnalyzer/Core/RangeConstraintManager.cpp:2384:10: warning: 'clang::ento::RangeSet {anonymous}::RangeConstraintManager::getRange(clang::ento::ProgramStateRef, {anonymous}::EquivalenceClass)' defined but not used [-Wunused-function] Differential Revision: https://reviews.llvm.org/D106063	2021-07-15 16:36:01 +02:00
Sander de Smalen	a607f64118	Revert "[LV] Print remark when loop cannot be vectorized due to invalid costs." This reverts commit `efaf3099c8`. This reverts commit `dc7bdc1e71`. Reverting patches due to buildbot failures.	2021-07-15 15:21:57 +01:00
Anton Zabaznov	05eb59e1d0	[OpenCL] Add support of __opencl_c_program_scope_global_variables feature macro Reviewed By: Anastasia Differential Revision: https://reviews.llvm.org/D103191	2021-07-15 17:21:19 +03:00
Tim Northover	f24335c69e	MachO: fix Clang test broken by dropping private labels in LLVM. LLVM changed to not emit L... labels for things marked "do_not_dead_strip" because the linker can sometimes drop the flag if there's no proper symbol. This Clang test checked for the old behaviour, but doesn't actually care about that bit.	2021-07-15 15:05:08 +01:00
Nathan Sidwell	b36c4bb3ec	[docs] More CMAKE variable documentation This breaks out some (more) common llvm-specific variables. Controlling the subprojects and target architectures, along with clues about restricting build parallelism when linking. 'more common' is somewhat subjective, of course. Differential Revision: https://reviews.llvm.org/D105822	2021-07-15 06:56:49 -07:00
David Green	dad506bd4e	[ARM] Expand types handled in VQDMULH recognition We have a DAG combine for recognizing the sequence of nodes that make up an MVE VQDMULH, but only currently handles specifically legal types. This patch expands that to other power-2 vector types. For smaller than legal types this means any_extending the type and casting it to a legal type, using a VQDMULH where we only use some of the lanes. The result is sign extended back to the original type, to properly set the invalid lanes. Larger than legal types are split into chunks with extracts and concat back together. Differential Revision: https://reviews.llvm.org/D105814	2021-07-15 14:47:53 +01:00
Tim Northover	5d7632ee72	MachO: don't emit L... private symbols in do_not_dead_strip sections. The linker can sometimes drop the do_not_dead_strip if it can't associate the atom with a symbol (the other place to specify no dead-stripping in MachO files).	2021-07-15 14:40:43 +01:00
liuke	034b94bb71	Fix documentation; NFC The documentation about ignoringImpCasts is wrong, which can cause misunderstandings. This patch fixes it.	2021-07-15 09:38:05 -04:00
Roman Lebedev	3e6c383dc6	[SimplifyCFG] Rerun PHI deduplication after common code sinkinkg (PR51092) `SinkCommonCodeFromPredecessors()` doesn't itself ensure that duplicate PHI nodes aren't created. I suppose, we could teach it to do that on-the-fly (& account for the already-existing PHI nodes, & adjust costmodel), the diff will be bigger than this. The alternative is to schedule a new EarlyCSE pass invocation somewhere later in the pipeline. Clearly, we don't have any EarlyCSE runs in module optimization passline, so this pattern isn't cleaned up... That would perhaps better, but it will again have some compile time impact. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D106010	2021-07-15 16:34:34 +03:00
Adrian Kuegel	74b88807ae	[mlir][rocdl] Add math::Exp2Op lowering to ROCDL Differential Revision: https://reviews.llvm.org/D106057	2021-07-15 14:33:04 +02:00
Simon Pilgrim	91e151476c	[TTI] Consistently make getMinVectorRegisterBitWidth() methods const. NFCI. The underlying getMinVectorRegisterBitWidth() methods are const, but it was missed in a couple of TargetTransformInfo wrappers. Noticed while working on D103925	2021-07-15 13:27:55 +01:00
Sander de Smalen	dc7bdc1e71	[LV] Fix determinism for failing scalable-call.ll test. The sort function for emitting an OptRemark was not deterministic, which caused scalable-call.ll to fail on some buildbots. This patch fixes that. This patch also fixes an issue where `Instruction::comesBefore()` is called when two Instructions are in different basic blocks, which would otherwise cause an assertion failure.	2021-07-15 13:16:59 +01:00
Nicolas Vasilache	01bdb0f75e	[mlir][linalg] Improve implementation of hoist padding. Instead of relying on adhoc bounds calculations, use a projection-based implementation. This simplifies the implementation and finds more static constant sizes than previously/ Differential Revision: https://reviews.llvm.org/D106054	2021-07-15 12:10:31 +00:00
Louis Dionne	5024fe9306	[libc++] Mark failing rel_ops test as XFAIL in back-deployment The test triggers availability errors.	2021-07-15 08:04:33 -04:00

... 2 3 4 5 6 ...

393878 Commits All Branches Search

393878 Commits

All Branches