llvm-project

Commit Graph

Author	SHA1	Message	Date
Siva Chandra Reddy	804dc3dcf2	[libc] Clear all exceptions before setting in fesetexceptflag. Previously, exceptions from the flag were being added. This patch changes it such that only the exceptions in the flag will be set. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D105085	2021-06-30 17:29:48 +00:00
Philip Reames	0c2f40f916	[instcombine] Precommit tests for umin(a,b) ne/eq 0 fold	2021-06-30 10:29:19 -07:00
Siva Chandra Reddy	9474ddc3ac	[libc] Fix feclearexcept for x86_64. Previously, feclearexcept cleared all exceptions irrespective of the argument. This change brings it in line with the aarch64 flavors wherein only those exceptions listed in the argument will be cleared. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D105081	2021-06-30 17:28:06 +00:00
Philip Reames	c4fc2cb5b2	[instcombine] umin(x, 1) == zext(x != 0) We already implemented this for the select form, but the intrinsic form was missing. Note that this doesn't change poison behavior as 1 is non-poison, and the optimized form is still poison exactly when x is.	2021-06-30 10:20:01 -07:00
Tomas Matheson	f617ab1044	[NPM] Resolve llvmGetPassPluginInfo to the plugin being loaded Dynamically loaded plugins for the new pass manager are initialised by calling llvmGetPassPluginInfo. This is defined as a weak symbol so that it is continually redefined by each plugin that is loaded. When loading a plugin from a shared library, the intention is that llvmGetPassPluginInfo will be resolved to the definition in the most recent plugin. However, using a global search for this resolution can fail in situations where multiple plugins are loaded. Currently: * If a plugin does not define llvmGetPassPluginInfo, then it will be silently resolved to the previous plugin's definition. * If loading the same plugin twice with another in between, e.g. plugin A/plugin B/plugin A, then the second load of plugin A will resolve to llvmGetPassPluginInfo in plugin B. * The previous case can also occur when a dynamic library defines both NPM and legacy plugins; the legacy plugins are loaded first and then with `-fplugin=A -fpass-plugin=B -fpass-plugin=A`: A will be loaded as a legacy plugin and define llvmGetPassPluginInfo; B will be loaded and redefine it; and finally when A is loaded as an NPM plugin it will be resolved to the definition from B. Instead of searching globally, restrict the symbol lookup to the library that is currently being loaded. Differential Revision: https://reviews.llvm.org/D104916	2021-06-30 18:11:28 +01:00
Yaxun (Sam) Liu	434bd5bf54	[AMDGPU] Add builtin functions image_bvh_intersect_ray Reviewed by: Stanislav Mekhanoshin, Matt Arsenault Differential Revision: https://reviews.llvm.org/D104946	2021-06-30 13:10:47 -04:00
Nico Weber	f6db88535c	[gn build] add dep needed after `b56e5f8a10`	2021-06-30 12:58:59 -04:00
Nico Weber	b56e5f8a10	[clangd] Unbreak mac build after `0c96a92d86` That commit removed the include of Features.inc from ClangdLSPServer.h, but ClangdMain.cpp relied on this include to pull in Features.inc for the #if at the bottom of Transport.h. Since the include is needed in Transport.h, just add it to there directly.	2021-06-30 12:53:38 -04:00
Fangrui Song	7b06bfc49e	[ELF] -pie: produce dynamic relocations for absolute relocations referencing undef weak See the comment for my understanding of -no-pie and -shared expectation. -no-pie has freedom on choices. We choose dynamic relocations to be consistent with the handling of GOT-generating relocations. Note: GNU ld has arch-varying behaviors and its x86 -pie has a very complex rule: if there is at least one GOT-generating or PLT-generating relocation and -z dynamic-undefined-weak (enabled by default) is in effect, generate a dynamic relocation. We don't emulate its rule. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D105164	2021-06-30 09:43:28 -07:00
David Goldman	570984204f	[clangd] Fix highlighting for implicit ObjC property refs Objective-C lets you use the `self.prop` syntax as sugar for both `[self prop]` and `[self setProp:]`, but clangd previously did not provide a semantic token for `prop`. Now, we provide a semantic token, treating it like a normal property except it's backed by a `ObjCMethodDecl` instead of a `ObjCPropertyDecl`. Differential Revision: https://reviews.llvm.org/D104117	2021-06-30 12:31:50 -04:00
Caroline Tice	05915400b7	[lldb] Replace SVE_PT* macros in NativeRegisterContextLinux_arm64.{cpp,h} with their equivalent defintions in LinuxPTraceDefines_arm64sve.h Commit `090306fc80` (August 2020) changed most of the arm64 SVE_PT* macros, but apparently did not make the changes in the NativeRegisterContextLinux_arm64.* files (or those files were pulled over from someplace else after that commit). This change replaces the macros NativeRegisterContextLinux_arm64.cpp with the replacement definitions in LinuxPTraceDefines_arm64sve.h. It also includes LinuxPTraceDefines_arm64sve.h in NativeRegisterContextLinux_arm64.h. Differential Revision: https://reviews.llvm.org/D104826	2021-06-30 09:26:20 -07:00
thomasraoux	0298f2cfb1	[mlir] Fix wrong type in WmmaConstantOpToNVVMLowering InsertElement takes a scalar integer attribute not an array of integer. Differential Revision: https://reviews.llvm.org/D105174	2021-06-30 09:10:02 -07:00
thomasraoux	4392841949	[mlir][VectorToGPU] Support converting vetor.broadcast to MMA op Differential Revision: https://reviews.llvm.org/D105175	2021-06-30 09:08:55 -07:00
LLVM GN Syncbot	0596f7d828	[gn build] Port `0c96a92d86`	2021-06-30 15:57:43 +00:00
Jeremy Morse	4955544162	[LiveDebugValues][InstrRef][1/2] Recover more clobbered variable locations In various circumstances, when we clobber a register there may be alternative locations that the value is live in. The classic example would be a value loaded from the stack, and then clobbered: the value is still available on the stack. InstrRefBasedLDV was coping with this at block starts where it's forced to pick a location, however it wasn't searching for alternative locations when values were clobbered. This patch notifies the "Transfer Tracker" object when clobbers occur, and it's able to find alternatives and issue DBG_VALUEs for that location. See: the added test. Differential Revision: https://reviews.llvm.org/D88405	2021-06-30 16:56:25 +01:00
Joseph Huber	ecabc6684f	[OpenMP] Change analysis remarks to not emit on cold functions The remarks will trigger on some functions that are marked cold, such as the `__muldc3` intrinsic functions. Change the remarks to avoid these functions. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D105196	2021-06-30 11:54:24 -04:00
Sam McCall	b447445eaa	[clangd] Show padding following a field on field hover. This displays as: `Size: 4 bytes (+4 padding)` Also stop showing (byte) offset/size for bitfields. They're not meaningful and using them to calculate padding is dangerous! Differential Revision: https://reviews.llvm.org/D98377	2021-06-30 17:50:59 +02:00
Sam McCall	0c96a92d86	[clangd] Log feature configuration (linux+asan+grpc) of the clangd build Included in logs, --version, remote index queries, and LSP serverInfo. Differential Revision: https://reviews.llvm.org/D100553	2021-06-30 17:49:29 +02:00
Sam McCall	bb41f85691	[clangd] Correct SelectionTree behavior around anonymous field access. struct A { struct { int b; }; }; A().^b; This should be considered a reference to b, but currently it's considered a reference to the anonymous struct field. Fixes https://github.com/clangd/clangd/issues/798 Differential Revision: https://reviews.llvm.org/D104376	2021-06-30 17:34:48 +02:00
Philip Reames	14d8f1546a	[SCEV] Fold (0 udiv %x) to 0 We have analogous rules in instsimplify, etc.., but were missing the same in SCEV. The fold is near trivial, but came up in the context of a larger change.	2021-06-30 08:31:13 -07:00
Philip Reames	5e630a9202	[test] precommit a test for missing (0 /u %x) SCEV fold	2021-06-30 08:26:34 -07:00
Louis Dionne	c69cfbfd71	[libc++] Remove broken links and outdated information in the docs The various design docs have been moved to RST, and the linked blog post does not apply anymore since libc++ is the default library used by Clang on Apple platforms.	2021-06-30 11:12:37 -04:00
Craig Topper	0f1f92156f	[ARM] Fix incorrect assignment of Changed variable in MVEGatherScatterLowering::optimiseOffsets. I believe this Changed flag should be initialized to false, otherwise the if (!Changed) is always dead. This doesn't manifest in a functional issue because the PHINode checks will fail if nothing changed. They are identical to the earlier checks that must have already failed to get into this else block. While there remove an else after return to reduce indentation. Differential Revision: https://reviews.llvm.org/D105159	2021-06-30 07:52:57 -07:00
Louis Dionne	fec521a7b2	[lit] Add the ability to parse regexes in Lit boolean expressions This patch augments Lit with the ability to parse regular expressions in boolean expressions. This includes REQUIRES:, XFAIL:, UNSUPPORTED:, and all other special Lit markup that evaluates to a boolean expression. Regular expressions can be specified by enclosing them in {{...}}, similarly to how FileCheck handles such regular expressions. The regular expression can either be on its own, or it can be part of an identifier. For example, a match expression like {{.+}}-apple-darwin{{.+}} would match the following variables: x86_64-apple-darwin20.0 arm64-apple-darwin20.0 arm64-apple-darwin22.0 etc... In the long term, this could be used to remove the need to handle the target triple specially when parsing boolean expressions. Differential Revision: https://reviews.llvm.org/D104572	2021-06-30 10:52:16 -04:00
Florian Mayer	90f244c843	[hwasan] Add missing newline in report. Reviewed By: glider Differential Revision: https://reviews.llvm.org/D105190	2021-06-30 15:48:45 +01:00
Simon Pilgrim	47941d601d	[CostModel][X86] Adjust fp<->int vXi32 AVX1+ costs based on llvm-mca reports Based off the worse case numbers generated by D103695, the AVX1/2/512 sitofp/uitofp/fptosi/fptoui costs were higher than necessary (based off instruction counts instead of actual throughput). The SSE costs still need further fixes, but I hit an issue with the order in which SSE costs are checked - we need to check CUSTOM costs (with non-legal types) first, and then fallback to LEGALIZED types. I'm looking at this now, and this should let us start thinning out a lot of the duplicates in the costs tables. Then we can finally start work on vXi64 / vXi16 / vXi8 / vXi1 integers, which should let us look at sub-128-bit vectorization (D103925).	2021-06-30 15:23:34 +01:00
Nico Weber	db86e5c914	Revert "[Coroutine] Add statistics for the number of elided coroutine" This reverts commit `1d9539cf49`. Test fails in LLVM_ENABLE_ASSERTIONS=OFF builds (such as regular release builds).	2021-06-30 10:22:45 -04:00
William S. Moses	ad4152d1b8	[MLIR] Update description of SCF.execute_region op See https://reviews.llvm.org/D104865	2021-06-30 10:09:42 -04:00
William S. Moses	dfb34c0df9	[MLIR][SCF] Inline ExecuteRegion if parent can contain multiple blocks The executeregionop is used to allow multiple blocks within SCF constructs. If the container allows multiple blocks, inline the region Differential Revision: https://reviews.llvm.org/D104960	2021-06-30 10:03:22 -04:00
Melanie Blower	e773216f46	[clang][patch] Add builtin __arithmetic_fence and option fprotect-parens This patch adds a new clang builtin, __arithmetic_fence. The purpose of the builtin is to provide the user fine control, at the expression level, over floating point optimization when -ffast-math (-ffp-model=fast) is enabled. The builtin prevents the optimizer from rearranging floating point expression evaluation. The new option fprotect-parens has the same effect on parenthesized expressions, forcing the optimizer to respect the parentheses. Reviewed By: aaron.ballman, kpn Differential Revision: https://reviews.llvm.org/D100118	2021-06-30 09:58:06 -04:00
Joseph Huber	0edb87773b	[OpenMP] Add additional remarks for OpenMPOpt This patch adds additional remarks, suggesting the use of `noescape` for failed globalization and indicating when internalization failed. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D105150	2021-06-30 09:49:25 -04:00
William S. Moses	0cd8422e8c	[MLIR] Eliminate unnecessary affine stores Deduce circumstances where an affine load could not possibly be read by an operation (such as an affine load), and if so, eliminate the load Differential Revision: https://reviews.llvm.org/D105041	2021-06-30 09:45:26 -04:00
Florian Hahn	8db9cb262f	[Matrix] Add tests for hoisting address computations.	2021-06-30 14:31:00 +01:00
Peter Smith	fc1cb3104b	[LLD][ELF][ARM] Tidy up test to hook up missing filecheck patterns [NFC] A couple of filecheck patterns had not been hooked up with the patterns suffering from some drift. As this test is old and llvm-objdump has improved a lot, take this opportunity to hide the instruction encoding. I've also taken out a lot of the explanatory comments that llvm-objdump improvements make redundant, as these comments oftern don't get updated when addresses change. Differential Revision: https://reviews.llvm.org/D104907	2021-06-30 14:16:40 +01:00
alex-t	e585b332e4	[AMDGPU] PHI node cost should not be counted for the size and latency. Details: https://reviews.llvm.org/D96805 changed the GCNTTIImpl::getCFInstrCost to return 1 for the PHI nodes for the TTI::TCK_CodeSize and TTI::TCK_SizeAndLatency. This is incorrect because the value moves that are the result of the PHI lowering are inserted into the basic block predecessors - not into the block itself. As a result of this change LoopRotate and LoopUnroll were broken because of the incorrect Loop header and loop body size/cost estimation. Reviewed By: rampitec Differential Revision: https://reviews.llvm.org/D105104	2021-06-30 16:11:17 +03:00
Peter Smith	dd4d3f7406	[LLD][ELF][ARM] Fix case of patched unrelocated BLX There are a couple of problems with the code to patch unrelocated BLX instructions: 1. The calculation of the PC needs to take into account the alignment of the instruction. The Thumb BLX uses alignDown(PC, 4) for the source address. 2. The calculation of the PC bias is hard-coded to 4 which works for Thumb, but when there is a BLX the branch will be in Arm state so it needs an 8 byte PC bias. No asssembler generates an unrelocated BLX instruction so these problems do not affect real world programs. However we should still fix them. Differential Revision: https://reviews.llvm.org/D104905	2021-06-30 14:07:35 +01:00
Bradley Smith	002911503f	[TargetLowering][AArch64][SVE] Take into account accessed type when clamping address When clamping the index for a memory access to a stacked vector we must take into account the entire type being accessed, not just assume that we are accessing only a single element. Differential Revision: https://reviews.llvm.org/D105016	2021-06-30 13:30:18 +01:00
Tobias Gysi	42d99bc376	[mlir][linalg][python] Update the OpDSL doc (NFC). Update the OpDSL documentation to reflect recent changes. In particular, the updated documentation discusses: - Attributes used to parameterize index expressions - Shape-only tensor support - Scalar parameters Differential Revision: https://reviews.llvm.org/D105123	2021-06-30 12:27:17 +00:00
Saiyedul Islam	f7ce532d62	[clang-offload-bundler] Add unbundling of archives containing bundled object files into device specific archives This patch adds unbundling support of an archive file. It takes an archive file along with a set of offload targets as input. Output is a device specific archive for each given offload target. Input archive contains bundled code objects bundled using clang-offload-bundler. Each generated device specific archive contains a set of device code object files which are named as <Parent Bundle Name>-<CodeObject-GPUArch>. Entries in input archive can be of any binary type which is supported by clang-offload-bundler, like *.bc. Output archives will contain files in same type. Example Usuage: clang-offload-bundler --unbundle --inputs=lib-generic.a -type=a -targets=openmp-amdgcn-amdhsa--gfx906,openmp-amdgcn-amdhsa--gfx908 -outputs=devicelib-gfx906.a,deviceLib-gfx908.a Reviewed By: jdoerfert, yaxunl Differential Revision: https://reviews.llvm.org/D93525	2021-06-30 17:55:50 +05:30
Simon Pilgrim	fcd0cb3921	Fix MSVC "32-bit shift implicitly converted to 64 bits" warning.	2021-06-30 13:23:53 +01:00
Alexey Bataev	7fab1146e4	[OPENMP]Fix PR50929: Ignored initializer clause in user-defined reduction. No need to try to create the default constructor for private copy, it will be called automatically in the initializer of the declare reduction. Fixes balance between constructors/destructors calls. Differential Revision: https://reviews.llvm.org/D105143	2021-06-30 04:55:38 -07:00
Zhouyi Zhou	2fd75507d1	[clang] NFC: add line break at the end of if expressions Hi, In function TransformTemplateArgument, would it be better to add line break at the end of "if" expressions? I use clang-format to do the job for me. Thanks a lot Reviewed By: pengfei Differential Revision: https://reviews.llvm.org/D104604	2021-06-30 19:48:24 +08:00
madhur13490	a7ed55f64c	[AMDGPU] Simplify getReservedNumSGPRs This is a followup patch on D103636 where it seemed checking on amdgpu-calls and amdgpu-stack-objects is unnecessary. Removing these checks didn't regress any tests functionally. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D104513	2021-06-30 16:19:39 +05:30
Florian Hahn	611a02cce5	[ConstantRanges] Use APInt for constant case for urem/srem. Currently UREM & SREM on constant ranges produces overly pessimistic results for single element constant ranges. Delegate to APInt's implementation if both operands are single element constant ranges. We already do something similar for other binary operators, like binary AND. Fixes PR49731. Reviewed By: lebedev.ri Differential Revision: https://reviews.llvm.org/D105115	2021-06-30 11:18:20 +01:00
Florian Mayer	ad8494c021	[hwasan] Make sure we retag with a new tag on free. Reviewed By: eugenis Differential Revision: https://reviews.llvm.org/D105021	2021-06-30 11:13:38 +01:00
David Sherwood	7b7b5b5a26	[NFC] Rename shadowed variable in InnerLoopVectorizer::createInductionVariable Avoid creating a IRBuilder stack variable with the same name as the class member.	2021-06-30 11:11:49 +01:00
Florian Mayer	a24f104645	[MTE] Remove redundant helper function. Looking at PostDominatorTree::dominates, we can see that has the same logic (with the addition of handling Phi nodes - which are not used as inputs in this pass) as the helper function. Reviewed By: eugenis Differential Revision: https://reviews.llvm.org/D105141	2021-06-30 11:11:26 +01:00
Jay Foad	2da58826a5	[TableGen] Allow identical MnemonicAliases with no predicate My use case for this is illustrated in the test case: I want to define the same instruction twice with different (disjoint) predicates, because the instruction has different operands on different subtargets. It's convenient to do this with a multiclass that also defines an alias for the instruction. Previously tablegen would complain if this alias was defined twice with no predicate. One way to fix this would be to add a predicate on each definition of the alias, matching the predicate on the instruction. But this (a) is slightly awkward to do in the real world use case I had, and (b) leads to an inefficient matcher that will do something like this: if (Mnemonic == "foo_alias") { if (Features.test(Feature_Subtarget1Bit)) Mnemonic == "foo"; else if (Features.test(Feature_Subtarget2Bit)) Mnemonic == "foo"; return; } It would be more efficient to skip the feature tests and return "foo" unconditionally. Overall it seems better to allow multiple definitions of the identical alias with no predicate. Differential Revision: https://reviews.llvm.org/D105033	2021-06-30 10:53:39 +01:00
Valeriy Savchenko	c818cb96ad	[analyzer][satest][NFC] Relax dependencies requirements	2021-06-30 12:50:21 +03:00
Igor Kudrin	657e067bb5	[ARMInstPrinter] Print the target address of a branch instruction This follows other patches that changed printing immediate values of branch instructions to target addresses, see D76580 (x86), D76591 (PPC), D77853 (AArch64). As observing immediate values might sometimes be useful, they are printed as comments for branch instructions. // llvm-objdump -d output (before) 000200b4 <_start>: 200b4: ff ff ff fa blx #-4 <thumb> 000200b8 <thumb>: 200b8: ff f7 fc ef blx #-8 <_start> // llvm-objdump -d output (after) 000200b4 <_start>: 200b4: ff ff ff fa blx 0x200b8 <thumb> @ imm = #-4 000200b8 <thumb>: 200b8: ff f7 fc ef blx 0x200b4 <_start> @ imm = #-8 // GNU objdump -d. 000200b4 <_start>: 200b4: faffffff blx 200b8 <thumb> 000200b8 <thumb>: 200b8: f7ff effc blx 200b4 <_start> Differential Revision: https://reviews.llvm.org/D104701	2021-06-30 16:35:28 +07:00

... 3 4 5 6 7 ...

392681 Commits All Branches Search

392681 Commits

All Branches