llvm-project

Commit Graph

Author	SHA1	Message	Date
Tatyana Krasnukha	eecef3af2c	[lldb][test] Enable fix-its for the test case that expects them enabled Fix-its were intentionally disabled by TestBase.setUp so that incorrect expressions in tests don't pass just because Clang thinks it has a fix-it.	2020-03-05 10:31:07 +03:00
Tatyana Krasnukha	a31130f6fc	[lldb][testsuite] Create a SBDebugger instance for each test Some tests set settings and don't clean them up, this leads to side effects in other tests. The patch removes a global debugger instance with a per-test debugger to avoid such effects. From what I see, lldb.DBG was needed to determine the platform before a test is run, lldb.selected_platform is used for this purpose now. Though, this required adding a new function to the SBPlatform interface. Differential Revision: https://reviews.llvm.org/D74903	2020-03-05 10:12:54 +03:00
Jun Ma	b10deb9487	[Coroutines] Optimized coroutine elision based on reachability Differential Revision: https://reviews.llvm.org/D75440	2020-03-05 14:43:50 +08:00
David Blaikie	7a6878a72e	X86AsmBackend.cpp: #ifndef NDEBUG some only-used-in-asserts variables to fix the -Werror non-asserts build	2020-03-04 22:36:24 -08:00
Lang Hames	4b15decb60	[ORC] Remove hard dependency on libobjc when using MachOPlatform with LLJIT. The LLJIT::MachOPlatformSupport class used to unconditionally attempt to register __objc_selrefs and __objc_classlist sections. If libobjc had not been loaded this resulted in an assertion, even if no objc sections were actually present. This patch replaces this unconditional registration with a check that no objce sections are present if libobjc has not been loaded. This will allow clients to use MachOPlatform with LLJIT without requiring libobjc for non-objc code.	2020-03-04 21:49:28 -08:00
Sameer Sahasrabuddhe	42febbab91	StructurizeCFG: simplify phi nodes when possible After structurization, some phi nodes can have a single incoming edge and can be simplified away. This change runs a simplify query on all phis that are either modified or added by the structurizer. This also moves some phis closer to their use as a side benefit. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D75500	2020-03-05 10:33:15 +05:30
Craig Topper	4c7c87f245	[X86] Simplify the code at the end of lowerShuffleAsBroadcast. The original code could create a bitcast from f64 to i64 and back on 32-bit targets. This was only working because getBitcast was able to fold the casts away to avoid leaving the illegal i64 type. Now we handle the scalar case directly by broadcasting using the scalar type as the element type. Then bitcasting to the final VT. This works since we ensure the scalar type is the same size as the final VT element type. No more casts to i64. For the vector case, we cast to VT or subvector of VT. And then do the broadcast. I think this all matches what we generated before, just in a more readable way.	2020-03-04 20:45:02 -08:00
Matt Arsenault	c64ca93053	clang: Treat ieee mode as the default for denormal-fp-math The IR hasn't switched the default yet, so explicitly add the ieee attributes. I'm still not really sure how the target default denormal mode should interact with -fno-unsafe-math-optimizations. The target may have selected the default mode to be non-IEEE based on the flags or based on its true behavior, but we don't know which is the case. Since the only users of a non-IEEE mode without a flag still support IEEE mode, just reset to IEEE.	2020-03-04 23:34:02 -05:00
Philip Reames	c94a4133bb	Consistently capitalize a variable [NFC] One instance in a copy paste was pointed out in a review, fix all instances at once.	2020-03-04 20:00:08 -08:00
Michael Trent	df058699d3	Fix dyld opcode *_ADD_ADDR_IMM_SCALED error detection. Summary: Move the check for malformed REBASE_OPCODE_ADD_ADDR_IMM_SCALED and BIND_OPCODE_DO_BIND_ADD_ADDR_IMM_SCALED opcodes after the immediate has been applied to the SegmentOffset. This fixes specious errors where SegmentOffset is pointing between two sections when trying to correct the SegmentOffset value. Update the regression tests to verify the proper error message. Reviewers: pete, ab, lhames, steven_wu, jhenderson Reviewed By: pete Subscribers: hiraditya, dexonsmith, rupprecht, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75629	2020-03-04 19:57:45 -08:00
Igor Kudrin	cc61283bf6	[DebugInfo] Avoid crashing on an invalid section identifier. A DWARFSectionKind is read from input. It is not validated on parsing, so an unexpected value may result in reaching llvm_unreachable() in DWARFUnitIndex::getColumnHeader() when dumping the index section. Differential Revision: https://reviews.llvm.org/D75609	2020-03-05 10:54:43 +07:00
QingShan Zhang	3906ae387f	[DAGCombine] Check the uses of negated floating constant and remove the hack PowerPC hits an assertion due to somewhat the same reason as https://reviews.llvm.org/D70975. Though there are already some hack, it still failed with some case, when the operand 0 is NOT a const fp, it is another fma that with const fp. And that const fp is negated which result in multi-uses. A better fix is to check the uses of the negated const fp. If there are already use of its negated value, we will have benefit as no extra Node is added. Differential revision: https://reviews.llvm.org/D75501	2020-03-05 03:42:50 +00:00
Jim Lin	ea6eb813c7	[AVR][NFC] Use Register instead of unsigned Summary: Use Register type for variables instead of unsigned type. Reviewers: dylanmckay Reviewed By: dylanmckay Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75595	2020-03-05 11:38:24 +08:00
Greg Clayton	ffe6695acf	Fix buildbots with merge that didn't happen for `4050b01ba9`.	2020-03-04 19:28:24 -08:00
Greg Clayton	4050b01ba9	Fix GSYM tests to run the yaml files and fix test failures on some machines. YAML files were not being run during lit testing as there was no lit.local.cfg file. Once this was fixed, some buildbots would fail due to a StringRef that pointed to a std::string inside of a temporary llvm::Triple object. These issues are fixed here by making a local triple object that stays around long enough so the StringRef points to valid data. Fixed memory sanitizer bot bugs as well. Differential Revision: https://reviews.llvm.org/D75390	2020-03-04 19:14:08 -08:00
hsmahesha	3fda1fde8f	AMDGPU/GlobalISel: Support llvm.trap and llvm.debugtrap intrinsics Summary: Lower trap and debugtrap intrinsics to AMDGPU machine instruction(s). Reviewers: arsenm, nhaehnle, kerbowa, cdevadas, t-tye, kzhuravl Reviewed By: arsenm Subscribers: kzhuravl, jvesely, wdng, yaxunl, rovka, dstuttard, tpr, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74688	2020-03-05 08:16:57 +05:30
Shengchen Kan	b3722dea3b	[X86] Add a private member function determinePaddingPrefix for X86AsmBackend Summary: X86 can reduce the bytes of NOP by padding instructions with prefixes to get a better peformance in some cases. So a private member function `determinePaddingPrefix` is added to determine which prefix is the most suitable. Reviewers: annita.zhang, reames, MaskRay, craig.topper, LuoYuanke, jyknight Reviewed By: reames Subscribers: llvm-commits, dexonsmith, hiraditya Tags: #llvm Differential Revision: https://reviews.llvm.org/D75357	2020-03-05 09:26:33 +08:00
MaheshRavishankar	755c050200	[mlir][Linalg] Fix load/store operations generated while lower loops when output has zero rank. While lowering to loops, no indices should be used in the load/store operation if the buffer is zero-rank. Differential Revision: https://reviews.llvm.org/D75391	2020-03-04 17:04:30 -08:00
Philip Reames	f708c823f0	[X86] Relax existing instructions to reduce the number of nops needed for alignment purposes If we have an explicit align directive, we currently default to emitting nops to fill the space. As discussed in the context of the prefix padding work for branch alignment (D72225), we're allowed to play other tricks such as extending the size of previous instructions instead. This patch will convert near jumps to far jumps if doing so decreases the number of bytes of nops needed for a following align. It does so as a post-pass after relaxation is complete. It intentionally works without moving any labels or doing anything which might require another round of relaxation. The point of this patch is mainly to mock out the approach. The optimization implemented is real, and possibly useful, but the main point is to demonstrate an approach for implementing such "pad previous instruction" approaches. The key notion in this patch is to treat padding previous instructions as an optional optimization, not as a core part of relaxation. The benefit to this is that we avoid the potential concern about increasing the distance between two labels and thus causing further potentially non-local code grown due to relaxation. The downside is that we may miss some opportunities to avoid nops. For the moment, this patch only implements a small set of existing relaxations.. Assuming the approach is satisfactory, I plan to extend this to a broader set of instructions where there are obvious "relaxations" which are roughly performance equivalent. Note that this patch doesn't change which instructions are relaxable. We may wish to explore that separately to increase optimization opportunity, but I figured that deserved it's own separate discussion. There are possible downsides to this optimization (and all "pad previous instruction" variants). The major two are potentially increasing instruction fetch and perturbing uop caching. (i.e. the usual alignment risks) Specifically: * If we pad an instruction such that it crosses a fetch window (16 bytes on modern X86-64), we may cause the decoder to have to trigger a fetch it wouldn't have otherwise. This can effect both decode speed, and icache pressure. * Intel's uop caching have particular restrictions on instruction combinations which can fit in a particular way. By moving around instructions, we can both cause misses an change misses into hits. Many of the most painful cases are around branch density, so I don't expect this to be too bad on the whole. On the whole, I expect to see small swings (i.e. the typical alignment change problem), but nothing major or systematic in either direction. Differential Revision: https://reviews.llvm.org/D75203	2020-03-04 16:52:35 -08:00
Louis Dionne	2b2a1a42c0	[libc++] Mark deprecation test as UNSUPPORTED on Clang 6	2020-03-04 19:32:40 -05:00
Sam McCall	2cd33e6fe6	[clangd] Track document versions, include them with diags, enhance logs Summary: This ties to an LSP feature (diagnostic versioning) but really a lot of the value is in being able to log what's happening with file versions and queues more descriptively and clearly. As such it's fairly invasive, for a logging patch :-\ Key decisions: - at the LSP layer, we don't reqire the client to provide versions (LSP makes it mandatory but we never enforced it). If not provided, versions start at 0 and increment. DraftStore handles this. - don't propagate magically using contexts, but rather manually: addDocument -> ParseInputs -> (ParsedAST, Preamble, various callbacks) Context-propagation would hide the versions from ClangdServer, which would make producing good log messages hard - within ClangdServer, treat versions as opaque and unordered. std::string is a convenient type for this, and allows richer versions for embedders. They're "mandatory" but "null" is a reasonable default. Subscribers: ilya-biryukov, javed.absar, MaskRay, jkorous, arphaman, kadircet, usaxena95, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D75582	2020-03-05 01:22:32 +01:00
Sam McCall	e6d9b2cb92	[clangd] Remove unused+broken InvalidationError class.	2020-03-05 01:08:40 +01:00
Petr Hosek	ea086d10ce	Revert "[clang-doc] Improving Markdown Output" This reverts commit `45499f3801`, it's still failing on Windows bots.	2020-03-04 16:00:22 -08:00
Matt Arsenault	b2dcde08ad	Add constexpr to DenormalMode constructors This will allow their use in member initializers in a future commit.	2020-03-04 18:46:46 -05:00
Matt Arsenault	7459781bd9	X86: Generate mir checks in sqrt test	2020-03-04 18:46:46 -05:00
Stefan Gränitz	76c59a63bc	[ORC] Decompose LazyCallThroughManager::callThroughToSymbol() Summary: Decompose callThroughToSymbol() into findReexport(), resolveSymbol(), notifyResolved() and reportCallThroughError(). This allows derived classes to reuse the functionality while adding their own code in between. Reviewers: lhames Reviewed By: lhames Subscribers: hiraditya, steven_wu, dexonsmith, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75084	2020-03-05 00:24:23 +01:00
Sam McCall	c627b120eb	[clangd] Cancel certain operations if the file changes before we start. Summary: Otherwise they can force us to build lots of snapshots that we don't need. Particularly, try to do this for operations that are frequently generated by editors without explicit user interaction, and where editing the file makes the result less useful. (Code action enumeration is a good example). https://github.com/clangd/clangd/issues/298 This doesn't return the "right" LSP error code (ContentModified) to the client, we need to teach the cancellation API to distinguish between different causes. Reviewers: kadircet Subscribers: ilya-biryukov, javed.absar, MaskRay, jkorous, arphaman, jfb, usaxena95, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D75602	2020-03-05 00:10:07 +01:00
Craig Topper	eadea7868f	[X86] Convert vXi1 vectors to xmm/ymm/zmm types via getRegisterTypeForCallingConv rather than using CCPromoteToType in the td file Previously we tried to promote these to xmm/ymm/zmm by promoting in the X86CallingConv.td file. But this breaks when we run out of xmm/ymm/zmm registers and need to fall back to memory. We end up trying to create a non-sensical scalar to vector. This lead to an assertion. The new tests in avx512-calling-conv.ll all trigger this assertion. Since we really want to treat these types like we do on avx2, it seems better to promote them before the calling convention code gets involved. Except when the calling convention is one that passes the vXi1 type in a k register. The changes in avx512-regcall-Mask.ll are because we indicated that xmm/ymm/zmm types should be passed indirectly for the Win64 ABI before we go to the common lines that promoted the vXi1 types. This caused the promoted types to be picked up by the default calling convention code. Now we promote them earlier so they get passed indirectly as though they were xmm/ymm/zmm. Differential Revision: https://reviews.llvm.org/D75154	2020-03-04 15:02:32 -08:00
Sam McCall	2be4569719	[clangd] Fix isInsideMainFile to be aware of preamble. Reviewers: kadircet Subscribers: ilya-biryukov, MaskRay, jkorous, arphaman, usaxena95, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D75460	2020-03-04 23:57:21 +01:00
shafik	37549464c1	[dsymutil] Fix template stripping in getDIENames(...) to account for overloaded operators Currently dsymutil when generating accelerator tables will attempt to strip the template parameters from names for subroutines. For some overload operators which contain < in their names e.g. operator< the current method ends up stripping the operator name as well, we just end up with the name operator in the table for each case. Differential Revision: https://reviews.llvm.org/D75545	2020-03-04 14:54:31 -08:00
Martijn Vels	b019c5c037	Partially inline basic_string copy constructor in UNSTABLE Summary: This is a recommit of https://reviews.llvm.org/D73223 where the added function accidentally ended up inside an idef block. This change splits the copy constructor up inlining short initialization, and explicitly outlining long initialization into __init_copy_ctor_external() which is the externally instantiated slow path. For unstable ABI, this has the following changes: remove basic_string(const basic_string&) remove basic_string(const basic_string&, const Allocator&) add __init_copy_ctor_external(const value_type*, size_type) Quick local benchmark for Copy: Master ``` --------------------------------------------------------------- Benchmark Time CPU Iterations --------------------------------------------------------------- BM_StringCopy_Empty 3.50 ns 3.51 ns 199326720 BM_StringCopy_Small 3.50 ns 3.51 ns 199510016 BM_StringCopy_Large 15.7 ns 15.7 ns 45230080 BM_StringCopy_Huge 1503 ns 1503 ns 464896 ``` With this change ``` --------------------------------------------------------------- Benchmark Time CPU Iterations --------------------------------------------------------------- BM_StringCopy_Empty 1.99 ns 2.00 ns 356471808 BM_StringCopy_Small 3.29 ns 3.30 ns 203425792 BM_StringCopy_Large 13.3 ns 13.3 ns 52948992 BM_StringCopy_Huge 1472 ns 1472 ns 475136 ``` Subscribers: libcxx-commits Tags: #libc Differential Revision: https://reviews.llvm.org/D75639	2020-03-04 17:52:46 -05:00
Petr Hosek	45499f3801	[clang-doc] Improving Markdown Output This change has two components. The moves the generated file for a namespace to the directory named after the namespace in a file named 'index.<format>'. This greatly improves the browsing experience since the index page is shown by default for a directory. The second improves the markdown output by adding the links to the referenced pages for children objects and the link back to the source code. Patch By: Clayton Differential Revision: https://reviews.llvm.org/D72954	2020-03-04 14:42:07 -08:00
Craig Topper	6ca96765c7	[X86] Disable commuting for the first source operand of zero masked scalar fma intrinsic instructions. I believe this is the correct fix for D75506 rather than disabling all commuting. We can still commute the remaining two sources. Differential Revision:m https://reviews.llvm.org/D75526	2020-03-04 14:35:53 -08:00
Kostya Kortchinsky	a0e86420ae	[scudo][standalone] Do not fill 32b regions at once Summary: For the 32b primary, whenever we created a region, we would fill it all at once (eg: create all the transfer batches for all the blocks in that region). This wasn't ideal as all the potential blocks in a newly created region might not be consummed right away, and it was using extra memory (and release cycles) to keep all those free blocks. So now we keep track of the current region for a given class, and how filled it is, carving out at most `MaxNumBatches` worth of blocks at a time. Additionally, lower `MaxNumBatches` on Android from 8 to 4. This lowers the randomness of blocks, which isn't ideal for security, but keeps things more clumped up for PSS/RSS accounting purposes. Subscribers: #sanitizers, llvm-commits Tags: #sanitizers, #llvm Differential Revision: https://reviews.llvm.org/D75551	2020-03-04 14:22:24 -08:00
Matt Arsenault	15bf916b54	AMDGPU: Remove VOP3OpSelMods0 complex pattern Use default operand of 0 instead.	2020-03-04 17:18:22 -05:00
Frank Laub	cdc5cba721	[MLIR][Affine][NFC] Expose expandAffineMap Summary: Expose expandAffineMap so that it can be used by lowerings defined outside of MLIR core. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D75589	2020-03-04 14:17:17 -08:00
Nikita Popov	c6ff3c9bad	[InstSimplify] Constant fold icmp of gep InstSimplify can fold icmps of gep where the base pointers are the same and the offsets are constant. It does so by constructing a constant expression icmp and assumes that it gets folded -- but this doesn't actually happen, because GEP expressions can usually only be folded by the target-dependent constant folding layer. As such, we need to explicitly invoke it here. Differential Revision: https://reviews.llvm.org/D75407	2020-03-04 23:16:52 +01:00
Lei Zhang	f6981ac595	[mlir][vulkan-runner] Add basic timing for compute pipeline This commit adds timestamp query commands in Vulkan runner's compute pipeline to gain insights into how long it takes to run the compute shader. This commit also adds timing from CPU side for VkQueueSubmit and vkQueueWaitIdle. Differential Revision: https://reviews.llvm.org/D75531	2020-03-04 17:13:28 -05:00
Muhammad Omair Javaid	5583c2f2fb	Revert "[GlobalISel][Localizer] Enable intra-block localization of already-local uses." This reverts commit `e91e1df6ab`.	2020-03-05 03:12:28 +05:00
Eric Fiselier	50b8088ba2	[libc++] Un-xfail GCC test for new version	2020-03-04 17:09:42 -05:00
Matt Arsenault	9e1d2afc13	AMDGPU/GlobalISel: Don't use vector G_EXTRACT in arg lowering Create a wider source vector, and unmerge with dead defs like the legalizer. The legalization handling for G_EXTRACT is incomplete, and it's preferrable to keep everything in 32-bit pieces. We should probably start moving these functions into utils, since we have a growing number of places that do almost the same thing.	2020-03-04 16:49:01 -05:00
Matt Arsenault	f70e7dc17d	AMDGPU/GlobalISel: Switch target in argument test Since this is still largely relying on the DAG argument type lowering code, this has inherited the problem where i16 vectors have a different ABI on targets with and without legal i16. Switch to using a target with legal i16, so the i16 vector argument tests are more useful.	2020-03-04 16:40:06 -05:00
Matt Arsenault	b71203a751	GlobalISel: Move some legalizer functions to utils	2020-03-04 16:40:00 -05:00
Matt Arsenault	fb0c35fa34	GlobalISel: Set alignment on function argument stack load/store	2020-03-04 16:38:46 -05:00
Alexey Bataev	b27ff4d07d	[OPENMP50]Codegen for 'destroy' clause in depobj directive. If the destroy clause is appplied, the previously allocated memory for the dependency object must be destroyed.	2020-03-04 16:30:34 -05:00
Richard Smith	f545ede91c	Fix regression in bdad0a1: force rebuilding of StmtExpr nodes in TreeTransform if the 'dependent' flag would change.	2020-03-04 13:19:49 -08:00
Frank Laub	c4119a5b90	[MLIR][Affine][NFC] Remove obsolete and ambiguous definitions Summary: Looks like a refactor that was never completed. This change removes some unused and ambiguous definitions. Reviewed By: bondhugula, nicolasvasilache, rriddle Differential Revision: https://reviews.llvm.org/D75586	2020-03-04 13:14:25 -08:00
Louis Dionne	a27f29c6e4	[libc++] Fix typo in REQUIRES that broke the GCC-tot bot configuration Explained in https://reviews.llvm.org/D70117#inline-688897.	2020-03-04 16:11:08 -05:00
Raphael Isemann	70b8f95813	[lldb][NFC] Use C++11 for loop in ClangASTSource::FindExternalLexicalDecls	2020-03-04 12:57:05 -08:00
Zola Bridges	aa3f791fa9	[x86][SLH] Rm liveness check from data invariance check SLH had two functions named isDataInvariant and isDataInvariantLoad that checked whether the passed instruction was data invariant. For some instructions, if the EFLAGS were dead then they were considered data invariant, otherwise they were not considered data invariant. In this patch, I extracted that EFLAGS liveness check and made it explicit at every call to isDataInvariant and isDataInvariantLoad. This makes the isDataInvariant function behave more generally and preserves the liveness check behavior that SLH would like to have. Tested via llvm-lit llvm/test/CodeGen/X86/speculative-load-hardening* This is the first step in making these two data invariance checks available for non-SLH passes. The second step is to move the passes from SLH to X86InstrInfo.cpp. I'll follow up with a patch that does that. Differential Revision: https://reviews.llvm.org/D70283	2020-03-04 21:49:49 +01:00

... 2 3 4 5 6 ...

344608 Commits All Branches Search

344608 Commits

All Branches