llvm-project

Commit Graph

Author	SHA1	Message	Date
River Riddle	68c8b6c4cd	[mlir] Use getDenseElementBitwidth instead of Type::getElementTypeBitWidth. Summary: Some data values have a different storage width than the corresponding MLIR type, e.g. bfloat is currently stored as a double. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D72478	2020-01-09 14:51:44 -08:00
Jonas Devlieghere	58b3dec6c1	[lldb/Lua] Add lua typemaps for INOUT params	2020-01-09 14:51:13 -08:00
Matt Arsenault	10edb1d0d4	TableGen/GlobalISel: Fix pattern matching of immarg literals For arguments that are not expected to be materialized with G_CONSTANT, this was emitting predicates which could never match. It was first adding a meaningless LLT check, which would always fail due to the operand not being a register. Infer the cases where a literal should check for an immediate operand, instead of a register This avoids needing to invent a special way of representing timm literal values. Also handle immediate arguments in GIM_CheckLiteralInt. The comments stated it handled isImm() and isCImm(), but that wasn't really true. This unblocks work on the selection of all of the complicated AMDGPU intrinsics in future commits.	2020-01-09 17:37:52 -05:00
Matt Arsenault	b4a647449f	TableGen/GlobalISel: Add way for SDNodeXForm to work on timm The current implementation assumes there is an instruction associated with the transform, but this is not the case for timm/TargetConstant/immarg values. These transforms should directly operate on a specific MachineOperand in the source instruction. TableGen would assert if you attempted to define an equivalent GISDNodeXFormEquiv using timm when it failed to find the instruction matcher. Specially recognize SDNodeXForms on timm, and pass the operand index to the render function. Ideally this would be a separate render function type that looks like void renderFoo(MachineInstrBuilder, const MachineOperand&), but this proved to be somewhat mechanically painful. Add an optional operand index which will only be passed if the transform should only look at the one source operand. Theoretically it would also be possible to only ever pass the MachineOperand, and the existing renderers would check the parent. I think that would be somewhat ugly for the standard usage which may want to inspect other operands, and I also think MachineOperand should eventually not carry a pointer to the parent instruction. Use it in one sample pattern. This isn't a great example, since the transform exists to satisfy DAG type constraints. This could also be avoided by just changing the MachineInstr's arbitrary choice of operand type from i16 to i32. Other patterns have nontrivial uses, but this serves as the simplest example. One flaw this still has is if you try to use an SDNodeXForm defined for imm, but the source pattern uses timm, you still see the "Failed to lookup instruction" assert. However, there is now a way to avoid it.	2020-01-09 17:37:52 -05:00
Matt Arsenault	0ea3c7291f	GlobalISel: Handle llvm.read_register Compared to the attempt in `bdcc6d3d26`, this uses intermediate generic instructions.	2020-01-09 17:37:52 -05:00
Matt Arsenault	f33f3d98e9	DAG: Don't use unchecked dyn_cast	2020-01-09 17:37:52 -05:00
Matt Arsenault	ac53a5f1dc	GlobalISel: Fix else after return	2020-01-09 17:37:52 -05:00
Matt Arsenault	255cc5a760	CodeGen: Use LLT instead of EVT in getRegisterByName Only PPC seems to be using it, and only checks some simple cases and doesn't distinguish between FP. Just switch to using LLT to simplify use from GlobalISel.	2020-01-09 17:37:52 -05:00
Amara Emerson	cc95bb1f57	[AArch64][GlobalISel] Implement selection of <2 x float> vector splat. Also requires making G_IMPLICIT_DEF of v2s32 legal. Differential Revision: https://reviews.llvm.org/D72422	2020-01-09 14:05:35 -08:00
Eric Schweitz	016bf03ef6	[mlir] add a missing dependency for Linalg conversion We were seeing some occasional build failures that would come and go. It appeared to be this missing dependence. Differential Revision: https://reviews.llvm.org/D72419	2020-01-09 23:00:41 +01:00
Kern Handa	ea67737b16	[mlir] mlir-cpu-runner test's cblas_interface should export functions on Windows This change fixes the build on Windows, so that cblas_interface.dll exports functions correctly and an implib is created and installed correctly. Currently, LLVM cannot be consumed on Windows after it has been installed in a location because cblas_interface.lib is not created/installed, thus failing the import check in `LLVMExports.cmake`. Differential Revision: https://reviews.llvm.org/D72384	2020-01-09 22:55:46 +01:00
Alex Richardson	8c387cbea7	Add builtins for aligning and checking alignment of pointers and integers This change introduces three new builtins (which work on both pointers and integers) that can be used instead of common bitwise arithmetic: __builtin_align_up(x, alignment), __builtin_align_down(x, alignment) and __builtin_is_aligned(x, alignment). I originally added these builtins to the CHERI fork of LLVM a few years ago to handle the slightly different C semantics that we use for CHERI [1]. Until recently these builtins (or sequences of other builtins) were required to generate correct code. I have since made changes to the default C semantics so that they are no longer strictly necessary (but using them does generate slightly more efficient code). However, based on our experience using them in various projects over the past few years, I believe that adding these builtins to clang would be useful. These builtins have the following benefit over bit-manipulation and casts via uintptr_t: - The named builtins clearly convey the semantics of the operation. While checking alignment using __builtin_is_aligned(x, 16) versus ((x & 15) == 0) is probably not a huge win in readably, I personally find __builtin_align_up(x, N) a lot easier to read than (x+(N-1))&~(N-1). - They preserve the type of the argument (including const qualifiers). When using casts via uintptr_t, it is easy to cast to the wrong type or strip qualifiers such as const. - If the alignment argument is a constant value, clang can check that it is a power-of-two and within the range of the type. Since the semantics of these builtins is well defined compared to arbitrary bit-manipulation, it is possible to add a UBSAN checker that the run-time value is a valid power-of-two. I intend to add this as a follow-up to this change. - The builtins avoids int-to-pointer casts both in C and LLVM IR. In the future (i.e. once most optimizations handle it), we could use the new llvm.ptrmask intrinsic to avoid the ptrtoint instruction that would normally be generated. - They can be used to round up/down to the next aligned value for both integers and pointers without requiring two separate macros. - In many projects the alignment operations are already wrapped in macros (e.g. roundup2 and rounddown2 in FreeBSD), so by replacing the macro implementation with a builtin call, we get improved diagnostics for many call-sites while only having to change a few lines. - Finally, the builtins also emit assume_aligned metadata when used on pointers. This can improve code generation compared to the uintptr_t casts. [1] In our CHERI compiler we have compilation mode where all pointers are implemented as capabilities (essentially unforgeable 128-bit fat pointers). In our original model, casts from uintptr_t (which is a 128-bit capability) to an integer value returned the "offset" of the capability (i.e. the difference between the virtual address and the base of the allocation). This causes problems for cases such as checking the alignment: for example, the expression `if ((uintptr_t)ptr & 63) == 0` is generally used to check if the pointer is aligned to a multiple of 64 bytes. The problem with offsets is that any pointer to the beginning of an allocation will have an offset of zero, so this check always succeeds in that case (even if the address is not correctly aligned). The same issues also exist when aligning up or down. Using the alignment builtins ensures that the address is used instead of the offset. While I have since changed the default C semantics to return the address instead of the offset when casting, this offset compilation mode can still be used by passing a command-line flag. Reviewers: rsmith, aaron.ballman, theraven, fhahn, lebedev.ri, nlopes, aqjune Reviewed By: aaron.ballman, lebedev.ri Differential Revision: https://reviews.llvm.org/D71499	2020-01-09 21:48:29 +00:00
Christian Sigg	0f5f28d000	Add gdb pretty printer for MutableArrayRef, remove ConstArrayRef. Reviewers: dblaikie Reviewed By: dblaikie Subscribers: merge_guards_bot, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D72136	2020-01-09 22:43:45 +01:00
Matt Arsenault	595ac8c46e	GlobalISel: Move getLLTForMVT/getMVTForLLT As an intermediate step, some TLI functions can be converted to using LLT instead of MVT. Move this somewhere out of GlobalISel so DAG functions can use these.	2020-01-09 16:32:51 -05:00
Matt Arsenault	f937b43fdb	TableGen/GlobalISel: Address fixme Don't call computeAvailableFunctionFeatures for every instruction.	2020-01-09 16:29:44 -05:00
Matt Arsenault	fba1fbb9c7	GlobalISel: Don't assert on MoreElements creating vectors If the original type was a scalar, it should be valid to add elements to turn it into a vector. Tests included with following legalization change.	2020-01-09 16:29:44 -05:00
Matt Arsenault	767aa507a4	AMDGPU/GlobalISel: Fix argument lowering for vectors of pointers When these arguments are broken down by the EVT based callbacks, the pointer information is lost. Hack around this by coercing the register types to be the expected pointer element type when building the remerge operations.	2020-01-09 16:29:44 -05:00
Matt Arsenault	35ad66fae8	AMDGPU/GlobalISel: Widen 16-bit shift amount sources This should be legal, but will require future selection work. 16-bit shift amounts were already removed from being legal, but this didn't adjust the transformation rules.	2020-01-09 16:29:44 -05:00
Sylvestre Ledru	0c195ef7c6	phab doc: Replace or remove references to svn	2020-01-09 22:29:20 +01:00
Sylvestre Ledru	dbfc516d1f	phab doc: also document 'arc land'	2020-01-09 22:17:06 +01:00
Sylvestre Ledru	004ae200a5	phab doc: remove the svn section	2020-01-09 22:17:06 +01:00
Nathan Ridge	6a69d3c6b3	[clangd] Handle DeducedTemplateSpecializationType in TargetFinder Summary: This is a workaround for https://bugs.llvm.org/show_bug.cgi?id=42914. Once that is fixed, the handling in VisitDeducedTyped() should be sufficient. Fixes https://github.com/clangd/clangd/issues/242 Subscribers: ilya-biryukov, MaskRay, jkorous, arphaman, kadircet, usaxena95, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D72119	2020-01-09 16:14:11 -05:00
Sylvestre Ledru	c348a2674b	[clang-tidy] Refresh the add_new_check.py now that we use a table + autofix Reviewers: alexfh Reviewed By: alexfh Subscribers: njames93, xazax.hun, mgorny, cfe-commits Tags: #clang, #clang-tools-extra Differential Revision: https://reviews.llvm.org/D72421	2020-01-09 22:11:18 +01:00
Eric Fiselier	61bd19206f	[libc++] Explicitly enumerate std::string external instantiations. The external instantiation of std::string is a problem for libc++. Additions and removals of inline functions in string can cause ABI breakages, including introducing new symbols. This patch aims to: (1) Make clear which functions are explicitly instatiated. (2) Prevent new functions from being accidentally instantiated. (3) Allow a migration path for adding or removing functions from the explicit instantiation over time. Although this new formulation is uglier, it is preferable from a maintainability and readability standpoint because it explicitly enumerates the functions we've chosen to expose in our ABI. Changing this list is non-trivial and requires thought and planning. (3) is achieved by making it possible to control the extern template declaration separately from it's definition. Meaning we could add a new definition to the dylib, wait for it to roll out, then add the extern template declaration to the header. Similarly, we could remove existing extern template declarations while still keeping the definition to prevent ABI breakages.	2020-01-09 15:51:02 -05:00
River Riddle	fc3367dd5e	[mlir] NFC: Move the state for managing SSA value names out of OperationPrinter and into a new class SSANameState. Summary: This reduces the complexity of OperationPrinter and simplifies the code by quite a bit. The SSANameState is now held by ModuleState. This is in preparation for a future revision that molds ModuleState into something that can be used by users for caching the printer state, as well as for implementing printAsOperand style methods. Depends On D72292 Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D72293	2020-01-09 12:48:35 -08:00
Alex Richardson	646ca7d7e7	MipsDelaySlotFiller: Update registers def-uses for BUNDLE instructions Summary: In commit `b91f239485` I updated the MipsDelaySlotFiller to skip BUNDLE instructions. However, in addition to not considering BUNDLE instructions for the delay slot, we also need to ensure that the register def-use information is updated. Not updating this information caused run-time crashes (when using the out-of-tree CHERI backend) since later definitions could be overwritten with earlier register values. Reviewers: atanasyan Reviewed By: atanasyan Differential Revision: https://reviews.llvm.org/D72254	2020-01-09 20:46:02 +00:00
Alex Richardson	1444e6e2e6	Re-apply "[ELF] Allow getErrPlace() to work before Out::bufferStart is set" This time with a fix for the UBSAN failure. Differential Revision: https://reviews.llvm.org/D70659	2020-01-09 20:26:31 +00:00
Craig Topper	4e003aad42	[X86] Add ueq/one fp128 quiet compare tests. NFC The ONE expansion calls OGT/OLT libcalls which will signal for QNAN. The UEQ expansion uses unord and eq libcalls which won't signal. We should probably use those libcalls for ONE with appropriate logic. Quiet OGT/OLT/OLE/OGE have similar issue, but not sure how to fix those yet.	2020-01-09 12:22:39 -08:00
Jessica Paquette	9949b1a175	[GlobalISel][AArch64] Import + select LDRroW and STRroW patterns This adds support for selecting a large chunk of the load/store roW patterns. This is pretty much a straight port of AArch64DAGToDAGISel::SelectAddrModeWRO into GISel. The code is very similar to the XRO code. The main difference is that in the roW patterns, we want to try and fold in an extend, and possibly a shift along with it. A good portion of this patch is refactoring the existing XRO code. - Add selectAddrModeWRO - Factor out the code from selectAddrModeShiftedExtendXReg which is used by both selectAddrModeXRO and selectAddrModeWRO into selectExtendedSHL. This is similar to the function of the same name in AArch64DAGToDAGISel. - Add support for extends to the factored out code in selectExtendedSHL. - Teach getExtendTypeForInst how to handle AND masks that are intended to be used in loads/stores (necessary for this addressing mode.) - Make getExtendTypeForInst not static because moving it made an annoying diff and I wanted to have the WRO/XRO functions close to each other while I was writing the code. Differential Revision: https://reviews.llvm.org/D72426	2020-01-09 12:15:56 -08:00
serge-sans-paille	b35f5d4914	[clang] Enforce triple in mempcpy test Fixes http://lab.llvm.org:8011/builders/llvm-clang-win-x-armv7l/builds/2597	2020-01-09 21:09:15 +01:00
Eric Astor	1c545f6dbc	[ms] [X86] Use "P" modifier on all branch-target operands in inline X86 assembly. Summary: Extend D71677 to apply to all branch-target operands, rather than special-casing call instructions. Also add a regression test for llvm.org/PR44272, since this finishes fixing it. Reviewers: thakis, rnk Reviewed By: thakis Subscribers: merge_guards_bot, hiraditya, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D72417	2020-01-09 14:55:03 -05:00
Ganesh Gopalasubramanian	3408940f73	[X86] AMD Znver2 (Rome) Scheduler enablement The patch gives out the details of the znver2 scheduler model. There are few improvements with respect to execution units, latencies and throughput when compared with znver1. The tests that were present for znver1 for llvm-mca tool were replicated. The latencies, execution units, timeline and throughput information are updated for znver2. Reviewers: craig.topper, Simon Pilgrim Differential Revision: https://reviews.llvm.org/D66088	2020-01-10 00:44:59 +05:30
Sean Fertile	1a1dbea24d	[PowerPC] The VK_PLT symbolref modifier is only used on 32-bit ELF. [NFC] Fix a conditional that guarded code for execution only on 32-bit ELF by checking that the Subtarget was not 64-bit and not-Darwin. By adding a new target ABI (AIX), the condition is no longer correct. This code is dead for AIX, due to a 'report_fatal_error' for thread local storage usage earlier in the pipeline, but needs to be modifed as part of Darwins removal from the PowerPC backend.	2020-01-09 14:07:07 -05:00
Erik Pilkington	183b5d38d7	[demangle] Copy back some NFC commits from LLVM	2020-01-09 10:27:24 -08:00
Craig Topper	b705fe5686	[TargetLowering][X86] TeachSimplifyDemandedBits to handle cases where only the sign bit is demanded from a SETCC and can be passed through If we're doing a compare that only tests the sign bit and only the sign bit is demanded, we can just bypass the node. This removes one of the blend dependencies in our v2i64->v2f32 uint_to_fp codegen on pre-sse4.2 targets. Differential Revision: https://reviews.llvm.org/D72356	2020-01-09 10:21:25 -08:00
Bruno Ricci	002be6cfa2	[Support][NFC] Add an explicit unit test for Process::getPageSize() It turns out that it was only tested indirectly. For now test only on Linux X86-64 and aarch64.	2020-01-09 18:14:05 +00:00
Sanjay Patel	460cbabe17	[x86] add tests for 2-way splat copy; NFC Based on code in PR42024: https://bugs.llvm.org/show_bug.cgi?id=42024	2020-01-09 13:08:16 -05:00
Ulrich Weigand	b51fa8670f	[SystemZ] Fix matching another pattern for nxgrk (PR44496) SystemZDAGToDAGISel::Select will attempt to split logical instruction with a large immediate constant. This must not happen if the result matches one of the z15 combined operations, so the code checks for those. However, one of them was missed, causing invalid code to be generated in the test case for PR44496.	2020-01-09 19:06:22 +01:00
Jonas Devlieghere	7bbd4076c1	[lldb/SWIG] Undo incorrect substitution The Python directory for the script interpreter is still capitalized.	2020-01-09 09:55:39 -08:00
Bruno Ricci	2fe45e029d	[Support][NFC] Make some helper functions "static" in Memory.inc	2020-01-09 17:46:21 +00:00
Simon Moll	356b33516c	[NFC,format] Sort switch cases alphabetically This patch brings the switch cases of `llvm/lib/Support/Triple.cpp` back into alphabetical order. This was noted during the the review of https://reviews.llvm.org/D69103 Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D72452	2020-01-09 18:37:24 +01:00
Bruno Ricci	ed6daa2e1d	[Support][NFC] Add a comment about the semantics of MF_HUGE_HINT flag	2020-01-09 17:34:18 +00:00
Simon Tatham	06d07ec4a3	[Clang] Handle target-specific builtins returning aggregates. Summary: A few of the ARM MVE builtins directly return a structure type. This causes an assertion failure at code-gen time if you try to assign the result of the builtin to a variable, because the `RValue` created in `EmitBuiltinExpr` from the `llvm::Value` produced by codegen is always made by `RValue::get()`, which creates a non-aggregate `RValue` that will fail an assertion when `AggExprEmitter::withReturnValueSlot` calls `Src.getAggregatePointer()`. A similar failure occurs if you try to use the struct return value directly to extract one field, e.g. `vld2q(address).val[0]`. The existing code-gen tests for those MVE builtins pass the returned structure type directly to the C `return` statement, which apparently managed to avoid that particular code path, so we didn't notice the crash. Now `EmitBuiltinExpr` checks the evaluation kind of the builtin's return value, and does the necessary handling for aggregate returns. I've added two extra test cases, both of which crashed before this change. Reviewers: dmgreen, rjmccall Reviewed By: rjmccall Subscribers: kristof.beyls, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D72271	2020-01-09 17:28:37 +00:00
Jonas Devlieghere	5e0bf6772e	[lldb/SWIG] Fix capitalization for case sensitive file systems. When moving the Python directory I renamed it to python (lowercase) but didn't update the python.swig file.	2020-01-09 09:23:01 -08:00
Kadir Cetinkaya	189aa5b7a4	[clangd] Adjust diagnostic range to be inside main file Summary: LSP requires diagnostics to lay inside main file. In clangd we keep diagnostics in three different cases: - already in main file - adjusted to a header included in main file - has a note covering some range in main file In the last case, we were not adjusting the diagnostics range to be in main file, therefore these diagnostics ended up pointing some arbitrary locations. This patch fixes that issue by adjusting the range of diagnostics to be the first note inside main file when converting to LSP. Reviewers: ilya-biryukov Subscribers: MaskRay, jkorous, arphaman, usaxena95, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D72458	2020-01-09 18:02:33 +01:00
Jonas Devlieghere	6498aff249	[lldb/Bindings] Move bindings into their own subdirectory All the code required to generate the language bindings for Python and Lua lives under scripts, even though the majority of this code aren't scripts at all, and surrounded by scripts that are totally unrelated. I've reorganized these files and moved everything related to the language bindings into a new top-level directory named bindings. This makes the corresponding files self contained and much more discoverable. Differential revision: https://reviews.llvm.org/D72437	2020-01-09 08:44:34 -08:00
Sven van Haastregt	241f335b26	[OpenCL][Docs] Rename C++ for OpenCL label To avoid potential confusion with OpenCL C++.	2020-01-09 16:31:45 +00:00
serge-sans-paille	cee4a1c957	Improve support of GNU mempcpy - Lower to the memcpy intrinsic - Raise warnings when size/bounds are known Differential Revision: https://reviews.llvm.org/D71374	2020-01-09 17:31:00 +01:00
Whitney Tsang	d27a15fed7	[NFCI][LoopUnrollAndJam] Changing LoopUnrollAndJamPass to a function pass. Summary: This patch changes LoopUnrollAndJamPass to a function pass, and keeps the loops traversal order same as defined in FunctionToLoopPassAdaptor LoopPassManager.h. The next patch will change the loop traversal to outer to inner order, so more loops can be transform. Discussion in llvm-dev mailing list: https://groups.google.com/forum/#!topic/llvm-dev/LF4rUjkVI2g Reviewer: dmgreen, jdoerfert, Meinersbur, kbarton, bmahjour, etiotto Reviewed By: dmgreen Subscribers: hiraditya, zzheng, llvm-commits Tag: LLVM Differential Revision: https://reviews.llvm.org/D72230	2020-01-09 16:18:36 +00:00
Jonas Devlieghere	45c971f7ee	[lldb/Lua] Make lldb.debugger et al available to Lua The Python script interpreter makes the current debugger, target, process, thread and frame available to interactive scripting sessions through convenience variables. This patch does the same for Lua. Differential revision: https://reviews.llvm.org/D71801	2020-01-09 08:15:41 -08:00

1 2 3 4 5 ...

338964 Commits All Branches Search

338964 Commits

All Branches