llvm-project

Commit Graph

Author	SHA1	Message	Date
Petar Avramovic	2b66d32f3f	[MIPS GlobalISel] Select count leading zeros llvm.ctlz.<type> intrinsic has additional i1 argument is_zero_undef, it tells whether zero as the first argument produces a defined result. MIPS clz instruction returns 32 for zero input. G_CTLZ is generated from llvm.ctlz.<type> (<type> <src>, i1 false) intrinsics, clang generates these intrinsics from __builtin_clz and __builtin_clzll. G_CTLZ_ZERO_UNDEF can also be generated from llvm.ctlz with true as second argument. It is also traditionally part of and many algorithms that are now predicated on avoiding zero-value inputs. Add narrow scalar for G_CTLZ (algorithm uses G_CTLZ_ZERO_UNDEF). Lower G_CTLZ_ZERO_UNDEF and select G_CTLZ for MIPS32. Differential Revision: https://reviews.llvm.org/D73214	2020-01-27 09:43:38 +01:00
Fangrui Song	941f20c3bd	[MachineVerifier] Simplify and delete LLVM_VERIFY_MACHINEINSTRS from a comment. NFC The environment variable has been unused since r228079.	2020-01-27 00:31:23 -08:00
Martin Storsjö	b780df052d	[libunwind] Treat assembly files as C on mingw When targeting mingw, current CMake (3.16) fails to get the right flags for assembly source files for windows gnu/clang targets (see https://gitlab.kitware.com/cmake/cmake/merge_requests/4287 for a fix), causing builds to fail due to `-fPIC` being unsupported in clang for mingw targets In the meantime, restore the behaviour from before `c48974ffd7` selectively on mingw targets, treating the assembly files as C. Differential Revision: https://reviews.llvm.org/D73436	2020-01-27 09:04:58 +02:00
Qiu Chaofan	59d690850e	[NFC] Fix typo in Clang docs	2020-01-27 11:37:43 +08:00
Wang, Pengfei	17b8f96d65	[FPEnv] Divide macro INSTRUCTION into INSTRUCTION and DAG_INSTRUCTION, and macro FUNCTION likewise. NFCI. Some functions like fmuladd don't really have a node, we should divide the declaration form those have node to avoid introducing fake nodes. Differential Revision: https://reviews.llvm.org/D72871	2020-01-27 10:38:05 +08:00
Saar Raz	9c24fca2a3	[Concepts] Fix incorrect TemplateArgs for introduction of local parameters The wrong set of TemplateArgs was being provided to addInstantiatedParametersToScope. Caused bug #44658.	2020-01-27 00:59:37 +02:00
Lei Zhang	29e411b3d6	[mlir] Expose getNearestSymbolTable as SymbolTable class method This is a generally useful utility function for interacting with symbol tables. Differential Revision: https://reviews.llvm.org/D73433	2020-01-26 17:35:26 -05:00
Saar Raz	a8d096aff6	[Concepts] Add missing null check to transformConstructor Caused bug 44671 when transforming a constructor with a type-constraint with no explicit template args.	2020-01-27 00:15:42 +02:00
Martin Storsjö	0e0c65264a	[libunwind] Fix building standalone after `c48974ffd7` After this change, we need to explicitly list the languages the project uses, otherwise the assembly source files won't get built at all. Previously (before that commit), the assembly source files were simply treated as C. The toplevel llvm CMakeLists.txt adds these three languages, so when building libunwind integrated as part of that, it works fine.	2020-01-26 22:12:40 +02:00
Roman Lebedev	76fcf900d5	[X86][BdVer2] Polish LEA instruction scheduling info Based on exhaustive llvm-exegesis measurements. There may still be some imperfections for LEA16r/LEA32r. Much like was observed in D68646, i'm also measuring some outliers with some specific registers.	2020-01-26 22:17:27 +03:00
Roman Lebedev	31019dfdf5	[NFC][MCA] Re-autogenerate all check lines in all X86 MCA tests Some whitespace issues have crept in, and some znver2 check lines were missing..	2020-01-26 22:17:26 +03:00
Simon Pilgrim	f99ef5455a	[InstCombine] Add extra shift(c1,add(c2,y)) tests for PR15141	2020-01-26 19:04:12 +00:00
Simon Pilgrim	fa19d67a2a	[X86][AVX] Extend combineCommutableSHUFP to handle v8f32 and v16f32 commutable shufps patterns	2020-01-26 19:04:12 +00:00
Saar Raz	5043962dd3	[Concepts] Fix parsing of scope specifier in compound-requirements, add more tests for scope specifiers in type-constraints The code for parsing of type-constraints in compound-requirements was not adapted for the new TryAnnotateTypeConstraint which caused compound-requirements with scope specifiers to ignore them. Also add regression tests for scope specifiers in type-constraints in more contexts.	2020-01-26 20:46:53 +02:00
Stephen Kelly	f29204d388	NFC: Implement AST node skipping in ParentMapContext Summary: This allows ASTContext to store only one parent map, rather than storing an entire parent map for each traversal mode used. This is therefore a partial revert of commit `0a717d5b` (Make it possible control matcher traversal kind with ASTContext, 2019-12-06). Reviewers: aaron.ballman, rsmith, rnk Subscribers: cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D73388	2020-01-26 18:35:44 +00:00
Guillaume Chatelet	cc034a5883	[IR] masked gather/scatter alignment should be set Summary: masked_load and masked_store instructions require the alignment to be specified and a power of two. It seems to me that this requirement applies to masked_gather and masked_scatter as well. Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73179	2020-01-26 18:51:36 +01:00
Lei Zhang	8d6884a15e	[mlir][spirv] Create builtin variable in nearest symbol table This commit changes the logic of `getBuiltinVariableValue` to get or create the builtin variable in the nearest symbol table. This will allow us to use this function in other partial conversion cases where we haven't created the spv.module yet. Differential Revision: https://reviews.llvm.org/D73416	2020-01-26 11:00:49 -05:00
Lei Zhang	09f9deaff2	[mlir][spirv] NFC: simplify load/store builder call sites This commit introduces default values for load/store builders to simplify builder call sites. Differential Revision: https://reviews.llvm.org/D73419	2020-01-26 10:45:42 -05:00
Lei Zhang	91d6655a29	[mlir][spirv] NFC: expose builtin func op conversion pattern This commit exposes the func op conversion pattern via a new `populateBuiltinFuncToSPIRVPatterns` function from the standard to SPIR-V conversion passs. This is structurally better given that func op belongs to the builtin dialect. More importantly, this makes the pattern reusable to other dialect to SPIR-V dialect conversion as other dialect can well adopt builtin func op instead of having its own. Besides, it's very common to use func ops as test wrappers in lit tests, so test passes will need to handle func ops too. Differential Revision: https://reviews.llvm.org/D73421	2020-01-26 10:42:06 -05:00
Lei Zhang	60d541e1b9	[mlir][spirv] Relax verification to allow flexible placement Thus far certain SPIR-V ops have been required to be in spv.module. While this provides strong verification to catch unexpected errors, it's quite rigid and makes progressive lowering difficult. Sometimes we would like to partially lower ops from other dialects, which may involve creating ops like global variables that should be placed in other module-like ops. So this commit relaxes the requirement of such SPIR-V ops' scope to module-like ops. Similarly for function- like ops. Differential Revision: https://reviews.llvm.org/D73415	2020-01-26 10:39:45 -05:00
Lei Zhang	ae21e37eb4	[mlir][spirv] Add spv.GroupNonUniformElect and spv.GroupNonUniformIAdd Differential Revision: https://reviews.llvm.org/D73349	2020-01-26 10:20:40 -05:00
Simon Pilgrim	377e86d12e	[X86][AVX] Add tests showing combineCommutableSHUFP failure to handle v8f32 and v16f32 commutable shufps patterns	2020-01-26 14:36:24 +00:00
Simon Pilgrim	1a81b296cd	[X86][SSE] combineCommutableSHUFP - permilps(shufps(load(),x)) --> permilps(shufps(x,load())) Pull out combineTargetShuffle code added in rG3fd5d1c6e7db into a helper function and extend it to handle shufps(shufps(load(),x),y) and shufps(y,shufps(load(),x)) cases as well.	2020-01-26 14:36:23 +00:00
Serge Pavlov	4aea70ed32	[FPEnv] Extended FPOptions with new attributes This change added two new attributes, rounding mode and exception behavior to the structure FPOptions. These attributes allow more flexible treatment of specific floating point environment than it is provided by #pragma STDC FENV_ACCESS. Differential Revision: https://reviews.llvm.org/D65994	2020-01-26 21:03:53 +07:00
Simon Pilgrim	4a5f9d9faf	[TargetLowering] Respect recursive depth in SimplifyDemandedBits call to ComputeNumSignBits	2020-01-26 10:01:56 +00:00
Maheaha Shivamallappa	66f93071cd	AMDGPU/GlobalISel: Clean-up code around ISel for Intrinsics. Summary: A minor code clean-up around ISel for intrinsic llvm.amdgcn.end.cf() Reviewers: arsenm, mshivama Reviewed By: arsenm Tags: #llvm Differential Revision: https://reviews.llvm.org/D73358	2020-01-26 14:09:31 +05:30
Fangrui Song	70389be7a0	[ELF][PPC32] Support range extension thunks with addends * Generalize the code added in D70637 and D70937. We should eventually remove the EM_MIPS special case. * Handle R_PPC_LOCAL24PC the same way as R_PPC_REL24. Reviewed By: Bdragon28 Differential Revision: https://reviews.llvm.org/D73424	2020-01-25 22:32:42 -08:00
George Burgess IV	2f45a93edf	[Support] `const`ify a method; NFC Pointed out by Stepan on llvm-dev: http://lists.llvm.org/pipermail/llvm-dev/2020-January/138617.html	2020-01-25 21:48:04 -08:00
Mehdi Amini	308571074c	Mass update the MLIR license header to mention "Part of the LLVM project" This is an artifact from merging MLIR into LLVM, the file headers are now aligned with the rest of the project.	2020-01-26 03:58:30 +00:00
Craig Topper	3fdd435a4b	[X86] Use a macro to convert X86ISD names to strings in getTargetNodeName. Every case in the switch had a string version of themselves. Two of them had a typo that used : instead of :: By using a macro we can automate the string creation and avoid the possibility of typos like this. This is similar to what is done on the AMDGPU target.	2020-01-25 18:27:29 -08:00
Fangrui Song	837e8a9c0c	[ELF][PPC32] Support canonical PLT -fno-pie produces a pair of non-GOT-non-PLT relocations R_PPC_ADDR16_{HA,LO} (R_ABS) referencing external functions. ``` lis 3, func@ha la 3, func@l(3) ``` In a -no-pie/-pie link, if func is not defined in the executable, a canonical PLT entry (st_value>0, st_shndx=0) will be needed. References to func in shared objects will be resolved to this address. -fno-pie -pie should fail with "can't create dynamic relocation ... against ...", so we just need to think about -no-pie. On x86, the PLT entry passes the JMP_SLOT offset to the rtld PLT resolver. On x86-64: the PLT entry passes the JUMP_SLOT index to the rtld PLT resolver. On ARM/AArch64: the PLT entry passes &.got.plt[n]. The PLT header passes &.got.plt[fixed-index]. The rtld PLT resolver can compute the JUMP_SLOT index from the two addresses. For these targets, the canonical PLT entry can just reuse the regular PLT entry (in PltSection). On PPC32: PltSection (.glink) consists of `b PLTresolve` instructions and `PLTresolve`. The rtld PLT resolver depends on r11 having been set up to the .plt (GotPltSection) entry. On PPC64 ELFv2: PltSection (.glink) consists of `__glink_PLTresolve` and `bl __glink_PLTresolve`. The rtld PLT resolver depends on r12 having been set up to the .plt (GotPltSection) entry. We cannot reuse a `b PLTresolve`/`bl __glink_PLTresolve` in PltSection as a canonical PLT entry. PPC64 ELFv2 avoids the problem by using TOC for any external reference, even in non-pic code, so the canonical PLT entry scenario should not happen in the first place. For PPC32, we have to create a PLT call stub as the canonical PLT entry. The code sequence sets up r11. Reviewed By: Bdragon28 Differential Revision: https://reviews.llvm.org/D73399	2020-01-25 17:56:37 -08:00
Saar Raz	713562f548	[Concepts] Transform constraints of non-template functions to ConstantEvaluated We would previously try to evaluate atomic constraints of non-template functions as-is, and since they are now unevaluated at first, this would cause incorrect evaluation (bugs #44657, #44656). Substitute into atomic constraints of non-template functions as we would atomic constraints of template functions, in order to rebuild the expressions in a constant-evaluated context.	2020-01-25 23:00:24 +02:00
Simon Pilgrim	3daa71ee00	[SelectionDAG] ComputeNumSignBits - add DemandedElts support for MIN/MAX ops	2020-01-25 20:21:14 +00:00
Fangrui Song	deb5819d62	[ELF] Rename relocateOne() to relocate() and pass `Relocation` to it Symbol information can be used to improve out-of-range/misalignment diagnostics. It also helps R_ARM_CALL/R_ARM_THM_CALL which has different behaviors with different symbol types. There are many (67) relocateOne() call sites used in thunks, {Arm,AArch64}errata, PLT, etc. Rename them to `relocateNoSym()` to be clearer that there is no symbol information. Reviewed By: grimar, peter.smith Differential Revision: https://reviews.llvm.org/D73254	2020-01-25 12:00:18 -08:00
Simon Pilgrim	481b79668c	[X86] Add tests showing ComputeNumSignBits's failure to use DemandedElts for MIN/MAX opcodes	2020-01-25 19:28:57 +00:00
Simon Pilgrim	3f8916b2e8	[SelectionDAG] ComputeNumSignBits - add support for rotate non-uniform vector amounts	2020-01-25 19:15:05 +00:00
Simon Pilgrim	e3c26a9d1b	[SelectionDAG] ComputeNumSignBits - add support for rotate uniform vector amounts	2020-01-25 18:55:47 +00:00
Simon Pilgrim	435a60a5af	[X86] Add tests showing ComputeNumSignBits's failure to see through rotate vector amounts	2020-01-25 18:24:51 +00:00
Jacques Pienaar	e47b561398	[mlir] Revert MSVC specific part of whole_archive_link Revert the MSVC specific parts in whole_archive_link to previous form to potentially address https://bugs.llvm.org/show_bug.cgi?id=44660.	2020-01-25 09:56:04 -08:00
Simon Pilgrim	c8de7c8f50	[TargetLowering] SimplifyDemandedBits - Remove ashr if all our demandedbits already match the sign bit Differential Revision: https://reviews.llvm.org/D73412	2020-01-25 17:36:46 +00:00
Jacques Pienaar	e298e21650	[mlir] Bootstrap doxygen config Add basic doxygen config following clang and llvm example with minimal changes.	2020-01-25 09:31:59 -08:00
Francis Visoiu Mistrih	0f34ea5dc3	[perf-training] Update ' (in-process)' prefix handling A recent change added a new line after the prefix, so it's now part of the prefix list.	2020-01-25 09:14:24 -08:00
serge-sans-paille	6d485ff455	Improve static checks for sprintf and __builtin___sprintf_chk Implement a pessimistic evaluator of the minimal required size for a buffer based on the format string, and couple that with the fortified version to emit a warning when the buffer size is lower than the lower bound computed from the format string. Differential Revision: https://reviews.llvm.org/D71566	2020-01-25 18:10:34 +01:00
Sam McCall	d08563486e	[clangd] Make Notification a little safer. I just fixed a test involving a similar Notification class: `18e6a65bae` The pattern (notify() on one thread, wait() and then destroy the Notification on the other) seems innocuous enough. I'm not sure we actually use it in clangd, but better safe than sorry.	2020-01-25 15:31:55 +01:00
Sam McCall	18e6a65bae	[Support] Fix race in threading test, found by TSan	2020-01-25 15:22:12 +01:00
Tom Stellard	cb297050bb	AMDGPU/SILoadStoreOptimizer: Fix uninitialized variable error This was introduced by `86c944d790` and caught by the sanitizer-x86_64-linux-fast bot.	2020-01-24 21:53:05 -08:00
Jonas Devlieghere	1c90ce0c76	[lldb/Test] Disable hardware check on arm/aarch64 BreakpointSites know they're backed by hardware based on whether the "hardware index" is set. This does not appear the to be done for arm/aarch64. https://llvm.org/PR44659	2020-01-24 20:54:18 -08:00
Jonas Devlieghere	1ed561aa4b	[lldb/Test] Update minidebuginfo-set-and-hit-breakpoint.test Update test to account for the new 'hardware' field between 'resolved' and 'hit count'.	2020-01-24 20:47:31 -08:00
Matt Arsenault	fe9765762c	AMDGPU: Generate test checks	2020-01-24 23:25:57 -05:00
Tom Stellard	86c944d790	AMDGPU/SILoadStoreOptimizer: Improve merging of out of order offsets Summary: This improves merging of sequences like: store a, ptr + 4 store b, ptr + 8 store c, ptr + 12 store d, ptr + 16 store e, ptr + 20 store f, ptr Prior to this patch the basic block was scanned in order to find instructions to merge and the above sequence would be transformed to: store4 <a, b, c, d>, ptr + 4 store e, ptr + 20 store r, ptr With this change, we now sort all the candidate merge instructions by their offset, so instructions are visited in offset order rather than in the order they appear in the basic block. We now transform this sequnce into: store4 <f, a, b, c>, ptr store2 <d, e>, ptr + 16 Another benefit of this change is that since we have sorted the mergeable lists by offset, we can easily check if an instruction is mergeable by checking the offset of the instruction that becomes before or after it in the sorted list. Once we determine an instruction is not mergeable we can remove it from the list and avoid having to do the more expensive mergeablilty checks. Reviewers: arsenm, pendingchaos, rampitec, nhaehnle, vpykhtin Reviewed By: arsenm, nhaehnle Subscribers: kerbowa, merge_guards_bot, kzhuravl, jvesely, wdng, yaxunl, dstuttard, tpr, t-tye, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D65966	2020-01-24 19:45:56 -08:00

1 2 3 4 5 ...

340539 Commits All Branches Search

340539 Commits

All Branches