llvm-project

Commit Graph

Author	SHA1	Message	Date
Nikita Popov	26fa33755f	[InstCombine] Simplify select of cmpxchg transform Rather than converting to a dummy select with equal true and false ops, just directly return the resulting value. As a side-effect, this fixes missing DCE of the previously replaced operand.	2020-03-29 18:57:32 +02:00
Florian Hahn	99913ef3d1	[OpenMP] set_bits iterator yields unsigned elements, no reference (NFC). BitVector::set_bits() returns an iterator range yielding unsinged elements, which always will be copied while const & gives the impression that there will be no copy. Newer version of clang complain: warning: loop variable 'SetBitsIt' is always a copy because the range of type 'iterator_range<llvm::BitVector::const_set_bits_iterator>' (aka 'iterator_range<const_set_bits_iterator_impl<llvm::BitVector> >') does not return a reference [-Wrange-loop-analysis] Reviewers: jdoerfert, rnk Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D77010	2020-03-29 17:08:13 +01:00
Nikita Popov	28f67bd5c5	[InstCombine] Fix worklist management in varargs transform Add a replaceUse() helper to mirror replaceOperand() for the rare cases where we're working directly on uses. NFC apart from worklist order changes.	2020-03-29 18:04:12 +02:00
Nikita Popov	6f07a9e80a	[InstCombine] Erase original add when creating saddo Usually when we replaceInstUsesWith() we also return the original instruction, and InstCombine will take care of erasing it. Here we don't do that, so we need to manually erase it. NFC apart from worklist order changes.	2020-03-29 18:01:32 +02:00
Nikita Popov	1e363023b8	[InstCombine] Use replaceOperand() in a few more places To make sure the old operands get DCEd. NFC apart from worklist order changes.	2020-03-29 18:01:00 +02:00
Simon Pilgrim	7734e4b3a3	[X86][AVX] Combine 128-bit lane shuffles with a zeroable upper half to EXTRACT_SUBVECTOR (PR40720) As explained on PR40720, EXTRACTF128 is always as good/better than VPERM2F128, and we can use the implicit zeroing of the upper half. I've added some extra tests to vector-shuffle-combining-avx2.ll to make sure we don't lose coverage.	2020-03-29 16:41:59 +01:00
Simon Pilgrim	da4c7db793	[X86] Rename matchShuffleAsByteRotate to matchShuffleAsElementRotate. NFC. This was an inner helper function for the real matchShuffleAsByteRotate function, but it is more generic and is used directly for VALIGN lowering which doesn't work at the byte level.	2020-03-29 16:41:58 +01:00
Simon Pilgrim	10439f9e32	[X86][AVX] Add X86ISD::VALIGN target shuffle decode support Allows us to combine VALIGN instructions with other shuffles - the combiner doesn't create VALIGN yet though.	2020-03-29 16:41:58 +01:00
Florian Hahn	49d00824bb	[VPlan] Use one VPWidenRecipe per original IR instruction. (NFC). This patch changes VPWidenRecipe to only store a single original IR instruction. This is the first required step towards modeling it's operands as VPValues and also towards breaking it up into a VPInstruction. Discussed as part of D74695. Reviewers: Ayal, gilr, rengolin Reviewed By: gilr Differential Revision: https://reviews.llvm.org/D76988	2020-03-29 13:47:28 +01:00
Nikita Popov	6ba6351072	[PostOrderIterator] Use SmallVector to store stack; NFC We use a SmallPtrSet to track visited nodes, use a SmallVector of the same size for the stack.	2020-03-29 14:29:02 +02:00
Simon Pilgrim	a7115d51be	[X86] X86CallFrameOptimization - generalize slow push code path Replace the explicit isAtom() \|\| isSLM() test with the more general (and more specific) slowTwoMemOps() check to avoid the use of the PUSHrmm push from memory case. This is actually very tricky to test in anything but quite complex code, but the atomic-idempotent.ll tests seem to be the most straightforward to use. Differential Revision: https://reviews.llvm.org/D76239	2020-03-29 11:01:59 +01:00
Richard Diamond	4bf015c035	[AlignmentFromAssumptions] Fix a SCEV assertion resulting from address space differences. Summary: On targets with different pointer sizes, -alignment-from-assumptions could attempt to create SCEV expressions which use different effective SCEV types. The provided test illustrates the issue. In `getNewAlignment`, AASCEV would be the (only) alloca, which would have an effective SCEV type of i32. But PtrSCEV, the GEP in this case, due to being in the flat/default address space, will have an effective SCEV of i64. This patch resolves the issue by truncating PtrSCEV to AASCEV's effective type. Reviewers: hfinkel, jdoerfert Reviewed By: jdoerfert Subscribers: jvesely, nhaehnle, hiraditya, javed.absar, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75471	2020-03-29 01:26:31 -05:00
Craig Topper	c0aa97b632	[X86] Add cost model test cases for fmin/fmax reduction.	2020-03-28 17:12:49 -07:00
Fangrui Song	fc93787d7e	[MC][PowerPC] Make .reloc support arbitrary relocation types Generalizes `ad7199f3e6` (R_PPC_NONE/R_PPC64_NONE).	2020-03-28 17:04:31 -07:00
Matt Arsenault	9564f46766	AMDGPU: Make use of default operands	2020-03-28 17:33:29 -04:00
Benjamin Kramer	dd030036f0	Put back initializers that were dropped in `0ab5b5b858` Found by msan.	2020-03-28 22:06:12 +01:00
Benjamin Kramer	ba2e72c54e	[MDBuilder] Don't use stable sort for sorting integers.	2020-03-28 21:19:46 +01:00
Nikita Popov	2215dcf1d7	[InstCombine] Remove unreachable blocks before DCE Dropping unreachable code may reduce use counts on other instructions, so it's better to do this earlier rather than later. NFC-ish, may only impact worklist order.	2020-03-28 21:19:16 +01:00
Nikita Popov	97cc1275c7	[InstCombine] Merge two functions; NFC Merge AddReachableCodeToWorklist() into prepareICWorklistFromFunction(). It's one logical step, and this makes it easier to move code.	2020-03-28 21:19:16 +01:00
Benjamin Kramer	d3b6e1f1f9	[ADT] Automatically forward llvm::sort to array_pod_sort if safe This is safe if the iterator type is a pointer and the comparator is stateless. The enable_if pattern I'm adding here only uses array_pod_sort for the default comparator (std::less). Using array_pod_sort has a potential performance impact, but I didn't notice anything when testing clang. Sorting doesn't seem to be on the hot path anywhere in LLVM. Shrinks Release+Asserts clang by 73k.	2020-03-28 20:20:14 +01:00
Benjamin Kramer	2d24d74b85	[AMDGPU] Stabilize sort order Found by the expensive checks in llvm::sort.	2020-03-28 20:20:14 +01:00
Yonghong Song	ced0d1f42b	[BPF] support 128bit int explicitly in layout spec Currently, bpf does not specify 128bit alignment in its layout spec. So for a structure like struct ipv6_key_t { unsigned pid; unsigned __int128 saddr; unsigned short lport; }; clang will generate IR type %struct.ipv6_key_t = type { i32, [12 x i8], i128, i16, [14 x i8] } Additional padding is to ensure later IR->MIR can generate correct stack layout with target layout spec. But it is common practice for a tracing program to be first compiled with target flag (e.g., x86_64 or aarch64) through clang to generate IR and then go through llc to generate bpf byte code. Tracing program often refers to kernel internal data structures which needs to be compiled with non-bpf target. But such a compilation model may cause a problem on aarch64. The bcc issue https://github.com/iovisor/bcc/issues/2827 reported such a problem. For the above structure, since aarch64 has "i128:128" in its layout string, the generated IR will have %struct.ipv6_key_t = type { i32, i128, i16 } Since bpf does not have "i128:128" in its spec string, the selectionDAG assumes alignment 8 for i128 and computes the stack storage size for the above is 32 bytes, which leads incorrect code later. The x86_64 does not have this issue as it does not have "i128:128" in its layout spec as it does permits i128 to be alignmented at 8 bytes at stack. Its IR type looks like %struct.ipv6_key_t = type { i32, [12 x i8], i128, i16, [14 x i8] } The fix here is add i128 support in layout spec, the same as aarch64. The only downside is we may have less optimal stack allocation in certain cases since we require 16byte alignment for i128 instead of 8. But this is probably fine as i128 is not used widely and in most cases users should already have proper alignment. Differential Revision: https://reviews.llvm.org/D76587	2020-03-28 11:46:29 -07:00
Benjamin Kramer	4065e92195	Upgrade some instances of std::sort to llvm::sort. NFC.	2020-03-28 19:23:29 +01:00
Reid Kleckner	e5bf5037d8	[CodeGen] Fix sinking local values in lpads with phis There was already a test case for landingpads to handle this case, but I had forgotten to consider PHI instructions preceding the EH_LABEL in the landingpad. PR45261	2020-03-28 11:10:33 -07:00
Nikita Popov	30d712103f	[InstCombine] Use replaceOperand() API in GEP transforms To make sure that replaced operands get DCEd. This drops one iteration from gepphigep.ll, which is still not optimal. This was the last test case performing more than 3 iterations. NFC-ish, only worklist order should change.	2020-03-28 19:07:25 +01:00
Nikita Popov	b1f78baeaa	[InstCombine] Reduce code duplication in GEP of PHI transform; NFC The `NewGEP->setOperand(DI, NewPN)` call was duplicated, and the insertion of NewGEP is the same in both if/else, so we can extract it.	2020-03-28 19:07:25 +01:00
Alexandre Ganea	3ab3f3c5d5	After `09158252f7`, fix build when -DLLVM_ENABLE_THREADS=OFF Tested on Linux with Clang 9, and on Windows with Visual Studio 2019 16.5.1 with -DLLVM_ENABLE_THREADS=ON and OFF.	2020-03-28 13:54:58 -04:00
Nikita Popov	672e8bfbfc	[InstCombine] Fix worklist management in foldXorOfICmps() Because this code does not use the IC-aware replaceInstUsesWith() helper, we need to manually push users to the worklist. This is NFC-ish, in that it may only change worklist order.	2020-03-28 18:25:21 +01:00
Nikita Popov	337b671b0d	[InstCombine] Change limit-max-iterations test case; NFC This particular case will stop needing multiple iterations in a followup change.	2020-03-28 18:25:20 +01:00
Enna1	03bc311a16	[CorrelatedValuePropagation] Remove redundant if statement in processSelect() This statement if (ReplaceWith == S) ReplaceWith = UndefValue::get(S->getType()); is introduced in https://reviews.llvm.org/rG35609d97ae89b8e13f40f4e6b9b056954f8baa83 to fix a case where unreachable code can cause select instruction simplification to fail. In https://reviews.llvm.org/rGd10480657527ffb44ea213460fb3676a6b1300aa, we begin to perform a depth-first walk of basic blocks. This means we will not visit unreachable blocks. So we do not need this the special check any more. Differential Revision: https://reviews.llvm.org/D76753	2020-03-28 18:01:17 +01:00
Martin Storsjö	e6112a56dd	[AsmPrinter] Emit .weak directive for weak linkage on COFF for symbols without a comdat MC already knows how to emulate the .weak directive (with its ELF semantics; i.e., an undefined weak symbol resolves to 0, and a defined weak symbol has lower link precedence than a strong symbol of the same name) using COFF weak externals. Plumb this through the ASM printer too, so that definitions marked with __attribute__((weak)) at the language level (which gets translated to weak linkage at the IR level) have the corresponding .weak directive emitted. Note that declarations marked with __attribute__((weak)) at the language level (which translates to extern_weak at the IR level) already have .weak directives emitted. Weak/linkonce symbols without an associated comdat (in particular, ones generated with __attribute__((weak)) in C/C++) were earlier emitted as normal unique globals, as the comdat is required to provide the linkonce semantics. This change makes sure they are emitted as .weak instead, allowing other symbols to override them. Rename the existing coff-weak.ll test to coff-linkonce.ll. I'm not quite sure what that test covers, since the behavior being tested in it (the emission of a one_only section) is just a result of passing -function-sections to llc; the linkonce_odr makes no difference. Add a new coff-weak.ll which tests the new directive emission. Based on an previous patch by Shoaib Meenai. Differential Revision: https://reviews.llvm.org/D44543	2020-03-28 18:48:58 +02:00
Florian Hahn	81f173ed0e	[SCCP] Remove LatticeVal alias now that transition is done (NFC). The LatticeVal alias was introduced to reduce the diff size for the transition to ValueLatticeElement, which is done now. This patch removes the unnecessary alias and updates some very verbose type uses with auto.	2020-03-28 15:40:24 +00:00
Florian Hahn	a44bf59c93	[SCCP] Remove unused toLatticeValue helper (NFC). LatticeVal is an alias for ValueLatticeElement and the function is not used any longer.	2020-03-28 15:40:24 +00:00
Michael Liao	d2dd0fac48	Fix `-Wsign-compare` warning. NFC.	2020-03-28 10:20:27 -04:00
Martin Storsjö	8330dcadb8	[llvm-rc] Allow -1 for menu item IDs This seems to be used in some resource files, e.g. `f3217573d7/include/wx/msw/wx.rc (L28)`. MSVC rc.exe and GNU windres both allow any value here, and silently just truncate to uint16_t range. This just explicitly allows the -1 value and errors out on others - the same was done for control IDs in dialogs in `c1a67857ba`. Differential Revision: https://reviews.llvm.org/D76951	2020-03-28 14:32:08 +02:00
Simon Pilgrim	8c1dbd5c1e	[X86][SSE] Add testnzc(~X,Y) -> testnzc(X,Y) test cases	2020-03-28 10:56:57 +00:00
Simon Pilgrim	d34d2ec28b	[X86][SSE] Add original PR38522 test case	2020-03-28 10:56:57 +00:00
Simon Pilgrim	8d85da5f5a	[X86][SSE] Add combine tests for PTEST/TESTPS/TESTPD instructions Including some test coverage for PR38522	2020-03-28 10:56:57 +00:00
Evan LeClercq	37943e518c	[docs] Added solutions to slow build under common problems. I added a list of options to configure should someone have issues with long build time or running out of memory. This was added under common problems in the getting started section of the documentation. Reviewed By: Meinersbur, dim, e-leclercq Differential Revision: https://reviews.llvm.org/D75425	2020-03-28 04:19:45 -05:00
Uday Bondhugula	06066c4003	[NFC] Attributor comment updates / cast cleanup Minor update/fixes to comments for the Attributor pass, and dyn_cast -> cast. Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Differential Revision: https://reviews.llvm.org/D76972	2020-03-28 13:36:43 +05:30
Serge Pavlov	f398739152	[FEnv] Constfold some unary constrained operations This change implements constant folding to constrained versions of intrinsics, implementing rounding: floor, ceil, trunc, round, rint and nearbyint. Differential Revision: https://reviews.llvm.org/D72930	2020-03-28 12:28:33 +07:00
Jonas Devlieghere	190df4a5bc	Revert "[FileCollector] Add a method to add a whole directory and it contents." This reverts commit `8913769e35` because the unit test is failing on the Windows bot.	2020-03-27 19:21:48 -07:00
Jessica Paquette	98d05f88d5	[GlobalISel] Fix equality for copies from physregs in matchEqualDefs When we see this: ``` %a = COPY $physreg ... SOMETHING implicit-def $physreg ... %b = COPY $physreg ``` The two copies are not equivalent, and so we shouldn't perform any folding on them. When we have two instructions which use a physical register check that they define the same virtual register(s) as well. e.g., if we run into this case ``` %a = COPY $physreg ... %b = COPY %a ``` we can say that the two copies are the same, and can be folded. Differential Revision: https://reviews.llvm.org/D76890	2020-03-27 17:52:21 -07:00
Jonas Devlieghere	a67f057f69	[FileCollector] Devirtualize FileCollector (NFC) This is not (yet) necessary.	2020-03-27 17:47:18 -07:00
Jonas Devlieghere	8913769e35	[FileCollector] Add a method to add a whole directory and it contents. Extend the FileCollector's API with addDirectory which adds a directory and its contents to the VFS mapping. Differential revision: https://reviews.llvm.org/D76671	2020-03-27 17:38:24 -07:00
Kamlesh Kumar	aabc24acf0	[RISCV] Support llvm.thread.pointer Fixes https://bugs.llvm.org/show_bug.cgi?id=45303 (clang crashed on __builtin_thread_pointer) Reviewed By: lenary, MaskRay, luismarques Differential Revision: https://reviews.llvm.org/D76828	2020-03-27 17:30:12 -07:00
David Blaikie	cbce88dd3a	FunctionRef: Strip cv qualifiers in the converting constructor Without this some instances of copy construction would use the converting constructor & lead to the destination function_ref referring to the source function_ref instead of the underlying functor. Discovered in feedback from `857bf5da35` Thanks to Johannes Doerfert, Arthur O'Dwyer, and Richard Smith for the discussion and debugging.	2020-03-27 16:31:58 -07:00
Nemanja Ivanovic	4821411347	[DAGCombine] Fix splitting indexed loads in ForwardStoreValueToDirectLoad() In DAGCombiner::visitLOAD() we perform some checks before breaking up an indexed load. However, we don't do the same checking in ForwardStoreValueToDirectLoad() which can lead to failures later during combining (see: https://bugs.llvm.org/show_bug.cgi?id=45301). This patch just adds the same checks to this function as well. Fixes: https://bugs.llvm.org/show_bug.cgi?id=45301 Differential revision: https://reviews.llvm.org/D76778	2020-03-27 18:03:47 -05:00
Francesco Petrogalli	4b3d94051c	[llvm][Type] Return fixed size for scalar types. [NFC] Summary: It is safe to assume that the TypeSize associated to scalar types has a fixed size. This avoids an implicit cast of TypeSize to integer inside `Type::getScalarSizeInBits()`, as such implicit cast is deprecated. Reviewers: efriedma, sdesmalen Reviewed By: efriedma Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76892	2020-03-27 22:23:46 +00:00
Florian Hahn	9ce198d6ed	[Darwin] Respect -fno-unroll-loops during LTO. Currently -fno-unroll-loops is ignored when doing LTO on Darwin. This patch adds a new -lto-no-unroll-loops option to the LTO code generator and forwards it to the linker if -fno-unroll-loops is passed. Reviewers: thegameg, steven_wu Reviewed By: thegameg Differential Revision: https://reviews.llvm.org/D76916	2020-03-27 22:19:03 +00:00
Jonas Devlieghere	3ef33e69de	[VirtualFileSystem] Support directory entries in the YAMLVFSWriter The current implementation of the JSONWriter does not support writing out directory entries. Earlier today I added a unit test to illustrate the problem. When an entry is added to the YAMLVFSWriter and the path is a directory, it will incorrectly emit the directory as a file, and any files inside that directory will not be found by the VFS. It's possible to partially work around the issue by only adding "leaf nodes" (files) to the YAMLVFSWriter. However, this doesn't work for representing empty directories. This is a problem for clients of the VFS that want to iterate over a directory. The directory not being there is not the same as the directory being empty. This is not just a hypothetical problem. The FileCollector for example does not differentiate between file and directory paths. I temporarily worked around the issue for LLDB by ignoring directories, but I suspect this will prove problematic sooner rather than later. This patch fixes the issue by extending the JSONWriter to support writing out directory entries. We store whether an entry should be emitted as a file or directory. Differential revision: https://reviews.llvm.org/D76670	2020-03-27 15:16:52 -07:00
Sam McCall	6b3bedec99	Add BitWidth trait to BitmaskEnum, and use for clang DependenceFlags. NFC Reviewers: hokein Subscribers: dexonsmith, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D76942	2020-03-27 22:40:21 +01:00
Sanjay Patel	0f56bbc1a5	[InstCombine] reduce FP-casted and bitcasted signbit check PR45305: https://bugs.llvm.org/show_bug.cgi?id=45305 Alive2 proofs: http://volta.cs.utah.edu:8080/z/bVyrko http://volta.cs.utah.edu:8080/z/Vxpz9q	2020-03-27 17:33:59 -04:00
Sanjay Patel	e72730ee3a	[InstCombine] add tests for FP cast+bitcast signbit checks; NFC PR45305: https://bugs.llvm.org/show_bug.cgi?id=45305	2020-03-27 17:25:25 -04:00
Michael Liao	5be9b8cbe2	[cuda][hip] Add CUDA builtin surface/texture reference support. Summary: - Re-commit after fix Sema checks on partial template specialization. Reviewers: tra, rjmccall, yaxunl, a.sidorin Subscribers: cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D76365	2020-03-27 17:18:49 -04:00
Francesco Petrogalli	c66d1f38f6	[llvm][Support] Add isZero method for TypeSize. [NFC] Summary: The method is used where TypeSize is implicitly cast to integer for being checked against 0. Reviewers: sdesmalen, efriedma Reviewed By: sdesmalen, efriedma Subscribers: efriedma, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76748	2020-03-27 21:03:44 +00:00
Matt Arsenault	a8cc9047de	CodeGen: Add -denormal-fp-math-f32 flag Make the set of FP related attributes and command flags closer.	2020-03-27 14:00:39 -07:00
Jay Foad	a6dfd827e5	[AMDGPU] Fix getEUsPerCU for gfx10 in CU mode Summary: "Per CU" is a bit simplistic for gfx10, but I couldn't think of a better name. Reviewers: arsenm, rampitec, nhaehnle, dstuttard, tpr Subscribers: kzhuravl, jvesely, wdng, yaxunl, t-tye, hiraditya, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76861	2020-03-27 20:36:49 +00:00
Fangrui Song	152d14da64	[MC][X86] Make .reloc support arbitrary relocation types Generalizes D62014 (R_386_NONE/R_X86_64_NONE). Unlike ARM (D76746) and AArch64 (D76754), we cannot delete FK_NONE from getFixupKindSize because FK_NONE is still used by R_386_TLS_DESC_CALL/R_X86_64_TLSDESC_CALL.	2020-03-27 13:33:15 -07:00
Matt Arsenault	0fd8030be3	Fix line endings in test	2020-03-27 16:26:06 -04:00
diggerlin	9c20f09985	[AIX] Address comment https://reviews.llvm.org/D76162#inline-701237 SUMMARY: Address clang format issue: "clang format this block, I don't think the spaces are aligned correctly." Subscribers: wuzish, nemanjai, hiraditya Differential Revision: https://reviews.llvm.org/D76162	2020-03-27 16:21:53 -04:00
Matt Arsenault	348735b723	AMDGPU: Stop setting attributes based on TargetOptions Having arbitrary passes looking at the TargetOptions is pretty messy. This was also disregarding if a function already had an explicit attribute setting on it. opt/llc now add the attributes to functions that don't specify the attribute. clang and lld do not call the function to do this, which they maybe should. This was also treating unsafe-fp-math as implying the others, and setting the other attributes based on it. This is not done anywhere else, and I'm not sure is correct based on the current description of the option bit. Effectively reverts `1d8cf2be89`	2020-03-27 13:13:43 -07:00
Matt Arsenault	0ab5b5b858	Fix denormal-fp-math flag and attribute interaction Make these behave the same way unsafe-fp-math and co. The command line flag should add the attribute to functions that do not already have it, and leave existing attributes. The attribute is the actual implementation, but the flag is useful in some testing situations. AMDGPU has a variety of tests with denormals enabled/disabled that would require a painful level of test duplication without a flag. This doesn't expose setting the separate input/output modes, or add a flag for the f32 version yet. Tests will be included in future patch.	2020-03-27 12:48:58 -07:00
Fangrui Song	34d77516b8	[MC][AArch64] Make .reloc support arbitrary relocation types Depends on D76746. Generalizes D61973. Differential Revision: https://reviews.llvm.org/D76754	2020-03-27 12:30:52 -07:00
Fangrui Song	c389526171	[MC][ARM] Make .reloc support arbitrary relocation types Generalizes D61992. In GNU as, the .reloc directive supports arbitrary relocation types. A MCFixupKind value `V` larger than or equal to FirstLiteralRelocationKind is used to represent the relocation type whose number is V-FirstLiteralRelocationKind. This is useful for linker tests. Without the feature the assembler cannot produce certain relocation records (e.g. R_ARM_ALU_PC_G0/R_ARM_LDR_PC_G0) This helps move forward D75349 and D76575. Differential Revision: https://reviews.llvm.org/D76746	2020-03-27 12:29:49 -07:00
Julian Lettner	8896d12315	[lit] Avoid global imports in module declaration A previous attempt to cleanup module imports broke installing via pip/setup.py [1]. This should be fixed now. [1] `cf252240e8` Reviewed By: paquette Differential Revision: https://reviews.llvm.org/D76940	2020-03-27 12:07:19 -07:00
Lang Hames	cb84e4827e	[ORC] Introduce JITSymbolFlags::HasMaterializeSideEffectsOnly flag. This flag can be used to mark a symbol as existing only for the purpose of enabling materialization. Such a symbol can be looked up to trigger materialization with the lookup returning only once materialization is complete. Symbols with this flag will never resolve however (to avoid permanently polluting the symbol table), and should only be looked up using the SymbolLookupFlags::WeaklyReferencedSymbol flag. The primary use case for this flag is initialization symbols.	2020-03-27 11:02:54 -07:00
Lang Hames	d38d06e649	[ORC] Don't create MaterializingInfo entries unnecessarily.	2020-03-27 11:02:54 -07:00
Craig Topper	cdd1cd7120	[X86] Don't form masked instructions if the operation has an additional user. This will cause the operation to be repeated in both a mask and another masked or unmasked form. This can a wasted of execution resources. Differential Revision: https://reviews.llvm.org/D60940	2020-03-27 10:44:22 -07:00
Simon Pilgrim	763c87309d	[X86][SSE] Add some additional v8i16 'truncation' style shuffle tests	2020-03-27 17:29:29 +00:00
Dennis Felsing	aa0be69e74	Export Segment.IsGapRegion to JSON Summary: So that external tools can make use of that information and not display such lines as uncovered. Fixes https://bugs.llvm.org/show_bug.cgi?id=45300 Reviewers: vsk Reviewed By: vsk Differential Revision: https://reviews.llvm.org/D76763	2020-03-27 18:05:01 +01:00
Simon Pilgrim	950ea61653	[X86] Remove orphan LowerSTRICT_FSETCC declaration. NFCI. LowerSETCC handles strict cases as well, we don't have a separate function.	2020-03-27 17:03:19 +00:00
Artem Belevich	fe8063e1a0	Revert "[cuda][hip] Add CUDA builtin surface/texture reference support." This reverts commit `6a9ad5f3f4`. The patch breaks CUDA copmilation. Differential Revision: https://reviews.llvm.org/D76365	2020-03-27 10:01:38 -07:00
LLVM GN Syncbot	f8c25945e5	[gn build] Port `d60d7d69de`	2020-03-27 16:07:06 +00:00
jasonliu	d60d7d69de	[llvm-objdump][XCOFF][AIX] Implement -r option Summary: Implement several XCOFF hooks to get '-r' option working for llvm-objdump -r. Reviewer: DiggerLin, hubert.reinterpretcast, jhenderson, MaskRay Differential Revision: https://reviews.llvm.org/D75131	2020-03-27 16:05:42 +00:00
Guillaume Chatelet	74eac9031a	[Alignment][NFC] MachineMemOperand::getAlign/getBaseAlign Summary: This is patch is part of a series to introduce an Alignment type. See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2019-July/133851.html See this patch for the introduction of the type: https://reviews.llvm.org/D64790 Reviewers: courbet Subscribers: arsenm, dschuff, sdardis, nemanjai, jvesely, nhaehnle, sbc100, jgravelle-google, hiraditya, aheejin, kbarton, jrtc27, atanasyan, jfb, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76925	2020-03-27 15:49:13 +00:00
Sam Parker	d7084fa34a	[ARM][LowOverheadLoops] DoubleWidthResult instructions canGenerateZeros Given that some instructions generate wider result elements than their inputs, flag them as being able to generate non zeros in the false lanes. Differential Revision: https://reviews.llvm.org/D76766	2020-03-27 15:26:13 +00:00
Alexandre Ganea	4bc8882b89	Fix build after `09158252f7`	2020-03-27 11:23:11 -04:00
Simon Pilgrim	f4f4a8bfef	[InstCombine][X86] Add repeated ops demanded elts tests for SSE intrinsics (PR24523)	2020-03-27 14:51:09 +00:00
Simon Pilgrim	ec3bb6c3e7	[InstCombine][X86] Regenerate SSE2 tests	2020-03-27 14:51:09 +00:00
Alexandre Ganea	09158252f7	[ThinLTO] Allow usage of all hardware threads in the system Before this patch, it wasn't possible to extend the ThinLTO threads to all SMT/CMT threads in the system. Only one thread per core was allowed, instructed by usage of llvm::heavyweight_hardware_concurrency() in the ThinLTO code. Any number passed to the LLD flag /opt:lldltojobs=..., or any other ThinLTO-specific flag, was previously interpreted in the context of llvm::heavyweight_hardware_concurrency(), which means SMT disabled. One can now say in LLD: /opt:lldltojobs=0 -- Use one std::thread / hardware core in the system (no SMT). Default value if flag not specified. /opt:lldltojobs=N -- Limit usage to N threads, regardless of usage of heavyweight_hardware_concurrency(). /opt:lldltojobs=all -- Use all hardware threads in the system. Equivalent to /opt:lldltojobs=$(nproc) on Linux and /opt:lldltojobs=%NUMBER_OF_PROCESSORS% on Windows. When an affinity mask is set for the process, threads will be created only for the cores selected by the mask. When N > number-of-hardware-threads-in-the-system, the threads in the thread pool will be dispatched equally on all CPU sockets (tested only on Windows). When N <= number-of-hardware-threads-on-a-CPU-socket, the threads will remain on the CPU socket where the process started (only on Windows). Differential Revision: https://reviews.llvm.org/D75153	2020-03-27 10:20:58 -04:00
Sam Parker	0e6aa08381	[ARM][MVE] Add DoubleWidthResult flag Add a flag for those instructions which read from the top/bottom halves of their inputs and produce a vector of results with double width elements. Differential Revision: https://reviews.llvm.org/D76762	2020-03-27 13:44:04 +00:00
Louis Dionne	08d5426981	[lit] NFC: Move the flaky test logic to _runShTest This minor refactoring allows reducing the amount of processing that is duplicated when we re-run a flaky test. It also has the nice side effect that libc++'s current test format supports flaky .sh.cpp tests, because those are built on top of _runShTest, not executeShTest.	2020-03-27 09:32:58 -04:00
Louis Dionne	faf415a1de	[lit] Recursively expand substitutions This allows defining substitutions in terms of other substitutions. For example, a %build substitution could be defined in terms of a %cxx substitution as '%cxx %s -o %t.exe' and the script would be properly expanded. Differential Revision: https://reviews.llvm.org/D76178	2020-03-27 09:25:26 -04:00
Sjoerd Meijer	401a324c51	[LV] Refactor widenIntOrFpInduction. NFC. This untangles the logic in widenIntOrFpInduction in order to make more explicit and visible how exactly the induction variable is lowered. Differential Revision: https://reviews.llvm.org/D76686	2020-03-27 12:58:50 +00:00
Guillaume Chatelet	e2ef6127d9	[Alignment] Fix overaligning bug Summary: This was discovered while converting to Align type. See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2019-July/133851.html See this patch for the introduction of the type: https://reviews.llvm.org/D64790 Reviewers: courbet Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76914	2020-03-27 12:57:50 +00:00
Simon Pilgrim	e15ade4781	Revert rGa3c715e9788d829031989b0a5ea4eb43c7288be9 "Twine - fix uninitialized variable warnings. NFCI." @dblaikie noticed that this may interfere with msan analysis	2020-03-27 11:44:04 +00:00
Simon Pilgrim	d6ddabd7ef	Revert rG6ff1ea3244c543ad24fc99c7f4979db2f2078593 "Fix "use of uninitialized variable" static analyzer warning. NFCI." @dblaikie noticed that this may interfere with msan analysis	2020-03-27 11:44:03 +00:00
Jonas Paulsson	35173dddd1	[SystemZ] Fix typos in comments.	2020-03-27 12:31:48 +01:00
David Green	8689f98e9b	[ARM] Fix MVE VCMPr f16 pattern This patterns seemed to be using the f32 instruction, not f16. Fix it to use the correct one. Differential Revision: https://reviews.llvm.org/D76841	2020-03-27 11:18:24 +00:00
Georgii Rymar	30c1f9a558	[llvm-readobj] - Fix a crash when DT_STRTAB is broken. We might have a crash scenario when we have an invalid DT_STRTAB value that is larger than the file size. I've added a test case to demonstrate. Differential revision: https://reviews.llvm.org/D76706	2020-03-27 13:18:08 +03:00
Guillaume Chatelet	a98662f4c1	[Alignment][NFC] Update MachineMemOperand implementation to use MaybeAlign Summary: This is patch is part of a series to introduce an Alignment type. See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2019-July/133851.html See this patch for the introduction of the type: https://reviews.llvm.org/D64790 Reviewers: courbet Reviewed By: courbet Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76625	2020-03-27 08:06:10 +00:00
Fangrui Song	6728a9ae19	[MCInstPrinter] Add parameter `Address` to printCustomAliasOperand. NFC Follow-up of D72172 and llvmorg-11-init-6896-gb3cc5dcef0f.	2020-03-27 00:38:20 -07:00
Johannes Doerfert	095cecbe0d	[OpenMP] `omp begin/end declare variant` - part 1, parsing This is the first part extracted from D71179 and cleaned up. This patch provides parsing support for `omp begin/end declare variant`, as defined in OpenMP technical report 8 (TR8) [0]. A major purpose of this patch is to provide proper math.h/cmath support for OpenMP target offloading. See PR42061, PR42798, PR42799. The current code was developed with this feature in mind, see [1]. [0] https://www.openmp.org/wp-content/uploads/openmp-TR8.pdf [1] https://reviews.llvm.org/D61399#change-496lQkg0mhRN Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D74941	2020-03-27 02:30:58 -05:00
Fangrui Song	b3cc5dcef0	[MCInstPrinter] Add parameter `Address` to MCInstPrinter::printAliasInstr. NFC Follow-up of D72172.	2020-03-27 00:03:32 -07:00
Shengchen Kan	1fb4f99a21	[X86][MC] Fix the bug for prefix padding support Summary: There is a tiny logic error of D75300, making branch is not correctly aligned with option -x86-pad-max-prefix-size Reviewers: reames, MaskRay, craig.topper, LuoYuanke, jyknight Reviewed By: reames Subscribers: hiraditya, llvm-commits, annita.zhang Tags: #llvm Differential Revision: https://reviews.llvm.org/D76285	2020-03-27 14:16:09 +08:00
Kai Luo	26b46b67d8	[PowerPC] Fix test for PR45297 to adapt build without asserts. NFC.	2020-03-27 05:28:34 +00:00
Kai Luo	351b192315	[PowerPC] Enhance test for PR45297. NFC.	2020-03-27 04:45:21 +00:00
Juneyoung Lee	1bcc500b48	[DAGCombine] Add basic optimizations for FREEZE in SelDag Summary: This patch is the first effort to adding basic optimizations for FREEZE in SelDag. Reviewers: spatel, lebedev.ri Reviewed By: spatel Subscribers: xbolva00, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76707	2020-03-27 12:20:39 +09:00
David Blaikie	324f5a14d7	Make llvm::function_ref's operator bool explicit This can avoid all sorts of mistakes with implicit conversion (indirectly) to int, etc. I'm quite surprise there aren't any things to fixup with this - but I guess most uses of function_ref aren't optional/nullable.	2020-03-26 20:09:57 -07:00

1 2 3 4 5 ...

194052 Commits