llvm-project

Commit Graph

Author	SHA1	Message	Date
Fangrui Song	874bdc8e61	[Driver] Clean up Debian multiarch /usr/include/<triplet> madness Debian multiarch additionally adds /usr/include/<triplet> and somehow Android borrowed the idea. (Note /usr/<triplet>/include is already an include dir...). On Debian, we should just assume a GCC installation is available and use its triple.	2021-03-21 22:40:38 -07:00
Stella Laurenzo	bdf4e93b2c	Fix extraneous context parameter in templated helper function. (missed in lattner's overall updates related to D99028)	2021-03-22 05:08:44 +00:00
Bing1 Yu	113f077f80	[X86] Pass to transform tdpbf16ps intrinsics to scalar operation. In previous patch https://reviews.llvm.org/D93594, we only scalarize tilezero, tileload, tilestore and tiledpbssd. In this patch we scalarize tdpbf16ps intrinsic. Reviewed By: pengfei Differential Revision: https://reviews.llvm.org/D96110	2021-03-22 13:00:40 +08:00
Max Kazantsev	8fab9f824f	[IndVars] Sharpen context in eliminateIVComparison When eliminating comparisons, we can use common dominator of all its users as context. This gives better results when ICMP is not computed right before the branch that uses it. Differential Revision: https://reviews.llvm.org/D98924 Reviewed By: lebedev.ri	2021-03-22 11:55:57 +07:00
Lang Hames	fc36a511c6	[JITLink][ELF/x86-64] Add support for R_X86_64_GOTPC64 and R_X86_64_GOT64. Start adding support for ELF x86-64 large code model, PIC relocations.	2021-03-21 21:52:54 -07:00
Siva Chandra	b22f448c21	[libc] Add a target "install-llvmlibc" to install LLVM libc static archive.	2021-03-22 04:45:14 +00:00
Lang Hames	0a74ec3299	[JITLink] Start laying the groundwork for ELF x86-64 large code model support. Introduces DefineExternalSectionStartAndEndSymbols.h, which defines a template for a JITLink pass that transforms external symbols meeting a user-supplied predicate into defined symbols pointing at the start and end of a Section identified by the predicate. JITLink.h is updated with a new makeAbsolute function to support this pass. Also renames BasicGOTAndStubsBuilder to PerGraphGOTAndPLTStubsBuilder -- the new name better describes the intent of this GOT and PLT stubs builder, and will help to distinguish it from future GOT and PLT stub builders that build entries that may be shared between multiple graphs.	2021-03-21 20:56:47 -07:00
Lang Hames	209ceed745	[JITLink][ELF/x86-64] Add Delta32, NegDelta32, NegDelta64 support. These were missing, but are used in eh-frame section support.	2021-03-21 20:15:40 -07:00
Chuanqi Xu	55486161fa	[ASTMatcher] Add AST Matcher support for C++20 coroutine keywords Summary: Try to enable the support for C++20 coroutine keywords for AST Matchers. Reviewers: sammccall, njames93, aaron.ballman Differential Revision: https://reviews.llvm.org/D96316	2021-03-22 10:27:46 +08:00
Luo, Yuanke	661c016f68	[X86][AMX] Add test cases for AMX load/store lowering. Differential Revision: https://reviews.llvm.org/D99030	2021-03-22 09:14:52 +08:00
Fangrui Song	74933efeb6	[Driver] Detect Debian hack g++-multiarch-incdir.diff to simplify addLibStdCXXIncludePaths call sites	2021-03-21 17:33:31 -07:00
Fangrui Song	6a4fbf14ef	[test] Add test for cross compiling on Linux	2021-03-21 15:37:35 -07:00
Fangrui Song	72ac988dc7	[test] Delete obsoleted debian_multiarch_tree and ubuntu_13.04_multiarch_tree They are quite outdated. Delete them to avoid unnecessary test churn.	2021-03-21 15:37:34 -07:00
Jacques Pienaar	113baa2b9f	Update examples post OwningRewritePatternList change	2021-03-21 15:15:54 -07:00
Nico Weber	2554b95db5	Revert "[lld-macho] Implement -dependency_info (partially - more opcodes needed)" This reverts commit `c53a1322f3`. Test only passes depending on build dir having a lexicographically later name than the source dir, and doesn't link on mac/win. See https://reviews.llvm.org/D98559#2640265 onward.	2021-03-21 16:35:38 -04:00
Roman Lebedev	be87321280	[clang][Codegen] EmitBranchOnBoolExpr(): emit prof branch counts even at -O0 This restores the original behaviour before i unadvertedly broke it in `e3a4701627` and clang/test/Profile/ caught it.	2021-03-21 23:24:27 +03:00
Roman Lebedev	e3a4701627	[clang][CodeGen] Lower Likelihood attributes to @llvm.expect intrin instead of branch weights `08196e0b2e` exposed LowerExpectIntrinsic's internal implementation detail in the form of LikelyBranchWeight/UnlikelyBranchWeight options to the outside. While this isn't incorrect from the results viewpoint, this is suboptimal from the layering viewpoint, and causes confusion - should transforms also use those weights, or should they use something else, D98898? So go back to status quo by making LikelyBranchWeight/UnlikelyBranchWeight internal again, and fixing all the code that used it directly, which currently is only clang codegen, thankfully, to emit proper @llvm.expect intrinsics instead.	2021-03-21 22:50:21 +03:00
Roman Lebedev	37d6be9052	Revert "[BranchProbability] move options for 'likely' and 'unlikely'" Upon reviewing D98898 i've come to realization that these are implementation detail of LowerExpectIntrinsicPass, and they should not be exposed to outside of it. This reverts commit `ee8b53815d`.	2021-03-21 22:50:21 +03:00
Fangrui Song	bcaca360f8	[Driver] Gnu.cpp: fix libstdc++ search path for multilib With this change, on Debian x86-64 (with a MULTILIB_OSDIRNAMES local patch ../lib64 -> ../lib; this does not matter because /usr/lib64/crt{1,i,n}.o do not exist), `clang++ --target=aarch64-linux-gnu a.cc -Wl,--dynamic-linker=/usr/aarch64-linux-gnu/lib/ld-linux-aarch64.so.1 -Wl,-rpath,/usr/aarch64-linux-gnu/lib` built executable can run under qemu-user. Previously this failed with `/usr/lib/gcc-cross/aarch64-linux-gnu/10/../../../../include/c++/10/iostream:38:10: fatal error: 'bits/c++config.h' file not found` On Arch Linux, due to the MULTILIB_OSDIRNAMES patch and the existence of /usr/lib64/crt{1,i,n}.o, clang driver may pick /usr/lib64/crt{1,i,n}.o and cause a linker error. -B can work around the problem. `clang++ --target=aarch64-linux-gnu -B /usr/aarch64-linux-gnu/lib a.cc -Wl,--dynamic-linker=/usr/aarch64-linux-gnu/lib/ld-linux-aarch64.so.1 -Wl,-rpath,/usr/aarch64-linux-gnu/lib64:/usr/aarch64-linux-gnu/lib`	2021-03-21 12:01:44 -07:00
Vy Nguyen	c53a1322f3	[lld-macho] Implement -dependency_info (partially - more opcodes needed) Bug: https://bugs.llvm.org/show_bug.cgi?id=49278 The flag is not well documented, so this implementation is based on observed behaviour. When specified, `-dependency_info <path>` produced a text file containing information pertaining to the current linkage, such as input files, output file, linker version, etc. This file's layout is also not documented, but it seems to be a series of null ('\0') terminated strings in the form `<op code><path>` `<op code>` could be: `0x00` : linker version `0x10` : input `0x11` : files not found(??) `0x40` : output `<path>` : is the file path, except for the linker-version case. (??) This part is a bit unclear. I think it means all the files the linker attempted to look at, but could not find. Differential Revision: https://reviews.llvm.org/D98559	2021-03-21 14:35:46 -04:00
Craig Topper	30080b003e	[DAGCombiner] Minor compile time improvement to (sext_in_reg (sign_extend_vector_inreg x)) optimization. Don't bother calling ComputeNumSignBits if N00Bits < ExtVTBits. No matter what answer we get back this will be true: (N00Bits - DAG.ComputeNumSignBits(N00, DemandedSrcElts)) < ExtVTBits) So we might as well save the computation. This makes the code more consistent with the similar (sext_in_reg (sext x)) handling above.	2021-03-21 11:16:41 -07:00
Nikita Popov	d11d5d1c5f	[ValueTracking] Improve mul handling in isKnownNonEqual() X != X * C is true if: * C is not 0 or 1 * X is not 0 * mul is nsw or nuw Proof: https://alive2.llvm.org/ce/z/uwF29z This is motivated by one of the cases in D98422.	2021-03-21 18:41:35 +01:00
Nikita Popov	f5bbdf2a67	[ValueTracking] Add more tests for isKnownNonEqual() of mul (NFC) This is for the case of (x * C) == x, rather than the (x * C1) == (x * C2) variant that we already cover.	2021-03-21 18:41:35 +01:00
Chris Lattner	1d909c9a35	Remove the extraneous MLIRContext argument from populateWithGenerated. NFC.	2021-03-21 10:38:35 -07:00
Matt Arsenault	20a24af01d	MIR: Fix missing serialization for HasTailCall	2021-03-21 13:14:04 -04:00
Matt Arsenault	a0f5aad6d7	AMDGPU: Fix allowing immediates for tail call pseudo. The pseudo was using SSrc_b64, so it allowed folding immediates into the destination operand for a tail call to null. However, this is not a valid operand for the s_setpc_b64 this will be lowered to. Avoids printing the operand as an invalid immediate. Avoids a regression when tail calls are enabled in GlobalISel (somehow tail calls to null get deleted in the DAG).	2021-03-21 13:14:04 -04:00
Chris Lattner	ffde3acb1b	[ShapeDialect] Silence a build warning, NFC mlir/lib/Dialect/Shape/IR/Shape.cpp:573:26: warning: loop variable 'shape' is always a copy because the range of type '::mlir::Operation::operand_range' (aka 'mlir::OperandRange') does not return a reference [-Wrange-loop-analysis] for (const auto &shape : shapes()) { ^	2021-03-21 10:10:38 -07:00
Chris Lattner	3a506b31a3	Change OwningRewritePatternList to carry an MLIRContext with it. This updates the codebase to pass the context when creating an instance of OwningRewritePatternList, and starts removing extraneous MLIRContext parameters. There are many many more to be removed. Differential Revision: https://reviews.llvm.org/D99028	2021-03-21 10:06:31 -07:00
Nikita Popov	9f864d2025	Reapply [ConstantFold] Handle vectors in ConstantFoldLoadThroughBitcast() There seems to be an impedance mismatch between what the type system considers an aggregate (structs and arrays) and what constants consider an aggregate (structs, arrays and vectors). Adjust the type check to consider vectors as well. The previous version of the patch dropped the type check entirely, but it turns out that getAggregateElement() does require the constant to be an aggregate in some edge cases: For Poison/Undef the getNumElements() API is called, without checking in advance that we're dealing with an aggregate. Possibly the implementation should avoid doing that, but for now I'm adding an assert so the next person doesn't fall into this trap.	2021-03-21 17:48:21 +01:00
Nikita Popov	59dbf4d516	[InstSimplify] Add load of undef aggregate test (NFC) To make sure this doesn't crash the following commit.	2021-03-21 17:42:26 +01:00
Nikita Popov	b32f5d5045	[InstSimplify] Regenerate test checks (NFC)	2021-03-21 17:41:21 +01:00
Nikita Popov	ece1403aca	[InstSimplify] Add additional select operand replacement tests (NFC) This tests for binops with identity elements.	2021-03-21 15:30:30 +01:00
Nikita Popov	daae927f9c	[InstSimplify] Clean up SimplifyReplacedWithOp implementation (NFCI) Replace Op with RepOp up-front, and then always work with the new operands, rather than checking for replacement in various places.	2021-03-21 15:30:30 +01:00
Matt Arsenault	1098acd46d	GlobalISel: Avoid unnecessary truncation to i64 We can just directly pass through the APInt to create a new constant.	2021-03-21 10:07:41 -04:00
Matt Arsenault	6314a72730	AMDGPU/GlobalISel: Enable CSE in pre-legalizer combiner	2021-03-21 10:07:37 -04:00
Simon Pilgrim	64c2641c89	[DAG] Limit (sext_in_reg (zero_extend_vector_inreg x)) to exact sign extension As commented by @craig.topper on rG1ba5c550d418, we can't guarantee that we'll be extending zero bits, just sign bit. So, revert to the old code for zero_extend_vector_inreg cases.	2021-03-21 14:01:37 +00:00
Jez Ng	8757616de3	[lld-macho][nfc] Format Options.td Summary: A good chunk of it was mis-indented. Fixed by using the formatting settings from llvm/utils/vim.	2021-03-21 09:33:04 -04:00
Simon Pilgrim	3179588947	[X86][AVX] ComputeNumSignBitsForTargetNode - add X86ISD::VBROADCAST handling for scalar sources The target shuffle code handles vector sources, but X86ISD::VBROADCAST can also accept a scalar source for splatting. Added as an extension to PR49658	2021-03-21 12:22:51 +00:00
Simon Pilgrim	dc51cc3293	[X86] Add 'mulhs' variant of PR49658 test case	2021-03-21 12:09:05 +00:00
David Green	6d9d2049c8	[ARM] VINS f16 pattern This adds an extra pattern for inserting an f16 into a odd vector lane via an VINS. If the dual-insert-lane pattern does not happen to apply, this can help with some simple cases. Differential Revision: https://reviews.llvm.org/D95471	2021-03-21 12:00:06 +00:00
luxufan	02ffbac844	[RISCV] remove redundant instruction when eliminate frame index The reason for generating mv a0, a0 instruction is when the stack object offset is large then int<12>. To deal this situation, in the elimintateFrameIndex function, it will create a virtual register, which needs the register scavenger to scavenge it. If the machine instruction that contains the stack object and the opcode is ADDI(the addi was generated by frameindexNode), and then this instruction's destination register was the same as the register that was generated by the register scavenger, then the mv a0, a0 was generated. So to eliminnate this instruction, in the eliminateFrameIndex function, if the instrution opcode is ADDI, then the virtual register can't be created. Differential Revision: https://reviews.llvm.org/D92479	2021-03-21 18:54:00 +08:00
Simon Pilgrim	297b9bc3fa	[X86][AVX] computeKnownBitsForTargetNode - add X86ISD::VBROADCAST handling for scalar sources The target shuffle code handles vector sources, but X86ISD::VBROADCAST can also accept a scalar source for splatting. Suggested by @craig.topper on PR49658	2021-03-21 10:40:57 +00:00
Simon Pilgrim	613157dd67	[X86] Add PR49658 test case	2021-03-21 10:16:55 +00:00
Simon Pilgrim	54a05f2ec8	[X86] computeKnownBitsForTargetNode - add X86ISD::PMULUDQ handling Reuse the existing KnownBits multiplication code to handle what is effectively a ISD::UMUL_LOHI varient	2021-03-21 09:57:20 +00:00
Fangrui Song	2288a75d9e	[Driver] Linux.cpp: add -internal-isystem lib/../$triple/include With this change, for `#include <ar.h>`, `clang --target=aarch64-linux-gnu` will read `/usr/lib/gcc/aarch64-linux-gnu/10/../../../../aarch64-linux-gnu/include/ar.h` (on Debian gcc->gcc-cross) instead of `/usr/include/ar.h`. Some glibc headers (e.g. gnu/stubs.h) are different across architectures.	2021-03-21 00:56:03 -07:00
Fangrui Song	c2f9086b61	[Driver] Gnu.cpp: drop an unneeded special rule related to sysroot	2021-03-20 21:37:49 -07:00
Fangrui Song	56700e9379	[Driver] Gnu.cpp: drop an unneeded special rule related to sysroot Seem unnecessary to diverge from GCC here. Beside, lib/../$OSLibDir can be considered closer to the GCC installation then the system root. The comment should not apply.	2021-03-20 21:32:55 -07:00
Fangrui Song	0ad0c476ef	[Driver] Gnu.cpp: remove unneeded -L detection hack for -mx32 Removing the hack actually improves our compatibility with gcc -mx32.	2021-03-20 20:12:45 -07:00
Fangrui Song	775a294820	[Driver] Gnu.cpp: remove unneeded -L detection for libc++ If clang is installed in the system, the other -L suffice; otherwise $ccc_install_dir/../lib below suffices.	2021-03-20 18:56:40 -07:00
Fangrui Song	06d6b1471e	[Driver] Gnu.cpp: remove unneeded -L lib/gcc/$triple/$version/../../../$triple After path resolution, it duplicates a subsequent -L entry. The entry below (lib/gcc/$triple/$version/../../../../$OSLibDir) usually does not exist (e.g. Arch Linux; Debian cross gcc). When it exists, it typically just has ld.so (e.g. Debian native gcc) which cannot cause collision. Removing the -L (similar to reordering it) is therefore justified.	2021-03-20 18:50:14 -07:00

1 2 3 4 5 ...

383325 Commits All Branches Search

383325 Commits

All Branches