llvm-project

Commit Graph

Author	SHA1	Message	Date
Kai Luo	3d72a58981	[PowerPC][NFC] Precommit a test case where ppc-mi-peepholes miscompiles extswsli Added a test case to show codegen differences. llvm-svn: 366666	2019-07-22 05:32:20 +00:00
Serguei Katkov	c6c31da867	[Loop Peeling] Fix the handling of branch weights of peeled off branches. Current algorithm to update branch weights of latch block and its copies is based on the assumption that number of peeling iterations is approximately equal to trip count. However it is not correct. According to profitability check in one case we can decide to peel in case it helps to reduce the number of phi nodes. In this case the number of peeled iteration can be less then estimated trip count. This patch introduces another way to set the branch weights to peeled of branches. Let F is a weight of the edge from latch to header. Let E is a weight of the edge from latch to exit. F/(F+E) is a probability to go to loop and E/(F+E) is a probability to go to exit. Then, Estimated TripCount = F / E. For I-th (counting from 0) peeled off iteration we set the the weights for the peeled latch as (TC - I, 1). It gives us reasonable distribution, The probability to go to exit 1/(TC-I) increases. At the same time the estimated trip count of remaining loop reduces by I. As a result after peeling off N iteration the weights will be (F - N * E, E) and trip count of loop becomes F / E - N or TC - N. The idea is taken from the review of the patch D63918 proposed by Philip. Reviewers: reames, mkuper, iajbar, fhahn Reviewed By: reames Subscribers: hiraditya, zzheng, llvm-commits Differential Revision: https://reviews.llvm.org/D64235 llvm-svn: 366665	2019-07-22 05:15:34 +00:00
Fangrui Song	6ef23e6581	[utils] Clean up UpdateTestChecks/common.py llvm-svn: 366664	2019-07-22 04:59:01 +00:00
Fangrui Song	1a1af4392a	[analyzer] Fix -Wunused-function in NDEBUG builds with #ifdef LLVM_DUMP_METHOD llvm-svn: 366663	2019-07-22 04:14:09 +00:00
Craig Topper	ee5dc7e7ad	[InstCombine] Add foldAndOfICmps test cases inspired by PR42691. icmp ne %x, INT_MIN can be treated similarly to icmp sgt %x, INT_MIN. icmp ne %x, INT_MAX can be treated similarly to icmp slt %x, INT_MAX. icmp ne %x, UINT_MAX can be treated similarly to icmp ult %x, UINT_MAX. We already treat icmp ne %x, 0 similarly to icmp ugt %x, 0 llvm-svn: 366662	2019-07-22 02:43:43 +00:00
Nemanja Ivanovic	3d68adebc5	[PowerPC][NFC] Precomit test case for upcoming patch Just committing a test case for an upcoming patch so that the review can show only the codegen differences. llvm-svn: 366661	2019-07-21 21:03:45 +00:00
Simon Pilgrim	86fa3270ef	[X86] SimplifyDemandedVectorEltsForTargetNode - Move SUBV_BROADCAST narrowing handling. NFCI. Move the narrowing of SUBV_BROADCAST to where we handle all the other opcodes. llvm-svn: 366660	2019-07-21 19:04:44 +00:00
Nemanja Ivanovic	73d641a23c	[PowerPC][NFC] Regenerate test using script This test case ended up as a hybrid of generated checks and manually inserted checks. Regenerate using script to make it consistent. llvm-svn: 366659	2019-07-21 18:42:29 +00:00
Craig Topper	e6cd20ba53	[InstCombine] Update comment I missed in r366649. NFC llvm-svn: 366658	2019-07-21 16:15:03 +00:00
Simon Pilgrim	630be14ac6	[SmallBitVector] Fix bug in find_next_unset for small types with indices >=32 We were creating a bitmask from a shift of unsigned instead of uintptr_t, meaning we couldn't create masks for indices above 31. Noticed due to a MSVC analyzer warning. llvm-svn: 366657	2019-07-21 16:06:26 +00:00
Aditya Nandakumar	d7504a1569	[GISel]: Attach missing range metadata while translating G_LOADs https://reviews.llvm.org/D65048 Attach range information to G_LOAD when only defining one register. reviewed by: arsenm llvm-svn: 366656	2019-07-21 14:07:54 +00:00
David Green	c38899fc26	[ARM] Move MVE VPT block tests into the Thumb2 directory. NFC llvm-svn: 366655	2019-07-21 13:09:19 +00:00
Raphael Isemann	ca9dfdfaec	[lldb] Fix crash when looking up type coming from the ClangModuleDeclVendor Summary: We assume in LLDB that every type comes from an ASTContext with an associated ClangASTContext. However the types inside the ClangModuleDeclVendor don't have a ClangASTContext so we end up crashing whenever we create a CompilerType for one of these types. Simplest way to trigger this bug is to just look up NSObject from a module: (lldb) expr @import Foundation (lldb) type lookup NSObject Assertion failed: (m_type_system != nullptr), function CompilerType, file /Users/teemperor/llvm1/llvm-project/lldb/source/Symbol/CompilerType.cpp, line 39. This patch just creates a ClangASTContext for the ASTContext used by ClangModuleDeclVendor. Reviewers: davide, shafik Reviewed By: davide Subscribers: lldb-commits Tags: #lldb Differential Revision: https://reviews.llvm.org/D64989 llvm-svn: 366653	2019-07-21 10:31:13 +00:00
Roman Lebedev	8a431874e9	[NFC][InstCombine] Add a few extra srem-by-power-of-two tests - extra uses llvm-svn: 366652	2019-07-21 09:05:49 +00:00
Nathan Lanza	fe1b8a0911	[NativePDB] Make GetOrCreateDeclForUid return an lldb CompilerDecl We intend to make PdbAstBuilder abstract and implement PdbAstBuilderClang along with any other languages that wish to use PDBs. Thus, change GetOrCreateDeclForUid from returning a clang decl to a lldb_private::CompilerDecl. llvm-svn: 366650	2019-07-21 07:46:18 +00:00
Craig Topper	1d149d08d3	[InstCombine] Remove insertRangeTest code that handles the equality case. For equality, the function called getTrue/getFalse with the VT of the comparison input. But getTrue/getFalse need the boolean VT. So if this code ever executed, it would assert. I believe these cases are removed by InstSimplify so we don't get here. So this patch just fixes up an assert to exclude the equality possibility and removes the broken code. llvm-svn: 366649	2019-07-21 06:43:38 +00:00
Craig Topper	8fabdfe9fc	[InstCombine] Don't use AddOne/SubOne to see if two APInts are 1 apart. Use APInt operations instead. NFCI AddOne/SubOne create new Constant objects. That seems heavy for comparing ConstantInts which wrap APInts. Just do the math on on the APInts and compare them. llvm-svn: 366648	2019-07-21 05:26:05 +00:00
Yuanfang Chen	b4e06328fc	[lldb] Fix buildbot build fail caused by r366645 llvm-svn: 366647	2019-07-21 00:07:39 +00:00
Nico Weber	b910956202	gn build: Merge r366622 llvm-svn: 366646	2019-07-21 00:03:55 +00:00
Yuanfang Chen	ff22ec3d70	[Clang] Replace cc1 options '-mdisable-fp-elim' and '-momit-leaf-frame-pointer' with '-mframe-pointer' After D56351 and D64294, frame pointer handling is migrated to tri-state (all, non-leaf, none) in clang driver and on the function attribute. This patch makes the frame pointer handling cc1 option tri-state. Reviewers: chandlerc, rnk, t.p.northover, MaskRay Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D56353 llvm-svn: 366645	2019-07-20 22:50:50 +00:00
Petr Hosek	ae4c30a4be	[ELF] Support explicitly overriding relocation model in LTO lld currently selects the relocation model automatically depending on the link flags specified, but in some cases it'd be useful to allow explicitly overriding the relocation model using a flag. llvm-svn: 366644	2019-07-20 21:59:47 +00:00
Roman Lebedev	a2dd672c5f	[NFC][InstCombine] Autogenerate a few tests llvm-svn: 366643	2019-07-20 21:34:00 +00:00
Roman Lebedev	056640f8b3	[NFC][InstCombine] Add srem-by-signbit tests - still can fold to bittest https://rise4fun.com/Alive/IIeS llvm-svn: 366642	2019-07-20 21:33:50 +00:00
Roman Lebedev	7f0c23576f	[NFC][Codegen][X86][AArch64] Add "(x s% C) == 0" tests Much like with `urem`, the same optimization (albeit with slightly different algorithm) applies for the signed case, too. I'm simply copying the test coverage from `urem` case for now, i believe it should be (close to?) sufficient. llvm-svn: 366640	2019-07-20 19:25:44 +00:00
Saleem Abdulrasool	e97f2f33e7	build: allow the user to specify `llvm-tblgen` This follows the same pattern as Clang and permits the user to specify the tablegen to use via `-DLLVM_TABLEGEN=`. This allows for cross-compiling LLDB for a foreign target (e.g. Windows ARM64 on Windows X64). The LLVM dependency for LLDB in that case must be a Windows ARM64 build which cannot cross-compile llvm-tblgen due to the way that Visual Studio works. Instead, permit the user to have a separate tablegen build which can be used during the build. llvm-svn: 366639	2019-07-20 17:59:08 +00:00
Serge Guelton	a30a4a35ec	Fix asan infinite loop on undefined symbol Fix llvm#39641 Recommit of r366413 Differential Revision: https://reviews.llvm.org/D63877 > llvm-svn: 366632 llvm-svn: 366638	2019-07-20 17:44:30 +00:00
Roman Lebedev	cd9b19484b	[Codegen][SelectionDAG] X u% C == 0 fold: non-splat vector improvements Summary: Four things here: 1. Generalize the fold to handle non-splat divisors. Reasonably trivial. 2. Unban power-of-two divisors. I don't see any reason why they should be illegal. * There is no ban in Hacker's Delight * I think the ban came from the same bug that caused the miscompile in the base patch - in `floor((2^W - 1) / D)` we were dividing by `D0` instead of `D`, and we were ensuring that `D0` is not `1`, which made sense. 3. Unban `1` divisors. I no longer believe Hacker's Delight actually says that the fold is invalid for `D = 0`. Further considerations: * We know that * `(X u% 1) == 0` can be constant-folded to `1`, * `(X u% 1) != 0` can be constant-folded to `0`, * Also, we know that * `X u<= -1` can be constant-folded to `1`, * `X u> -1` can be constant-folded to `0`, * https://godbolt.org/z/7jnZJX https://rise4fun.com/Alive/oF6p * We know will end up with the following: `(setule/setugt (rotr (mul N, P), K), Q)` * Therefore, for given new DAG nodes and comparison predicates (`ule`/`ugt`), we will still produce the correct answer if: `Q` is a all-ones constant; and both `P` and `K` are anything other than `undef`. * The fold will indeed produce `Q = all-ones`. 4. Try to re-splat the `P` and `K` vectors - we don't care about their values for the lanes where divisor was `1`. Reviewers: RKSimon, hermord, craig.topper, spatel, xbolva00 Reviewed By: RKSimon Subscribers: hiraditya, javed.absar, dexonsmith, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D63963 llvm-svn: 366637	2019-07-20 16:33:15 +00:00
Simon Pilgrim	adec0f2252	[X86][SSE] Use PSADBW to improve vXi8 sum reduction (PR42674) As detailed on PR42674, we can reduce a vXi8 down until we have the final <8 x i8>, and then use PSADBW with zero, to sum those values. We then extract the bottom i8, discarding any overflow from the upper bits of the i16 result. llvm-svn: 366636	2019-07-20 15:20:11 +00:00
Simon Pilgrim	12b48b1607	Fix cppcheck reduce scope variable warnings. NFCI Move a couple of variables inside the block where they are actually needed. llvm-svn: 366635	2019-07-20 13:01:16 +00:00
Serge Guelton	7a3d4c15a7	Revert "Fix asan infinite loop on undefined symbol" This reverts commit `cbd28cd05b`. Buildbot fail: http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux/builds/22434/steps/64-bit%20check-asan-dynamic/ llvm-svn: 366634	2019-07-20 13:00:12 +00:00
Florian Hahn	0a7faa4e3d	[Local] Zap blockaddress without users in ConstantFoldTerminator. If the blockaddress is not destoryed, the destination block will still be marked as having its address taken, limiting further transformations. I think there are other places where the dead blockaddress constants are kept around, I'll look into that as follow up. Reviewers: craig.topper, brzycki, davide Reviewed By: brzycki, davide Differential Revision: https://reviews.llvm.org/D64936 llvm-svn: 366633	2019-07-20 12:25:47 +00:00
Serge Guelton	cbd28cd05b	Fix asan infinite loop on undefined symbol Fix llvm#39641 Recommit of r366413 Differential Revision: https://reviews.llvm.org/D63877 llvm-svn: 366632	2019-07-20 12:01:18 +00:00
Stefan Granitz	fc0d766511	[CMake] Align debugserver with lldb-server on Darwin Summary: Make debugserver a tool like lldb-server, so it can be included/excluded via `LLDB_TOOL_DEBUGSERVER_BUILD`. This replaces the old `LLDB_NO_DEBUGSERVER` flag. Doing the same for darwin-debug while I am here. Reviewers: xiaobai, JDevlieghere, davide Reviewed By: xiaobai, JDevlieghere Subscribers: mgorny, lldb-commits, #lldb Tags: #lldb Differential Revision: https://reviews.llvm.org/D64994 llvm-svn: 366631	2019-07-20 11:18:31 +00:00
Richard Smith	6a38205039	[c++20] P1161R3: a[b,c] is deprecated. llvm-svn: 366630	2019-07-20 09:32:27 +00:00
Aaron Ballman	7017a6d3a3	Mark P1301R4 in C++2a as being SVN instead. llvm-svn: 366629	2019-07-20 08:57:08 +00:00
Aaron Ballman	1358af27c0	We support P1301R4 in C++2a as of r366626. llvm-svn: 366628	2019-07-20 08:24:56 +00:00
Richard Smith	1f8aa536f3	[cxx_status] Update status page for WG21 Cologne meeting motions. Note that many of the paper links will be dead until the post-meeting mailing is released. llvm-svn: 366627	2019-07-20 08:20:54 +00:00
Aaron Ballman	3bef014e7d	Implement P1301R4, which allows specifying an optional message on the [[nodiscard]] attribute. This also bumps the attribute feature test value and introduces the notion of a C++2a extension warning. llvm-svn: 366626	2019-07-20 07:56:34 +00:00
Jessica Paquette	41affad967	[GlobalISel][AArch64] Contract trivial same-size cross-bank copies into G_STOREs Sometimes, you can end up with cross-bank copies between same-sized GPRs and FPRs, which feed into G_STOREs. When these copies feed only into stores, they aren't necessary; we can just store using the original register bank. This provides some minor code size savings for some floating point SPEC benchmarks. (Around 0.2% for 453.povray and 450.soplex) This issue doesn't seem to show up due to regbankselect or anything similar. So, this patch introduces an early select function, `contractCrossBankCopyIntoStore` which performs the contraction when possible. The selector then continues normally and selects the correct store opcode, eliminating needless copies along the way. Differential Revision: https://reviews.llvm.org/D65024 llvm-svn: 366625	2019-07-20 01:55:35 +00:00
Guanzhong Chen	5204f7611f	[WebAssembly] Compute and export TLS block alignment Summary: Add immutable WASM global `__tls_align` which stores the alignment requirements of the TLS segment. Add `__builtin_wasm_tls_align()` intrinsic to get this alignment in Clang. The expected usage has now changed to: __wasm_init_tls(memalign(__builtin_wasm_tls_align(), __builtin_wasm_tls_size())); Reviewers: tlively, aheejin, sbc100, sunfish, alexcrichton Reviewed By: tlively Subscribers: dschuff, jgravelle-google, hiraditya, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D65028 llvm-svn: 366624	2019-07-19 23:34:16 +00:00
Teresa Johnson	604f802fd3	[LTO] Always mark regular LTO units with EnableSplitLTOUnit=1 Summary: Regular LTO modules do not need LTO Unit splitting, only ThinLTO does (they must be consistently split into regular and Thin units for optimizations such as whole program devirtualization and lower type tests). In order to avoid spurious errors from LTO when combining with split ThinLTO modules, always set this flag for regular LTO modules. Reviewers: pcc Subscribers: mehdi_amini, Prazek, inglorion, steven_wu, dexonsmith, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D65009 llvm-svn: 366623	2019-07-19 23:02:58 +00:00
Daniel Sanders	578e8fa833	Re-commit: r366610 and r366612: Expand pseudo-components before embedding in llvm-config There were two main problems: * The 'nativecodegen' pseudo-component was unconditionally adding ${native_tgt}CodeGen even though it conditionally added ${native_tgt}Info and ${native_tgt}Desc. This has been fixed by making ${native_tgt}CodeGen conditional too * The 'all' pseudo-component was causing library names like LLVMLLVMDemangle as the expansion was to a library name and not a component. There doesn't seem to be a list of available components anywhere so this has been fixed by moving the expansion of 'all' back where it was before. This manifested in different ways on different builders but it was the same root cause llvm-svn: 366622	2019-07-19 22:46:47 +00:00
Matt Arsenault	f3bfb85bce	AMDGPU/GlobalISel: Legalize GEP for other 32-bit address spaces llvm-svn: 366621	2019-07-19 22:28:44 +00:00
Denis Bakhvalov	a29002e59b	[NFC] Remove unused variable Change-Id: I5aee24dcdf6eebfbf788e52be22463387f23d927 llvm-svn: 366620	2019-07-19 21:59:42 +00:00
Stanislav Mekhanoshin	05d9e6a2a3	[AMDGPU] Autogenerate register sequences in tuples Differential Revision: https://reviews.llvm.org/D65007 llvm-svn: 366619	2019-07-19 21:43:42 +00:00
Matthew Voss	407e837540	Revert "Fix asan infinite loop on undefined symbol" This reverts commit `8e46275488`. This was failing on sanitizer-x86_64-linux and our internal CI. llvm-svn: 366618	2019-07-19 21:41:07 +00:00
Sunil Srivastava	f4038e75d2	Disallow most calling convention attributes on PS4 PS4 now only allows "cdecl", and its equivalent on PS4, "sysv_abi". Differential Revision: https://reviews.llvm.org/D64780 llvm-svn: 366617	2019-07-19 21:38:34 +00:00
Stanislav Mekhanoshin	7b5a54e369	[AMDGPU] Fixed occupancy calculation for gfx10 Differential Revision: https://reviews.llvm.org/D65010 llvm-svn: 366616	2019-07-19 21:29:51 +00:00
Daniel Sanders	34da8dfba0	Revert r366610 and r366612: Expand pseudo-components before embedding in llvm-config Some targets are missing LLVMDemangle, one is adding the LLVM prefix twice, and two are hitting the very error this patch fixes for my target. Reverting while I work through the reports. llvm-svn: 366615	2019-07-19 21:11:05 +00:00
Craig Topper	3a3c58f045	[InstCombine] Fix copy/paste mistake in the test cases I added for PR42691. NFC llvm-svn: 366614	2019-07-19 21:09:21 +00:00

1 2 3 4 5 ...

322265 Commits All Branches Search

322265 Commits

All Branches