llvm-project

Commit Graph

Author	SHA1	Message	Date
Roman Lebedev	cae2d871c0	[NFCI][Thumb2] Regenerate MVE tests i missed in `59560e8589`	2020-12-14 21:01:00 +03:00
Tony	90b951dd68	[NFC] Remove trailing whitespace in llvm/CMakeLists.txt Differential Revision: https://reviews.llvm.org/D93234	2020-12-14 17:48:16 +00:00
Cameron Desrochers	d784845de1	[TableGen] Fixed 64-bit filters being sliced to 32 bits in FixedLenDecoderEmitter When using the FixedLenDecoderEmitter, llvm-tblgen emits tables with (OPC_ExtractField, OPC_ExtractFilterValue) opcode sequences to match the contiguous fixed bits of a given instruction's encoding. This encoding is represented in a 64-bit integer. However, the filter values were represented in a 32-bit integer. As such, instructions with fixed 64-bit encodings resulted in a table with an OPC_ExtractField for all 64 bits, followed by an OPC_ExtractFilterValue containing just the low 32 bits of their encoding, causing the filter never to match. The exact point at which the slicing occurred was during the map insertion at line 630. Differential Revision: https://reviews.llvm.org/D92423	2020-12-14 12:42:35 -05:00
LemonBoy	92c6141ce6	lld/ELF: Parse MSP430 BFD/emulation names Follow the naming set by TI's own GCC-based toolchain. Also, force the `osabi` field to `ELFOSABI_STANDALONE`, this matches GNU LD's output (the patching is done in `elf32_msp430_post_process_headers`). Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D92931	2020-12-14 09:38:12 -08:00
Nemanja Ivanovic	bfdc19e778	[PowerPC] Restore stack ptr from frame ptr with setjmp If a function happens to: - call setjmp - do a 16-byte stack allocation - call a function that sets up a stack frame and longjmp's back The stack pointer that is restores by setjmp will no longer point to a valid back chain. According to the ABI, stack accesses in such a function are to be frame pointer based - so it is an error (quite obviously) to restore the stack from the back chain. We already restore the stack from the frame pointer when there are calls to fast_cc functions. We just need to also do that when there are calls to setjmp. This patch simply does that. This was pointed out by the Julia team. Differential revision: https://reviews.llvm.org/D92906	2020-12-14 11:34:16 -06:00
ergawy	ecab63894b	[MLIR][SPIRV] Refactoring serialization and deserialization This commit splits SPIR-V's serialization and deserialization code into separate libraries. The motiviation being that the serializer is used more often the deserializer and therefore lumping them together unnecessarily increases binary size for the most common case. This commit also moves these libraries into the Target/ directory to follow MLIR convention. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D91548	2020-12-14 12:28:16 -05:00
Gabor Marton	68f53960e1	[ASTImporter] Fix import of a typedef that has an attribute The import of a typedefs with an attribute uses clang::Decl::setAttrs(). But that needs the ASTContext which we can get only from the TranslationUnitDecl. But we can get the TUDecl only thourgh the DeclContext, which is not set by the time of the setAttrs call. Fix: import the attributes only after the DC is surely imported. Btw, having the attribute import initiated from GetImportedOrCreateDecl was fundamentally flawed. Now that is implicitly fixed. Differential Revision: https://reviews.llvm.org/D92962	2020-12-14 18:27:05 +01:00
Roman Lebedev	59560e8589	[SimplifyCFG] FoldBranchToCommonDest(): temporairly put back restrictions on liveout uses of bonus instructions (PR48450) Even though `d38205144f` was mostly a correct fix for the external non-PHI users, it's not a generally correct fix, because the 'placeholder' values in those trivial PHI's we create shouldn't be always 'undef', but the PHI itself for the backedges, else we end up with wrong value, as the `@pr48450_2` test shows. But we can't just do that, because we can't check that the PHI can be it's own incoming value when coming from certain predecessor, because we don't have a dominator tree. So until we can address this correctness problem properly, ensure that we don't perform the transformation if there are such problematic external uses. Making dominator tree available there is going to be involved, since `-simplifycfg` pass currently does not preserve/update domtree...	2020-12-14 20:14:31 +03:00
Roman Lebedev	e8360a8e1e	[NFC][SimplifyCFG] FoldBranchToCommonDest(): pull out 'common successor' into a variable Makes it easier to use it elsewhere	2020-12-14 20:14:31 +03:00
Roman Lebedev	effbbdec6e	[NFC][SimplifyCFG] Add another miscompiled test for PR48450	2020-12-14 20:14:31 +03:00
Arthur O'Dwyer	3c8e31e17b	[libc++] ADL-proof <functional> by adding _VSTD:: qualification on calls. - std::reference_wrapper - std::function - std::mem_fn While I'm here, remove _VSTD:: qualification from calls to `declval` because it takes no arguments and thus isn't susceptible to ADL. Differential Revision: https://reviews.llvm.org/D92884	2020-12-14 12:08:34 -05:00
Arthur O'Dwyer	be4c657b01	[libc++] Consistently replace `::new(__p) T` with `::new ((void)__p) T`. NFCI. Everywhere, normalize the whitespace to `::new (EXPR) T`. Everywhere, normalize the spelling of the cast to `(void)EXPR`. Without the cast to `(void)`, the expression triggers ADL on GCC. (I think this is a GCC bug: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98249) Even if it doesn't trigger ADL, it still seems incorrect to use any argument that's not exactly `(void)` because that opens the possibility of overload resolution picking a user-defined overload of `operator new`, which would be wrong. Differential Revision: https://reviews.llvm.org/D93153	2020-12-14 12:08:34 -05:00
Sylvain Audi	640ad76911	[clang-scan-deps] Support clang-cl clang-scan-deps contains some command line parsing and modifications. This patch adds support for clang-cl command options. Differential Revision: https://reviews.llvm.org/D92191	2020-12-14 12:06:05 -05:00
Siva Chandra Reddy	9ad2091e78	[libc][Obvious] Include <fenv.h> from DummyFenv.h.	2020-12-14 08:51:54 -08:00
Stanislav Mekhanoshin	87d7757bbe	[SLP] Control maximum vectorization factor from TTI D82227 has added a proper check to limit PHI vectorization to the maximum vector register size. That unfortunately resulted in at least a couple of regressions on SystemZ and x86. This change reverts PHI handling from D82227 and replaces it with a more general check in SLPVectorizerPass::tryToVectorizeList(). Moved to tryToVectorizeList() it allows to restart vectorization if initial chunk fails. However, this function is more general and handles not only PHI but everything which SLP handles. If vectorization factor would be limited to maximum vector register size it would limit much more vectorization than before leading to further regressions. Therefore a new TTI callback getMaximumVF() is added with the default 0 to preserve current behavior and limit nothing. Then targets can decide what is better for them. The callback gets ElementSize just like a similar getMinimumVF() function and the main opcode of the chain. The latter is to avoid regressions at least on the AMDGPU. We can have loads and stores up to 128 bit wide, and <2 x 16> bit vector math on some subtargets, where the rest shall not be vectorized. I.e. we need to differentiate based on the element size and operation itself. Differential Revision: https://reviews.llvm.org/D92059	2020-12-14 08:49:40 -08:00
Raul Tambre	c21df2a79c	Revert "Re-apply "[CMake][compiler-rt][AArch64] Avoid preprocessing LSE builtins separately"" This reverts commit 03ebe1937192c247c4a7b8ec19dde2cf9845c914. It's still breaking bots, e.g. http://green.lab.llvm.org/green/job/clang-stage1-RA/17027/console although it doesn't change any actual code. The compile errors don't make much sense either. Revert for now. Differential Revision: https://reviews.llvm.org/D93228	2020-12-14 18:43:55 +02:00
Jay Foad	07e92e6b60	[AMDGPU] Make use of HasSMemRealTime predicate. NFC. We have this subtarget feature so it makes sense to use it here. This is NFC because it's always defined by default on GFX8+. Differential Revision: https://reviews.llvm.org/D93202	2020-12-14 16:34:57 +00:00
Kazushi (Jam) Marukawa	aefedb1707	[VE] Add logical mask intrinsic instructions Add andm, orm, xorm, eqvm, nndm, negm, pcvm, lzvm, and tovm intrinsic instructions, a few pseudo instructions to expand logical intrinsic using VM512, a mechnism to expand such pseudo instructions, and regression tests. Also, assign vector mask types and vector mask register classes correctly. This is required to use VM512 registers as function arguments. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D93093	2020-12-15 01:34:31 +09:00
Simon Pilgrim	5f5a2547c1	[X86] LowerBUILD_VECTOR - track zero/nonzero elements with APInt masks. NFCI. Prep work for undef/zero 'upper elements' handling as proposed in D92645.	2020-12-14 16:28:45 +00:00
Marek Kurdej	59c72a7012	[libc++] [P1164] Add tests for create_directories. NFC. That's a follow-up patch after D92769. Reviewed By: ldionne, #libc Differential Revision: https://reviews.llvm.org/D93026	2020-12-14 17:27:18 +01:00
Kazushi (Jam) Marukawa	c9213e1b29	[VE] Correct addRegisterClass calls Correct addRegisterClass calls for vector mask registers. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D93212	2020-12-15 01:16:56 +09:00
Andrzej Warzynski	6bbbe4a574	[flang][driver] Fix a small bug (auto vs auto&) This bug hasn't affected us yet as our usage is too basic, i.e. we don't rely on the defaults provided by `SetDefaultFortranOpts` just yet. This will change shortly.	2020-12-14 16:10:07 +00:00
diggerlin	15f2d4f198	[AIX] Fixed "comparison of unsigned expression >= 0 is always true" gcc warnings. Summary: fixed a Fixed "comparison of unsigned expression >= 0 is always true" gcc warnings. http://lab.llvm.org:8011/#/builders/5/builds/2407/steps/2/logs/stdio the error caused by patch https://reviews.llvm.org/D92398	2020-12-14 11:08:40 -05:00
Markus Lavin	2a6782bb9f	Reland [DebugInfo] Improve dbg preservation in LSR. Use SCEV to salvage additional @llvm.dbg.value that have turned into referencing undef after transformation (and traditional salvageDebugInfo). Before rewrite (but after introduction of new induction variables) use SCEV to compute an equivalent set of values for each @llvm.dbg.value in the loop body (among the loop header PHI-nodes). After rewrite (and dead PHI elimination) update those @llvm.dbg.value now referencing undef by picking a remaining value from its equivalence set. Allow match with offset by inserting compensation code in the DIExpression. Fixes : PR38815 Differential Revision: https://reviews.llvm.org/D87494	2020-12-14 16:15:18 +01:00
Arthur O'Dwyer	2664f5d436	generate_header_tests.py: Sort the header files ASCIIbetically. Otherwise they come out in random (inode?) order. Also `chmod +x` the generator, and re-run it. Somehow on Marek's machine it produced \r\n line endings?! Open all files with `newline='\n'` so that (if the Python3 docs are correct) that won't happen again. Differential Revision: https://reviews.llvm.org/D93137	2020-12-14 09:56:07 -05:00
Arthur O'Dwyer	b6f1917415	[libc++] Fix some one-off typos in comments. NFCI.	2020-12-14 09:54:58 -05:00
Arthur O'Dwyer	ce9ac549c9	[libc++] Remove __is_construct::__nat. NFCI. This type has been unused since commit `5b4cc84b87`.	2020-12-14 09:54:58 -05:00
Arthur O'Dwyer	e9eb99999f	[libc++] s/insertible/insertable/g. NFCI.	2020-12-14 09:54:58 -05:00
Arthur O'Dwyer	1d7c39e14e	[libc++] s/Birdirectional/Bidirectional/g. NFCI.	2020-12-14 09:54:57 -05:00
Raul Tambre	d0797e62fa	Re-apply "[CMake][compiler-rt][AArch64] Avoid preprocessing LSE builtins separately" `aa772fc85e` (D92530) has landed fixing Apple builds. Previous quick-fix `d9697c2e6b` (D93198) included in this commit. Invoking the preprocessor ourselves is fragile and would require us to replicate CMake's handling of definitions, compiler flags, etc for proper compatibility. In my toolchain builds this notably resulted in a bunch of warnings from unused flags as my CMAKE_C_FLAGS includes CPU-specific optimization options. Notably this part was already duplicating the logic for VISIBILITY_HIDDEN define. Instead, symlink the files and set the proper set of defines on each. This should also be faster as we avoid invoking the compiler multiple times. Fixes https://llvm.org/PR48494 Differential Revision: https://reviews.llvm.org/D93211	2020-12-14 16:45:48 +02:00
Kuba Mracek	aa772fc85e	[compiler-rt] [builtins] Make lse.S compile on Darwin Reviewed By: ilinpv Differential Revision: https://reviews.llvm.org/D92530	2020-12-14 16:38:48 +02:00
Florian Hahn	e42e5263bd	[VPlan] Make VPWidenMemoryInstructionRecipe a VPDef. This patch updates VPWidenMemoryInstructionRecipe to use VPDef to manage the value it produces instead of inheriting from VPValue. Reviewed By: gilr Differential Revision: https://reviews.llvm.org/D90563	2020-12-14 14:13:59 +00:00
David Spickett	aabaca3363	[llvm-objdump] Use "--" for long options in --help text Single dash for these options is not recognised. Changes found by running this on the --help output and the user guide: grep -e ' -[a-zA-Z]\{2,\}' The user guide was updated in https://reviews.llvm.org/D92305 so no change there. Reviewed By: jhenderson, MaskRay Differential Revision: https://reviews.llvm.org/D92310	2020-12-14 13:11:29 +00:00
Raphael Isemann	22ccdb7870	Revert "Consider reference, pointer, and pointer-to-member TemplateArguments to be different if they have different types." This reverts commit `05cdf4acf4`. It breaks stage-2 compilation of LLVM, see https://reviews.llvm.org/D91488#2451534	2020-12-14 14:03:38 +01:00
Anton Afanasyev	fac7c7ec3c	[SLP] Fix vector element size for the store chains Vector element size could be different for different store chains. This patch prevents wrong computation of maximum number of elements for that case. Differential Revision: https://reviews.llvm.org/D93192	2020-12-14 15:51:43 +03:00
Simon Pilgrim	6c8ded0d8c	[TableGen] Don't dereference from dyn_cast<> - use cast<> instead. NFCI. dyn_cast<> can return null if the cast fails, resulting in null dereferences and static analyzer warnings. We should use cast<> instead.	2020-12-14 12:12:08 +00:00
Simon Pilgrim	5a02bf4f95	[IRCE] Add test case for PR48051	2020-12-14 12:01:19 +00:00
Kerry McLaughlin	c5ced82c8e	[SVE][CodeGen] Lower scalable floating-point vector reductions Changes in this patch: - Minor changes to the LowerVECREDUCE_SEQ_FADD function added by @cameron.mcinally to also work for scalable types - Added TableGen patterns for FP reductions with unpacked types (nxv2f16, nxv4f16 & nxv2f32) - Asserts added to expandFMINNUM_FMAXNUM & expandVecReduceSeq for scalable types Reviewed By: cameron.mcinally Differential Revision: https://reviews.llvm.org/D93050	2020-12-14 11:45:42 +00:00
David Green	1de3e7fd62	[ARM] Improve handling of empty VPT blocks in tail predicated loops A vpt block that just contains either VPST;VCTP or VPT;VCTP, once the VCTP is removed will become invalid. This fixed the first by removing the now empty block and bails out for the second, as we have no simple way of converting a VPT to a VCMP. Differential Revision: https://reviews.llvm.org/D92369	2020-12-14 11:17:01 +00:00
Carl Ritson	62c246eda2	[AMDGPU][NFC] Rename opsel/opsel_hi/neg_lo/neg_hi with suffix 0 These parameters set a default value of 0, so I believe they should include a 0 suffix. This allows for versions which do not set a default value in future. Reviewed By: foad Differential Revision: https://reviews.llvm.org/D93187	2020-12-14 20:01:56 +09:00
Carl Ritson	af4570cd3a	[AMDGPU][NFC] Remove unused VOP3Mods0Clamp This is unused and the selection function does not exist. Reviewed By: foad Differential Revision: https://reviews.llvm.org/D93188	2020-12-14 20:00:58 +09:00
Raul Tambre	55f07a3400	[XRay] Remove unnecessary <x86intrin.h> include It hasn't been necessary since commit `4d4ed0e288` (D43278). Reviewed By: dberris Differential Revision: https://reviews.llvm.org/D93196	2020-12-14 12:36:35 +02:00
Frederik Gossen	75d9a46090	[MLIR] Add atan and atan2 lowerings to CUDA intrinsics Differential Revision: https://reviews.llvm.org/D93124	2020-12-14 10:45:28 +01:00
Sebastian Neubauer	5733167f54	[AMDGPU] Mark amdgpu_gfx functions as module entry function - Allows lds allocations - Writes resource usage into COMPUTE_PGM_RSRC1 registers in PAL metadata Differential Revision: https://reviews.llvm.org/D92946	2020-12-14 10:43:39 +01:00
Frederik Gossen	1c6bc2c0b5	[MLIR] Add lowerings for atan and atan2 to ROCDL intrinsics Differential Revision: https://reviews.llvm.org/D93123	2020-12-14 10:43:19 +01:00
Raul Tambre	617cd01a4b	Revert "[CMake][compiler-rt][AArch64] Avoid preprocessing LSE builtins separately" Causing issues on Apple buildbots. http://green.lab.llvm.org/green/job/clang-stage1-RA/17019/console This reverts commit `33b740f8dc`. This reverts commit `d9697c2e6b`. Differential Revision: https://reviews.llvm.org/D93199	2020-12-14 11:42:28 +02:00
Raul Tambre	d9697c2e6b	[compiler-rt][CMake] Define HAS_ASM_LSE on Apple if available Should hopefully fix `33b740f8dc` (D93178) failing on bots. Differential Revision: https://reviews.llvm.org/D93198	2020-12-14 11:26:24 +02:00
Jan Svoboda	16aa00b622	[clang][cli] Port FileSystem options to new option parsing system Depends on D84187 Reviewed By: dexonsmith Original patch by Daniel Grumberg. Differential Revision: https://reviews.llvm.org/D84188	2020-12-14 10:17:23 +01:00
Georgii Rymar	98a4289810	[llvm-readobj] - For SHT_REL relocations, don't display an addend. This is https://bugs.llvm.org/show_bug.cgi?id=44257. In LLVM style we always print `0` as addend when dumping SHT_REL relocations. It is confusing, this patch stops printing it as the first comment on the bug page suggests. Differential revision: https://reviews.llvm.org/D93033	2020-12-14 12:03:00 +03:00
Jan Svoboda	e2fc85c69b	[clang][cli] Better defaults for MarshallingInfoString Depends on D84018 Reviewed By: Bigcheese Original patch by Daniel Grumberg. Differential Revision: https://reviews.llvm.org/D84185	2020-12-14 09:59:56 +01:00

1 2 3 4 5 ...

374842 Commits All Branches Search

374842 Commits

All Branches