llvm-project

Commit Graph

Author	SHA1	Message	Date
Craig Topper	7c89d046ea	Use SmallPtrSet instead of SmallSet in places where we iterate over the set. SmallSet forwards to SmallPtrSet for pointer types. SmallPtrSet supports iteration, but a normal SmallSet doesn't. So if it wasn't for the forwarding, this wouldn't work. These places were found by hiding the begin/end methods in the SmallSet forwarding. llvm-svn: 334339	2018-06-09 00:30:45 +00:00
Alex Shlyapnikov	643600566d	[MSan] Report proper error on allocator failures instead of CHECK(0)-ing Summary: Following up on and complementing D44404. Currently many allocator specific errors (OOM, for example) are reported as a text message and CHECK(0) termination, not stack, no details, not too helpful nor informative. To improve the situation, detailed and structured errors were defined and reported under the appropriate conditions. Reviewers: eugenis Subscribers: srhines, delcypher, #sanitizers, llvm-commits Differential Revision: https://reviews.llvm.org/D47793 llvm-svn: 334338	2018-06-08 23:31:42 +00:00
Daniel Sanders	646d06fce0	[tablegen] Improve performance on *GenRegisterInfo.inc by replacing SparseVector with BitVector. NFC Summary: Generating X86GenRegisterInfo.inc and AArch64GenRegisterInfo.inc is 8-9% faster on my build. Reviewers: bogner, javed.absar Reviewed By: bogner Subscribers: llvm-commits, kristof.beyls Differential Revision: https://reviews.llvm.org/D47907 llvm-svn: 334337	2018-06-08 23:12:29 +00:00
Alex Langford	6500c693a5	Remove more dead code from NativeProcessLinux This should have been removed in r334333. llvm-svn: 334336	2018-06-08 22:28:41 +00:00
Jonathan Peyton	e92ae43be8	[OpenMP] Fix formatting issues in kmp_stats.h llvm-svn: 334335	2018-06-08 22:27:53 +00:00
Craig Topper	a4b20e1869	[X86] Add avx512 feature flags to __builtin_ia32_select*. There are many masked intrinsics that just wrap a select around a legacy intrinsic from a pre-avx512 instruciton set. If that intrinsic is implemented as a macro, nothing prevents it from being used when only the older feature was enabled. This likely generates very poor code since we don't have a good way to convert from the scalar masked type used by the intrinsic into a vector control for a legacy blend instruction. If we even have a blend instruction to use. By adding a feature to the select builtins we can prevent and diagnose misuse of these intrinsics. llvm-svn: 334334	2018-06-08 22:19:42 +00:00
Alex Langford	bd620effc8	Delete dead code in NativeProcessLinux As far as I can tell, this code has always been guarded by `#if 0`. If this is useful code, it can be added back. llvm-svn: 334333	2018-06-08 22:14:29 +00:00
Alex Shlyapnikov	84f8ec5c72	[Sanitizers] Use proper substitution in common allocator test. Use %env_tool_ops instead of %tool_options. llvm-svn: 334332	2018-06-08 21:58:22 +00:00
Craig Topper	88097d9355	[X86] Add back some masked vector truncate builtins. Custom IRgen a a few others. I'd like to make the select builtins require an avx512f, avx512bw, or avx512vl fature to match what is normally required to get masking. Truncate is special in that there are instructions with a 128/256-bit masked result even without avx512vl. By using special buitlins we can emit a select without using the 128/256-bit select builtins. llvm-svn: 334331	2018-06-08 21:50:08 +00:00
Craig Topper	5f50f33806	[X86] Fold masking into subvector extract builtins. I'm looking into making the select builtins require avx512f, avx512bw, or avx512vl since masking operations generally require those features. The extract builtins are funny because the 512-bit versions return a 128 or 256 bit vector with masking even when avx512vl is not supported. llvm-svn: 334330	2018-06-08 21:50:07 +00:00
Walter Lee	5a780ee9a9	[asan, myriad] Use local pool for new/delete when ASan run-time is not up This can happen on Myriad RTEMS so needs to be handled. Differential Revision: https://reviews.llvm.org/D47916 llvm-svn: 334329	2018-06-08 21:49:38 +00:00
Craig Topper	e53fa05763	[X86] Remove GCCBuiltin from some intrinsics so we can do custom IR generation from clang. llvm-svn: 334328	2018-06-08 21:49:09 +00:00
Krzysztof Parzyszek	fb3ed4f409	[Polly] Fix a testcase after LLVM commit r334318 ScalarEvolution has become slightly more intelligent, so obfuscate the exit condition in the testcase some more to keep it working. llvm-svn: 334327	2018-06-08 21:39:55 +00:00
Eli Friedman	c065bb2953	[LangRef] fptosi and fptoui return poison on overflow. I think we assume poison, not undef, for certain transforms we currently do. In any case, we should clarify the language here. (This sort of conversion is undefined behavior according to the C and C++ standards. And in practice, hardware implementations handle overflow inconsistently, so it would be difficult to define the result here.) Differential Revision: https://reviews.llvm.org/D47851 llvm-svn: 334326	2018-06-08 21:33:33 +00:00
Eli Friedman	2c7a81b2f8	[LangRef] insertelement/extractelement return poison for out of range. We need to clarify the language here. I think poison makes more sense than undef, since it's an undefined operation rather than uninitialized memory. I don't think anything depends on the difference at the moment, though. Differential Revision: https://reviews.llvm.org/D47859 llvm-svn: 334325	2018-06-08 21:23:09 +00:00
Ryan Prichard	70783e5f35	Test commit: remove a blank line Test commit access llvm-svn: 334324	2018-06-08 21:21:55 +00:00
Sam McCall	8330589d59	[clangd] Deduplicate CompletionItemKind conversion. llvm-svn: 334323	2018-06-08 21:17:19 +00:00
Eli Friedman	864df22307	[ARM] Allow CMPZ transforms even if the input has multiple uses. It looks like this got left in by accident in r289794; I can't think of any reason this check would be necessary. (Maybe it was meant to be a check that the AND has one use? But we check that a few lines earlier.) Differential Revision: https://reviews.llvm.org/D47921 llvm-svn: 334322	2018-06-08 21:16:56 +00:00
Florian Hahn	79510be7cf	[SmallSet] Add some simple unit tests. Reviewers: craig.topper, dblaikie Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D47940 llvm-svn: 334321	2018-06-08 21:14:49 +00:00
Alex Langford	9391061fd7	Delete some dead code llvm-svn: 334320	2018-06-08 21:13:26 +00:00
Vitaly Buka	683e2d636d	[asan] Re-enable no_asan_gen_globals for 32-bit PR22682. Bug in gold was fixed 3 years ago. llvm-svn: 334319	2018-06-08 21:04:44 +00:00
Krzysztof Parzyszek	b10ea39270	[SCEV] Look through zero-extends in howFarToZero An expression like (zext i2 {(trunc i32 (1 + %B) to i2),+,1}<%while.body> to i32) will become zero exactly when the nested value becomes zero in its type. Strip injective operations from the input value in howFarToZero to make the value simpler. Differential Revision: https://reviews.llvm.org/D47951 llvm-svn: 334318	2018-06-08 20:43:07 +00:00
Davide Italiano	189c2cf114	[InstCombine] Skip dbg.value(s) when looking at stack{save,restore}. Fixes PR37713. llvm-svn: 334317	2018-06-08 20:42:36 +00:00
Alex Shlyapnikov	dcf0097962	[Sanitizers] Check alignment != 0 for aligned_alloc and posix_memalign Summary: Move the corresponding tests to the common folder (as all of the sanitizer allocators will support this feature soon) and add the checks specific to aligned_alloc to ASan and LSan allocators. Reviewers: vitalybuka Subscribers: srhines, kubamracek, delcypher, #sanitizers, llvm-commits Differential Revision: https://reviews.llvm.org/D47924 llvm-svn: 334316	2018-06-08 20:40:35 +00:00
Sam McCall	9e5eafbaa8	[clangd] Fix fuzzer after r333993 llvm-svn: 334315	2018-06-08 20:25:05 +00:00
Sanjay Patel	afcf39e1f9	[InstCombine] add llvm.assume + debuginfo test (PR37726); NFC llvm-svn: 334314	2018-06-08 18:47:33 +00:00
Reid Kleckner	0bab222084	[asan] Instrument comdat globals on COFF targets Summary: If we can use comdats, then we can make it so that the global metadata is thrown away if the prevailing definition of the global was uninstrumented. I have only tested this on COFF targets, but in theory, there is no reason that we cannot also do this for ELF. This will allow us to re-enable string merging with ASan on Windows, reducing the binary size cost of ASan on Windows. Reviewers: eugenis, vitalybuka Subscribers: hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D47841 llvm-svn: 334313	2018-06-08 18:33:16 +00:00
Sanjay Patel	498564e6fb	[DAGCombiner] clean up comments; NFC llvm-svn: 334312	2018-06-08 18:00:46 +00:00
Craig Topper	03f4f04b91	[X86] Add builtins for vpermq/vpermpd instructions to enable target feature checking. llvm-svn: 334311	2018-06-08 18:00:25 +00:00
Craig Topper	9d3962f4f1	[X86] Change immediate type for some builtins from char to int. These builtins are all handled by CGBuiltin.cpp so it doesn't much matter what the immediate type is, but int matches the intrinsic spec. llvm-svn: 334310	2018-06-08 18:00:22 +00:00
Simon Pilgrim	5c32989c91	[X86][SSE] Support v8i16/v16i16 rotations Extension to D46954 (PR37426), this patch adds support for v8i16/v16i16 rotations in a similar manner - the conversion of the shift/rotate amount to a multiplication factor and the use of PMULLW to shift left and PMULHUW (ISD::MULHU) to shift the wrapped bits back around to be ORd together. Differential Revision: https://reviews.llvm.org/D47822 llvm-svn: 334309	2018-06-08 17:58:42 +00:00
Sanjay Patel	70314bd61c	[x86] add tests for node-level FMF; NFC These cases should be optimized using the change from D47911. llvm-svn: 334308	2018-06-08 17:54:28 +00:00
Sanjay Patel	9995a00a94	[x86] regenerate test checks; NFC llvm-svn: 334307	2018-06-08 17:42:35 +00:00
Michael Berg	bf90d1f263	Utilize new SDNode flag functionality to expand current support for fsub Summary: This patch originated from D46562 and is a proper subset, with some issues addressed for fsub. Reviewers: spatel, hfinkel, wristow, arsenm Reviewed By: spatel Subscribers: wdng Differential Revision: https://reviews.llvm.org/D47910 llvm-svn: 334306	2018-06-08 17:39:50 +00:00
Florian Hahn	45e5d5b4be	[VPlan] Move recipe construction to VPRecipeBuilder. This patch moves the recipe-creation functions out of LoopVectorizationPlanner, which should do the high-level orchestration of the transformations. Reviewers: dcaballe, rengolin, hsaito, Ayal Reviewed By: dcaballe Differential Revision: https://reviews.llvm.org/D47595 llvm-svn: 334305	2018-06-08 17:30:45 +00:00
Zaara Syeda	4455b37666	[PPC64] Add support for local-exec TLS model This patch adds the relocations needed support the local-exec TLS model: R_PPC64_TPREL16 R_PPC64_TPREL16_HA R_PPC64_TPREL16_LO R_PPC64_TPREL16_HI R_PPC64_TPREL16_DS R_PPC64_TPREL16_LO_DS R_PPC64_TPREL16_HIGHER R_PPC64_TPREL16_HIGHERA R_PPC64_TPREL16_HIGHEST R_PPC64_TPREL16_HIGHESTA Differential Revision: https://reviews.llvm.org/D47598 llvm-svn: 334304	2018-06-08 17:04:09 +00:00
Simon Pilgrim	89deac6694	[X86][BtVer2] Add support for all SUB/XOR 32/64 scalar instructions that should match the dependency-breaking 'zero-idiom' As detailed on Agner's Microarchitecture doc (21.8 AMD Bobcat and Jaguar pipeline - Dependency-breaking instructions), these instructions are dependency breaking and fast-path zero the destination register (and appropriate EFLAGS bits). llvm-svn: 334303	2018-06-08 17:00:45 +00:00
Simon Pilgrim	59e915c691	[X86] Fix schedule-x86_64.s tests to use different registers in reg-reg cases Same fix as rL334110: I noticed while working on zero-idiom + dependency-breaking support (PR36671) that most of our binary instruction schedule tests were reusing the same src registers, which would cause the tests to fail once we enable scalar zero-idiom support on btver2. llvm-svn: 334302	2018-06-08 16:40:15 +00:00
Daniil Fukalov	c9a098b314	[AMDGPU] Inline asm - added i16, half and i128 types support AMDGPU inline assembler support i16, half and i128 typed variables in constraints, but they were reported as error. Needed to fix https://github.com/RadeonOpenCompute/ROCm/issues/341, e.g. to be able to load with global_load_dwordx4 to a 128bit integer variable Differential Revision: https://reviews.llvm.org/D44920 llvm-svn: 334301	2018-06-08 16:29:04 +00:00
Daniil Fukalov	37433dc2e1	reapply r334209 with fixes for harfbuzz in Chromium r334209 description: [LSR] Check yet more intrinsic pointer operands the patch fixes another assertion in isLegalUse() Differential Revision: https://reviews.llvm.org/D47794 llvm-svn: 334300	2018-06-08 16:22:52 +00:00
Roman Lebedev	f87321a2dc	[NFC][InstSimplify] SimplifyAddInst(): coding style: variable names. llvm-svn: 334299	2018-06-08 15:44:53 +00:00
Roman Lebedev	b060ce45ca	[InstSimplify] add nuw %x, -1 -> -1 fold. Summary: `%ret = add nuw i8 %x, C` From [[ https://llvm.org/docs/LangRef.html#add-instruction \| langref ]]: nuw and nsw stand for “No Unsigned Wrap” and “No Signed Wrap”, respectively. If the nuw and/or nsw keywords are present, the result value of the add is a poison value if unsigned and/or signed overflow, respectively, occurs. So if `C` is `-1`, `%x` can only be `0`, and the result is always `-1`. I'm not sure we want to use `KnownBits`/`LVI` here, because there is exactly one possible value (all bits set, `-1`), so some other pass should take care of replacing the known-all-ones with constant `-1`. The `test/Transforms/InstCombine/set-lowbits-mask-canonicalize.ll` change is confusing. What happening is, before this: (omitting `nuw` for simplicity) 1. First, InstCombine D47428/rL334127 folds `shl i32 1, %NBits`) to `shl nuw i32 -1, %NBits` 2. Then, InstSimplify D47883/rL334222 folds `shl nuw i32 -1, %NBits` to `-1`, 3. `-1` is inverted to `0`. But now: 1. This InstSimplify fold `%ret = add nuw i32 %setbit, -1` -> `-1` happens first, before InstCombine D47428/rL334127 fold could happen. Thus we now end up with the opposite constant, and it is all good: https://rise4fun.com/Alive/OA9 https://rise4fun.com/Alive/sldC Was mentioned in D47428 review. Follow-up for D47883. Reviewers: spatel, craig.topper Reviewed By: spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D47908 llvm-svn: 334298	2018-06-08 15:44:47 +00:00
Simon Pilgrim	efb4806bb9	[X86][BtVer2] Remove SBB tests that were accidentally added in rL334296 These aren't true zero-idiom instructions (just dependency breaking). llvm-svn: 334297	2018-06-08 15:43:00 +00:00
Simon Pilgrim	53766a986d	[X86][BtVer2] Add tests for scalar SUB/XOR instructions that should match the dependency-breaking 'zero-idiom' As detailed on Agner's Microarchitecture doc (21.8 AMD Bobcat and Jaguar pipeline - Dependency-breaking instructions). llvm-svn: 334296	2018-06-08 15:28:43 +00:00
Alexander Kornienko	f084913223	commandLineFitsWithinSystemLimits Overestimates System Limits Summary: The function `llvm::sys::commandLineFitsWithinSystemLimits` appears to be overestimating the system limits. This issue was discovered while attempting to enable response files in the Swift compiler. When the compiler submits its frontend jobs, those jobs are subjected to the system limits on command line length. `commandLineFitsWithinSystemLimits` is used to determine if the job's arguments need to be wrapped in a response file. There are some cases where the argument size for the job passes `commandLineFitsWithinSystemLimits`, but actually exceeds the real system limit, and the job fails. `clang` also uses this function to decide whether or not to wrap it's job arguments in response files. See: https://github.com/llvm-mirror/clang/blob/master/lib/Driver/Driver.cpp#L1341. Clang will also fail for response files who's size falls within a certain range. I wrote a script that should find a failure point for `clang++`. All that is needed to run it is Python 2.7, and a simple "hello world" program for `test.cc`. It should run on Linux and on macOS. The script is available here: https://gist.github.com/dabelknap/71bd083cd06b91c5b3cef6a7f4d3d427. When it hits a failure point, you should see a `clang: error: unable to execute command: posix_spawn failed: Argument list too long`. The proposed solution is to mirror the behavior of `xargs` in `commandLinefitsWithinSystemLimits`. `xargs` defaults to 128k for the command line length size (See: https://fossies.org/dox/findutils-4.6.0/buildcmd_8c_source.html#l00551). It adjusts this depending on the value of `ARG_MAX`. Reviewers: alexfh Reviewed By: alexfh Subscribers: llvm-commits Tags: #clang Patch by Austin Belknap! Differential Revision: https://reviews.llvm.org/D47795 llvm-svn: 334295	2018-06-08 15:19:16 +00:00
Zachary Turner	66ef5d3cd6	Clean up some code in Program. NFC here, this just raises some platform specific ifdef hackery out of a class and creates proper platform-independent typedefs for the relevant things. This allows these typedefs to be reused in other places without having to reinvent this preprocessor logic. llvm-svn: 334294	2018-06-08 15:16:25 +00:00
Zachary Turner	6edfecb883	Add a file open flag that disables O_CLOEXEC. O_CLOEXEC is the right default, but occasionally you don't want this. This is especially true for tools like debuggers where you might need to spawn the child process with specific files already open, but it's occasionally useful in other scenarios as well, like when you want to do some IPC between parent and child. llvm-svn: 334293	2018-06-08 15:15:56 +00:00
Simon Pilgrim	aafcf9e4a1	[X86][BtVer2] Limit zero idiom tests to a single iteration. Reduces output size and we're only wanting to check that the instructions are fast-path'd (just Dispatch+Retire) anyhow llvm-svn: 334292	2018-06-08 15:01:40 +00:00
Simon Pilgrim	c246d8dda0	Fix Wdocumentation warning for unknown param. NFCI. llvm-svn: 334291	2018-06-08 14:53:52 +00:00
Simon Pilgrim	eab9d20424	[X86][SSE] Add SSE2/AVX2 vector rotate tests Now that we're custom lowering vector rotates for SSE in general we should be testing the combines with them as well. llvm-svn: 334290	2018-06-08 14:07:21 +00:00

1 2 3 4 5 ...

291511 Commits All Branches Search

291511 Commits

All Branches