llvm-project

Commit Graph

Author	SHA1	Message	Date
Sanjay Patel	06ea4206ad	[InstCombine] allow more shuffle-binop folds with safe constants The case with 2 variables is more complicated than the case where we eliminate the shuffle entirely because a shuffle with an undef mask element creates an undef result. I'm not aware of any current analysis/transform that recognizes that undef propagating to a div/rem/shift, but we have to guard against the possibility. llvm-svn: 336668	2018-07-10 13:33:26 +00:00
Anastasis Grammenos	612bf7cac5	[DebugInfo][LoopVectorize] Preserve DL in induction PHI and Add Differential Revision: https://reviews.llvm.org/D48968 llvm-svn: 336667	2018-07-10 13:29:50 +00:00
Eric Fiselier	9cc6d3b305	Remove BUILD file from google-benchmark llvm-svn: 336666	2018-07-10 13:25:26 +00:00
Louis Dionne	be4d99dd21	[libc++] Declare <compare> operators with the proper visibility attribute Summary: Many operators in <compare> were _defined_ with the proper visibility attribute, but they were _declared_ without any. This is not a problem until we change the definition of _LIBCPP_INLINE_VISIBILITY to something that requires the declaration to be decorated. I also marked `strong_equality::operator weak_equality()` as `_LIBCPP_INLINE_VISIBILITY`, since it seems like it had been forgotten. This came up while trying to get rid of `__attribute__((__always_inline__))` in favor of `__attribute__((internal_linkage))`. Reviewers: EricWF, mclow.lists Subscribers: christof, dexonsmith, llvm-commits Differential Revision: https://reviews.llvm.org/D49104 llvm-svn: 336665	2018-07-10 13:21:03 +00:00
Simon Pilgrim	641097d561	[DAGCombiner] visitREM - call visitSDIVLike/visitUDIVLike directly to avoid recursive combining. As suggested by @efriedma on D48975 use the visitSDIVLike/visitUDIVLike functions introduced at rL336656. llvm-svn: 336664	2018-07-10 13:18:16 +00:00
Dan Liew	b1f95697c1	[CMake] Add compiler-rt header files to the list of sources for targets when building with an IDE so that header files show up in the UI. This massively improves the development workflow in IDEs. To implement this a new function `compiler_rt_process_sources(...)` has been added that adds header files to the list of sources when the generator is an IDE. For non-IDE generators (e.g. Ninja/Makefile) no changes are made to the list of source files. The function can be passed a list of headers via the `ADDITIONAL_HEADERS` argument. For each runtime library a list of explicit header files has been added and passed via `ADDITIONAL_HEADERS`. For `tsan` and `sanitizer_common` a list of headers was already present but it was stale and has been updated to reflect the current state of the source tree. The original version of this patch used file globbing (`.{h,inc,def}`) to find the headers but the approach was changed due to this being a CMake anti-pattern (if the list of headers changes CMake won't automatically re-generate if globbing is used). The LLVM repo contains a similar function named `llvm_process_sources()` but we don't use it here for several reasons: It depends on the `LLVM_ENABLE_OPTION` cache variable which is not set in standalone compiler-rt builds. * We would have to `include(LLVMProcessSources)` which I'd like to avoid because it would include a bunch of stuff we don't need. Differential Revision: https://reviews.llvm.org/D48422 llvm-svn: 336663	2018-07-10 13:00:17 +00:00
Krzysztof Parzyszek	c052451a02	[Hexagon] Add implicit uses even when untied explicit uses are present An explicit untied use is not sufficient to maintain liveness of a register redefined in a predicated instruction. For example %1 = COPY %0 ... %1 = A2_paddif %2, %1, 1 could become $r1 = COPY $r0 ... $r1 = A2_paddif $p0, $r1, 1 and later $r1 = COPY $r0 ;; this is not really dead! ... $r1 = A2_paddif $p0, $r0, 1 llvm-svn: 336662	2018-07-10 12:57:49 +00:00
Filipe Cabecinhas	6a9c719ee1	[compiler-rt] Get rid of "%T" expansions Summary: Original patch by Kuba Mracek The %T lit expansion expands to a common directory shared between all the tests in the same directory, which is unexpected and unintuitive, and more importantly, it's been a source of subtle race conditions and flaky tests. In https://reviews.llvm.org/D35396, it was agreed that it would be best to simply ban %T and only keep %t, which is unique to each test. When a test needs a temporary directory, it can just create one using mkdir %t. This patch removes %T in compiler-rt. Differential Revision: https://reviews.llvm.org/D48618 llvm-svn: 336661	2018-07-10 12:53:46 +00:00
Yuka Takahashi	a15364152c	[modules] Fix 37878; Autoload subdirectory modulemaps with specific LangOpts Summary: Reproducer and errors: https://bugs.llvm.org/show_bug.cgi?id=37878 lookupModule was falling back to loadSubdirectoryModuleMaps when it couldn't find ModuleName in (proper) search paths. This was causing iteration over all files in the search path subdirectories for example "/usr/include/foobar" in bugzilla case. Users don't expect Clang to load modulemaps in subdirectories implicitly, and also the disk access is not cheap. if (AllowExtraModuleMapSearch) true with ObjC with @import ModuleName. Reviewers: rsmith, aprantl, bruno Subscribers: cfe-commits, teemperor, v.g.vassilev Differential Revision: https://reviews.llvm.org/D48367 llvm-svn: 336660	2018-07-10 12:17:34 +00:00
Karl-Johan Karlsson	1ffeb5d7f0	[LowerSwitch] Fixed faulty PHI nodes Summary: Fixed two cases of where PHI nodes need to be updated by lowerswitch. When lowerswitch find out that the switch default branch is not reachable it remove the old default and replace it with the most popular block from the cases, but it forget to update the PHI nodes in the default block. The PHI nodes also need to be updated when the switch is replaced with a single branch. Reviewers: hans, reames, arsenm Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D47203 llvm-svn: 336659	2018-07-10 12:06:16 +00:00
Omer Paparo Bivas	d2f5a6f7a2	Fixing builtin __atomic_fetch_min declaration Differential Revision: http://reviews.llvm.org/D49068 llvm-svn: 336658	2018-07-10 12:04:04 +00:00
Sam McCall	e6057bc689	[Support] Harded JSON against invalid UTF-8. Parsing invalid UTF-8 input is now a parse error. Creating JSON values from invalid UTF-8 now triggers an assertion, and (in no-assert builds) substitutes the unicode replacement character. Strings retrieved from json::Value are always valid UTF-8. llvm-svn: 336657	2018-07-10 11:51:26 +00:00
Simon Pilgrim	ce5c19b623	[DAGCombiner] Split SDIV/UDIV optimization expansions from the rest of the combines. NFCI. As suggested by @efriedma on D48975, this patch separates the BuildDiv/Pow2 style optimizations from the rest of the visitSDIV/visitUDIV to make it easier to reuse the combines and will allow us to avoid some rather nasty node recursive combining in visitREM. llvm-svn: 336656	2018-07-10 11:38:00 +00:00
Martin Storsjo	006bffe25a	[MinGW] Skip adding default win32 api libraries if -lwindowsapp is specified In this setup, skip adding all the default windows import libraries, if linking to windowsapp (which replaces them, when targeting the windows store/UWP api subset). With GCC, the same is achieved by using a custom spec file, but since clang doesn't use spec files, we have to allow other means of overriding what default libraries to use (without going all the way to using -nostdlib, which would exclude everything). The same approach, in detecting certain user specified libraries and omitting others from the defaults, was already used in SVN r314138. Differential Revision: https://reviews.llvm.org/D49059 llvm-svn: 336655	2018-07-10 10:46:51 +00:00
Martin Storsjo	995e596168	[MinGW] Treat any -lucrt* as replacing -lmsvcrt Since SVN r314138, we check if the user has specified any particular alternative msvcrt/ucrt version, and skip the default -lmsvcrt in those cases. In addition to the existing names checked, we should also treat a plain -lucrt in the same way, mingw-w64 has now added a separate import library named libucrt.a, in addition to libucrtbase.a. Differential Revision: https://reviews.llvm.org/D49054 llvm-svn: 336654	2018-07-10 10:46:45 +00:00
Florian Hahn	8263975c32	[VPlan] Add VPlanTestBase.h with helper class to build VPlan for tests. Reviewers: dcaballe, hsaito, rengolin Reviewed By: dcaballe Differential Revision: https://reviews.llvm.org/D49032 llvm-svn: 336653	2018-07-10 10:45:46 +00:00
Martin Storsjo	474be005db	[COFF] Store import symbol pointers as pointers to the base class Future symbol insertions can potentially change the type of these symbols - keep pointers to the base class to reflect this, and use dynamic casts to inspect them before using as the subclass type. This fixes crashes that were possible before, by touching these symbols that now are populated as e.g. a DefinedRegular, via the old pointers with DefinedImportThunk type. Differential Revision: https://reviews.llvm.org/D48953 llvm-svn: 336652	2018-07-10 10:40:11 +00:00
George Rimar	bc6702a424	[ELF] - Improve call graph pasing error reporting. This adds a file name to the error message, adds a missing test case and refactors code a bit. llvm-svn: 336651	2018-07-10 10:28:55 +00:00
George Rimar	20b92c4d0d	[ELF] - Report call graph profile file names in error messages. We did not report file names for some reason. llvm-svn: 336650	2018-07-10 10:16:48 +00:00
Simon Pilgrim	c048599f22	Fix MSVC "signed/unsigned mismatch" warning. NFCI. llvm-svn: 336649	2018-07-10 09:46:57 +00:00
Dean Michael Berris	b251f62428	[XRay][compiler-rt] Fixup build breakage Changes: - Remove static assertion on size of a structure, fails on systems where pointers aren't 8 bytes. - Use size_t instead of deducing type of arguments to `nearest_boundary`. Follow-up to D48653. llvm-svn: 336648	2018-07-10 08:58:12 +00:00
Chandler Carruth	148861f579	[PM/Unswitch] Fix unused variable in r336646. llvm-svn: 336647	2018-07-10 08:57:04 +00:00
Chandler Carruth	47dc3a346e	[PM/Unswitch] Fix a collection of closely related issues with trivial switch unswitching. The core problem was that the way we handled unswitching trivial exit edges through the default successor of a switch. For some reason I thought the right way to do this was to add a block containing unreachable and point the default successor at this block. In retrospect, this has an amazing number of problems. The first issue is the one that this pass has always worked around -- we have to detect such edges and avoid unswitching them again. This seemed pretty easy really. You juts look for an edge to a block containing unreachable. However, this pattern is woefully unsound. So many things can break it. The amazing thing is that I found a test case where simple-loop-unswitch itself breaks this! When we do a non-trivial unswitch of a switch we will end up splitting this exit edge. The result will be a default successor that is an exit and terminates in ... a perfectly normal branch. So the first test case that I started trying to fix is added to the nontrivial test cases. This is a ridiculous example that did just amazing things previously. With just unswitch, it would create 10+ copies of this stuff stamped out. But if you combine it just right with a bunch of other passes (like simplify-cfg, loop rotate, and some LICM) you can get it to do this infinitely. Or at least, I never got it to finish. =[ This, in turn, uncovered another related issue. When we are manipulating these switches after doing a trivial unswitch we never correctly updated PHI nodes to reflect our edits. As soon as I started changing how these edges were managed, it became obvious there were more issues that I couldn't realistically leave unaddressed, so I wrote more test cases around PHI updates here and ensured all of that works now. And this, in turn, required some adjustment to how we collect and manage the exit successor when it is the default successor. That showed a clear bug where we failed to include it in our search for the outer-most loop reached by an unswitched exit edge. This was actually already tested and the test case didn't work. I (wrongly) thought that was due to SCEV failing to analyze the switch. In fact, it was just a simple bug in the code that skipped the default successor. While changing this, I handled it correctly and have updated the test to reflect that we now get precise SCEV analysis of trip counts for the outer loop in one of these cases. llvm-svn: 336646	2018-07-10 08:36:05 +00:00
Mikhail Dvoretckii	89c919c20b	[X86] Fast-isel tests for lowered truncation intrinsics This patch adds fast-isel tests for the IR patterns produced for truncation intrinsics in rC336643. Differential Revision: https://reviews.llvm.org/D48822 llvm-svn: 336645	2018-07-10 08:26:54 +00:00
Dean Michael Berris	0dd4f9f22f	[XRay][compiler-rt] xray::Array Freelist and Iterator Updates Summary: We found a bug while working on a benchmark for the profiling mode which manifests as a segmentation fault in the profiling handler's implementation. This change adds unit tests which replicate the issues in isolation. We've tracked this down as a bug in the implementation of the Freelist in the `xray::Array` type. This happens when we trim the array by a number of elements, where we've been incorrectly assigning pointers for the links in the freelist of chunk nodes. We've taken the chance to add more debug-only assertions to the code path and allow us to verify these assumptions in debug builds. In the process, we also took the opportunity to use iterators to implement both `front()` and `back()` which exposes a bug in the iterator decrement operation. In particular, when we decrement past a chunk size boundary, we end up moving too far back and reaching the `SentinelChunk` prematurely. This change unblocks us to allow for contributing the non-crashing version of the benchmarks in the test-suite as well. Reviewers: kpw Subscribers: mgorny, llvm-commits Differential Revision: https://reviews.llvm.org/D48653 llvm-svn: 336644	2018-07-10 08:25:44 +00:00
Mikhail Dvoretckii	d1bf9ef0c7	[X86] Lowering integer truncation intrinsics to native IR This patch lowers the _mm[256\|512]_cvtepi{64\|32\|16}_epi{32\|16\|8} intrinsics to native IR in cases where the result's length is less than 128 bits. The resulting IR for 256-bit inputs is folded into VPMOV instructions, while for 128-bit inputs the vpshufb (or, in the 64-to-32-bit case, vinsertps) instructions are generated instead Differential Revision: https://reviews.llvm.org/D48712 llvm-svn: 336643	2018-07-10 08:22:44 +00:00
Simon Pilgrim	d32ca2c0b7	[X86][SSE] Prefer BLEND(SHL(v,c1),SHL(v,c2)) over MUL(v, c3) Now that rL336250 has landed, we should prefer 2 immediate shifts + a shuffle blend over performing a multiply. Despite the increase in instructions, this is quicker (especially for slow v4i32 multiplies), avoid loads and constant pool usage. It does mean however that we increase register pressure. The code size will go up a little but by less than what we save on the constant pool data. This patch also adds support for v16i16 to the BLEND(SHIFT(v,c1),SHIFT(v,c2)) combine, and also prevents blending on pre-SSE41 shifts if it would introduce extra blend masks/constant pool usage. Differential Revision: https://reviews.llvm.org/D48936 llvm-svn: 336642	2018-07-10 07:58:33 +00:00
Craig Topper	5fd020c082	[X86] Regenerate vector-shuffle-512-v8.ll so the script will merge the 32 and 64 bit checks together. NFC llvm-svn: 336641	2018-07-10 07:17:41 +00:00
Stephen Kelly	8613c0b5d2	Test commit Add redundant doc. llvm-svn: 336640	2018-07-10 06:55:12 +00:00
Craig Topper	08b81a5508	[X86] Use IsProfitableToFold to block vinsertf128rm in favor of insert_subreg instead of artifically increasing pattern complexity to give priority. This is a much more direct way to solve the issue than just giving extra priority. llvm-svn: 336639	2018-07-10 06:19:54 +00:00
Craig Topper	db73f56489	[X86] Remove some seemingly unnecessary patterns. We're missing the EVEX equivalents of these patterns and seem to get along fine. I think we end up with X86vzload for the obvious IR cases that would produce this DAG. llvm-svn: 336638	2018-07-10 05:31:42 +00:00
Craig Topper	36ab775cc1	[X86] Use masked the masked scalar fma builtins to implement the default rounding version of the fma intrinsics. The rounding mode is checked in CGBuiltin.cpp to generate the correct intrinsic call. Making this switch switchs the masking to use the i8 bitcast to <8 x i1> and extract i1 version of the IR for the mask. Previously we ended up with a scalar 'and' plus an icmp. llvm-svn: 336637	2018-07-10 04:38:29 +00:00
Eric Fiselier	09701324a1	Add new string benchmarks llvm-svn: 336636	2018-07-10 04:11:22 +00:00
Eric Fiselier	fcafd3e600	Update google-benchark to trunk llvm-svn: 336635	2018-07-10 04:02:00 +00:00
Erik Pilkington	1c5ae9bc1f	[Sema] Fix a structured binding typo correction bug BindingDecls have null type until their initializer is processed, so we can't assume that a correction candidate has non-null type. rdar://41559582 llvm-svn: 336634	2018-07-10 02:15:07 +00:00
Vlad Tsyrklevich	25291f15e7	Add lowercase OS name feature Summary: Some tests already make use of OS feature names, e.g. 'linux' and 'freebsd', but they are not actually currently set by lit. Reviewers: pcc, eugenis Reviewed By: eugenis Subscribers: emaste, krytarowski, delcypher, llvm-commits, #sanitizers Differential Revision: https://reviews.llvm.org/D49115 llvm-svn: 336633	2018-07-10 02:02:21 +00:00
Richard Trieu	27c1b1a638	[ODRHash] Merge the two function hashes into one. Functions that are a sub-Decl of a record were hashed differently than other functions. This change keeps the AddFunctionDecl function and the hash of records now calls this function. In addition, AddFunctionDecl has an option to perform a hash as if the body was absent, which is required for some checks after loading modules. Additional logic prevents multiple error message from being printed. llvm-svn: 336632	2018-07-10 01:40:50 +00:00
Rui Ueyama	2a3036fb1d	Report an error for an extremely large .gdb_index section. I believe the only way to test this functionality is to create extremely large object files and attempt to create a .gdb_index that is greater than 4 GiB. But I think that's too much for most environments and buildbots, so I'm commiting this without a test that actually triggers the new error condition. llvm-svn: 336631	2018-07-10 01:22:25 +00:00
Eric Christopher	6e912c24d1	Update crash diagnostics test to avoid attempting to write into various directories if possible and to not require %t to have "Output" in the name. llvm-svn: 336630	2018-07-10 01:01:38 +00:00
Akira Hatanaka	189359d1ff	Fix parsing of privacy annotations in os_log format strings. Privacy annotations shouldn't have to appear in the first comma-delimited string in order to be recognized. Also, they should be ignored if they are preceded or followed by non-whitespace characters. rdar://problem/40706280 llvm-svn: 336629	2018-07-10 00:50:25 +00:00
Craig Topper	1faf953d75	[X86] Remove custom handling for __builtin_ia32_divss_round_mask and __builtin_ia32_divsd_round_mask. llvm-svn: 336628	2018-07-10 00:50:03 +00:00
Craig Topper	3e7406b45b	[X86] Add back GCCBuiltin on mask_div_ss/sd_round. We no longer need custom handling in clang. llvm-svn: 336627	2018-07-10 00:49:50 +00:00
Craig Topper	866a377e91	[X86] Correct vfixupimm load patterns to look for an integer load, not a floating point load bitcasted to integer. DAG combine wouldn't let a floating point load bitcasted to integer exist. It would just be an integer load. llvm-svn: 336626	2018-07-10 00:49:49 +00:00
Craig Topper	59fd2f4c52	[X86] Add test cases that show failure to fold load into vfixupimm instructions due to bad isel pattern. llvm-svn: 336625	2018-07-10 00:49:47 +00:00
Craig Topper	e4f46e4f31	[X86] Remove FloatVT from X86VectorVTInfo in X86InstrAVX512.td The only places it was used where places where VT was the same as FloatVT. So switch those uses to VT and drop it. llvm-svn: 336624	2018-07-10 00:49:45 +00:00
Vlad Tsyrklevich	688e752207	Revert "AMDGPU: Force inlining if LDS global address is used" This reverts commit r336587, it was causing test failures on the sanitizer bots. llvm-svn: 336623	2018-07-10 00:46:07 +00:00
Craig Topper	638426fc36	[X86] Add __builtin_ia32_selectss_128 and __builtin_ia32_selectsd_128 that is suitable for use in scalar mask intrinsics. This will convert the i8 mask argument to <8 x i1> and extract an i1 and then emit a select instruction. This replaces the '(__U & 1)" and ternary operator used in some of intrinsics. The old sequence was lowered to a scalar and and compare. The new sequence uses an i1 vector that will interoperate better with other mask intrinsics. This removes the need to handle div_ss/sd specially in CGBuiltin.cpp. A follow up patch will add the GCCBuiltin name back in llvm and remove the custom handling. I made some adjustments to legacy move_ss/sd intrinsics which we reused here to do a simpler extract and insert instead of 2 extracts and two inserts or a shuffle. llvm-svn: 336622	2018-07-10 00:37:25 +00:00
Wolfgang Pieb	e194f73e9f	[DWARF][NFC] Refactor range list emission to use a static helper This is prep for DWARF v5 range list emission. Emission of a single range list is moved to a static helper function. Reviewer: jdevlieghere Differential Revision: https://reviews.llvm.org/D49098 llvm-svn: 336621	2018-07-10 00:10:11 +00:00
Rui Ueyama	a9e169edff	Fix a bug for packed relocations. Previously, we didn't create multiple consecutive bitmaps. Added a test to catch this bug too. Differential Revision: https://reviews.llvm.org/D49107 llvm-svn: 336620	2018-07-09 23:54:24 +00:00
Matt Morehouse	a34c65e845	[libFuzzer] Make -fsanitize=memory,fuzzer work. This patch allows libFuzzer to fuzz applications instrumented with MSan without recompiling libFuzzer with MSan instrumentation. Fixes https://github.com/google/sanitizers/issues/958. Differential Revision: https://reviews.llvm.org/D48891 llvm-svn: 336619	2018-07-09 23:51:08 +00:00

... 3 4 5 6 7 ...

293923 Commits All Branches Search

293923 Commits

All Branches