llvm-project

Commit Graph

Author	SHA1	Message	Date
Alex Bradbury	32b77383ec	[SelectionDAG] Support promotion of the FPOWI integer operand For targets where i32 is not a legal type (e.g. 64-bit RISC-V), LegalizeIntegerTypes must promote the integer operand of ISD::FPOWI. As this is a signed value, this should be sign-extended. This patch enables all tests in test/CodeGen/RISCVfloat-intrinsics.ll for RV64, as prior to this patch that file couldn't be compiled for RV64 due to an assertion when performing codegen for fpowi. Differential Revision: https://reviews.llvm.org/D54574 llvm-svn: 352832	2019-02-01 03:46:28 +00:00
James Y Knight	473e3420ce	Fix compilation of examples after `13680223b9` / r352827 Who knew...they're not built by default or as part of the tests. llvm-svn: 352830	2019-02-01 03:23:42 +00:00
James Y Knight	13680223b9	[opaque pointer types] Add a FunctionCallee wrapper type, and use it. Recommit r352791 after tweaking DerivedTypes.h slightly, so that gcc doesn't choke on it, hopefully. Original Message: The FunctionCallee type is effectively a {FunctionType,Value} pair, and is a useful convenience to enable code to continue passing the result of getOrInsertFunction() through to EmitCall, even once pointer types lose their pointee-type. Then: - update the CallInst/InvokeInst instruction creation functions to take a Callee, - modify getOrInsertFunction to return FunctionCallee, and - update all callers appropriately. One area of particular note is the change to the sanitizer code. Previously, they had been casting the result of `getOrInsertFunction` to a `Function*` via `checkSanitizerInterfaceFunction`, and storing that. That would report an error if someone had already inserted a function declaraction with a mismatching signature. However, in general, LLVM allows for such mismatches, as `getOrInsertFunction` will automatically insert a bitcast if needed. As part of this cleanup, cause the sanitizer code to do the same. (It will call its functions using the expected signature, however they may have been declared.) Finally, in a small number of locations, callers of `getOrInsertFunction` actually were expecting/requiring that a brand new function was being created. In such cases, I've switched them to Function::Create instead. Differential Revision: https://reviews.llvm.org/D57315 llvm-svn: 352827	2019-02-01 02:28:03 +00:00
Sanjay Patel	ef9a3881d0	[x86] adjust test to show both add/inc options; NFC If we're optimizing for size, that overrides the subtarget feature, so we would always produce 'inc' if we matched this pattern. llvm-svn: 352821	2019-02-01 00:07:20 +00:00
Sanjay Patel	da45d68a71	[x86] add test for missed opportunity to use 'inc'; NFC Another pattern exposed in D57516. llvm-svn: 352820	2019-01-31 23:46:58 +00:00
Kostya Serebryany	a78a44d480	[sanitizer-coverage] prune trace-cmp instrumentation for CMP isntructions that feed into the backedge branch. Instrumenting these CMP instructions is almost always useless (and harmful) for fuzzing llvm-svn: 352818	2019-01-31 23:43:00 +00:00
Matt Arsenault	50d6579bac	GlobalISel: Fix MMO creation with non-power-of-2 mem size It should probably just be mandatory for getTgtMemIntrinsic to return the alignment. llvm-svn: 352817	2019-01-31 23:41:23 +00:00
JF Bastien	e2dedd5564	Revert "Bump minimum toolchain version" A handful of bots are still breaking, either because I missed them in my audit, they were offline, or something else. I'm contacting their authors, but I'll revert for now and re-commit later. llvm-svn: 352814	2019-01-31 23:29:39 +00:00
Thomas Lively	9a48438832	[WebAssembly] Fix a regression selecting negative build_vector lanes Summary: The custom lowering introduced in rL352592 creates build_vector nodes with negative i32 operands, but these operands did not meet the value range constraints necessary to match build_vector nodes. This CL fixes the issue by removing the unnecessary constraints. Reviewers: aheejin Subscribers: dschuff, sbc100, jgravelle-google, hiraditya, sunfish Differential Revision: https://reviews.llvm.org/D57481 llvm-svn: 352813	2019-01-31 23:22:39 +00:00
JF Bastien	d5dbe83127	DeveloperPolicy: update toolchain with sample RFC / patch As was suggested when the policy originally went in. llvm-svn: 352812	2019-01-31 23:18:11 +00:00
JF Bastien	62bb58a357	Bump minimum toolchain version Summary: The RFC on moving past C++11 got good traction: http://lists.llvm.org/pipermail/llvm-dev/2019-January/129452.html This patch therefore bumps the toolchain versions according to our policy: llvm.org/docs/DeveloperPolicy.html#toolchain Subscribers: mgorny, jkorous, dexonsmith, llvm-commits, mehdi_amini, jyknight, rsmith, chandlerc, smeenai, hans, reames, lattner, lhames, erichkeane Differential Revision: https://reviews.llvm.org/D57264 llvm-svn: 352811	2019-01-31 23:13:10 +00:00
Wouter van Oortmerssen	5f563f06d1	Fixed hasLinkerPrivateGlobalPrefix treating StringRef as C String. Reviewers: jgravelle-google, sbc100 Subscribers: aheejin, llvm-commits Differential Revision: https://reviews.llvm.org/D57545 llvm-svn: 352810	2019-01-31 23:03:48 +00:00
Alex Bradbury	d834d8301d	[RISCV] Add RV64F codegen support This requires a little extra work due tothe fact i32 is not a legal type. When call lowering happens post-legalisation (e.g. when an intrinsic was inserted during legalisation). A bitcast from f32 to i32 can't be introduced. This is similar to the challenges with RV32D. To handle this, we introduce target-specific DAG nodes that perform bitcast+anyext for f32->i64 and trunc+bitcast for i64->f32. Differential Revision: https://reviews.llvm.org/D53235 llvm-svn: 352807	2019-01-31 22:48:38 +00:00
Sam Clegg	c0affde863	[WebAssembly] MC: Fix for outputing wasm object to /dev/null Subscribers: dschuff, jgravelle-google, aheejin, sunfish, llvm-commits Differential Revision: https://reviews.llvm.org/D57479 llvm-svn: 352806	2019-01-31 22:38:22 +00:00
Sanjay Patel	d16ca2fcbe	[x86] add test for missed opportunity to use 'inc'; NFC llvm-svn: 352805	2019-01-31 22:33:11 +00:00
Richard Trieu	8f6182f7f6	[Hexagon] Rename textually included file from .h to .inc llvm-svn: 352802	2019-01-31 21:58:42 +00:00
James Y Knight	fadf25068e	Revert "[opaque pointer types] Add a FunctionCallee wrapper type, and use it." This reverts commit `f47d6b38c7` (r352791). Seems to run into compilation failures with GCC (but not clang, where I tested it). Reverting while I investigate. llvm-svn: 352800	2019-01-31 21:51:58 +00:00
James Y Knight	0bd6b91fcf	Fix compilation error with GCC after r352791. llvm-svn: 352795	2019-01-31 21:21:57 +00:00
Alina Sbirlea	e271889291	[EarlyCSE & MSSA] Cleanup special handling for removing MemoryAccesses. Summary: Moving special handling to MemorySSAUpdater in D57199. Reviewers: gberry, george.burgess.iv Subscribers: sanjoy, jlebar, Prazek, llvm-commits Differential Revision: https://reviews.llvm.org/D57200 llvm-svn: 352794	2019-01-31 21:12:41 +00:00
Thomas Lively	88058d4e1e	[WebAssembly] Add bulk memory target feature Summary: Also clean up some preexisting target feature code. Reviewers: aheejin Subscribers: dschuff, sbc100, jgravelle-google, hiraditya, sunfish, jfb Differential Revision: https://reviews.llvm.org/D57495 llvm-svn: 352793	2019-01-31 21:02:19 +00:00
Guozhi Wei	0bed9e0453	[DAGCombine] Avoid CombineZExtLogicopShiftLoad if there is free ZEXT This patch fixes pr39098. For the attached test case, CombineZExtLogicopShiftLoad can optimize it to t25: i64 = Constant<1099511627775> t35: i64 = Constant<0> t0: ch = EntryToken t57: i64,ch = load<(load 4 from `i40* undef`, align 8), zext from i32> t0, undef:i64, undef:i64 t58: i64 = srl t57, Constant:i8<1> t60: i64 = and t58, Constant:i64<524287> t29: ch = store<(store 5 into `i40* undef`, align 8), trunc to i40> t57:1, t60, undef:i64, undef:i64 But later visitANDLike transforms it to t25: i64 = Constant<1099511627775> t35: i64 = Constant<0> t0: ch = EntryToken t57: i64,ch = load<(load 4 from `i40* undef`, align 8), zext from i32> t0, undef:i64, undef:i64 t61: i32 = truncate t57 t63: i32 = srl t61, Constant:i8<1> t64: i32 = and t63, Constant:i32<524287> t65: i64 = zero_extend t64 t58: i64 = srl t57, Constant:i8<1> t60: i64 = and t58, Constant:i64<524287> t29: ch = store<(store 5 into `i40* undef`, align 8), trunc to i40> t57:1, t60, undef:i64, undef:i64 And it triggers CombineZExtLogicopShiftLoad again, causes a dead loop. Both forms should generate same instructions, CombineZExtLogicopShiftLoad generated IR looks cleaner. But it looks more difficult to prevent visitANDLike to do the transform, so I prevent CombineZExtLogicopShiftLoad to do the transform if the ZExt is free. Differential Revision: https://reviews.llvm.org/D57491 llvm-svn: 352792	2019-01-31 20:46:42 +00:00
James Y Knight	f47d6b38c7	[opaque pointer types] Add a FunctionCallee wrapper type, and use it. The FunctionCallee type is effectively a {FunctionType,Value} pair, and is a useful convenience to enable code to continue passing the result of getOrInsertFunction() through to EmitCall, even once pointer types lose their pointee-type. Then: - update the CallInst/InvokeInst instruction creation functions to take a Callee, - modify getOrInsertFunction to return FunctionCallee, and - update all callers appropriately. One area of particular note is the change to the sanitizer code. Previously, they had been casting the result of `getOrInsertFunction` to a `Function*` via `checkSanitizerInterfaceFunction`, and storing that. That would report an error if someone had already inserted a function declaraction with a mismatching signature. However, in general, LLVM allows for such mismatches, as `getOrInsertFunction` will automatically insert a bitcast if needed. As part of this cleanup, cause the sanitizer code to do the same. (It will call its functions using the expected signature, however they may have been declared.) Finally, in a small number of locations, callers of `getOrInsertFunction` actually were expecting/requiring that a brand new function was being created. In such cases, I've switched them to Function::Create instead. Differential Revision: https://reviews.llvm.org/D57315 llvm-svn: 352791	2019-01-31 20:35:56 +00:00
Shoaib Meenai	e1b332efba	[cmake] Note future cleanup in comment. NFC CMake 3.6 introduced CMAKE_TRY_COMPILE_PLATFORM_VARIABLES, which solves precisely the problem that necessitated init_user_prop, so we can switch over whenever we bump our minimum CMake requirement. llvm-svn: 352790	2019-01-31 20:32:45 +00:00
Alina Sbirlea	240a90a57e	[MemorySSA] Extend removeMemoryAccess API to optimize MemoryPhis. Summary: EarlyCSE needs to optimize MemoryPhis after an access is removed and has special handling for it. This should be handled by MemorySSA instead. The default remains that MemoryPhis are not optimized after an access is removed. Reviewers: george.burgess.iv Subscribers: sanjoy, jlebar, llvm-commits, Prazek Differential Revision: https://reviews.llvm.org/D57199 llvm-svn: 352787	2019-01-31 20:13:47 +00:00
Nirav Dave	b792299d83	[DAG][SystemZ] Define unwrapAddress for PCREL_WRAPPER. Summary: Like with X86, this allows better DAG-level alias analysis and alignment inference for wrapped addresses. Reviewers: jonpa, uweigand Reviewed By: uweigand Subscribers: hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D57407 llvm-svn: 352786	2019-01-31 19:58:34 +00:00
Matt Davis	82937e44bd	[ELF] Return the section name when calling getSymbolName on a section symbol. Summary: Previously, llvm-nm would report symbols for .debug and .note sections as: '?' with an empty section name: ``` 00000000 ? 00000000 ? ... ``` With this patch the output more closely resembles GNU nm: ``` 00000000 N .debug_abbrev 00000000 n .note.GNU-stack ... ``` This patch calls `getSectionName` for sections that belong to symbols of type `ELF::STT_SECTION`, which returns the name of the section from the section string table. Reviewers: Bigcheese, davide, jhenderson Reviewed By: davide, jhenderson Subscribers: rupprecht, jhenderson, llvm-commits Differential Revision: https://reviews.llvm.org/D57105 llvm-svn: 352785	2019-01-31 19:42:21 +00:00
Nirav Dave	4061b44057	[DAG] Aggressively cleanup dangling node in CombineZExtLogicopShiftLoad. While dangling nodes will eventually be pruned when they are considered, leaving them disables combines requiring single-use. Reviewers: Carrot, spatel, craig.topper, RKSimon, efriedma Subscribers: hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D57520 llvm-svn: 352784	2019-01-31 19:35:14 +00:00
Leonard Chan	ae527ac603	[Intrinsic] Expand SMULFIX to MUL, MULH[US], or [US]MUL_LOHI on vector arguments r zero scale SMULFIX, expand into MUL which produces better code for X86. For vector arguments, expand into MUL if SMULFIX is provided with a zero scale. Otherwise, expand into MULH[US] or [US]MUL_LOHI. Differential Revision: https://reviews.llvm.org/D56987 llvm-svn: 352783	2019-01-31 19:15:37 +00:00
Craig Topper	a8f0745440	Revert "[X86] Mark EMMS and FEMMS as clobbering MM0-7 and ST0-7." This is causing a failure in chromium llvm-svn: 352782	2019-01-31 19:05:22 +00:00
Philip Reames	ede49ddff5	Lower widenable_conditions in CGP This ensures that if we make it to the backend w/o lowering widenable_conditions first, that we generate correct code. Doing it in CGP - instead of isel - let's us fold control flow before hitting block local instruction selection. Differential Revision: https://reviews.llvm.org/D57473 llvm-svn: 352779	2019-01-31 18:45:46 +00:00
Matt Arsenault	4b94d2597d	GlobalISel: Fix handling of vectors of pointers in clamp{Min,Max}NumElements This avoids hitting the assert added in r352636 in a future commit. llvm-svn: 352777	2019-01-31 18:01:49 +00:00
Bob Wilson	9f4563bbd2	[ADT] Fix a typo in isOSVersionLT that breaks the Micro version check The original commit of this function (r129800 in 2011) had a typo where part of the "Micro" version check was actually comparing against the "Minor" version number. llvm-svn: 352776	2019-01-31 17:58:59 +00:00
Simon Pilgrim	00cefe1158	Trim trailing whitespace. NFCI. llvm-svn: 352775	2019-01-31 17:49:25 +00:00
Simon Pilgrim	eb6aef6db3	[X86][AVX] Fold concat(broadcast(x),broadcast(x)) -> broadcast(x) Differential Revision: https://reviews.llvm.org/D57514 llvm-svn: 352774	2019-01-31 17:48:35 +00:00
Simon Pilgrim	d04a2d2d5e	[X86][AVX] insert_subvector(bitcast(v), bitcast(s), c1) -> bitcast(insert_subvector(v,s,c2)) Similar to what we already do in DAGCombiner, but this version also handles bitcasts from types with different scalar sizes, which x86 is better at handling. Differential Revision: https://reviews.llvm.org/D57514 llvm-svn: 352773	2019-01-31 17:38:10 +00:00
Craig Topper	c1892ec15a	[CallSite removal] Remove CallSite uses from InstCombine. Reviewers: chandlerc Reviewed By: chandlerc Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D57494 llvm-svn: 352771	2019-01-31 17:23:29 +00:00
Teresa Johnson	f59242e5ff	Recommit "[ThinLTO] Rename COMDATs for COFF when promoting/renaming COMDAT leader" Recommit of r352763 with fix for use after free. llvm-svn: 352770	2019-01-31 17:18:11 +00:00
Sanjay Patel	4ec1599082	revert r352766: [PatternMatch] add special-case uaddo matching for increment-by-one Missed some regression test updates when testing this. llvm-svn: 352769	2019-01-31 16:48:42 +00:00
Teresa Johnson	4877715ee6	Revert "[ThinLTO] Rename COMDATs for COFF when promoting/renaming COMDAT leader" This reverts commit r352763. Causing a couple bot failures, root cause pointed to by sanitizer bot: http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux-fast/builds/28909/steps/annotate/logs/stdio Use after free. I understand the issue but will revert and test with fix before recommitting. llvm-svn: 352768	2019-01-31 16:46:14 +00:00
Jordan Rupprecht	bd7735f797	[llvm-objcopy] Skip --localize-symbol for undefined symbols Summary: Include the symbol being defined in the list of requirements for using --localize-symbol. This is used, for example, when someone is depending on two different projects that have the same (or close enough) method defined in each library, and using "-L sym" for a conflicting symbol in one of the libraries so that the definition from the other one is used. However, the library may have internal references to the symbol, which cause program crashes when those are used, i.e.: ``` $ cat foo.c int foo() { return 5; } $ cat bar.c int foo(); int bar() { return 2 * foo(); } $ cat foo2.c int foo() { /* Safer implementation */ return 42; } $ cat main.c int bar(); int main() { __builtin_printf("bar = %d\n", bar()); return 0; } $ ar rcs libfoo.a foo.o bar.o $ ar rcs libfoo2.a foo2.o # Picks the wrong foo() impl $ clang main.o -lfoo -lfoo2 -L. -o main # Picks the right foo() impl $ objcopy -L foo libfoo.a && clang main.o -lfoo -lfoo2 -L. -o main # Links somehow, but crashes at runtime $ llvm-objcopy -L foo libfoo.a && clang main.o -lfoo -lfoo2 -L. -o main ``` Reviewers: jhenderson, alexshap, jakehehrlich, espindola Subscribers: emaste, arichardson Differential Revision: https://reviews.llvm.org/D57417 llvm-svn: 352767	2019-01-31 16:45:16 +00:00
Sanjay Patel	6fa5e62c25	[PatternMatch] add special-case uaddo matching for increment-by-one This is the most important uaddo problem mentioned in PR31754: https://bugs.llvm.org/show_bug.cgi?id=31754 We were failing to match the canonicalized pattern when it's an 'add 1' operation. Pattern matching, however, shouldn't assume that we have canonicalized IR, so we match 4 commuted variants of uaddo. There's also a test with a crazy type to show that the existing CGP transform based on this matcher is not limited by target legality checks, but that's a different problem. Differential Revision: https://reviews.llvm.org/D57516 llvm-svn: 352766	2019-01-31 16:40:07 +00:00
Teresa Johnson	992b53fd16	[ThinLTO] Rename COMDATs for COFF when promoting/renaming COMDAT leader Summary: COFF requires that COMDAT name match that of the leader. When we promote and rename an internal leader in ThinLTO due to an import, ensure we subsequently rename the associated COMDAT. Similar to D31963 which did this during ThinLTO module splitting. Fixes PR40414. Reviewers: pcc, inglorion Subscribers: mehdi_amini, dexonsmith, dmajor, llvm-commits Differential Revision: https://reviews.llvm.org/D57395 llvm-svn: 352763	2019-01-31 16:00:15 +00:00
Sanjay Patel	6be24274be	[CGP] add more tests for uaddo; NFC llvm-svn: 352762	2019-01-31 15:48:46 +00:00
Nico Weber	e51582c69e	gn build: Merge r352483 llvm-svn: 352759	2019-01-31 15:23:02 +00:00
Nico Weber	5b23ab2e19	gn build: Merge r352681, r352739 llvm-svn: 352758	2019-01-31 14:45:40 +00:00
James Henderson	5282c872c0	[llvm-symbolizer][test] Extract tests from llvm-symbolizer.test and simplify (#3 ) This is the fourth (and final for now) of a series of patches simplifying llvm-symbolizer tests. See r352752, r352753 and 352754 for the previous ones. This patch splits out several more distinct test cases from llvm-symbolizer.test into separate tests, and simplifies them in various ways including: 1) Building a test case for spaces in path from source, rather than using a pre-canned binary. This allows deleting of said binary and the source it was built from. 2) Switching to specifying addresses and objects directly on the command-line rather than via stdin. This also adds an explict test for the ability to specify a file and address as a line in stdin, since the majority of the tests have been migrated away from this approach, leaving this largely untested. Reviewed by: dblaikie Differential Revision: https://reviews.llvm.org/D57446 llvm-svn: 352756	2019-01-31 14:22:50 +00:00
James Henderson	0ca744c845	[llvm-symbolizer][test] Extract tests from llvm-symbolizer.test and simplify (#2 ) This is the third of a series of patches simplifying llvm-symbolizer tests. See r352752 and r352753 for the previous two. This patch splits out a number of distinct test cases from llvm-symbolizer.test into separate tests, and simplifies them in various ways including: 1) using --obj/positional arguments for the input file and addresses instead of stdin, 2) using runtime-generated inputs rather than a pre-canned binary, and 3) testing more specifically (i.e. checking only what is interesting to the behaviour changed in the original commit for that test case). This patch also removes the test case for using --obj. The tools/llvm-symbolizer/basic.s test already tests this case. Finally, this patch adds a simple test case to the demangle switch test case to show that demangling happens by default. See https://bugs.llvm.org/show_bug.cgi?id=40070#c1 for the motivation. Reviewed by: dblaikie Differential Revision: https://reviews.llvm.org/D57446 llvm-svn: 352754	2019-01-31 14:17:33 +00:00
James Henderson	b10f112cf9	[llvm-symbolizer][test] Extract tests from llvm-symbolizer.test and simplify (#1 ) This is the second of a series of patches simplifying llvm-symbolizer tests. See r352752 for the first. This one splits out 5 distinct test cases from llvm-symbolizer.test into separate tests, and simplifies them slightly by using --obj/positional arguments for the input file and addresses instead of stdin. See https://bugs.llvm.org/show_bug.cgi?id=40070#c1 for the motivation. Reviewed by: dblaikie Differential Revision: https://reviews.llvm.org/D57443 llvm-svn: 352753	2019-01-31 14:11:17 +00:00
James Henderson	ca8f3cb27c	[llvm-symbolizer][test] Simplify test input reading This change migrates most llvm-symbolizer tests away from reading input via stdin and instead using --obj + positional arguments for the file and addresses respectively, which makes the tests easier to read. One exception is the test test/tools/llvm-symbolizer/pdb/pdb.test, which was doing some manipulation on the input addresses. This patch simplifies this somewhat, but it still reads from stdin. More changes to follow to simplify/break-up other tests. Reviewed by: dblaikie Differential Revision: https://reviews.llvm.org/D57441 llvm-svn: 352752	2019-01-31 14:04:47 +00:00
Simon Pilgrim	63f3383ece	[X86][AVX] Fold broadcast(bitcast(src)) -> bitcast(broadcast(src)) llvm-svn: 352751	2019-01-31 14:04:07 +00:00
James Henderson	140f75f625	[CommandLine] Improve help text for cl::values style options In order to make an option value truly optional, both the ValueOptional and an empty-named value are required. This empty-named value appears in the command-line help text, which is not ideal. This change improves the help text for these sort of options in a number of ways: 1) ValueOptional options with an empty-named value now print their help text twice: both without and then with '=<value>' after the name. The latter version then lists the allowed values after it. 2) Empty-named values with no help text in ValueOptional options are not listed in the permitted values. 3) Otherwise empty-named options are printed as =<empty> rather than simply '='. 4) Option values without help text do not have the '-' separator printed. It also tweaks the llvm-symbolizer -functions help text to not print a trailing ':' as that looks bad combined with 1) above. Reviewed by: thopre, ruiu Differential Revision: https://reviews.llvm.org/D57030 llvm-svn: 352750	2019-01-31 13:58:48 +00:00
Simon Pilgrim	ac1b75b5c5	[X86][AVX] Add PR34394 subvector broadcast test cases Tidyup check-prefixes at the same time llvm-svn: 352749	2019-01-31 13:32:09 +00:00
Eugene Leviant	2267c58aea	[llvm-strip] Add --strip-symbol Differential revision: https://reviews.llvm.org/D57440 llvm-svn: 352746	2019-01-31 12:16:20 +00:00
Simon Pilgrim	a001008a09	[X86] combineExtractWithShuffle - more aggressively peek through bitcasts Fixes regression introduced by rL352743 llvm-svn: 352745	2019-01-31 11:55:30 +00:00
Simon Pilgrim	b96a2c7fed	[X86][AVX] Enable AVX1 broadcasts in shuffle combining Enables 32/64-bit scalar load broadcasts on AVX1 targets The extractelement-load.ll regression will be fixed shortly in a followup commit. llvm-svn: 352743	2019-01-31 11:41:10 +00:00
Simon Pilgrim	51c2efc104	[X86][AVX] Fold vt1 concat_vectors(vt2 undef, vt2 broadcast(x)) --> vt1 broadcast(x) If we're not inserting the broadcast into the lowest subvector then we can avoid the insertion by just performing a larger broadcast. Avoids a regression when we enable AVX1 broadcasts in shuffle combining llvm-svn: 352742	2019-01-31 11:15:05 +00:00
Max Kazantsev	f392bc846f	Default lowering for experimental.widenable.condition Introduces a pass that provides default lowering strategy for the `experimental.widenable.condition` intrinsic, replacing all its uses with `i1 true`. Differential Revision: https://reviews.llvm.org/D56096 Reviewed By: reames llvm-svn: 352739	2019-01-31 09:10:17 +00:00
Yevgeny Rouban	ae29857d64	Test commit. NFCI. llvm-svn: 352738	2019-01-31 08:49:20 +00:00
Sjoerd Meijer	f222259c3c	[ARM] Thumb2: ConstantMaterializationCost Constants can also be materialised using the negated value and a MVN, and this case seem to have been missed for Thumb2. To check the constant materialisation costs, we now call getT2SOImmVal twice, once for the original constant and then also for its negated value, and this function checks if the constant can both be splatted or rotated. This was revealed by a test that optimises for minsize: instead of a LDR literal pool load and having a literal pool entry, just a MVN with an immediate is smaller (and also faster). Differential Revision: https://reviews.llvm.org/D57327 llvm-svn: 352737	2019-01-31 08:38:06 +00:00
Sjoerd Meijer	f7cc34cae8	[SelectionDAG] Codesize: don't expand SHIFT to SHIFT_PARTS And instead just generate a libcall. My motivating example on ARM was a simple: shl i64 %A, %B for which the code bloat is quite significant. For other targets that also accept __int128/i128 such as AArch64 and X86, it is also beneficial for these cases to generate a libcall when optimising for minsize. On these 64-bit targets, the 64-bits shifts are of course unaffected because the SHIFT/SHIFT_PARTS lowering operation action is not set to custom/expand. Differential Revision: https://reviews.llvm.org/D57386 llvm-svn: 352736	2019-01-31 08:07:30 +00:00
Douglas Yung	a493843372	Fixup test after r352704 since it changes how paths may be emitted. On Unix/Mac OS X, normpath() returns the path unchanged (FileCheck), but on case-insensitive filesystems (like NTFS on Windows), it converts the path to lowercase (filecheck) which was causing the test to fail. llvm-svn: 352735	2019-01-31 07:58:34 +00:00
Dmitry Venikov	4c81a2b2ec	Commit tests for changes in revision D41940 llvm-svn: 352734	2019-01-31 07:38:19 +00:00
Petr Hosek	12062e0667	Revert "[CMake] Unify scripts for generating VCS headers" This reverts commits r352729 and r352731: this broke Sanitizer Windows bots llvm-svn: 352733	2019-01-31 07:12:43 +00:00
Dmitry Venikov	8817658836	[InstCombine] Missed optimization in math expression: simplify calls exp functions Summary: This patch enables folding following expressions under -ffast-math flag: exp(X) * exp(Y) -> exp(X + Y), exp2(X) * exp2(Y) -> exp2(X + Y). Motivation: https://bugs.llvm.org/show_bug.cgi?id=35594 Reviewers: hfinkel, spatel, efriedma, lebedev.ri Reviewed By: spatel, lebedev.ri Subscribers: lebedev.ri, llvm-commits Differential Revision: https://reviews.llvm.org/D41342 llvm-svn: 352730	2019-01-31 06:28:10 +00:00
Petr Hosek	0e712a766e	[CMake] Unify scripts for generating VCS headers Previously, there were two different scripts for generating VCS headers: one used by LLVM and one used by Clang. They were both similar, but different. They were both broken in their own ways, for example the one used by Clang didn't properly handle monorepo resulting in an incorrect version information reported by Clang. This change unifies two the scripts by introducing a new script that's used from both LLVM and Clang, ensures that the new script supports both monorepo and standalone SVN and Git setups, and removes the old scripts. Differential Revision: https://reviews.llvm.org/D57063 llvm-svn: 352729	2019-01-31 06:21:01 +00:00
Max Kazantsev	b37419ef66	[SCEV] Prohibit SCEV transformations for huge SCEVs Currently SCEV attempts to limit transformations so that they do not work with big SCEVs (that may take almost infinite compile time). But for this, it uses heuristics such as recursion depth and number of operands, which do not give us a guarantee that we don't actually have big SCEVs. This situation is still possible, though it is not likely to happen. However, the bug PR33494 showed a bunch of simple corner case tests where we still produce huge SCEVs, even not reaching big recursion depth etc. This patch introduces a concept of 'huge' SCEVs. A SCEV is huge if its expression size (intoduced in D35989) exceeds some threshold value. We prohibit optimizing transformations if any of SCEVs we are dealing with is huge. This gives us a reliable check that we don't spend too much time working with them. As the next step, we can possibly get rid of old limiting mechanisms, such as recursion depth thresholds. Differential Revision: https://reviews.llvm.org/D35990 Reviewed By: reames llvm-svn: 352728	2019-01-31 06:19:25 +00:00
Richard Trieu	108b892939	Add namespace to some types. llvm-svn: 352725	2019-01-31 04:33:11 +00:00
Matt Arsenault	e2b5bbf56c	Fix missing C++ mode comment in header llvm-svn: 352724	2019-01-31 04:27:17 +00:00
David L. Jones	d81f23071c	Revert "Reapply "[CGP] Check for existing inttotpr before creating new one"" This change reverts r351626. The changes in r351626 cause quadratic work in several cases. (See r351626 thread on llvm-commits for details.) llvm-svn: 352722	2019-01-31 03:28:46 +00:00
Matt Arsenault	c7bce739ad	GlobalISel: Handle odd splits in fewerElementsVector for load/store llvm-svn: 352720	2019-01-31 02:46:05 +00:00
Matt Arsenault	d1bfc8d0c3	GlobalISel: Implement narrowScalar for bswap llvm-svn: 352719	2019-01-31 02:34:03 +00:00
Matt Arsenault	cf4db733d8	GlobalISel: Don't call changingInstruction before giving up llvm-svn: 352718	2019-01-31 02:22:39 +00:00
Matt Arsenault	d5684f76e0	GlobalISel: Allow bitcount ops to have different result type For AMDGPU the result is always 32-bit for 64-bit inputs. llvm-svn: 352717	2019-01-31 02:09:57 +00:00
Matt Arsenault	8db2001d52	GlobalISel: Use helper function for MMO splitting Also fix an alignment bug getMachineMemOperand. If the tracked value is null, the offset isn't tracked so the base alignment needs to be reduced. llvm-svn: 352716	2019-01-31 01:49:58 +00:00
Kostya Serebryany	025e03d62b	[libFuzzer] update docs llvm-svn: 352715	2019-01-31 01:47:29 +00:00
Evandro Menezes	ca64c09360	[InstCombine] Expand testing for Windows (NFC) Added the checks to the existing cases when the target is Win64. llvm-svn: 352714	2019-01-31 01:41:39 +00:00
Matt Arsenault	2a64598ef2	GlobalISel: Fix creating MMOs with align 0 llvm-svn: 352712	2019-01-31 01:38:47 +00:00
Craig Topper	8a3c5b48df	[X86] Add a 32-bit command line to avx512-intrinsics.ll. Move all 64-bit mode only intrinsics to avx512-intrinsics-x86_64.ll. Most of the other intrinsic tests have a 32-bit command lines. llvm-svn: 352708	2019-01-31 00:49:40 +00:00
Evandro Menezes	b166936603	[InstCombine] Simplify check clauses in test (NFC) llvm-svn: 352707	2019-01-31 00:49:27 +00:00
Peter Collingbourne	0e2e0cc8c7	Reland "gn build: Add BPF target." Differential Revision: https://reviews.llvm.org/D57436 llvm-svn: 352705	2019-01-31 00:42:02 +00:00
Nico Weber	d14d35bff1	lit: Let lit.util.which() return a normcase()ed path LLVMConfig.with_environment() uses os.path.normcase(os.path.normpath(x)) to normalize temporary env vars. LLVMConfig.use_clang() uses with_environment() to temporarily set PATH and then look for clang there. This means that on Windows, clang will be run with a path like c:\foo\bin\clang.EXE (with a lower-case "C:"). lit.util.which() used to not do this, which means the executables added in clang/test/lit.cfg.py (e.g. c-index-test) were run with a path like C:\foo\bin\c-index-test.EXE (because both CMake and GN happen to write clang_tools_dir with an upper-case C to lit.site.cfg.py). clang/test/Index/pch-from-libclang.c requires that both c-index-test and clang use _exactly_ the same resource dir path (same case and everything), because a hash of the resource directory is used as module cache path. This patch is necessary but not sufficient to make pch-from-libclang.c pass on Windows. Differential Revision: https://reviews.llvm.org/D57343 llvm-svn: 352704	2019-01-31 00:40:43 +00:00
Thomas Lively	9510adafe6	[LegalizeVectorTypes] Allow illegal indices when splitting extract_vector_elt Summary: Fixes PR40267, in which the removed assertion was triggering on perfectly valid IR. As far as I can tell, constant out of bounds indices should be allowed when splitting extract_vector_elt, since they will simply be propagated as out of bounds indices in the resulting split vector and handled appropriately elsewhere. Reviewers: aheejin Subscribers: dschuff, sbc100, jgravelle-google, hiraditya Differential Revision: https://reviews.llvm.org/D57471 llvm-svn: 352702	2019-01-31 00:35:37 +00:00
Craig Topper	49c4c68919	[LegalizeTypes] Use report_fatal_error instead of llvm_unreachable in the default case of some type legalization handlers that can be reached with intrinsics with result or operands that aren't legal types. These can be triggered by mistakenly using a 64-bit mode only intrinsics with a -mtriple=i686. Using report_fatal_error gives a better experience for this mistake in release builds instead of probably crashing. We already do this for some of the vector type legalization handles. llvm-svn: 352699	2019-01-31 00:04:48 +00:00
Craig Topper	8bdc203d4b	[X86] Remove handling of ISD::INTRINSIC_WO_CHAIN in ReplaceNodeResults. I believe this was there to handle avx512bw intrinsics that returned i64 type in 32-bit mode. But all those intrinsics have since been changed to v64i1 results or replaced with generic IR. llvm-svn: 352698	2019-01-31 00:04:46 +00:00
Craig Topper	e55f6a4039	[X86] Add test case for pr40539. NFC llvm-svn: 352697	2019-01-31 00:04:42 +00:00
Heejin Ahn	7c43ac26e5	[WebAssembly] Remove TODO on wasm.extract.exception intrinsic (NFC) Summary: We planned to delete this intrinsic and do custom lowering from `wasm.get.exception`, which has a token argument, to `EXTRACT_EXCEPTION`, a wasm pseudo instruction that simulates popping a value from the wasm stack. To do that, we need to introduce a new `WebAssemblyISD` node for this, which itself is not a problem, but also have to introduce the `WebAssemblyISD` namespace in SelectionDAGBuilder.cpp. I don't think any other targets are doing that in the file. And also putting a target-specific intrinsic in the common file is a little weird too. (All other intrinsic functions in this `visitIntrinsicCall` functions are not target-specific ones. Other target-specific intrinsics are usually handled in `lib/Target/[TargetName]/[TargetName]ISelLowering.cpp`. The reason we can't do this is it has a token argument. Anyway, so I think I prefer the current code with one redundant intrinsic more than adding one more `WebAssemblyISD` node and also introducing the `WebAssemblyISD` namespace into SelectionDAGBuilder.cpp. What do you think? Reviewers: dschuff Subscribers: sbc100, jgravelle-google, sunfish, llvm-commits Differential Revision: https://reviews.llvm.org/D57480 llvm-svn: 352695	2019-01-30 23:53:36 +00:00
Zachary Turner	3c35f774de	[RuntimeDyld] Don't try to allocate sections with align 0. ELF sections allow 0 for the alignment, which is specified to be the same as 1. However many clients do not expect this and will behave poorly in the presence of a 0-aligned section (for example by trying to modulo something by the section alignment). We can be more polite by making sure that we always pass a non-zero value to clients. Differential Revision: https://reviews.llvm.org/D57482 llvm-svn: 352694	2019-01-30 23:52:32 +00:00
Jessica Paquette	84bedac7e9	[GlobalISel][AArch64] Select G_FEXP This teaches the legalizer to handle G_FEXP in AArch64. As a result, it also allows us to select G_FEXP. It... - Updates the legalizer-info tests - Adds a test for legalizing exp - Updates the existing fp tests to show that we can now select G_FEXP https://reviews.llvm.org/D57483 llvm-svn: 352692	2019-01-30 23:46:15 +00:00
Amara Emerson	13311e5274	[GlobalISel][LegalizerHelper] Add some missing MI change observer calls. No test as it's a preventative fix. llvm-svn: 352691	2019-01-30 23:42:46 +00:00
Chen Zheng	be589423d8	[PowerPC] delete no more needed workaround for readsRegister() in PowerPC Differential Revision: https://reviews.llvm.org/D57439 llvm-svn: 352689	2019-01-30 23:18:38 +00:00
Matt Arsenault	547a83b4eb	MIR: Reject non-power-of-4 alignments in MMO parsing llvm-svn: 352686	2019-01-30 23:09:28 +00:00
Jessica Paquette	10f59405ae	[GlobalISel][AArch64] Select G_FABS This adds instruction selection support for G_FABS in AArch64. It also updates the existing basic FP tests, adds a selection test for G_FABS. https://reviews.llvm.org/D57418 llvm-svn: 352684	2019-01-30 22:54:21 +00:00
Sam Clegg	19e8befabb	[WebAssembly] MC: Use WritePatchableLEB helper function. NFC. Subscribers: dschuff, jgravelle-google, aheejin, sunfish, llvm-commits Differential Revision: https://reviews.llvm.org/D57477 llvm-svn: 352683	2019-01-30 22:47:35 +00:00
Heejin Ahn	0bb9865011	[WebAssembly] Restore stack pointer right after catch instruction Summary: After the staack is unwound due to a thrown exxception, `__stack_pointer` global can point to an invalid address. So a `global.set` to restore `__stack_pointer` should be inserted right after `catch` instruction. But after r352598 the `global.set` instruction is inserted not right after `catch` but after `block` - `br-on-exn` - `end_block` - `extract_exception` sequence. This CL fixes it. While doing that, we can actually move ReplacePhysRegs pass after LateEHPrepare and merge EHRestoreStackPointer pass into LateEHPrepare, and now placing `global.set` to `__stack_pointer` right after `catch` is much easier. Otherwise it is hard to guarantee that `global.set` is still right after `catch` and not touched with other transformations, in which case we have to do something to hoist it. Reviewers: dschuff Subscribers: mgorny, sbc100, jgravelle-google, sunfish, llvm-commits Differential Revision: https://reviews.llvm.org/D57421 llvm-svn: 352681	2019-01-30 22:44:45 +00:00
Sanjay Patel	9ab23101a8	[DAGCombiner] sub X, 0/1 --> add X, 0/-1 This extends the existing transform for: add X, 0/1 --> sub X, 0/-1 ...to allow the sibling subtraction fold. This pattern could regress with the proposed change in D57401. llvm-svn: 352680	2019-01-30 22:41:35 +00:00
Sanjay Patel	c6d261efdb	[AArch64][x86] add tests for add/sub signbits fold; NFC As discussed/shown in D57401, we are missing a fold for subtract of 0/1 --> add 0/-1. llvm-svn: 352678	2019-01-30 21:58:20 +00:00
Jessica Paquette	0154bd1385	[GlobalISel][AArch64] Add instruction selection support for @llvm.log2 This teaches GlobalISel to emit a RTLib call for @llvm.log2 when it encounters it. It updates the existing floating point tests to show that we don't fall back on the intrinsic, and select the correct instructions. It also adds a legalizer test for G_FLOG2. https://reviews.llvm.org/D57357 llvm-svn: 352673	2019-01-30 21:16:04 +00:00
Jessica Paquette	22457f8e9b	[GlobalISel][AArch64] Add instruction selection support for @llvm.sqrt This teaches the legalizer about G_FSQRT in AArch64. Also adds a legalizer test for G_FSQRT, a selection test for it, and updates existing floating point tests. https://reviews.llvm.org/D57361 llvm-svn: 352671	2019-01-30 21:03:52 +00:00
Jessica Paquette	b147e7d853	[GlobalISel] Add IRTranslator support for @llvm.sqrt -> G_FSQRT Follow-up commit to https://reviews.llvm.org/D57359. (r352668) This adds IRTranslator support for recognising a @llvm.sqrt intrinsic and translating it into a G_FSQRT. https://reviews.llvm.org/D57360 llvm-svn: 352670	2019-01-30 20:58:14 +00:00
Jessica Paquette	04a83a4cae	[GlobalISel] Introduce a G_FSQRT generic instruction This introduces a generic instruction for computing the floating point square root of a value. Right now, we can't select @llvm.sqrt, so this is working towards fixing that. llvm-svn: 352668	2019-01-30 20:49:50 +00:00

1 2 3 4 5 ...

174605 Commits