llvm-project

Commit Graph

Author	SHA1	Message	Date
Ilya Biryukov	4ea70ecda8	[Support] Add a GTest matcher for Optional<T> Reviewers: sammccall Reviewed By: sammccall Subscribers: mgorny, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D61071 llvm-svn: 359174	2019-04-25 09:03:32 +00:00
Roman Lebedev	445c22b7eb	[NFC][LoopIdiomRecognize] Some basic baseline tests for bcmp loop idiom Doubt this is the final test coverage, but this appears to have good coverage already, so i figure i might as well precommit it. llvm-svn: 359173	2019-04-25 08:33:47 +00:00
Simon Atanasyan	a0291110da	[MIPS] Use custom bitcast lowering to avoid excessive instructions On Mips32r2 bitcast can be expanded to two sw instructions and an ldc1 when using bitcast i64 to double or an sdc1 and two lw instructions when using bitcast double to i64. By introducing custom lowering that uses mtc1/mthc1 we can avoid excessive instructions. Patch by Mirko Brkusanin. Differential Revision: https://reviews.llvm.org/D61069 llvm-svn: 359171	2019-04-25 07:47:28 +00:00
Craig Topper	013503c78d	[X86] Remove part of an if condition that should always be true. The IndexReg will always be non-null at this point. Earlier in the function, if IndexReg was null we set it to CurDAG->getRegister(0, VT) which made it non-null. llvm-svn: 359170	2019-04-25 06:08:02 +00:00
Lang Hames	64eb9a95be	[JITLink] Make the JITLink MachO/x86-64 eh-frame test work on Windows. This should fix the MachO/x86-64 eh-frame regression test by ensuring that the symbols __ZTIi and ___gxx_personality_v0 are defined on all platforms. llvm-svn: 359169	2019-04-25 05:24:40 +00:00
Lang Hames	cf49aa3908	[llvm-rtdyld] Add support for passing command line arguments to rtdyld-run code. The --args option can now be used to pass arguments to code linked with llvm-rtdyld. E.g. $ llvm-rtdyld file1.o file2.o --args a b c is equivalent to: $ ld -o program file1.o file2.o $ ./program a b c This is the rtdyld counterpart to the jitlink change in r359115, and makes benchmarking and comparison between the tools easier. llvm-svn: 359168	2019-04-25 05:02:10 +00:00
Alina Sbirlea	733c8c40c8	Enable LoopVectorization by default. Summary: When refactoring vectorization flags, vectorization was disabled by default in the new pass manager. This patch re-enables is for both managers, and changes the assumptions opt makes, based on the new defaults. Comments in opt.cpp should clarify the intended use of all flags to enable/disable vectorization. Reviewers: chandlerc, jgorbe Subscribers: jlebar, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D61091 llvm-svn: 359167	2019-04-25 04:49:48 +00:00
Fangrui Song	3458ff361a	[llvm-objdump] errorToErrorCode+message -> toString For test/Object/elf-invalid-phdr.test, the intended error message got lost due to errorToErrorCode(). llvm-svn: 359166	2019-04-25 04:31:26 +00:00
Philip Reames	88cd69b56f	Consolidate existing utilities for interpreting vector predicate maskes [NFC] llvm-svn: 359163	2019-04-25 02:30:17 +00:00
Kit Barton	8e64f0a649	Fix unused variable warning in LoopFusion pass. Do not wrap the contents of printFusionCandidates in the LLVM_DEBUG macro. This fixes an unused variable warning generated when compiling without asserts but with -DENABLE_LLVM_DUMP. Differential Revision: https://reviews.llvm.org/D61035 llvm-svn: 359161	2019-04-25 02:10:02 +00:00
Philip Reames	7c8647b26f	[InstCombine] Be consistent w/handling of masked intrinsics style wise [NFC] llvm-svn: 359160	2019-04-25 01:18:56 +00:00
Davide Italiano	4f88388c0b	[utils] Add a lldb data formatter for llvm::SmallString. Result: (lldb) p val (llvm::SmallString<32>) $31 = "patatino" llvm-svn: 359157	2019-04-25 00:03:02 +00:00
Austin Kerbow	83e52142d1	Fix spelling error. NFC Summary: Test commit. Reviewers: msearles, jkorous Reviewed By: jkorous Subscribers: dexonsmith, arsenm, jvesely, nhaehnle, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D61093 llvm-svn: 359154	2019-04-24 23:32:21 +00:00
Nico Weber	23cb79ff93	llvm-cvtres: Make new dupe resource error a bit friendlier For well-known type IDs, include the name of the type. To not duplicate the ID->name map, make llvm-readobj call this new function as well. It has slightly different output, so this also requires updating a few tests. Differential Revision: https://reviews.llvm.org/D61086 llvm-svn: 359153	2019-04-24 23:26:30 +00:00
JF Bastien	fb742da34c	posix_spawn should retry upon EINTR Summary: We've seen cases of bots failing with: clang: error: unable to execute command: posix_spawn failed: Interrupted system call Add a small retry loop to posix_spawn in case this happens. Don't retry too much in case there's some systemic problem going on, but retry a few times. <rdar://problem/50181448> Reviewers: Bigcheese, arphaman Subscribers: jkorous, dexonsmith, kristina, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D61096 llvm-svn: 359152	2019-04-24 23:24:53 +00:00
Reid Kleckner	54763e4453	Mark new jitlink test XFAIL for windows llvm-svn: 359151	2019-04-24 23:11:17 +00:00
Amy Huang	68c9199493	Recommitting r358783 and r358786 "[MS] Emit S_HEAPALLOCSITE debug info" with fixes for buildbot error (undefined assembler label). Summary: This emits labels around heapallocsite calls and S_HEAPALLOCSITE debug info in codeview. Currently only changes FastISel, so emitting labels still needs to be implemented in SelectionDAG. Reviewers: rnk Subscribers: aprantl, hiraditya, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D61083 llvm-svn: 359149	2019-04-24 23:02:48 +00:00
Sanjay Patel	6f41bf948b	[DAGCombiner] scale repeated FP divisor by splat factor If we have a vector FP division with a splatted divisor, use the existing transform that converts 'x/y' into 'x * (1.0/y)' to allow more conversions. This can then potentially be converted into a scalar FP division by existing combines (rL358984) as seen in the tests here. That can be a potentially big perf difference if scalar fdiv has better timing (including avoiding possible frequency throttling for vector ops). Differential Revision: https://reviews.llvm.org/D61028 llvm-svn: 359147	2019-04-24 22:28:58 +00:00
Joerg Sonnenberger	8372b467f1	[PowerPC] Allow using initial-exec TLS with PIC Using initial-exec TLS variables is a reasonable performance optimisation for system libraries. Use the correct PIC mechanism to get hold of the GOT to avoid text relocations. Differential Revision: https://reviews.llvm.org/D61026 llvm-svn: 359146	2019-04-24 22:12:22 +00:00
Sean Fertile	526633deea	Add period at end of comment. llvm-svn: 359144	2019-04-24 21:51:30 +00:00
Craig Topper	6932abee2c	[X86] Attempt to fix use-after-poison from r359121. llvm-svn: 359143	2019-04-24 21:48:24 +00:00
Stanislav Mekhanoshin	9d287358a8	[AMDGPU] gfx1010 SOP instructions Differential Revision: https://reviews.llvm.org/D61080 llvm-svn: 359139	2019-04-24 20:44:34 +00:00
Alexey Bataev	ef3c1884ec	[SLP] Fix crash after r358519, by V. Porpodas. Summary: The code did not check if operand was undef before casting it to Instruction. Reviewers: RKSimon, ABataev, dtemirbulatov Reviewed By: ABataev Subscribers: uabelho Tags: #llvm Differential Revision: https://reviews.llvm.org/D61024 llvm-svn: 359136	2019-04-24 20:21:32 +00:00
Reid Kleckner	c06a470fc8	Try once more to ensure constant initializaton of ManagedStatics First, use the old style of linker initialization for MSVC 2019 in addition to 2017. MSVC 2019 emits a dynamic initializer for ManagedStatic when compiled in debug mode, and according to zturner, also sometimes in release mode. I wasn't able to reproduce that, but it seems best to stick with the old code that works. When clang is using the MSVC STL, we have to give ManagedStatic a constexpr constructor that fully zero initializes all fields, otherwise it emits a dynamic initializer. The MSVC STL implementation of std::atomic has a non-trivial (but constexpr) default constructor that zero initializes the atomic value. Because one of the fields has a non-trivial constructor, ManagedStatic ends up with a non-trivial ctor. The ctor is not constexpr, so clang ends up emitting a dynamic initializer, even though it simply does zero initialization. To make it constexpr, we must initialize all fields of the ManagedStatic. However, while the constructor that takes a pointer is marked constexpr, clang says it does not evaluate to a constant because it contains a cast from a pointer to an integer. I filed this as: https://developercommunity.visualstudio.com/content/problem/545566/stdatomic-value-constructor-is-not-actually-conste.html Once we do that, we can add back the LLVM_REQUIRE_CONSTANT_INITIALIZATION marker, and so far as I'm aware it compiles successfully on all supported targets. llvm-svn: 359135	2019-04-24 20:13:23 +00:00
Xinliang David Li	499c80b890	Add optional arg to profile count getters to filter synthetic profile count. Differential Revision: http://reviews.llvm.org/D61025 llvm-svn: 359131	2019-04-24 19:51:16 +00:00
Craig Topper	af194e9380	[X86] Prevent folding a load into an AND if that AND is really a ZEXT_INREG that should use movzx. This can save a 32-bit immediate move. We would shrink the load and fold it if it was non-volatile, but that's trickier to check for. llvm-svn: 359129	2019-04-24 19:28:38 +00:00
Nico Weber	1591693c7c	llvm-cvtres: Remove a default argument. No behavior change. llvm-svn: 359128	2019-04-24 19:13:38 +00:00
Adrian Prantl	c90ff5e123	Revert using fcopyfile(3) to implement sys::fs::copy_file(Twine, int) on macOS It turns out that I mesread the man page and fcopyfile(3) does not actually support COPYFILE_CLONE for files. <rdar://problem/50148757> llvm-svn: 359127	2019-04-24 19:08:43 +00:00
David Blaikie	832c7d9f36	DebugInfo: Emit only declarations (not whole definitions) of non-unit user defined types into type units While this doesn't come up in reasonable cases currently (the only user defined types not in type units are ones without linkage - which makes for near-ODR violations, because it'd be a type with linkage referencing a type without linkage - such a type can't be validly defined in more than one TU, so arguably it shouldn't be in a type unit to begin with - but it's a convenient way to demonstrate an issue that will become more revalent with homed modular debug info type definitions - which also don't need to be in type units but more legitimately so). Precursor to the Clang change to de-type-unit (by omitting the 'identifier') types homed due to strong linkage vtables. (making that change without this one would lead to major type duplication in type units) llvm-svn: 359122	2019-04-24 18:09:44 +00:00
Craig Topper	882ca6d484	[X86] Remove dead nodes left after ReplaceAllUsesWith calls during address matching ReplaceAllUsesWith doesn't remove the node that was replaced. So its left around in the graph messing up use counts on other nodes. One thing to note, is that this isn't valid if the node being deleted is the root node of an LEA match that gets rejected. In that case the node needs to stay alive because the isel table walking code would still have a reference to it that its going to try to match next. I don't think that's the case here though because the nodes being deleted here should be "and", "srl", and "zero_extend" none of which can be the root node of an LEA match. Differential Revision: https://reviews.llvm.org/D61048 llvm-svn: 359121	2019-04-24 18:02:07 +00:00
Stanislav Mekhanoshin	33d806a517	[AMDGPU] gfx1010 sgpr register changes Differential Revision: https://reviews.llvm.org/D61045 llvm-svn: 359117	2019-04-24 17:28:30 +00:00
Simon Pilgrim	10daecba1d	[X86][SSE] Add tests for bitcasting vXi1 bool vectors to non-simple types. llvm-svn: 359116	2019-04-24 17:25:45 +00:00
Lang Hames	d959a609a4	[JITLink] Add support for passing arguments to jit-linked code. The --args option can now be used to pass arguments to code linked with llvm-jitlink. E.g. $ llvm-jitlink file1.o file2.o --args a b c is equivalent to: $ ld -o program file1.o file2.o $ ./program a b c llvm-svn: 359115	2019-04-24 17:23:05 +00:00
Robert Widmann	09c5b883cb	[LLVM-C] Deprecate the LLVMValueRef-returning metadata creation functions Summary: There is still some value in using these functions while the remaining LLVMValueRef-based accessors are still around, but LLVMMDNodeInContext in particular has some wonky semantics that make it worth replacing outright. Reviewers: whitequark, deadalnix Reviewed By: whitequark Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D60524 llvm-svn: 359114	2019-04-24 17:05:08 +00:00
Stanislav Mekhanoshin	cee607e414	[AMDGPU] Add gfx1010 target definitions Differential Revision: https://reviews.llvm.org/D61041 llvm-svn: 359113	2019-04-24 17:03:15 +00:00
Simon Pilgrim	55f14dac74	[InstCombine][X86] Use generic expansion of PACKSS/PACKUS for constant folding. NFCI. This patch rewrites the existing PACKSS/PACKUS constant folding code to expand as a generic expansion. This is a first NFCI step toward expanding PACKSS/PACKUS intrinsics which are acting as non-saturating truncations (although technically the expansion could be used in all cases - but we'll probably want to be conservative). llvm-svn: 359111	2019-04-24 16:53:17 +00:00
JF Bastien	46d67fa6c5	Revert "[llvm-objdump] errorToErrorCode+message -> toString" Revert r359100 It breaks llvm/test/Object/elf-invalid-phdr.test llvm-svn: 359110	2019-04-24 16:49:30 +00:00
Nico Weber	8d05eb8556	llvm-undname: Fix assert-on->4GiB-string-literal, found by oss-fuzz llvm-svn: 359109	2019-04-24 16:09:38 +00:00
Lang Hames	b1ba4d8a8a	[JITLink] Refer to FDE's CIE (not the most recent CIE) when parsing eh-frame. Frame Descriptor Entries (FDEs) have a pointer back to a Common Information Entry (CIE) that describes how the rest FDE should be parsed. JITLink had been assuming that FDEs always referred to the most recent CIE encountered, but the spec allows them to point back to any previously encountered CIE. This patch fixes JITLink to look up the correct CIE for the FDE. The testcase is a MachO binary with an FDE that refers to a CIE that is not the one immediately proceeding it (the layout can be viewed wit 'dwarfdump --eh-frame <testcase>'. This test case had to be a binary as llvm-mc now sorts FDEs (as of r356216) to ensure FDEs do point to the most recent CIE. llvm-svn: 359105	2019-04-24 15:15:55 +00:00
Fangrui Song	aaecb8f799	[llvm-objdump] Delete redundant check llvm-svn: 359102	2019-04-24 15:09:23 +00:00
George Rimar	93a47a6291	[obj2yamp] - Simplify and cleanup the code in ELFDumper<ELFT>::dumpGroup a bit. NFC. This makes the variables naming to match LLVM style, simplifies the code used to extract the group members, simplifies the loop and reorders the code around a bit. llvm-svn: 359101	2019-04-24 15:03:53 +00:00
Fangrui Song	a5f8dcb63f	[llvm-objdump] errorToErrorCode+message -> toString llvm-svn: 359100	2019-04-24 15:03:46 +00:00
Dmitry Preobrazhensky	47621d7c89	[AMDGPU][MC] Parser cleanup and refactoring Reviewers: artem.tamazov, arsenm Differential Revision: https://reviews.llvm.org/D60767 llvm-svn: 359096	2019-04-24 14:06:15 +00:00
Sanjay Patel	b1b3368907	[x86] make sure horizontal op and broadcast types match to simplify (PR41414) If the types don't match, we can't just remove the shuffle. There may be some other opportunity for optimization here, but this should prevent the crashing seen in: https://bugs.llvm.org/show_bug.cgi?id=41414 llvm-svn: 359095	2019-04-24 14:05:08 +00:00
whitequark	50392a3b1b	[LLVM-C] Use dyn_cast instead of unwrap in LLVMGetDebugLoc functions Summary: The `unwrap<Type>` calls can assert with: ``` Assertion failed: (isa<X>(Val) && "cast<Ty>() argument of incompatible type!"), function cast ``` so replace them with `dyn_cast`. Reviewers: whitequark, abdulras, hiraditya, compnerd Reviewed By: whitequark Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D60473 llvm-svn: 359093	2019-04-24 13:30:03 +00:00
Fangrui Song	de0462a500	[yaml2obj] Replace num_zeros with write_zeros llvm-svn: 359091	2019-04-24 13:23:15 +00:00
George Rimar	b49e192a37	[yaml2elf] - Replace a loop with write_zeros(). NFC. And apply clang-format to the method changed. llvm-svn: 359090	2019-04-24 13:02:15 +00:00
Simon Pilgrim	d30745b2a0	[X86] Add shouldFoldConstantShiftPairToMask override placeholder. NFCI. Prep work toward fixing PR40758 llvm-svn: 359088	2019-04-24 12:34:08 +00:00
Nico Weber	ccf096463a	Let llvm-cvtres (and lld-link) report duplicate resources If two .res files contain the same resource, cvtres.exe (and hence link.exe) reject the input with this message: CVTRES : fatal error CVT1100: duplicate resource. type:STRING, name:101, language:0x0409 LINK : fatal error LNK1123: failure during conversion to COFF: file invalid or corrupt llvm-cvtres (and lld-link) used to silently pick one of the duplicate resources instead. This patch makes them report an error as well. We slightly improve on cvtres by printing the name of two .res files containing duplicate entries as well. Differential Revision: https://reviews.llvm.org/D61049 llvm-svn: 359083	2019-04-24 11:42:59 +00:00
Simon Pilgrim	039a563e6a	[X86][SSE] Add masked bit test cases for PR26697 llvm-svn: 359082	2019-04-24 10:34:15 +00:00
Bjorn Pettersson	71e8c6f20f	Add "const" in GetUnderlyingObjects. NFC Summary: Both the input Value pointer and the returned Value pointers in GetUnderlyingObjects are now declared as const. It turned out that all current (in-tree) uses of GetUnderlyingObjects were trivial to update, being satisfied with have those Value pointers declared as const. Actually, in the past several of the users had to use const_cast, just because of ValueTracking not providing a version of GetUnderlyingObjects with "const" Value pointers. With this patch we get rid of those const casts. Reviewers: hfinkel, materi, jkorous Reviewed By: jkorous Subscribers: dexonsmith, jkorous, jholewinski, sdardis, eraman, hiraditya, jrtc27, atanasyan, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D61038 llvm-svn: 359072	2019-04-24 06:55:50 +00:00
Craig Topper	1e413ffa7b	[Mips][CodeGen] Remove MachineFunction::setSubtarget. Change Mips to just copy the subtarget from the MachineFunction instead of recalculating it. Summary: The MachineFunction should have been created with the correct subtarget. As long as there is no way to change it, MipsTargetMachine can just capture it directly from the MachineFunction without calling getSubtargetImpl again. While there, const correct the Subtarget pointer to avoid a const_cast. I believe the Mips16Subtarget and NoMips16Subtarget members are never used, but I'll leave there removal for a separate patch. Reviewers: echristo, atanasyan Reviewed By: atanasyan Subscribers: sdardis, arichardson, hiraditya, jrtc27, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D60936 llvm-svn: 359071	2019-04-24 06:48:31 +00:00
Fangrui Song	b5f3984541	[CommandLine] Provide parser<unsigned long> instantiation to allow cl::opt<uint64_t> on LP64 platforms Summary: And migrate opt<unsigned long long> to opt<uint64_t> Fixes PR19665 Differential Revision: https://reviews.llvm.org/D60933 llvm-svn: 359068	2019-04-24 02:40:20 +00:00
Nico Weber	39a2d20a0f	llvm-cvtres: Accept /? as help flag, like cvtres.exe llvm-svn: 359064	2019-04-24 02:11:24 +00:00
Nico Weber	95c18c7bee	gn build: Merge r359050 more llvm-svn: 359058	2019-04-24 00:59:24 +00:00
Nico Weber	8b83fb590d	gn build: Merge r359050 llvm-svn: 359056	2019-04-24 00:44:14 +00:00
Alina Sbirlea	b341efce31	Revert [AliasAnalysis] AAResults preserves AAManager. Triggers use-after-free. llvm-svn: 359055	2019-04-24 00:28:29 +00:00
Francis Visoiu Mistrih	465415f1db	[Remarks] Fix documentation indentation Fix the documentation bot: http://lab.llvm.org:8011/builders/llvm-sphinx-docs/builds/30450/steps/docs-llvm-html/logs/stdio llvm-svn: 359053	2019-04-24 00:27:59 +00:00
Francis Visoiu Mistrih	7fee2b89fd	[Remarks] Add string deduplication using a string table * Add support for uniquing strings in the remark streamer and emitting the string table in the remarks section. * Add parsing support for the string table in the RemarkParser. From this remark: ``` --- !Missed Pass: inline Name: NoDefinition DebugLoc: { File: 'test-suite/SingleSource/UnitTests/2002-04-17-PrintfChar.c', Line: 7, Column: 3 } Function: printArgsNoRet Args: - Callee: printf - String: ' will not be inlined into ' - Caller: printArgsNoRet DebugLoc: { File: 'test-suite/SingleSource/UnitTests/2002-04-17-PrintfChar.c', Line: 6, Column: 0 } - String: ' because its definition is unavailable' ... ``` to: ``` --- !Missed Pass: 0 Name: 1 DebugLoc: { File: 3, Line: 7, Column: 3 } Function: 2 Args: - Callee: 4 - String: 5 - Caller: 2 DebugLoc: { File: 3, Line: 6, Column: 0 } - String: 6 ... ``` And the string table in the .remarks/__remarks section containing: ``` inline\0NoDefinition\0printArgsNoRet\0 test-suite/SingleSource/UnitTests/2002-04-17-PrintfChar.c\0printf\0 will not be inlined into \0 because its definition is unavailable\0 ``` This is mostly supposed to be used for testing purposes, but it gives us a 2x reduction in the remark size, and is an incremental change for the updates to the remarks file format. Differential Revision: https://reviews.llvm.org/D60227 llvm-svn: 359050	2019-04-24 00:06:24 +00:00
Josh Stone	27924c3a3c	[Lint] Permit aliasing noalias readonly arguments Summary: If two arguments are both readonly, then they have no memory dependency that would violate noalias, even if they do actually overlap. Reviewers: hfinkel, efriedma Reviewed By: efriedma Subscribers: efriedma, hiraditya, llvm-commits, tstellar Tags: #llvm Differential Revision: https://reviews.llvm.org/D60239 llvm-svn: 359047	2019-04-23 23:43:47 +00:00
Jessica Paquette	4fe7574d5d	[AArch64][GlobalISel] Select G_INTRINSIC_ROUND Add selection support for G_INTRINSIC_ROUND, add a selection test, and add check lines to arm64-vfloatintrinsics.ll and f16-instructions.ll. llvm-svn: 359046	2019-04-23 23:03:03 +00:00
Jessica Paquette	9766bf1854	[AArch64][GlobalISel] Mark G_INTRINSIC_ROUND as a pre-isel floating point opcode Add G_INTRINSIC_ROUND to isPreISelGenericFloatingPointOpcode to ensure that its input and output are assigned the correct register bank. Add a regbankselect test to verify that we get what we expect here. llvm-svn: 359044	2019-04-23 22:47:00 +00:00
Dmitry Mikulin	312b5f86b7	The error message for mismatched value sites is very cryptic. Make it more readable for an average user. Differential Revision: https://reviews.llvm.org/D60896 llvm-svn: 359043	2019-04-23 22:26:55 +00:00
Alex Langford	bfd248d2a6	[CMake] Use add_dependencies in add_llvm_install_targets Summary: The CMake documentation says that the `DEPENDS` field of add_custom_target is for files and output of custom commands. Adding a dependency on a target should be done with `add_dependency`. Differential Revision: https://reviews.llvm.org/D60879 llvm-svn: 359042	2019-04-23 21:59:07 +00:00
Francis Visoiu Mistrih	1646851b87	[CGP] Look through bitcasts when duplicating returns for tail calls The simple case of: ``` int callee(); void caller(void *a) { if (a == NULL) return callee(); return a; } ``` would generate a regular call instead of a tail call because we don't look through the bitcast of the call to `callee` when duplicating the return blocks. Differential Revision: https://reviews.llvm.org/D60837 llvm-svn: 359041	2019-04-23 21:57:46 +00:00
Francis Visoiu Mistrih	eea9da5921	[X86] Add codegen prepare test exercising a bitcast + tail call In preparation of https://reviews.llvm.org/D60837, add this test where we don't perform a tail call because we don't look through a bitcast. llvm-svn: 359040	2019-04-23 21:57:43 +00:00
Heejin Ahn	b9f282d384	[WebAssembly] Emit br_table for most switch instructions Summary: Always convert switches to br_tables unless there is only one case, which is equivalent to a simple branch. This reduces code size for wasm, and we defer possible jump table optimizations to the VM. Addresses PR41502. Reviewers: kripken, sunfish Subscribers: dschuff, sbc100, jgravelle-google, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D60966 llvm-svn: 359038	2019-04-23 21:30:30 +00:00
Heejin Ahn	ace7a086ca	[WebAssembly] Make LBB markers not affected by test order Summary: This way we can change the order of tests or delete some of them without affecting tests for other functions. Reviewers: tlively Subscribers: sunfish, dschuff, sbc100, jgravelle-google, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D60929 llvm-svn: 359036	2019-04-23 21:17:03 +00:00
Amy Huang	fc79ab9857	Revert "[MS] Emit S_HEAPALLOCSITE debug info" because of ToTWin64(db) buildbot failure. This reverts commit `d07d6d6177` and `c774f687b6`. llvm-svn: 359034	2019-04-23 21:12:58 +00:00
Jessica Paquette	3cc6d1f542	[AArch64][GlobalISel] Legalize G_INTRINSIC_ROUND Add it to the same rule as G_FCEIL etc. Add a legalizer test, and add a missing switch case to AArch64LegalizerInfo.cpp. llvm-svn: 359033	2019-04-23 21:11:57 +00:00
Alina Sbirlea	4fd1f266b1	[MemorySSA] LCSSA preserves MemorySSA. Summary: Enabling MemorySSA in the old pass manager leads to MemorySSA being run twice due to the fact that LCSSA and LoopSimplify do not preserve MemorySSA. This is the first step to address that: target LCSSA. LCSSA does not make any changes that invalidate MemorySSA, so it preserves it by design. It must preserve AA as well, for this to hold. After this patch, MemorySSA is still run twice in the old pass manager. Step two follows: target LoopSimplify. Subscribers: mehdi_amini, jlebar, Prazek, llvm-commits, george.burgess.iv, chandlerc Tags: #llvm Differential Revision: https://reviews.llvm.org/D60832 llvm-svn: 359032	2019-04-23 20:59:44 +00:00
Craig Topper	26518466ef	[X86] Autogenerate complete checks. NFC Prep for D60993 llvm-svn: 359031	2019-04-23 20:52:00 +00:00
Jessica Paquette	991cb39242	[AArch64][GlobalISel] Actually select G_INTRINSIC_TRUNC Apparently FileCheck wasn't actually matching the fallback check lines in arm64-vfloatintrinsics.ll properly. So, there were selection fallbacks for G_INTRINSIC_TRUNC there. Actually hook it up into AArch64InstructionSelector.cpp and write a proper selection test. I guess I'll figure out the FileCheck magic to make the fallback checks work properly in arm64-vfloatintrinsics.ll. llvm-svn: 359030	2019-04-23 20:46:19 +00:00
Akira Hatanaka	5c3117b0a9	[ObjC][ARC] Check the basic block size before calling DominatorTree::dominate. ARC contract pass has an optimization that replaces the uses of the argument of an ObjC runtime function call with the call result. For example: ; Before optimization %1 = tail call i8* @foo1() %2 = tail call i8* @llvm.objc.retainAutoreleasedReturnValue(i8* %1) store i8* %1, i8** @g0, align 8 ; After optimization %1 = tail call i8* @foo1() %2 = tail call i8* @llvm.objc.retainAutoreleasedReturnValue(i8* %1) store i8* %2, i8** @g0, align 8 // %1 is replaced with %2 Before replacing the argument use, DominatorTree::dominate is called to determine whether the user instruction is dominated by the ObjC runtime function call instruction. The call to DominatorTree::dominate can be expensive if the two instructions belong to the same basic block and the size of the basic block is large. This patch checks the basic block size and just bails out if the size exceeds the limit set by command line option "arc-contract-max-bb-size". rdar://problem/49477063 Differential Revision: https://reviews.llvm.org/D60900 llvm-svn: 359027	2019-04-23 19:49:03 +00:00
David Blaikie	2f51176223	Reapply: "DebugInfo: Emit only one kind of accelerated access/name table"" Originally committed in r358931 Reverted in r358997 Seems this change made Apple accelerator tables miss names (because names started respecting the CU NameTableKind GNU & assuming that shouldn't produce accelerated names too), which is never correct (apple accelerator tables don't have separators or CU lists - if present, they must describe all names in all CUs). Original Description: Currently to opt in to debug_names in DWARFv5, the IR must contain 'nameTableKind: Default' which also enables debug_pubnames. Instead, only allow one of {debug_names, apple_names, debug_pubnames, debug_gnu_pubnames}. nameTableKind: Default gives debug_names in DWARFv5 and greater, debug_pubnames in v4 and earlier - and apple_names when tuning for lldb on MachO. nameTableKind: GNU always gives gnu_pubnames llvm-svn: 359026	2019-04-23 19:00:45 +00:00
Teresa Johnson	867bc3951b	[ThinLTO] Pass down opt level to LTO backend and handle -O0 LTO in new PM Summary: The opt level was not being passed down to the ThinLTO backend when invoked via clang (for distributed ThinLTO). This exposed an issue where the new PM was asserting if the Thin or regular LTO backend pipelines were invoked with -O0 (not a new issue, could be provoked by invoking in-process *LTO backends via linker using new PM and -O0). Fix this similar to the old PM where -O0 only does the necessary lowering of type metadata (WPD and LowerTypeTest passes) and then quits, rather than asserting. Reviewers: xur Subscribers: mehdi_amini, inglorion, eraman, hiraditya, steven_wu, dexonsmith, cfe-commits, llvm-commits, pcc Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D61022 llvm-svn: 359025	2019-04-23 18:56:19 +00:00
Nico Weber	6967da8ffa	llvm-cvtres: Split addChild(ID) into two functions Before, there was an IsData parameter. Now, there are two different functions for data nodes and ID nodes. No behavior change, needed for a follow-up change to make two data nodes (but not two ID nodes) with the same ID an error. For consistency, rename another addChild() overload to addNameChild(). llvm-svn: 359024	2019-04-23 18:46:53 +00:00
Jessica Paquette	ede0b2e695	[AArch64][GlobalISel] Teach regbankselect about G_INTRINSIC_TRUNC Add it to isPreISelGenericFloatingPointOpcode, and add a regbankselect test. Update arm64-vfloatintrinsics.ll now that we can select it. llvm-svn: 359022	2019-04-23 18:20:47 +00:00
Jessica Paquette	56342642a0	[AArch64][GlobalISel] Legalize G_INTRINSIC_TRUNC Same patch as G_FCEIL etc. Add the missing switch case in widenScalar, add G_INTRINSIC_TRUNC to the correct rule in AArch64LegalizerInfo.cpp, and add a test. llvm-svn: 359021	2019-04-23 18:20:44 +00:00
Nikita Popov	f945429fed	[ConstantRange] Add urem support Add urem support to ConstantRange, so we can handle in in LVI. This is an approximate implementation that tries to capture the most useful conditions: If the LHS is always strictly smaller than the RHS, then the urem is a no-op and the result is the same as the LHS range. Otherwise the lower bound is zero and the upper bound is min(LHSMax, RHSMax - 1). Differential Revision: https://reviews.llvm.org/D60952 llvm-svn: 359019	2019-04-23 18:00:17 +00:00
Nikita Popov	4a52397965	[ConstantRangeTest] Move helper methods; NFC Move Test(Unsigned\|Signed)BinOpExhaustive() towards the top of the file, so they're easier to reuse. llvm-svn: 359018	2019-04-23 18:00:02 +00:00
Stanislav Mekhanoshin	c464dddccb	[AMDGPU] Fixed addReg() in SIOptimizeExecMaskingPreRA.cpp The second argument is flags, not subreg. Differential Revision: https://reviews.llvm.org/D61031 llvm-svn: 359017	2019-04-23 17:59:26 +00:00
Jessica Paquette	df5ce782ad	[AArch64][GlobalISel] Legalize G_FMA for more vector types Same as G_FCEIL, G_FABS, etc. Just move it into that rule. Add a legalizer test for G_FMA, which we didn't have before and update arm64-vfloatintrinsics.ll. llvm-svn: 359015	2019-04-23 17:37:56 +00:00
Alina Sbirlea	a809e8e5e7	[AliasAnalysis] AAResults preserves AAManager. Summary: AAResults should not invalidate AAManager. Update tests. Reviewers: chandlerc Subscribers: mehdi_amini, jlebar, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D60914 llvm-svn: 359014	2019-04-23 17:21:18 +00:00
Jessica Paquette	e50e6d2563	[AArch64][GlobalISel] Add G_FMA to isPreISelGenericFloatingPointOpcode Noticed an unnecessary fallback in arm64-vmul caused by this. Also add a regbankselect test for G_FMA. llvm-svn: 359013	2019-04-23 17:17:06 +00:00
Joel E. Denny	3234887fe2	[APSInt][OpenMP] Fix isNegative, etc. for unsigned types Without this patch, APSInt inherits APInt::isNegative, which merely checks the sign bit without regard to whether the type is actually signed. isNonNegative and isStrictlyPositive call isNegative and so are also affected. This patch adjusts APSInt to override isNegative, isNonNegative, and isStrictlyPositive with implementations that consider whether the type is signed. A large set of Clang OpenMP tests are affected. Without this patch, these tests assume that `true` is not a valid argument for clauses like `collapse`. Indeed, `true` fails APInt::isStrictlyPositive but not APSInt::isStrictlyPositive. This patch adjusts those tests to assume `true` should be accepted. This patch also adds tests revealing various other similar fixes due to APSInt::isNegative calls in Clang's ExprConstant.cpp and SemaExpr.cpp: `++` and `--` overflow in `constexpr`, evaluated object size based on `alloc_size`, `<<` and `>>` shift count validation, and OpenMP array section validation. Reviewed By: lebedev.ri, ABataev, hfinkel Differential Revision: https://reviews.llvm.org/D59712 llvm-svn: 359012	2019-04-23 17:04:15 +00:00
Adrian Prantl	2351d6102f	[dsymutil] Put Swift interface files into a per-arch subdirectory. This was meant to be part of the original commit r358921, but somehow got lost. <rdar://problem/49751748> llvm-svn: 359010	2019-04-23 16:42:35 +00:00
Sanjay Patel	7c0bd5a27c	[x86] fix test checks for fdiv combine; NFC Must have picked up some transient code changes when originally generating this. llvm-svn: 359008	2019-04-23 16:31:30 +00:00
Nico Weber	e8f21b1a6b	llvm-undname: Support demangling the spaceship operator Also add a test for demanling the co_await operator. llvm-svn: 359007	2019-04-23 16:20:27 +00:00
Sanjay Patel	171b74e31c	[x86] add tests for vector fdiv with splat divisor; NFC llvm-svn: 359006	2019-04-23 16:16:16 +00:00
Adrian Prantl	53bd7ce42e	[dsymutil] Fix use-after-free when sys::path::append grows the buffer. <rdar://problem/50117620> llvm-svn: 359003	2019-04-23 15:44:22 +00:00
Adrian Prantl	c7bde29cfe	Revert "[dsymutil] Fix use-after-free when sys::path::append grows the buffer." llvm-svn: 359002	2019-04-23 15:44:19 +00:00
Adrian Prantl	03e906d9d5	[dsymutil] Fix use-after-free when sys::path::append grows the buffer. <rdar://problem/50117620> llvm-svn: 359001	2019-04-23 15:39:13 +00:00
Philip Reames	2ce017026a	[InstCombine] Convert a masked.load of a dereferenceable address to an unconditional load If we have a masked.load from a location we know to be dereferenceable, we can simply issue a speculative unconditional load against that address. The key advantage is that it produces IR which is well understood by the optimizer. The select (cnd, load, passthrough) form produced should be pattern matchable back to hardware predication if profitable. Differential Revision: https://reviews.llvm.org/D59703 llvm-svn: 359000	2019-04-23 15:25:14 +00:00
Sanjay Patel	12a561fa1b	[x86] use psubus for more vsetcc lowering (PR39859) Circling back to a leftover bit from PR39859: https://bugs.llvm.org/show_bug.cgi?id=39859#c1 ...we have this counter-intuitive (based on the test diffs) opportunity to use 'psubus'. This appears to be the better perf option for both Haswell and Jaguar based on llvm-mca. We already do this transform for the SETULT predicate, so this makes the code more symmetrical too. If we have pminub/pminuw, we prefer those, so this should not affect anything but pre-SSE4.1 subtargets. $ cat before.s movdqa -16(%rip), %xmm2 ## xmm2 = [32768,32768,32768,32768,32768,32768,32768,32768] pxor %xmm0, %xmm2 pcmpgtw -32(%rip), %xmm2 ## xmm2 = [255,255,255,255,255,255,255,255] pand %xmm2, %xmm0 pandn %xmm1, %xmm2 por %xmm2, %xmm0 $ cat after.s movdqa -16(%rip), %xmm2 ## xmm2 = [256,256,256,256,256,256,256,256] psubusw %xmm0, %xmm2 pxor %xmm3, %xmm3 pcmpeqw %xmm2, %xmm3 pand %xmm3, %xmm0 pandn %xmm1, %xmm3 por %xmm3, %xmm0 $ llvm-mca before.s -mcpu=haswell Iterations: 100 Instructions: 600 Total Cycles: 909 Total uOps: 700 Dispatch Width: 4 uOps Per Cycle: 0.77 IPC: 0.66 Block RThroughput: 1.8 $ llvm-mca after.s -mcpu=haswell Iterations: 100 Instructions: 700 Total Cycles: 409 Total uOps: 700 Dispatch Width: 4 uOps Per Cycle: 1.71 IPC: 1.71 Block RThroughput: 1.8 Differential Revision: https://reviews.llvm.org/D60838 llvm-svn: 358999	2019-04-23 15:20:17 +00:00
Joerg Sonnenberger	6e7cc49d5c	[SPARC] Use the correct register set for the "r" asm constraint. 64bit mode must use 64bit registers, otherwise assumptions about the top half of the registers are made. Problem found by Takeshi Nakayama in NetBSD. llvm-svn: 358998	2019-04-23 15:15:33 +00:00
David Blaikie	a2470a4653	Revert "DebugInfo: Emit only one kind of accelerated access/name table" Regresses some apple_names situations - still investigating. This reverts commit r358931. llvm-svn: 358997	2019-04-23 15:03:24 +00:00
Fangrui Song	efd94c56ba	Use llvm::stable_sort While touching the code, simplify if feasible. llvm-svn: 358996	2019-04-23 14:51:27 +00:00
Lewis Revill	df3cb477a3	[RISCV] Support assembling %tls_{ie,gd}_pcrel_hi modifiers This patch adds support for parsing and assembling the %tls_ie_pcrel_hi and %tls_gd_pcrel_hi modifiers. Differential Revision: https://reviews.llvm.org/D55342 llvm-svn: 358994	2019-04-23 14:46:13 +00:00
Nico Weber	9fc422830a	gn build: Merge r358944 llvm-svn: 358993	2019-04-23 14:32:18 +00:00
Scott Linder	3eed961973	[AMDGPU] Fix hidden argument metadata duplication for V3 Essentially complete a proper rebase of the V3 metadata change over https://reviews.llvm.org/D49096. Minimize the diff between the V2 and V3 variants of the relevant lit tests, and clean up some trailing whitespace. llvm-svn: 358992	2019-04-23 14:31:17 +00:00
Nico Weber	d7a748a71b	gn build: Merge r358949 llvm-svn: 358991	2019-04-23 14:31:15 +00:00
Simon Pilgrim	0e4992ce27	[X86] Pull out collectConcatOps helper. NFCI. Create collectConcatOps helper that returns all the subvector ops for CONCAT_VECTORS or a INSERT_SUBVECTOR series. llvm-svn: 358989	2019-04-23 14:07:49 +00:00
Tim Northover	6af366be8a	ARM: disallow add/sub to sp unless Rn is also sp. The manual says that Thumb2 add/sub instructions are only allowed to modify sp if the first source is also sp. This is slightly different from the usual rGPR restriction since it's context-sensitive, so implement it in C++. llvm-svn: 358987	2019-04-23 13:50:13 +00:00
Roman Lebedev	a6be919c92	[Docs] ReleaseNotes: fixup markup in memcmp()->bcmp() entry llvm-svn: 358986	2019-04-23 13:46:18 +00:00
Sanjay Patel	06ff5eae5b	[DAGCombiner] generalize binop-of-splats scalarization If we only match build vectors, we can miss some patterns that use shuffles as seen in the affected tests. Note that the underlying calls within getSplatSourceVector() have the potential for compile-time explosion because of exponential recursion looking through binop opcodes, but currently the list of supported opcodes is very limited. Both of those problems should be addressed in follow-up patches. llvm-svn: 358984	2019-04-23 13:16:41 +00:00
Nicolai Haehnle	7edae4c403	AMDGPU: Fix LCSSA phi lowering in SILowerI1Copies Summary: When an LCSSA phi survives through instruction selection, the pass ends up removing that phi entirely because it is dominated by the logic that does the lanemask merging. This then used to trigger an assertion when processing a dependent phi instruction. Change-Id: Id4949719f8298062fe476a25718acccc109113b6 Reviewers: llvm-commits Subscribers: kzhuravl, jvesely, wdng, yaxunl, t-tye, tpr, dstuttard, rtaylor, arsenm Tags: #llvm Differential Revision: https://reviews.llvm.org/D60999 llvm-svn: 358983	2019-04-23 13:12:52 +00:00
Fedor Sergeev	652168a99b	[CallSite removal] move InlineCost to CallBase usage Converting InlineCost interface and its internals into CallBase usage. Inliners themselves are still not converted. Reviewed By: reames Tags: #llvm Differential Revision: https://reviews.llvm.org/D60636 llvm-svn: 358982	2019-04-23 12:43:27 +00:00
Aaron Ballman	61ef9193aa	Removing the explicit specifier from some default constructors; NFC. llvm-svn: 358978	2019-04-23 12:16:28 +00:00
David Green	c519d3c403	[ARM] Update check for CBZ in Ifcvt The check for creating CBZ in constant island pass recently obtained the ability to search backwards to find a Cmp instruction. The code in IfCvt should mirror this to allow more conversions to the smaller form. The common code has been pulled out into a separate function to be shared between the two places. Differential Revision: https://reviews.llvm.org/D60090 llvm-svn: 358977	2019-04-23 12:11:26 +00:00
David Green	2f9eed6265	[ARM] Don't replicate instructions in Ifcvt at minsize Ifcvt can replicate instructions as it converts them to be predicated. This stops that from happening on thumb2 targets at minsize where an extra IT instruction is likely needed. Differential Revision: https://reviews.llvm.org/D60089 llvm-svn: 358974	2019-04-23 11:46:58 +00:00
Simon Pilgrim	ddd225d1a9	Fix MSVC "32-bit shift implicitly converted to 64 bits" warning. NFCI. llvm-svn: 358970	2019-04-23 11:16:16 +00:00
Simon Pilgrim	e7a68fd93e	Fix MSVC "32-bit shift implicitly converted to 64 bits" warning. NFCI. llvm-svn: 358969	2019-04-23 11:11:34 +00:00
Bjorn Pettersson	f97b29be88	[DAGCombiner] Combine OR as ADD when no common bits are set Summary: The DAGCombiner is rewriting (canonicalizing) an ISD::ADD with no common bits set in the operands as an ISD::OR node. This could sometimes result in "missing out" on some combines that normally are performed for ADD. To be more specific this could happen if we already have rewritten an ADD into OR, and later (after legalizations or combines) we expose patterns that could have been optimized if we had seen the OR as an ADD (e.g. reassociations based on ADD). To make the DAG combiner less sensitive to if ADD or OR is used for these "no common bits set" ADD/OR operations we now apply most of the ADD combines also to an OR operation, when value tracking indicates that the operands have no common bits set. Reviewers: spatel, RKSimon, craig.topper, kparzysz Reviewed By: spatel Subscribers: arsenm, rampitec, lebedev.ri, jvesely, nhaehnle, hiraditya, javed.absar, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D59758 llvm-svn: 358965	2019-04-23 10:01:08 +00:00
Javed Absar	1cdc3dbc58	[AArch64] Add support for MTE intrinsics This patch provides intrinsics support for Memory Tagging Extension (MTE), which was introduced with the Armv8.5-a architecture. The intrinsics are described in detail in the latest ACLE Q1 2019 documentation: https://developer.arm.com/docs/101028/latest Reviewed by: David Spickett Differential Revision: https://reviews.llvm.org/D60486 llvm-svn: 358963	2019-04-23 09:39:58 +00:00
Diogo N. Sampaio	2619f399f9	[ARM][FIX] Add missing f16.lane.vldN/vstN lowering Summary: Add missing D and Q lane VLDSTLane lowering for fp16 elements. Reviewers: efriedma, kosarev, SjoerdMeijer, ostannard Reviewed By: efriedma Subscribers: javed.absar, kristof.beyls, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D60874 llvm-svn: 358962	2019-04-23 09:36:39 +00:00
George Rimar	b9ed9cb5d7	[llvm-mc] - Properly set the the address align field of the compressed sections. About the compressed sections spec says: (https://docs.oracle.com/cd/E37838_01/html/E36783/section_compression.html) sh_addralign fields of the section header for a compressed section reflect the requirements of the compressed section. Currently, llvm-mc always puts uncompressed section alignment to sh_addralign. It is not correct. zlib styled section contains an Elfxx_Chdr header, so we should either use 4 or 8 values depending on the target (Uncompressed section alignment is stored in ch_addralign field of the compression header). GNU assembler version 2.31.1 also has this issue, but in 2.32.51 it was already fixed. This is how it was found during debugging of the https://bugs.llvm.org/show_bug.cgi?id=40482 actually. Differential revision: https://reviews.llvm.org/D60965 llvm-svn: 358960	2019-04-23 09:16:53 +00:00
David Green	63a2aa715a	[LSR] Limit the recursion for setup cost In some circumstances we can end up with setup costs that are very complex to compute, even though the scevs are not very complex to create. This can also lead to setupcosts that are calculated to be exactly -1, which LSR treats as an invalid cost. This patch puts a limit on the recursion depth for setup cost to prevent them taking too long. Thanks to @reames for the report and test case. Differential Revision: https://reviews.llvm.org/D60944 llvm-svn: 358958	2019-04-23 08:52:21 +00:00
Sam Clegg	9da81421b8	[WebAssembly] Bail out of fastisel earlier when computing PIC addresses This change partially reverts https://reviews.llvm.org/D54647 in favor of bailing out during computeAddress instead. This catches the condition earlier and handles more cases. Differential Revision: https://reviews.llvm.org/D60986 llvm-svn: 358948	2019-04-23 03:43:26 +00:00
Qiu Chaofan	ab66e34c63	add Qiu Chaofan (qiucf@cn.ibm.com) to the CREDITS.txt llvm-svn: 358942	2019-04-23 02:37:48 +00:00
Chandler Carruth	bbddf21f90	Revert "Use const DebugLoc&" This reverts r358910 (git commit `2b74466530`) While this patch seems trivial and safe and correct, it is not. The copies are actually load bearing copies. You can observe this with MSan or other ways of checking for use-after-destroy, but otherwise this may result in ... difficult to debug inexplicable behavior. I suspect the issue is that the debug location is used after the original reference to it is removed. The metadata backing it gets destroyed as its last references goes away, and then we reference it later through these const references. llvm-svn: 358940	2019-04-23 01:42:07 +00:00
Petr Hosek	fbcce9fe9d	[CMake] Replace the sanitizer support in runtimes build with multilib This is a more generic solution; while the sanitizer support can be used only for sanitizer instrumented builds, the multilib support can be used to build other variants such as noexcept which is what we would like to use in Fuchsia. The name CMake target name uses the target name, same as for the regular runtimes build and the name of the multilib, concatenated with '+'. The libraries are installed in a subdirectory named after the multilib. Differential Revision: https://reviews.llvm.org/D60926 llvm-svn: 358935	2019-04-22 23:31:39 +00:00
Adrian Prantl	05aad1567b	Fully qualify llvm::Optional, some compilers complain otherwise. llvm-svn: 358933	2019-04-22 22:51:34 +00:00
David Blaikie	68602ab2f3	DebugInfo: Emit only one kind of accelerated access/name table Currently to opt in to debug_names in DWARFv5, the IR must contain 'nameTableKind: Default' which also enables debug_pubnames. Instead, only allow one of {debug_names, apple_names, debug_pubnames, debug_gnu_pubnames}. nameTableKind: Default gives debug_names in DWARFv5 and greater, debug_pubnames in v4 and earlier - and apple_names when tuning for lldb on MachO. nameTableKind: GNU always gives gnu_pubnames llvm-svn: 358931	2019-04-22 22:45:11 +00:00
Sanjay Patel	bf8aacb715	[SelectionDAG] move splat util functions up from x86 lowering This was supposed to be NFC, but the change in SDLoc definitions causes instruction scheduling changes. There's nothing x86-specific in this code, and it can likely be used from DAGCombiner's simplifyVBinOp(). llvm-svn: 358930	2019-04-22 22:43:36 +00:00
Adrian Prantl	e495ec23fe	Try to work around compile errors with older versions of GCC. llvm-svn: 358927	2019-04-22 22:40:37 +00:00
Douglas Yung	16df7e086b	Relax test to check for a valid number instead of a specific number. llvm-svn: 358926	2019-04-22 22:31:57 +00:00
Michael Liao	389d5a3474	[AMDGPU] Fix an issue in `op_sel_hi` skipping. Summary: - Only apply packed literal `op_sel_hi` skipping on operands requiring packed literals. Even an instruction is `packed`, it may have operand requiring non-packed literal, such as `v_dot2_f32_f16`. Reviewers: rampitec, arsenm, kzhuravl Subscribers: jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D60978 llvm-svn: 358922	2019-04-22 22:05:49 +00:00
Adrian Prantl	0d809aa218	[dsymutil] Collect parseable Swift interfaces in the .dSYM bundle. When a Swift module built with debug info imports a library without debug info from a textual interface, the textual interface is necessary to reconstruct types defined in the library's interface. By recording the Swift interface files in DWARF dsymutil can collect them and LLDB can find them. This patch teaches dsymutil to look for DW_TAG_imported_modules and records all references to parseable Swift ingterfrace files and copies them to a.out.dSYM/Contents/Resources/<Arch>/<ModuleName>.swiftinterface <rdar://problem/49751748> llvm-svn: 358921	2019-04-22 21:33:22 +00:00
Philip Reames	d748689c7f	[InstCombine] Eliminate stores to constant memory If we have a store to a piece of memory which is known constant, then we know the store must be storing back the same value. As a result, the store (or memset, or memmove) must either be down a dead path, or a noop. In either case, it is valid to simply remove the store. The motivating case for this involves a memmove to a buffer which is constant down a path which is dynamically dead. Note that I'm choosing to implement the less aggressive of two possible semantics here. We could simply say that the store is undefined, and prune the path. Consensus in the review was that the more aggressive form might be a good follow on change at a later date. Differential Revision: https://reviews.llvm.org/D60659 llvm-svn: 358919	2019-04-22 20:28:19 +00:00
Bob Haarman	2eea99a4b9	[Support] unflake TempFileCollisions test Summary: This test was added to verify that createUniqueEntity() does not enter an infinite loop when all possible names are taken. However, it also checked that all possible names are generated, which is flaky (because the names are generated randomly). This change increases the number of attempts we make to make flakes exceedingly unlikely (3.88e-62). Reviewers: fedor.sergeev, rsmith Reviewed By: fedor.sergeev Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D56336 llvm-svn: 358914	2019-04-22 19:46:25 +00:00
Philip Reames	d8d9b7b20e	[InstSimplify] Move masked.gather w/no active lanes handling to InstSimplify from InstCombine In the process, use the existing masked.load combine which is slightly stronger, and handles a mix of zero and undef elements in the mask. llvm-svn: 358913	2019-04-22 19:30:01 +00:00
Nico Weber	5828421c7c	gn build: Merge r358869 llvm-svn: 358912	2019-04-22 19:25:40 +00:00
Matt Arsenault	2b74466530	Use const DebugLoc& llvm-svn: 358910	2019-04-22 19:14:27 +00:00
Matt Arsenault	f84ce75cd1	AMDGPU: Skip debug instructions in assert These are inserted after branch relaxation, and for some reason it's decided to put them in the long branch expansion block. It's probably not great to rely on the source block address, so this should probably be switched to being PC relative instead of relying on the block address llvm-svn: 358909	2019-04-22 19:14:26 +00:00
Philip Reames	f01583d097	[Tests] Revise a test as requested by reviewer in D59703 llvm-svn: 358907	2019-04-22 18:51:58 +00:00
Philip Reames	8f47089034	[Tests] Add a negative test for masked.gather part of D59703 llvm-svn: 358906	2019-04-22 18:28:44 +00:00
Justin Bogner	e90d5c8db0	[IPSCCP] Add missing `AssumptionCacheTracker` dependency Back in August, r340525 introduced a dependency on the assumption cache tracker in the ipsccp pass, but that commit missed a call to INITIALIZE_PASS_DEPENDENCY, which leaves the assumption cache improperly registered if SCCP is the only thing that pulls it in. llvm-svn: 358903	2019-04-22 17:38:29 +00:00
Philip Reames	37104d7189	[LPM/BPI] Preserve BPI through trivial loop pass pipeline (e.g. LCSSA, LoopSimplify) Currently, we do not expose BPI to loop passes at all. In the old pass manager, we appear to have been ignoring the fact that LCSSA and/or LoopSimplify didn't preserve BPI, and making it available to the following loop passes anyways. In the new one, it's invalidated before running any loop pass if either LCSSA or LoopSimplify actually make changes. If they don't make changes, then BPI is valid and available. So, we go ahead and teach LCSSA and LoopSimplify how to preserve BPI for consistency between old and new pass managers. This patch avoids an invalidation between the two requires in the following trivial pass pipeline: opt -passes="requires<branch-prob>,loop(no-op-loop),requires<branch-prob>" (when the input file is one which requires either LCSSA or LoopSimplify to canonicalize the loops) Differential Revision: https://reviews.llvm.org/D60790 llvm-svn: 358901	2019-04-22 17:13:43 +00:00
Wei Mi	01f8d556aa	[PGO/SamplePGO][NFC] Move the function updateProfWeight from Instruction to CallInst. The issue was raised here: https://reviews.llvm.org/D60903#1472783 The function Instruction::updateProfWeight is only used for CallInst in profile update. From the current interface, it is very easy to think that the function can also be used for branch instruction. However, Branch instruction does't need the scaling the function provides for branch_weights and VP (value profile), in addition, scaling may introduce inaccuracy for branch probablity. The patch moves the function updateProfWeight from Instruction class to CallInst to remove the confusion. The patch also changes the scaling of branch_weights from a loop to a block because we know that ProfileData for branch_weights of CallInst will only have two operands at most. Differential Revision: https://reviews.llvm.org/D60911 llvm-svn: 358900	2019-04-22 17:04:51 +00:00
Fangrui Song	a5355a5ed1	Use llvm::stable_sort. NFC llvm-svn: 358897	2019-04-22 15:53:43 +00:00
Aaron Ballman	f033617974	Remove spurious semicolons; NFC. llvm-svn: 358895	2019-04-22 15:31:09 +00:00
Matt Arsenault	2b6f76f05f	AMDGPU/GlobalISel: Fix non-power-of-2 G_EXTRACT sources llvm-svn: 358894	2019-04-22 15:22:46 +00:00
Fangrui Song	75fbd1c604	STLExtras: add stable_sort wrappers llvm-svn: 358893	2019-04-22 15:19:13 +00:00
Matt Arsenault	8f624abc1d	GlobalISel: Legalize scalar G_EXTRACT sources llvm-svn: 358892	2019-04-22 15:10:42 +00:00
Nico Weber	f5c7f3ad33	llvm-undname: Fix an assert-on-invalid, found by oss-fuzz llvm-svn: 358891	2019-04-22 15:05:18 +00:00
Matt Arsenault	70346d127b	AMDGPU: Fix not checking for copy when looking at copy src Effectively reverts r356956. The check for isFullCopy was excessive, but there still needs to be a check that this is a copy. llvm-svn: 358890	2019-04-22 14:54:39 +00:00
Dmitry Preobrazhensky	e2707f5aac	[AMDGPU][MC] Corrected parsing of SP3 'neg' modifier See bug 41156: https://bugs.llvm.org/show_bug.cgi?id=41156 Reviewers: artem.tamazov, arsenm Differential Revision: https://reviews.llvm.org/D60624 llvm-svn: 358888	2019-04-22 14:35:47 +00:00
Simon Pilgrim	6276ce0142	[TargetLowering][AMDGPU][X86] Improve SimplifyDemandedBits bitcast handling This patch adds support for BigBitWidth -> SmallBitWidth bitcasts, splitting the DemandedBits/Elts accordingly. The AMDGPU backend needed an extra (srl (and x, c1 << c2), c2) -> (and (srl(x, c2), c1) combine to encourage BFE creation, I investigated putting this in DAGCombine but it caused a lot of noise on other targets - some improvements, some regressions. The X86 changes are all definite wins. Differential Revision: https://reviews.llvm.org/D60462 llvm-svn: 358887	2019-04-22 14:04:35 +00:00
Sanjay Patel	9bc6c77220	[DAGCombiner] make variable name less ambiguous; NFC llvm-svn: 358886	2019-04-22 13:42:50 +00:00
Sanjay Patel	d6989daae9	[DAGCombiner] prepare shuffle-of-splat to handle more patterns; NFC llvm-svn: 358884	2019-04-22 13:36:07 +00:00
Robert Widmann	ff8febcb6d	[LLVM-C] Add accessors to the default floating-point metadata node Summary: Add a getter and setter pair for floating-point accuracy metadata. Reviewers: whitequark, deadalnix Reviewed By: whitequark Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D60527 llvm-svn: 358883	2019-04-22 13:13:22 +00:00
Serguei Katkov	40a3b96196	[NewPM] Add Option handling for SimpleLoopUnswitch This patch enables passing options to SimpleLoopUnswitch via the passes pipeline. Reviewers: chandlerc, fedor.sergeev, leonardchan, philip.pfaffe Reviewed By: fedor.sergeev Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D60676 llvm-svn: 358880	2019-04-22 10:35:07 +00:00
Simon Pilgrim	ffd67233d4	[AMDGPU] Regenerate uitofp i8 to float conversion tests. Prep work for D60462 llvm-svn: 358879	2019-04-22 10:19:09 +00:00
Serguei Katkov	5614f4a3a5	[NewPM] Add dummy Test for LoopVectorize option parsing. llvm-svn: 358878	2019-04-22 09:53:26 +00:00
Nikita Popov	5aacc7a573	Revert "[ConstantRange] Rename make{Guaranteed -> Exact}NoWrapRegion() NFC" This reverts commit 7bf4d7c07f2fac862ef34c82ad0fef6513452445. After thinking about this more, this isn't right, the range is not exact in the same sense as makeExactICmpRegion(). This needs a separate function. llvm-svn: 358876	2019-04-22 09:01:38 +00:00
Nikita Popov	5299e25f50	[ConstantRange] Rename make{Guaranteed -> Exact}NoWrapRegion() NFC Following D60632 makeGuaranteedNoWrapRegion() always returns an exact nowrap region. Rename the function accordingly. This is in line with the naming of makeExactICmpRegion(). llvm-svn: 358875	2019-04-22 08:36:05 +00:00
Craig Topper	5c43ab337f	[X86] Reject 512-bit types in getRegForInlineAsmConstraint when AVX512 is not enabled. Same for 256 bit and AVX. llvm-svn: 358872	2019-04-22 06:12:02 +00:00
Lang Hames	1233c15be5	[JITLink] Remove a lot of reduntant 'JITLink_' prefixes. NFC. llvm-svn: 358869	2019-04-22 03:03:09 +00:00
Fangrui Song	4ad27e65cd	[cmake] Add llvm-jit to LLVM_TEST_DEPENDS Otherwise llvm-jit would say "utils/lit/lit/llvm/subst.py:127: note: Did not find llvm-jitlink in ..." llvm-svn: 358867	2019-04-22 02:23:09 +00:00
Lang Hames	d3dac47aa2	[JITLink] Fix section start address calculation in eh-frame recorder. Section atoms are not sorted, so we need to scan the whole section to find the start address. No test case: Found by inspection, and any reproduction would depend on pointer ordering. llvm-svn: 358865	2019-04-22 01:35:16 +00:00
Nico Weber	405e62b805	Attemp get llvm-jitlink building on Windows By removing an include of dlfcn.h that looks unused. And clang-format a too-long line while here. llvm-svn: 358864	2019-04-21 23:50:24 +00:00
Lang Hames	bc76bbcaa0	[JITLink] Add an option to dump relocated section content. The -dump-relocated-section-content option will dump the contents of each section after relocations are applied, and before any checks are run or code executed. llvm-svn: 358863	2019-04-21 20:34:19 +00:00
Nico Weber	584e7486ab	gn build: Re-run `git ls-files '.gn' '.gni' \| xargs llvm/utils/gn/gn.py format` llvm-svn: 358862	2019-04-21 20:14:21 +00:00
Nico Weber	01efcc61ea	gn build: Merge r358749 Since the symlinks list for llvm-symbolizer is now never empty, the :symlinks target no longer needs an explicit dep on :llvm-symbolizer -- there will be at least one dep on a symlink, and each symlink depends on :llvm-symbolizer already. Since llvm-symbolizer:symlinks now produces symlinks that check-llvm uses, make llvm/test depend on the symlink target. llvm-svn: 358861	2019-04-21 20:08:45 +00:00
Nico Weber	684fe014df	gn build: Merge r358818 (JITLink) llvm-svn: 358860	2019-04-21 19:45:37 +00:00
Don Hinton	75a43a28f0	[cmake] Fix bug in r358779 - [CMake] Pass monorepo build settings in cross compile Escape semicolons in the targets list so that cmake doesn't expand them to spaces. llvm-svn: 358859	2019-04-21 19:20:13 +00:00
Nico Weber	ce67a41741	llvm-undname: Fix hex escapes in wchar_t, char16_t, char32_t strings llvm-undname used to put '\x' in front of every pair of nibbles, but u"\xD7\xFF" produces a string with 6 bytes: \xD7 \0 \xFF \0 (and \0\0). Correct for a single character (plus terminating \0) is u\xD7FF instead. Now, wchar_t, char16_t, and char32_t strings roundtrip from source to clang-cl (and cl.exe) and then llvm-undname. (...at least as long as it's not a string like L"\xD7FF" L"foo" which gets demangled as L"\xD7FFfoo", where the compiler then considers the "f" as part of the hex escape. That seems ok.) Also add a comment saying that the "almost-valid" char32_t string I added in my last commit is actually produced by compilers. llvm-svn: 358857	2019-04-21 17:19:27 +00:00
Nico Weber	8fc9902bbb	llvm-undname: Fix stack overflow on almost-valid If a unsigned with all 4 bytes non-0 was passed to outputHex(), there were two off-by-ones in it: - Both MaxPos and Pos left space for the final \0, which left the buffer one byte to small. Set MaxPos to 16 instead of 15 to fix. - The `assert(Pos >= 0);` was after a `Pos--`, move it up one line. Since valid Unicode codepoints are <= 0x10ffff, this could never really happen in practice. Found by oss-fuzz. llvm-svn: 358856	2019-04-21 16:58:25 +00:00
Nikita Popov	198ab60136	[ConstantRange] Add saturating add/sub methods Add support for uadd_sat and friends to ConstantRange, so we can handle uadd.sat and friends in LVI. The implementation is forwarding to the corresponding APInt methods with appropriate bounds. One thing worth pointing out here is that the handling of wrapping ranges is not maximally accurate. A simple example is that adding 0 to a wrapped range will return a full range, rather than the original wrapped range. The tests also only check that the non-wrapping envelope is correct and minimal. Differential Revision: https://reviews.llvm.org/D60946 llvm-svn: 358855	2019-04-21 15:23:05 +00:00
Nikita Popov	dbc3fbafe7	[ConstantRange] Add getNonEmpty() constructor ConstantRanges have an annoying special case: If upper and lower are the same, it can be either an empty or a full set. When constructing constant ranges nearly always a full set is intended, but this still requires an explicit check in many places. This revision adds a getNonEmpty() constructor that disambiguates this case: If upper and lower are the same, a full set is created. Differential Revision: https://reviews.llvm.org/D60947 llvm-svn: 358854	2019-04-21 15:22:54 +00:00
Sanjay Patel	7fa3a0eec9	[AArch64] add tests with multiple binop+splat vals; NFC See D60890 for context. llvm-svn: 358853	2019-04-21 15:01:19 +00:00
Nico Weber	aa162682ca	llvm-undname: Fix stack overflow on invalid found by oss-fuzz llvm-svn: 358852	2019-04-21 14:25:07 +00:00
Nico Weber	f985e31254	gn build: Fix build after r358837 llvm-svn: 358851	2019-04-21 14:07:13 +00:00
David Green	0d741507f7	[ARM] Rewrite isLegalT2AddressImmediate This does two main things, firstly adding some at least basic addressing modes for i64 types, and secondly treats floats and doubles sensibly when there is no fpu. The floating point change can help codesize in some cases, especially with D60294. Most backends seems to not consider the exact VT in isLegalAddressingMode, instead switching on type size. That is now what this does when the target does not have an fpu (as the float data will be loaded using LDR's). i64's currently use the address range of an LDRD (even though they may be legalised and loaded with an LDR). This is at least better than marking them all as illegal addressing modes. I have not attempted to do much with vectors yet. That will need changing once MVE is added. Differential Revision: https://reviews.llvm.org/D60677 llvm-svn: 358845	2019-04-21 09:54:29 +00:00
Craig Topper	df02beb416	[X86] Add the rounding control operand to the printing for some scalar FMA instructions. llvm-svn: 358844	2019-04-21 07:12:56 +00:00
Fangrui Song	a0f9c4f72c	[CachePruning] Simplify comparator llvm-svn: 358843	2019-04-21 06:17:40 +00:00
Fangrui Song	70961f17ef	[JITLink] Add dependency on MCParser to unit test after rL358818 This is required by -DBUILD_SHARED_LIBS=on builds for createMCAsmParser. llvm-svn: 358842	2019-04-21 06:12:00 +00:00
Craig Topper	63db7e347b	[X86] Don't form masked vfpclass instruction from and+vfpclass unless the fpclass only has a single use. llvm-svn: 358841	2019-04-21 05:18:04 +00:00
Lang Hames	a97032e947	[JITLink] Remove an overly strict error check in JITLink's eh-frame parser. The error check required FDEs to refer to the most recent CIE, but the eh-frame spec allows them to refer to any previously seen CIE. This patch removes the offending check. llvm-svn: 358840	2019-04-21 04:48:32 +00:00
Lang Hames	3ccd677bf8	[BinaryFormat] Fix bitfield-ordering of MachO::relocation_info on big-endian. Hopefully this will fix the JITLink regression test failures on big-endian testers (e.g. http://lab.llvm.org:8011/builders/clang-s390x-linux-lnt/builds/12702) llvm-svn: 358839	2019-04-21 03:14:43 +00:00
Lang Hames	0191531a76	[JITLink] Factor basic common GOT and stub creation code into its own class. llvm-svn: 358838	2019-04-21 03:14:42 +00:00
Petr Hosek	33b996408f	[gn] Move Features.inc to clangd, create a config for it ClangdLSPServer and clangd unittests now include Features.inc so we need to append the target_gen_dir that contains it to their include_dirs. To do so, we use a public config that's applied to any target that depends on the features one. Differential Revision: https://reviews.llvm.org/D60919 llvm-svn: 358837	2019-04-21 01:09:15 +00:00
Lang Hames	7fc7b368bd	[JITLink] Add dependencies on MCDissassembler and Target to unit test. llvm-svn: 358836	2019-04-21 00:35:58 +00:00
Nico Weber	8eeaf5178d	llvm-undname: Improve string literal demangling with embedded \0 chars - Don't assert when a string looks like a u32 string to the heuristic but doesn't have a length that's 0 mod 4. Instead, classify those as u16 with embedded \0 chars. Found by oss-fuzz. - Print embedded nul bytes as \0 instead of \x00. llvm-svn: 358835	2019-04-20 23:59:06 +00:00
Nico Weber	f2654b638d	ftime-trace: Trace the name of the currently active pass as well. Differential Revision: https://reviews.llvm.org/D60782 llvm-svn: 358834	2019-04-20 23:22:45 +00:00
Lang Hames	65e1ddd713	[JITLink] Add yet more detail to MachO/x86-64 unsupported relocation errors. Knowing the address/symbolnum field values makes it easier to identify the unsupported relocation, and provides enough information for the full bit pattern of the relocation to be reconstructed. llvm-svn: 358833	2019-04-20 22:59:43 +00:00
Lang Hames	5004abcd86	[JITLink][ORC] Add JITLink to the list of dependencies for ORC. The new ObjectLinkingLayer in ORC depends on JITLink. This should fix the build error at http://lab.llvm.org:8011/builders/clang-ppc64le-linux-multistage/builds/9621 llvm-svn: 358832	2019-04-20 22:15:57 +00:00
Lang Hames	7f77a231fa	[JITLink] Fix a bad formatv format string. llvm-svn: 358831	2019-04-20 22:06:12 +00:00
Lang Hames	3474ba4f22	[JITLink] Disable MachO/x86-64 regression test if the X86 target is not built. llvm-svn: 358830	2019-04-20 21:32:49 +00:00
Amara Emerson	4286652556	Revert r358800. Breaks Obsequi from the test suite. The last attempt fixed gcc and consumer-typeset, but Obsequi seems to fail with a different issue. llvm-svn: 358829	2019-04-20 21:25:00 +00:00
Lang Hames	f0ffb2e4e8	[JITLink] Add llvm-jitlink to the list of available tools in lit. Should fix the 'llvm-jitlink command not found' errors that are appearing on some builders. llvm-svn: 358828	2019-04-20 20:05:30 +00:00
Lang Hames	daed9b10f1	[JITLink] Add BinaryFormat to JITLink's dependencies. Hopefully this will fix the missing dependence on llvm::identify_magic that is showing up on some PPC bots. E.g. http://lab.llvm.org:8011/builders/clang-ppc64le-linux-multistage/builds/9617 llvm-svn: 358827	2019-04-20 19:48:45 +00:00
Lang Hames	c283fc5ebb	[JITLink] Add more detail to MachO/x86-64 "unsupported relocation" errors. The extra information here will be helpful in diagnosing errors, like the ones currently occuring on the PPC big-endian bots. :) llvm-svn: 358826	2019-04-20 18:50:13 +00:00
Lang Hames	bcdce5cd41	[JITLink] Add check to JITLink unit test to bail out for unsupported targets. This should prevent spurious JITLink unit test failures for builds that do not support the target(s) required by the tests. llvm-svn: 358825	2019-04-20 18:30:17 +00:00
Lang Hames	dfc3a4f6ff	[JITLink] Silence some MSVC implicit cast warnings. llvm-svn: 358824	2019-04-20 18:30:16 +00:00
Lang Hames	d9a7a7d3d0	[JITLink] Add llvm-jitlink subdirectory to tools/LLVMBuild.txt llvm-svn: 358823	2019-04-20 17:58:29 +00:00
Lang Hames	b39109585a	[JITLink] Use memset instead of bzero. llvm-svn: 358822	2019-04-20 17:49:58 +00:00
Lang Hames	3211b44751	[JITLink] Silence a narrowing conversion warning. llvm-svn: 358821	2019-04-20 17:37:09 +00:00
Lang Hames	f7c5e6c0ad	[JITLink] Update BuildingAJIT tutorials to account for API changes in r358818. DynamicLibrarySearchGenerator::GetForCurrentProcess now takes a char (the global prefix) rather than a DataLayout reference. llvm-svn: 358820	2019-04-20 17:35:28 +00:00
Lang Hames	68b0b8c192	[JITLink] Fix a missing header and bad prototype. llvm-svn: 358819	2019-04-20 17:29:57 +00:00
Lang Hames	11c8dfa583	Initial implementation of JITLink - A replacement for RuntimeDyld. Summary: JITLink is a jit-linker that performs the same high-level task as RuntimeDyld: it parses relocatable object files and makes their contents runnable in a target process. JITLink aims to improve on RuntimeDyld in several ways: (1) A clear design intended to maximize code-sharing while minimizing coupling. RuntimeDyld has been developed in an ad-hoc fashion for a number of years and this had led to intermingling of code for multiple architectures (e.g. in RuntimeDyldELF::processRelocationRef) in a way that makes the code more difficult to read, reason about, extend. JITLink is designed to isolate format and architecture specific code, while still sharing generic code. (2) Support for native code models. RuntimeDyld required the use of large code models (where calls to external functions are made indirectly via registers) for many of platforms due to its restrictive model for stub generation (one "stub" per symbol). JITLink allows arbitrary mutation of the atom graph, allowing both GOT and PLT atoms to be added naturally. (3) Native support for asynchronous linking. JITLink uses asynchronous calls for symbol resolution and finalization: these callbacks are passed a continuation function that they must call to complete the linker's work. This allows for cleaner interoperation with the new concurrent ORC JIT APIs, while still being easily implementable in synchronous style if asynchrony is not needed. To maximise sharing, the design has a hierarchy of common code: (1) Generic atom-graph data structure and algorithms (e.g. dead stripping and \| memory allocation) that are intended to be shared by all architectures. \| + -- (2) Shared per-format code that utilizes (1), e.g. Generic MachO to \| atom-graph parsing. \| + -- (3) Architecture specific code that uses (1) and (2). E.g. JITLinkerMachO_x86_64, which adds x86-64 specific relocation support to (2) to build and patch up the atom graph. To support asynchronous symbol resolution and finalization, the callbacks for these operations take continuations as arguments: using JITLinkAsyncLookupContinuation = std::function<void(Expected<AsyncLookupResult> LR)>; using JITLinkAsyncLookupFunction = std::function<void(const DenseSet<StringRef> &Symbols, JITLinkAsyncLookupContinuation LookupContinuation)>; using FinalizeContinuation = std::function<void(Error)>; virtual void finalizeAsync(FinalizeContinuation OnFinalize); In addition to its headline features, JITLink also makes other improvements: - Dead stripping support: symbols that are not used (e.g. redundant ODR definitions) are discarded, and take up no memory in the target process (In contrast, RuntimeDyld supported pointer equality for weak definitions, but the redundant definitions stayed resident in memory). - Improved exception handling support. JITLink provides a much more extensive eh-frame parser than RuntimeDyld, and is able to correctly fix up many eh-frame sections that RuntimeDyld currently (silently) fails on. - More extensive validation and error handling throughout. This initial patch supports linking MachO/x86-64 only. Work on support for other architectures and formats will happen in-tree. Differential Revision: https://reviews.llvm.org/D58704 llvm-svn: 358818	2019-04-20 17:10:34 +00:00
Craig Topper	3980d1ca6b	[X86] Disable argument copy elision for arguments passed via pointers Summary: If you pass two 1024 bit vectors in IR with AVX2 on Windows 64. Both vectors will be split in four 256 bit pieces. The four pieces of the first argument will be passed indirectly using 4 gprs. The second argument will get passed via pointers in memory. The PartOffsets stored for the second argument are all in terms of its original 1024 bit size. So the PartOffsets for each piece are 32 bytes apart. So if we consider it for copy elision we'll only load an 8 byte pointer, but we'll move the address 32 bytes. The stack object size we create for the first part is probably wrong too. This issue was encountered by ISPC. I'm working on getting a reduce test case, but wanted to go ahead and get feedback on the fix. Reviewers: rnk Reviewed By: rnk Subscribers: dbabokin, llvm-commits, hiraditya Tags: #llvm Differential Revision: https://reviews.llvm.org/D60801 llvm-svn: 358817	2019-04-20 15:26:44 +00:00
Luqman Aden	2993661cc0	[CorrelatedValuePropagation] Mark subs that we know not to wrap with nuw/nsw. Summary: Teach CorrelatedValuePropagation to also handle sub instructions in addition to add. Relatively simple since makeGuaranteedNoWrapRegion already understood sub instructions. Only subtle change is which range is passed as "Other" to that function, since sub isn't commutative. Note that CorrelatedValuePropagation::processAddSub is still hidden behind a default-off flag as IndVarSimplify hasn't yet been fixed to strip the added nsw/nuw flags and causes a miscompile. (PR31181) Reviewers: sanjoy, apilipenko, nikic Reviewed By: nikic Subscribers: hiraditya, jfb, jdoerfert, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D60036 llvm-svn: 358816	2019-04-20 13:14:18 +00:00
Fangrui Song	d3b2682351	[ExecutionDomainFix] Optimize a binary search insertion llvm-svn: 358815	2019-04-20 13:00:50 +00:00
Fangrui Song	dd0e833555	[llvm-symbolizer] Fix section index at the end of a section This is very minor issue. The returned section index is only used by DWARFDebugLine as an llvm::upper_bound input and the use case shouldn't cause any behavioral change. llvm-svn: 358814	2019-04-20 13:00:09 +00:00
Nikita Popov	d89de3f7f4	[IndVarSimplify] Generate full checks for some LFTR tests; NFC llvm-svn: 358813	2019-04-20 12:05:53 +00:00
Nikita Popov	aa0c5a022f	[IndVarSimplify] Add tests for PR31181; NFC llvm-svn: 358812	2019-04-20 12:05:43 +00:00
Sam McCall	51389aad98	[ADT] Avoid warning in bsearch testcase llvm-svn: 358811	2019-04-20 11:48:11 +00:00
Fangrui Song	b48e41be96	[llvm-objdump] Fix End in disassemblyObject after rL358806 llvm-svn: 358809	2019-04-20 07:48:41 +00:00
Nikita Popov	2e33f8de57	[CVP] Add tests for sub nowrap inference; NFC These are baseline tests for D60036. Patch by Luqman Aden. llvm-svn: 358808	2019-04-20 07:43:15 +00:00
Nikita Popov	b75c8fc6fb	[X86] Fix stack probing on x32 (PR41477) Fix for https://bugs.llvm.org/show_bug.cgi?id=41477. On the x32 ABI with stack probing a dynamic alloca will result in a WIN_ALLOCA_32 with a 32-bit size. The current implementation tries to copy it into RAX, resulting in a physreg copy error. Fix this by copying to EAX instead. Also fix incorrect opcodes or registers used in subs. llvm-svn: 358807	2019-04-20 07:25:46 +00:00
Fangrui Song	ce12ea8dfc	[llvm-objdump] Don't disassemble symbols before SectionAddr This was caught by UBSAN tools/llvm-objdump/X86/macho-disassembly-g-dsym.test tools/llvm-objdump/X86/hex-displacement.test llvm-svn: 358806	2019-04-20 07:19:24 +00:00
Craig Topper	4d4b5d952e	[X86] Don't turn (and (shl X, C1), C2) into (shl (and X, (C1 >> C2), C2) if the original AND can represented by MOVZX. The MOVZX doesn't require an immediate to be encoded at all. Though it does use a 2 byte opcode so its the same size as a 1 byte immediate. But it has a separate source and dest register so can help avoid copies. llvm-svn: 358805	2019-04-20 04:38:53 +00:00
Craig Topper	8b8264828c	[X86] Turn (and (anyextend (shl X, C1), C2)) into (shl (and (anyextend X), (C1 >> C2), C2) if the AND could match a movzx. There's one slight regression in here because we don't check that the immediate already allowed movzx before the shift. I'll fix that next. llvm-svn: 358804	2019-04-20 04:38:49 +00:00
Fangrui Song	8f28f7a488	[llvm-objdump] Simplify --{start,stop}-address llvm-svn: 358803	2019-04-20 02:10:48 +00:00
Sam Clegg	fe8aabf9d9	[WebAssembly] Object: Improve error messages on invalid section Also add a test. Differential Revision: https://reviews.llvm.org/D60836 llvm-svn: 358801	2019-04-20 00:11:46 +00:00
Amara Emerson	eac69e9377	Revert "Revert "[GlobalISel] Add legalization support for non-power-2 loads and stores"" We were shifting the wrong component of a split load when trying to combine them back into a single value. llvm-svn: 358800	2019-04-19 23:54:44 +00:00
Jessica Paquette	d5c69e0836	[GlobalISel][AArch64] Legalize + select G_FRINT Exactly the same as G_FCEIL, G_FABS, etc. Add tests for the fp16/nofp16 behaviour, update arm64-vfloatintrinsics, etc. Differential Revision: https://reviews.llvm.org/D60895 llvm-svn: 358799	2019-04-19 23:41:52 +00:00
Sam Clegg	a27252794e	[WebAssembly] FastISel: Don't fallback to SelectionDAG after BuildMI in selectCall My understanding is that once BuildMI has been called we can't fallback to SelectionDAG. This change moves the fallback for when getRegForValue() fails for that target of an indirect call. This was failing in -fPIC mode when the callee is GlobalValue. Add a test case that tickles this. Differential Revision: https://reviews.llvm.org/D60908 llvm-svn: 358793	2019-04-19 22:43:32 +00:00
Vedant Kumar	282b26ec4d	[GVN+LICM] Use line 0 locations for better crash attribution This is a follow-up to r291037+r291258, which used null debug locations to prevent jumpy line tables. Using line 0 locations achieves the same effect, but works better for crash attribution because it preserves the right inline scope. Differential Revision: https://reviews.llvm.org/D60913 llvm-svn: 358791	2019-04-19 22:36:40 +00:00
Vitaly Buka	a9c2ba3fff	Update GN files to build with r358103 llvm-svn: 358790	2019-04-19 22:27:50 +00:00
Eric Christopher	dfebd84eb3	Remove the EnableEarlyCSEMemSSA set of options from the legacy and new pass managers. They were default to true and not being used. Differential Revision: https://reviews.llvm.org/D60747 llvm-svn: 358789	2019-04-19 22:18:53 +00:00
Eli Friedman	1810339bc3	[AArch64] Fix checks for AArch64MCExpr::VK_SABS flag. VK_SABS is part of the SymLoc bitfield in the variant kind which should be compared for equality, not by checking the VK_SABS bit. As far as I know, the existing code happened to produce the correct results in all cases, so this is just a cleanup. Patch by Stephen Crane. Differential Revision: https://reviews.llvm.org/D60596 llvm-svn: 358788	2019-04-19 21:58:10 +00:00
Jessica Paquette	ad69af3e95	[GlobalISel] Add IRTranslator support for G_FRINT Add it as a simple intrinsic, update arm64-irtranslator.ll. Differential Revision: https://reviews.llvm.org/D60893 llvm-svn: 358787	2019-04-19 21:46:12 +00:00
Amy Huang	d07d6d6177	Attempt to fix buildbot failure in commit 1bb57bac959ac163fd7d8a76d734ca3e0ecee6ab. llvm-svn: 358786	2019-04-19 21:44:30 +00:00
Jessica Paquette	627e8f8cb3	[GlobalISel] Add a G_FRINT opcode Equivalent to SelectionDAG's frint node. Differential Revision: https://reviews.llvm.org/D60891 llvm-svn: 358785	2019-04-19 21:44:16 +00:00
Craig Topper	f919a2ece4	[X86] Add test case for D60801. NFC llvm-svn: 358784	2019-04-19 21:39:16 +00:00
Amy Huang	c774f687b6	[MS] Emit S_HEAPALLOCSITE debug info Summary: This emits labels around heapallocsite calls and S_HEAPALLOCSITE debug info in codeview. Currently only changes FastISel, so emitting labels still needs to be implemented in SelectionDAG. Reviewers: hans, rnk Subscribers: aprantl, hiraditya, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D60800 llvm-svn: 358783	2019-04-19 21:09:11 +00:00
Chris Bieneman	36228cb63f	[CMake] Pass monorepo build settings in cross compile This allows the cross compiled build targets to configure the LLVM tools and sub-projects that are part of the main build. This is needed for generating native non llvm *-tablegen tools when cross compiling clang in the monorepo build environment. llvm-svn: 358779	2019-04-19 20:08:55 +00:00
Petr Hosek	45fc90326a	[gn] Support dots in CMake paths in the sync script Some file paths use dots to pick up sources from parent directories. Differential Revision: https://reviews.llvm.org/D60734 llvm-svn: 358774	2019-04-19 18:29:17 +00:00
Alina Sbirlea	43709f7233	[LICM & MemorySSA] Make limit flags pass tuning options. Summary: Make the flags in LICM + MemorySSA tuning options in the old and new pass managers. Subscribers: mehdi_amini, jlebar, Prazek, george.burgess.iv, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D60490 llvm-svn: 358772	2019-04-19 17:46:50 +00:00
Amara Emerson	36c5baef49	Revert "[GlobalISel] Add legalization support for non-power-2 loads and stores" This introduces some runtime failures which I'll need to investigate further. llvm-svn: 358771	2019-04-19 17:42:13 +00:00
Jessica Paquette	dfd87f6fa1	[GlobalISel][AArch64] Legalize vector G_FPOW This instruction is legalized in the same way as G_FSIN, G_FCOS, G_FLOG10, etc. Update legalize-pow.mir and arm64-vfloatintrinsics.ll to reflect the change. Differential Revision: https://reviews.llvm.org/D60218 llvm-svn: 358764	2019-04-19 16:28:08 +00:00
Alina Sbirlea	0499a2f961	[NewPassManager] Adding pass tuning options: loop vectorize. Summary: Trying to add the plumbing necessary to add tuning options to the new pass manager. Testing with the flags for loop vectorize. Reviewers: chandlerc Subscribers: sanjoy, mehdi_amini, jlebar, steven_wu, dexonsmith, dang, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D59723 llvm-svn: 358763	2019-04-19 16:11:59 +00:00
Fangrui Song	51873d3150	[dsymutil] DwarfLinker: delete unused parameter llvm-svn: 358762	2019-04-19 15:45:25 +00:00
Sanjay Patel	e197c617a6	[SelectionDAG] soften splat mask assert/unreachable (PR41535) These are general queries, so they should not die when given a degenerate input like an all undef mask. Callers should be able to deal with an op that will eventually be simplified away. llvm-svn: 358761	2019-04-19 15:31:11 +00:00
Nico Weber	e145a540cc	llvm-undname: Attempt to fix leak-on-invalid found by oss-fuzz llvm-svn: 358760	2019-04-19 14:13:11 +00:00
Nico Weber	e579800b84	gn build: Merge r358722 llvm-svn: 358755	2019-04-19 13:18:41 +00:00
Nico Weber	ea622ef43e	gn build: Merge r358691 llvm-svn: 358754	2019-04-19 13:16:26 +00:00
Florian Hahn	b340497f76	[LTO] Add plumbing to save stats during LTO on Darwin. Gold and ld on Linux already support saving stats, but the infrastructure is missing on Darwin. Unfortunately it seems like the configuration from lib/LTO/LTO.cpp is not used. This patch adds a new LTOStatsFile option and adds plumbing in Clang to use it on Darwin, similar to the way remarks are handled. Currnetly the handling of LTO flags seems quite spread out, with a bunch of duplication. But I am not sure if there is an easy way to improve that? Reviewers: anemet, tejohnson, thegameg, steven_wu Reviewed By: steven_wu Differential Revision: https://reviews.llvm.org/D60516 llvm-svn: 358753	2019-04-19 12:36:41 +00:00
Fangrui Song	a64fcb6da7	Change \r\n -> \n for llvm-symbolizer/help.test after rL358749 llvm-svn: 358751	2019-04-19 12:28:36 +00:00
Igor Kudrin	99f641ccad	[llvm-symbolizer] Add llvm-addr2line This adds an alias for llvm-symbolizer with different defaults so that it can be used as a drop-in replacement for GNU's addr2line. If a substring "addr2line" is found in the tool's name: * it defaults "-i", "-f" and "-C" to OFF; * it uses "--output-style=GNU" by default. Differential Revision: https://reviews.llvm.org/D60067 llvm-svn: 358749	2019-04-19 10:17:52 +00:00
Igor Kudrin	1b71b7f3b8	[llvm-symbolizer] Unhide and document the "-output-style" option With the latest changes, the option gets useful for users of llvm-symbolizer, not only for the upcoming llvm-addr2line. Differential Revision: https://reviews.llvm.org/D60816 llvm-svn: 358748	2019-04-19 10:14:18 +00:00
Igor Kudrin	4bc29cbf6b	[llvm-symbolizer] Make the output with -output-style=GNU closer to addr2line's This patch addresses two differences in the output of llvm-symbolizer and GNU's addr2line: * llvm-symbolizer prints an empty line after the report for an address. * With "-f -i=0", llvm-symbolizer replaces the name of an inlined function with the name from the symbol table, i. e., the top caller function in the inlining chain. addr2line preserves the name of the inlined function. Differential Revision: https://reviews.llvm.org/D60770 llvm-svn: 358747	2019-04-19 10:12:56 +00:00
Simon Pilgrim	4c09b7d921	[AMDGPU] Regenerate extractelt->truncate test. Prep work for D60462 llvm-svn: 358746	2019-04-19 09:49:04 +00:00
Bjorn Pettersson	238c9d6308	[CodeGen] Add "const" to MachineInstr::mayAlias Summary: The basic idea here is to make it possible to use MachineInstr::mayAlias also when the MachineInstr is const (or the "Other" MachineInstr is const). The addition of const in MachineInstr::mayAlias then rippled down to the need for adding const in several other places, such as TargetTransformInfo::getMemOperandWithOffset. Reviewers: hfinkel Reviewed By: hfinkel Subscribers: hfinkel, MatzeB, arsenm, jvesely, nhaehnle, hiraditya, javed.absar, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D60856 llvm-svn: 358744	2019-04-19 09:08:38 +00:00
James Molloy	9ad4cb3de4	[PATCH] [MachineScheduler] Check pending instructions when an instruction is scheduled Pending instructions that may have been blocked from being available by the HazardRecognizer may no longer may not be blocked any more when an instruction is scheduled; pending instructions should be re-checked in this case. This is primarily aimed at VLIW targets with large parallelism and esoteric constraints. No testcase as no in-tree targets have this behavior. Differential revision: https://reviews.llvm.org/D60861 llvm-svn: 358743	2019-04-19 09:00:55 +00:00
Fangrui Song	7137b54a03	[MergeFunc] Delete unused FunctionNode::release() llvm-svn: 358742	2019-04-19 08:03:20 +00:00
Fangrui Song	884f557bb2	[MergeFunc] removeUsers: call remove() only on direct users removeUsers uses a work list to collect indirect users and call remove() on those functions. However it has a bug (`if (!Visited.insert(UU).second)`). Actually, we don't have to collect indirect users. After the merge of F and G, G's callers will be considered (added to Deferred). If G's callers can be merged, G's callers' callers will be considered. Update the test unnamed-addr-reprocessing.ll to make it clear we can still merge indirect callers. llvm-svn: 358741	2019-04-19 07:57:51 +00:00

... 3 4 5 6 7 ...

178141 Commits