llvm-project

Commit Graph

Author	SHA1	Message	Date
Diana Picus	116bbab4e4	[CodeGen] Rename MachineInstrBuilder::addOperand. NFC Rename from addOperand to just add, to match the other method that has been added to MachineInstrBuilder for adding more than just 1 operand. See https://reviews.llvm.org/D28057 for the whole discussion. Differential Revision: https://reviews.llvm.org/D28556 llvm-svn: 291891	2017-01-13 09:58:52 +00:00
Diana Picus	4f8c3e1882	[ARM] CodeGen: Remove AddDefaultPred. NFC. Replace all uses of AddDefaultPred with MachineInstrBuilder::add(predOps()). This makes the code building MachineInstrs more readable, because it allows us to write code like: MIB.addSomeOperand(blah) .add(predOps()) .addAnotherOperand(blahblah) instead of AddDefaultPred(MIB.addSomeOperand(blah)) .addAnotherOperand(blahblah) This commit also adds the predOps helper in the ARM backend, as well as the add method taking a variable number of operands to the MachineInstrBuilder. The transformation has been done mostly automatically with a custom tool based on Clang AST Matchers + RefactoringTool. Differential Revision: https://reviews.llvm.org/D28555 llvm-svn: 291890	2017-01-13 09:37:56 +00:00
Michael Zuckerman	558a4d8419	[X86][AVX512] Adding missing shuffle lowering to blend mask instructions Some shuffles can be lowered to blend mask instruction (VPBLENDMB/VPBLENDMW/VPBLENDMD/VPBLENDMQ) . In this patch, I added new pattern match for this case. Reviewers: 1. craig.topper 2. guyblank 3. RKSimon 4. igorb Differential Revision: https://reviews.llvm.org/D28483 llvm-svn: 291888	2017-01-13 09:06:00 +00:00
Tobias Grosser	190d4e5fa2	RegionPass: Set isExecuted flag correctly This was forgotten in r291882. Without this fix, the Polly build bots are broken. llvm-svn: 291887	2017-01-13 09:00:17 +00:00
Craig Topper	1ec84c2a18	[AVX-512] Remove unmasked BLENDM instructions from the wrong load folding table. The unmasked versions read memory from operand 2, but were in the operand 3 table. These aren't the most interesting set of blendm instructions as the unmasked version isn't useful. We were also missing the B and W forms. I'll add the masked versions of all sizes in a future patch. llvm-svn: 291885	2017-01-13 07:28:56 +00:00
Craig Topper	46b6ecf41e	[X86] Move some entries in the load folding tables to move appropriate grouping. NFC llvm-svn: 291884	2017-01-13 07:28:53 +00:00
Craig Topper	eec4890346	[IR] Don't call assertModuleIsMaterialized in release builds Summary: To fix a release vs debug build linking error, r259695 made the body of assertModuleIsMaterialized empty if Value.cpp gets compiled in a release build. This way any code compiled as a debug build can still link against a release version of the function. This patch takes this a step farther and removes all calls to it from Value.h in any code that includes it in a relase build. This shrinks the opt binary on my macbook build by 17240 bytes. Reviewers: rafael Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28191 llvm-svn: 291883	2017-01-13 06:26:18 +00:00
Serge Pavlov	d409411ef1	Track validity of pass results Running tests with expensive checks enabled exhibits some problems with verification of pass results. First, the pass verification may require results of analysis that are not available. For instance, verification of loop info requires results of dominator tree analysis. A pass may be marked as conserving loop info but does not need to be dependent on DominatorTreePass. When a pass manager tries to verify that loop info is valid, it needs dominator tree, but corresponding analysis may be already destroyed as no user of it remained. Another case is a pass that is skipped. For instance, entities with linkage available_externally do not need code generation and such passes are skipped for them. In this case result verification must also be skipped. To solve these problems this change introduces a special flag to the Pass structure to mark passes that have valid results. If this flag is reset, verifications dependent on the pass result are skipped. Differential Revision: https://reviews.llvm.org/D27190 llvm-svn: 291882	2017-01-13 06:09:54 +00:00
Easwaran Raman	b035f914e4	ProfileSummaryInfo improvements. * Add is{Hot\|Cold}CallSite methods * Fix a bug in isHotBB where it was looking for MD_prof on a return instruction * Use MD_prof data only if sample profiling was used to collect profiles. * Add an unit test to ProfileSummaryInfo Differential Revision: https://reviews.llvm.org/D28584 llvm-svn: 291878	2017-01-13 01:34:00 +00:00
Eugene Zelenko	8187c192c6	[PowerPC] Fix some Clang-tidy modernize and Include What You Use warnings; other minor fixes (NFC). llvm-svn: 291872	2017-01-13 00:58:58 +00:00
Greg Clayton	0e62ee7d60	Add the ability to iterate across all attributes in a DIE. Differential Revision: https://reviews.llvm.org/D28386 llvm-svn: 291861	2017-01-13 00:13:42 +00:00
Evgeniy Stepanov	f01c70fec0	[asan] Don't overalign global metadata. Other than on COFF with incremental linking, global metadata should not need any extra alignment. Differential Revision: https://reviews.llvm.org/D28628 llvm-svn: 291859	2017-01-12 23:26:20 +00:00
Evgeniy Stepanov	5d31d08a21	[asan] Refactor instrumentation of globals. llvm-svn: 291858	2017-01-12 23:03:03 +00:00
Teresa Johnson	83aaf358fd	[ThinLTO] Import static functions from the same module as caller Summary: We can sometimes end up with multiple copies of a local function that have the same GUID in the index. This happens when there are local functions with the same name that are in different source files with the same name (but in different directories), and they were compiled in their own directory so had the same path at compile time. In this case make sure we import the copy in the caller's module. While it isn't a correctness problem (the renamed reference which is based on the module IR hash will be unique since the module must have had an externally visible function that was imported), importing the wrong copy will result in lost performance opportunity since it won't be referenced and inlined. Reviewers: mehdi_amini Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28440 llvm-svn: 291841	2017-01-12 22:04:45 +00:00
Chris Bieneman	07088c1060	[ObjectYAML] Pull yaml2dwarf out of yaml2obj for reuse This patch pulls the yaml2dwarf code out of yaml2obj into a new set of DWARF emitter functions in the DWARFYAML namespace. This will enable the YAML->DWARF code to be used inside DWARF tests by populating the DWARFYAML structs and calling the Emitter functions. llvm-svn: 291828	2017-01-12 21:35:21 +00:00
Robert Lougher	b0124c1eb8	[DebugInfo] Remove redundant check in SimplifyCFG; NFC. llvm-svn: 291813	2017-01-12 21:11:09 +00:00
Eli Friedman	b5c3a0d1c3	[SCEV] Simplify SolveLinEquationWithOverflow a bit. Cleanup in preparation for generalizing it. llvm-svn: 291808	2017-01-12 20:21:00 +00:00
Nikolai Bozhenov	f02ac0eeb2	[X86] Replace AND+IMM64 with SRL/SHL Emit SHRQ/SHLQ instead of ANDQ with a 64 bit constant mask if the result is unused and the mask has only higher/lower bits set. For example, with this patch LLVM emits shrq $41, %rdi je instead of movabsq $0xFFFFFE0000000000, %rcx testq %rcx, %rdi je This reduces number of instructions, code size and register pressure. The transformation is applied only for cases where the mask cannot be encoded as an immediate value within TESTQ instruction. Differential Revision: https://reviews.llvm.org/D28198 llvm-svn: 291806	2017-01-12 19:54:27 +00:00
Nikolai Bozhenov	6bdf92cec7	[X86] Tune bypassing of slow division for Intel CPUs 64-bit integer division in Intel CPUs is extremely slow, much slower than 32-bit division. On the other hand, 8-bit and 16-bit divisions aren't any faster. The only important exception is Atom where DIV8 is fastest. Because of that, the patch 1) Enables bypassing of 64-bit division for Atom, Silvermont and all big cores. 2) Modifies 64-bit bypassing to use 32-bit division instead of 16-bit one. This doesn't make the shorter division slower but increases chances of taking it. Moreover, it's much more likely to prove at compile-time that a value fits 32 bits and doesn't require a run-time check (e.g. zext i32 to i64). Differential Revision: https://reviews.llvm.org/D28196 llvm-svn: 291800	2017-01-12 19:34:15 +00:00
Matt Arsenault	45337df08f	AMDGPU: Skip fneg/select combine if it can fold into other llvm-svn: 291792	2017-01-12 18:58:15 +00:00
Matt Arsenault	31c039ef2e	AMDGPU: Fold free fneg into sin llvm-svn: 291790	2017-01-12 18:48:09 +00:00
Saleem Abdulrasool	555e5980a5	ARM: slightly more table driven libcall setup Switch some additional library call setup to be table driven. This makes it more immediately obvious what the library call looks like. This is important for ARM since the calling conventions for the builtins change based on the target/libcall name. NFC llvm-svn: 291789	2017-01-12 18:46:11 +00:00
Robert Lougher	6717a6fe54	[DebugInfo] DILocation variable declaration should be const; NFC. llvm-svn: 291787	2017-01-12 18:33:49 +00:00
Hans Wennborg	84da661509	Avoid std::errc::protocol_* to appease mingw Like r291636 and r285261. llvm-svn: 291786	2017-01-12 18:33:14 +00:00
Robert Lougher	f5df7a18dd	[DebugInfo] Add const to DILocation variable declaration; NFC. llvm-svn: 291785	2017-01-12 18:29:28 +00:00
Matt Arsenault	a8c325e2f5	AMDGPU: Fold fneg into fmul_legacy llvm-svn: 291784	2017-01-12 18:26:30 +00:00
Matt Arsenault	ff7e5aadf5	AMDGPU: Fold fneg into rcp llvm-svn: 291779	2017-01-12 17:46:35 +00:00
Matt Arsenault	4242d48c36	AMDGPU: Fold fneg into fp_round llvm-svn: 291778	2017-01-12 17:46:33 +00:00
Matt Arsenault	98d2bf1024	AMDGPU: Fold fneg into fp_extend llvm-svn: 291777	2017-01-12 17:46:28 +00:00
Daniel Sanders	b7391dd3b4	[globalisel] Move as much RegisterBank initialization to the constructor as possible Summary: The register bank is now entirely initialized in the constructor. However, we still have the hardcoded number of register classes which will be dealt with in the TableGen patch (D27338) since we do not have access to this information to resolve this at this stage. The number of register classes is known to the TRI and to TableGen but the RegisterBank constructor is too early for the former and too late for the latter. This will be fixed when the data is tablegen-erated. Reviewers: t.p.northover, ab, rovka, qcolombet Subscribers: aditya_nandakumar, kristof.beyls, vkalintiris, llvm-commits, dberris Differential Revision: https://reviews.llvm.org/D27809 llvm-svn: 291770	2017-01-12 16:11:23 +00:00
Amjad Aboud	9607571861	[DebugInfo] Added DI macro creation API to DIBuilder. Differential Revision: https://reviews.llvm.org/D16077 llvm-svn: 291769	2017-01-12 15:49:46 +00:00
Daniel Sanders	ae03595bfb	[globalisel] Initialize RegisterBanks with static data. Summary: Refactor the RegisterBank initialization to use static data. This requires GlobalISel implementations to rewrite calls to createRegisterBank() and addRegBankCoverage() into a call to setRegBankData(). Out of tree targets can use diff 4 of D27807 (https://reviews.llvm.org/D27807?id=84117) to have addRegBankCoverage() dump the register classes and other data that needs to be provided to setRegBankData(). This is the method that was used to generate the static data in this patch. Tablegen-eration of this static data will follow after some refactoring. Reviewers: t.p.northover, ab, rovka, qcolombet Subscribers: aditya_nandakumar, kristof.beyls, vkalintiris, llvm-commits, dberris Differential Revision: https://reviews.llvm.org/D27807 Differential Revision: https://reviews.llvm.org/D27808 llvm-svn: 291768	2017-01-12 15:32:10 +00:00
Piotr Padlewski	9530883e8c	[Devirtualization] MemDep returns non-local !invariant.group dependencies Summary: Memory Dependence Analysis was limited to return only local dependencies for invariant.group handling. Now it returns NonLocal when it finds it and then by asking getNonLocalPointerDependency we get found dep. Thanks to this we are able to devirtualize loops! void indirect(A &a, int n) { for (int i = 0 ; i < n; i++) a.foo(); } void test(int n) { A a; indirect(a); } After inlining a.foo() will be changed to direct call, even if foo and A::A() is external (but only if vtable definition is be available). Reviewers: nlewycky, dberlin, chandlerc, rsmith Subscribers: mehdi_amini, davide, llvm-commits Differential Revision: https://reviews.llvm.org/D28137 llvm-svn: 291762	2017-01-12 11:33:58 +00:00
Matt Arsenault	f003198b28	AMDGPU: Fix sub_oneuse being marked commutative llvm-svn: 291748	2017-01-12 07:17:28 +00:00
Craig Topper	24c3a2395f	[AVX-512] Improve lowering of zero_extend of v4i1 to v4i32 and v2i1 to v2i64 with VLX, but no DQ or BW support. llvm-svn: 291747	2017-01-12 06:49:12 +00:00
Craig Topper	69ab67b279	[AVX-512] Improve lowering of sign_extend of v4i1 to v4i32 and v2i1 to v2i64 when avx512vl is available, but not avx512dq. llvm-svn: 291746	2017-01-12 06:49:08 +00:00
Elad Cohen	c5ba925ef2	[X86][AVX512] Fix PR31515 - Do not flip vselect condition if it's not a vXi1 mask r289653 added a case where `vselect <cond> <vector1> <all-zeros>` is transformed to: `vselect xor(cond, DAG.getConstant(1, DL, CondVT) <all-zeros> <vector1>` This was not aimed to catch cases where Cond is not a vXi1 mask but it does. Moreover, when Cond type is VxiN (N > 1) then xor(cond, DAG.getConstant(1, DL, CondVT) != NOT(cond). This patch changes the above to xor with allones, and avoids entering the case for non-mask Conds. llvm-svn: 291745	2017-01-12 06:49:03 +00:00
Matt Arsenault	63f953795e	AMDGPU: Fold fneg into fma or fmad Patch mostly by Fiona Glaser llvm-svn: 291733	2017-01-12 00:32:16 +00:00
Matt Arsenault	4103a81d6d	AMDGPU: Fold fneg into fmul Patch mostly by Fiona Glaser llvm-svn: 291732	2017-01-12 00:23:20 +00:00
Matt Arsenault	2529fba989	AMDGPU: Fold fneg into fadd Patch mostly by Fiona Glaser llvm-svn: 291731	2017-01-12 00:09:34 +00:00
Matt Arsenault	2a04ff97ad	AMDGPU: Pull fneg/fabs out of a select Allows better source modifier usage. llvm-svn: 291729	2017-01-11 23:57:38 +00:00
Davide Italiano	eac05f6b88	[NewGVN] Fixup store count for the `initial` congruency class. It was always zero. When we move a store from `initial` to its own congruency class, we end up with a negative store count, which is obviously wrong. Also, while here, change StoreCount to be signed so that the assertions actually fire. Ack'ed by Daniel Berlin. llvm-svn: 291725	2017-01-11 23:41:24 +00:00
Zachary Turner	629cb7d8cc	[CodeView] Finish decoupling TypeDatabase from TypeDumper. Previously the type dumper itself was passed around to a lot of different places and manipulated in ways that were more appropriate on the type database. For example, the entire TypeDumper was passed into the symbol dumper, when all the symbol dumper wanted to do was lookup the name of a TypeIndex so it could print it. That's what the TypeDatabase is for -- mapping type indices to names. Another example is how if the user runs llvm-pdbdump with the option to dump symbols but not types, we still have to visit all types so that we can print minimal information about the type of a symbol, but just without dumping full symbol records. The way we did this before is by hacking it up so that we run everything through the type dumper with a null printer, so that the output goes to /dev/null. But really, we don't need to dump anything, all we want to do is build the type database. Since TypeDatabaseVisitor now exists independently of TypeDumper, we can do this. We just build a custom visitor callback pipeline that includes a database visitor but not a dumper. All the hackery around printers etc goes away. After this patch, we could probably even delete the entire CVTypeDumper class since really all it is at this point is a thin wrapper that hides the details of how to build a useful visitation pipeline. It's not a priority though, so CVTypeDumper remains for now. After this patch we will be able to easily plug in a different style of type dumper by only implementing the proper visitation methods to dump one-line output and then sticking it on the pipeline. Differential Revision: https://reviews.llvm.org/D28524 llvm-svn: 291724	2017-01-11 23:24:22 +00:00
Peter Collingbourne	1b5f1cfdb4	X86: Remove dead code. NFC. llvm-svn: 291721	2017-01-11 23:00:28 +00:00
Matt Arsenault	24a1273ae1	AMDGPU: Fix shrinking of addc/subb. To shrink to VOP2 the input carry must also be VCC. llvm-svn: 291720	2017-01-11 22:58:12 +00:00
Matt Arsenault	682eb4396a	AMDGPU: Fix sext_inreg for i1 in i16 This produces worse code when i16 is legal, mostly due to combines getting confused by conversions inserted for uniform 16-bit operations. llvm-svn: 291717	2017-01-11 22:35:22 +00:00
Matt Arsenault	28bd4cbeaf	AMDGPU: Fix breaking VOP3 v_add_i32s This was shrinking the instruction even though the carry output register was a virtual register, not known VCC. llvm-svn: 291716	2017-01-11 22:35:17 +00:00
Kuba Mracek	503162b4a1	[asan] Set alignment of __asan_global_* globals to sizeof(GlobalStruct) When using profiling and ASan together (-fprofile-instr-generate -fcoverage-mapping -fsanitize=address), at least on Darwin, the section of globals that ASan emits (__asan_globals) is misaligned and starts at an odd offset. This really doesn't have anything to do with profiling, but it triggers the issue because profiling emits a string section, which can have arbitrary size. This patch changes the alignment to sizeof(GlobalStruct). Differential Revision: https://reviews.llvm.org/D28573 llvm-svn: 291715	2017-01-11 22:26:10 +00:00
Davide Italiano	0dc68bfa87	Revert "[NewGVN] Strengthen a couple of assertions." It's breaking some bots. Will investigate and recommit. llvm-svn: 291712	2017-01-11 22:00:29 +00:00
Matt Arsenault	69e3001b84	AMDGPU: Fix folding immediates into mac src2 Whether it is legal or not needs to check for the instruction it will be replaced with. llvm-svn: 291711	2017-01-11 22:00:02 +00:00

1 2 3 4 5 ...

98398 Commits