llvm-project

Commit Graph

Author	SHA1	Message	Date
Manman Ren	4a9b0ebe83	Add a parameter for getLazyBitcodeModule to lazily load Metadata. We only defer loading metadata inside ParseModule when ShouldLazyLoadMetadata is true and we have not loaded any Metadata block yet. This commit implements all-or-nothing loading of Metadata. If there is a request to load any metadata block, we will load all deferred metadata blocks. We make sure the deferred metadata blocks are loaded before we materialize any function or a module. The default value of the added parameter ShouldLazyLoadMetadata for getLazyBitcodeModule is false, so the default behavior stays the same. We only set the parameter to true when creating LTOModule in local contexts. These can only really be used for parsing symbols, so it's unnecessary to ever load the metadata blocks. If we are going to enable lazy-loading of Metadata for other usages of getLazyBitcodeModule, where deferred metadata blocks need to be loaded, we can expose BitcodeReader::materializeMetadata to Module, similar to Module::materialize. rdar://19804575 llvm-svn: 232198	2015-03-13 19:24:30 +00:00
Duncan P. N. Exon Smith	c6820ec1c2	instcombine: alloca: Split out simplifyAllocaArraySize(), NFC Follow-up commits will change some of the logic here. Splitting into a separate function simplifies the logic by allowing early returns instead of deeper nesting. llvm-svn: 232197	2015-03-13 19:22:03 +00:00
Robert Lougher	5e0ea66d59	Revert: "[Reassociate] Add initial support for vector instructions." This reverts revision 232190 due to buildbot failure reported on clang-hexagon-elf for test arm64_vtst.c. To be investigated. llvm-svn: 232196	2015-03-13 19:20:46 +00:00
Robert Lougher	1bad505c3c	[Reassociate] Add initial support for vector instructions. This patch adds initial support for vector instructions to the reassociation pass. It enables most parts of the pass to work with vectors but to keep the size of the patch small, optimization of Xor trees, canonicalization of negative constants and converting shifts to muls, etc., have been left out. This will be handled in later patches. The patch is based on an initial patch by Chad Rosier. Differential Revision: http://reviews.llvm.org/D7566 llvm-svn: 232190	2015-03-13 18:33:27 +00:00
Sanjoy Das	f1e9e1df25	[SCEV] Fix PR22856. Summary: ScalarEvolutionExpander assumes that the header block of a loop is a legal place to have a use for a phi node. This is true only for phis that are either in the header or dominate the header block, but it is not true for phi nodes that are strictly internal to the loop body. This change teaches ScalarEvolutionExpander to place uses of PHI nodes in the basic block the PHI nodes belong to. This is always legal, and `hoistIVInc` ensures that the said position dominates `IsomorphicInc`. Reviewers: atrick Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D8311 llvm-svn: 232189	2015-03-13 18:31:19 +00:00
David Blaikie	f72d05bc7b	[opaque pointer type] Add textual IR support for explicit type parameter to gep operator Similar to gep (r230786) and load (r230794) changes. Similar migration script can be used to update test cases, which successfully migrated all of LLVM and Polly, but about 4 test cases needed manually changes in Clang. (this script will read the contents of stdin and massage it into stdout - wrap it in the 'apply.sh' script shown in previous commits + xargs to apply it over a large set of test cases) import fileinput import sys import re rep = re.compile(r"(getelementptr(?:\s+inbounds)?\s$)((<\d\s+x\s+)?([^@]?)(\|\saddrspace\(\d+$)\s\(?(3)>)\s*)(?=$\|%\|@\|null\|undef\|blockaddress\|getelementptr\|addrspacecast\|bitcast\|inttoptr\|zeroinitializer\|<\|\[\[[a-zA-Z]\|\{\{)", re.MULTILINE \| re.DOTALL) def conv(match): line = match.group(1) line += match.group(4) line += ", " line += match.group(2) return line line = sys.stdin.read() off = 0 for match in re.finditer(rep, line): sys.stdout.write(line[off:match.start()]) sys.stdout.write(conv(match)) off = match.end() sys.stdout.write(line[off:]) llvm-svn: 232184	2015-03-13 18:20:45 +00:00
Jan Vesely	7a9cca9e7d	r600: Clear visited structure before running. Fixes random crashes in for-loop piglit. Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Matt Arsenault <Matthew.Arsenault@amd.com> llvm-svn: 232181	2015-03-13 17:32:46 +00:00
Jan Vesely	18b289f590	r600: Use deque and simplify loops in AMDGPUCFGStructurizer Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Matt Arsenault <Matthew.Arsenault@amd.com> llvm-svn: 232180	2015-03-13 17:32:43 +00:00
Andrea Di Biagio	510feca1b8	[X86][AVX] Fix wrong lowering of v4x64 shuffles into concat_vector plus extract_subvector nodes. This patch fixes a bug in the shuffle lowering logic implemented by function 'lowerV2X128VectorShuffle'. The are few cases where function 'lowerV2X128VectorShuffle' wrongly expands a shuffle of two v4X64 vectors into a CONCAT_VECTORS of two EXTRACT_SUBVECTOR nodes. The problematic expansion only occurs when the shuffle mask M has an 'undef' element at position 2, and M is equivalent to mask <0,1,4,5>. In that case, the algorithm propagates the wrong vector to one of the two new EXTRACT_SUBVECTOR nodes. Example: ;; define <4 x double> @test(<4 x double> %A, <4 x double> %B) { entry: %0 = shufflevector <4 x double> %A, <4 x double> %B, <4 x i32><i32 undef, i32 1, i32 undef, i32 5> ret <4 x double> %0 } ;; Before this patch, llc (-mattr=+avx) generated: vinsertf128 $1, %xmm0, %ymm0, %ymm0 With this patch, llc correctly generates: vinsertf128 $1, %xmm1, %ymm0, %ymm0 Added test lower-vec-shuffle-bug.ll Differential Revision: http://reviews.llvm.org/D8259 llvm-svn: 232179	2015-03-13 17:29:49 +00:00
Benjamin Kramer	76e37aa334	unique_ptrs are unique already, no need to unique them any further. llvm-svn: 232178	2015-03-13 16:59:29 +00:00
David Majnemer	e2a4b856d8	ConstantFold: Fix big shift constant folding Constant folding for shift IR instructions ignores all bits above 32 of second argument (shift amount). Because of that, some undef results are not recognized and APInt can raise an assert failure if second argument has more than 64 bits. Patch by Paweł Bylica! Differential Revision: http://reviews.llvm.org/D7701 llvm-svn: 232176	2015-03-13 16:39:46 +00:00
Daniel Sanders	60f1db0525	Recommit r232027 with PR22883 fixed: Add infrastructure for support of multiple memory constraints. The operand flag word for ISD::INLINEASM nodes now contains a 15-bit memory constraint ID when the operand kind is Kind_Mem. This constraint ID is a numeric equivalent to the constraint code string and is converted with a target specific hook in TargetLowering. This patch maps all memory constraints to InlineAsm::Constraint_m so there is no functional change at this point. It just proves that using these previously unused bits in the encoding of the flag word doesn't break anything. The next patch will make each target preserve the current mapping of everything to Constraint_m for itself while changing the target independent implementation of the hook to return Constraint_Unknown appropriately. Each target will then be adapted in separate patches to use appropriate Constraint_* values. PR22883 was caused the matching operands copying the whole of the operand flags for the matched operand. This included the constraint id which needed to be replaced with the operand number. This has been fixed with a conversion function. Following on from this, matching operands also used the operand number as the constraint id. This has been fixed by looking up the matched operand and taking it from there. llvm-svn: 232165	2015-03-13 12:45:09 +00:00
Toma Tabacu	e95a49118c	[mips] [IAS] Refactor MipsTargetStreamer::emitMipsAbiFlags(). NFC. Summary: Make emitMipsAbiFlags a direct member of MipsTargetELFStreamer, as that's the only place where it's used, and remove the empty implementations from MipsTargetStreamer and MipsTargetAsmStreamer. Reviewers: dsanders, rafael Reviewed By: rafael Subscribers: rafael, llvm-commits Differential Revision: http://reviews.llvm.org/D8199 llvm-svn: 232161	2015-03-13 11:40:01 +00:00
Owen Anderson	41a185c521	Teach TBAA analysis to report errors on cyclic TBAA metadata rather than hanging. llvm-svn: 232144	2015-03-13 07:09:33 +00:00
Owen Anderson	08f46e1de6	Fix an infinite recursion in the verifier caused by calling isSized on a recursive type. llvm-svn: 232143	2015-03-13 06:41:26 +00:00
Hao Liu	04183242b3	[MachineCopyPropagation] Fix a bug causing incorrect removal for the instruction sequences as follows %Q5_Q6<def> = COPY %Q2_Q3 %D5<def> = %D3<def> = %D3<def> = COPY %D6 // Incorrectly removed in MachineCopyPropagation Using of %D3 results in incorrect result ... Reviewed in http://reviews.llvm.org/D8242 llvm-svn: 232142	2015-03-13 05:15:23 +00:00
Nick Lewycky	b6ef9a14de	When forming an addrec out of a phi don't just look at the last computation and steal its flags for our own, there may be other computations in the middle. Check whether the LHS of the computation is the phi itself and then we know it's safe to steal the flags. Fixes PR22795. There's a missed optimization opportunity where we could look at the full chain of computation and take the intersection of the flags instead of only looking one instruction deep. llvm-svn: 232134	2015-03-13 01:37:52 +00:00
Eric Christopher	ef9e01eada	Use the cached subtarget off of the machine function. llvm-svn: 232129	2015-03-13 00:49:50 +00:00
Eric Christopher	5ab3b79ba8	Use the cached subtarget off of the machine function. llvm-svn: 232128	2015-03-13 00:38:19 +00:00
Sanjay Patel	4339abe66f	[X86, AVX2] Replace inserti128 and extracti128 intrinsics with generic shuffles This should complete the job started in r231794 and continued in r232045: We want to replace as much custom x86 shuffling via intrinsics as possible because pushing the code down the generic shuffle optimization path allows for better codegen and less complexity in LLVM. AVX2 introduced proper integer variants of the hacked integer insert/extract C intrinsics that were created for this same functionality with AVX1. This should complete the removal of insert/extract128 intrinsics. The Clang precursor patch for this change was checked in at r232109. llvm-svn: 232120	2015-03-12 23:16:18 +00:00
Eric Christopher	7fde301d5b	Move a variable into the assert where it's used - fixes a -Asserts build warning/error. llvm-svn: 232119	2015-03-12 23:13:03 +00:00
Eric Christopher	ae32649ff2	In preparation for moving ARM's TargetRegisterInfo to the TargetMachine merge Thumb1RegisterInfo and Thumb2RegisterInfo. This will enable us to match the TargetMachine for our TargetRegisterInfo classes. llvm-svn: 232117	2015-03-12 22:48:50 +00:00
Tom Stellard	e2f5b41055	R600/SI: Don't print scc reg in sopc assembly string This is how the proprietary driver prints sopc instructions. llvm-svn: 232106	2015-03-12 21:34:28 +00:00
Tom Stellard	c0503926f5	R600/SI: Remove _e32 and _e64 suffixes from mnemonics Instead print them as part of the $dst operand. The AsmMatcher requires the 32-bit and 64-bit encodings have the same mnemonic in order to parse them correctly. llvm-svn: 232105	2015-03-12 21:34:22 +00:00
Eric Christopher	1b585aeb8a	Migrate the AArch64 TargetRegisterInfo to its TargetMachine implementation. This requires a bit of scaffolding and a few fixups that'll go away once all of the ports have been migrated. llvm-svn: 232103	2015-03-12 21:04:46 +00:00
Eric Christopher	1f0a635116	Remove unused headers. llvm-svn: 232102	2015-03-12 21:04:42 +00:00
Hal Finkel	e78e52ba9b	Revert "r232027 - Add infrastructure for support of multiple memory constraints" This (r232027) has caused PR22883; so it seems those bits might be used by something else after all. Reverting until we can figure out what else to do. Original commit message: The operand flag word for ISD::INLINEASM nodes now contains a 15-bit memory constraint ID when the operand kind is Kind_Mem. This constraint ID is a numeric equivalent to the constraint code string and is converted with a target specific hook in TargetLowering. This patch maps all memory constraints to InlineAsm::Constraint_m so there is no functional change at this point. It just proves that using these previously unused bits in the encoding of the flag word doesn't break anything. The next patch will make each target preserve the current mapping of everything to Constraint_m for itself while changing the target independent implementation of the hook to return Constraint_Unknown appropriately. Each target will then be adapted in separate patches to use appropriate Constraint_* values. llvm-svn: 232093	2015-03-12 20:09:39 +00:00
Quentin Colombet	f59b2d034c	[X86] Fix a regression introduced by r223641. The permps and permd instructions have their operands swapped compared to the intrinsic definition. Therefore, they do not fall into the INTR_TYPE_2OP category. I did not create a new category for those two, as they are the only one AFAICT in that case. <rdar://problem/20108262> llvm-svn: 232085	2015-03-12 19:34:12 +00:00
Eric Christopher	63ea0402c2	Fix comment formatting. llvm-svn: 232076	2015-03-12 18:23:01 +00:00
Eric Christopher	ed6a446403	Remove the need to cache the subtarget in the X86 TargetRegisterInfo classes. Use a Triple instead and simplify a lot of the querying logic to use lookups on the Triple. llvm-svn: 232071	2015-03-12 17:54:19 +00:00
Krzysztof Parzyszek	a29622a8c5	Remove unused complex patterns for addressing modes on Hexagon. llvm-svn: 232057	2015-03-12 16:44:50 +00:00
Sanjay Patel	7b079fb219	make an array of constants explicitly const Suggested by Craig Topper in D8184. This goes with r232047. llvm-svn: 232056	2015-03-12 16:29:58 +00:00
Sanjay Patel	2db6d3899b	IRBuilder: add a CreateShuffleVector function that takes an ArrayRef of int This is a convenience function to ease mask creation of ShuffleVectors in AutoUpgrade and other places. Differential Revision: http://reviews.llvm.org/D8184 llvm-svn: 232047	2015-03-12 15:27:07 +00:00
Andrea Di Biagio	de2fb00a16	[X86] Fix wrong target specific combine on SETCC nodes. Part of the folding logic implemented by function 'PerformISDSETCCCombine' only worked under the assumption that the condition code in input could have been either SETNE or SETEQ. Unfortunately that assumption was incorrect, and in some cases the algorithm ended up incorrectly folding SETCC nodes. The incorrect folding only affected SETCC dag nodes where: - one of the operands was a build_vector of all zeroes; - the other operand was a SIGN_EXTEND from a vector of MVT:i1 elements; - the condition code was neither SETNE nor SETEQ. Example: (setcc (v4i32 (sign_extend v4i1:%A)), (v4i32 VectorOfAllZeroes), setge) Before this patch, the entire dag node sequence from the example was incorrectly folded to node %A. With this patch, the dag node sequence is folded to a (xor %A, (v4i1 VectorOfAllOnes)). Added test setcc-combine.ll. Thanks to Greg Bedwell for spotting this issue. llvm-svn: 232046	2015-03-12 15:16:58 +00:00
Sanjay Patel	af1846c097	[X86, AVX] replace vextractf128 intrinsics with generic shuffles Now that we've replaced the vinsertf128 intrinsics, do the same for their extract twins. This is very much like D8086 (checked in at r231794): We want to replace as much custom x86 shuffling via intrinsics as possible because pushing the code down the generic shuffle optimization path allows for better codegen and less complexity in LLVM. This is also the LLVM sibling to the cfe D8275 patch. Differential Revision: http://reviews.llvm.org/D8276 llvm-svn: 232045	2015-03-12 15:15:19 +00:00
Aaron Ballman	c579d66b9a	Silencing an "enumeral and non-enumeral type in conditional expression" warning; NFC. llvm-svn: 232035	2015-03-12 13:24:06 +00:00
Daniel Sanders	41c072e63b	Add infrastructure for support of multiple memory constraints. Summary: The operand flag word for ISD::INLINEASM nodes now contains a 15-bit memory constraint ID when the operand kind is Kind_Mem. This constraint ID is a numeric equivalent to the constraint code string and is converted with a target specific hook in TargetLowering. This patch maps all memory constraints to InlineAsm::Constraint_m so there is no functional change at this point. It just proves that using these previously unused bits in the encoding of the flag word doesn't break anything. The next patch will make each target preserve the current mapping of everything to Constraint_m for itself while changing the target independent implementation of the hook to return Constraint_Unknown appropriately. Each target will then be adapted in separate patches to use appropriate Constraint_* values. Reviewers: hfinkel Reviewed By: hfinkel Subscribers: hfinkel, jholewinski, llvm-commits Differential Revision: http://reviews.llvm.org/D8171 llvm-svn: 232027	2015-03-12 11:00:48 +00:00
Elena Demikhovsky	5d06b4c80c	AVX-512: Added encoding tests for VPROR, VPROL instructions, fixed opcode. llvm-svn: 232018	2015-03-12 07:28:41 +00:00
Eric Christopher	234a1ec404	Remove some unnecessary forward declarations and put a couple more where they're supposed to reside. llvm-svn: 232014	2015-03-12 06:07:16 +00:00
Eric Christopher	8bb838ac07	Remove the need to cache the subtarget in the Sparc TargetRegisterInfo classes. llvm-svn: 232013	2015-03-12 05:55:26 +00:00
Eric Christopher	a20c3cf85d	Remove the need to cache the subtarget in the Mips TargetRegisterInfo classes. llvm-svn: 232012	2015-03-12 05:43:57 +00:00
Kevin Qin	49bc764310	Reapply 'Run LICM pass after loop unrolling pass.' It's firstly committed at r231630, and reverted at r231635. Function pass InstructionSimplifier is inserted as barrier to make sure loop unroll pass won't affect on LICM pass. llvm-svn: 232011	2015-03-12 05:36:01 +00:00
Eric Christopher	34085832f8	Remove the need to cache the subtarget in the ARM TargetRegisterInfo classes. Replace the frame pointer initialization with a static function that'll look it up via the subtarget on the MachineFunction. llvm-svn: 232010	2015-03-12 05:12:31 +00:00
Eric Christopher	09696d3fea	Remove the need to cache the subtarget in the AArch64 TargetRegisterInfo classes. Replace it with a cache to the Triple and use that where applicable at the moment. llvm-svn: 232005	2015-03-12 02:04:46 +00:00
Jingyue Wu	e8290f21b5	[NVPTXAsmPrinter] do not print .align on function headers Summary: PTX does not allow .align directives on function headers. Fixes PR21551. Test Plan: test/Codegen/NVPTX/function-align.ll Reviewers: eliben, jholewinski Reviewed By: eliben, jholewinski Subscribers: llvm-commits, eliben, jpienaar, jholewinski Differential Revision: http://reviews.llvm.org/D8274 llvm-svn: 232004	2015-03-12 01:50:30 +00:00
Reid Kleckner	52b07790ff	Make llvm.eh.actions an intrinsic and add docs for it These docs don't match the way WinEHPrepare uses them yet, and verifier support isn't implemented either. The implementation will come after the documentation text is reviewed and agreed upon. llvm-svn: 232003	2015-03-12 01:45:37 +00:00
Eric Christopher	ea178cf48f	Remove the need to cache the subtarget in the PowerPC TargetRegisterInfo classes. Replace it with a cache to the TargetMachine and use that where applicable at the moment. llvm-svn: 232002	2015-03-12 01:42:51 +00:00
Krzysztof Parzyszek	325297c101	Fix build break introduced in r231992 llvm-svn: 231996	2015-03-12 00:49:13 +00:00
Reid Kleckner	47c8e7a0e7	Stop calling DwarfEHPrepare from WinEHPrepare Instead, run both EH preparation passes, and have them both ignore functions with unrecognized EH personalities. Pass delegation involved some hacky code for creating an AnalysisResolver that we don't need now. llvm-svn: 231995	2015-03-12 00:36:20 +00:00
Krzysztof Parzyszek	6d5a4b5dcd	Eliminate constant-extender profitability checks from Hexagon isel llvm-svn: 231992	2015-03-12 00:19:59 +00:00

1 2 3 4 5 ...

77859 Commits