llvm-project

Commit Graph

Author	SHA1	Message	Date
Alina Sbirlea	e76dcd2b12	[cpu-detection] Return amdfam10 for all subtypes. Address Bug 28067. Summary: Remove architecture subtype from the string returned by getHostCPUName(). String matching done on type. Reviewers: llvm-commits, echristo Subscribers: mehdi_amini Differential Revision: http://reviews.llvm.org/D21193 llvm-svn: 272328	2016-06-09 22:47:12 +00:00
Easwaran Raman	71069cf67d	Use ProfileSummaryInfo in inline cost analysis. Instead of directly using MaxFunctionCount and function entry count to determine callee hotness, use the isHotFunction/isColdFunction methods provided by ProfileSummaryInfo. Differential revision: http://reviews.llvm.org/D21045 llvm-svn: 272321	2016-06-09 22:23:21 +00:00
Simon Pilgrim	643734c565	[X86][AVX512] Added avx512 VPSLLDQ/VPSRLDQ instruction comments llvm-svn: 272319	2016-06-09 22:03:15 +00:00
Quentin Colombet	d307909a50	[LiveRangeEdit] Fix a crash in eliminateDeadDef. When we delete a live-range, we check if that live-range is the origin of others to keep it around for rematerialization. For that we check that the instruction we are about to remove is the same as the definition of the VNI of the original live-range. If this is the case, we just shrink the live-range to an empty one. Now, when we try to delete one of the children of such live-range (product of splitting), we do the same check. However, now the original live-range is empty and there is no way we can access the VNI to check its definition, and we crash. When we cannot get the VNI for the original live-range, that means we are not in the presence of the original definition. Thus, this check does not need to happen in that case and the crash is sloved! This bug was introduced in r266162 \| wmi \| 2016-04-12 20:08:27. It affects every target that uses the greedy register allocator. To happen, we need to delete both a the original instruction and its split products, in that order. This is likely to happen when rematerialization comes into play. Trying to produce a more robust test case. Will follow in a coming commit. This fixes llvm.org/PR27983. rdar://problem/26651519 llvm-svn: 272314	2016-06-09 21:34:31 +00:00
Simon Pilgrim	f718682eb9	[X86][AVX512] Dropped avx512 VPSLLDQ/VPSRLDQ intrinsics Auto-upgrade to generic shuffles like sse/avx2 implementations now that we can lower to VPSLLDQ/VPSRLDQ llvm-svn: 272308	2016-06-09 21:09:03 +00:00
Simon Pilgrim	47c76e201a	[X86][AVX512] Fixed issue with v16i32 shuffles lowering to VPALIGNR llvm-svn: 272307	2016-06-09 20:53:12 +00:00
Duncan P. N. Exon Smith	c3f8997386	BitcodeReader: Use std:::piecewise_construct when upgrading type refs r267296 used std::piecewise_construct without using std::forward_as_tuple, and r267298 hacked it out (using an emplace_back followed by a couple of reset() calls) because of a problem on a bot. I'm finally circling back to call forward_as_tuple as I should have to begin with (thanks to David Blaikie for pointing out the missing piece). Note that this code uses emplace_back() instead of push_back(make_pair()) because the move constructor for TrackingMDRef is expensive (cheaper than a copy, but still expensive). llvm-svn: 272306	2016-06-09 20:46:33 +00:00
Simon Pilgrim	0ab9d3026a	[X86][AVX512] Added support for lowering 512-bit vector shuffles to bit/byte shifts 512-bit VPSLLDQ/VPSRLDQ can only be used for avx512bw targets so lowerVectorShuffleAsShift had to be adjusted to include the subtarget llvm-svn: 272300	2016-06-09 20:13:58 +00:00
Justin Lebar	ed2c282d4b	[NVPTX] Add intrinsics for shfl instructions. Summary: Currently clang emits these instructions via inline (volatile) asm in the CUDA headers. Switching to intrinsics will let the optimizer reason across calls to these intrinsics. Reviewers: tra Subscribers: llvm-commits, jholewinski Differential Revision: http://reviews.llvm.org/D21160 llvm-svn: 272298	2016-06-09 20:04:08 +00:00
Easwaran Raman	e12c487b8c	[PM] Port LCSSA to the new PM. Differential Revision: http://reviews.llvm.org/D21090 llvm-svn: 272294	2016-06-09 19:44:46 +00:00
Wei Ding	ed0f97fad2	AMDGPU/SI: Fix 32-bit fdiv lowering We were using the fast fdiv lowering for all division, implementation of IEEE754 fdiv is added. http://reviews.llvm.org/D20557 llvm-svn: 272292	2016-06-09 19:17:15 +00:00
Michael Kuperstein	c5edcdeb0e	[LV] Use vector phis for some secondary induction variables Previously, we materialized secondary vector IVs from the primary scalar IV, by offseting the primary to match the correct start value, and then broadcasting it - inside the loop body. Instead, we can use a real vector IV, like we do for the primary. This enables using vector IVs for secondary integer IVs whose type matches the type of the primary. Differential Revision: http://reviews.llvm.org/D20932 llvm-svn: 272283	2016-06-09 18:03:15 +00:00
Jan Vesely	2da0cba5fb	SelectionDAG: Implement expansion of {S,U}MIN/MAX in integer legalization Fixes {u,}long_{min,max,clamp} opencl piglit regressions on EG. Reviewers: arsenm Differential Revision: http://reviews.llvm.org/D17898 llvm-svn: 272272	2016-06-09 16:04:00 +00:00
Haicheng Wu	5b458cc1f6	Reapply "[MBP] Reduce code size by running tail merging in MBP."" This reapplies commit r271930, r271915, r271923. They hit a bug in Thumb which is fixed in r272258 now. The original message: The code layout that TailMerging (inside BranchFolding) works on is not the final layout optimized based on the branch probability. Generally, after BlockPlacement, many new merging opportunities emerge. This patch calls Tail Merging after MBP and calls MBP again if Tail Merging merges anything. llvm-svn: 272267	2016-06-09 15:24:29 +00:00
Ulrich Weigand	79564611d9	[SystemZ] Enable long displacement constraints for inline ASM operands This enables use of the 'S' constraint for inline ASM operands on SystemZ, which allows for a memory reference with a signed 20-bit immediate displacement. This patch includes corresponding documentation and test case updates. I've changed the 'T' constraint to match the new behavior for 'S', as 'T' also uses a long displacement (though index constraints are still not implemented). I also changed 'm' to match the behavior for 'S' as this will allow for a wider range of displacements for 'm', though correct me if that's not the right decision. Author: colpell Differential Revision: http://reviews.llvm.org/D21097 llvm-svn: 272266	2016-06-09 15:19:16 +00:00
Davide Italiano	bd4243c519	[CodeGen] Change getSDagStackGuard to get an internal sym. Fixes a crash in the backend during an LTO build of rtld(1) in FreeBSD. llvm-svn: 272262	2016-06-09 14:23:38 +00:00
Hrvoje Varga	c962c4936e	[mips][microMIPS] Implement BOVC, BNVC, EXT, INS and JALRC instructions Differential Revision: http://reviews.llvm.org/D11798 llvm-svn: 272259	2016-06-09 12:57:23 +00:00
James Molloy	a7dbf987b5	[Thumb] A branch is not part of an IT block ReplaceTailWithBranchTo assumed that if an instruction is predicated, it must be part of an IT block. This is not correct for conditional branches. No testcase as this was triggered by the reverted patch r272017 - test coverage will occur when that patch is re-reverted and there is no known way to trigger this in the meantime. llvm-svn: 272258	2016-06-09 11:51:29 +00:00
Igor Breger	f635367e2b	[AVX512] Remove masked_move/blendm intrinsic from back-end. This is complement patch to D21060. Differential Revision: http://reviews.llvm.org/D21174 llvm-svn: 272257	2016-06-09 11:46:55 +00:00
Zlatko Buljan	cd242c1655	[mips][microMIPS] Add CodeGen support for SEL., SELEQZ, SELNEZ, SELEQZ., SELNEZ.* and CMP.condn.fmt instructions Differential Revision: http://reviews.llvm.org/D20862 llvm-svn: 272256	2016-06-09 11:15:53 +00:00
Sam Kolton	c9bdcb75c4	[AMDGPU] Disassembler: Support for sdwa instructions Reviewers: vpykhtin, tstellarAMD Subscribers: arsenm, kzhuravl Differential Revision: http://reviews.llvm.org/D21129 llvm-svn: 272255	2016-06-09 11:04:45 +00:00
Craig Topper	6f7288dc44	[AVX512] Fix shuffle decode printing for several instructions with write masks. There are still more bugs here with UNPCK and PALIGN for sure. But these were the easiest ones to fix. llvm-svn: 272252	2016-06-09 07:49:08 +00:00
James Molloy	feb9f4243b	[Thumb] Select a BIC instead of AND if the immediate can be encoded more optimally negated If an immediate is only used in an AND node, it is possible that the immediate can be more optimally materialized when negated. If this is the case, we can negate the immediate and use a BIC instead; int i(int a) { return a & 0xfffffeec; } Used to produce: ldr r1, [CONSTPOOL] ands r0, r1 CONSTPOOL: 0xfffffeec And now produces: movs r1, #255 adds r1, #20 ; Less costly immediate generation bics r0, r1 llvm-svn: 272251	2016-06-09 07:39:08 +00:00
Craig Topper	7a2993093e	[X86] Bring consistent naming to the SSE/AVX and AVX512 PALIGNR instructions. Then add shuffle decode printing for the EVEX forms which is made easier by having the naming structure more similar to other instructions. llvm-svn: 272249	2016-06-09 07:06:38 +00:00
Craig Topper	565a5b5451	[X86] Fix bad comment in assert. NFC llvm-svn: 272248	2016-06-09 07:06:33 +00:00
Xinliang David Li	ecde1c7f3d	Revert r272194 No need for it if loop Analysis Manager is used llvm-svn: 272243	2016-06-09 03:22:39 +00:00
Saleem Abdulrasool	6c19ffc8bc	AArch64: support the `.arch` directive in the IAS Add support to the AArch64 IAS for the `.arch` directive. This allows the assembly input to use architectural functionality in part of a file. This is used in existing code like BoringSSL. Resolves PR26016! llvm-svn: 272241	2016-06-09 02:56:40 +00:00
Kostya Serebryany	f7798526b9	[libFuzzer] add one more OOM test, which we currently don't handle very well llvm-svn: 272240	2016-06-09 01:20:35 +00:00
Teresa Johnson	7ab1f69272	[ThinLTO/gold] Enable summary-based internalization Summary: Enable existing summary-based importing support in the gold-plugin. Reviewers: mehdi_amini Subscribers: llvm-commits, mehdi_amini Differential Revision: http://reviews.llvm.org/D21080 llvm-svn: 272239	2016-06-09 01:14:13 +00:00
Sanjoy Das	1eade91513	Minor clean up in loopHasNoAbnormalExits; NFC llvm-svn: 272238	2016-06-09 01:14:03 +00:00
Sanjoy Das	c7f69b921f	Be wary of abnormal exits from loop when exploiting UB We can safely rely on a NoWrap add recurrence causing UB down the road only if we know the loop does not have a exit expressed in a way that is opaque to ScalarEvolution (e.g. by a function call that conditionally calls exit(0)). I believe with this change PR28012 is fixed. Note: I had to change some llvm-lit tests in LoopReroll, since it looks like they were depending on this incorrect behavior. llvm-svn: 272237	2016-06-09 01:13:59 +00:00
Sanjoy Das	97cd7d5d44	Factor out a loopHasNoAbnormalExits; NFC llvm-svn: 272236	2016-06-09 01:13:54 +00:00
Richard Smith	2ad6d48b0c	Search for llvm-symbolizer binary in the same directory as argv[0], before looking for it along $PATH. This allows installs of LLVM tools outside of $PATH to find the symbolizer and produce pretty backtraces if they crash. llvm-svn: 272232	2016-06-09 00:53:21 +00:00
Reid Kleckner	6d1d27542f	[codeview] Skip DIGlobalVariables with no variable They have probably been discarded during optimization. llvm-svn: 272231	2016-06-09 00:29:00 +00:00
Rui Ueyama	c41cd6dcf7	[pdbdump] Verify part of TPI hash streams. TPI hash table contains a parallel array for the type records. For each type record R, a hash value is calculated by `H(R) % NumBuckets` where H is a hash function, and the result is stored to a bucket element. H is TPI1::hashPrec function in microsoft-pdb repository. Our hash function does not support all type record types yet. Currently it supports only records for line number. I'll extend it in a follow up patch. The aim of verify the hash table is not only detect corrupted files. It ensures that our understanding of how the hash values are calculated is correct. llvm-svn: 272229	2016-06-09 00:10:19 +00:00
Alina Sbirlea	080241b75d	[cpu-detection] Add missing break statements in outer switches Summary: Break on all switch cases for outer and inner switches. No functionality changed. Reviewers: llvm-commits, sanjoy Differential Revision: http://reviews.llvm.org/D21158 llvm-svn: 272228	2016-06-09 00:08:15 +00:00
Quentin Colombet	2c6469687d	[MIR] Check that generic virtual registers get a size. Without that check it was possible to write test cases where the size was not specified and we ended up with weird asserts down the road, because the default value (1) would not make sense. llvm-svn: 272226	2016-06-08 23:27:46 +00:00
Rui Ueyama	f05f360deb	Function names should start with lowercase letters. llvm-svn: 272225	2016-06-08 23:15:09 +00:00
Michael Zolotukhin	8e7e76729d	[LoopSimplify] Preserve LCSSA when merging exit blocks. Summary: This fixes PR26682. Also add LCSSA as a preserved pass to LoopSimplify, that looks correct to me and allows to write a test for the issue. Reviewers: chandlerc, bogner, sanjoy Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D21112 llvm-svn: 272224	2016-06-08 23:13:21 +00:00
Rui Ueyama	170988f21f	[PDB] Move PDB functions to a separate file. We are going to use the hash functions from TPI streams. Differential Revision: http://reviews.llvm.org/D21142 llvm-svn: 272223	2016-06-08 23:11:14 +00:00
Michael Zolotukhin	aa547616d2	[LoopUnroll] Check that DT is available before trying to verify it. llvm-svn: 272221	2016-06-08 22:49:59 +00:00
Quentin Colombet	3340645771	[RegBankSelect] Print out the actual mapping of the operands. This improves the debuggability of the pass. llvm-svn: 272210	2016-06-08 21:55:30 +00:00
Quentin Colombet	9400bfbf42	[RegBankSelect] Remove a debug print of a potentially dead instruction. For complex rewrittings, which do not occur currently, the related machine instruction may have been deleted in the process. Therefore, do not try to print it after the mapping is applied. llvm-svn: 272209	2016-06-08 21:55:29 +00:00
Quentin Colombet	9f8e209c60	[RegisterBankInfo] Avoid code duplication in OperandsMapper for the computation of the end of range. Refactor the code so that we do not compute in two different places the end iterator for the range of new virtual registers for a given operand. Although this refactoring was intended as NFC, this is not the case because it actually fixes a bug where we were returning a range off by 1 (too long). Right now, this could not result in an actual bug because we were accessing this range via the BreakDown size of the related operand. llvm-svn: 272208	2016-06-08 21:55:26 +00:00
Quentin Colombet	9d26805f42	[RegisterBankInfo] Add dump/print methods for OperandsMapper. Improve debuggability of the OperandsMapper helper class. llvm-svn: 272207	2016-06-08 21:55:23 +00:00
Michael Zolotukhin	987ab631fa	[SLPVectorizer] Handle GEP with differing constant index types Summary: This fixes PR27617. Bug description: The SLPVectorizer asserts on encountering GEPs with different index types, such as i8 and i64. The patch includes a simple relaxation of the assert to allow constants being of different types, along with a regression test that will provoke the unrelaxed assert. Reviewers: nadav, mzolotukhin Subscribers: JesperAntonsson, llvm-commits, mzolotukhin Differential Revision: http://reviews.llvm.org/D20685 Patch by Jesper Antonsson! llvm-svn: 272206	2016-06-08 21:55:16 +00:00
Davide Italiano	02861d8695	[PM] Add missing caching of GlobalsAA to EarlyCSE. llvm-svn: 272204	2016-06-08 21:31:55 +00:00
Dehao Chen	769219b11a	Revive http://reviews.llvm.org/D12778 to handle forward-hot-prob and backward-hot-prob consistently. Summary: Consider the following diamond CFG: A / \ B C \/ D Suppose A->B and A->C have probabilities 81% and 19%. In block-placement, A->B is called a hot edge and the final placement should be ABDC. However, the current implementation outputs ABCD. This is because when choosing the next block of B, it checks if Freq(C->D) > Freq(B->D) * 20%, which is true (if Freq(A) = 100, then Freq(B->D) = 81, Freq(C->D) = 19, and 19 > 8120%=16.2). Actually, we should use 25% instead of 20% as the probability here, so that we have 19 < 8125%=20.25, and the desired ABDC layout will be generated. Reviewers: djasper, davidxl Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D20989 llvm-svn: 272203	2016-06-08 21:30:12 +00:00
Sanjay Patel	3929313811	[InstCombine] move fold of select of add/sub to helper function; NFCI llvm-svn: 272199	2016-06-08 21:10:01 +00:00
Reid Kleckner	de3d8b500f	[DebugInfo] Add calling convention support for DWARF and CodeView Summary: Now DISubroutineType has a 'cc' field which should be a DW_CC_ enum. If it is present and non-zero, the backend will emit it as a DW_AT_calling_convention attribute. On the CodeView side, we translate it to the appropriate enum for the LF_PROCEDURE record. I added a new LLVM vendor specific enum to the list of DWARF calling conventions. DWARF does not appear to attempt to standardize these, so I assume it's OK to do this until we coordinate with GCC on how to emit vectorcall convention functions. Reviewers: dexonsmith, majnemer, aaboud, amccarth Subscribers: mehdi_amini, llvm-commits Differential Revision: http://reviews.llvm.org/D21114 llvm-svn: 272197	2016-06-08 20:34:29 +00:00
Sanjay Patel	384d0f219d	[InstCombine] fix outdated comment, simplify logic; NFCI llvm-svn: 272196	2016-06-08 20:31:52 +00:00
Evgeny Stupachenko	3e2f389a7e	The patch set unroll disable pragma when unroll with user specified count has been applied. Summary: Previously SetLoopAlreadyUnrolled() set the disable pragma only if there was some loop metadata. Now it set the pragma in all cases. This helps to prevent multiple unroll when -unroll-count=N is given. Reviewers: mzolotukhin Differential Revision: http://reviews.llvm.org/D20765 From: Evgeny Stupachenko <evstupac@gmail.com> llvm-svn: 272195	2016-06-08 20:21:24 +00:00
Xinliang David Li	572135f717	[PM] Refector LoopAccessInfo analysis code This is the preparation patch to port the analysis to new PM Differential Revision: http://reviews.llvm.org/D20560 llvm-svn: 272194	2016-06-08 20:15:37 +00:00
Sanjay Patel	10a2c38d83	[InstCombine] reduce indent; NFC llvm-svn: 272193	2016-06-08 20:09:04 +00:00
Tim Shen	7aa0ad65ce	[MemCpyOpt] Do not exchange llvm.lifetime.start and llvm.memcpy Reviewers: iteratee Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D21087 llvm-svn: 272192	2016-06-08 19:42:32 +00:00
Sanjay Patel	916f8a0cdb	[InstCombine] use copyIRFlags() ; NFCI llvm-svn: 272191	2016-06-08 19:33:52 +00:00
Benjamin Kramer	c321e53402	Apply most suggestions of clang-tidy's performance-unnecessary-value-param Avoids unnecessary copies. All changes audited & pass tests with asan. No functional change intended. llvm-svn: 272190	2016-06-08 19:09:22 +00:00
Adrian McCarthy	f3c3c13206	Generate codeview for array type metadata. Differential Revision: http://reviews.llvm.org/D21107 llvm-svn: 272187	2016-06-08 18:22:59 +00:00
George Burgess IV	fd4e2f7cb2	Attempt #2 to appease the buildbots. MSVC calls the copy ctor on StratifiedSets for some reason. So, undelete it. llvm-svn: 272184	2016-06-08 17:56:35 +00:00
Reid Kleckner	ee641c20ca	[codeview] Avoid emitting an empty file checksum table Again, the Microsoft linker does not like empty substreams. We still emit an empty string table if CodeView is enabled, but that doesn't cause problems because it always contains at least one null byte. llvm-svn: 272183	2016-06-08 17:50:29 +00:00
Sanjoy Das	2401c98475	[SCEV] Break out of loop if there is no more work to do This is NFC as far as externally visible behavior is concerned, but will keep us from spinning in the worklist traversal algorithm unnecessarily. llvm-svn: 272182	2016-06-08 17:48:46 +00:00
Sanjoy Das	8598412e24	[SCEV] Track no-abnormal-exits instead of no-throw calls Absence of may-unwind calls is not enough to guarantee that a UB-generating use of an add-rec poison in the loop latch will actually cause UB. We also need to guard against calls that terminate the thread or infinite loop themselves. This partially addresses PR28012. llvm-svn: 272181	2016-06-08 17:48:42 +00:00
Sanjoy Das	9a65cd214d	Teach isGuarantdToTransferExecToSuccessor about debug info intrinsics Calls to `@llvm.dbg.*` can be assumed to terminate. llvm-svn: 272180	2016-06-08 17:48:36 +00:00
Sanjoy Das	a19edc4d15	Fix a bug in SCEV's poison value propagation The worklist algorithm introduced in rL271151 didn't check to see if the direct users of the post-inc add recurrence propagates poison. This change fixes the problem and makes the code structure more obvious. Note for release managers: correctness wise, this bug wasn't a regression introduced by rL271151 -- the behavior of SCEV around post-inc add recurrences was strictly improved (in terms of correctness) in rL271151. llvm-svn: 272179	2016-06-08 17:48:31 +00:00
Quentin Colombet	86be3748a6	[RegBankSelect] Silence an unused variable warning in release mode. llvm-svn: 272177	2016-06-08 17:39:47 +00:00
Quentin Colombet	d6886bd22c	[RegBankSelect] Comment on how we could improve repairing with copies. When repairing with a copy, instead of accounting for the cost of that copy and actually inserting it, we may be able to use an alternative source for the register to repair and just use it. Make sure this is documented, so that we consider that opportunity at some point. llvm-svn: 272176	2016-06-08 17:39:43 +00:00
George Burgess IV	785f391131	Try to appease buildbots. r272064 apparently made them angry. This undoes some changes made in r272064 (defaulting move ctors) to make them happy again. llvm-svn: 272173	2016-06-08 17:27:14 +00:00
Zachary Turner	a1657a9e64	[pdb] Handle stream index errors better. Reviewed By: ruiu Differential Revision: http://reviews.llvm.org/D21128 llvm-svn: 272172	2016-06-08 17:26:39 +00:00
Rui Ueyama	ced0853b46	Remove a patch .rej file. llvm-svn: 272171	2016-06-08 16:54:31 +00:00
Quentin Colombet	d1cd30b218	[AArch64][RegisterBankInfo] G_OR are fine on either GPR or FPR. Teach AArch64RegisterBankInfo that G_OR can be mapped on either GPR or FPR for 64-bit or 32-bit values. Add test cases demonstrating how this information is used to coalesce a computation on a single register bank. llvm-svn: 272170	2016-06-08 16:53:32 +00:00
Quentin Colombet	ec5c93d3a0	[RegBankSelect] Use RegisterBankInfo applyMapping method. The RegBankSelect pass can now rely on the target to do the remapping of the instructions. llvm-svn: 272169	2016-06-08 16:45:04 +00:00
Quentin Colombet	574a329962	[RegisterBankInfo] Implement the method to apply a mapping. Now, the target will be able to provide its how implementation to remap an instruction. This open the way to crazier optimizations, but to beginning with, we will be able to handle something else than the default mapping. llvm-svn: 272165	2016-06-08 16:39:21 +00:00
Quentin Colombet	f33e36545b	[RegBankSelect] Use the OperandMapper class to hold remap information. Now that we have an entity that hold the remap information the rewritting should be easier to do. No functional changes. llvm-svn: 272164	2016-06-08 16:30:55 +00:00
Quentin Colombet	06ef4e209d	[RegBankSelect] Use const_iterator instead of iterator for repairReg. The repairing code has no reason to change the source or destination of the registers. llvm-svn: 272163	2016-06-08 16:24:55 +00:00
Quentin Colombet	7a03de5210	[RegisterBankInfo] Introduce OperandsMapper class. This helper class is used to encapsulate the necessary information to remap an instruction. llvm-svn: 272161	2016-06-08 16:18:13 +00:00
Quentin Colombet	a41272fb48	[RegBankSelect] Introduce a command line option to override the running mode. When the command line option is set, it overrides any thing that the target may have set. The rationale is that we get what we asked for. Options are respectively regbankselect-fast and regbankselect-greedy for fast and greedy mode. llvm-svn: 272158	2016-06-08 15:49:23 +00:00
Quentin Colombet	6feaf82088	[RegBankSelect] Explain what it would take to support non-copy repairing. Copies are easy because we repair only when there is a mismatch. For non-copy repairing, i.e., cases that involves breaking down or gathering up the value, one of the operand may not have a register bank yet. Thus, derivate a cost from that, requires more work. llvm-svn: 272157	2016-06-08 15:40:32 +00:00
Oliver Stannard	b3378e2f3c	[ARM] MSR instructions implicitly set CPSR The MSR instructions can write to the CPSR, but we did not model this fact, so we could emit them in the middle of IT blocks, changing the condition flags for later instructions in the block. The tests use two calls to llvm.write_register.i32 because it is valid to use these instructions at the end of an IT block, which if conversion does do in some cases. With two calls, the first clobbers the flags, so a branch has to be used to make the second one conditional. Differential Revision: http://reviews.llvm.org/D21139 llvm-svn: 272154	2016-06-08 15:26:34 +00:00
Saleem Abdulrasool	1ef925f0bd	Support: correct AArch64 TargetParser implementation The architecture enumeration is shared across ARM and AArch64. However, the data is not. The code incorrectly would index into the array using the architecture index which was offset by the ARMv7 architecture enumeration. We do not have a marker for indicating the architectural family to which the enumeration belongs so we cannot be clever about offsetting the index (at least it is not immediately apparent to me). Instead, fall back to the tried-and-true method of slowly iterating the array (its not a large array, so the impact of this is not too high). Because of the incorrect indexing, if we were lucky, we would crash, but usually we would return an invalid StringRef. We did not have any tests for the AArch64 target parser previously;. Extend the previous tests I had added for ARM to cover AArch64 for ensuring that we return expected StringRefs. Take the opportunity to change some iterator types to references. This work is needed to support parsing `.arch name` directives in the AArch64 target asm parser. llvm-svn: 272145	2016-06-08 14:30:00 +00:00
Davide Italiano	2d5ab0a56a	[PM] LoopSimplify. Remove unneeded pass dependencies. NFCI. llvm-svn: 272140	2016-06-08 13:56:59 +00:00
Davide Italiano	d8d83f4773	[PM/SimplifyCFG] Preserve GlobalsAA even if the IR is mutated. llvm-svn: 272139	2016-06-08 13:32:23 +00:00
Vasileios Kalintiris	a9e5154dc5	[mips] Add a proper file header in MipsFastISel.cpp llvm-svn: 272138	2016-06-08 13:13:15 +00:00
Krzysztof Parzyszek	b16882ddf1	[Hexagon] Modify HexagonExpandCondsets to handle subregisters Also, switch to using functions from LiveIntervalAnalysis to update live intervals, instead of performing the updates manually. Re-committing r272045. llvm-svn: 272135	2016-06-08 12:31:16 +00:00
Diana Picus	0781d10ac4	[ARM] Remove redundant check. NFC isSwift is tested earlier and known to be false when we reach this code. llvm-svn: 272127	2016-06-08 10:29:02 +00:00
Benjamin Kramer	46e38f3678	Avoid copies of std::strings and APInt/APFloats where we only read from it As suggested by clang-tidy's performance-unnecessary-copy-initialization. This can easily hit lifetime issues, so I audited every change and ran the tests under asan, which came back clean. llvm-svn: 272126	2016-06-08 10:01:20 +00:00
Igor Breger	982e4003a6	[AVX512] Fix cvtusi2sd instruction Opcode, it should be 0x7B instead of 0x2A. llvm-svn: 272122	2016-06-08 07:48:23 +00:00
Matt Arsenault	b1630a1487	Make LiveDebugValues preserve CFG llvm-svn: 272117	2016-06-08 05:18:01 +00:00
Kostya Serebryany	53b7b3ca5f	[libFuzzer] add 'weak' back to __sanitizer_malloc_hook and __sanitizer_free_hook llvm-svn: 272116	2016-06-08 04:49:29 +00:00
Kostya Serebryany	76f425211e	[libFuzzer] add a test that is built w/o coverage instrumentation but has the coverage rt (it should now fail with a descriptive message) llvm-svn: 272090	2016-06-08 01:46:13 +00:00
Quentin Colombet	a4ac7cdac2	[AArch64][RegisterBankInfo] Use the generic implementation of copyCost. Long term we may want to give high cost at FPR to/from GPR copies. llvm-svn: 272086	2016-06-08 01:24:00 +00:00
Quentin Colombet	cfbdee2312	[RegisterBankInfo] Add a size argument for the cost of copy. The cost of a copy may be different based on how many bits we have to copy around. E.g., a 8-bit copy may be different than a 32-bit copy. llvm-svn: 272084	2016-06-08 01:11:03 +00:00
Quentin Colombet	123a7a55e7	[RegisterBankInfo] Move a hidden function into a static method. NFC. This will allow code reuse in the coming commits. llvm-svn: 272083	2016-06-08 01:04:32 +00:00
Matthias Braun	3ef7df9cdf	MIR: Fix parsing of stack object references in MachineMemOperands The MachineMemOperand parser lacked the code to handle %stack.X references (%fixed-stack.X was working). llvm-svn: 272082	2016-06-08 00:47:07 +00:00
Zachary Turner	d2b2bfed94	[pdb] Try to fix use after free. llvm-svn: 272078	2016-06-08 00:25:08 +00:00
Rui Ueyama	f14a74c102	[pdbdump] Print out # of hash buckets. In the reference code, the field name is `cHashBuckets`. llvm-svn: 272075	2016-06-07 23:53:43 +00:00
Rui Ueyama	d833917f98	[pdbdump] Print out TPI hash key size. llvm-svn: 272073	2016-06-07 23:44:27 +00:00
Dan Liew	1873a496e2	[LibFuzzer] Declare and use sanitizer functions in ``fuzzer::ExternalFunctions`` This fixes linking problems on OSX. Unfortunately it turns out we need to use an instance of the ``fuzzer::ExternalFunctions`` object in several places so this commit also replaces all instances with a single global instance. It also turns out initializing a global ``fuzzer::ExternalFunctions`` before main is entered (i.e. letting the object be initialised by the global initializers) is not safe (on OSX the call to ``Printf()`` in the CTOR crashes if it is called from a global initializer) so we instead have a global ``fuzzer::ExternalFunctions`` and initialize it inside ``FuzzerDriver()``. Multiple unit tests depend also depend on the ``fuzzer::ExternalFunctions`` global so a ``main()`` function has been added that initializes it before running any tests. Differential Revision: http://reviews.llvm.org/D20943 llvm-svn: 272072	2016-06-07 23:32:50 +00:00
George Burgess IV	60af226b86	[CFLAA] Kill dead code/fix comments in StratifiedSets. Also use default/delete instead of hand-written ctors. Thanks to Jia Chen for bringing this stuff up. llvm-svn: 272064	2016-06-07 21:41:18 +00:00
Nicolai Haehnle	c00e03b8f5	AMDGPU: Add amdgpu-ps-wqm-outputs function attributes Summary: The presence of this attribute indicates that VGPR outputs should be computed in whole quad mode. This will be used by Mesa for prolog pixel shaders, so that derivatives can be taken of shader inputs computed by the prolog, fixing a bug. The generated code could certainly be improved: if a prolog pixel shader is used (which isn't common in modern OpenGL - they're used for gl_Color, polygon stipples, and forcing per-sample interpolation), Mesa will use this attribute unconditionally, because it has to be conservative. So WQM may be used in the prolog when it isn't really needed, and furthermore a silly back-and-forth switch is likely to happen at the boundary between prolog and main shader parts. Fixing this is a bit involved: we'd first have to add a mechanism by which LLVM writes the WQM-related input requirements to the main shader part binary, and then Mesa specializes the prolog part accordingly. At that point, we may as well just compile a monolithic shader... Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=95130 Reviewers: arsenm, tstellarAMD, mareko Subscribers: arsenm, llvm-commits, kzhuravl Differential Revision: http://reviews.llvm.org/D20839 llvm-svn: 272063	2016-06-07 21:37:17 +00:00
Dan Liew	1d0a9fd089	[LibFuzzer] Split the fuzzer-oom.test into two tests. This is necessary because the existing fuzzer-oom.test was Linux specific due to its use of __sanitizer_print_memory_profile() which is only available on Linux right now and so the test would fail on OSX. Differential Revision: http://reviews.llvm.org/D20977 llvm-svn: 272061	2016-06-07 21:23:30 +00:00

1 2 3 4 5 ...

91383 Commits