llvm-project

Commit Graph

Author	SHA1	Message	Date
Simon Pilgrim	8159cf11bc	[X86] Add AVX512 vector zext tests llvm-svn: 259786	2016-02-04 14:06:19 +00:00
Jonas Paulsson	2293685731	[ScheduleDagInstrs] Improved comments llvm-svn: 259783	2016-02-04 13:08:48 +00:00
Simon Pilgrim	1d2d6c5a57	[X86] Moved SEXT -> SIGN_EXTEND_VECTOR_INREG combine into helper. NFC. llvm-svn: 259771	2016-02-04 09:27:19 +00:00
Andrey Turetskiy	bca0f99224	[X86] Use hash table in LEA optimization pass. Use hash table (key is a memory operand) to store found LEA instructions to reduce compile time. Differential Revision: http://reviews.llvm.org/D16404 llvm-svn: 259770	2016-02-04 08:57:03 +00:00
Justin Bogner	e12385bd6a	cmake: Add a flag to enable LTO This adds -DLLVM_ENABLE_LTO, rather than forcing people to manually add -flto to the various _FLAGS variables. llvm-svn: 259766	2016-02-04 07:28:30 +00:00
Craig Topper	775fb73de7	[Support] Use range-based for loop. NFC llvm-svn: 259763	2016-02-04 06:51:41 +00:00
Craig Topper	d08f32f66a	[Support] Use hexdigit instead of manually coding the same thing. NFC llvm-svn: 259762	2016-02-04 06:51:38 +00:00
Xinliang David Li	1e4c809c6c	[PGO] Profile interface cleanup - Remove unused valuemapper parameter - add totalcount optional parameter llvm-svn: 259756	2016-02-04 05:29:51 +00:00
Jingyue Wu	f650441b04	[NVPTX] Disable performance optimizations when OptLevel==None Reviewers: jholewinski, tra, eliben Subscribers: jholewinski, llvm-commits Differential Revision: http://reviews.llvm.org/D16874 llvm-svn: 259749	2016-02-04 04:15:36 +00:00
Nemanja Ivanovic	155402c9c2	Test case for PR 26381 llvm-svn: 259740	2016-02-04 01:58:20 +00:00
Wei Mi	a49559befb	[SCEV] Try to reuse existing value during SCEV expansion Current SCEV expansion will expand SCEV as a sequence of operations and doesn't utilize the value already existed. This will introduce redundent computation which may not be cleaned up throughly by following optimizations. This patch introduces an ExprValueMap which is a map from SCEV to the set of equal values with the same SCEV. When a SCEV is expanded, the set of values is checked and reused whenever possible before generating a sequence of operations. The original commit triggered regressions in Polly tests. The regressions exposed two problems which have been fixed in current version. 1. Polly will generate a new function based on the old one. To generate an instruction for the new function, it builds SCEV for the old instruction, applies some tranformation on the SCEV generated, then expands the transformed SCEV and insert the expanded value into new function. Because SCEV expansion may reuse value cached in ExprValueMap, the value in old function may be inserted into new function, which is wrong. In SCEVExpander::expand, there is a logic to check the cached value to be used should dominate the insertion point. However, for the above case, the check always passes. That is because the insertion point is in a new function, which is unreachable from the old function. However for unreachable node, DominatorTreeBase::dominates thinks it will be dominated by any other node. The fix is to simply add a check that the cached value to be used in expansion should be in the same function as the insertion point instruction. 2. When the SCEV is of scConstant type, expanding it directly is cheaper than reusing a normal value cached. Although in the cached value set in ExprValueMap, there is a Constant type value, but it is not easy to find it out -- the cached Value set is not sorted according to the potential cost. Existing reuse logic in SCEVExpander::expand simply chooses the first legal element from the cached value set. The fix is that when the SCEV is of scConstant type, don't try the reuse logic. simply expand it. Differential Revision: http://reviews.llvm.org/D12090 llvm-svn: 259736	2016-02-04 01:27:38 +00:00
Richard Smith	69cb000974	Fix undefined behavior when compiling in C++14 mode (with sized deletion enabled): ensure that we do not invoke the sized deallocator for MemoryBuffer subclasses that have tail-allocated data. llvm-svn: 259735	2016-02-04 01:21:16 +00:00
Reid Kleckner	cb91e7d395	[codeview] Don't attempt a cross-section label diff This only comes up when we're trying to find the next .cv_loc label. Fixes PR26467 llvm-svn: 259733	2016-02-04 00:21:42 +00:00
Kostya Serebryany	ce925c580e	[libFuzzer] hot fix a test llvm-svn: 259732	2016-02-04 00:12:28 +00:00
Kostya Serebryany	b92602ada0	[libFuzzer] don't write the test unit when a leak is detected (since we don't know which unit causes the leak) llvm-svn: 259731	2016-02-04 00:02:17 +00:00
Gerolf Hoflehner	2432bd0ddd	[SimplifyCFG] Fix for "endless" loop after dead code removal (Alternative to D16251) Summary: This is a simpler fix to the problem than the dominator approach in http://reviews.llvm.org/D16251. It adds only values into the gather() while loop that have been seen before. The actual endless loop is in the constant compare gather() routine in Utils/SimplifyCFG.cpp. The same value ret.0.off0.i is pushed back into the queue: %.ret.0.off0.i = or i1 %.ret.0.off0.i, %cmp10.i Here is what happens at the IR level: for.cond.i: ; preds = %if.end6.i, %if.end.i54 %ix.0.i = phi i32 [ 0, %if.end.i54 ], [ %inc.i55, %if.end6.i ] %ret.0.off0.i = phi i1 [false, %if.end.i54], [%.ret.0.off0.i, %if.end6.i] <<< %cmp2.i = icmp ult i32 %ix.0.i, %11 br i1 %cmp2.i, label %for.body.i, label %LBJ_TmpSimpleNeedExt.exit if.end6.i: ; preds = %for.body.i %cmp10.i = icmp ugt i32 %conv.i, %add9.i %.ret.0.off0.i = or i1 %ret.0.off0.i, %cmp10.i <<< When if.end.i54 gets eliminated which removes the definition of ret.0.off0.i. The result is the expression %.ret.0.off0.i = or i1 %.ret.0.off0.i, %cmp10.i (Note the first ‘or’ operand is now %.ret.0.off0.i, and NOT %ret.0.off0.i). And now there is use of .ret.0.off0.i before a definition which triggers the “endless” loop in gather(): while(!DFT.empty()) { V = DFT.pop_back_val(); // V is .ret.0.off0.i if (Instruction *I = dyn_cast<Instruction>(V)) { // If it is a \|\| (or && depending on isEQ), process the operands. if (I->getOpcode() == (isEQ ? Instruction::Or : Instruction::And)) { DFT.push_back(I->getOperand(1)); // This is now .ret.0.off0.i also DFT.push_back(I->getOperand(0)); continue; // “endless loop” for .ret.0.off0.i } Reviewers: reames, ahatanak Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D16839 llvm-svn: 259730	2016-02-03 23:54:25 +00:00
Vedant Kumar	2d5b5d3d3a	[InstrProfiling] Fix a comment (NFC) llvm-svn: 259727	2016-02-03 23:22:43 +00:00
David L Kreitzer	f24d409dce	Unify the target opcode enum in TargetOpcodes.h and the FixedInstrs array in CodeGenTarget.cpp to avoid the ordering dependence. NFCI. Differential Revision: http://reviews.llvm.org/D16826 llvm-svn: 259726	2016-02-03 23:17:32 +00:00
Junmo Park	e90057a5f3	Minor code cleanups. NFC. llvm-svn: 259725	2016-02-03 23:16:39 +00:00
David Majnemer	d74490f2cc	Print the OffsetStart field's relocation llvm-svn: 259723	2016-02-03 22:45:21 +00:00
Sanjay Patel	e9fa3363b4	rangify; NFCI llvm-svn: 259722	2016-02-03 22:44:14 +00:00
Sanjay Patel	460ce9cd9b	clean up; NFC llvm-svn: 259720	2016-02-03 22:37:37 +00:00
David Majnemer	ac10cfde97	[llvm-readobj] Add support for dumping S_DEFRANGE symbols llvm-svn: 259719	2016-02-03 22:36:46 +00:00
Reid Kleckner	0d4ecb6ff5	Replace static const int with enum to fix obnoxious linker errors about a missing definition llvm-svn: 259712	2016-02-03 21:45:39 +00:00
Reid Kleckner	17495274fd	[unittests] Move TargetRegistry test from Support to MC This removes the dependency from SupportTests to all of the LLVM backends, and makes it link faster. llvm-svn: 259705	2016-02-03 21:41:24 +00:00
Reid Kleckner	c2e2311627	Silence -Wsign-conversion issue in ProgramTest.cpp Unfortunately, ProgramInfo::ProcessId is signed on Unix and unsigned on Windows, breaking the standard fix of using '0U' in the gtest expectation. llvm-svn: 259704	2016-02-03 21:41:12 +00:00
Ana Pazos	b3596028cf	Fix pointers to go on the right hand side. NFC. Summary: Fixed pointers to go on the right hand side following coding guidelines. NFC. Patch by Mandeep Singh Grang. Reviewers: majnemer, arsenm, sanjoy Differential Revision: http://reviews.llvm.org/D16866 llvm-svn: 259703	2016-02-03 21:34:39 +00:00
David Majnemer	a53b5bbb18	[LoopStrengthReduce] Don't rewrite PHIs with incoming values from CatchSwitches Bail out if we have a PHI on an EHPad that gets a value from a CatchSwitchInst. Because the CatchSwitchInst cannot be split, there is no good place to stick any instructions. This fixes PR26373. llvm-svn: 259702	2016-02-03 21:30:34 +00:00
David Majnemer	fa8681e452	[ScalarEvolutionExpander] Simplify findInsertPointAfter No functional change is intended. The loop could only execute, at most, once. llvm-svn: 259701	2016-02-03 21:30:31 +00:00
Reid Kleckner	eb3bcdd28b	[codeview] Remove EmitLabelDiff in favor emitAbsoluteSymbolDiff llvm-svn: 259700	2016-02-03 21:24:42 +00:00
Reid Kleckner	dac21b43d5	[codeview] Use the MCStreamer interface directly instead of AsmPrinter This is mostly about having shorter lines and standardizing on one interface, but it also avoids some needless indirection. No functional change. llvm-svn: 259697	2016-02-03 21:15:48 +00:00
Keno Fischer	6c1e47a66b	[DWARFDebug] Fix another case of overlapping ranges Summary: In r257979, I added code to ensure that we wouldn't merge DebugLocEntries if the pieces they describe overlap. Unfortunately, I failed to cover the case, where there may have multiple active Expressions in the entry, in which case we need to make sure that no two values overlap before we can perform the merge. This fixed PR26148. Reviewers: aprantl Differential Revision: http://reviews.llvm.org/D16742 llvm-svn: 259696	2016-02-03 21:13:33 +00:00
Todd Fiala	675bdcedff	Address NDEBUG-related linkage issues for Value::assertModuleIsMaterialized() The IR/Value class had a linkage issue present when LLVM was built as a library, and the LLVM library build time had different settings for NDEBUG than the client of the LLVM library. Clients could get into a state where the LLVM lib expected Value::assertModuleIsMaterialized() to be inline-defined in the header but clients expected that method to be defined in the LLVM library. See this llvm-commits thread for more details: http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20160201/329667.html llvm-svn: 259695	2016-02-03 21:13:23 +00:00
Tim Shen	f99f0d5a7e	[SelectionDAG] Fix CombineToPreIndexedLoadStore O(n^2) behavior This patch consists of two parts: a performance fix in DAGCombiner.cpp and a correctness fix in SelectionDAG.cpp. The test case tests the bug that's uncovered by the performance fix, and fixed by the correctness fix. The performance fix keeps the containers required by the hasPredecessorHelper (which is a lazy DFS) and reuse them. Since hasPredecessorHelper is called in a loop, the overall efficiency reduced from O(n^2) to O(n), where n is the number of SDNodes. The correctness fix keeps iterating the neighbor list even if it's time to early return. It will return after finishing adding all neighbors to Worklist, so that no neighbors are discarded due to the original early return. llvm-svn: 259691	2016-02-03 20:58:55 +00:00
Reid Kleckner	45b6159ed3	Minor performance tweaks to llvm-tblgen (and a few that might be a good idea) Summary: This patch adds a reserve call to an expensive function (`llvm::LoadIntrinsics`), and may fix a few other low hanging performance fruit (I've put them in comments for now, so we can discuss). Motivation: As I'm sure other developers do, when I build LLVM, I build the entire project with the same config (`Debug`, `MinSizeRel`, `Release`, or `RelWithDebInfo`). However, the `Debug` config also builds llvm-tblgen in `Debug` mode. Later build steps that run llvm-tblgen then can actually be the slowest steps in the entire build. Nobody likes slow builds. Reviewers: rnk, dblaikie Differential Revision: http://reviews.llvm.org/D16832 Patch by Alexander G. Riccio llvm-svn: 259683	2016-02-03 19:34:28 +00:00
Saleem Abdulrasool	f36005a358	ARM: support TLS for WoA Add support for TLS access for Windows on ARM. This generates a similar access to MSVC for ARM. The changes to the tablegen data is needed to support loading an external symbol global that is not for a call. The adjustments to the DAG to DAG transforms are needed to preserve the 32-bit move. llvm-svn: 259676	2016-02-03 18:21:59 +00:00
Wei Mi	97de385868	Revert r259662, which caused regressions on polly tests. llvm-svn: 259675	2016-02-03 18:05:57 +00:00
Quentin Colombet	7ec03dc7f8	[InstCombine] Revert r238452: Fold IntToPtr and PtrToInt into preceding loads. According to git bisect, this is the root cause of a miscompile for Regex in libLLVMSupport. I am still working on reducing a test case. The actual bug may be elsewhere and this commit just exposed it. Anyway, at the moment, to reproduce, follow these steps: 1. Build clang and libLTO in release mode. 2. Create a new build directory <stage2> and cd into it. 3. Use clang and libLTO from #1 to build llvm-extract in Release mode + asserts using -O2 -flto 4. Run llvm-extract -ralias '.bar' -S test/Other/extract-alias.ll Result: program doesn't contain global named '.bar'! Expected result: @a0a0bar = alias void ()* @bar @a0bar = alias void ()* @bar declare void @bar() Note: In step #3, if you don't use lto or asserts, the miscompile disappears. llvm-svn: 259674	2016-02-03 18:04:13 +00:00
Jonas Paulsson	ac29f01788	[ScheduleDAGInstrs::buildSchedGraph()] Handling of memory dependecies rewritten. Recommited, after some fixing with test cases. Updated test cases: test/CodeGen/AArch64/arm64-misched-memdep-bug.ll test/CodeGen/AArch64/tailcall_misched_graph.ll Temporarily disabled test cases: test/CodeGen/AMDGPU/split-vector-memoperand-offsets.ll test/CodeGen/PowerPC/ppc64-fastcc.ll (partially updated) test/CodeGen/PowerPC/vsx-fma-m.ll test/CodeGen/PowerPC/vsx-fma-sp.ll http://reviews.llvm.org/D8705 Reviewers: Hal Finkel, Andy Trick. llvm-svn: 259673	2016-02-03 17:52:29 +00:00
Xinliang David Li	ae2556f7ee	Fix comments /NFC llvm-svn: 259672	2016-02-03 17:51:16 +00:00
Joseph Tremoulet	e1014a398e	[Unittest] Clean up formatting, NFC Summary: Use an early return to reduce indentation. Remove unused local. Reviewers: dblaikie, lhames Subscribers: lhames, llvm-commits Differential Revision: http://reviews.llvm.org/D16513 llvm-svn: 259663	2016-02-03 17:11:24 +00:00
Wei Mi	ed133978a0	[SCEV] Try to reuse existing value during SCEV expansion Current SCEV expansion will expand SCEV as a sequence of operations and doesn't utilize the value already existed. This will introduce redundent computation which may not be cleaned up throughly by following optimizations. This patch introduces an ExprValueMap which is a map from SCEV to the set of equal values with the same SCEV. When a SCEV is expanded, the set of values is checked and reused whenever possible before generating a sequence of operations. Differential Revision: http://reviews.llvm.org/D12090 llvm-svn: 259662	2016-02-03 17:05:12 +00:00
Renato Golin	6027dd38ef	[ARM] Move GNUEABI divmod to __aeabi_divmod* The GNU toolchain emits __aeabi_divmod for soft-divide on ARM cores which happens to be a lot faster than __divsi3/__modsi3 when the core has hardware divide instructions. Do the same here. Fixes PR26450. llvm-svn: 259657	2016-02-03 16:10:54 +00:00
Jun Bum Lim	59df5e89c2	[MachineCopyPropagation] Fix comment. NFC Reviewers: MatzeB, qcolombet, jmolloy, mcrosier Subscribers: llvm-commits, mcrosier Differential Revision: http://reviews.llvm.org/D16806 llvm-svn: 259656	2016-02-03 15:56:27 +00:00
Daniel Sanders	3b1a2dbffa	[mips] Remove redundant inclusions of MipsAnalyzeImmediate.h llvm-svn: 259655	2016-02-03 15:54:12 +00:00
James Molloy	6e518a3b50	[DemandedBits] Revert r249687 due to PR26071 This regresses a test in LoopVectorize, so I'll need to go away and think about how to solve this in a way that isn't broken. From the writeup in PR26071: What's happening is that ComputeKnownZeroes is telling us that all bits except the LSB are zero. We're then deciding that only the LSB needs to be demanded from the icmp's inputs. This is where we're wrong - we're assuming that after simplification the bits that were known zero will continue to be known zero. But they're not - during trivialization the upper bits get changed (because an XOR isn't shrunk), so the icmp fails. The fault is in demandedbits - its contract does clearly state that a non-demanded bit may either be zero or one. llvm-svn: 259649	2016-02-03 15:05:06 +00:00
Nemanja Ivanovic	82e1168989	Fix for PR 26381 Simple fix - Constant values were not being sign extended in FastIsel. llvm-svn: 259645	2016-02-03 12:53:38 +00:00
Simon Atanasyan	e774126c96	[mips] Add SHF_MIPS_GPREL flag to the MIPS .sbss and .sdata sections MIPS ABI states that .sbss and .sdata sections must have SHF_MIPS_GPREL flag. See Figure 4–7 on page 69 in the following document: ftp://www.linux-mips.org/pub/linux/mips/doc/ABI/mipsabi.pdf. Differential Revision: http://reviews.llvm.org/D15740 llvm-svn: 259641	2016-02-03 11:50:22 +00:00
Dylan McKay	bff960a926	[TableGen] Add 'register alternative name matching' support Summary: This adds a new attribute which targets can set in TableGen which causes a function to be generated which matches register alternative names. This is very similar to `ShouldEmitMatchRegisterName`, except it works on alt names. This patch is currently used by the out of tree part of the AVR backend. It reduces code duplication greatly, and has the effect that you do not need to hardcode altname to register mappings in C++. It will not work on targets which have registers which share the same aliases. Reviewers: stoklund, arsenm, dsanders, hfinkel, vkalintiris Subscribers: hfinkel, dylanmckay, llvm-commits Differential Revision: http://reviews.llvm.org/D16312 llvm-svn: 259636	2016-02-03 10:30:16 +00:00
Simon Pilgrim	18bcf93efb	[X86][AVX] Add support for 64-bit VZEXT_LOAD of 256/512-bit vectors to EltsFromConsecutiveLoads Follow up to D16217 and D16729 This change uncovered an odd pattern where VZEXT_LOAD v4i64 was being lowered to a load of the lower v2i64 (so the 2nd i64 destination element wasn't being zeroed), I can't find any use/reason for this and have removed the pattern and replaced it so only the 1st i64 element is loaded and the upper bits all zeroed. This matches the description for X86ISD::VZEXT_LOAD Differential Revision: http://reviews.llvm.org/D16768 llvm-svn: 259635	2016-02-03 09:41:59 +00:00
Xinliang David Li	876ed52c8a	Add a compatibility test llvm-svn: 259632	2016-02-03 06:27:38 +00:00
Xinliang David Li	3c88288927	Fix a typo in comment llvm-svn: 259631	2016-02-03 06:24:11 +00:00
Xinliang David Li	a398d2d94a	Fix uninitiazed variable use problem llvm-svn: 259630	2016-02-03 06:23:16 +00:00
Xinliang David Li	6c93ee8d36	[PGO] Profile summary reader/writer support With this patch, the profile summary data will be available in indexed profile data file so that profiler reader/compiler optimizer can start to make use of. Differential Revision: http://reviews.llvm.org/D16258 llvm-svn: 259626	2016-02-03 04:08:18 +00:00
Peter Collingbourne	0c0d7e2d0f	LowerBitSets: Don't bother to do any work if the llvm.bitset.test intrinsic is unused. llvm-svn: 259625	2016-02-03 03:48:46 +00:00
Peter Collingbourne	83cc981c49	Add #include "llvm/Support/raw_ostream.h" to fix Windows build. llvm-svn: 259623	2016-02-03 03:16:37 +00:00
Peter Collingbourne	9f7ec14009	Transforms: Move GlobalOpt's Evaluator to Utils where it can be reused. llvm-svn: 259621	2016-02-03 02:51:00 +00:00
Nick Lewycky	a093ab4ad6	Fix typo in comment. NFC llvm-svn: 259620	2016-02-03 02:15:49 +00:00
Peter Collingbourne	4e3605a2af	docs: Document how bitsets may be used to encode type information. llvm-svn: 259619	2016-02-03 02:01:08 +00:00
Kyle Butt	d62d8b771d	Codegen: [PPC] Fix PPCVSXFMAMutate to handle duplicates. The purpose of PPCVSXFMAMutate is to elide copies by changing FMA forms on PPC. %vreg6<def> = COPY %vreg96 %vreg6<def,tied1> = XSMADDASP %vreg6<tied0>, %vreg5<kill>, %vreg7 ;v6 = v6 + v5 * v7 is replaced by %vreg5<def,tied1> = XSMADDMSP %vreg5<tied0>, %vreg7, %vreg96 ;v5 = v5 * v7 + v96 This was broken in the case where the target register was also used as a multiplicand. Fix this case by checking for it and replacing both uses with the copied register. %vreg6<def> = COPY %vreg96 %vreg6<def,tied1> = XSMADDASP %vreg6<tied0>, %vreg5<kill>, %vreg6 ;v6 = v6 + v5 * v6 is replaced by %vreg5<def,tied1> = XSMADDMSP %vreg5<tied0>, %vreg96, %vreg96 ;v5 = v5 * v96 + v96 llvm-svn: 259617	2016-02-03 01:41:09 +00:00
Yunzhong Gao	eb959722a7	Revert r259576: Disable the vzeroupper insertion pass on PS4. Will re-implement based on review feedback. llvm-svn: 259615	2016-02-03 01:25:12 +00:00
Marcello Maggioni	bfe87568aa	RegCoalescer: Making sure re-materialization defines all subranges The register coalescer can rematerialize constants that define more of a register than the copy it is going to replace was going to do. This is valid in the case the register was undef before the copy happened. This patch makes sure that all the subranges defined by the new rematerialization instructions have at least a dead def. Review: http://reviews.llvm.org/D16693 llvm-svn: 259614	2016-02-03 00:22:32 +00:00
NAKAMURA Takumi	a8d480d9d5	DiagnosticInfoWithDebugLocBase: Appease Twine for now. FIXME: We should get rid of Twine in the record. llvm-svn: 259612	2016-02-03 00:09:22 +00:00
Adam Nemet	d52ed84160	[LoopVersioning] Expose loop versioning as a pass too Summary: LoopVersioning is a transform utility that transform passes can use to run-time disambiguate may-aliasing accesses. I'd like to also expose as pass to allow it to be unit-tested. I am planning to add support for non-aliasing annotation in LoopVersioning and I'd like to be able to write tests directly using this pass. (After that feature is done, the pass could also be used to look for optimization opportunities that are hidden behind incomplete alias information at compile time.) The pass drives LoopVersioning in its default way which is to fully disambiguate may-aliasing accesses no matter how many checks are required. Reviewers: hfinkel, ashutosh.nema, sbaranga Subscribers: zzheng, mssimpso, llvm-commits, sanjoy Differential Revision: http://reviews.llvm.org/D16612 llvm-svn: 259610	2016-02-03 00:06:10 +00:00
George Burgess IV	60adac46f2	Attempt #2 to unbreak r259595. llvm-svn: 259602	2016-02-02 23:26:01 +00:00
David Majnemer	30579ec851	[codeview] Improve readability of codeview assembly output Strictly speaking, this is not an improvement in functionality per se but a usability improvement to those debugging codeview. llvm-svn: 259601	2016-02-02 23:18:23 +00:00
Kostya Serebryany	d88d1305c4	[libFuzzer] don't create too many trace-based mutations as it may be too slow llvm-svn: 259600	2016-02-02 23:17:45 +00:00
George Burgess IV	b5a229f779	Attempt to fix builds broken by r259595. llvm-svn: 259599	2016-02-02 23:15:26 +00:00
George Burgess IV	e1100f533f	This patch adds MemorySSA to LLVM. Please see include/llvm/Transforms/Utils/MemorySSA.h for a description of MemorySSA, and what it does. Differential Revision: http://reviews.llvm.org/D7864 llvm-svn: 259595	2016-02-02 22:46:49 +00:00
Philip Reames	b7571043f2	[LVI] Fix debug output Due to staleness in a patch I committed yesterday, the debug output was reporting overdefined cases as being undefined. Confusing to say the least. The mistake appears to have only effected the debug output thankfully. llvm-svn: 259594	2016-02-02 22:43:08 +00:00
Anna Zaks	3b50e70bbe	[asan] Add iOS support to AddressSanitzier Differential Revision: http://reviews.llvm.org/D15625 llvm-svn: 259586	2016-02-02 22:05:07 +00:00
Philip Reames	ed8cd0d36e	[LVI] Code motion only [NFC] I introduced a declaration in 259583 to keep the diff readable. This change just moves the definition up to remove the declaration again. llvm-svn: 259585	2016-02-02 22:03:19 +00:00
Philip Reames	d1f829d374	[LVI] Refactor to use newly introduced intersect utility This patch uses the newly introduced 'intersect' utility (from 259461: [LVI] Introduce an intersect operation on lattice values) to simplify existing code in LVI. While not introducing any new concepts, this change is probably not NFC. The common 'intersect' function is more powerful that the ad-hoc implementations we'd had in a couple of places. Given that, we may see optimizations triggering a bit more often. llvm-svn: 259583	2016-02-02 21:57:37 +00:00
Justin Bogner	246345a834	Remove utils/buildit The autoconf build system was removed - this doesn't even work and doesn't need to be here. llvm-svn: 259582	2016-02-02 21:56:16 +00:00
Hemant Kulkarni	782edae7d6	Correct size calculations for ELF files llvm-svn: 259578	2016-02-02 21:41:49 +00:00
Yunzhong Gao	b76ccacfb1	Disable the vzeroupper insertion pass on PS4. See comments in test/CodeGen/X86/avx-vzeroupper.ll for more explanation. Original patch by: Sean Silva llvm-svn: 259576	2016-02-02 21:39:23 +00:00
Lang Hames	3923698b3f	[Orc] Stub addresses should be based on stub size, not pointer size. This didn't affect X86_64, which is the only client of this code at the moment, as stubs and pointers are both 8-bytes there. It will affect other platforms though. llvm-svn: 259575	2016-02-02 21:38:30 +00:00
Matt Arsenault	de4208122b	AMDGPU: Do not promote allocas with non-inbounds GEPs If we can't assume the pointer value isn't within the bounds of the object, it seems risky to try to replace the pointer calculations. llvm-svn: 259573	2016-02-02 21:16:12 +00:00
Matt Arsenault	7e747f1a38	AMDGPU: Handle promoting memmove Also add missing tests for the others. llvm-svn: 259558	2016-02-02 20:28:10 +00:00
Quentin Colombet	b8fb2ba1bb	[X86] Fix the merging of SP updates in prologue/epilogue insertions. When the merging was involving LEAs, we were taking the wrong immediate from the list of operands. rdar://problem/24446069 llvm-svn: 259553	2016-02-02 20:11:17 +00:00
Matthias Braun	1377fd6781	MachineVerifier: Check that defs/uses are live in subregisters as well. llvm-svn: 259552	2016-02-02 20:04:51 +00:00
Matt Arsenault	8b175672cb	AMDGPU: Skip promote alloca with no optimizations llvm-svn: 259551	2016-02-02 19:32:42 +00:00
Matt Arsenault	fb8cdbae0c	AMDGPU: Minor cleanups for AMDGPUPromoteAlloca Mostly convert to use range loops. llvm-svn: 259550	2016-02-02 19:32:35 +00:00
Lang Hames	e28b118be0	[Orc] Turn OrcX86_64::IndirectStubsInfo into a template helper class: GenericIndirectStubsInfo. This will allow architecture support classes for other architectures to re-use this code. llvm-svn: 259549	2016-02-02 19:31:15 +00:00
David Majnemer	c9911f28e5	[codeview] Correctly handle inlining functions post-dominated by unreachable CodeView requires us to accurately describe the extent of the inlined code. We did this by grabbing the next debug location in source order and using that to denote where we stopped inlining. However, this is not sufficient or correct in instances where there is no next debug location or the next debug location belongs to the start of another function. To get this correct, use the end symbol of the function to denote the last possible place the inlining could have stopped at. llvm-svn: 259548	2016-02-02 19:22:34 +00:00
Matt Arsenault	e5737f7cac	AMDGPU: Report AMDGPUPromoteAlloca changed the function llvm-svn: 259547	2016-02-02 19:18:57 +00:00
Matt Arsenault	ad1348459f	AMDGPU: Whitelist handled intrinsics We shouldn't crash on unhandled intrinsics. Also simplify failure handling in loop. llvm-svn: 259546	2016-02-02 19:18:53 +00:00
Matt Arsenault	853a1fc6d9	AMDGPU: Use inbounds when calculating workitem offset When promoting allocas to LDS, we know we are indexing into a specific area just created, and the calculation will also never overflow. Also emit some of the muls as nsw nuw, because instcombine infers this already from the range metadata. I think putting this on the other adds and muls might be OK too, but I'm not 100% sure. llvm-svn: 259545	2016-02-02 19:18:48 +00:00
Eugene Zelenko	ecefe5a81f	Fix Clang-tidy readability-redundant-control-flow warnings; other minor fixes. Differential revision: http://reviews.llvm.org/D16793 llvm-svn: 259539	2016-02-02 18:20:45 +00:00
Reid Kleckner	1fcd610c94	[codeview] Wire up the .cv_inline_linetable directive This directive emits the binary annotations that describe line and code deltas in inlined call sites. Single-stepping through inlined frames in windbg now works. llvm-svn: 259535	2016-02-02 17:41:18 +00:00
Derek Schuff	c6d8fd3f54	[MC] Enable eip-relative addressing on x86-64 for X32 ABI Summary: Enables eip-based addressing, e.g., lea constant(%eip), %rax lea constant(%eip), %eax in MC, (used for the x32 ABI). EIP-base addressing is also valid in x86_64, it is left enabled for that architecture as well. Patch by João Porto Differential Revision: http://reviews.llvm.org/D16581 llvm-svn: 259528	2016-02-02 17:20:04 +00:00
Chad Rosier	1142f3cf90	[AArch64] Add a FIXME comment. llvm-svn: 259515	2016-02-02 15:22:55 +00:00
Chad Rosier	bba881ef3d	[AArch64] Allocate the modified and used regs only once per function. llvm-svn: 259510	2016-02-02 15:02:30 +00:00
JF Bastien	926b189a81	WebAssembly: update expected GCC torture test failures The 3 programs used __attribute__((mode(?))) on enum, which clang r259497 fixed. llvm-svn: 259508	2016-02-02 14:27:34 +00:00
Oliver Stannard	7e7d983a87	Refactor backend diagnostics for unsupported features Re-commit of r258951 after fixing layering violation. The BPF and WebAssembly backends had identical code for emitting errors for unsupported features, and AMDGPU had very similar code. This merges them all into one DiagnosticInfo subclass, that can be used by any backend. There should be minimal functional changes here, but some AMDGPU tests have been updated for the new format of errors (it used a slightly different format to BPF and WebAssembly). The AMDGPU error messages will now benefit from having precise source locations when debug info is available. llvm-svn: 259498	2016-02-02 13:52:43 +00:00
Simon Pilgrim	96fe4ef5f7	[X86][AVX512] Add support for AVX512 VMOVQ (load) shuffle decoding llvm-svn: 259496	2016-02-02 13:32:56 +00:00
JF Bastien	dc1255f02f	WebAssembly: add option to disable register coloring Having this hidden option makes it easier to debug other issues. llvm-svn: 259482	2016-02-02 09:30:01 +00:00
Sjoerd Meijer	ffe19f5245	Removed FeatureVFPOnlySP from the Cortex-R7 processor model description and changed the regression test accordingly. The default configuration of a Cortex-R7 is to implement the VFPv3-D16 architecture and the feature line as it was is too restrictive. llvm-svn: 259480	2016-02-02 09:28:20 +00:00
David Majnemer	ccc809e2e6	[RegisterCoalescer] Better DebugLoc for reMaterializeTrivialDef When rematerializing a computation by replacing the copy, use the copy's location. The location of the copy is more representative of the original program. This partially fixes PR10003. llvm-svn: 259469	2016-02-02 06:41:55 +00:00
Chandler Carruth	a4499e9f73	[LCG] Build an edge abstraction for the LazyCallGraph and use it to differentiate between indirect references to functions an direct calls. This doesn't do a whole lot yet other than change the print out produced by the analysis, but it lays the groundwork for a very major change I'm working on next: teaching the call graph to actually be a call graph, modeling both the indirect reference graph and the call graph simultaneously. More details on that in the next patch though. The rest of this is essentially a bunch of over-engineering that won't be interesting until the next patch. But this also isolates essentially all of the churn necessary to introduce the edge abstraction from the very important behavior change necessary in order to separately model the two graphs. So it should make review of the subsequent patch a bit easier at the cost of making this patch seem poorly motivated. ;] Differential Revision: http://reviews.llvm.org/D16038 llvm-svn: 259463	2016-02-02 03:57:13 +00:00
Philip Reames	44456b8963	[LVI] Introduce an intersect operation on lattice values LVI has several separate sources of facts - edge local conditions, recursive queries, assumes, and control independent value facts - which all apply to the same value at the same location. The existing implementation was very conservative about exploiting all of these facts at once. This change introduces an "intersect" function specifically to abstract the action of picking a good set of facts from all of the separate facts given. At the moment, this function is relatively simple (i.e. mostly just reuses the bits which were already there), but even the minor additions reveal the inherent power. For example, JumpThreading is now capable of doing an inductive proof that a particular value is always positive and removing a half range check. I'm currently only using the new intersect function in one place. If folks are happy with the direction of the work, I plan on making a series of small changes without review to replace mergeIn with intersect at all the appropriate places. Differential Revision: http://reviews.llvm.org/D14476 llvm-svn: 259461	2016-02-02 03:15:40 +00:00
Kostya Serebryany	bfbe7fc404	[libFuzzer] allow passing 1 or more files as individual inputs llvm-svn: 259459	2016-02-02 03:03:47 +00:00
Matthias Braun	579c9cda13	MachineVerifier: Use report_context() instead of ad-hoc messages. llvm-svn: 259457	2016-02-02 02:44:25 +00:00
Sanjoy Das	881de4d12a	[X86] Fix a bug in getMemOpBaseRegImmOfs Fix a crash in `getMemOpBaseRegImmOfs` that happens if the base of `MemOp` is a frame index memory operand. The fix is to have `getMemOpBaseRegImmOfs` bail out in such cases. We can possibly be more clever here, if needed. llvm-svn: 259456	2016-02-02 02:32:43 +00:00
Kostya Serebryany	078e984d8d	[libFuzzer] fail if the corpus dir does not exist llvm-svn: 259454	2016-02-02 02:07:26 +00:00
Ahmed Bougacha	68a8efa374	[X86][FastISel] Don't force Nearest-Even rounding for VCVTPS2PH, use MXCSR. FastISel counterpart to r259448. llvm-svn: 259449	2016-02-02 01:44:03 +00:00
Ahmed Bougacha	55c6682ae2	[X86] Don't force Nearest-Even rounding for VCVTPS2PH, use MXCSR. Officially, we don't acknowledge non-default configurations of MXCSR, as getting there would require usage of the FENV_ACCESS pragma (at least insofar as rounding mode is concerned). We don't support the pragma, so we can assume that the default rounding mode - round to nearest, ties to even - is always used. However, it's inconsistent with the rest of the instruction set, where MXCSR is always effective (unless otherwise specified). Also, it's an unnecessary obstacle to the few brave souls that use fenv.h with LLVM. Avoid the hard-coded rounding mode for fp_to_f16; use MXCSR instead. llvm-svn: 259448	2016-02-02 01:32:50 +00:00
Anna Zaks	cad7994c3b	[safestack] Make sure the unsafe stack pointer is popped in all cases The unsafe stack pointer is only popped in moveStaticAllocasToUnsafeStack so it won't happen if there are no static allocas. Fixes https://llvm.org/bugs/show_bug.cgi?id=26122 Differential Revision: http://reviews.llvm.org/D16339 llvm-svn: 259447	2016-02-02 01:03:11 +00:00
Philip Reames	2c275cc686	[LVI] Fix a latent bug in getValueAt This routine was returning Undefined for most queries. This was utterly wrong. Amusingly, we do not appear to have any callers of this which are actually trying to exploit unreachable code or this would have broken the world. A better approach would be to explicit describe the intersection of facts. That's blocked behind http://reviews.llvm.org/D14476 and I wanted to fix the current bug. llvm-svn: 259446	2016-02-02 00:45:30 +00:00
Sanjay Patel	c54600dbb1	fix typos; NFC llvm-svn: 259438	2016-02-01 23:53:35 +00:00
Philip Reames	f3b94694c0	[LVI] Missing test case from 259432 llvm-svn: 259437	2016-02-01 23:44:38 +00:00
Teresa Johnson	a9a630759f	Add test for PR26419 (stable function summary ordering) Enhance an existing test to also check that the ordering of the function summary entries is stable. llvm-svn: 259434	2016-02-01 23:26:30 +00:00
Philip Reames	13f7324b86	[LVI] Remove overly tight assert from 259429 I'll submit a test case shortly which covers this, but it's causing clang self host problems in the builders so I wanted to get it removed. llvm-svn: 259432	2016-02-01 23:21:11 +00:00
Simon Pilgrim	5be17b6e3e	[X86][AVX512] Add support for AVX512 VMOVD (load) shuffle decoding llvm-svn: 259430	2016-02-01 23:04:05 +00:00
Philip Reames	c0bdb0c1e5	[LVI] Add select handling Teach LVI to handle select instructions in the exact same way it handles PHI nodes. This is useful since various parts of the optimizer convert PHI nodes into selects and we don't want these transformations to cause inferior optimization. Note that this patch does nothing to exploit the implied constraint on the inputs represented by the select condition itself. That will be a later patch and is blocked on http://reviews.llvm.org/D14476 llvm-svn: 259429	2016-02-01 22:57:53 +00:00
Simon Pilgrim	f5c23ad3d7	[X86][AVX512] Add support for AVX512 VMOVSD/VMOVSS shuffle decoding llvm-svn: 259427	2016-02-01 22:26:28 +00:00
Sanjay Patel	4b198802b3	function names start with a lowercase letter; NFC llvm-svn: 259425	2016-02-01 22:23:39 +00:00
Sanjay Patel	103ab7d571	[InstCombine] simplify masked scatter/gather intrinsics with zero masks A masked scatter with a zero mask means there's no store. A masked gather with a zero mask means the passthru arg is returned. This is a continuation of: http://reviews.llvm.org/rL259369 http://reviews.llvm.org/rL259392 llvm-svn: 259421	2016-02-01 22:10:26 +00:00
Simon Pilgrim	025a3d857a	[X86][AVX512] Add support for AVX512 VINSERTPS shuffle decoding llvm-svn: 259420	2016-02-01 22:05:50 +00:00
Matthias Braun	3f88eabe93	SmallSet/SmallPtrSet: Refuse huge Small numbers These sets do linear searching in small mode; It is not a good idea to use huge numbers as the small value here, save people from themselves by adding a static_assert. Differential Revision: http://reviews.llvm.org/D16706 llvm-svn: 259419	2016-02-01 22:05:16 +00:00
Simon Pilgrim	e9848d4a88	[X86][SSE] Regenerated load vector + element extraction tests. llvm-svn: 259416	2016-02-01 21:46:12 +00:00
Chad Rosier	dbdb1d6eaf	Move comments a bit closer to associated code. NFC. llvm-svn: 259411	2016-02-01 21:38:31 +00:00
Simon Pilgrim	068e38f7f4	[X86][SSE] Add AVX512 merge consecutive load tests Add AVX512F/AVX512BW 512-bit tests. Add AVX512F tests to existing 128/256-bit tests. llvm-svn: 259410	2016-02-01 21:30:50 +00:00
Simon Pilgrim	f3c37cc87e	Regenerate vector blend tests. llvm-svn: 259406	2016-02-01 21:06:32 +00:00
Simon Pilgrim	32b25549fa	Regenerate vector sext/zext constant folding tests. llvm-svn: 259405	2016-02-01 21:01:29 +00:00
Jun Bum Lim	53907161cc	Avoid inlining call sites in unreachable-terminated block Summary: If the normal destination of the invoke or the parent block of the call site is unreachable-terminated, there is little point in inlining the call site unless there is literally zero cost. Unlike my previous change (D15289), this change specifically handle the call sites followed by unreachable in the same basic block for call or in the normal destination for the invoke. This change could be a reasonable first step to conservatively inline call sites leading to an unreachable-terminated block while BFI / BPI is not yet available in inliner. Reviewers: manmanren, majnemer, hfinkel, davidxl, mcrosier, dblaikie, eraman Subscribers: dblaikie, davidxl, mcrosier, llvm-commits Differential Revision: http://reviews.llvm.org/D16616 llvm-svn: 259403	2016-02-01 20:55:11 +00:00
Chad Rosier	064261da16	Remove extra semicolon. NFC. llvm-svn: 259402	2016-02-01 20:54:36 +00:00
Sanjoy Das	4c7b6d79c0	[SCEV] Clean up isKnownPredicateViaConstantRanges; NFCI - ScalarEvolution::isKnownPredicateViaConstantRanges duplicates some logic already present in ConstantRange, use ConstantRange for those bits. - In some cases ScalarEvolution::isKnownPredicateViaConstantRanges returns `false` to mean "definitely false" (e.g. see the `LHSRange.getSignedMin().sge(RHSRange.getSignedMax())` case for `ICmpInst::ICMP_SLT`), but for `isKnownPredicateViaConstantRanges`, `false` actually means "don't know". Get rid of this extra bit of code to avoid confusion. llvm-svn: 259401	2016-02-01 20:48:14 +00:00
Sanjoy Das	401e631c4b	[SCEV] Rename isKnownPredicateWithRanges; NFC Make it obvious that it uses constant ranges, and use `Via` instead of `With`, like other similar functions in SCEV. llvm-svn: 259400	2016-02-01 20:48:10 +00:00
Rafael Espindola	52570ea2a2	Fix infinite recursion in MCAsmStreamer::EmitValueImpl. If a target can only emit 8-bits data, we would loop in EmitValueImpl since it will try to split a 32-bits data in 1 chunk of 32-bits. No test since all current targets can emit 32bits at a time. Patch by Alexandru Guduleasa! llvm-svn: 259399	2016-02-01 20:36:49 +00:00
Teresa Johnson	2d9da4dc50	[ThinLTO] Ensure function summary output order is stable Iterate over the function list instead of a DenseMap of Function pointers when emitting the function summary into the module. This fixes PR26419. llvm-svn: 259398	2016-02-01 20:16:35 +00:00
Rafael Espindola	ac60e5f028	Add a test for r258362. Thanks to Mehdi for finding it. llvm-svn: 259394	2016-02-01 19:56:12 +00:00
Sanjay Patel	04f792bdc9	[InstCombine] simplify masked store intrinsics with all ones or zeros masks A masked store with a zero mask means there's no store. A masked store with an allOnes mask means it's a normal vector store. This is a continuation of: http://reviews.llvm.org/rL259369 llvm-svn: 259392	2016-02-01 19:39:52 +00:00
Davide Italiano	d4a48532e0	[llvm-nm] Simplify the code a bit. NFCI. Fix a style violation while I'm here. llvm-svn: 259391	2016-02-01 19:22:16 +00:00
Balaram Makam	92431703d7	AArch64: Implement missed conditional compare sequences. Summary: This is an extension to the existing implementation of r242436 which restricts to only select inputs. This version fixes missed opportunities in pr26084 by attempting to lower conditional compare sequences of and/or trees with setcc leafs. This will additionaly handle the case when a tree with select input is not a conjunction-disjunction tree but some of the sub trees are conjunction-disjunction trees. Reviewers: jmolloy, t.p.northover, mcrosier, MatzeB Subscribers: mcrosier, llvm-commits, junbuml, haicheng, mssimpso, gberry Differential Revision: http://reviews.llvm.org/D16291 llvm-svn: 259387	2016-02-01 19:13:07 +00:00
Matthew Simpson	45dee06177	Add test case missing from r259357 (NFC) llvm-svn: 259385	2016-02-01 19:09:24 +00:00
Geoff Berry	29d4a695f4	[AArch64] Simplify prolog/epilog callee save/restore. NFC. Summary: Factor out common code for callee-save register pair calculation. This is intended to simplify follow-on changes that reduce the number of registers saved/restored. Depends on D16732 Reviewers: mcrosier, jmolloy, t.p.northover Subscribers: aemerson, rengolin, mcrosier, llvm-commits Differential Revision: http://reviews.llvm.org/D16734 llvm-svn: 259384	2016-02-01 19:07:06 +00:00
Ulrich Weigand	4a4d4ab7a4	[SystemZ] Fix wrong-code generation for certain always-false conditions We've found another bug in the code generation logic conditions for a certain class of always-false conditions, those of the form if ((a & 1) < 0) These only reach the back end when compiling without optimization. The bug was introduced by the choice of using TEST UNDER MASK to implement a check for if ((a & MASK) < VAL) as if ((a & MASK) == 0) where VAL is less than the the lowest bit of MASK. This is correct in all cases except for VAL == 0, in which case the original condition is always false, but the replacement isn't. Fixed by excluding that particular case. llvm-svn: 259381	2016-02-01 18:31:19 +00:00
Colin LeMahieu	6fdfa3dc32	[NFC] Referencing manual for reason why subregbit is checked llvm-svn: 259380	2016-02-01 18:15:39 +00:00
Sanjay Patel	cf57c5a4ee	fix broken check lines Without the colon, it doesn't mean anything! llvm-svn: 259377	2016-02-01 17:46:18 +00:00
David Majnemer	f8853ae7b3	[InstCombine] Don't transform (X+INT_MAX)>=(Y+INT_MAX) -> (X<=Y) This miscompile came about because we tried to use a transform which was only appropriate for xor operators when addition was present. This fixes PR26407. llvm-svn: 259375	2016-02-01 17:37:56 +00:00
Jun Bum Lim	ca832660ae	[ValueTracking] Improve isKnownNonZero for PHI of non-zero constants It is clear that a PHI is a non-zero if all incoming values are non-zero constants. llvm-svn: 259370	2016-02-01 17:03:07 +00:00
Sanjay Patel	b695c5557c	[InstCombine] simplify masked load intrinsics with all ones or zeros masks A masked load with a zero mask means there's no load. A masked load with an allOnes mask means it's a normal vector load. Differential Revision: http://reviews.llvm.org/D16691 llvm-svn: 259369	2016-02-01 17:00:10 +00:00
Geoff Berry	b13b2eed11	[PrologEpilogInserter] Add some debug output for callee-save frame object allocation Reviewers: mcrosier Subscribers: mcrosier, llvm-commits Differential Revision: http://reviews.llvm.org/D16733 llvm-svn: 259367	2016-02-01 16:47:51 +00:00
Geoff Berry	04bf91a8c1	[AArch64] Simplify callee-save register save/restore. NFC. Summary: Simplify callee-save register save/restore code generation by remembering the size of the callee-save area when it is computed so we don't have to scan the prologue/epilogue instructions again later to reconstruct it. This is intended to simplify follow-on changes that reduce the number of registers saved/restored. Reviewers: mcrosier, jmolloy, t.p.northover Subscribers: aemerson, rengolin, mcrosier, llvm-commits Differential Revision: http://reviews.llvm.org/D16732 llvm-svn: 259365	2016-02-01 16:29:19 +00:00
Matthew Simpson	73dad62174	[LV] Rename RdxPHIsToFix to PHIsToFix (NFC) In the future, we will vectorize recurrences other than reductions. This patch renames a few variables and updates their associated comments to enable them to be reused for non-reduction PHI nodes. This change was requested in the review for D16197. llvm-svn: 259364	2016-02-01 16:07:01 +00:00
Asaf Badouh	5a3a0231f4	[X86][AVX512VBMI] add encoding and intrinsics for Multishift Differential Revision: http://reviews.llvm.org/D16399 llvm-svn: 259363	2016-02-01 15:48:21 +00:00
Vasileios Kalintiris	a052037034	[mips] Split large test file into 3 smaller ones. Remove the old select.ll file and use select-int.ll, select-flt.ll, select-dbl.ll for testing selects on integers, floats & doubles respectivelly. llvm-svn: 259361	2016-02-01 15:19:35 +00:00
Daniel Sanders	f8bb23e509	[mips] Range check uimm16 and fix several bugs this revealed. Summary: The bugs were: * teq and similar take 4-bit unsigned immediates on microMIPS. * teqi and similar have side-effects like teq do. * shll_s.w and shra_r.w take 5-bit unsigned immediates. * The various DSP ext* instructions take a 5-bit immediate. * repl.qh takes an 8-bit unsigned immediate. * repl.ph takes a 10-bit unsigned immediate. * rddsp/wrdsp take a 10-bit unsigned immediate. * teqi and similar take signed 16-bit immediates (10-bit for microMIPS). * Out-of-range immediate macros for or/xor take a simm32/simm64 depending on architecture. I'll fix the simm64 case properly when I reach simm32. lui is a bit more lenient than GAS and accepts signed immediates in addition to unsigned. This is because MipsMCExpr can produce signed values when constant folding and it currently lacks a way of knowing it should fold to an unsigned value. Reviewers: vkalintiris Subscribers: dsanders, llvm-commits Differential Revision: http://reviews.llvm.org/D15446 llvm-svn: 259360	2016-02-01 15:13:31 +00:00
Amjad Aboud	8bbce8ad8e	Improved macro emission in dwarf. Changed emitting offset of macinfo entry into compiler unit DIE to use "addSectionLabel" method rather than explicitly calculating size/offset of macro entry. Differential Revision: http://reviews.llvm.org/D16292 llvm-svn: 259358	2016-02-01 14:09:41 +00:00

1 2 3 4 5 ...

127118 Commits