llvm-project

Commit Graph

Author	SHA1	Message	Date
Scott Douglass	039f768c42	[ARM] Small refactor of tryConvertingToTwoOperandForm (nfc) Also, add more Thumb2 ADD tests requested during review of http://reviews.llvm.org/D11053. Differential Revision: http://reviews.llvm.org/D11130 llvm-svn: 242034	2015-07-13 15:31:33 +00:00
Silviu Baranga	a647c30f88	Cleanup after r241809 - remove uncessary call to std::sort Summary: The iteration order within a member of DepCands is deterministic and therefore we don't have to sort the accesses within a member. We also don't have to copy the indices of the pointers into a vector, since we can iterate over the members of the class. Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D11145 llvm-svn: 242033	2015-07-13 14:48:24 +00:00
Rafael Espindola	c1d63f7499	Remove unused variable. Sorry I missed it in the previous commit. llvm-svn: 242032	2015-07-13 14:43:33 +00:00
Rafael Espindola	5895d6845e	Aliases don't have available_externally linkage. Allowing that is probably a good idea, but currently we don't, so this is dead code. llvm-svn: 242031	2015-07-13 14:39:02 +00:00
Rafael Espindola	237c3a6def	Don't change the visibility when converting a definition to a declaration. llvm-svn: 242030	2015-07-13 14:18:22 +00:00
Aaron Ballman	6d8f785073	Removing several -Wunused-but-set-variable warnings; NFC intended. llvm-svn: 242028	2015-07-13 14:04:30 +00:00
Rafael Espindola	7068cbbc1a	Print the visibility of available_externally functions. We were already printing it for declarations, but not available_externally. llvm-svn: 242027	2015-07-13 13:55:18 +00:00
Manuel Klimek	779cf85a4f	Revert r241981 "Revert "Revert r236894 "[BasicAA] Fix zext & sext handling""" The repros from PR23626 still fail. llvm-svn: 242025	2015-07-13 13:50:55 +00:00
Elena Demikhovsky	0f370936a0	AVX-512: Added all AVX-512 forms of Vector Convert for Float/Double/Int/Long types. In this patch I have only encoding. Intrinsics and DAG lowering will be in the next patch. I temporary removed the old intrinsics test (just to split this patch). Half types are not covered here. Differential Revision: http://reviews.llvm.org/D11134 llvm-svn: 242023	2015-07-13 13:26:20 +00:00
Jingyue Wu	9a92d4fb04	[LSR] don't attempt to promote ephemeral values to indvars Summary: This at least saves compile time. I also encountered a case where ephemeral values affect whether other variables are promoted, causing performance issues. It may be a bug in LSR, but I didn't manage to reduce it yet. Anyhow, I believe it's in general not worth considering ephemeral values in LSR. Reviewers: atrick, hfinkel Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D11115 llvm-svn: 242011	2015-07-13 03:28:53 +00:00
David Majnemer	599ca4426c	[InstSimplify] Teach InstSimplify how to simplify extractelement llvm-svn: 242008	2015-07-13 01:15:53 +00:00
David Majnemer	25a796e148	[InstSimplify] Teach InstSimplify how to simplify extractvalue llvm-svn: 242007	2015-07-13 01:15:46 +00:00
Renato Golin	1ef7a0f7c0	[ARM] Add support for nest attribute using r12 Register r12 ('ip') is used by GCC for this purpose and hence is used here. As discussed on the GCC mailing list, the register choice is an ABI issue and so choosing the same register as GCC means __builtin_call_with_static_chain is compatible. A similar patch has just gone in the AArch64 backend, so this is just the ARM counterpart, following the same discussion. Patch by Stephen Cross. llvm-svn: 241996	2015-07-12 18:16:40 +00:00
Simon Pilgrim	4f500525ef	[X86][SSE] (V)PMINSB is commutable. (V)PMINSB is no different to the other (V)PMIN/(V)PMAX B/D/W instructions - it is fully commutable. llvm-svn: 241994	2015-07-12 16:44:11 +00:00
Simon Pilgrim	ae5cd2773d	Trim trailing whitespaces. NFC. llvm-svn: 241990	2015-07-12 11:17:33 +00:00
Simon Pilgrim	64cc4ad0a2	[X86][SSE] Vectorized v4i32 non-uniform shifts. While the v4i32 shl operation is already vectorized using a cvttps2dq/pmulld pattern, the lshr/ashr opeations are still scalarized. This patch adds vectorization support for non-uniform v4i32 shift operations - it splats constant shift amounts to allow them to use the immediate sse shift instructions, or extracts/zero-extends non-constant shift amounts. The individual results are then blended together. Differential Revision: http://reviews.llvm.org/D11063 llvm-svn: 241989	2015-07-12 11:15:19 +00:00
David Majnemer	6bc83e0f43	[LICM] Don't try to sink values out of loops without any exits There is no suitable basic block to sink instructions in loops without exits. The only way an instruction in a loop without exits can be used is as an incoming value to a PHI. In such cases, the incoming block for the corresponding value is unreachable. This fixes PR24013. Differential Revision: http://reviews.llvm.org/D10903 llvm-svn: 241987	2015-07-12 03:53:05 +00:00
Hal Finkel	cbf08925ef	[PowerPC] Make use of the TargetRecip system r238842 added the TargetRecip system for controlling use of reciprocal estimates for sqrt and division using a set of parameters that can be set by the frontend. Clang now supports a sophisticated -mrecip option, and this will allow that option to effectively control the relevant code-generation functionality of the PPC backend. llvm-svn: 241985	2015-07-12 02:33:57 +00:00
Hal Finkel	965cea5670	[PowerPC] Support the nest parameter attribute This adds support for the 'nest' attribute, which allows the static chain register to be set for functions calls under non-Darwin PPC/PPC64 targets. r11 is the chain register (which the PPC64 ELF ABI calls the "environment pointer"). For indirect calls under PPC64 ELFv1, this would normally be loaded from the function descriptor, but providing an explicit 'nest' parameter will override that process and use the value provided. This allows __builtin_call_with_static_chain to work as expected on PowerPC. llvm-svn: 241984	2015-07-12 00:37:44 +00:00
Hal Finkel	ef28aad9f4	Revert "Revert r236894 "[BasicAA] Fix zext & sext handling"" r236894 caused PR23626 (Clang miscompiles webkit's base64 decoder), and was reverted in r237984. This reapplies the patch with an additional test case for PR23626 and the associated fix (both scales and offsets in the BasicAliasAnalysis::constantOffsetHeuristic should initially be zero). Patch by Nick White, thanks! llvm-svn: 241981	2015-07-11 11:04:54 +00:00
Hal Finkel	9cf58c4095	Move getStrideFromPointer and friends from LoopVectorize to VectorUtils The following functions are moved from the LoopVectorizer to VectorUtils: - getGEPInductionOperand - stripGetElementPtr - getUniqueCastUse - getStrideFromPointer These used to be static functions in LoopVectorize, but will also be used by the upcoming loop versioning LICM transformation. Patch by Ashutosh Nema! llvm-svn: 241980	2015-07-11 10:52:42 +00:00
Igor Laevsky	39d662f7ba	Add argmemonly attribute. This change adds new attribute called "argmemonly". Function marked with this attribute can only access memory through it's argument pointers. This attribute directly corresponds to the "OnlyAccessesArgumentPointees" ModRef behaviour in alias analysis. Differential Revision: http://reviews.llvm.org/D10398 llvm-svn: 241979	2015-07-11 10:30:36 +00:00
Chandler Carruth	00ebdbcc47	[PM/AA] Completely remove the AliasAnalysis::copyValue interface. No in-tree alias analysis used this facility, and it was not called in any particularly rigorous way, so it seems unlikely to be correct. Note that one of the only stateful AA implementations in-tree, GlobalsModRef is completely broken currently (and any AA passes like it are equally broken) because Module AA passes are not effectively invalidated when a function pass that fails to update the AA stack runs. Ultimately, it doesn't seem like we know how we want to build stateful AA, and until then trying to support and maintain correctness for an untested API is essentially impossible. To that end, I'm planning to rip out all of the update API. It can return if and when we need it and know how to build it on top of the new pass manager and as part of tested stateful AA implementations in the tree. Differential Revision: http://reviews.llvm.org/D10889 llvm-svn: 241975	2015-07-11 04:39:00 +00:00
Tyler Nowicki	3960d85262	Renamed some uses of unroll to interleave in the vectorizer. llvm-svn: 241971	2015-07-11 00:31:11 +00:00
Adrian Prantl	12d528493e	Cleanup a couple of comments in DIBuilder.cpp llvm-svn: 241966	2015-07-10 23:26:02 +00:00
Duncan P. N. Exon Smith	e463e470f8	MC: Only allow changing feature bits in MCSubtargetInfo Disallow all mutation of `MCSubtargetInfo` expect the feature bits. Besides deleting the assignment operators -- which were dead "code" -- this restricts `InitMCProcessorInfo()` to subclass initialization sequences, and exposes a new more limited function called `setDefaultFeatures()` for use by the ARMAsmParser `.cpu` directive. There's a small functional change here: ARMAsmParser used to adjust `MCSubtargetInfo::CPUSchedModel` as a side effect of calling `InitMCProcessorInfo()`, but I've removed that suspicious behaviour. Since the AsmParser shouldn't be doing any scheduling, there shouldn't be any observable change... llvm-svn: 241961	2015-07-10 22:52:15 +00:00
Matt Arsenault	cf13d18730	AMDGPU: Fix chains for memory ops dependent on argument loads Most loads and stores are derived from pointers derived from a kernel argument load inserted during argument lowering. This was just using the EntryToken chain for the argument loads, and any users of these loads were also on the EntryToken chain. Return the chain of the lowered argument load so that dependent loads end up on the correct chain. No test since I'm not aware of any case where this actually broke. llvm-svn: 241960	2015-07-10 22:51:36 +00:00
Alex Lorenz	53464510cc	MIR Serialization: Serialize the virtual register operands. Reviewers: Duncan P. N. Exon Smith Differential Revision: http://reviews.llvm.org/D11005 llvm-svn: 241959	2015-07-10 22:51:20 +00:00
David Majnemer	a5c7051a60	[IR] Switch static const to an enum to silence MSVC linker warnings Integral class statics are handled oddly in MSVC, we don't need them in this case, use an enum instead. llvm-svn: 241958	2015-07-10 22:46:02 +00:00
Duncan P. N. Exon Smith	754e21f244	MC: Remove MCSubtargetInfo() default constructor Force all creators of `MCSubtargetInfo` to immediately initialize it, merging the default constructor and the initializer into an initializing constructor. Besides cleaning up the code a little, this makes it clear that the initializer is never called again later. Out-of-tree backends need a trivial change: instead of calling: auto *X = new MCSubtargetInfo(); InitXYZMCSubtargetInfo(X, ...); return X; they should call: return createXYZMCSubtargetInfoImpl(...); There's no real functionality change here. llvm-svn: 241957	2015-07-10 22:43:42 +00:00
Duncan P. N. Exon Smith	bb57d73805	MC: Remove MCSubtargetInfo::InitCPUSched() Remove all calls to `MCSubtargetInfo::InitCPUSched()` and merge its body into the only relevant caller, `MCSubtargetInfo::InitMCProcessorInfo()`. We were only calling the former after explicitly calling the latter with the same CPU; it's confusing to have both methods exposed. Besides a minor (surely unmeasurable) speedup in ARM and X86 from avoiding running the logic twice, no functionality change. llvm-svn: 241956	2015-07-10 22:33:01 +00:00
Bjorn Steinbrink	a6b929dfe2	[InstCombine] Actually combine AA metadata when replacing one load with another Fixes PR24083 llvm-svn: 241955	2015-07-10 22:30:17 +00:00
Matt Arsenault	0d5197380c	AMDGPU: Use requested chain when lowering arguments No test since I'm not aware of any case where this will end up being a different chain. llvm-svn: 241954	2015-07-10 22:28:41 +00:00
Matthias Braun	e5a112f5e1	ARM: Use SpecificBumpPtrAllocator to fix leak introduced in r241920 llvm-svn: 241951	2015-07-10 22:23:57 +00:00
Reid Kleckner	7ea7708d92	[SEH] Push reloads of the SEH code past phi nodes This in turn would sometimes introduce new cleanupblocks that didn't previously exist. The uses were being introduced by SSA value demotion. We actually want to promote uses of EH pointers and selectors, so I added some spcecial casing to avoid demoting such instructions. This is getting overly complicated, but hopefully we'll come along and delete it in the new representation. llvm-svn: 241950	2015-07-10 22:21:54 +00:00
Duncan P. N. Exon Smith	f787ed0b35	Add <type_traits> for is_pod, fixing r241947 llvm-svn: 241949	2015-07-10 22:17:49 +00:00
Matt Arsenault	f54dc2384d	DAGCombiner: Assume invariant load cannot alias a store The motivation is to allow GatherAllAliases / FindBetterChain to not give up on dependent loads of a pointer from constant memory. This is important for AMDGPU, because most loads are pointers derived from a load of a kernel argument from constant memory. llvm-svn: 241948	2015-07-10 22:17:40 +00:00
Duncan P. N. Exon Smith	f862f87ff2	MC: Remove the copy of MCSchedModel in MCSubtargetInfo `MCSchedModel` is large. Make `MCSchedModel::GetDefaultSchedModel()` return by-reference instead of by-value, so we can store a pointer in `MCSubtargetInfo::CPUSchedModel` instead of a copy. Note: since `MCSchedModel` is POD, this doesn't create a static constructor. llvm-svn: 241947	2015-07-10 22:13:43 +00:00
Quentin Colombet	8b984d19f2	[ShrinkWrap][PEI] Do not insert epilogue for unreachable blocks. Although this is not incorrect to insert such code, it is useless and it hurts the binary size. llvm-svn: 241946	2015-07-10 22:09:55 +00:00
David Majnemer	3f0a0e4a28	[MC] Switch static const to an enum to silence MSVC linker warnings Integral class statics are handled oddly in MSVC, we don't need them in this case, use an enum instead. llvm-svn: 241945	2015-07-10 21:50:04 +00:00
Evgeniy Stepanov	00b3020453	Fix AArch64 prologue for empty frame with dynamic allocas. Fixes PR23804: assertion failure in emitPrologue in the case of a function with an empty frame and a dynamic alloca that needs stack realignment. This is a typical case for AddressSanitizer. llvm-svn: 241943	2015-07-10 21:24:07 +00:00
Jingyue Wu	a277561922	[TTI] BasicTTIImpl assumes no vector registers Summary: Following the discussion on r241884, it's more reasonable to assume that a target has no vector registers by default instead of letting every such target overrides getNumberOfRegisters. Therefore, this patch modifies BasicTTIImpl::getNumberOfRegisters to return 0 when Vector is true, and partially reverts r241884 which modifies NVPTXTTIImpl::getNumberOfRegisters. It also fixes a performance bug in LoopVectorizer. Even if a target has no vector registers, vectorization may still help ILP. So, we need both checks to be false before disabling loop vectorization all together. Reviewers: hfinkel Subscribers: llvm-commits, jholewinski Differential Revision: http://reviews.llvm.org/D11108 llvm-svn: 241942	2015-07-10 21:14:54 +00:00
Adam Nemet	215746b45a	[LoopDist/LoopVer] Move LoopVersioning to a new module, NFC Summary: The class will obviously need improvement down the road. For one, there is no reason that addPHINodes would have to be exposed like that. I will make this and other improvements in follow-up patches. The main goal is to be able to share this functionality. The LoopLoadElimination pass I am working on needs it too. Later we can move other clients as well (LV and Ashutosh's LICMVer). Reviewers: hfinkel, ashutosh.nema Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10577 llvm-svn: 241932	2015-07-10 18:55:13 +00:00
Adam Nemet	1a689188c4	[LoopDist] Move loop-versioning helper functions to Cloning, NFC Summary: This makes them available to the LoopVersioning class as that is moved to its own module in the next patch. Reviewers: ashutosh.nema, hfinkel Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10576 llvm-svn: 241931	2015-07-10 18:55:09 +00:00
Matthias Braun	d9bd22b2c4	ARMLoadStoreOpt: Merge subs/adds into LDRD/STRD; Factor out common code This commit factors out common code from MergeBaseUpdateLoadStore() and MergeBaseUpdateLSMultiple() and introduces a new function MergeBaseUpdateLSDouble() which merges adds/subs preceding/following a strd/ldrd instruction into an strd/ldrd instruction with writeback where possible. Differential Revision: http://reviews.llvm.org/D10676 llvm-svn: 241928	2015-07-10 18:37:33 +00:00
Fiona Glaser	b08ae7affb	ComputeKnownBits: be a bit smarter about ADDs If our two inputs have known top-zero bit counts M and N, we trivially know that the output cannot have any bits set in the top (min(M, N)-1) bits, since nothing could carry past that point. llvm-svn: 241927	2015-07-10 18:29:02 +00:00
Matthias Braun	e4ba6b8c24	ARMLoadStoreOptimizer: Create LDRD/STRD on thumb2 Differential Revision: http://reviews.llvm.org/D10623 llvm-svn: 241926	2015-07-10 18:28:49 +00:00
JF Bastien	5ca0baca4a	WebAssembly: basic instructions todo, and basic register info. Summary: This code is based on AArch64 for modern backend good practice, and NVPTX for virtual ISA concerns. Reviewers: sunfish Subscribers: aemerson, llvm-commits, jfb Differential Revision: http://reviews.llvm.org/D11070 llvm-svn: 241923	2015-07-10 18:23:10 +00:00
Alex Lorenz	f6bc8667cd	MIR Serialization: Initial serialization of stack objects. This commit implements the initial serialization of stack objects from the MachineFrameInfo class. It can only serialize the ordinary stack objects (including ordinary spill slots), but it doesn't serialize variable sized or fixed stack objects yet. The stack objects are serialized using a YAML sequence of YAML inline mappings. Each mapping has the object's ID, type, size, offset and alignment. The stack objects are a part of machine function's YAML mapping. Reviewers: Duncan P. N. Exon Smith llvm-svn: 241922	2015-07-10 18:13:57 +00:00
JF Bastien	b73a2ed20e	Target RegisterInfo: devirtualize TargetFrameLowering Summary: The target frame lowering's concrete type is always known in RegisterInfo, yet it's only sometimes devirtualized through a static_cast. This change adds an auto-generated static function <Target>GenRegisterInfo::getFrameLowering(const MachineFunction &MF) which does this devirtualization, and uses this function in all targets which can. This change was suggested by sunfish in D11070 for WebAssembly, I figure that I may as well improve the other targets while I'm here. Subscribers: sunfish, ted, llvm-commits, jfb Differential Revision: http://reviews.llvm.org/D11093 llvm-svn: 241921	2015-07-10 18:13:17 +00:00
Matthias Braun	a4a3182ded	ARMLoadStoreOptimizer: Rewrite LDM/STM matching logic. This improves the logic in several ways and is a preparation for followup patches: - First perform an analysis and create a list of merge candidates, then transform. This simplifies the code in that you have don't have to care to much anymore that you may be holding iterators to MachineInstrs that get removed. - Analyze/Transform basic blocks in reverse order. This allows to use LivePhysRegs to find free registers instead of the RegisterScavenger. The RegisterScavenger will become less precise in the future as it relies on the deprecated kill-flags. - Return the newly created node in MergeOps so there's no need to look around in the schedule to find it. - Rename some MBBI iterators to InsertBefore to make their role clear. - General code cleanup. Differential Revision: http://reviews.llvm.org/D10140 llvm-svn: 241920	2015-07-10 18:08:49 +00:00
Eli Bendersky	5c0039a014	Actually support volatile memcpys in NVPTX lowering Differential Revision: http://reviews.llvm.org/D11091 llvm-svn: 241914	2015-07-10 15:40:33 +00:00
Nemanja Ivanovic	d9e4b4ff36	NFC. Added a blank line for consistency. llvm-svn: 241913	2015-07-10 14:25:17 +00:00
Benjamin Kramer	f4ebfa3ae1	[InstSimplify] Fold away ord/uno fcmps when nnan is present. This is important to fold away the slow case of complex multiplies emitted by clang. llvm-svn: 241911	2015-07-10 14:02:02 +00:00
James Molloy	88eb535b2d	Add support for fast-math flags to the FCmp instruction. FCmp behaves a lot like a floating-point binary operator in many ways, and can benefit from fast-math information. Flags such as nsz and nnan can affect if this fcmp (in combination with a select) can be treated as a fminnum/fmaxnum operation. This adds backwards-compatible bitcode support, IR parsing and writing, LangRef changes and IRBuilder changes. I'll need to audit InstSimplify and InstCombine in a followup to find places where flags should be copied. llvm-svn: 241901	2015-07-10 12:52:00 +00:00
Nemanja Ivanovic	5655fb320c	Add missing builtins to the PPC back end for ABI compliance (vol. 3) This patch corresponds to review: http://reviews.llvm.org/D10973 Back end portion of the third round of additions to altivec.h. llvm-svn: 241900	2015-07-10 12:38:08 +00:00
Alexey Bataev	da33d80e9a	Disable loop re-rotation for -Oz (patch by Andrey Turetsky) After changes in rL231820 loop re-rotation is performed even in -Oz mode. Since loop rotation is disabled for -Oz, it seems loop re-rotation should be disabled too. Differential Revision: http://reviews.llvm.org/D10961 llvm-svn: 241897	2015-07-10 10:37:09 +00:00
David Majnemer	db82d2f338	Revert the new EH instructions This reverts commits r241888-r241891, I didn't mean to commit them. llvm-svn: 241893	2015-07-10 07:15:17 +00:00
David Majnemer	82771b1ad6	Tighten the verifier check for catchblock. llvm-svn: 241891	2015-07-10 07:01:07 +00:00
David Majnemer	11aeb90aaa	Address Joseph's review comments. llvm-svn: 241890	2015-07-10 07:01:03 +00:00
David Majnemer	1d3fe98d57	Address Reid's review feedback. llvm-svn: 241889	2015-07-10 07:00:58 +00:00
David Majnemer	ae2ffc8a8c	New EH representation for MSVC compatibility Summary: This introduces new instructions neccessary to implement MSVC-compatible exception handling support. Most of the middle-end and none of the back-end haven't been audited or updated to take them into account. Reviewers: rnk, JosephTremoulet, reames, nlewycky, rjmccall Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D11041 llvm-svn: 241888	2015-07-10 07:00:44 +00:00
Bjorn Steinbrink	8350534772	[InstCombine] Employ AliasAnalysis in FindAvailableLoadedValue llvm-svn: 241887	2015-07-10 06:55:49 +00:00
Bjorn Steinbrink	a91fd0998f	[InstCombine] Properly combine metadata when replacing a load with another Not doing this can lead to misoptimizations down the line, e.g. because of range metadata on the replacing load excluding values that are valid for the load that is being replaced. llvm-svn: 241886	2015-07-10 06:55:44 +00:00
Jingyue Wu	ad85c8c204	[NVPTX] declare no vector registers Summary: Without this patch, LoopVectorizer in certain cases (see loop-vectorize.ll) produces code with complex control flow which hurts later optimizations. Since NVPTX doesn't have vector registers in LLVM's sense (NVPTXTTI::getRegisterBitWidth(true) == 32), we for now declare no vector registers to effectively disable loop vectorization. Reviewers: jholewinski Subscribers: jingyue, llvm-commits, jholewinski Differential Revision: http://reviews.llvm.org/D11089 llvm-svn: 241884	2015-07-10 04:31:56 +00:00
Reid Kleckner	85a2450d56	[WinEH] Make sure LSDA tables are 4 byte aligned Apparently this is important, otherwise _except_handler3 assumes that the registration node is corrupted and ignores it. Also fix a bug in WinEHPrepare where we would insert code after a terminator instruction. llvm-svn: 241877	2015-07-10 00:08:49 +00:00
Eli Bendersky	d880520bc2	Replace index-loops by range-based loops NFC llvm-svn: 241875	2015-07-09 23:06:03 +00:00
Sanjay Patel	81beefc541	[x86] enable machine combiner reassociations for scalar double-precision multiplies llvm-svn: 241873	2015-07-09 22:58:39 +00:00
Sanjay Patel	ea81edf351	[x86] enable machine combiner reassociations for scalar double-precision adds llvm-svn: 241871	2015-07-09 22:48:54 +00:00
Alex Lorenz	28148ba82d	MIR Serialization: Serialize the virtual register definitions. The virtual registers are serialized using a YAML sequence of YAML inline mappings. Each mapping has the id of the virtual register and the register class. Reviewers: Duncan P. N. Exon Smith Differential Revision: http://reviews.llvm.org/D10981 llvm-svn: 241868	2015-07-09 22:23:13 +00:00
Adam Nemet	0f67c6c1d5	[LAA] Fix grammar in debug output llvm-svn: 241867	2015-07-09 22:17:41 +00:00
Adam Nemet	ee61474a61	[LAA] Hide NeedRTCheck logic completely inside canCheckPtrAtRT, NFC Currently canCheckPtrAtRT returns two flags NeedRTCheck and CanDoRT. NeedRTCheck says whether we need checks and CanDoRT whether we can generate the checks. The idea is to encode three states with these: Need/Can: (1) false/dont-care: no checks are needed (2) true/false: we need checks but can't generate them (3) true/true: we need checks and we can generate them This is pretty unnecessary since the caller (analyzeLoop) is only interested in whether we can generate the checks if we actually need them (i.e. 1 or 3). So this change cleans up to return just that (CanDoRTIfNeeded) and pulls all the underlying logic into canCheckPtrAtRT. By doing all this, we simplify analyzeLoop which is the complex function in LAA. There is further room for improvement here by using RtCheck.Need directly rather than a new local variable NeedRTCheck but that's for a later patch. llvm-svn: 241866	2015-07-09 22:17:38 +00:00
Reid Kleckner	8eecb3c160	[WinEH] Give up on using CSRs across 32-bit invokes for now The runtime does not restore CSRs when transferring control back to the function handling the exception. According to the experts on IRC, LLVM's register allocator has no way to model register clobbers that only happen on one edge of the CFG. For now, don't worry about trying to use the meager three CSRs available on 32-bit X86 and just say that such invokes preserve nothing. llvm-svn: 241865	2015-07-09 22:09:41 +00:00
Reid Kleckner	c16b1078df	Expose sjlj preparation through opt for my own debugging purposes llvm-svn: 241864	2015-07-09 21:48:40 +00:00
Alex Lorenz	c8704b02df	MIR Parser: Report an error when parsing machine function with an empty body. This commit adds a new error which is reported when the MIR Parser encounters a machine function without any machine basic blocks. The machine verifier expects that the machine functions have at least one MBB, and this error will prevent machine functions without MBBs from reaching the machine verifier and crashing with an assertion. llvm-svn: 241862	2015-07-09 21:21:33 +00:00
Tom Stellard	dcb9f0907f	AMDGPU: Add helper function for implicit parameter offsets. Patch by: Zoltan Gilian llvm-svn: 241861	2015-07-09 21:20:37 +00:00
JF Bastien	b379643f7c	Unbreak WebAssembly build Summary: D11021 and D11045 didn't update the WebAssembly target's code. It's still experimental so all tests passed. Reviewers: sunfish, joker.eph, echristo Subscribers: llvm-commits, jfb Differential Revision: http://reviews.llvm.org/D11084 llvm-svn: 241859	2015-07-09 21:00:09 +00:00
Sanjoy Das	c3a8e398a2	[ImplicitNullChecks] Fix a memory leak. llvm-svn: 241851	2015-07-09 20:13:31 +00:00
Sanjoy Das	b771845461	[ImplicitNullChecks] Be smarter in picking the memory op. Summary: Before this change ImplicitNullChecks would only pick loads of the form: ``` test Reg, Reg jz elsewhere fallthrough: movl 32(Reg), Reg2 ``` but not (say) ``` test Reg, Reg jz elsewhere fallthrough: inc Reg3 movl 32(Reg), Reg2 ``` This change teaches ImplicitNullChecks to look through "unrelated" instructions like `inc Reg3` when searching for a load instruction to convert to a trapping load. Reviewers: atrick, JosephTremoulet, reames Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D11044 llvm-svn: 241850	2015-07-09 20:13:25 +00:00
Alex Lorenz	60541c1d44	MIR Serialization: Serialize the simple MachineFrameInfo attributes. This commit serializes the 13 scalar boolean and integer attributes from the MachineFrameInfo class: IsFrameAddressTaken, IsReturnAddressTaken, HasStackMap, HasPatchPoint, StackSize, OffsetAdjustment, MaxAlignment, AdjustsStack, HasCalls, MaxCallFrameSize, HasOpaqueSPAdjustment, HasVAStart, and HasMustTailInVarArgFunc. These attributes are serialized as part of the frameInfo YAML mapping, which itself is a part of the machine function's YAML mapping. llvm-svn: 241844	2015-07-09 19:55:27 +00:00
Rafael Espindola	594e676cbe	llvm-ar: Pad the symbol table to 4 bytes. It looks like ld64 requires it. With this we seem to be able to bootstrap using llvm-ar+/usr/bin/true instead of ar+ranlib (currently on stage2). llvm-svn: 241842	2015-07-09 19:48:06 +00:00
Matt Arsenault	8b03e6c164	AMDGPU/R600: Return correct chain when lowering loads The other LowerLOAD should be returning the correct chain. llvm-svn: 241839	2015-07-09 18:47:03 +00:00
Sanjoy Das	6f062c8c2a	[IndVars] Try to use existing values in RewriteLoopExitValues. Summary: In RewriteLoopExitValues, before expanding out an SCEV expression using SCEVExpander, try to see if an existing LLVM IR expression already computes the value we're interested in. If so use that existing expression. Apart from reducing IndVars' reliance on the rest of the compilation pipeline, this also prevents IndVars from concluding some expressions as "high cost" when they're not. For instance, `InductiveRangeCheckElimination` often emits code of the following form: ``` len = umin(len_A, len_B) loop: ... if (i++ < len) goto loop outside_loop: use(i) ``` `SCEVExpander` refuses to rewrite the use of `i` in `outside_loop`, since it thinks the value of `i` on loop exit, `len`, is a high cost expansion since it contains an `umax` in it. With this change, `IndVars` can see that it can re-use `len` instead of creating a new expression to compute `umin(len_A, len_B)`. I considered putting this cleverness in `SCEVExpander`, but I was worried that it may then have a deterimental effect on other passes that use it. So I decided it was better to just do this in the one place where it seems like an obviously good idea, with the intent of generalizing later if needed. Reviewers: atrick, reames Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10782 llvm-svn: 241838	2015-07-09 18:46:12 +00:00
Reid Kleckner	0f7f8d41f7	Remove dead code from old 64-bit SEH lowering llvm-svn: 241829	2015-07-09 17:46:39 +00:00
Pat Gavlin	a717f255b6	Allow {e,r}bp as the target of {read,write}_register. This patch allows the read_register and write_register intrinsics to read/write the RBP/EBP registers on X86 iff the targeted register is the frame pointer for the containing function. Differential Revision: http://reviews.llvm.org/D10977 llvm-svn: 241827	2015-07-09 17:40:29 +00:00
Sanjay Patel	e2361d4a18	fix an invisible bug when combining repeated FP divisors This patch fixes bugs that were exposed by the addition of fast-math-flags in the DAG: r237046 ( http://reviews.llvm.org/rL237046 ): 1. When replacing a division node, it's not enough to RAUW. We should call CombineTo() to delete dead nodes and combine again. 2. Because we are changing the DAG, we can't return an empty SDValue after the transform. As the code comments say: Visitation implementation - Implement dag node combining for different node types. The semantics are as follows: Return Value: SDValue.getNode() == 0 - No change was made SDValue.getNode() == N - N was replaced, is dead and has been handled. otherwise - N should be replaced by the returned Operand. The new test case shows no difference with or without this patch, but it will crash if we re-apply r237046 or enable FMF via the current -enable-fmf-dag cl::opt. Differential Revision: http://reviews.llvm.org/D9893 llvm-svn: 241826	2015-07-09 17:28:37 +00:00
Juergen Ributzka	216ed03ebb	[StackMap] Use lambdas to specify the sort and erase conditions. NFC. llvm-svn: 241823	2015-07-09 17:11:15 +00:00
Juergen Ributzka	aef76cafa0	[StackMap] Rename variables to be more consistent. NFC. Rename a few variables and use auto for long iterator names. llvm-svn: 241822	2015-07-09 17:11:11 +00:00
Juergen Ributzka	e4685a1c0d	[StackMaps] Use emplace_back when possible. NFC. llvm-svn: 241821	2015-07-09 17:11:08 +00:00
Tom Stellard	ab6e9c0f94	AMDGPU/SI: The SIShrinkInstructions pass should only fold immediates with one use This is convered by existing testcases and will be exposed by a future commit. llvm-svn: 241817	2015-07-09 16:30:36 +00:00
Tom Stellard	9ebf7ca2f0	AMDGPU/SI: Fix crash on physical registers in SIInstrInfo::isOperandLegal() No test case for this. I ran into it while working on some improvements to SIShrinkInstructions.cpp. llvm-svn: 241816	2015-07-09 16:30:27 +00:00
Rafael Espindola	c79bff6bb1	Basic support for BSD symbol tables in archives. This could be optimized and for now we only produce __.SYMDEF and not "__.SYMDEF SORTED". llvm-svn: 241814	2015-07-09 15:56:23 +00:00
Krzysztof Parzyszek	8b26fbf758	[Hexagon] Add missing preamble to a source file llvm-svn: 241813	2015-07-09 15:40:25 +00:00
Rafael Espindola	2ba806c702	Remove redundant variable. NFC. llvm-svn: 241810	2015-07-09 15:24:39 +00:00
Silviu Baranga	ce3877fc8c	Don't rely on the DepCands iteration order when constructing checking pointer groups Summary: The checking pointer group construction algorithm relied on the iteration on DepCands. We would need the same leaders across runs and the same iteration order over the underlying std::set for determinism. This changes the algorithm to process the pointers in the order in which they were added to the runtime check, which is deterministic. We need to update the tests, since the order in which pointers appear has changed. No new tests were added, since it is impossible to test for non-determinism. Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D11064 llvm-svn: 241809	2015-07-09 15:18:25 +00:00
Rafael Espindola	b870e9ca93	Add a helper to printing BE of LE depending on the format. The gnu ar format uses BE numbers. The BSD one uses LE. Add a helper for one or the other. NFC for now, just removes some noise from the following patch. llvm-svn: 241808	2015-07-09 15:13:41 +00:00
Mehdi Amini	eaabc51e78	Re-instate the EVT parameter to getScalarShiftAmountTy() for OOT user A documentation for this function would be nice by the way. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 241807	2015-07-09 15:12:23 +00:00
Pawel Bylica	d1b818bcf4	Reapply fixed r241790: Fix shift legalization and lowering for big constants. Summary: If shift amount is a constant value > 64 bit it is handled incorrectly during type legalization and X86 lowering. This patch the type of shift amount argument in function DAGTypeLegalizer::ExpandShiftByConstant from unsigned to APInt. Reviewers: nadav, majnemer, sanjoy, RKSimon Subscribers: RKSimon, llvm-commits Differential Revision: http://reviews.llvm.org/D10767 llvm-svn: 241806	2015-07-09 14:58:04 +00:00
Rafael Espindola	8cde5c01d8	Extract printBSDMemberHeader. It will get another use in the following patch. Also rename the other helper to printGNUSmallMemberHeader for consistency. llvm-svn: 241805	2015-07-09 14:54:12 +00:00
Krzysztof Parzyszek	feaf7b8d35	[Hexagon] Add support for atomic RMW operations llvm-svn: 241804	2015-07-09 14:51:21 +00:00
Arnaud A. de Grandmaison	f40f99e3a4	[AArch64] Select SBFIZ or UBFIZ instead of left + right shifts And rename LSB to Immr / MSB to Imms to match the ARM ARM terminology. llvm-svn: 241803	2015-07-09 14:33:38 +00:00
Scott Douglass	8143bc25ee	[ARM] Thumb1 3 to 2 operand convertion for commutative operations Differential Revision: http://reviews.llvm.org/D11057 llvm-svn: 241802	2015-07-09 14:13:55 +00:00
Scott Douglass	2740a63725	[ARM] Don't be overzealous converting Thumb1 3 to 2 operands Differential Revision: http://reviews.llvm.org/D11056 llvm-svn: 241801	2015-07-09 14:13:48 +00:00
Scott Douglass	47a3fce461	[ARM] Add Thumb2 ADD with PC narrowing from 3 operand to 2 Differential Revision: http://reviews.llvm.org/D11055 llvm-svn: 241800	2015-07-09 14:13:41 +00:00
Scott Douglass	8c7803f4c1	[ARM] Refactor converting Thumb1 from 3 to 2 operand (nfc) Also adds some test cases. Differential Revision: http://reviews.llvm.org/D11054 llvm-svn: 241799	2015-07-09 14:13:34 +00:00
Renato Golin	17d4efe7c1	Add support for nest attribute to AArch64 backend The nest attribute is currently supported on the x86 (32-bit) and x86-64 backends, but not on ARM (32-bit) or AArch64. This patch adds support for nest to the AArch64 backend. Register x18 is used by GCC for this purpose and hence is used here. As discussed on the GCC mailing list the register choice is an ABI issue and so choosing the same register as GCC means __builtin_call_with_static_chain is compatible. Patch by Stephen Cross. llvm-svn: 241794	2015-07-09 10:18:02 +00:00
Tamas Berghammer	b6b0ddfc95	Add getSizeInBits function to the APFloat class The newly added function returns the size of the specified floating point semantics in bits. Differential revision: http://reviews.llvm.org/D8413 llvm-svn: 241793	2015-07-09 10:13:39 +00:00
Pawel Bylica	627762fda5	Revert r241790: Fix shift legalization and lowering for big constants. llvm-svn: 241792	2015-07-09 09:50:54 +00:00
Pawel Bylica	eb122f2baf	Fix shift legalization and lowering for big constants. Summary: If shift amount is a constant value > 64 bit it is handled incorrectly during type legalization and X86 lowering. This patch the type of shift amount argument in function DAGTypeLegalizer::ExpandShiftByConstant from unsigned to APInt. Reviewers: nadav, majnemer, sanjoy, RKSimon Subscribers: RKSimon, llvm-commits Differential Revision: http://reviews.llvm.org/D10767 llvm-svn: 241790	2015-07-09 08:01:36 +00:00
Elena Demikhovsky	37a4da825f	Extended syntax of vector version of getelementptr instruction. The justification of this change is here: http://lists.cs.uiuc.edu/pipermail/llvmdev/2015-March/082989.html According to the current GEP syntax, vector GEP requires that each index must be a vector with the same number of elements. %A = getelementptr i8, <4 x i8> %ptrs, <4 x i64> %offsets In this implementation I let each index be or vector or scalar. All vector indices must have the same number of elements. The scalar value will mean the splat vector value. (1) %A = getelementptr i8, i8 %ptr, <4 x i64> %offsets or (2) %A = getelementptr i8, <4 x i8> %ptrs, i64 %offset In all cases the %A type is <4 x i8> In the case (2) we add the same offset to all pointers. The case (1) covers C[B[i]] case, when we have the same base C and different offsets B[i]. The documentation is updated. http://reviews.llvm.org/D10496 llvm-svn: 241788	2015-07-09 07:42:48 +00:00
Adam Nemet	b41d2d3fa3	[LAA] Fix line break in comment llvm-svn: 241785	2015-07-09 06:47:21 +00:00
Adam Nemet	5dc3b2cf53	[LAA] Rename IsRTNeeded to IsRTCheckAnalysisNeeded The original name was too close to NeedRTCheck which is what the actual memcheck analysis returns. This flag, as the new name suggests, is only used to whether to initiate that analysis. Also a comment is added to answer one question I had about this code for a long time. Namely, how does this flag differ from isDependencyCheckNeeded since they are seemingly set at the same time. llvm-svn: 241784	2015-07-09 06:47:18 +00:00
Mehdi Amini	157e5a6d10	Remove getDataLayout() from TargetSelectionDAGInfo (had no users) Summary: Remove empty subclass in the process. This change is part of a series of commits dedicated to have a single DataLayout during compilation by using always the one owned by the module. Reviewers: echristo Subscribers: jholewinski, llvm-commits, rafael, yaron.keren, ted Differential Revision: http://reviews.llvm.org/D11045 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 241780	2015-07-09 02:10:08 +00:00
Mehdi Amini	a749f2ad47	Remove getDataLayout() from TargetLowering Summary: This change is part of a series of commits dedicated to have a single DataLayout during compilation by using always the one owned by the module. Reviewers: echristo Subscribers: yaron.keren, rafael, llvm-commits, jholewinski Differential Revision: http://reviews.llvm.org/D11042 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 241779	2015-07-09 02:09:52 +00:00
Mehdi Amini	0cdec1e2ab	Make isLegalAddressingMode() taking DataLayout as an argument Summary: This change is part of a series of commits dedicated to have a single DataLayout during compilation by using always the one owned by the module. Reviewers: echristo Subscribers: jholewinski, llvm-commits, rafael, yaron.keren Differential Revision: http://reviews.llvm.org/D11040 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 241778	2015-07-09 02:09:40 +00:00
Mehdi Amini	5c183d5239	Make getByValTypeAlignment() taking DataLayout as an argument Summary: This change is part of a series of commits dedicated to have a single DataLayout during compilation by using always the one owned by the module. Reviewers: echristo Subscribers: yaron.keren, rafael, llvm-commits, jholewinski Differential Revision: http://reviews.llvm.org/D11038 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 241777	2015-07-09 02:09:28 +00:00
Mehdi Amini	9639d650bb	Make TargetLowering::getShiftAmountTy() taking DataLayout as an argument Summary: This change is part of a series of commits dedicated to have a single DataLayout during compilation by using always the one owned by the module. Reviewers: echristo Subscribers: jholewinski, llvm-commits, rafael, yaron.keren Differential Revision: http://reviews.llvm.org/D11037 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 241776	2015-07-09 02:09:20 +00:00
Mehdi Amini	44ede33a69	Make TargetLowering::getPointerTy() taking DataLayout as an argument Summary: This change is part of a series of commits dedicated to have a single DataLayout during compilation by using always the one owned by the module. Reviewers: echristo Subscribers: jholewinski, ted, yaron.keren, rafael, llvm-commits Differential Revision: http://reviews.llvm.org/D11028 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 241775	2015-07-09 02:09:04 +00:00
Mehdi Amini	5010ebf181	Make TargetTransformInfo keeping a reference to the Module DataLayout DataLayout is no longer optional. It was initialized with or without a DataLayout, and the DataLayout when supplied could have been the one from the TargetMachine. Summary: This change is part of a series of commits dedicated to have a single DataLayout during compilation by using always the one owned by the module. Reviewers: echristo Subscribers: jholewinski, llvm-commits, rafael, yaron.keren Differential Revision: http://reviews.llvm.org/D11021 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 241774	2015-07-09 02:08:42 +00:00
Mehdi Amini	56228dabfa	Redirect DataLayout from TargetMachine to Module in ComputeValueVTs() Summary: Avoid using the TargetMachine owned DataLayout and use the Module owned one instead. This requires passing the DataLayout up the stack to ComputeValueVTs(). This change is part of a series of commits dedicated to have a single DataLayout during compilation by using always the one owned by the module. Reviewers: echristo Subscribers: jholewinski, yaron.keren, rafael, llvm-commits Differential Revision: http://reviews.llvm.org/D11019 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 241773	2015-07-09 01:57:34 +00:00
David Majnemer	3f49e662c8	[CodeView] Add support for emitting column information Column information is present in CodeView when the line table subsection has bit 0 set to 1 in it's flags field. The column information is represented as a pair of 16-bit quantities: a starting and ending column. This information is present at the end of the chunk, after all the line-PC pairs. llvm-svn: 241764	2015-07-09 00:19:51 +00:00
Adam Nemet	943befedf1	[LAA] Fix misleading use of word 'consecutive' Fix some places where the word consecutive is used but the code really means constant-stride (i.e. not just unit stride). llvm-svn: 241763	2015-07-09 00:03:22 +00:00
Alex Lorenz	4d026b89da	MIR Serialization: Serialize the 'undef' register machine operand flag. llvm-svn: 241762	2015-07-08 23:58:31 +00:00
Sanjay Patel	1319446195	[SLPVectorizer] Try different vectorization factors for store chains ...and set max vector register size based on target This patch is based on discussion on the llvmdev mailing list: http://lists.cs.uiuc.edu/pipermail/llvmdev/2015-July/087405.html and also solves: https://llvm.org/bugs/show_bug.cgi?id=17170 Several FIXME/TODO items are noted in comments as potential improvements. Differential Revision: http://reviews.llvm.org/D10950 llvm-svn: 241760	2015-07-08 23:40:55 +00:00
Matthias Braun	91e85d4327	RegisterPressure: Add PressureDiff::dump() Also display the pressure diff in the case of a getMaxUpwardPressureDelta() verify failure. llvm-svn: 241759	2015-07-08 23:40:27 +00:00
Adam Nemet	424edc6c80	[LAA] Revert a small part of r239295 This commit ([LAA] Fix estimation of number of memchecks) regressed the logic a bit. We shouldn't quit the analysis if we encounter a pointer without known bounds unless we actually need to emit a memcheck for it. The original code was using NumComparisons which is now computed differently. Instead I compute NeedRTCheck from NumReadPtrChecks and NumWritePtrChecks. As side note, I find the separation of NeedRTCheck and CanDoRT confusing, so I will try to merge them in a follow-up patch. llvm-svn: 241756	2015-07-08 22:58:48 +00:00
Juergen Ributzka	d25407e972	Run clang-format before making changes to StackMaps. NFC. llvm-svn: 241754	2015-07-08 22:42:09 +00:00
Sanjay Patel	093fb170a6	[x86] enable machine combiner reassociations for scalar single-precision multiplies llvm-svn: 241752	2015-07-08 22:35:20 +00:00
Rafael Espindola	4104fe8ae9	Don't reject an archive with just a symbol table. It is pretty unambiguous how to interpret it and gnu ar accepts it too. llvm-svn: 241750	2015-07-08 22:27:54 +00:00
Rafael Espindola	c91177e410	Disallow Archive::child_iterator that don't point to an archive. NFC, just less error prone. llvm-svn: 241747	2015-07-08 22:15:07 +00:00
Michael Zolotukhin	97295ea7dd	[LoopVectorizer] Rename BypassBlock to VectorPH, and CheckBlock to NewVectorPH. NFCI. llvm-svn: 241742	2015-07-08 21:48:03 +00:00
Michael Zolotukhin	8c874bb2f1	[LoopVectorizer] Restructurize code for emitting RT checks. NFCI. Place all code corresponding to a run-time check in one place. Previously we generated some code, then proceeded to a next check, then finished the code for the first check (like splitting blocks and generating branches). Now the code for generating a check is self-contained. llvm-svn: 241741	2015-07-08 21:47:59 +00:00
Michael Zolotukhin	66f5591f9b	[LoopVectorizer] Remove redundant variables PastOverflowCheck and OverflowCheckAnchor. NFCI. llvm-svn: 241740	2015-07-08 21:47:56 +00:00
Michael Zolotukhin	00345cadd5	[LoopVectorizer] Move some code around to ease further refactoring. NFCI. llvm-svn: 241739	2015-07-08 21:47:53 +00:00
Michael Zolotukhin	7db3063f87	[LoopVectorizer] Remove redundant variable LastBypassBlock. NFC. llvm-svn: 241738	2015-07-08 21:47:47 +00:00
Alex Lorenz	df08179d1b	MIR Parser: Remove redundant TODO comment. NFC. This TODO comment has been redundant since r240474. llvm-svn: 241737	2015-07-08 21:30:21 +00:00
Alex Lorenz	495ad87919	MIR Serialization: Serialize the 'killed' register machine operand flag. llvm-svn: 241734	2015-07-08 21:23:34 +00:00
Diego Novillo	13e20f1bbf	Add missing dependency to Hexagon target. A recent patch added calls to isInstructionTriviallyDead without the corresponding dependency on TransformUtils. llvm-svn: 241731	2015-07-08 21:13:37 +00:00
Rafael Espindola	80c662d243	Use a raw_svector_ostream and simplify a loop. NFC. llvm-svn: 241727	2015-07-08 21:07:18 +00:00
Reid Kleckner	4f21df2b96	[Win64] Only treat some functions as having the Win64 convention All the usual X86 target-specific conventions are collapsed to the normal Win64 convention, but the custom conventions like GHC and webkit should not be. Previously we would assume that the caller allocated 32 bytes of shadow space for us, which is not how webkit_jscc or other custom conventions are supposed to work. Based on a patch by peavo@outlook.com. Fixes PR24051. llvm-svn: 241725	2015-07-08 21:03:47 +00:00
Rafael Espindola	a2ed0b0bab	Start adding support for writing archives in BSD format. No support for the symbol table yet (but will hopefully add it today). We always use the long filename format so that we can align the member, which is an advantage of the BSD format. llvm-svn: 241721	2015-07-08 20:47:32 +00:00
Alex Lorenz	b1f9ce8fc9	MIR Parser: Use source locations for MBB naming errors. This commit changes the type of the field 'Name' in the struct 'yaml::MachineBasicBlock' from 'std::string' to 'yaml::StringValue'. This change allows the MIR parser to report errors related to the MBB name with the proper source locations. llvm-svn: 241718	2015-07-08 20:22:20 +00:00
Sanjay Patel	c1afa95a51	early exits -> less indenting; NFCI llvm-svn: 241716	2015-07-08 19:32:39 +00:00
Krzysztof Parzyszek	79b2433e7c	[Hexagon] Implement commoning of GetElementPtr instructions llvm-svn: 241714	2015-07-08 19:22:28 +00:00
Peter Collingbourne	7a544f7327	LibDriver: Fix output path inference. The inferred output file name is based on the first input file, not the first one with extension .obj. The output file was also being written to the wrong directory; it needs to be written to whichever directory on the libpath it was found in. This change fixes both issues. llvm-svn: 241710	2015-07-08 19:00:46 +00:00
Adam Nemet	0131a5693a	[LAA] Add missing debug output after r239285 r239285 ([LoopAccessAnalysis] Teach LAA to check the memory dependence between strided accesses.) introduced a new case under MemoryDepChecker::isDependent. We normally have debug output for each case. llvm-svn: 241707	2015-07-08 18:47:38 +00:00
Reid Kleckner	ed012dbf2a	[SEH] Ensure that empty __except blocks have their own BB The 32-bit lowering assumed that WinEHPrepare had this invariant. WinEHPrepare did it for C++, but not SEH. The result was that we would insert calls to llvm.x86.seh.restoreframe in normal basic blocks, which corrupted the frame pointer. llvm-svn: 241699	2015-07-08 18:08:52 +00:00
Duncan P. N. Exon Smith	ad98745561	MC: Constify MCSubtargetInfo in getDeprecationInfo(), NFC There's no reason to be able to mutate `MCSubtargetInfo` in `getDeprecationInfo()`. Constify the reference. llvm-svn: 241693	2015-07-08 17:30:55 +00:00
Rafael Espindola	65a9953d69	Inline function into only use. llvm-svn: 241692	2015-07-08 17:26:24 +00:00
Rafael Espindola	c291a4b212	Add a helper function to reduce a bit of code duplication. llvm-svn: 241691	2015-07-08 17:08:26 +00:00
Eli Bendersky	8e131f8cbc	Cosmetic cleanups - NFC Remove commented lines, trailing whitespace, etc. llvm-svn: 241687	2015-07-08 16:33:21 +00:00
James Y Knight	f238d176eb	[SPARC] Cleanup handling of the Y/ASR registers. - Implement copying ASR to/from GPR regs. - Mark ASRs as non-allocatable, so it won't try to arbitrarily use them inappropriately. - Instead of inserting explicit WRASR/RDASR nodes in the MUL/DIV routines, just do normal register copies. - Also...mark div as using Y, not just writing it. Added a test case with some code which previously died with an assertion failure (with -O0), or produced wrong code (otherwise). (Third time's the charm?) Differential Revision: http://reviews.llvm.org/D10401 llvm-svn: 241686	2015-07-08 16:25:12 +00:00
Rafael Espindola	51271bdc4f	Use a range loop. NFC. llvm-svn: 241685	2015-07-08 16:16:15 +00:00
Krzysztof Parzyszek	21b53a5120	[Hexagon] Generate "insert" instructions more aggressively llvm-svn: 241683	2015-07-08 14:47:34 +00:00
Krzysztof Parzyszek	d19b4767ff	Revert 241681: causes Windows builds to fail llvm-svn: 241682	2015-07-08 14:34:13 +00:00
Krzysztof Parzyszek	712b15b45e	[Hexagon] Generate "insert" instructions more aggressively llvm-svn: 241681	2015-07-08 14:22:27 +00:00
Silviu Baranga	1b6b50a921	[LAA] Merge memchecks for accesses separated by a constant offset Summary: Often filter-like loops will do memory accesses that are separated by constant offsets. In these cases it is common that we will exceed the threshold for the allowable number of checks. However, it should be possible to merge such checks, sice a check of any interval againt two other intervals separated by a constant offset (a,b), (a+c, b+c) will be equivalent with a check againt (a, b+c), as long as (a,b) and (a+c, b+c) overlap. Assuming the loop will be executed for a sufficient number of iterations, this will be true. If not true, checking against (a, b+c) is still safe (although not equivalent). As long as there are no dependencies between two accesses, we can merge their checks into a single one. We use this technique to construct groups of accesses, and then check the intervals associated with the groups instead of checking the accesses directly. Reviewers: anemet Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10386 llvm-svn: 241673	2015-07-08 09:16:33 +00:00
Simon Pilgrim	752de5dff2	[X86][SSE] Added (V)ROUNDSD + (V)ROUNDSS stack folding support llvm-svn: 241671	2015-07-08 08:07:57 +00:00
Karthik Bhat	d2bc0d8423	Allow constfolding of llvm.sin.* and llvm.cos.* intrinsics This patch const folds llvm.sin.* and llvm.cos.* intrinsics whenever feasible. Differential Revision: http://reviews.llvm.org/D10836 llvm-svn: 241665	2015-07-08 03:55:47 +00:00
Mehdi Amini	ffc1402fad	Remove IsLittleEndian from TargetLowering and redirect to DataLayout Summary: This change is part of a series of commits dedicated to have a single DataLayout during compilation by using always the one owned by the module. Reviewers: echristo Subscribers: llvm-commits, rafael, yaron.keren Differential Revision: http://reviews.llvm.org/D11017 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 241655	2015-07-08 01:00:38 +00:00
Mehdi Amini	f50daedfc7	Redirect DataLayout from TargetMachine to Module in SjLjEHPrepare Summary: This change is part of a series of commits dedicated to have a single DataLayout during compilation by using always the one owned by the module. Reviewers: echristo Subscribers: yaron.keren, rafael, llvm-commits Differential Revision: http://reviews.llvm.org/D11009 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 241654	2015-07-08 01:00:31 +00:00
Reid Kleckner	e69bdb8619	[WinEH] Make llvm.x86.seh.restoreframe work for stack realignment prologues The incoming EBP value points to the end of a local stack allocation, so we can use that to restore ESI, the base pointer. Once we do that, we can use local stack allocations. If we know we need stack realignment, spill the original frame pointer in the prologue and reload it after restoring ESI. llvm-svn: 241648	2015-07-07 23:45:58 +00:00
Mehdi Amini	ed6edbf17a	Redirect DataLayout from TargetMachine to Module in StackProtector Summary: This change is part of a series of commits dedicated to have a single DataLayout during compilation by using always the one owned by the module. Reviewers: echristo Subscribers: llvm-commits, rafael, yaron.keren Differential Revision: http://reviews.llvm.org/D11010 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 241646	2015-07-07 23:38:49 +00:00
Alex Lorenz	900b5cb2ab	MIR Printer: Use a module slot tracker to print global address operands. NFC. This commit adopts the 'ModuleSlotTracker' class, which was surfaced in r240842, to print the global address operands. This change ensures that the slot tracker won't have to be recreated every time a global address operand is printed, making the MIR printing more efficient. llvm-svn: 241645	2015-07-07 23:27:53 +00:00
Reid Kleckner	d5afc62ff6	[WinEH] Add localaddress intrinsic instead of using frameaddress Clang uses this for SEH finally. The new intrinsic will produce the right value when stack realignment is required. llvm-svn: 241643	2015-07-07 23:23:03 +00:00
Arnold Schwaighofer	3d43f66c91	Add more nvcasts Tim Northover has told me that they can occur when the compiler cleverly constructs constants - as demonstrated in the test case. rdar://21703486 llvm-svn: 241641	2015-07-07 23:13:18 +00:00
Dan Gohman	489abd7046	[WebAssembly] Set the scheduling preference. llvm-svn: 241637	2015-07-07 22:38:06 +00:00
Reid Kleckner	60381791b5	Rename llvm.frameescape and llvm.framerecover to localescape and localrecover Summary: Initially, these intrinsics seemed like part of a family of "frame" related intrinsics, but now I think that's more confusing than helpful. Initially, the LangRef specified that this would create a new kind of allocation that would be allocated at a fixed offset from the frame pointer (EBP/RBP). We ended up dropping that design, and leaving the stack frame layout alone. These intrinsics are really about sharing local stack allocations, not frame pointers. I intend to go further and add an `llvm.localaddress()` intrinsic that returns whatever register (EBP, ESI, ESP, RBX) is being used to address locals, which should not be confused with the frame pointer. Naming suggestions at this point are welcome, I'm happy to re-run sed. Reviewers: majnemer, nicholas Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D11011 llvm-svn: 241633	2015-07-07 22:25:32 +00:00
Sanjay Patel	d4e1bb89e3	fix typo; NFC llvm-svn: 241629	2015-07-07 21:31:54 +00:00
Alex Lorenz	cbbfd0b194	MIR Serialization: Serialize the 'dead' register machine operand flag. llvm-svn: 241624	2015-07-07 20:34:53 +00:00
Mehdi Amini	8ac7a9d57a	Redirect DataLayout from TargetMachine to Module in SelectionDAG Summary: SelectionDAG itself is not invoking directly the DataLayout in the TargetMachine, but the "TargetLowering" class is still using it. I'll address it in a following commit. This change is part of a series of commits dedicated to have a single DataLayout during compilation by using always the one owned by the module. Reviewers: echristo Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D11000 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 241618	2015-07-07 19:07:19 +00:00
David Majnemer	6cc21f909c	Revert "Revert r241570, it caused PR24053" This reverts commit r241602. We had a latent bug in SCCP where we would make a basic block empty and then proceed to ask questions about it's terminator. llvm-svn: 241616	2015-07-07 18:49:41 +00:00
Mehdi Amini	f6727b0da1	Redirect DataLayout from TargetMachine to Module in GlobalMerge Summary: This change is part of a series of commits dedicated to have a single DataLayout during compilation by using always the one owned by the module. Reviewers: echristo Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10987 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 241615	2015-07-07 18:49:25 +00:00
Mehdi Amini	4fe3798dca	Redirect DataLayout from TargetMachine to Module in CodeGen Prepare Summary: This change is part of a series of commits dedicated to have a single DataLayout during compilation by using always the one owned by the module. Reviewers: echristo Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10986 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 241614	2015-07-07 18:45:17 +00:00
Mehdi Amini	7da8b536f4	Redirect DataLayout from TargetMachine to Module in FastISel Summary: This change is part of a series of commits dedicated to have a single DataLayout during compilation by using always the one owned by the module. Reviewers: echristo Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10985 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 241613	2015-07-07 18:39:02 +00:00
Arnold Schwaighofer	4bc34b1515	Add a pattern for a nvcast from v2f64 -> v4f32 Since the NvCast is generated by the selection process the concerns about endianess and bit reversal don't apply. rdar://21703486 llvm-svn: 241611	2015-07-07 18:31:55 +00:00
Mehdi Amini	42e9f96712	Redirect DataLayout from TargetMachine to Module in MachineFunction Summary: This change is part of a series of commits dedicated to have a single DataLayout during compilation by using always the one owned by the module. Reviewers: echristo Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10984 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 241610	2015-07-07 18:20:57 +00:00
Reid Kleckner	af04c2a972	Use default member initializers to deduplicate code in X86MachineFunctionInfo, NFC llvm-svn: 241609	2015-07-07 18:12:06 +00:00
Rafael Espindola	99d294ee9a	Fix the -DBUILD_SHARED_LIBS=ON build. llvm-svn: 241608	2015-07-07 17:48:00 +00:00
Alex Lorenz	7a503facdf	MIR Parser: wrap 'MBBSlots' from the MI parsing functions in a struct. NFC. This commit modifies the interface for the machine instruction parsing functions by wrapping the parameter 'MBBSlots' in a new structure called 'PerFunctionMIParsingState'. This change is useful as in the future I will be able to pass new parameters to the machine instruction parser just by modifying the 'PerFunctionMIParsingState' structure instead of adding a new parameter to each function. llvm-svn: 241607	2015-07-07 17:46:43 +00:00
Rafael Espindola	be8b0ea854	Delete UnknownAddress. It is a perfectly valid symbol value. getSymbolValue now returns a value that in convenient for most callers: * 0 for undefined * symbol size for common symbols * offset/address for symbols the rest Code that needs something more specific can check getSymbolFlags. llvm-svn: 241605	2015-07-07 17:12:59 +00:00
Rafael Espindola	ef888a4db6	Simplify by passing in the section of the symbol. NFC. llvm-svn: 241603	2015-07-07 16:45:55 +00:00
Nico Weber	feb13e9e09	Revert r241570, it caused PR24053 llvm-svn: 241602	2015-07-07 16:42:50 +00:00
Krzysztof Parzyszek	a45971ac94	[Hexagon] Fix unused variable warnings in NDEBUG build caused by r241595 llvm-svn: 241600	2015-07-07 16:02:11 +00:00
Reid Kleckner	9200b2f93b	[WinEH] Add a report_fatal_error for 32-bit stack realignment This type of prologue isn't supported yet. Implementing it should be a matter of copying the adjusted incoming EBP into ESI (the base pointer) instead of EBP. The original EBP can be saved and restored from other memory afterwards. llvm-svn: 241597	2015-07-07 15:47:29 +00:00
Krzysztof Parzyszek	e53b31a593	[Hexagon] Implement bit-tracking facility with specifics for Hexagon This includes code that is intended to be target-independent as well as the Hexagon-specific details. This is just the framework without any users. llvm-svn: 241595	2015-07-07 15:16:42 +00:00
Rafael Espindola	7e7be92c7f	Common symbols don't have a value. At least not in the interface exposed by ObjectFile. This matches what ELF and COFF implement. Adjust existing code that was expecting them to have values. No overall functionality change intended. Another option would be to change the interface and the ELF and COFF implementations to say that the value of a common symbol is its size. llvm-svn: 241593	2015-07-07 15:05:09 +00:00
Sanjay Patel	cf0a80728c	use range-based for loops; NFCI llvm-svn: 241592	2015-07-07 15:03:53 +00:00
Rafael Espindola	d82477278b	Common symbols are not undefined, at least for ObjectFile. They are implemented like that in some object formats, but for the interface provided by lib/Object, SF_Undefined and SF_Common are different things. This matches the ELF and COFF implementation and fixes llvm-nm for MachO. llvm-svn: 241587	2015-07-07 14:26:39 +00:00
Rafael Espindola	05cbccc649	Simplify, NFC. In these two contexts we really just want the raw n_value. No need to use getSymbolValue which checks for special cases where, semantically, the symbol has no value. llvm-svn: 241584	2015-07-07 13:58:32 +00:00
David Majnemer	381326d771	[IR] Make getFirstNonPHI return null if the BB is empty getFirstNonPHI's documentation states that it returns null if there is no non-PHI instruction. However, it instead returns a pointer to the end iterator. The implementation of getFirstNonPHI claims that dereferencing the iterator will result in an assertion failure but this doesn't occur. Instead, machinery like getFirstInsertionPt will attempt to isa<> this invalid memory which results in unpredictable behavior. Instead, make getFirst* return null if no such instruction exists. llvm-svn: 241570	2015-07-07 09:15:29 +00:00
Denis Protivensky	b612902faa	Fix gcc warnings of different enum and non-enum types in ternaries llvm-svn: 241567	2015-07-07 07:48:48 +00:00
Akira Hatanaka	1bc8af78f4	[ARM] Define a subtarget feature and use it to decide whether long calls should be emitted. This is needed to enable ARM long calls for LTO and enable and disable it on a per-function basis. Out-of-tree projects currently using EnableARMLongCalls to emit long calls should start passing "+long-calls" to the feature string (see the changes made to clang in r241565). rdar://problem/21529937 Differential Revision: http://reviews.llvm.org/D9364 llvm-svn: 241566	2015-07-07 06:54:42 +00:00
Alex Lorenz	36962cd925	MIR Parser: Verify the implicit machine register operands. This commit verifies that the parsed machine instructions contain the implicit register operands as specified by the MCInstrDesc. Variadic and call instructions aren't verified. Reviewers: Duncan P. N. Exon Smith Differential Revision: http://reviews.llvm.org/D10781 llvm-svn: 241537	2015-07-07 02:08:46 +00:00
Juergen Ributzka	9622cdf4b9	[StackMap Liveness] Calling the base class' getAnalysisUsage method. NFCI. Calling into the base class' getAnalysisUsage method after we did our pass specific modifications. This shouldn't really matter since this is the last pass in the pipeline anyways. llvm-svn: 241536	2015-07-07 02:05:18 +00:00
Juergen Ributzka	c111fcc0a0	[StackMap Liveness] No need to cache the MachineFunction. NFC. Don't cache the MachineFunction in the pass and range'ify some loops. llvm-svn: 241535	2015-07-07 02:05:15 +00:00
Benjamin Kramer	4ea14a671d	[Triple] Add a helper to switch between big/little endian variants This will be used from clang's driver. llvm-svn: 241527	2015-07-06 23:58:14 +00:00
Sanjoy Das	8ee6a30b8d	[FaultMaps] Add statistic to count the # of implicit null checks. llvm-svn: 241521	2015-07-06 23:32:10 +00:00
Alex Lorenz	cb268d46f0	MIR Serialization: Serialize the implicit register flag. This commit serializes the implicit flag for the register machine operands. It introduces two new keywords into the machine instruction syntax: 'implicit' and 'implicit-def'. The 'implicit' keyword is used for the implicit register operands, and the 'implicit-def' keyword is used for the register operands that have both the implicit and the define flags set. Reviewers: Duncan P. N. Exon Smith Differential Revision: http://reviews.llvm.org/D10709 llvm-svn: 241519	2015-07-06 23:07:26 +00:00
Eric Christopher	96353b3281	Remove JumpInstrTableInfo.h as it is no longer used. llvm-svn: 241517	2015-07-06 22:55:20 +00:00
Simon Pilgrim	40343e6b3a	[X86][AVX] Add support for shuffle decoding of vperm2f128/vperm2i128 with zero'd lanes The vperm2f128/vperm2i128 shuffle mask decoding was not attempting to deal with shuffles that give zero lanes. This patch fixes this so that the assembly printer can provide shuffle comments. As this decoder is also used in X86ISelLowering for shuffle combining, I've added an early-out to match existing behaviour. The hope is that we can add zero support in the future, this would allow other ops' decodes (e.g. insertps) to be combined as well. Differential Revision: http://reviews.llvm.org/D10593 llvm-svn: 241516	2015-07-06 22:46:46 +00:00
Sanjay Patel	681a56ac58	[x86] extend machine combiner reassociation optimization to SSE scalar adds Extend the reassociation optimization of http://reviews.llvm.org/rL240361 (D10460) to SSE scalar FP SP adds in addition to AVX scalar FP SP adds. With the 'switch' in place, we can trivially add other opcodes and test cases in future patches. Differential Revision: http://reviews.llvm.org/D10975 llvm-svn: 241515	2015-07-06 22:35:29 +00:00
Simon Pilgrim	8fbf1c1f4a	[X86][SSE] Vectorized i64 uniform constant SRA shifts This patch adds vectorization support for uniform constant i64 arithmetic shift right operators. Differential Revision: http://reviews.llvm.org/D9645 llvm-svn: 241514	2015-07-06 22:35:19 +00:00
JF Bastien	86bc91508d	WebAssembly: add some TODO Reviewers: sunfish Subscribers: llvm-commits, jfb Differential Revision: http://reviews.llvm.org/D10971 llvm-svn: 241513	2015-07-06 21:41:59 +00:00
Reid Kleckner	da76bd444f	[WinEH] Insert the EH code load before the block terminator The previous code put the load after the terminator, leading to invalid IR and downstream crashes. This caused http://crbug.com/506446. llvm-svn: 241509	2015-07-06 21:13:43 +00:00
Simon Pilgrim	d85cae3d52	[X86][SSE4A] Shuffle lowering using SSE4A EXTRQ/INSERTQ instructions This patch adds support for v8i16 and v16i8 shuffle lowering using the immediate versions of the SSE4A EXTRQ and INSERTQ instructions. Although rather limited (they can only act on the lower 64-bits of the source vectors, leave the upper 64-bits of the result vector undefined and don't have VEX encoded variants), the instructions are still useful for the zero extension of any lane (EXTRQ) or inserting a lane into another vector (INSERTQ). Testing demonstrated that it wasn't typically worth it to use these instructions for v2i64 or v4i32 vector shuffles although they are capable of it. As well as adding specific pattern matching for the shuffles, the patch uses EXTRQ for zero extension cases where SSE41 isn't available and its more efficient than the SSE2 'unpack' default approach. It also adds shuffle decode support for the EXTRQ / INSERTQ cases when the instructions are handling full byte-sized extractions / insertions. From this foundation, future patches will be able to make use of the instructions for situations that use their ability to extract/insert at the bit level. Differential Revision: http://reviews.llvm.org/D10146 llvm-svn: 241508	2015-07-06 20:46:41 +00:00
Simon Pilgrim	8b756596fc	[X86][SSE] Use the general SMAX/SMIN/UMAX/UMIN opcodes and remove the X86 implementation With the completion of D9746 there is now a common implementation of integer signed/unsigned min/max nodes, removing the need for the equivalent X86 specific implementations. This patch removes the old X86ISD nodes, legalizes the relevant SSE2/SSE41/AVX2/AVX512 instructions for the ISD versions and converts the small amount of existing X86 code. Differential Revision: http://reviews.llvm.org/D10947 llvm-svn: 241506	2015-07-06 20:30:47 +00:00
Quentin Colombet	40dd510a73	[TwoAddressInstructionPass] Rename a variable to match the coding style. Spot by Bruno. llvm-svn: 241505	2015-07-06 20:12:54 +00:00
Reid Kleckner	fc0f93832b	[llvm-extract] Drop comdats from declarations The verifier rejects comdats on declarations. llvm-svn: 241483	2015-07-06 18:48:02 +00:00
Alex Lorenz	e2d75239d1	llc: Add a 'run-pass' option. This commit adds a 'run-pass' option to llc, which instructs the compiler to run one specific code generation pass only. Llc already has the 'start-after' and the 'stop-after' options, and this new option complements the other two by making it easier to write tests that want to invoke a single pass only. Reviewers: Duncan P. N. Exon Smith Differential Revision: http://reviews.llvm.org/D10776 llvm-svn: 241476	2015-07-06 17:44:26 +00:00
Matt Arsenault	db7781c6e9	AMDGPU: Run SIInsertWaits as pre-emit pass Running this after the scheduler enables scheduling waits later so other ALU instructions can run while this would be waiting. When combined with enabling the post-RA scheduler, this gives about a ~20% improvement on sgemm. llvm-svn: 241473	2015-07-06 17:02:20 +00:00
Daniel Sanders	f423f5627c	Change the last few internal StringRef triples into Triple objects. Summary: This concludes the patch series to eliminate StringRef forms of GNU triples from the internals of LLVM that began in r239036. At this point, the StringRef-form of GNU Triples should only be used in the public API (including IR serialization) and a couple objects that directly interact with the API (most notably the Module class). The next step is to replace these Triple objects with the TargetTuple object that will represent our authoratative/unambiguous internal equivalent to GNU Triples. Reviewers: rengolin Subscribers: llvm-commits, jholewinski, ted, rengolin Differential Revision: http://reviews.llvm.org/D10962 llvm-svn: 241472	2015-07-06 16:56:07 +00:00
Adrian Prantl	4276d4a8d0	DIBuilder: Don't rauw null pointers with empty arrays in finalize(). This makes the IR a little easier to read. llvm-svn: 241470	2015-07-06 16:36:02 +00:00
Daniel Sanders	fbdab437f0	Where Triple has a suitable predicate, use it rather than the enum values. NFC. Reviewers: mcrosier Subscribers: llvm-commits, rengolin Differential Revision: http://reviews.llvm.org/D10960 llvm-svn: 241469	2015-07-06 16:33:18 +00:00
Sanjay Patel	d2b7144c4a	use range-based for loops; NFCI llvm-svn: 241468	2015-07-06 16:27:35 +00:00
Teresa Johnson	d3a33a16bb	Resubmit "Add new EliminateAvailableExternally module pass" (r239480) This change includes a fix for https://code.google.com/p/chromium/issues/detail?id=499508#c3, which required updating the visibility for symbols with eliminated definitions. --Original Commit Message-- Add new EliminateAvailableExternally module pass, which is performed in O2 compiles just before GlobalDCE, unless we are preparing for LTO. This pass eliminates available externally globals (turning them into declarations), regardless of whether they are dead/unreferenced, since we are guaranteed to have a copy available elsewhere at link time. This enables additional opportunities for GlobalDCE. If we are preparing for LTO (e.g. a -flto -c compile), the pass is not included as we want to preserve available externally functions for possible link time inlining. The FE indicates whether we are doing an -flto compile via the new PrepareForLTO flag on the PassManagerBuilder. llvm-svn: 241466	2015-07-06 16:22:42 +00:00
Adrian Prantl	0e224a646c	Use an early exit in DIBuilder::finalize() to improve readability. llvm-svn: 241465	2015-07-06 16:22:12 +00:00
Sanjay Patel	6d4c3e3ded	use range-based for loops; NFCI llvm-svn: 241463	2015-07-06 16:19:14 +00:00
Matt Arsenault	706f930b72	AMDGPU/SI: Add debugging subtarget feature for DS offsets We don't have a good way to detect most situations where DS offsets are usable on SI, so add an option to force using them even if unsafe for debugging performance problems. llvm-svn: 241462	2015-07-06 16:01:58 +00:00
James Y Knight	89ac11de32	[Sparc] Add more instruction aliases. These are mostly from the chart in the SparcV8 spec, section "A.3 Synthetic Instructions". Differential Revision: http://reviews.llvm.org/D9834 llvm-svn: 241461	2015-07-06 16:01:07 +00:00
James Y Knight	7208a12eef	[Sparc] Add support for flush instruction. Differential Revision: http://reviews.llvm.org/D9833 llvm-svn: 241460	2015-07-06 16:01:04 +00:00
Rafael Espindola	76ad232179	Remove getRelocationAddress. Originally added in r139314. Back then it didn't actually get the address, it got whatever value the relocation used: address or offset. The values in different object formats are: * MachO: Always an offset. * COFF: Always an address, but when talking about the virtual address of sections it says: "for simplicity, compilers should set this to zero". * ELF: An offset for .o files and and address for .so files. In the case of the .so, the relocation in not linked to any section (sh_info is 0). We can't really compute an offset. Some API mappings would be: * Use getAddress for everything. It would be quite cumbersome. To compute the address elf has to follow sh_info, which can be corrupted and therefore the method has to return an ErrorOr. The address of the section is also the same for every relocation in a section, so we shouldn't have to check the error and fetch the value for every relocation. * Use a getValue and make it up to the user to know what it is getting. * Use a getOffset and: * Assert for dynamic ELF objects. That is a very peculiar case and it is probably fair to ask any tool that wants to support it to use ELF.h. The only tool we have that reads those (llvm-readobj) already does that. The only other use case I can think of is a dynamic linker. * Check that COFF .obj files have sections with zero virtual address spaces. If it turns out that some assembler/compiler produces these, we can change COFFObjectFile::getRelocationOffset to subtract it. Given COFF format, this can be done without the need for ErrorOr. The getRelocationAddress method was never implemented for COFF. It also had exactly one use in a very peculiar case: a shortcut for adding the section value to a pcrel reloc on MachO. Given that, I don't expect that there is any use out there of the C API. If that is not the case, let me know and I will add it back with the implementation inlined and do a proper deprecation. llvm-svn: 241450	2015-07-06 14:55:37 +00:00
Chad Rosier	85a346395e	Fix a bug in the A57FPLoadBalancing register tracking/scavenger. The code in AArch64A57FPLoadBalancing::scavengeRegister() to handle dead defs was not correctly handling aliased registers. E.g. if the dead def was of D2, then S2 was not being marked as unavailable, so it could potentially be used across a live-range in which it would be clobbered. Patch by Geoff Berry <gberry@codeaurora.org>! Phabricator: http://reviews.llvm.org/D10900 llvm-svn: 241449	2015-07-06 14:46:34 +00:00
Rafael Espindola	76d650e8d7	Check that COFF .obj files have sections with zero virtual address spaces. When talking about the virtual address of sections the coff spec says: ... for simplicity, compilers should set this to zero. Otherwise, it is an arbitrary value that is subtracted from offsets during relocation. We don't currently subtract it, so check that it is zero. If some producer does create such files, we can change getRelocationOffset instead. llvm-svn: 241447	2015-07-06 14:26:07 +00:00
Asaf Badouh	c6f3c82ffc	[X86][AVX512] Multiply Packed Unsigned Integers with Round and Scale pmulhrsw review: http://reviews.llvm.org/D10948 llvm-svn: 241443	2015-07-06 14:03:40 +00:00
Petar Jovanovic	0326a06c15	[Mips] Add support for MCJIT for MIPS32r6 Add support for resolving MIPS32r6 relocations in MCJIT. Patch by Vladimir Radosavljevic. Differential Revision: http://reviews.llvm.org/D10687 llvm-svn: 241442	2015-07-06 12:50:55 +00:00
Craig Topper	d26d2d9a50	[TableGen] Change a couple methods to return an ArrayRef instead of a const std::vector reference. NFC llvm-svn: 241430	2015-07-06 06:23:01 +00:00
Sanjay Patel	cc9fad0bf7	remove unnecessary temp variable; NFCI llvm-svn: 241415	2015-07-05 21:21:47 +00:00
Peter Collingbourne	46eb0f539c	Verifier: Forbid comdats on linker declarations. Differential Revision: http://reviews.llvm.org/D10945 llvm-svn: 241414	2015-07-05 20:52:40 +00:00
Peter Collingbourne	6a9d1774d0	IR: Do not consider available_externally linkage to be linker-weak. From the linker's perspective, an available_externally global is equivalent to an external declaration (per isDeclarationForLinker()), so it is incorrect to consider it to be a weak definition. Also clean up some logic in the dead argument elimination pass and clarify its comments to better explain how its behavior depends on linkage, introduce GlobalValue::isStrongDefinitionForLinker() and start using it throughout the optimizers and backend. Differential Revision: http://reviews.llvm.org/D10941 llvm-svn: 241413	2015-07-05 20:52:35 +00:00
Sanjay Patel	a4860f3af2	use range-based for loops; NFCI llvm-svn: 241412	2015-07-05 20:15:21 +00:00
Benjamin Kramer	9bfb627a0e	[TargetLowering] StringRefize asm constraint getters. There is some functional change here because it changes target code from atoi(3) to StringRef::getAsInteger which has error checking. For valid constraints there should be no difference. llvm-svn: 241411	2015-07-05 19:29:18 +00:00
Asaf Badouh	73f26f8ffc	[x86][AVX512] add Multiply High Op include encoding and intrinsics tests. review http://reviews.llvm.org/D10896 llvm-svn: 241406	2015-07-05 12:23:20 +00:00
Michael Kuperstein	5f05153fbb	[X86] Fix incorrect/inefficient pushw encodings for x86-64 targets Correctly support assembling "pushw $imm8" on x86-64 targets. Also some cleanup of the PUSH instructions (PUSH64i16 and PUSHi16 actually represent the same instruction) This fixes PR23996 Patch by: david.l.kreitzer@intel.com Differential Revision: http://reviews.llvm.org/D10878 llvm-svn: 241404	2015-07-05 10:25:41 +00:00
Nemanja Ivanovic	d358b8f80d	Add missing builtins to the PPC back end for ABI compliance (vol. 2) This patch corresponds to review: http://reviews.llvm.org/D10874 Back end portion of the second round of additions to altivec.h. llvm-svn: 241398	2015-07-05 06:03:51 +00:00
Sanjay Patel	f73f8919ed	use range-based for loops; NFCI llvm-svn: 241395	2015-07-04 19:38:52 +00:00
Simon Pilgrim	ea1b6ee366	[X86][SSE] Improved i8/i16 to f64 uint2fp vector conversions Followup to D10433 and D10589 that fixes i8/i16 uint2fp vector conversions by zero extending to i32 and using the sint2fp path (unless the target does actually support uint2fp). llvm-svn: 241394	2015-07-04 15:33:34 +00:00
Sanjay Patel	82db3b7d5e	use valid bits to avoid unnecessary machine trace metric recomputations Although this does cut the number of traces recomputed by ~10% for the test case mentioned in http://reviews.llvm.org/D10460, it doesn't make a dent in the overall performance. That example needs to be more selective when invalidating traces. llvm-svn: 241393	2015-07-04 15:00:28 +00:00
Yaron Keren	fffc068d68	Fix spelling, NFC. llvm-svn: 241392	2015-07-04 05:48:52 +00:00
Peter Collingbourne	17eff10f68	LTO: expose LTO_SYMBOL_ALIAS, which indicates that the symbol is an alias. This is needed for COFF linkers to distinguish between weak external aliases and regular symbols with LLVM weak linkage, which are represented as strong symbols in COFF. llvm-svn: 241389	2015-07-04 03:42:35 +00:00
Rui Ueyama	d5297ee724	Object/COFF: Do not rely on VirtualSize being 0 in object files. llvm-svn: 241387	2015-07-04 03:25:51 +00:00
Lang Hames	78937c2ae5	[RuntimeDyld] Skip relocations for external symbols with 64-bit address ~0ULL. Requested by Eugene Rozenfeld of the LLILC team, this feature allows JIT clients to skip relocations for selected external symbols by returning ~0ULL from their symbol resolver. If this value is returned for a given symbol, RuntimeDyld will skip all relocations for that symbol. The client will be responsible for applying the skipped relocations manually before the code is executed. llvm-svn: 241383	2015-07-04 01:35:26 +00:00
Craig Topper	de8395229a	[X86] Add proper 64-bit mode checks to jrcxz and jcxz. llvm-svn: 241381	2015-07-04 00:01:07 +00:00
Matt Arsenault	24e33d10a0	AMDGPU: Fix indentation of switch llvm-svn: 241380	2015-07-03 23:33:38 +00:00
Simon Atanasyan	5db0276925	[ELFYAML] Fix handling SHT_NOBITS sections by obj2yaml/yaml2obj tools SHT_NOBITS sections do not have content in an object file. Now the yaml2obj tool does not accept `Content` field for such sections, and the obj2yaml tool does not attempt to read the section content from a file. Restore r241350 and r241352. llvm-svn: 241377	2015-07-03 23:00:54 +00:00
Rafael Espindola	d74b4f0a32	Use a continue to reduce indentation. llvm-svn: 241375	2015-07-03 22:02:28 +00:00
Rafael Espindola	74d079b272	Use a continue to reduce indentation. llvm-svn: 241374	2015-07-03 21:57:41 +00:00
Rafael Espindola	a88a196f04	Context is allocated just a few lines above. Don't check if it is null. llvm-svn: 241373	2015-07-03 21:54:41 +00:00
Rafael Espindola	a4b2733c86	Fix build with -DLLVM_USE_INTEL_JITEVENTS=ON -DLLVM_USE_OPROFILE=ON. Is anyone using those? llvm-svn: 241372	2015-07-03 21:47:00 +00:00
Filipe Cabecinhas	0011c58444	Remove always-true comparison, NFC. Summary: Looking at r241279, I noticed that UpgradedIntrinsics only gets written to in the following code: if (UpgradeIntrinsicFunction(&F, NewFn)) UpgradedIntrinsics[&F] = NewFn; Looking through UpgradeIntrinsicFunction, we always return false OR NewFn will be set to a different function from our source. This patch pulls the F != NewFn into UpgradeIntrinsicFunction as an assert, and removes the check from callers of UpgradeIntrinsicFunction. Reviewers: rafael, chandlerc Subscribers: llvm-commits-list Differential Revision: http://reviews.llvm.org/D10915 llvm-svn: 241369	2015-07-03 20:12:01 +00:00

... 3 4 5 6 7 ...

81412 Commits