llvm-project

Commit Graph

Author	SHA1	Message	Date
Hal Finkel	7529c55c02	Add a CFL Alias Analysis implementation This provides an implementation of CFL alias analysis (including some supporting data structures). Currently, we don't have any extremely fancy features, sans some interprocedural analysis (i.e. no field sensitivity, etc.), and we do best sitting behind BasicAA + TBAA. In such a configuration, we take ~0.6-0.8% of total compile time, and give ~7-8% NoAlias responses to queries TBAA and BasicAA couldn't answer when bootstrapping LLVM. In testing this on other projects, we've seen up to 10.5% of queries dropped by BasicAA+TBAA answered with NoAlias by this algorithm. Patch by George Burgess IV (with minor modifications by me -- mostly adapting some BasicAA tests), thanks! llvm-svn: 216970	2014-09-02 21:43:13 +00:00
Juergen Ributzka	dbe9e174b6	[FastISel][AArch64] Move over to target-dependent instruction selection only. This change moves FastISel for AArch64 to target-dependent instruction selection only. This change replicates the existing target-independent behavior, therefore there are no changes to the unit tests or new tests. Future changes will take advantage of this change and update functionality and unit tests. llvm-svn: 216955	2014-09-02 21:32:54 +00:00
Juergen Ributzka	7e998fb5e6	[FastISel] Provide the option to skip target-independent instruction selection. NFC. This allows the target to disable target-independent instruction selection and jump directly into the target-dependent instruction selection code. This can be beneficial for targets, such as AArch64, which could emit much better code, but never got a chance to do so, because the target-independent instruction selector was able to find an instruction sequence. llvm-svn: 216947	2014-09-02 21:07:44 +00:00
Yi Jiang	77a609b556	Generate extract for in-tree uses if the use is scalar operand in vectorized instruction. radar://18144665 llvm-svn: 216946	2014-09-02 21:00:39 +00:00
Matt Arsenault	b78875e979	R600/SI: Fix hardcoded register numbers in test llvm-svn: 216944	2014-09-02 20:43:07 +00:00
Sanjay Patel	3f7a24e400	Refactor LowerFABS and LowerFNEG into one function (x86) (NFC) We duplicate ~30 lines of code to lower FABS and FNEG for x86, so this patch combines them into one function. No functional change intended, so no additional test cases. Test-suite behavior is unchanged. Differential Revision: http://reviews.llvm.org/D5064 llvm-svn: 216942	2014-09-02 20:24:47 +00:00
Matt Arsenault	a3fc923818	cmake: Don't reject unknown cpp files that start with . Some editors create hidden file backups in the same directory as the file, and it's annoying when cmake errors on them. llvm-svn: 216941	2014-09-02 20:20:43 +00:00
Robin Morisset	4f6b93b1a8	Fix MemoryDependenceAnalysis in cases where QueryInstr is a CmpXchg or a AtomicRMW Summary: MemoryDependenceAnalysis is currently cautious when the QueryInstr is an atomic load or store, but I forgot to check for atomic cmpxchg/atomicrmw. This patch is a way of fixing that, and making it less brittle (i.e. no risk that I forget another possible kind of atomic, even if the IR ends up changing in the future), by adding a fallback checking mayReadOrWriteFromMemory. Thanks to Philip Reames for finding this bug and suggesting this solution in http://reviews.llvm.org/D4845 Sadly, I don't see how to add a test for this, since the passes depending on MemoryDependenceAnalysis won't trigger for an atomic rmw anyway. Does anyone see a way for testing it? Test Plan: none possible at first sight Reviewers: jfb, reames Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D5019 llvm-svn: 216940	2014-09-02 20:17:52 +00:00
Sanjay Patel	b2325b9ab3	Fix a logic bug when copying fast-math flags. "Setting" does not equal "copying". This bug has sat dormant for 2 reasons: 1. The unit test was not adequate. 2. Every current user of the "copyFastMathFlags" API is operating on a new instruction. (ie, all existing fast-math flags are off). If you copy flags to an existing instruction that has some flags on already, you will not necessarily turn them off as expected. I uncovered this bug while trying to implement a fix for PR20802. llvm-svn: 216939	2014-09-02 20:03:00 +00:00
Rafael Espindola	0e893f53dd	Add a note about AuroraUX to the release notes. llvm-svn: 216938	2014-09-02 19:49:39 +00:00
Matt Arsenault	907e64b436	Add note to documentation about machine node chains. I've been assuming chain operands were always the first operand, since the documentation says this. I was confused about why they were missing after instruction selection. Apparently the convention changes to using the last operand for MachineSDNodes and I've never noticed before. llvm-svn: 216934	2014-09-02 19:18:52 +00:00
Matt Arsenault	d1649db2fc	R600/SI: Add failing testcase. This is broken when 64-bit add is only partially moved to the VALU. llvm-svn: 216933	2014-09-02 19:12:31 +00:00
Matt Arsenault	c1a71217b3	Fix interference caused by fmul 2, x -> fadd x, x If an fmul was introduced by lowering, it wouldn't be folded into a multiply by a constant since the earlier combine would have replaced the fmul with the fadd. llvm-svn: 216932	2014-09-02 19:02:53 +00:00
Nick Kledzik	ac7cbdc9b1	Code review tweaks llvm-svn: 216931	2014-09-02 18:50:24 +00:00
Matt Arsenault	9d412ed41e	Fix crash when looking up the addrspace of GEPs with vector types Patch by Björn Steinbrink llvm-svn: 216930	2014-09-02 18:47:54 +00:00
Reid Kleckner	0b2bccc3cd	CodeGen: Handle va_start in the entry block Also fix a small copy-paste bug in X86ISelLowering where Chain should have been used in place of DAG.getEntryToken(). Fixes PR20828. llvm-svn: 216929	2014-09-02 18:42:44 +00:00
Matt Arsenault	965de3050f	Fix comment and unnecessary check for FP build_vectors. This was copy-paste from the integer version, but FP build_vectors don't truncate. llvm-svn: 216928	2014-09-02 18:33:51 +00:00
David Blaikie	78fdec5898	unique_ptrify LTOCodeGenerator::NativeObjectFile llvm-svn: 216927	2014-09-02 18:21:06 +00:00
David Blaikie	15913f46b2	unique_ptrify the result of SpecialCaseList::create llvm-svn: 216925	2014-09-02 18:13:54 +00:00
Hans Wennborg	e565d049ab	MCSchedule.h: fix VS2012 build after r216919 llvm-svn: 216924	2014-09-02 18:00:00 +00:00
David Blaikie	f55e31a986	unique_ptrify FileOutputBuffer::FileOutputBuffer llvm-svn: 216921	2014-09-02 17:49:23 +00:00
Alexey Samsonov	d37bab6197	Fix left shifts of negative values in MipsDisassembler. This bug was reported by UBSan. llvm-svn: 216920	2014-09-02 17:49:16 +00:00
Pete Cooper	1175945710	Change MCSchedModel to be a struct of statically initialized data. This removes static initializers from the backends which generate this data, and also makes this struct match the other Tablegen generated structs in behaviour Reviewed by Andy Trick and Chandler C llvm-svn: 216919	2014-09-02 17:43:54 +00:00
David Blaikie	505e1b829f	unique_ptrify PBQPBuilder::build llvm-svn: 216918	2014-09-02 17:42:01 +00:00
Alexey Samsonov	9ca4870b49	Fix signed integer overflow in PPCInstPrinter. This bug was reported by UBSan. llvm-svn: 216917	2014-09-02 17:38:34 +00:00
David Blaikie	14f7301392	Correct unique_ptr passing in MCObjectDisassembler::setFallbackRegion Rather than passing by lvalue reference, pass by value to ensure that the caller provides an rvalue (and use move assignment, rather than release+reset, to assign to the member variable) llvm-svn: 216916	2014-09-02 17:29:51 +00:00
Alexey Samsonov	1b0713ce09	Fix left shifts by too large exponents in MCParser (which happened only on error recovery path). This bug was reported by UBSan. llvm-svn: 216915	2014-09-02 17:25:29 +00:00
Andrea Di Biagio	b9de900788	Revert: [APFloat] Fixed a bug in method 'fusedMultiplyAdd'. This reverts revision 216913; the new test added at revision 216913 caused regression failures on a couple of buildbots. llvm-svn: 216914	2014-09-02 17:22:49 +00:00
Andrea Di Biagio	7676fe1878	[APFloat] Fixed a bug in method 'fusedMultiplyAdd'. When folding a fused multiply-add builtin call, make sure that we propagate the correct result in the case where the addend is zero, and the two other operands are finite non-zero. Example: define double @test() { %1 = call double @llvm.fma.f64(double 7.0, double 8.0, double 0.0) ret double %1 } Before this patch, the instruction simplifier wrongly folded the builtin call in function @test to constant 'double 7.0'. With this patch, method 'fusedMultiplyAdd' correctly evaluates the multiply and propagates the expected result (i.e. 56.0). Added test fold-builtin-fma.ll with the reproducible from PR20832 plus extra test cases to verify the behavior of method 'fusedMultiplyAdd' in the presence of NaN/Inf operands. This fixes PR20832. Differential Revision: http://reviews.llvm.org/D5152 llvm-svn: 216913	2014-09-02 16:44:56 +00:00
JF Bastien	12cc99eb13	Add missing override on ARMAsmBackend's dtor. Test Plan: ninja check && ninja clang-test Subscribers: aemerson Differential Revision: http://reviews.llvm.org/D5075 llvm-svn: 216912	2014-09-02 16:26:55 +00:00
David Majnemer	49428105aa	LICM: Don't crash when an instruction is used by an unreachable BB Summary: BBs might contain non-LCSSA'd values after the LCSSA pass is run if they are unreachable from the entry block. Normally, the users of the instruction would be PHIs but the unreachable BBs have normal users; rewrite their uses to be undef values. An alternative fix could involve fixing this at LCSSA but that would require this invariant to hold after subsequent transforms. If a BB created an unreachable block, they would be in violation of this. This fixes PR19798. Differential Revision: http://reviews.llvm.org/D5146 llvm-svn: 216911	2014-09-02 16:22:00 +00:00
Alexey Samsonov	729b12ede3	Fix left shifts of negative integers in AArch64 InstPrinter/Disassembler Summary: Left shift of negative integer is an undefined behavior, and is reported by UBSan. It's ok for imm values to be negative, so we can just replace left shifts with multiplications. Test Plan: check-llvm test suite Reviewers: t.p.northover Reviewed By: t.p.northover Subscribers: aemerson, mcrosier, llvm-commits Differential Revision: http://reviews.llvm.org/D5132 llvm-svn: 216910	2014-09-02 16:19:41 +00:00
Hal Finkel	e19006ea22	Enable splitting indexing from loads with TargetConstants When I recommitted r208640 (in r216898) I added an exclusion for TargetConstant offsets, as there is no guarantee that a backend can handle them on generic ADDs (even if it generates them during address-mode matching) -- and, specifically, applying this transformation directly with TargetConstants caused a self-hosting failure on PPC64. Ignoring all TargetConstants, however, is less than ideal. Instead, for non-opaque constants, we can convert them into regular constants for use with the generated ADD (or SUB). llvm-svn: 216908	2014-09-02 16:05:23 +00:00
Rafael Espindola	4dd3677b5f	Replace -use-init-array with -use-ctors. We have been using .init-array for most systems for quiet some time, but tools like llc are still defaulting to .ctors because the old option was never changed. This patch makes llc default to .init-array and changes the option to be -use-ctors. Clang is not affected by this. It has its own fancier logic. llvm-svn: 216905	2014-09-02 13:54:53 +00:00
Aaron Ballman	8ca53885fa	Silencing an MSVC C4334 warning ('<<' : result of 32-bit shift implicitly converted to 64 bits (was 64-bit shift intended?)). NFC. llvm-svn: 216902	2014-09-02 12:19:02 +00:00
David Xu	052b9d9282	Merge Extend and Shift into a UBFX llvm-svn: 216899	2014-09-02 09:33:56 +00:00
Hal Finkel	51e6fa2201	Revert "Revert '[DAGCombiner] Split up an indexed load if only the base pointer value is live'" I reverted r208640 in r209747 because r208640 broke self-hosting on PPC64. The underlying cause of the failure is that pre-inc loads with increments represented by ISD::TargetConstants were being transformed into ISD:::ADDs with ISD::TargetConstant operands. PPC doesn't have a pattern for those, and so they were selected as invalid r+r adds. This recommits r208640, rebased and with an exclusion for ISD::TargetConstant increments. This behavior seems correct, although in the future we might want to ask the target to split out the indexing that uses ISD::TargetConstants. Unfortunately, I don't yet have small test case where the relevant invalid 'add' instruction is not itself dead (and thus eliminated by DeadMachineInstructionElim -- sometimes bugpoint is too good at removing things) Original commit message (by Adam Nemet): Right now the load may not get DCE'd because of the side-effect of updating the base pointer. This can happen if we lower a read-modify-write of an illegal larger type (e.g. i48) such that the modification only affects one of the subparts (the lower i32 part but not the higher i16 part). See the testcase. In order to spot the dead load we need to revisit it when SimplifyDemandedBits decided that the value of the load is masked off. This is the CommitTargetLoweringOpt piece. I checked compile time with ARM64 by sending SPEC bitcode files through llc. No measurable change. Fixes <rdar://problem/16031651> llvm-svn: 216898	2014-09-02 06:24:04 +00:00
Hal Finkel	51b3fd1e28	[PowerPC] Guard against illegal selection of add for TargetConstant operands r208640 was reverted because it caused a self-hosting failure on ppc64. The underlying cause was the formation of ISD::ADD nodes with ISD::TargetConstant operands. Because we have no patterns for 'add' taking 'timm' nodes, these are selected as r+r add instructions (which is a miscompile). Guard against this kind of behavior in the future by making the backend crash should this occur (instead of silently generating invalid output). llvm-svn: 216897	2014-09-02 06:23:54 +00:00
Saleem Abdulrasool	d1a4ed6a7c	CodeGen: indicate Windows unwind data format The structures for Windows unwinding are shared across multiple platforms. Indicate the encoding to be used for the particular target. Use this to switch the unwind emitter instantiated by the AsmPrinter. llvm-svn: 216895	2014-09-01 23:48:39 +00:00
Saleem Abdulrasool	0fba7b5856	CodeGen: split out the Win64Exception emitter Move the Windows unwind information emitter into a separate header. This is not related to DWARF based emission. NFC. llvm-svn: 216894	2014-09-01 23:48:34 +00:00
Saleem Abdulrasool	d458091907	MC: remove unnecessary enumeration prefix This is an enum class, and will be appropriately prefixed, making the encoding type prefix redundant. No change to any uses as the use of this was not yet introduced. llvm-svn: 216893	2014-09-01 23:48:29 +00:00
David Majnemer	d4cffcf073	SROA: Don't insert instructions before a PHI SROA may decide that it needs to insert a bitcast and would set it's insertion point before a PHI. This will create an invalid module right quick. Instead, choose the first insertion point in the basic block that holds our PHI. This fixes PR20822. Differential Revision: http://reviews.llvm.org/D5141 llvm-svn: 216891	2014-09-01 21:20:14 +00:00
David Majnemer	d2df50196f	Revert "Revert two GEP-related InstCombine commits" This reverts commit r216698 which reverted r216523 and r216598. We would attempt to perform the transformation even if the match() failed because, as a side effect, it would set V. This would trick us into believing that we correctly found a place to correctly apply the transform. An additional test case was added to getelementptr.ll so that we might not regress in the future. llvm-svn: 216890	2014-09-01 21:10:02 +00:00
Sanjay Patel	601492a3e3	Use an integer constant for FABS / FNEG (x86). This change will ease refactoring LowerFABS() and LowerFNEG() since they have a lot of overlap. Remove the creation of a floating point constant from an integer because it's going to be used for a bitwise integer op anyway. No change to codegen expected, but the verbose comment string for asm output may change from float values to hex (integer), depending on whether the constant already exists or not. Differential Revision: http://reviews.llvm.org/D5052 llvm-svn: 216889	2014-09-01 19:01:47 +00:00
Sanjay Patel	5ad239e15a	Add a convenience method to copy wrapping, exact, and fast-math flags (NFC). The loop vectorizer preserves wrapping, exact, and fast-math properties of scalar instructions. This patch adds a convenience method to make that operation easier because we need to do this in the loop vectorizer, SLP vectorizer, and possibly other places. Although this is a 'no functional change' patch, I've added a testcase to verify that the exact flag is preserved by the loop vectorizer. The wrapping and fast-math flags are already checked in existing testcases. Differential Revision: http://reviews.llvm.org/D5138 llvm-svn: 216886	2014-09-01 18:44:57 +00:00
Jingyue Wu	263bab01f8	Fix a typo in comments in r216862, NFC PR20766 -> PR20776. Thanks Roman Divacky for the catch! llvm-svn: 216883	2014-09-01 14:55:04 +00:00
Tilmann Scheller	659dfb4dc8	[ARM] Add Thumb-2 code size optimization regression test for EOR. llvm-svn: 216881	2014-09-01 12:59:34 +00:00
Tilmann Scheller	2fc1adcaff	ARM] Add Thumb-2 code size optimization regression test for BIC. llvm-svn: 216880	2014-09-01 12:53:29 +00:00
Yuri Gorshenin	c107d147dc	[asan-assembly-instrumentation] Prologue and epilogue are moved out from InstrumentMemOperand(). Reviewers: eugenis Subscribers: llvm-commits Differential revision: http://reviews.llvm.org/D4923 llvm-svn: 216879	2014-09-01 12:51:00 +00:00
Renato Golin	92c816c68f	Thumb2 M-class MSR instruction support changes This patch implements a few changes related to the Thumb2 M-class MSR instruction: * better handling of unpredictable encodings, * recognition of the _g and _nzcvqg variants by the asm parser only if the DSP extension is available, preferred output of MSR APSR moves with the _<bits> suffix for v7-M. Patch by Petr Pavlu. llvm-svn: 216874	2014-09-01 11:25:07 +00:00
Patrik Hagglund	296acbfe4f	Fix in InlineSpiller to make the rematerilization loop also consider implicit uses of the whole register when a sub register is defined. Now the same iterator is used in the rematerilization loop as in the spill loop later. Patch provided by Mikael Holmen. This fix was proposed and reviewed by Quentin Colombet, http://lists.cs.uiuc.edu/pipermail/llvmdev/2014-August/076135.html. Unfortunately, this error in the rematerilization code has only been seen in a large test case for an out-of-tree target, and is probably hard to reproduce on an in-tree target. Therefore, no testcase is provided. llvm-svn: 216873	2014-09-01 11:04:07 +00:00
Yuri Gorshenin	e2f01eb730	Revert "[asan-assembly-instrumentation] Prologue and epilogue are moved out from InstrumentMemOperand()." This reverts commit 895aa397038b8de86d83ac0997a70949a486e112. llvm-svn: 216872	2014-09-01 10:24:04 +00:00
Chandler Carruth	18cee1defc	Fix a really bad miscompile introduced in r216865 - the else-if logic chain became completely broken here as all intrinsic users ended up being skipped, and the ones that seemed to be singled out were actually the exact wrong set. This is a great example of why long else-if chains can be easily confusing. Switch the entire code to use early exits and early continues to have simpler (and more importantly, correct) logic here, as well as fixing the reversed logic for detecting and continuing on lifetime intrinsics. I've also significantly cleaned up the test case and added another test case demonstrating an example where the optimization is not (trivially) safe to perform. llvm-svn: 216871	2014-09-01 10:09:18 +00:00
Renato Golin	86a6c3f269	Small refactor on VectorizerHint for deduplication Previously, the hint mechanism relied on clean up passes to remove redundant metadata, which still showed up if running opt at low levels of optimization. That also has shown that multiple nodes of the same type, but with different values could still coexist, even if temporary, and cause confusion if the next pass got the wrong value. This patch makes sure that, if metadata already exists in a loop, the hint mechanism will never append a new node, but always replace the existing one. It also enhances the algorithm to cope with more metadata types in the future by just adding a new type, not a lot of code. Re-applying again due to MSVC 2013 being minimum requirement, and this patch having C++11 that MSVC 2012 didn't support. Fixes PR20655. llvm-svn: 216870	2014-09-01 10:00:17 +00:00
Yuri Gorshenin	506a170d63	[asan-assembly-instrumentation] Prologue and epilogue are moved out from InstrumentMemOperand(). llvm-svn: 216869	2014-09-01 09:56:45 +00:00
Hal Finkel	0c083024f0	Feed AA to the inliner and use AA->getModRefBehavior in AddAliasScopeMetadata This feeds AA through the IFI structure into the inliner so that AddAliasScopeMetadata can use AA->getModRefBehavior to figure out which functions only access their arguments (instead of just hard-coding some knowledge of memory intrinsics). Most of the information is only available from BasicAA; this is important for preserving alias scoping information for target-specific intrinsics when doing the noalias parameter attribute to metadata conversion. llvm-svn: 216866	2014-09-01 09:01:39 +00:00
Nick Lewycky	fc243d54d2	Ignore lifetime intrinsics in use list for MemCpyOptimizer. Patch by Luqman Aden, review by Hal Finkel. llvm-svn: 216865	2014-09-01 06:03:11 +00:00
Nick Lewycky	97756409ea	Remove an errant outer loop that contains nothing but an inner loop over exactly the same elements. While no functionality is change intended (and hence there are no changes to tests), you don't want to skip this revision if bisecting for errors. llvm-svn: 216864	2014-09-01 05:17:15 +00:00
Hal Finkel	cbb85f249e	Fix AddAliasScopeMetadata again - alias.scope must be a complete description I thought that I had fixed this problem in r216818, but I did not do a very good job. The underlying issue is that when we add alias.scope metadata we are asserting that this metadata completely describes the aliasing relationships within the current aliasing scope domain, and so in the context of translating noalias argument attributes, the pointers must all be based on noalias arguments (as underlying objects) and have no other kind of underlying object. In r216818 excluding appropriate accesses from getting alias.scope metadata is done by looking for underlying objects that are not identified function-local objects -- but that's wrong because allocas, etc. are also function-local objects and we need to explicitly check that all underlying objects are the noalias arguments for which we're adding metadata aliasing scopes. This fixes the underlying-object check for adding alias.scope metadata, and does some refactoring of the related capture-checking eligibility logic (and adds more comments; hopefully making everything a bit clearer). Fixes self-hosting on x86_64 with -mllvm -enable-noalias-to-md-conversion (the feature is still disabled by default). llvm-svn: 216863	2014-09-01 04:26:40 +00:00
Jingyue Wu	5208cc5dbe	[MachineSink] Use the real post dominator tree Summary: Fixes a FIXME in MachineSinking. Instead of using the simple heuristics in isPostDominatedBy, use the real MachinePostDominatorTree. The old heuristics caused instructions to sink unnecessarily, and might create register pressure. Test Plan: Added a NVPTX codegen test to verify that our change is in effect. It also shows the unnecessary register pressure caused by over-sinking. Updated affected tests in AArch64 and X86. Reviewers: eliben, meheff, Jiangning Reviewed By: Jiangning Subscribers: jholewinski, aemerson, mcrosier, llvm-commits Differential Revision: http://reviews.llvm.org/D4814 llvm-svn: 216862	2014-09-01 03:47:25 +00:00
David Blaikie	6a150a87ef	DebugInfo: Elide lexical scopes which only contain other (inline or lexical) scopes. DW_TAG_lexical_scopes inform debuggers about the instruction range for which a given variable (or imported declaration/module/etc) is valid. If the scope doesn't itself contain any such entities, it's a waste of space and should be omitted. We were correctly doing this for entirely empty leaves, but not for intermediate nodes. Reduces total (not just debug sections) .o file size for a bootstrap -gmlt LLVM by 22% and bootstrap -gmlt clang executable by 13%. The wins for a full -g build will be less as a % (and in absolute terms), but should still be substantial - with some of that win being fewer relocations, thus more substantiall reducing link times than fewer bytes alone would have. llvm-svn: 216861	2014-08-31 21:26:22 +00:00
Matt Arsenault	a8efaf96d4	Consider addrspaces in canLosslesslyBitCastTo() Make this conservatively correct and report false for different address spaces, which might require a nontrivial translation. Based on the few uses of this, I don't think this currently breaks anything. llvm-svn: 216846	2014-08-31 19:19:57 +00:00
David Blaikie	3fbf3b8546	DebugInfo: Move argument creation up into the caller that's unambiguously handling the subprogram scope (replacing a conditional with an assertion in the process) llvm-svn: 216845	2014-08-31 18:04:28 +00:00
David Blaikie	2812746238	Delay adding imported entity DIEs to the lexical scope, streamlining the check for "this scope has nothing in it" This makes the emptiness of the scope with regards to variables and nested scopes is the same as with regards to imported entities. Just check if we had nothing at all before we build the node. llvm-svn: 216840	2014-08-31 05:46:17 +00:00
David Blaikie	8912df12ff	Modify DwarfDebug::constructImportedEntityDIE to return rather than insert into the scope Another step towards improving lexical_scope handling llvm-svn: 216839	2014-08-31 05:41:15 +00:00
David Blaikie	e0e8a3baa0	Refactor constructImportedEntityDIE(DwarfUnit, DIImportedEntity) to return a DIE rather than inserting it into a specified context. First of many steps to improve lexical scope construction (to omit trivial lexical scopes - those without any direct variables). To that end it's easier not to create imported entities directly into the lexical scope node, but to build them, then add them if necessary. llvm-svn: 216838	2014-08-31 05:32:06 +00:00
David Blaikie	cd4b8a2560	Simplify expression using container's front() rather than begin()-> llvm-svn: 216833	2014-08-31 02:14:26 +00:00
David Blaikie	ee03eadc3f	Add some negative (and positive) static_assert checks for ArrayRef-of-pointer conversions introduced in r216709 llvm-svn: 216830	2014-08-31 01:33:41 +00:00
Josh Klontz	e1900dc920	[PATCH][Interpreter] Add missing FP intrinsic lowering. Summary: This extends the work done in [1], adding missing intrinsic lowering for floor, trunc, round and copysign. [1] http://comments.gmane.org/gmane.comp.compilers.llvm.cvs/199372 Test Plan: Extended `test/ExecutionEngine/Interpreter/intrinsics.ll` to test the additional missing intrinsics. All tests pass. Reviewers: dexonsmith Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D5120 llvm-svn: 216827	2014-08-30 18:33:35 +00:00
Jordan Rose	88eb534517	Teach llvm-bcanalyzer to use one stream's BLOCKINFO to read another stream. This allows streams that only use BLOCKINFO for debugging purposes to omit the block entirely. As long as another stream is available with the correct BLOCKINFO, the first stream can still be analyzed and dumped. As part of this commit, BitstreamReader gets a move constructor and move assignment operator, as well as a takeBlockInfo method. llvm-svn: 216826	2014-08-30 17:07:55 +00:00
Craig Topper	fd38cbebda	Remove 'virtual' keyword from methods markedwith 'override' keyword. llvm-svn: 216823	2014-08-30 16:48:34 +00:00
Craig Topper	e3c88e1605	Use StringRef to avoid copies and simplify code. llvm-svn: 216822	2014-08-30 16:48:22 +00:00
Craig Topper	c9439cc8fc	Add a test for converting ArrayRef<T > to ArrayRef<const T >. llvm-svn: 216821	2014-08-30 16:48:19 +00:00
Craig Topper	6dc4a8bc2c	Fix some cases where StringRef was being passed by const reference. Remove const from some other StringRefs since its implicitly const already. llvm-svn: 216820	2014-08-30 16:48:02 +00:00
Brad Smith	e98cdf9b77	JIT support has been added awhile ago. llvm-svn: 216819	2014-08-30 14:52:34 +00:00
Hal Finkel	a3708df41a	Fix AddAliasScopeMetadata to not add scopes when deriving from unknown pointers The previous implementation of AddAliasScopeMetadata, which adds noalias metadata to preserve noalias parameter attribute information when inlining had a flaw: it would add alias.scope metadata to accesses which might have been derived from pointers other than noalias function parameters. This was incorrect because even some access known not to alias with all noalias function parameters could easily alias with an access derived from some other pointer. Instead, when deriving from some unknown pointer, we cannot add alias.scope metadata at all. This fixes a miscompile of the test-suite's tramp3d-v4. Furthermore, we cannot add alias.scope to functions unless we know they access only argument-derived pointers (currently, we know this only for memory intrinsics). Also, we fix a theoretical problem with using the NoCapture attribute to skip the capture check. This is incorrect (as explained in the comment added), but would not matter in any code generated by Clang because we get only inferred nocapture attributes in Clang-generated IR. This functionality is not yet enabled by default. llvm-svn: 216818	2014-08-30 12:48:33 +00:00
David Majnemer	492e612e01	InstCombine: Respect recursion depth in visitUDivOperand llvm-svn: 216817	2014-08-30 09:19:05 +00:00
David Majnemer	5e96f1b4c8	InstCombine: Try harder to combine icmp instructions consider: (and (icmp X, Y), (and Z, (icmp A, B))) It may be possible to combine (icmp X, Y) with (icmp A, B). If we successfully combine, create an 'and' instruction with Z. This fixes PR20814. N.B. There is room for improvement after this change but I'm not convinced it's worth chasing yet. llvm-svn: 216814	2014-08-30 06:18:20 +00:00
Juergen Ributzka	25816b0fdd	Revert r216805 "[MachineCombiner][AArch64] Use the correct register class for MADD, SUB, and OR." I think this broke the build bot. Reverting it for now until I have time to take a closer look. llvm-svn: 216813	2014-08-30 06:16:26 +00:00
Sean Callanan	8c51173d83	Fixed a build problem when there were headers for a different LLVM present in the system header lookup path. llvm-svn: 216812	2014-08-30 02:30:02 +00:00
Nick Kledzik	38637fb5b7	Add missing const to StringRef.copy() llvm-svn: 216811	2014-08-30 02:29:49 +00:00
Nick Kledzik	1b591bd289	Fix typo and formatting llvm-svn: 216809	2014-08-30 01:57:34 +00:00
Nick Kledzik	d04bc35852	Object/llvm-objdump: allow dumping of mach-o exports trie MachOObjectFile in lib/Object currently has no support for parsing the rebase, binding, and export information from the LC_DYLD_INFO load command in final linked mach-o images. This patch adds support for parsing the exports trie data structure. It also adds an option to llvm-objdump to dump that export info. I did the exports parsing first because it is the hardest. The information is encoded in a trie structure, but the standard ObjectFile way to inspect content is through iterators. So I needed to make an iterator that would do a non-recursive walk through the trie and maintain the concatenation of edges needed for the current string prefix. I plan to add similar support in MachOObjectFile and llvm-objdump to parse/display the rebasing and binding info too. llvm-svn: 216808	2014-08-30 00:20:14 +00:00
Juergen Ributzka	3e7f88c169	[MachineCombiner][AArch64] Use the correct register class for MADD, SUB, and OR. Select the correct register class for the various instructions that are generated when combining instructions and constrain the registers to the appropriate register class. This fixes rdar://problem/18183707. llvm-svn: 216805	2014-08-29 23:48:09 +00:00
Juergen Ributzka	c5c1c6090f	[FastISel][AArch64] Use the correct register class for branches. Also constrain the register class for branches. This fixes rdar://problem/18181496. llvm-svn: 216804	2014-08-29 23:48:06 +00:00
Juergen Ributzka	00d78221ab	[MachineSinking] Clear kill flag of all operands at all their uses. When sinking an instruction it might be moved past the original last use of one of its operands. This last use has the kill flag set and the verifier will obviously complain about this. Before Machine Sinking (AArch64): %vreg3<def> = ASRVXr %vreg1, %vreg2<kill> %XZR<def> = SUBSXrs %vreg4, %vreg1<kill>, 160, %NZCV<imp-def> ... After Machine Sinking: %XZR<def> = SUBSXrs %vreg4, %vreg1<kill>, 160, %NZCV<imp-def> ... %vreg3<def> = ASRVXr %vreg1, %vreg2<kill> This fix clears all the kill flags in all instruction that use the same operands as the instruction that is being sunk. This fixes rdar://problem/18180996. llvm-svn: 216803	2014-08-29 23:48:03 +00:00
Lang Hames	e1287c01be	[MCJIT] Move endian-aware read/writes from RuntimeDyldMachO into RuntimeDyldImpl. These are platform independent, and moving them to the base class allows RuntimeDyldChecker to use them too. llvm-svn: 216801	2014-08-29 23:17:47 +00:00
Adrian Prantl	daedfda892	Debug info: Add a new explicit DIDescriptor flag for the "public" access specifier and change the default behavior to only emit the DW_AT_accessibility(public) attribute when the isPublic() is explicitly set. rdar://problem/18154959 llvm-svn: 216799	2014-08-29 22:44:07 +00:00
Jean-Luc Duprat	97bfbb84eb	Comment only: Annotate loop as per mailing list discussion llvm-svn: 216798	2014-08-29 22:43:30 +00:00
Alexey Samsonov	700964ea00	Make isValidMCLOHType take unsigned instead of enum to avoid loading invalid enum values llvm-svn: 216797	2014-08-29 22:34:28 +00:00
Kevin Enderby	956366c6f1	Next bit of support for llvm-objdump’s -private-headers for Mach-O files. This adds the printing of the LC_SEGMENT load command and sections, LC_SYMTAB and LC_DYSYMTAB load commands. llvm-svn: 216795	2014-08-29 22:30:52 +00:00
Reid Kleckner	39ad7c9812	AArch64: Silence -Wabsolute-value warning with std::abs llvm-svn: 216794	2014-08-29 22:14:26 +00:00
Reid Kleckner	d70ab41a4f	Speculative build fix for const, gcc, and ArrayRef overloads llvm-svn: 216793	2014-08-29 22:12:08 +00:00
David Blaikie	6ba88e0f08	Revert accidentally committed patches r216787-r216789 Rushed when I realized I hadn't committed the FreeDeleter for a clang change I'd committed, and didn't check that I had things lying around in my client. Apologies for the noise. llvm-svn: 216792	2014-08-29 22:10:52 +00:00
David Blaikie	cdbe51b543	Add a trivial functor for use with unique_ptrs managing memory that needs to be freed rather than deleted. llvm-svn: 216790	2014-08-29 22:05:31 +00:00
David Blaikie	a19833d8f7	Omit DW_AT_artificial, DW_AT_external, and similar attributes under -gmlt llvm-svn: 216789	2014-08-29 22:05:29 +00:00
David Blaikie	85d8bcd000	Omit dwarf::DW_AT_frame_base under -gmlt llvm-svn: 216788	2014-08-29 22:05:27 +00:00
David Blaikie	71fffa950a	Stuff llvm-svn: 216787	2014-08-29 22:05:26 +00:00
Robin Morisset	039781ef26	Fix typos in comments, NFC Summary: Just fixing comments, no functional change. Test Plan: N/A Reviewers: jfb Subscribers: mcrosier, llvm-commits Differential Revision: http://reviews.llvm.org/D5130 llvm-svn: 216784	2014-08-29 21:53:01 +00:00
Reid Kleckner	dccd0cbec3	Add a const and munge some comments llvm-svn: 216781	2014-08-29 21:42:21 +00:00
Reid Kleckner	16e5541211	musttail: Forward regparms of variadic functions on x86_64 Summary: If a variadic function body contains a musttail call, then we copy all of the remaining register parameters into virtual registers in the function prologue. We track the virtual registers through the function body, and add them as additional registers to pass to the call. Because this is all done in virtual registers, the register allocator usually gives us good code. If the function does a call, however, it will have to spill and reload all argument registers (ew). Forwarding regparms on x86_32 is not implemented because most compilers don't support varargs in 32-bit with regparms. Reviewers: majnemer Subscribers: aemerson, llvm-commits Differential Revision: http://reviews.llvm.org/D5060 llvm-svn: 216780	2014-08-29 21:42:08 +00:00
Reid Kleckner	329d4a2b29	Verifier: Don't reject varargs callee cleanup functions We've rejected these kinds of functions since r28405 in 2006 because it's impossible to lower the return of a callee cleanup varargs function. However there are lots of legal ways to leave such a function without returning, such as aborting. Today we can leave a function with a musttail call to another function with the correct prototype, and everything works out. I'm removing the verifier check declaring that a normal return from such a function is UB. Reviewed By: nlewycky Differential Revision: http://reviews.llvm.org/D5059 llvm-svn: 216779	2014-08-29 21:25:28 +00:00
Louis Gerbarg	03c627e8a7	Remove spurious mask operations from AArch64 add->compares on 16 and 8 bit values This patch checks for DAG patterns that are an add or a sub followed by a compare on 16 and 8 bit inputs. Since AArch64 does not support those types natively they are legalized into 32 bit values, which means that mask operations are inserted into the DAG to emulate overflow behaviour. In many cases those masks do not change the result of the processing and just introduce a dependent operation, often in the middle of a hot loop. This patch detects the relevent DAG patterns and then tests to see if the transforms are equivalent with and without the mask, removing the mask if possible. The exact mechanism of this patch was discusses in http://lists.cs.uiuc.edu/pipermail/llvmdev/2014-July/074444.html There is a reasonably good chance there are missed oppurtunities due to similiar (but not identical) DAG patterns that could be funneled into this test, adding them should be simple if we see test cases. Tests included. rdar://13754426 llvm-svn: 216776	2014-08-29 21:00:22 +00:00
Reid Kleckner	ab99e24e94	X86: Fix conflict over ESI between base register and rep;movsl The new solution is to not use this lowering if there are any dynamic allocas in the current function. We know up front if there are dynamic allocas, but we don't know if we'll need to create stack temporaries with large alignment during lowering. Conservatively assume that we will need such temporaries. Reviewed By: hans Differential Revision: http://reviews.llvm.org/D5128 llvm-svn: 216775	2014-08-29 20:50:31 +00:00
Sanjay Patel	26583e009d	another typo llvm-svn: 216774	2014-08-29 20:35:00 +00:00
Sanjay Patel	85671b9a6e	typo llvm-svn: 216773	2014-08-29 20:34:17 +00:00
Robin Morisset	163ef0402a	Relax the constraint more in MemoryDependencyAnalysis.cpp Even loads/stores that have a stronger ordering than monotonic can be safe. The rule is no release-acquire pair on the path from the QueryInst, assuming that the QueryInst is not atomic itself. llvm-svn: 216771	2014-08-29 20:32:58 +00:00
Robin Morisset	5ce0ce4430	[X86] Refactor X86ISelDAGToDAG::SelectAtomicLoadArith - NFC Summary: Mostly renaming the (not very explicit) variables Tmp0, .. Tmp4, and grouping related statements together, along with a few lines of comments for the surprising parts. No functional change intended. Test Plan: make check-all Reviewers: jfb Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D5088 llvm-svn: 216768	2014-08-29 20:19:23 +00:00
Nick Kledzik	ffe5106fe6	Add missing mach-o EXPORT_SYMBOL_FLAG_KIND_ABSOLUTE llvm-svn: 216759	2014-08-29 19:55:55 +00:00
Jean-Luc Duprat	6d7b456184	Tablegen fixes for new syntax when initializing bits from variables. Followup to r215086. llvm-svn: 216757	2014-08-29 19:41:04 +00:00
Juergen Ributzka	f6ee7a7cdd	[FastISel][AArch64] Fix an incorrect kill flag due to a bug in SelectTrunc. When we select a trunc instruction we don't emit any code if the type is already i32 or smaller. This is because the instruction that uses the truncated value will deal with it. This behavior can incorrectly transfer a kill flag, which was meant for the result of the truncate, onto the source register. %2 = trunc i32 %1 to i16 ... = ... %2 -> ... = ... vreg1 <kill> ... = ... %1 ... = ... vreg1 This commit fixes this by emitting a COPY instruction, so that the result and source register are distinct virtual registers. This fixes rdar://problem/18178188. llvm-svn: 216750	2014-08-29 17:58:16 +00:00
Tilmann Scheller	09366de7a3	[ARM] Add Thumb-2 code size optimization test for ASR (register). llvm-svn: 216746	2014-08-29 17:19:00 +00:00
Tilmann Scheller	b3854015db	[ARM] Add Thumb-2 code size optimization test for ASR (immediate). llvm-svn: 216744	2014-08-29 17:02:28 +00:00
Hal Finkel	2d3d6da44b	Fix a typo in AddAliasScopeMetadata llvm-svn: 216741	2014-08-29 16:33:41 +00:00
Matt Arsenault	85cbc7e371	Make fabs safe to speculatively execute llvm-svn: 216736	2014-08-29 16:01:17 +00:00
Matt Arsenault	8675db15da	R600/SI: Use mad for fsub + fmul We can use a negate source modifier to match this for fsub. llvm-svn: 216735	2014-08-29 16:01:14 +00:00
Tim Northover	3c0915e858	AArch64: only try to get operand of a known node. A bug in r216725 meant we tried to discover the type of a SETCC before confirming the node actually was a SETCC. llvm-svn: 216734	2014-08-29 15:34:58 +00:00
Frederic Riss	aaae87e5ab	Remove unnecessary regex in test pattern per dblaikie suggestion. llvm-svn: 216733	2014-08-29 15:32:15 +00:00
Sanjay Patel	a065eb44aa	typo llvm-svn: 216732	2014-08-29 15:32:09 +00:00
Jingyue Wu	cb83a155c1	[NVPTX] Make the alignment an explicit argument to ldu/ldg Summary: Instead of specifying the alignment as metadata which may be destroyed by transformation passes, make the alignment the second argument to ldu/ldg intrinsic calls. Test Plan: ldu-ldg.ll ldu-i8.ll ldu-reg-plus-offset.ll Reviewers: eliben, meheff, jholewinski Reviewed By: meheff, jholewinski Subscribers: jholewinski, llvm-commits Differential Revision: http://reviews.llvm.org/D5093 llvm-svn: 216731	2014-08-29 15:30:20 +00:00
Tilmann Scheller	f28e7e7d34	[ARM] Make Thumb-2 code size optimization test more strict. llvm-svn: 216729	2014-08-29 15:13:35 +00:00
Tilmann Scheller	e8e577833a	[ARM] Add a first test for the Thumb-2 code size optimization pass. While working on a Thumb-2 code size optimization I just realized that we don't have any regression tests for it. So here's a first test case, I plan to increase the coverage over time. llvm-svn: 216728	2014-08-29 15:04:40 +00:00
Tim Northover	c1c05aeb5d	AArch64: skip select/setcc combine in complex case. In an llvm-stress generated test, we were trying to create a v0iN type and asserting when that failed. This case could probably be handled by the function, but not without added complexity and the situation it arises in is sufficiently odd that there's probably no benefit anyway. Should fix PR20775. llvm-svn: 216725	2014-08-29 13:05:18 +00:00
Arnaud A. de Grandmaison	6afbf2aa5e	[AArch64] FPLoadBalancing: move ownership of the chain to its current accumulator register and forget about the previously used accumulator. Coming up with a simple testcase is not easy, as this highly depends on what the register allocator is doing: this issue showed up while working with the PBQP allocator, which produced a different allocation scheme. A testcase would need to come up with chain starting in D[0-7], then moving to D[8-15], followed by a call to a function whose regmask clobbers the starting accumulator in D[0-7], then another use of the chain. Fixed some formatting, added some invariant checks while there. llvm-svn: 216721	2014-08-29 09:54:11 +00:00
Frederic Riss	5732301afb	Use DwarfDebug::attachLowHighPC for the compilation unit DIE. llvm-svn: 216719	2014-08-29 09:00:26 +00:00
Robert Khasanov	a651a62340	[SKX] Enable lowering of integer CMP operations. Added new types to Legalizer. Fixed getSetCCResultType function Added lowering tests. Reviewed by Elena Demikhovsky. llvm-svn: 216717	2014-08-29 08:46:04 +00:00
Job Noorman	9b31bd6bb0	Do not assume the value passed to memset is an i32. The code in SelectionDAG::getMemset for some reason assumes the value passed to memset is an i32. This breaks the generated code for targets that only have registers smaller than 32 bits because the value might get split into multiple registers by the calling convention. See the test for the MSP430 target included in the patch for an example. This patch ensures that nothing is assumed about the type of the value. Instead, the type is taken from the selected overload of the llvm.memset intrinsic. llvm-svn: 216716	2014-08-29 08:23:53 +00:00
Craig Topper	7e24c0a89c	Add conversion constructor to convert ArrayRef<T> to ArrayRef<const T>. Reviewed with Chandler and David Blaikie. llvm-svn: 216709	2014-08-29 06:01:43 +00:00
Jiangning Liu	08f4cda2ec	[AArch64] Fix some failures exposed by value type v4f16 and v8f16. 1) Add some missing bitcast patterns for v8f16. 2) Add type promotion for operand of ld/st operations. llvm-svn: 216706	2014-08-29 01:31:42 +00:00
Chris Bieneman	b1cd51e33c	Cleaning up static initializers in Signals.inc Reviewed by: Chandlerc llvm-svn: 216704	2014-08-29 01:05:16 +00:00
Chris Bieneman	5e7f44c25e	Cleaning up static initializers in TimeValue. Code reviewed by Chandlerc llvm-svn: 216703	2014-08-29 01:05:12 +00:00
Alexey Samsonov	4ee2675dfe	Introduce -DLLVM_USE_SANITIZER=Undefined CMake option to build UBSan-ified version of LLVM/Clang. I've fixed most of the simple bugs and currently "check-llvm" test suite has 26 failures, and "check-clang" suite has 5 failures. llvm-svn: 216701	2014-08-29 00:50:36 +00:00
Juergen Ributzka	77bc09f5ab	[FastISel][AArch64] Don't fold instructions that are not in the same basic block. This fix checks first if the instruction to be folded (e.g. sign-/zero-extend, or shift) is in the same machine basic block as the instruction we are folding into. Not doing so can result in incorrect code, because the value might not be live-out of the basic block, where the value is defined. This fixes rdar://problem/18169495. llvm-svn: 216700	2014-08-29 00:19:21 +00:00
David Majnemer	400e725bde	Revert two GEP-related InstCombine commits This reverts commit r216523 and r216598; people have reported regressions. llvm-svn: 216698	2014-08-29 00:06:43 +00:00
Reid Kleckner	febb279c9c	Don't promote byval pointer arguments when padding matters Don't promote byval pointer arguments when when their size in bits is not equal to their alloc size in bits. This can happen for x86_fp80, where the size in bits is 80 but the alloca size in bits in 128. Promoting these types can break passing unions of x86_fp80s and other types. Patch by Thomas Jablin! Reviewed By: rnk Differential Revision: http://reviews.llvm.org/D5057 llvm-svn: 216693	2014-08-28 22:42:00 +00:00
Jim Grosbach	ec2b0d0b11	AArch64: More correctly constrain target vector extend lowering. The AArch64 target lowering for [zs]ext of vectors is set up to handle input simple types and expects the generic SDag path to do something reasonable with anything that's not a simple type. The code, however, was only checking that the result type was a simple type and assuming that implied that the source type would also be a simple type. That's not a valid assumption, as operations like "zext <1 x i1> %0 to <1 x i32>" demonstrate. The fix is to simply explicitly validate the source type as well as the result type. PR20791 llvm-svn: 216689	2014-08-28 22:08:28 +00:00
Sanjay Patel	ccd267683b	Move FNEG next to FABS and make them more similar, so it's easier that they can be refactored. NFC. llvm-svn: 216688	2014-08-28 21:51:37 +00:00
Rafael Espindola	b43d51de95	On MachO, don't put non-private constants in mergeable sections. On MachO, putting a symbol that doesn't start with a 'L' or 'l' in one of the __TEXT,__literal* sections prevents the linker from merging the context of the section. Since private GVs are the ones the get mangled to start with 'L' or 'l', we now only put those on the __TEXT,__literal* sections. llvm-svn: 216682	2014-08-28 20:13:31 +00:00
Frederic Riss	9e32475f18	Constify MCSymbol* parameters to DwarfDebug::attachLowHighPC. llvm-svn: 216681	2014-08-28 19:09:29 +00:00
Sanjay Patel	81ecbb0737	Fix a logic bug in x86 vector codegen: sext (zext (x) ) != sext (x) (PR20472). Remove a block of code from LowerSIGN_EXTEND_INREG() that was added with: http://llvm.org/viewvc/llvm-project?view=revision&revision=177421 And caused: http://llvm.org/bugs/show_bug.cgi?id=20472 (more analysis here) http://llvm.org/bugs/show_bug.cgi?id=18054 The testcases confirm that we (1) don't remove a zext op that is necessary and (2) generate a pmovz instead of punpck if SSE4.1 is available. Although pmovz is 1 byte longer, it allows folding of the load, and so saves 3 bytes overall. Differential Revision: http://reviews.llvm.org/D4909 llvm-svn: 216679	2014-08-28 18:59:22 +00:00
Owen Anderson	3eb910b404	Do not introduce new shuffle patterns after operation legalization if SHUFFLE_VECTOR was marked custom. The target independent DAG combine has no way to know if the shuffles it is introducing are ones that the target could support or not. llvm-svn: 216678	2014-08-28 17:49:58 +00:00
Sanjay Patel	50cbfc5138	Janitorial services: "Don’t duplicate function or class name at the beginning of the comment." llvm-svn: 216674	2014-08-28 16:29:51 +00:00
Sanjay Patel	e28d57d9d8	Remove local TLI vars that are just duplicates of the class var. No functional change. llvm-svn: 216673	2014-08-28 16:01:50 +00:00
Sanjay Patel	78614bf0e8	Use local vars to improve readability. No functional change. Completes what was started in r216611 and r216623. Used const refs instead of pointers; not sure if one is preferable to the other. llvm-svn: 216672	2014-08-28 15:53:16 +00:00
Sid Manning	67a8936a84	Minor spelling correction. Reviewers: adasgupt, jverma, sidneym Differential Revision: http://reviews.llvm.org/D5025 llvm-svn: 216667	2014-08-28 14:16:32 +00:00
Aaron Ballman	a4aa0d7cc0	Silence a -Wsign-compare warning. NFC. llvm-svn: 216666	2014-08-28 13:23:26 +00:00
Arnaud A. de Grandmaison	efd4363172	[PBQP] Only output debug information when requested llvm-svn: 216660	2014-08-28 10:15:47 +00:00
David Majnemer	074052b623	InstCombine: Remove redundant combines InstSimplify already handles icmp (X+Y), X (and things like it) appropriately. The first thing that InstCombine does is run InstSimplify on the instruction. llvm-svn: 216659	2014-08-28 10:08:37 +00:00
Erik Eckstein	8354cfaf95	Fix: SLPVectorizer tried to move an instruction which was replaced by a vector instruction. For a detailed description of the problem see the comment in the test file. The problematic moveBefore() calls are not required anymore because the new scheduling algorithm ensures a correct ordering anyway. llvm-svn: 216656	2014-08-28 07:04:02 +00:00
David Xu	ee978203e6	Generate CMN when comparing a short int with minus llvm-svn: 216651	2014-08-28 04:59:53 +00:00
Justin Hibbits	3476db4220	Test commit. Fix whitespace from a previous patch of mine. llvm-svn: 216650	2014-08-28 04:40:55 +00:00
Lang Hames	c5cafbb074	[MCJIT] Fix format specifiers for debug output in RuntimeDyld. More work on http://llvm.org/PR20640 llvm-svn: 216648	2014-08-28 04:25:17 +00:00
David Majnemer	9ab5ff1c5b	MC: Don't crash when the COFF section limit is reached I've decided not to commit a test, it takes 2.5 seconds to run on my an incredibly strong machine. llvm-svn: 216647	2014-08-28 04:02:50 +00:00
Chandler Carruth	c01ce6bc01	[x86] Fix whitespace and formatting around this function with clang-format, no functionality changed. llvm-svn: 216646	2014-08-28 04:00:24 +00:00
Chandler Carruth	cb07a4adf3	[x86] Hoist conditions from every single if in this routine to a single early exit. And factor the subsequent cast<> from all but one block into a single variable. No functionality changed. llvm-svn: 216645	2014-08-28 03:57:13 +00:00
Chandler Carruth	974aa336b1	[x86] Inline an SSE4 helper function for INSERT_VECTOR_ELT lowering, no functionality changed. Separating this into two functions wasn't helping. There was a decent amount of boilerplate duplicated, and some subsequent refactorings here will pull even more common code out. llvm-svn: 216644	2014-08-28 03:52:45 +00:00
Chandler Carruth	5260c0894c	[x86] Clean up some tests to use FileCheck and combine two into a single file. Changing code that is covered by these tests is just too hard to debug currently, and now it will be clear the nature of the changes. llvm-svn: 216643	2014-08-28 03:41:28 +00:00
David Majnemer	76d06bc613	InstSimplify: Move a transform from InstCombine to InstSimplify Several combines involving icmp (shl C2, %X) C1 can be simplified without introducing any new instructions. Move them to InstSimplify; while we are at it, make them more powerful. llvm-svn: 216642	2014-08-28 03:34:28 +00:00
Juergen Ributzka	31328168bb	[FastISel] Undo phi node updates when falling-back to SelectionDAG. The included test case would fail, because the MI PHI node would have two operands from the same predecessor. This problem occurs when a switch instruction couldn't be selected. This happens always, because there is no default switch support for FastISel to begin with. The problem was that FastISel would first add the operand to the PHI nodes and then fall-back to SelectionDAG, which would then in turn add the same operands to the PHI nodes again. This fix removes these duplicate PHI node operands by reseting the PHINodesToUpdate to its original state before FastISel tried to select the instruction. This fixes <rdar://problem/18155224>. llvm-svn: 216640	2014-08-28 02:06:55 +00:00
Juergen Ributzka	4f1a54a41a	[FastISel] Currently instructions are folded very aggressively for AArch64 into the memory operation, which can lead to the use of killed operands: %vreg1<def> = ADDXri %vreg0<kill>, 2 %vreg2<def> = LDRBBui %vreg0, 2 ... = ... %vreg1 ... This usually happens when the result is also used by another non-memory instruction in the same basic block, or any instruction in another basic block. This fix teaches hasTrivialKill to not only check the LLVM IR that the value has a single use, but also to check if the register that represents that value has already been used. This can happen when the instruction with the use was folded into another instruction (in this particular case a load instruction). This fixes rdar://problem/18142857. llvm-svn: 216634	2014-08-28 00:09:46 +00:00
Juergen Ributzka	843f14f411	Revert "[FastISel][AArch64] Don't fold instructions too aggressively into the memory operation." Quentin pointed out that this is not the correct approach and there is a better and easier solution. llvm-svn: 216632	2014-08-27 23:09:40 +00:00
Alexey Samsonov	a8d2f819ad	Fix unaligned reads/writes in X86JIT and RuntimeDyldELF. Summary: Introduce support::ulittleX_t::ref type to Support/Endian.h and use it in x86 JIT to enforce correct endianness and fix unaligned accesses. Test Plan: regression test suite Reviewers: lhames Subscribers: ributzka, llvm-commits Differential Revision: http://reviews.llvm.org/D5011 llvm-svn: 216631	2014-08-27 23:06:08 +00:00
Juergen Ributzka	ad8beabe38	[FastISel][AArch64] Don't fold instructions too aggressively into the memory operation. Currently instructions are folded very aggressively into the memory operation, which can lead to the use of killed operands: %vreg1<def> = ADDXri %vreg0<kill>, 2 %vreg2<def> = LDRBBui %vreg0, 2 ... = ... %vreg1 ... This usually happens when the result is also used by another non-memory instruction in the same basic block, or any instruction in another basic block. If the computed address is used by only memory operations in the same basic block, then it is safe to fold them. This is because all memory operations will fold the address computation and the original computation will never be emitted. This fixes rdar://problem/18142857. llvm-svn: 216629	2014-08-27 22:52:33 +00:00
Renato Golin	d352137553	Avoid zero length memset error Adding a check on buffer lenght to avoid a __warn_memset_zero_len warning on GCC 4.8.2. llvm-svn: 216624	2014-08-27 21:58:56 +00:00
Sanjay Patel	159f127f63	Use local variable in visitFADD. No functional change. llvm-svn: 216623	2014-08-27 21:42:42 +00:00
Juergen Ributzka	56b4b33190	[FastISel][AArch64] Fix a comment in my previous commit (r216617). llvm-svn: 216622	2014-08-27 21:40:50 +00:00
Juergen Ributzka	3c1b286152	[FastISel][AArch64] Fix simplify address when the address comes from a shift. When the address comes directly from a shift instruction then the address computation cannot be folded into the memory instruction, because the zero register is not available as a base register. Simplify addess needs to emit the shift instruction and use the result as base register. llvm-svn: 216621	2014-08-27 21:38:33 +00:00
Rafael Espindola	cdb871d734	Fix a double free in llvm::getBitcodeTargetTriple. Unfortunately this is only used by ld64, so no testcase, but should fix the darwin LTO bootstrap. llvm-svn: 216618	2014-08-27 21:11:13 +00:00
Juergen Ributzka	100a9b7fda	[FastISel][AArch64] Use the zero register for stores. Use the zero register directly when possible to avoid an unnecessary register copy and a wasted register at -O0. This also uses integer stores to store a positive floating-point zero. This saves us from materializing the positive zero in a register and then storing it. llvm-svn: 216617	2014-08-27 21:04:52 +00:00
Sanjay Patel	ae402a35a0	Group unsafe-math optimizations for fsub into one block. No functional change. llvm-svn: 216616	2014-08-27 20:57:52 +00:00
Juergen Ributzka	833bc681e3	[FastISel] Fix a potential bug in FastEmitInst_ri FastEmitInst_ri was constraining the first operand without checking if it is a virtual register. Use constrainOperandRegClass as all the other FastEmitInst_* functions. llvm-svn: 216613	2014-08-27 20:47:33 +00:00
Sanjay Patel	a828f2ba46	Use local variable to improve readability. No functional change intended. llvm-svn: 216611	2014-08-27 20:40:31 +00:00
Sanjay Patel	1d23bac843	typo in comment llvm-svn: 216609	2014-08-27 20:27:05 +00:00
Rafael Espindola	eeec8e63c0	Don't create a MemoryBuffer just to get the MemoryBufferRef. NFC. llvm-svn: 216608	2014-08-27 20:25:55 +00:00
David Blaikie	dfbe3d6b17	Convert a few more cases of direct intialization of unique_ptrs from MemoryBuffer::getMemBuffer to move initialization now that it returns by unique_ptr instead of raw pointer. Cleanup/improvements following r216583. llvm-svn: 216605	2014-08-27 20:14:18 +00:00
Reid Kleckner	7b7a599ac5	X86 MC: Handle instructions like fxsave that match multiple operand sizes Instructions like 'fxsave' and control flow instructions like 'jne' match any operand size. The loop I added to the Intel syntax matcher assumed that using a different size would give a different instruction. Now it handles the case where we get the same instruction for different memory operand sizes. This also allows us to remove the hack we had for unsized absolute memory operands, because we can successfully match things like 'jnz' without reporting ambiguity. Removing this hack uncovered test case involving 'fadd' that was ambiguous. The memory operand could have been single or double precision. llvm-svn: 216604	2014-08-27 20:10:38 +00:00
David Majnemer	22ccfc4484	InstCombine: Combine gep X, (Y-X) to Y We try to perform this transform in InstSimplify but we aren't always able to. Sometimes, we need to insert a bitcast if X and Y don't have the same time. llvm-svn: 216598	2014-08-27 20:08:37 +00:00
David Majnemer	11ca2971e8	InstSimplify: Don't simplify gep X, (Y-X) to Y if types differ It's incorrect to perform this simplification if the types differ. A bitcast would need to be inserted for this to work. This fixes PR20771. llvm-svn: 216597	2014-08-27 20:08:34 +00:00
Nico Weber	48c82400ed	Reland r216439 215441, majnemer has a real fix for PR20771. llvm-svn: 216586	2014-08-27 20:06:19 +00:00
Rafael Espindola	3560ff2c1f	Return a std::unique_ptr when creating a new MemoryBuffer. llvm-svn: 216583	2014-08-27 20:03:13 +00:00
Nico Weber	7b343e3cc6	Revert r216439 (and r216441, else the former doesn't revert cleanly). It caused PR 20771. I'll land a test on the clang side. llvm-svn: 216582	2014-08-27 20:00:13 +00:00
Rafael Espindola	9eef18c58c	Remove unused argument. llvm-svn: 216580	2014-08-27 19:49:03 +00:00
Alexey Samsonov	a253bf9678	Use BitVector instead of int in R600 SIISelLowering. int may not have enough bits in it, which was detected by UBSan bootstrap (it reported left shift by a too large constant). llvm-svn: 216579	2014-08-27 19:36:53 +00:00
Rafael Espindola	68669e3a7b	yaml::Stream doesn't need to take ownership of the buffer. In fact, most users were already using the StringRef version. llvm-svn: 216575	2014-08-27 19:03:22 +00:00
Zachary Turner	671355e099	Fix some semantic usability issues with DynamicLibrary. This patch allows invalid DynamicLibrary instances to be constructed, and fixes the const-correctness of the isValid() method. No functional change. llvm-svn: 216571	2014-08-27 18:13:25 +00:00
David Majnemer	d6d1671c1e	InstSimplify: Compute comparison ranges for left shift instructions 'shl nuw CI, x' produces [CI, CI << CLZ(CI)] 'shl nsw CI, x' produces [CI << CLO(CI)-1, CI] if CI is negative 'shl nsw CI, x' produces [CI, CI << CLZ(CI)-1] if CI is non-negative llvm-svn: 216570	2014-08-27 18:03:46 +00:00
Zachary Turner	74a46c24f3	Revert "Limit the symbol search in DynamicLibrary to the module that was opened." This reverts commit r216563, which breaks lli's dynamic symbol resolution. llvm-svn: 216569	2014-08-27 17:51:43 +00:00
Lang Hames	0717c3de02	[MCJIT] Replace a C-style cast in RuntimeDyldImpl.h. llvm-svn: 216568	2014-08-27 17:48:07 +00:00
Lang Hames	dc77feb57d	[MCJIT] More endianness fixes for RuntimeDyldMachO. http://llvm.org/PR20640 llvm-svn: 216567	2014-08-27 17:41:06 +00:00
Zachary Turner	0611d01419	Limit the symbol search in DynamicLibrary to the module that was opened. Differential Revision: http://reviews.llvm.org/D5030 Reviewed By: Reid Kleckner, Rafael Espindola llvm-svn: 216563	2014-08-27 17:06:22 +00:00
Oliver Stannard	89d1542840	Teach the AArch64 backend about v4f16 and v8f16 This teaches the AArch64 backend to deal with the operations required to deal with the operations on v4f16 and v8f16 which are exposed by NEON intrinsics, plus the add, sub, mul and div operations. llvm-svn: 216555	2014-08-27 16:16:04 +00:00
Michael Zolotukhin	5dc466b863	[SLP] Re-enable vectorization of GEP expressions (re-apply r210342 with a fix). llvm-svn: 216549	2014-08-27 15:01:18 +00:00
Evgeniy Stepanov	5050553ab8	Clang-format over X86AsmInstrumentation.* with LLVM style. r216536 mistakenly used -style=Google instead of LLVM. llvm-svn: 216543	2014-08-27 13:11:55 +00:00
Benjamin Kramer	870d951bda	Add an explicit cast to pacify implicit boolean conversion warnings. llvm-svn: 216539	2014-08-27 11:47:52 +00:00
Chandler Carruth	a5a8a9adc8	[x86] Fix a regression introduced with r213897 for 32-bit targets where we stopped efficiently lowering sextload using the SSE41 instructions for that operation. This is a consequence of a bad predicate I used thinking of the memory access needs. The code actually handles the cases where the predicate doesn't apply, and handles them much better. =] Simple fix and a test case added. Fixes PR20767. llvm-svn: 216538	2014-08-27 11:39:47 +00:00
Chandler Carruth	74ec9e19ee	[SDAG] Re-instate r215611 with a fix to a pesky X86 DAG combine. This combine is essentially combining target-specific nodes back into target independent nodes that it "knows" will be combined yet again by a target independent DAG combine into a different set of target-independent nodes that are legal (not custom though!) and thus "ok". This seems... deeply flawed. The crux of the problem is that we don't combine un-legalized shuffles that are introduced by legalizing other operations, and thus we don't see a very profitable combine opportunity. So the backend just forces the input to that combine to re-appear. However, for this to work, the conditions detected to re-form the unlegalized nodes must be exactly right. Previously, failing this would have caused poor code (if you're lucky) or a crasher when we failed to select instructions. After r215611 we would fall back into the legalizer. In some cases, this just "fixed" the crasher by produces bad code. But in the test case added it caused the legalizer and the dag combiner to iterate forever. The fix is to make the alignment checking in the x86 side of things match the alignment checking in the generic DAG combine exactly. This isn't really a satisfying or principled fix, but it at least make the code work as intended. It also highlights that it would be nice to detect the availability of under aligned loads for a given type rather than bailing on this optimization. I've left a FIXME to document this. Original commit message for r215611 which covers the rest of the chang: [SDAG] Fix a case where we would iteratively legalize a node during combining by replacing it with something else but not re-process the node afterward to remove it. In a truly remarkable stroke of bad luck, this would (in the test case attached) end up getting some other node combined into it without ever getting re-processed. By adding it back on to the worklist, in addition to deleting the dead nodes more quickly we also ensure that if it stops being dead for any reason it makes it back through the legalizer. Without this, the test case will end up failing during instruction selection due to an and node with a type we don't have an instruction pattern for. It took many million runs of the shuffle fuzz tester to find this. llvm-svn: 216537	2014-08-27 11:22:16 +00:00
Evgeniy Stepanov	4d04f66627	Clang-format over X86AsmInstrumentation.*. llvm-svn: 216536	2014-08-27 11:10:54 +00:00
Robert Khasanov	29e3b96734	[SKX] Added new versions of cmp instructions in avx512_icmp_cc multiclass, added VL multiclass. Added encoding tests llvm-svn: 216532	2014-08-27 09:34:37 +00:00
Elena Demikhovsky	ff620edd3c	AVX-512: Added intrinsic for VMOVSS store form with mask. llvm-svn: 216530	2014-08-27 07:38:43 +00:00
Craig Topper	e1d1294853	Simplify creation of a bunch of ArrayRefs by using None, makeArrayRef or just letting them be implicitly created. llvm-svn: 216525	2014-08-27 05:25:25 +00:00
Craig Topper	3af9722529	Fix some cases were ArrayRefs were being passed by reference. Also remove 'const' from some other ArrayRef uses since its implicitly const already. llvm-svn: 216524	2014-08-27 05:25:00 +00:00
David Majnemer	54e97d5dc0	InstCombine: Optimize GEP's involving ptrtoint better We supported transforming: (gep i8* X, -(ptrtoint Y)) to: (inttoptr (sub (ptrtoint X), (ptrtoint Y))) However, this only fired if 'X' had type i8*. Generalize this to support various types of different sizes. This results in much better CodeGen, especially for pointers to packed structs. llvm-svn: 216523	2014-08-27 05:16:04 +00:00
David Blaikie	c13bc97e58	Remove type unit skeletons. GDB no longer needs them & this saves a heap of space. llvm-svn: 216521	2014-08-27 05:04:14 +00:00
Juergen Ributzka	fb506a417d	[FastISel][AArch64] Fix address simplification. When a shift with extension or an add with shift and extension cannot be folded into the memory operation, then the address calculation has to be materialized separately. While doing so the code forgot to consider a possible sign-/zero- extension. This fix folds now also the sign-/zero-extension into the add or shift instruction which is used to materialize the address. This fixes rdar://problem/18141718. llvm-svn: 216511	2014-08-27 00:58:30 +00:00
Juergen Ributzka	99dd30f338	[FastISel][AArch64] Fold Sign-/Zero-Extend into the shift immediate instruction. llvm-svn: 216510	2014-08-27 00:58:26 +00:00
David Blaikie	b3833ef0c1	Fix a couple of debug info test cases to match the metadata schema change in r216239 Found these while testing something else. llvm-svn: 216505	2014-08-27 00:04:16 +00:00
Rafael Espindola	e2c1d77fb4	Pass a std::unique_ptr<MemoryBuffer>& to getLazyBitcodeModule. By taking a reference we can do the ownership transfer in one place instead of expecting every caller to do it. llvm-svn: 216492	2014-08-26 22:00:09 +00:00
Rafael Espindola	d96d553d76	Pass a MemoryBufferRef when we can avoid taking ownership. The attached patch simplifies a few interfaces that don't need to take ownership of a buffer. For example, both parseAssembly and parseBitcodeFile will parse the entire buffer before returning. There is no need to take ownership. Using a MemoryBufferRef makes it obvious in the type signature that there is no ownership transfer. llvm-svn: 216488	2014-08-26 21:49:01 +00:00
Rafael Espindola	7271c19420	Give ExecutionEngine of top level buffers. Long term the idea if for the engine to not own the buffers, but for now this is consistent with the rest of the API. llvm-svn: 216484	2014-08-26 21:04:04 +00:00
Reid Kleckner	f6fb780890	MC: Split the x86 asm matcher implementations by dialect The existing matcher has lots of AT&T assembly dialect assumptions baked into it. In particular, the hack for resolving the size of a memory operand by appending the four most common suffixes doesn't work at all. The Intel assembly dialect mnemonic table has ambiguous entries, so we need to try matching multiple times with different operand sizes, since that's the only way to choose different instruction variants. This makes us more compatible with gas's implementation of Intel assembly syntax. MSVC assumes you want byte-sized operations for the instructions that we reject as ambiguous. Reviewed By: grosbach Differential Revision: http://reviews.llvm.org/D4747 llvm-svn: 216481	2014-08-26 20:32:34 +00:00
Joerg Sonnenberger	cb5674b9c2	Revert r210342 and r210343, add test case for the crasher. PR 20642. llvm-svn: 216475	2014-08-26 19:06:41 +00:00
Joerg Sonnenberger	2981591f7f	Convert MC command line option for fatal assembler warnings into a proper flag. llvm-svn: 216471	2014-08-26 18:39:50 +00:00
Rafael Espindola	5c4f4a6c33	Invert the condition to have a single return. Thanks to David Blaikie for the suggestion. llvm-svn: 216468	2014-08-26 18:03:35 +00:00
Rafael Espindola	d233b06afc	Return a std::unique_ptr from the IRReader.h functions. NFC. llvm-svn: 216466	2014-08-26 17:29:46 +00:00
Rafael Espindola	28b351a56d	Return a std::unique_ptr from parseInputFile and propagate. NFC. The memory management in BugPoint is fairly convoluted, so this just unwraps one layer by changing the return type of functions that always return owned Modules. llvm-svn: 216464	2014-08-26 17:19:03 +00:00
Rafael Espindola	2ce3882eaf	Simplify LTOModule::makeLTOModule a bit. NFC. Just call parseBitcodeFile instead of getLazyBitcodeModule followed by materializeAllPermanently. llvm-svn: 216461	2014-08-26 15:09:32 +00:00
Rafael Espindola	016a6d5192	Merge TempDir and system_temp_directory. We had two functions for finding the temp or cache directory. Each had a different set of smarts about OS specific APIs. With this patch system_temp_directory becomes the only way to do it. llvm-svn: 216460	2014-08-26 14:47:52 +00:00
Benjamin Kramer	01e1789d38	Silence unused function warning in Release builds. llvm-svn: 216458	2014-08-26 14:22:05 +00:00
James Molloy	36b8a88188	Change the return value of "getEnd()" from a MachineInstr* to a MachineBasicBlock::iterator. It seems on Darwin the illegal round-trip ::iterator -> MachineInstr* -> ::iterator breaks execution horribly when the iterator is not a real MachineInstr, like ::end(). llvm-svn: 216455	2014-08-26 13:41:31 +00:00
Yi Kong	ebaa150e23	ARM: Add patterns for dbg llvm-svn: 216451	2014-08-26 12:47:26 +00:00
Dinesh Dwivedi	4919bbe29d	This patch enables SimplifyUsingDistributiveLaws() to handle following pattens. (X >> Z) & (Y >> Z) -> (X&Y) >> Z for all shifts. (X >> Z) \| (Y >> Z) -> (X\|Y) >> Z for all shifts. (X >> Z) ^ (Y >> Z) -> (X^Y) >> Z for all shifts. These patterns were previously handled separately in visitAnd()/visitOr()/visitXor(). Differential Revision: http://reviews.llvm.org/D4951 llvm-svn: 216443	2014-08-26 08:53:32 +00:00
Bill Wendling	24c6f5763a	Use 'xz' compression instead of 'gz'. llvm-svn: 216442	2014-08-26 08:11:22 +00:00
David Majnemer	788d0ab8c8	InstSimplify: Fold gep X, (sub 0, ptrtoint(X)) to null Save InstCombine some work if we can perform this fold during InstSimplify. llvm-svn: 216441	2014-08-26 07:08:03 +00:00
David Majnemer	bc4981323f	InstSimplify: Simplify trivial pointer expressions like b + (e - b) consider: long long f(long long b, long long e) { return b + (e - b); } we would lower this to something like: define i64 @f(i64* %b, i64* %e) { %1 = ptrtoint i64* %e to i64 %2 = ptrtoint i64* %b to i64 %3 = sub i64 %1, %2 %4 = ashr exact i64 %3, 3 %5 = getelementptr inbounds i64* %b, i64 %4 ret i64* %5 } This should fold away to just 'e'. N.B. This adds m_SpecificInt as a convenient way to match against a particular 64-bit integer when using LLVM's match interface. llvm-svn: 216439	2014-08-26 05:55:16 +00:00
Dylan Noblesmith	4af4d2c111	AArch64: use std::fill instead of memset Followup based on review. llvm-svn: 216436	2014-08-26 03:33:26 +00:00
Dylan Noblesmith	b06f77b608	Revert "AArch64: use std::vector for temp array" This reverts commit r216365. llvm-svn: 216433	2014-08-26 02:03:43 +00:00
Dylan Noblesmith	43f49cad78	Analysis: cleanup Address review comments. llvm-svn: 216432	2014-08-26 02:03:40 +00:00
Dylan Noblesmith	4ffafefdaa	Revert "Analysis: unique_ptr-ify DependenceAnalysis::collectCoeffInfo" This reverts commit r216358. llvm-svn: 216431	2014-08-26 02:03:38 +00:00
Dylan Noblesmith	c9e2a2709e	Revert "NVPTX: remove another raw delete call" This reverts commit r216364. llvm-svn: 216430	2014-08-26 02:03:35 +00:00
Dylan Noblesmith	4e69e29a72	Revert "Support/APFloat: unique_ptr-ify temp arrays" This reverts commit rr216359. llvm-svn: 216429	2014-08-26 02:03:33 +00:00
Dylan Noblesmith	42836d95e0	Revert "Support/Path: remove raw delete" This reverts commit r216360. llvm-svn: 216428	2014-08-26 02:03:30 +00:00
Dylan Noblesmith	4b535d1930	ExecutionEngine: address review comments llvm-svn: 216427	2014-08-26 02:03:28 +00:00
Dylan Noblesmith	17f05a3fc6	CodeGen/LiveVariables: use vector::assign() Address review comments. llvm-svn: 216426	2014-08-26 02:03:25 +00:00
Reid Kleckner	3715461b48	musttail: Don't eliminate varargs packs if there is a forwarding call Also clean up and beef up this grep test for the feature. llvm-svn: 216425	2014-08-26 00:59:51 +00:00
Sanjay Patel	4e31cdabd1	fix typos in comments llvm-svn: 216424	2014-08-26 00:59:15 +00:00
Reid Kleckner	8349864dbd	Declare that musttail calls in variadic functions forward the ellipsis Summary: There is no functionality change here except in the way we assemble and dump musttail calls in variadic functions. There's really no need to separate out the bits for musttail and "is forwarding varargs" on call instructions. A musttail call by definition has to forward the ellipsis or it would fail verification. Reviewers: chandlerc, nlewycky Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D4892 llvm-svn: 216423	2014-08-26 00:33:28 +00:00
Reid Kleckner	d83c63b704	Fix Path unittests on Windows after raw_fd_ostream changes llvm-svn: 216422	2014-08-26 00:24:23 +00:00
Reid Kleckner	e6e88f99b3	ArgPromotion: Don't touch variadic functions Adding, removing, or changing non-pack parameters can change the ABI classification of pack parameters. Clang and other frontends encode the classification in the IR of the call site, but the callee side determines it dynamically based on the number of registers consumed so far. Changing the prototype affects the number of registers consumed would break such code. Dead argument elimination performs a similar task and already has a similar check to avoid this problem. Patch by Thomas Jablin! llvm-svn: 216421	2014-08-25 23:58:48 +00:00
Lang Hames	40e200eb69	[MCJIT][SystemZ] Use a simpler expression for indirect relocation offsets. The expressions 'Reloc.Addend - Addend' and 'Reloc.Offset' should always be equal in this context. The latter is prefered - we want to remove the RelocationValueRef::Addend field in the future. llvm-svn: 216418	2014-08-25 23:33:48 +00:00
Rafael Espindola	42036ae034	Fix bug in llvm::sys::argumentsFitWithinSystemLimits(). This patch fixes a subtle bug in the UNIX implementation of llvm::sys::argumentsFitWithinSystemLimits() regarding the misuse of a static variable. This bug causes our cached number that stores the system command line maximum length to be halved after each call to the function. With a sufficient number of calls to this function, it will eventually report any given command line string to be over system limits. Patch by Rafael Auler. llvm-svn: 216415	2014-08-25 22:53:21 +00:00
Lang Hames	f4b3b67f57	[MCJIT] Dump section memory both before and after relocations are applied. Also switch section memory dump format from 8 to 16 columns. llvm-svn: 216413	2014-08-25 22:19:14 +00:00
Rafael Espindola	f7c3a1d256	Refactor argument serialization logic when executing process. NFC. This patch refactors the argument serialization logic used in the Execute function, used to launch new Windows processes. There is a critical step that joins char** arguments into a single string, building the command line used to launch the new process, and the readability of this code is improved if this part is refactored in its own helper function. Patch by Rafael Auler! llvm-svn: 216411	2014-08-25 22:15:06 +00:00
Juergen Ributzka	1912e24898	[FastISel][AArch64] Refactor float zero materialization. NFCI. llvm-svn: 216403	2014-08-25 19:58:05 +00:00
Lang Hames	86b08f02c0	[MCJIT] Make RuntimeDyld dump section contents in -debug mode. llvm-svn: 216400	2014-08-25 18:37:38 +00:00
Rafael Espindola	3fd1e9933f	Modernize raw_fd_ostream's constructor a bit. Take a StringRef instead of a "const char *". Take a "std::error_code &" instead of a "std::string &" for error. A create static method would be even better, but this patch is already a bit too big. llvm-svn: 216393	2014-08-25 18:16:47 +00:00
Chandler Carruth	70f81a98ca	[x86] Fix a bug in r216319 where I was missing a 'break'. This actually was caught by existing tests but those tests were disabled with an XFAIL because of PR20736. While working on fixing that, I noticed the test failure, and tracked it down to this. We even have a really nice Clang warning that would have caught this but it isn't enabled in LLVM! =[ I may look at enabling it. llvm-svn: 216391	2014-08-25 18:06:11 +00:00
Bruno Cardoso Lopes	e2a1fa35df	Remove dangling initializers in GlobalDCE GlobalDCE deletes global vars and updates their initializers to nullptr while leaving underlying constants to be cleaned up later by its uses. The clean up may never happen, fix this by forcing it every time it's safe to destroy constants. Final patch by Rafael Espindola http://reviews.llvm.org/D4931 <rdar://problem/17523868> llvm-svn: 216390	2014-08-25 17:51:14 +00:00
Bruno Cardoso Lopes	356c4ac88b	Rise from the dead and update personal info llvm-svn: 216389	2014-08-25 17:51:04 +00:00
Chad Rosier	e62f365458	[AArch32] Add patterns for VCVT{A,N,P,M}. Patterns for lowering libm calls to VCVT{A,N,P,M} are also included. Phabricator Revision: http://reviews.llvm.org/D5033 llvm-svn: 216388	2014-08-25 16:56:33 +00:00
Robert Khasanov	2ea081d4d1	[SKX] avx512_icmp_packed multiclass extension Extended avx512_icmp_packed multiclass by masking versions. Added avx512_icmp_packed_rmb multiclass for embedded broadcast versions. Added corresponding _vl multiclasses. Added encoding tests for CPCMP{EQ\|GT}* instructions. Add more fields for X86VectorVTInfo. Added AVX512VLVectorVTInfo that include X86VectorVTInfo for 512/256/128-bit versions Differential Revision: http://reviews.llvm.org/D5024 llvm-svn: 216383	2014-08-25 14:49:34 +00:00
Stepan Dyatkovskiy	c90308bf83	MergeFunctions, tiny refactoring: cmpAPFloat has been renamed to cmpAPFloats (multiple form). llvm-svn: 216376	2014-08-25 08:22:46 +00:00
Stepan Dyatkovskiy	7f895c1184	MergeFunctions, tiny refactoring: cmpAPInt has been renamed to cmpAPInts (multiple form). llvm-svn: 216375	2014-08-25 08:19:50 +00:00
Stepan Dyatkovskiy	0b765dee6e	MergeFunctions, tiny refactoring: cmpType has been renamed to cmpTypes (multiple form). llvm-svn: 216374	2014-08-25 08:16:39 +00:00
Stepan Dyatkovskiy	016daddc52	MergeFunctions, tiny refactoring: cmpGEP has been renamed to cmpGEPs (multiple form). llvm-svn: 216373	2014-08-25 08:12:45 +00:00
Karthik Bhat	7f33ff7dea	Allow vectorization of division by uniform power of 2. This patch adds support to recognize division by uniform power of 2 and modifies the cost table to vectorize division by uniform power of 2 whenever possible. Updates Cost model for Loop and SLP Vectorizer.The cost table is currently only updated for X86 backend. Thanks to Hal, Andrea, Sanjay for the review. (http://reviews.llvm.org/D4971) llvm-svn: 216371	2014-08-25 04:56:54 +00:00
Dylan Noblesmith	6e69927d03	CodeGen/LiveVariables: hoist out code in nested loops This makes runOnMachineFunction vastly more readable. llvm-svn: 216368	2014-08-25 01:59:49 +00:00
Dylan Noblesmith	46a922c191	CodeGen/LiveVariables: switch to std::vector No functionality change. llvm-svn: 216367	2014-08-25 01:59:42 +00:00
Dylan Noblesmith	b899464f5b	AArch64: unique_ptr-ify map structures llvm-svn: 216366	2014-08-25 01:59:38 +00:00
Dylan Noblesmith	6076debd98	AArch64: use std::vector for temp array llvm-svn: 216365	2014-08-25 01:59:36 +00:00
Dylan Noblesmith	130589f804	NVPTX: remove another raw delete call llvm-svn: 216364	2014-08-25 01:59:32 +00:00
Dylan Noblesmith	802b6ce8de	NVPTX: remove raw delete call Also make members that are never accessed outside the class private. llvm-svn: 216363	2014-08-25 01:59:29 +00:00
Dylan Noblesmith	c4a9942a68	ExecutionEngine: unique_ptr-ify NFC. llvm-svn: 216362	2014-08-25 00:58:18 +00:00
Dylan Noblesmith	2b9b93e6f1	EE/JIT: unique_ptr-ify llvm-svn: 216361	2014-08-25 00:58:15 +00:00
Dylan Noblesmith	0b59924d60	Support/Path: remove raw delete llvm-svn: 216360	2014-08-25 00:58:13 +00:00
Dylan Noblesmith	49c758b769	Support/APFloat: unique_ptr-ify temp arrays llvm-svn: 216359	2014-08-25 00:58:10 +00:00
Dylan Noblesmith	3ecd22fcf5	Analysis: unique_ptr-ify DependenceAnalysis::collectCoeffInfo llvm-svn: 216358	2014-08-25 00:28:43 +00:00
Dylan Noblesmith	2cae60e730	Analysis: unique_ptr-ify DependenceAnalysis::depends llvm-svn: 216357	2014-08-25 00:28:39 +00:00
Dylan Noblesmith	d96ce66cb1	Analysis: take a reference instead of pointer This parameter is never null. llvm-svn: 216356	2014-08-25 00:28:35 +00:00
Dylan Noblesmith	688fa5e15b	CodeGen: switch raw array to std::vector llvm-svn: 216355	2014-08-25 00:28:31 +00:00
Dylan Noblesmith	06adf32814	IR: remove dead code This was added in r134994, to fix a memory leak; three days later, r135248 switched ContainedTys from being new-allocated to being allocated via BumpPtrAllocator, and the earlier fix was never reverted. The destructor doesn't seem to ever actually be called on Types anyway, so it's harmless, but if it were, this'd be an invalid pointer. This reverts r134994. llvm-svn: 216354	2014-08-25 00:28:27 +00:00
Craig Topper	4627679cec	Use range based for loops to avoid needing to re-mention SmallPtrSet size. llvm-svn: 216351	2014-08-24 23:23:06 +00:00
Dylan Noblesmith	085fc4d6c6	TableGen: unique_ptr-ify RecordKeeper llvm-svn: 216350	2014-08-24 19:10:57 +00:00
Dylan Noblesmith	aa9b74c544	TableGen: delete no-op code This does nothing but remove the Record from the map, and then re-add it, without actually changing it in between. The Record's Name used to be changed before re-adding it when the code was first committed in r137232, but the name-changing lines were removed in r142510, and since then this code seems to do nothing. This was also the only caller of removeClass or removeDef, so now RecordKeeper owns its Records unconditionally, and could be unique_ptr-ified. llvm-svn: 216349	2014-08-24 19:10:53 +00:00
Dylan Noblesmith	80f0e432ee	TableGen: use auto and for-range llvm-svn: 216348	2014-08-24 19:10:49 +00:00
Aaron Ballman	9d515ff695	This code is from r216285, which did not go out to the mailing list for some reason. The switch statement would never fire due to the preceding break statement. Also, the switch statement has a default label with no case labels. Simplified the code, and allow it to execute. llvm-svn: 216346	2014-08-24 13:25:16 +00:00
Elena Demikhovsky	22e735d725	X86 intrinsics table - simplifies intrinsics lowering. The tables are initialized when X86TargetLowering object is created. llvm-svn: 216345	2014-08-24 09:19:56 +00:00
Patrik Hagglund	3e2a9d6125	Silence gcc -Wpedantic. llvm-svn: 216344	2014-08-24 09:12:33 +00:00
David Majnemer	0ffccf7fb5	InstCombine: Properly optimize or'ing bittests together CFE, with -03, would turn: bool f(unsigned x) { bool a = x & 1; bool b = x & 2; return a \| b; } into: %1 = lshr i32 %x, 1 %2 = or i32 %1, %x %3 = and i32 %2, 1 %4 = icmp ne i32 %3, 0 This sort of thing exposes a nasty pathology in GCC, ICC and LLVM. Instead, we would rather want: %1 = and i32 %x, 3 %2 = icmp ne i32 %1, 0 Things get a bit more interesting in the following case: %1 = lshr i32 %x, %y %2 = or i32 %1, %x %3 = and i32 %2, 1 %4 = icmp ne i32 %3, 0 Replacing it with the following sequence is better: %1 = shl nuw i32 1, %y %2 = or i32 %1, 1 %3 = and i32 %2, %x %4 = icmp ne i32 %3, 0 This sequence is preferable because %1 doesn't involve %x and could potentially be hoisted out of loops if it is invariant; only perform this transform in the non-constant case if we know we won't increase register pressure. llvm-svn: 216343	2014-08-24 09:10:57 +00:00
Hal Finkel	584a70c820	[PowerPC] Add support for dcbtst and icbt (prefetch) Adds code generation support for dcbtst (data cache prefetch for write) and icbt (instruction cache prefetch for read - Book E cores only). We still end up with a 'cannot select' error for the non-supported prefetch intrinsic forms. This will be fixed in a later commit. Fixes PR20692. llvm-svn: 216339	2014-08-23 23:21:04 +00:00
Dylan Noblesmith	c4c5180fb4	Support: add llvm::unique_lock Based on the STL class of the same name, it guards a mutex while also allowing it to be unlocked conditionally before destruction. This eliminates the last naked usages of mutexes in LLVM and clang. It also uncovered and fixed a bug in callExternalFunction() when compiled without USE_LIBFFI, where the mutex would never be unlocked if the end of the function was reached. llvm-svn: 216338	2014-08-23 23:07:14 +00:00
Dylan Noblesmith	13044d1cc5	Support: make LLVM Mutexes STL-compatible Use lock/unlock() convention instead of acquire/release(). llvm-svn: 216336	2014-08-23 22:49:22 +00:00
Dylan Noblesmith	4704ffe164	Support/Unix: use ScopedLock wherever possible Only one function remains a bit too complicated for a simple mutex guard. No functionality change. llvm-svn: 216335	2014-08-23 22:49:17 +00:00
Dylan Noblesmith	63f9b57147	cmake: actually test -Wcomment This test was testing nothing, as only -Werror was ever being added to the compiler flags. You can see the final nitty-gritty compiler invocation in CMakeFiles/CMakeOutput.log (for successful tests) and CMakeFiles/CMakeError.log (for failed tests). Before: Building C object CMakeFiles/cmTryCompileExec3385359576.dir/src.c.o /usr/bin/clang -fPIC -Wall -W -Wno-unused-parameter -Wwrite-strings -Wmissing-field-initializers -pedantic -Wno-long-long -Wcovered-switch-default -DC_WCOMMENT_ALLOWS_LINE_WRAP -Werror -o CMakeFiles/cmTryCompileExec3385359576.dir/src.c.o -c /home/nobled/code/llvm-b9/CMakeFiles/CMakeTmp/src.c After: Building C object CMakeFiles/cmTryCompileExec3385359576.dir/src.c.o /usr/bin/clang -fPIC -Wall -W -Wno-unused-parameter -Wwrite-strings -Wmissing-field-initializers -pedantic -Wno-long-long -Wcovered-switch-default -DC_WCOMMENT_ALLOWS_LINE_WRAP -Werror -Wcomment -o CMakeFiles/cmTryCompileExec3385359576.dir/src.c.o -c /home/nobled/code/llvm-b9/CMakeFiles/CMakeTmp/src.c llvm-svn: 216328	2014-08-23 21:10:58 +00:00
Dylan Noblesmith	367cef6362	cmake: disable -Wnon-virtual-dtor when it gives false positives clang has only been smart enough not to trigger -Wnon-virtual-dtor warnings on final classes since r208449 (in clang 3.5). Building with older versions is extremely noisy, so disable the warning on those compilers. llvm-svn: 216327	2014-08-23 21:10:56 +00:00
Chad Rosier	ad7c910ecf	Revert "ARM: improve RTABI 4.2 conformance on Linux" This reverts commit r215862 due to nightly failures. Will work on getting a reduced test case, but I wanted to get our bots green in the meantime. llvm-svn: 216325	2014-08-23 18:29:43 +00:00
Chad Rosier	d2959362fb	Revert "ARM: mark missing functions from RTABI" This reverts commit r215863. llvm-svn: 216324	2014-08-23 18:29:40 +00:00
Chandler Carruth	a15258b4e6	[x86] Start fixing a really subtle and terrible form of miscompile in these DAG combines. The DAG auto-CSE thing is truly terrible. Due to it, when RAUW-ing a node with its operand, you can cause its uses to CSE to itself, which then causes their uses to become your uses which causes them to be picked up by the RAUW. For nodes that are determined to be "no-ops", this is "fine". But if the RAUW is one of several steps to enact a transformation, this causes the DAG to really silently eat an discard nodes that you would never expect. It took days for me to actually pinpoint a test case triggering this and a really frustrating amount of time to even comprehend the bug because I never even thought about the ability of RAUW to iteratively consume nodes due to CSE-ing them into itself. To fix this, we have to build up a brand-new chain of operations any time we are combining across (potentially) intervening nodes. But once the logic is added to do this, another issue surfaces: CombineTo eagerly deletes the one node combined, but no others. This is... really frustrating. If deleting it makes its operands become dead, those operand nodes often won't go onto the worklist in the order you would want -- they're already on it and not near the top. That means things higher on the worklist will get combined prior to these dead nodes being GCed out of the worklist, and if the chain is long, the immediate users won't be enough to re-detect where the root of the chain is that became single-use again after deleting the dead nodes. The better way to do this is to never immediately delete nodes, and instead to just enqueue them so we can recursively delete them. The combined-from node is typically not on the worklist anyways by virtue of having been popped off.... But that in turn breaks other tests that require CombineTo to delete unused nodes. :: sigh :: Fortunately, there is a better way. This whole routine should have been returning the replacement rather than using CombineTo which is quite hacky. Switch to that, and all the pieces fall together. I suspect the same kind of miscompile is possible in the half-shuffle folding code, and potentially the recursive folding code. I'll be switching those over to a pattern more like this one for safety's sake even though I don't immediately have any test cases for them. Note that the only way I got a test case for this instance was with heavily DAG combined 256-bit shuffle sequences generated by my fuzzer. ;] llvm-svn: 216319	2014-08-23 10:25:15 +00:00
Hans Wennborg	9a01309b3b	ProgrammersManual: the flag is called -debug-only llvm-svn: 216316	2014-08-23 04:34:58 +00:00
Alex Lorenz	7949a8b8ea	llvm-cov: test: add xfail for the big-endian buildbots llvm-svn: 216310	2014-08-23 00:47:24 +00:00
Nick Lewycky	a4967c2740	Revert r215611 because it caused the infinite loop in bug 20736. There is a reduced testcase in that bug. llvm-svn: 216307	2014-08-23 00:45:03 +00:00
Yunzhong Gao	300bdb35d4	Add a test case for SROA where the store size is bigger than slice size. The test case was fixed in r216248. llvm-svn: 216303	2014-08-22 23:27:04 +00:00
Rafael Espindola	f7ecb11572	Add support for comdats to the gold plugin. There are two parts to this. First, the plugin needs to tell gold the comdat by setting comdat_key. What gets things a bit more complicated is that gold only seems symbols. In particular, if A is an alias to B, it only sees the symbols A and B. It can then ask us to keep symbol A but drop symbol B. What we have to do instead is to create an internal version of B and make A an alias to that. At some point some of this logic should be moved to lib/Linker so that we don't map a Constant to an internal version just to have lib/Linker map that again to the destination module. The reason for implementing this in tools/gold for now is simplicity. With it in place it should be possible to update clang to use comdats for constructors and destructors on ELF without breaking the LTO bootstrap. Once that is done I intend to come back and improve the interface lib/Linker exposes. llvm-svn: 216302	2014-08-22 23:26:10 +00:00
Alex Lorenz	e82d89cc37	llvm-cov: add code coverage tool that's based on coverage mapping format and clang's pgo. This commit expands llvm-cov's functionality by adding support for a new code coverage tool that uses LLVM's coverage mapping format and clang's instrumentation based profiling. The gcov compatible tool can be invoked by supplying the 'gcov' command as the first argument, or by modifying the tool's name to end with 'gcov'. Differential Revision: http://reviews.llvm.org/D4445 llvm-svn: 216300	2014-08-22 22:56:03 +00:00
Jingyue Wu	ec33fa9aca	[SROA] Fold a PHI node if all its incoming values are the same Summary: Fixes PR20425. During slice building, if all of the incoming values of a PHI node are the same, replace the PHI node with the common value. This simplification makes alloca's used by PHI nodes easier to promote. Test Plan: Added three more tests in phi-and-select.ll Reviewers: nlewycky, eliben, meheff, chandlerc Reviewed By: chandlerc Subscribers: zinovy.nis, hfinkel, baldrick, llvm-commits Differential Revision: http://reviews.llvm.org/D4659 llvm-svn: 216299	2014-08-22 22:45:57 +00:00
Reid Kleckner	2d9bb65b3d	ARM / x86_64 varargs: Don't save regparms in prologue without va_start There's no need to do this if the user doesn't call va_start. In the future, we're going to have thunks that forward these register parameters with musttail calls, and they won't need these spills for handling va_start. Most of the test suite changes are adding va_start calls to existing tests to keep things working. llvm-svn: 216294	2014-08-22 21:59:26 +00:00
Rafael Espindola	bd334e2f32	Clear the llvm release notes to make room for 3.6. llvm-svn: 216292	2014-08-22 21:57:38 +00:00
Kevin Enderby	b76d386d7c	Add the start of the support for llvm-objdump’s -private-headers for Mach-O files. This adds the printing of the mach header. Load command printing will be next. llvm-svn: 216285	2014-08-22 20:35:18 +00:00
Kevin Enderby	7d7eeab365	Add a few missing mach header flags. llvm-svn: 216284	2014-08-22 20:34:31 +00:00
Reid Kleckner	e3f146d941	Fix PR17239 by changing the semantics of the RemainingArgsClass Option kind This patch contains the LLVM side of the fix of PR17239. This bug that happens because the /link (clang-cl.exe argument) is marked as "consume all remaining arguments". However, when inside a response file, /link should only consume all remaining arguments inside the response file where it is located, not the entire command line after expansion. My patch will change the semantics of the RemainingArgsClass kind to always consume only until the end of the response file when the option originally came from a response file. There are only two options in this class: dash dash (--) and /link. Reviewed By: rnk Differential Revision: http://reviews.llvm.org/D4899 Patch by Rafael Auler! llvm-svn: 216280	2014-08-22 19:29:17 +00:00
Tom Stellard	f3fc555e3b	R600/SI: Use READ2/WRITE2 instructions for 64-bit mem ops with 32-bit alignment llvm-svn: 216279	2014-08-22 18:49:35 +00:00

... 4 5 6 7 8 ...

107534 Commits