llvm-project

Commit Graph

Author	SHA1	Message	Date
David Blaikie	2ed678c6af	[llvm-dwp] clang-format this to catch anything I've missed along the way llvm-svn: 254828	2015-12-05 03:06:30 +00:00
David Blaikie	24c8ac93f3	[llvm-dwp] Support debug_tu_index llvm-svn: 254827	2015-12-05 03:05:45 +00:00
Dan Gohman	f0b165a7f8	[WebAssembly] Implement ReverseBranchCondition, and re-enable MachineBlockPlacement This patch introduces a codegen-only instruction currently named br_unless, which makes it convenient to implement ReverseBranchCondition and re-enable the MachineBlockPlacement pass. Then in a late pass, it lowers br_unless back into br_if. Differential Revision: http://reviews.llvm.org/D14995 llvm-svn: 254826	2015-12-05 03:03:35 +00:00
Kostya Serebryany	064a672f65	[libFuzzer] one more trophie llvm-svn: 254825	2015-12-05 02:23:49 +00:00
Kostya Serebryany	8617aaaac2	[libFuzzer] don't reload the corpus more than once every second llvm-svn: 254824	2015-12-05 02:09:22 +00:00
Lang Hames	da7ffc25dd	Whitespace. llvm-svn: 254821	2015-12-05 01:44:20 +00:00
Keno Fischer	e54f58c7c5	[opt] Fix run-twice option for non-idempotent passes Cloning the module was supposed to guard against the possibility that the passes may be non-idempotent. However, for some reason I decided to put that AFTER the passes had already run on the module, defeating the point entirely. Fix that by moving up the CloneModule as is done in llc. llvm-svn: 254819	2015-12-05 01:38:12 +00:00
Keno Fischer	8656d567d9	[MC] Add a test for state reset in MCMachOStreamer This was fixed in r254751, but untestable until r254774, which added the necessary command line flag to llc. Add a test now to make sure this doesn't regress again. llvm-svn: 254814	2015-12-05 01:02:53 +00:00
Cong Hou	a465312e9c	Fix a typo in LoopVectorize.cpp. NFC. llvm-svn: 254813	2015-12-05 01:00:22 +00:00
Dan Gohman	4da4abd87f	[WebAssembly] Fix scheduling dependencies in register-stackified code Add physical register defs to instructions used from stackified instructions to prevent them from being scheduled into the middle of a stack sequence. This is a conservative measure which may be loosened in the future. Differential Revision: http://reviews.llvm.org/D15252 llvm-svn: 254811	2015-12-05 00:51:40 +00:00
Justin Bogner	d9a8ac6cc7	CodeGen: Let the BumpPtrAllocator free the elements of indexList The indexList's nodes are all allocated on a BumpPtrAllocator, so it's more efficient to let them be freed when it goes away, rather than deleting them directly. This is a follow up to r254794. llvm-svn: 254808	2015-12-05 00:39:14 +00:00
Derek Schuff	9d77952332	[WebAssembly] Support constant offsets on loads and stores This is just prototype for load/store for i32 types. I'll add them to the rest of the types if we like this direction. Differential Revision: http://reviews.llvm.org/D15197 llvm-svn: 254807	2015-12-05 00:26:39 +00:00
Philip Reames	7c6692de16	[EarlyCSE] IsSimple vs IsVolatile naming clarification (NFC) When the notion of target specific memory intrinsics was introduced to EarlyCSE, the commit confused the notions of volatile and simple memory access. Since I'm about to start working on this area, cleanup the naming so that patches aren't horribly confusing. Note that the actual implementation was always bailing if the load or store wasn't simple. Reminder: - "volatile" - C++ volatile, can't remove any memory operations, but in principal unordered - "ordered" - imposes ordering constraints on other nearby memory operations - "atomic" - can't be split or sheared. In LLVM terms, all "ordered" operations are also atomic so the predicate "isAtomic" is often used. - "simple" - a load which is none of the above. These are normal loads and what most of the optimizer works with. llvm-svn: 254805	2015-12-05 00:18:33 +00:00
Keno Fischer	38707c45be	[opt] Fix sanitizer complaints about r254774 `Out` can be null if no output is requested, so move any access to it inside the conditional. Thanks to Justin Bogner for finding this. llvm-svn: 254804	2015-12-05 00:06:37 +00:00
Philip Reames	000f77d728	[PassManager] Ensure destructors of cached AnalysisUsage objects are run In 254760, I introduced the usage of a BumpPtrAllocator for the AnalysisUsage instances held by the PassManger. This turns out to have been incorrect since a BumpPtrAllocator does not run the destructors of objects when deallocating memory. Since a few of our SmallVector's had grown beyond their small size, we end up with some leaked memory. We need to use a SpecificBumpPtrAllocator instead. llvm-svn: 254803	2015-12-04 23:48:19 +00:00
Teresa Johnson	bae7e75959	[ThinLTO] Helper for performing renaming/promotion on a module Creates a module and performs necessary renaming/promotion of locals that may be exported to another module. Split out of D15024. llvm-svn: 254802	2015-12-04 23:40:22 +00:00
Hans Wennborg	fbf2822e6d	Add FeatureLAHFSAHF to amdfam10 as well. llvm-svn: 254801	2015-12-04 23:32:19 +00:00
Dan Gohman	35bfb24c28	[WebAssembly] Initial varargs support. Full varargs support will depend on prologue/epilogue support, but this patch gets us started with most of the basic infrastructure. Differential Revision: http://reviews.llvm.org/D15231 llvm-svn: 254799	2015-12-04 23:22:35 +00:00
Philip Reames	b6306da405	Address a memory leak in 254760 The issue appears to have been that the copy constructor of the SmallVector was being invoked and this was somehow leading to leaked memory. This patch avoids the symptom, but likely doesn't address the underlying problem. I'm still investigating the root cause, but wanted to avoid the memory leak in the mean time. Even with the underlying fix, avoiding the redundant allocation is worthwhile. llvm-svn: 254795	2015-12-04 23:06:33 +00:00
Justin Bogner	a0a9d75e3c	CodeGen: Move the SlotIndexes BumpPtrAllocator before the list it allocates When a `SlotIndexes` is destroyed, `ileAllocator` will currently be destructed before `IndexList`, but all of `IndexList`'s storage has been allocated by `ileAllocator`. This means we'll call destructors on garbage data, which is very bad. This can be avoided by putting the BumpPtrAllocator earlier in the class than anything it allocates. Unfortunately, I don't know how to test this. It depends very much on memory layout, and the only evidence I have that this is actually happening in practice are backtraces that might be explained by this. By inspection though, the code is obviously dangerous/wrong, and this is the right thing to do. I'll follow up later with a patch that calls clearAndLeakNodesUnsafely on the list, since there isn't much point in destructing them when they're allocated in a BPA anyway, but I figured it makes sense to commit the correctness fix separately from that optimization. llvm-svn: 254794	2015-12-04 23:00:54 +00:00
Hans Wennborg	5000ce8a63	X86: Don't emit SAHF/LAHF for 64-bit targets unless explicitly supported These instructions are not supported by all CPUs in 64-bit mode. Emitting them causes Chromium to crash on start-up for users with such chips. (GCC puts these instructions behind -msahf on 64-bit for the same reason.) This patch adds FeatureLAHFSAHF, enables it by default for 32-bit targets and modern CPUs, and changes X86InstrInfo::copyPhysReg back to the lowering from before r244503 when the instructions are not available. Differential Revision: http://reviews.llvm.org/D15240 llvm-svn: 254793	2015-12-04 23:00:33 +00:00
Derek Schuff	68b309a306	Add TransformUtils to list of required libraries for llc This dependency was added in r254774 llvm-svn: 254786	2015-12-04 22:47:58 +00:00
Kostya Serebryany	9e48cda9bc	[libFuzzer] compute base64 in-process instead of using an external lib. Since libFuzzer should not depend on anything, just re-implement base64 encoder. PR25746 llvm-svn: 254784	2015-12-04 22:29:39 +00:00
Rafael Espindola	f85e9729e9	MSVC complains about this being ambiguous. llvm-svn: 254782	2015-12-04 22:26:21 +00:00
Lang Hames	e69b751155	[Orc] Move some code up into the JITCompileCallbackManager base class. NFC. llvm-svn: 254778	2015-12-04 22:09:19 +00:00
Rafael Espindola	f49a38fc08	Always pass a diagnostic handler to the linker. Before this patch the diagnostic handler was optional. If it was not passed, the one in the LLVMContext was used. That is probably not a pattern we want to follow. If each area has an optional callback, there is a sea of callbacks and it is hard to follow which one is called. Doing this also found cases where the callback is a nice addition, like testing that no errors or warnings are reported. The other option is to always use the diagnostic handler in the LLVMContext. That has a few problems * To implement the C API we would have to set the diag handler and then set it back to the original value. * Code that creates the context might be far away from code that wants the diagnostics. I do have a patch that implements the second option and will send that as an RFC. llvm-svn: 254777	2015-12-04 22:08:53 +00:00
Weiming Zhao	8213072a45	[SimplifyLibCalls] Optimization for pow(x, n) where n is some constant Summary: In order to avoid calling pow function we generate repeated fmul when n is a positive or negative whole number. For each exponent we pre-compute Addition Chains in order to minimize the no. of fmuls. Refer: http://wwwhomes.uni-bielefeld.de/achim/addition_chain.html We pre-compute addition chains for exponents upto 32 (which results in a max of 7 fmuls). For eg: 4 = 2+2 5 = 2+3 6 = 3+3 and so on Hence, pow(x, 4.0) ==> y = fmul x, x x = fmul y, y ret x For negative exponents, we simply compute the reciprocal of the final result. Note: This transformation is only enabled under fast-math. Patch by Mandeep Singh Grang <mgrang@codeaurora.org> Reviewers: weimingz, majnemer, escha, davide, scanon, joerg Subscribers: probinson, escha, llvm-commits Differential Revision: http://reviews.llvm.org/D13994 llvm-svn: 254776	2015-12-04 22:00:47 +00:00
Pete Cooper	b51aafd28e	Fix incorrect quote. NFC llvm-svn: 254775	2015-12-04 21:59:04 +00:00
Keno Fischer	04464cf731	[llc/opt] Add an option to run all passes twice Summary: Lately, I have submitted a number of patches to fix bugs that only occurred when using the same pass manager to compile multiple modules (generally these bugs are failure to reset some persistent state). Unfortunately I don't think there is currently a way to test that from the command line. This adds a very simple flag to both llc and opt, under which the tools will simply re-run their respective pass pipelines using the same pass manager on (a clone of the same module). Additionally, we verify that both outputs are bitwise the same. Reviewers: yaron.keren Subscribers: loladiro, yaron.keren, kcc, llvm-commits Differential Revision: http://reviews.llvm.org/D14965 llvm-svn: 254774	2015-12-04 21:56:46 +00:00
Chad Rosier	f3491496dc	[AArch64] Expand vector SDIVREM/UDIVREM operations. http://reviews.llvm.org/D15214 Patch by Ana Pazos <apazos@codeaurora.org>! llvm-svn: 254773	2015-12-04 21:38:44 +00:00
David Blaikie	efadacfb14	[llvm-dwp] Remove some out of date comments llvm-svn: 254772	2015-12-04 21:38:39 +00:00
David Blaikie	7c4ffe018a	[llvm-dwp] Implement the required on-disk probed hash table llvm-svn: 254770	2015-12-04 21:30:23 +00:00
Reid Kleckner	9f23516415	Fix llvm-readobj build on Windows, match noreturn attribute on reportError in headers llvm-svn: 254769	2015-12-04 21:29:53 +00:00
David Blaikie	b7020255e5	[llvm-dwp] Include the debug_line.dwo section This probably shouldn't be generated in the .dwo file for CUs, only for TUs, but it's in the sample .dwos (generated by clang) so dwp should reflect that. Arguably the DWP tool could be smart enough to know that the CUs shouldn't need a debug_line.dwo section and skip that even when it's legitimately generated for TUs, but that's a bit more off-book. llvm-svn: 254767	2015-12-04 21:16:42 +00:00
Sanjoy Das	18ceafeb2d	[OperandBundles] Allow operand-specific attributes in operand bundles Currently `OperandBundleUse::operandsHaveAttr` computes its result without being given a specific operand. This is problematic because it forces us to say that, e.g., even non-pointer operands in `"deopt"` operand bundles are `readonly`, which doesn't make sense. This commit changes `operandsHaveAttr` to work in the context of a specific operand, so that we can give the operand attributes that make sense for the operands's `llvm::Type`. llvm-svn: 254764	2015-12-04 20:34:37 +00:00
Philip Reames	e8aeaeb712	[LegacyPassManager] Reduce memory usage for AnalysisUsage The LegacyPassManager was storing an instance of AnalysisUsage for each instance of each pass. In practice, most instances of a single pass class share the same dependencies. We can't rely on this because passes can (and some do) have dynamic dependencies based on instance options. We can exploit the likely commonality by uniqueing the usage information after querying the pass, but before storing it into the pass manager. This greatly reduces memory consumption by the AnalysisUsage objects. For a long pass pipeline, I measured a decrease in memory consumption for this storage of about 50%. I have not measured on the default O3 pipeline, but I suspect it will see some benefit as well since many passes are repeated (e.g. InstCombine). Differential Revision: http://reviews.llvm.org/D14677 llvm-svn: 254760	2015-12-04 20:05:04 +00:00
Matthias Braun	b17e8b1c1d	ScheduleDAGInstrs: Move LiveIntervals field to ScheduleDAGMI Now that ScheduleDAGInstrs doesn't need it anymore we can move the field down the class hierarcy to ScheduleDAGMI. llvm-svn: 254759	2015-12-04 19:54:24 +00:00
Davide Italiano	1eb9234fd3	[llvm-readobj] reportError() never returns. Mark with the correct attribute. llvm-svn: 254752	2015-12-04 19:29:49 +00:00
Davide Italiano	20fe428859	[llvm-readobj/ELF] Simplify Verdef handling. llvm-svn: 254751	2015-12-04 19:27:58 +00:00
Mike Aizatsky	fdc4b313d7	fixing Makefile llvm-svn: 254749	2015-12-04 19:11:54 +00:00
Mike Aizatsky	8dff7ca375	adding MC dependencies in hopes to pacify the hexagon build. llvm-svn: 254745	2015-12-04 18:50:18 +00:00
Mike Aizatsky	0650e9b2b7	sancov -not-covered-functions. Summary: The command prints out list of functions that were not entered. To do this, addresses are first converted to function locations. Set operations are used for function locations. Differential Revision: http://reviews.llvm.org/D14889 review llvm-svn: 254742	2015-12-04 18:35:37 +00:00
Dan Gohman	1ce2b1afd6	[WebAssembly] Add several more calling conventions to the supported list. llvm-svn: 254741	2015-12-04 18:27:03 +00:00
Sanjay Patel	8e7facbd4e	don't repeat function names in comments; NFC llvm-svn: 254740	2015-12-04 17:54:31 +00:00
Sanjay Patel	1640c54593	fix formatting; NFC llvm-svn: 254739	2015-12-04 17:51:55 +00:00
Manman Ren	19c7bbe3b7	[CXX TLS calling convention] Add CXX TLS calling convention. This commit adds a new target-independent calling convention for C++ TLS access functions. It aims to minimize overhead in the caller by perserving as many registers as possible. The target-specific implementation for X86-64 is defined as following: Arguments are passed as for the default C calling convention The same applies for the return value(s) The callee preserves all GPRs - except RAX and RDI The access function makes C-style TLS function calls in the entry and exit block, C-style TLS functions save a lot more registers than normal calls. The added calling convention ties into the existing implementation of the C-style TLS functions, so we can't simply use existing calling conventions such as preserve_mostcc. rdar://9001553 llvm-svn: 254737	2015-12-04 17:40:13 +00:00
David Blaikie	ad07b5d65e	[llvm-dwp] Retrieve the DWOID from the CU for the cu_index entry llvm-svn: 254731	2015-12-04 17:20:04 +00:00
Dan Gohman	541841e365	[WebAssembly] Give names to the callseq begin and end instructions. llvm-svn: 254730	2015-12-04 17:19:44 +00:00
Dan Gohman	a3f5ce5f1b	[WebAssembly] clang-format CallingConvSupported. NFC. llvm-svn: 254729	2015-12-04 17:18:32 +00:00
Dan Gohman	85dbdda1ed	[WebAssembly] Factor out the list of supported calling conventions. llvm-svn: 254728	2015-12-04 17:16:07 +00:00
Dan Gohman	2d822e73fa	[WebAssembly] Check for more unsupported ABI flags. llvm-svn: 254727	2015-12-04 17:12:52 +00:00
Dan Gohman	cb7940f9f5	[WebAssembly] Use SelectionDAG::getUNDEF. NFC. llvm-svn: 254726	2015-12-04 17:09:42 +00:00
Krzysztof Parzyszek	f1b3e5e52e	[Hexagon] Simplify LowerCONCAT_VECTORS, handle different types better llvm-svn: 254724	2015-12-04 16:18:15 +00:00
Rafael Espindola	a7612b4fac	Modernize the C++ APIs for creating LTO modules. This is a continuation of r253367. These functions return is owned by the caller, so they return std::unique_ptr now. The call can fail, so the return is wrapped in ErrorOr. They have a context where to report diagnostics, so they don't need to take a string out parameter. With this there are no call to getGlobalContext in lib/LTO. llvm-svn: 254721	2015-12-04 16:14:31 +00:00
Tim Northover	bebd2f4028	ARM/AArch64: update reference documentation. There's a more comprehensive ACLE and a real v8 ARM ARM now. llvm-svn: 254720	2015-12-04 16:10:48 +00:00
Colin LeMahieu	4c606e66a7	[Hexagon] Using multiply instead of shift on signed number which can be UB llvm-svn: 254719	2015-12-04 15:48:45 +00:00
Jonas Paulsson	7fa69cd5dd	[SystemZ] Bugfix: Don't add CC twice to new three-address instruction. Since BuildMI() automatically adds the implicit operands for a new instruction, adding the old instructions CC operand resulted in that there were two CC imp-def operands, where only one was marked as dead. This caused buildSchedGraph() to miss dependencies on the CC reg. Review by Ulrich Weigand llvm-svn: 254714	2015-12-04 12:48:51 +00:00
Alexey Bataev	7cf324772f	LEA code size optimization pass (Part 1): Remove redundant address recalculations, by Andrey Turetsky Add new x86 pass which replaces address calculations in load or store instructions with def register of existing LEA (must be in the same basic block), if the LEA calculates address that differs only by a displacement. Works only with -Os or -Oz. Differential Revision: http://reviews.llvm.org/D13294 llvm-svn: 254712	2015-12-04 10:53:15 +00:00
Oliver Stannard	3760cf3686	[AArch64] Clean up statistical profiling test This check has nothing to do with the statistical profiling extension, so shouldn't be in this test. llvm-svn: 254709	2015-12-04 09:45:18 +00:00
Yury Gribov	6ff0a66b09	[asan] Fix dynamic allocas unpoisoning on PowerPC64. For PowerPC64 we cannot just pass SP extracted from @llvm.stackrestore to _asan_allocas_unpoison due to specific ABI requirements (http://refspecs.linuxfoundation.org/ELF/ppc64/PPC-elf64abi.html#DYNAM-STACK). This patch adds the value returned by @llvm.get.dynamic.area.offset to extracted from @llvm.stackrestore stack pointer, so dynamic allocas unpoisoning stuff would work correctly on PowerPC64. Patch by Max Ostapenko. Differential Revision: http://reviews.llvm.org/D15108 llvm-svn: 254707	2015-12-04 09:19:14 +00:00
Rafael Espindola	71bd70cc30	Revert "[BranchFolding] Merge MMOs during tail merge" This reverts commit r254694. It broke bootstrap. llvm-svn: 254700	2015-12-04 04:15:05 +00:00
Rafael Espindola	7b8a24e5bb	Move a call to getGlobalContext out of lib/LTO. llvm-svn: 254696	2015-12-04 02:42:28 +00:00
Lang Hames	e52502c45e	[Orc] Fix Kaleidoscope example for change in r254693. llvm-svn: 254695	2015-12-04 02:32:32 +00:00
Junmo Park	c0731ca183	[BranchFolding] Merge MMOs during tail merge Summary: If we remove the MMOs from Load/Store instructions, they are treated as volatile. This makes other optimization passes unhappy. eg. Load/Store Optimization So, it looks better to merge, not remove. Reviewers: gberry, mcrosier Subscribers: gberry, llvm-commits Differential Revision: http://reviews.llvm.org/D14797 llvm-svn: 254694	2015-12-04 02:29:25 +00:00
Lang Hames	f0f4b4c882	[Orc] Rename JITCompileCallbackManagerBase to JITCompileCallbackManager. This class is turning into a useful interface, rather than an implementation detail, so I'm dropping the 'Base' suffix. No functional change. llvm-svn: 254693	2015-12-04 02:15:39 +00:00
Justin Bogner	9328957c8f	IR: Use format_hex instead of handrolling the conversion. NFC Cleans up some very old code in AsmWriter's WriteConstantInternal. llvm-svn: 254688	2015-12-04 02:14:34 +00:00
Nathan Slingerland	cb921a1d88	Revert "[llvm-profdata] Add support for weighted merge of profile data" This reverts commit b7250858d96b8ce567681214273ac0e62713c661. Reverting in order to investigate Windows test failure. llvm-svn: 254687	2015-12-04 02:13:58 +00:00
Junmo Park	7cc13f2e58	(no commit message) llvm-svn: 254686	2015-12-04 02:06:59 +00:00
NAKAMURA Takumi	a3561b388c	Move llvm/test/CodeGen/Generic/function-alias.ll to X86. It is incompatible to PECOFF. FIXME: It may be ELF-generic. llvm-svn: 254685	2015-12-04 02:00:12 +00:00
Quentin Colombet	901f036353	[ARM] When a bitcast is about to be turned into a VMOVDRR, try to combine it with its source instead of forcing the values on GPRs. This improves the lowering of vector code when such bitcasts happen in the middle of vector computations. rdar://problem/23691584 llvm-svn: 254684	2015-12-04 01:53:14 +00:00
Matthias Braun	97d0ffbe06	ScheduleDAGInstrs: Rework schedule graph builder. Re-comitting with a change that avoids undefined uses getting put into the VRegUses list. The new algorithm remembers the uses encountered while walking backwards until a matching def is found. Contrary to the previous version this: - Works without LiveIntervals being available - Allows to increase the precision to subregisters/lanemasks (not used for now) The changes in the AMDGPU tests are necessary because the R600 scheduler is not stable with respect to the order of nodes in the ready queues. Differential Revision: http://reviews.llvm.org/D9068 llvm-svn: 254683	2015-12-04 01:51:19 +00:00
Matthias Braun	c07cbc8d3c	raw_ostream: << operator for callables with raw_ostream argument This is a revised version of r254655 which uses a Printable wrapper class to avoid ambiguous overload problems. Differential Revision: http://reviews.llvm.org/D14348 llvm-svn: 254681	2015-12-04 01:31:59 +00:00
JF Bastien	580b6572b5	X86InstrInfo::copyPhysReg: workaround reg liveness Summary: computeRegisterLiveness and analyzePhysReg are currently getting confused about liveness in some cases, breaking copyPhysReg's calculation of whether AX is dead in some cases. Work around this issue temporarily by assuming that AX is always live. See detail in: https://llvm.org/bugs/show_bug.cgi?id=25033#c7 And associated bugs PR24535 PR25033 PR24991 PR24992 PR25201. This workaround makes the code correct but slightly inefficient, but it seems to confuse the machine instr verifier which now things EAX was undefined in some cases where it's being conservatively saved / restored. Reviewers: majnemer, sanjoy Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D15198 llvm-svn: 254680	2015-12-04 01:18:17 +00:00
Justin Bogner	b609b6be74	IR: Update a comment and a bool that've been out of date since 2012 It became impossible to get here with a half in r157393, over 3 years ago. llvm-svn: 254679	2015-12-04 01:14:24 +00:00
Xinliang David Li	01cb9bd7b3	[PGO] Unify VP data format between raw and indexed profile (Reader) With the latest refactoring and code sharing patches landed, it is possible to unify the value profile implementation between raw and indexed profile. This is the patch in raw profile reader that uses the common interface. Differential Revision: http://reviews.llvm.org/D15056 llvm-svn: 254677	2015-12-04 01:02:10 +00:00
Evgeniy Stepanov	7fc3cb5919	Fix function-alias.ll test on non-X86 targets. llvm-svn: 254676	2015-12-04 00:57:25 +00:00
Rafael Espindola	5e128dbcbf	Simplify the error handling in llvm-lto a bit. llvm-svn: 254675	2015-12-04 00:45:57 +00:00
Evgeniy Stepanov	2bb9c5ca22	Emit function alias to data as a function symbol. CFI emits jump slots for indirect functions as a byte array constant, and declares function-typed aliases to these constants. This change fixes AsmPrinter to emit these aliases as function symbols and not data symbols. llvm-svn: 254674	2015-12-04 00:45:43 +00:00
Cong Hou	94620278a4	Don't punish vectorized arithmetic instruction whose type will be split to multiple registers Currently in LLVM's cost model, a vectorized arithmetic instruction will have high cost if its type is split into multiple registers. However, this punishment is too heavy and unnecessary. The overhead of the split should not be on arithmetic instructions but instructions that implement the split. Note that during vectorization we have calculated the register pressure, and we only choose proper interleaving factor (and also vectorization factor) so that we don't use more registers than the maximum number. Here is a very simple example: if a vadd has the cost 1, and if we double VF so that we need two registers to perform it, then its cost will become 4 with the current implementation, which will prevent us to use larger VF. Differential revision: http://reviews.llvm.org/D15159 llvm-svn: 254671	2015-12-04 00:36:58 +00:00
Nathan Slingerland	2a3dbe8be2	[llvm-profdata] Add support for weighted merge of profile data This change adds support for an optional weight when merging profile data with the llvm-profdata tool. Weights are specified by adding an option ':<weight>' suffix to the input file names. Adding support for arbitrary weighting of input profile data allows for relative importance to be placed on the input data from multiple training runs. Both sampled and instrumented profiles are supported. Reviewers: dnovillo, bogner, davidxl Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14547 llvm-svn: 254669	2015-12-04 00:00:20 +00:00
Kevin B. Smith	09330577ff	[CodeGen] Minor correction to comment on PhysRegInfo. Differential revision: http://reviews.llvm.org/D15216 llvm-svn: 254668	2015-12-04 00:00:10 +00:00
Rafael Espindola	a06bb16f95	Simplify since this function never fails. llvm-svn: 254667	2015-12-03 23:56:42 +00:00
JF Bastien	1ac69947b6	CodeGen peephole: fold redundant phys reg copies Code generation often exposes redundant physical register copies through virtual registers such as: %vreg = COPY %PHYSREG ... %PHYSREG = COPY %vreg There are cases where no intervening clobber of %PHYSREG occurs, and the later copy could therefore be removed. In some cases this further allows us to remove the initial copy. This patch contains a motivating example which comes from the x86 build of Chrome, specifically cc::ResourceProvider::UnlockForRead uses libstdc++'s implementation of hash_map. That example has two tests live at the same time, and after machine sinking LLVM has confused itself enough and things spilling EFLAGS is a great idea even though it's never restored and the comparison results are both live. Before this patch we have: DEC32m %RIP, 1, %noreg, <ga:@L>, %noreg, %EFLAGS<imp-def> %vreg1<def> = COPY %EFLAGS; GR64:%vreg1 %EFLAGS<def> = COPY %vreg1; GR64:%vreg1 JNE_1 <BB#1>, %EFLAGS<imp-use> Both copies are useless. This patch tries to eliminate the later copy in a generic manner. dec is especially confusing to LLVM when compared with sub. I wrote this patch to treat all physical registers generically, but only remove redundant copies of non-allocatable physical registers because the allocatable ones caused issues (e.g. when calling conventions weren't properly modeled) and should be handled later by the register allocator anyways. The following tests used to failed when the patch also replaced allocatable registers: CodeGen/X86/StackColoring.ll CodeGen/X86/avx512-calling-conv.ll CodeGen/X86/copy-propagation.ll CodeGen/X86/inline-asm-fpstack.ll CodeGen/X86/musttail-varargs.ll CodeGen/X86/pop-stack-cleanup.ll CodeGen/X86/preserve_mostcc64.ll CodeGen/X86/tailcallstack64.ll CodeGen/X86/this-return-64.ll This happens because COPY has other special meaning for e.g. dependency breakage and x87 FP stack. Note that all other backends' tests pass. Reviewers: qcolombet Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D15157 llvm-svn: 254665	2015-12-03 23:43:56 +00:00
Justin Bogner	8d6f4e36e8	AsmPrinter: Simplify emitting FP elements in sequential data. NFC Use APFloat APIs here Rather than manually type-punning through unions. llvm-svn: 254664	2015-12-03 23:28:35 +00:00
Dan Gohman	391a98afd5	[WebAssembly] Fix dominance check for PHIs in the StoreResult pass When a block has no terminator instructions, getFirstTerminator() returns end(), which can't be used in dominance checks. Check dominance for phi operands separately. Also, remove some bits from WebAssemblyRegStackify.cpp that were causing trouble on the same testcase; they were left behind from an earlier experiment. Differential Revision: http://reviews.llvm.org/D15210 llvm-svn: 254662	2015-12-03 23:07:03 +00:00
Matthias Braun	149b859c55	Revert "raw_ostream: << operator for callables with raw_stream argument" This commit provoked "error C2593: 'operator <<' is ambiguous" on MSVC. This reverts commit r254655. llvm-svn: 254661	2015-12-03 23:00:28 +00:00
Chris Bieneman	ac6677ab8c	[CMake] Fixing bots CMake calls to set_property with APPEND string need to have a leading space. llvm-svn: 254659	2015-12-03 22:55:36 +00:00
Chris Bieneman	ccabd0e396	[CMake] set_target_properties doesn't append link flags This fixes a bug introduced in r254627, and another occurance of the same bug in this file. llvm-svn: 254657	2015-12-03 22:51:08 +00:00
David Majnemer	f6665f65b7	[Analysis] Become aware of MSVC's new/delete functions The compiler can take advantage of the allocation/deallocation function's properties. We knew how to do this for Itanium but had no support for MSVC-style functions. llvm-svn: 254656	2015-12-03 22:45:19 +00:00
Matthias Braun	e957a9bb1b	raw_ostream: << operator for callables with raw_stream argument This allows easier construction of print helpers. Example: Printable PrintLaneMask(unsigned LaneMask) { return Printable([LaneMask](raw_ostream &OS) { OS << format("%08X", LaneMask); }); } // Usage: OS << PrintLaneMask(Mask); Differential Revision: http://reviews.llvm.org/D14348 llvm-svn: 254655	2015-12-03 22:17:26 +00:00
Davide Italiano	bb599e3a4d	[llvm-objdump] Use report_fatal_error() if we can't find a target. llvm-svn: 254654	2015-12-03 22:13:40 +00:00
Chih-Hung Hsieh	ed7d81e5d4	[X86] Part 1 to fix x86-64 fp128 calling convention. Almost all these changes are conditioned and only apply to the new x86-64 f128 type configuration, which will be enabled in a follow up patch. They are required together to make new f128 work. If there is any error, we should fix or revert them as a whole. These changes should have no impact to current configurations. * Relax type legalization checks to accept new f128 type configuration, whose TypeAction is TypeSoftenFloat, not TypeLegal, but also has TLI.isTypeLegal true. * Relax GetSoftenedFloat to return in some cases f128 type SDValue, which is TLI.isTypeLegal but not "softened" to i128 node. * Allow customized FABS, FNEG, FCOPYSIGN on new f128 type configuration, to generate optimized bitwise operators for libm functions. * Enhance related Lower* functions to handle f128 type. * Enhance DAGTypeLegalizer::run, SoftenFloatResult, and related functions to keep new f128 type in register, and convert f128 operators to library calls. * Fix Combiner, Emitter, Legalizer routines that did not handle f128 type. * Add ExpandConstant to handle i128 constants, ExpandNode to handle ISD::Constant node. * Add one more parameter to getCommonSubClass and firstCommonClass, to guarantee that returned common sub class will contain the specified simple value type. This extra parameter is used by EmitCopyFromReg in InstrEmitter.cpp. * Fix infinite loop in getTypeLegalizationCost when f128 is the value type. * Fix printOperand to handle null operand. * Enhance ISD::BITCAST node to handle f128 constant. * Expand new f128 type for BR_CC, SELECT_CC, SELECT, SETCC nodes. * Enhance X86AsmPrinter to emit f128 values in comments. Differential Revision: http://reviews.llvm.org/D15134 llvm-svn: 254653	2015-12-03 22:02:40 +00:00
Colin LeMahieu	15ca65c253	[Hexagon] Adding shuffling resources for HVX instructions and tests for instruction encodings. llvm-svn: 254652	2015-12-03 21:44:28 +00:00
Keno Fischer	eb59d468d9	[RuntimeDyld] DenseMap -> std::unordered_map DenseMap is most applicable when both keys and values are small. In this case, the value violates that assumption, causing quite significant memory overhead. A std::unordered_map is more appropriate in this case (or at least fixed the memory problems I was seeing). Differential Revision: http://reviews.llvm.org/D14910 llvm-svn: 254651	2015-12-03 21:27:59 +00:00
Easwaran Raman	ecb05e5124	Interface to attach maximum function count from PGO to module as module flags. This provides interface to get and set maximum function counts to Module. This would allow things like determination of function hotness. The actual setting of this max function count will have to be done in the frontend. Differential Revision: http://reviews.llvm.org/D15003 llvm-svn: 254647	2015-12-03 20:57:37 +00:00
Reid Kleckner	93fc520339	[X86] Put no-op ADJCALLSTACK markers around all dynamic lowerings Summary: These ADJCALLSTACK markers don't generate code, but they keep dynamic alloca code that calls chkstk out of the prologue. This slightly pessimizes inalloca calls by preventing some register copy coalescing, but I can live with that. Reviewers: qcolombet Subscribers: hans, llvm-commits Differential Revision: http://reviews.llvm.org/D15200 llvm-svn: 254645	2015-12-03 20:46:59 +00:00
Chris Bieneman	4b44a76341	[CMake] Removing an unnecessary layer of variable indirection This prevents passthrough variables from having values. llvm-svn: 254641	2015-12-03 19:47:04 +00:00
Andrew Kaylor	92b3b16ba3	Move branch folding test to a better location. llvm-svn: 254640	2015-12-03 19:41:25 +00:00
Andrew Kaylor	412eabdeb2	Fix buildbot failures llvm-svn: 254636	2015-12-03 19:30:38 +00:00
Rafael Espindola	c0ccdc388c	Simplify test. NFC. llvm-svn: 254631	2015-12-03 19:10:55 +00:00
Easwaran Raman	3676da4b4a	Test commit. Remove blank spaces at the end of comments llvm-svn: 254630	2015-12-03 19:03:20 +00:00
Andrew Kaylor	9efb2332e2	[WinEH] Avoid infinite loop in BranchFolding for multiple single block funclets Differential Revision: http://reviews.llvm.org/D14996 llvm-svn: 254629	2015-12-03 18:55:28 +00:00
Chris Bieneman	bf2c4126ff	[CMake] Add option LLVM_EXTERNALIZE_DEBUGINFO Summary: This adds support for generating dSYM files and stripping debug info from executables and dylibs. It also supports passing -object_path_lto to the linker to generate dSYMs for LTO builds. Reviewers: bogner, friss Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D15133 llvm-svn: 254627	2015-12-03 18:45:39 +00:00
David Blaikie	725c4f71d1	dwarfdump: Correctly indentify the indicies for DWP records The indicies are one-based, not zero-based, per the spec. llvm-svn: 254626	2015-12-03 18:41:59 +00:00
Teresa Johnson	1e20a652ee	[ThinLTO] Appending linkage fixes Summary: Fix import from module with appending var, which cannot be imported. The first fix is to remove an overly-aggressive error check. The second fix is to deal with restructuring introduced to the module linker yesterday in r254418 (actually, this fix was included already in r254559, just added some additional cleanup). Test by Mehdi Amini. Reviewers: joker.eph, rafael Subscribers: joker.eph, llvm-commits Differential Revision: http://reviews.llvm.org/D15156 llvm-svn: 254624	2015-12-03 18:20:05 +00:00
Krzysztof Parzyszek	7709aa0e07	[Hexagon] Remove variable unused in NDEBUG build llvm-svn: 254623	2015-12-03 17:53:34 +00:00
Matthias Braun	0d4505c067	AArch64FastISel: Use cbz/cbnz to branch on i1 In the case of a conditional branch without a preceding cmp we used to emit a "and; cmp; b.eq/b.ne" sequence, use tbz/tbnz instead. Differential Revision: http://reviews.llvm.org/D15122 llvm-svn: 254621	2015-12-03 17:19:58 +00:00
Krzysztof Parzyszek	0881914723	Friendly takeover of the Hexagon backend llvm-svn: 254620	2015-12-03 17:07:12 +00:00
Krzysztof Parzyszek	c168c0165c	[Hexagon] Implement CONCAT_VECTORS for HVX using V6_vcombine llvm-svn: 254617	2015-12-03 16:47:20 +00:00
Colin LeMahieu	7c572b2125	[Hexagon] NFC Using canonicalizePacket to compound/duplex/pad packets rather than doing it separately. This also ensures the integrated assembler path matches the assembly parser path. llvm-svn: 254616	2015-12-03 16:37:21 +00:00
Rafael Espindola	562908bbd0	Simplify ValueMap handling. We now just return values and let ValueMap handle the map. llvm-svn: 254615	2015-12-03 16:36:16 +00:00
Krzysztof Parzyszek	25ddd2c9e8	[Hexagon] Fix instruction descriptor flags for memory access size llvm-svn: 254613	2015-12-03 15:41:33 +00:00
Rafael Espindola	792b7958ff	Don't pass member variables to member functions. NFC. llvm-svn: 254610	2015-12-03 14:48:20 +00:00
Rafael Espindola	56f9368d1d	Delete dead code. llvm-svn: 254609	2015-12-03 14:35:15 +00:00
Marina Yatsina	4b1aea0802	[X86] MS inline asm: produce error when encountering "<type> ptr <reg name>" Currently "<type> ptr <reg name>" treated as <reg name> in MS inline asm, ignoring the "<type> ptr" completely and possibly ignoring the intention of the user. Fixed llvm to produce an error when encountering "<type> ptr <reg name>" operands. For example: andpd xmm1,xmmword ptr xmm1 --> andpd xmm1, xmm1 though andpd has 2 possible matching formats - andpd xmm, xmm/m128 Patch by: ziv.izhar@intel.com Differential Revision: http://reviews.llvm.org/D14607 llvm-svn: 254607	2015-12-03 12:17:03 +00:00
Zlatko Buljan	0f1223053c	[mips][DSP] Add DSPr1 and DSPr2 tests for the standard encodings Differential Revision: http://reviews.llvm.org/D15141 llvm-svn: 254598	2015-12-03 09:56:39 +00:00
Marina Yatsina	90d9ffa7d6	[X86] Add support for fcomip, fucomip for Intel syntax According to x86 spec, fcomip and fucomip should be supported for Intel syntax. Differential Revision: http://reviews.llvm.org/D15104 llvm-svn: 254595	2015-12-03 08:55:33 +00:00
Andy Gibbs	81b1a27e53	Fix class SCEVPredicate has virtual functions and accessible non-virtual destructor. It is not enough to simply make the destructor virtual since there is a g++ 4.7 issue (see https://gcc.gnu.org/bugzilla/show_bug.cgi?id=53613) that throws the error "looser throw specifier for ... overridding ~SCEVPredicate() noexcept". llvm-svn: 254592	2015-12-03 08:20:20 +00:00
Craig Topper	1282df50c4	[TableGen] Remove an assumption about the order of encodings in the MVT::SimpleValueType enum. Instead of assuming the types are sorted by size, scan the typeset arrays to find the smallest/largest type. NFC llvm-svn: 254589	2015-12-03 05:57:37 +00:00
Tom Stellard	9760f03757	AMDGPU/SI: Emit constant arrays in the .hsrodata_readonly_agent section Summary: This is done only when targeting HSA. Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D13807 llvm-svn: 254587	2015-12-03 03:34:32 +00:00
Matthias Braun	2fd672a221	Revert "ScheduleDAGInstrs: Rework schedule graph builder." This works mostly fine but breaks some stage 1 builders when compiling compiler-rt on i386. Revert for further investigation as I can't see an obvious cause/fix. This reverts commit r254577. llvm-svn: 254586	2015-12-03 03:01:10 +00:00
Mehdi Amini	311fef6ea5	clang-format FunctionImport after refactoring (NFC) From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 254585	2015-12-03 02:58:14 +00:00
Mehdi Amini	7d11004c03	Rename Set variable to be plural Thanks Sean Silva for catching this. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 254584	2015-12-03 02:40:39 +00:00
Mehdi Amini	c8c551701e	Refactor FunctionImporter::importFunctions with a helper function to process the Worklist (NFC) This precludes some more functional changes to perform bulk imports. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 254583	2015-12-03 02:37:33 +00:00
Mehdi Amini	7471cf81b0	Adapt comment and rename variable in ModuleLinker to describe more accurately the actual use. Thanks Sean Silva for the suggestion. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 254582	2015-12-03 02:37:30 +00:00
Mehdi Amini	9abe1089c7	Remove "ExportingModule" from ThinLTO Index (NFC) There is no real reason the index has to have the concept of an exporting Module. We should be able to have one single unique instance of the Index, and it should be read-only after creation for the whole ThinLTO processing. The linker plugin should be able to process multiple modules (in parallel or in sequence) with the same index. The only reason the ExportingModule was present seems to be to implement hasExportedFunctions() that is used by the Module linker to decide what to do with the current Module. For now I replaced it with a query to the map of Modules path to see if this module was declared in the Index and consider that if it is the case then it is probably exporting function. On the long term the Linker interface needs to evolve and this call should not be needed anymore. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 254581	2015-12-03 02:37:23 +00:00
Joerg Sonnenberger	48eb197434	Add a TODO item that the nop handling before FP conditional branches is not enough for SPARCv7. llvm-svn: 254580	2015-12-03 02:35:24 +00:00
Matthias Braun	d35fe3d984	ScheduleDAGInstrs: Rework schedule graph builder. The new algorithm remembers the uses encountered while walking backwards until a matching def is found. Contrary to the previous version this: - Works without LiveIntervals being available - Allows to increase the precision to subregisters/lanemasks (not used for now) The changes in the AMDGPU tests are necessary because the R600 scheduler is not stable with respect to the order of nodes in the ready queues. Differential Revision: http://reviews.llvm.org/D9068 llvm-svn: 254577	2015-12-03 02:05:27 +00:00
Matthias Braun	b0083608b4	RegisterPressure: Use range based for, fix else style; NFC llvm-svn: 254575	2015-12-03 01:44:45 +00:00
Xinliang David Li	0f87463676	[PGO] Add v2 format compatibility test llvm-svn: 254572	2015-12-03 01:05:31 +00:00
Justin Bogner	72e81895da	MC: Make sure to clear all of MCMachOStreamer's state The CreatedADWARFSection flag was added in r232842, but isn't cleared properly when resetting the streamer's state. Fix that. llvm-svn: 254571	2015-12-03 00:52:20 +00:00
Derek Schuff	5268aaf7b6	[WebAssembly] Add a test for wasm-store-results pass Differential Revision: http://reviews.llvm.org/D15167 llvm-svn: 254570	2015-12-03 00:50:30 +00:00
Dan Gohman	ac132e9305	[WebAssembly] Assert that byval and nest are not used for return types. llvm-svn: 254567	2015-12-02 23:40:03 +00:00
David Majnemer	632e0ce8ce	Rename a header guard to be more appropriate llvm-svn: 254566	2015-12-02 23:28:27 +00:00
David Majnemer	6f4583c511	Forgot to add this file with r254562. llvm-svn: 254565	2015-12-02 23:09:05 +00:00
Krzysztof Parzyszek	8d8b229de9	[Hexagon] Improve lowering of instructions to the MC layer - Add extenders when necessary. - Handle some basic relocations. This should fix the failure in tools/clang/test/CodeGenCXX/crash.cpp llvm-svn: 254564	2015-12-02 23:08:29 +00:00
David Majnemer	70497c696a	Move EH-specific helper functions to a more appropriate place No functionality change is intended. llvm-svn: 254562	2015-12-02 23:06:39 +00:00
Alexey Samsonov	44ff204fad	Fixup for r254547: use format_hex() to simplify code. llvm-svn: 254560	2015-12-02 22:59:22 +00:00
Rafael Espindola	4b5ec26373	Switch the linker to having a whitelist of GVs. This replaces DoNotLinkFromSource with ValuesToLink. It also moves the computation of ValuesToLink earlier. It is a bit simpler and an important step in slitting the linker into an ir mover and a linker proper. The test change is because we now avoid creating dead declarations. llvm-svn: 254559	2015-12-02 22:59:04 +00:00
Mike Aizatsky	71552ce64b	Libfuzzer: do not pass null into user function Differential Revision: http://reviews.llvm.org/D15098 llvm-svn: 254558	2015-12-02 22:43:53 +00:00
Reid Kleckner	1f11b4e3a7	Use std::string instead of strdup() and free() in WinCodeViewLineTables llvm-svn: 254557	2015-12-02 22:34:30 +00:00
Rafael Espindola	8c04472edf	Delete what is now duplicated code. Having to import an alias as declaration is not thinlto specific. The test difference are because when we already have a decl and we are not importing it, we just leave the decl alone. llvm-svn: 254556	2015-12-02 22:22:24 +00:00
David Blaikie	b3757c008b	[llvm-dwp] Include only the non-empty columns in the cu_index llvm-svn: 254555	2015-12-02 22:01:56 +00:00
Xinliang David Li	f7861b7a09	[PGO] Allow input value node list to be null This is to handle the case when vp node linked list array is laziliy initialized at runtime llvm-svn: 254551	2015-12-02 21:47:43 +00:00
Cong Hou	1a6b5a9e4f	Fix a typo in LoopVectorize.cpp. NFC. llvm-svn: 254549	2015-12-02 21:33:47 +00:00
Alexey Samsonov	39b7d65d82	[PowerPC] Remove wild call to RegScavenger::initRegState(). This call should in fact be made by RegScavenger::enterBasicBlock() called below. The first call does nothing except for triggering UB, indicated by UBSan (passing nullptr to memset()). llvm-svn: 254548	2015-12-02 21:25:28 +00:00
Alexey Samsonov	bcfabaa05b	[Hexagon] Remove std::hex in favor of format(). std::hex is not used anywhere in LLVM code base except for this place, and it has a known undefined behavior (at least in libstdc++ 4.9.3): https://llvm.org/bugs/show_bug.cgi?id=18156, which fires in UBSan bootstrap of LLVM. llvm-svn: 254547	2015-12-02 21:13:43 +00:00
Kyle Butt	2f713eb438	Tests: PPC: remove unnecessary metadata. NFC Remove unnecessary metadata from a test case. llvm-svn: 254544	2015-12-02 21:08:03 +00:00
Rafael Espindola	0a80da0bec	Also copy private linkage globals when needed. This was an omission when handling COFF style comdats with local keys. Should fix the sanitizer-windows bot. llvm-svn: 254543	2015-12-02 20:57:33 +00:00
Alexey Samsonov	c895e34e0d	Re-enable UBSan tests for SystemZ: PR20980 was fixed. llvm-svn: 254542	2015-12-02 20:46:51 +00:00
Rafael Espindola	769efe621a	Don't copy information from aliasee to alias. They are independent. llvm-svn: 254541	2015-12-02 20:03:17 +00:00
Tom Stellard	00f2f91af4	AMDGPU/SI: Correctly emit agent global segment variables when targeting HSA Differential Revision: http://reviews.llvm.org/D14508 llvm-svn: 254540	2015-12-02 19:47:57 +00:00
Krzysztof Parzyszek	de25ecfa62	[Hexagon] Remove TFRI_V4 instruction, use existing A2_tfrsi instead llvm-svn: 254539	2015-12-02 19:44:35 +00:00
Rafael Espindola	f3518c955b	Fix linking when we copy over only a decl. We were failing to copy the fact that the GV is weak and in the case of an alias, producing invalid IR. llvm-svn: 254538	2015-12-02 19:30:52 +00:00
Kyle Butt	cf6a8bfe51	[CodeGen]: Fix bad interaction with AntiDep breaking and inline asm. AggressiveAntiDepBreaker was renaming registers specified by the user for inline assembly. While this will work for compiler-specified registers, it won't work for user-specified registers, and at the time this runs, I don't currently see a way to distinguish them. llvm-svn: 254532	2015-12-02 18:58:51 +00:00
Kyle Butt	015f4fc854	Test Commit: iteratee Remove whitespace from blank lines. NFC llvm-svn: 254531	2015-12-02 18:53:33 +00:00
Fiona Glaser	1075f6323f	Fix accidental off by one change Didn't break any tests, but did unnecessary extra work. llvm-svn: 254529	2015-12-02 18:46:23 +00:00
Tom Stellard	e928533dae	AMDGPU: Fix msan test failure llvm-svn: 254527	2015-12-02 18:35:23 +00:00
Fiona Glaser	e25b06fa23	Scheduler / Regalloc: use unique_ptr[] instead of std::vector vector.resize() is significantly slower than memset in many STLs and the cost of initializing these vectors is significant on targets with many registers. Since we don't need the overhead of a vector, use a simple unique_ptr instead. llvm-svn: 254526	2015-12-02 18:32:59 +00:00
Nathan Slingerland	aa5702d92b	[llvm-profdata] Change instr prof counter overflow to saturate rather than discard Summary: This changes overflow handling during instrumentation profile merge. Rathar than throwing away records that would result in counter overflow, merged counts are instead clamped to the maximum representable value. A warning about counter overflow is still surfaced to the user as before. Reviewers: dnovillo, davidxl, silvas Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14893 llvm-svn: 254525	2015-12-02 18:19:24 +00:00
Tim Northover	f520eff782	AArch64: use ldxp/stxp pair to implement 128-bit atomic loads. The ARM ARM is clear that 128-bit loads are only guaranteed to have been atomic if there has been a corresponding successful stxp. It's less clear for AArch32, so I'm leaving that alone for now. llvm-svn: 254524	2015-12-02 18:12:57 +00:00
Dan Gohman	53d1399792	[WebAssembly] Fix comments to say "LIFO" instead of "FIFO" when describing a stack. llvm-svn: 254523	2015-12-02 18:08:49 +00:00
Tom Stellard	e3b5aeaf83	AMDGPU/SI: Don't emit group segment global variables Summary: Only global or readonly segment variables should appear in object files. Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D15111 llvm-svn: 254519	2015-12-02 17:00:42 +00:00
David Majnemer	942003acc6	Do (A == C1 \|\| A == C2) -> (A & ~(C1 ^ C2)) == C1 rather than (A == C1 \|\| A == C2) -> (A \| (C1 ^ C2)) == C2 when C1 ^ C2 is a power of 2. Differential Revision: http://reviews.llvm.org/D14223 Patch by Amaury SECHET! llvm-svn: 254518	2015-12-02 16:15:07 +00:00
Rafael Espindola	9b04181d81	Add an interesting case we already get right. llvm-svn: 254514	2015-12-02 15:02:43 +00:00
Andy Gibbs	5538eed8ed	Rollback r254508 and r254511 to fix buildbots llvm-svn: 254513	2015-12-02 14:36:48 +00:00
Michael Zuckerman	15152a5c41	By intel spec \|9B DD /7\| FSTSW m2byte\| Valid Valid Store FPU status word at m2byteafter checking for pending unmasked floating-point exceptions.\| \|9B DF E0\| FSTSW AX\| Valid Valid Store FPU status word in AX register after checking for pending unmasked floating-point exceptions.\| \|DD /7 \|FNSTSW m2byte\| Valid Valid Store FPU status word at m2bytewithout checking for pending unmasked floating-point exceptions.\| \|DF E0 \|FNSTSW AX\| Valid Valid Store FPU status word in AX register without checking for pending unmasked floating-point exceptions\| m2byte is word register, and therefor instruction operand need to be change from f32mem to i16mem. Differential Revision: http://reviews.llvm.org/D14953 llvm-svn: 254512	2015-12-02 14:34:34 +00:00
Andy Gibbs	57a23151ca	Fix buildbots broken by r254508 g++ 4.7 does not allow an inline defaulted virtual destructor to be overridden, giving the error "looser throw specifier for ... overridding ~SCEVPredicate() noexcept (true)" (see https://gcc.gnu.org/bugzilla/show_bug.cgi?id=53613). The work-around given in the bug report above has been utilised here. llvm-svn: 254511	2015-12-02 14:22:18 +00:00
Andy Gibbs	f47be098c6	Fix class SCEVPredicate has virtual functions and accessible non-virtual destructor llvm-svn: 254508	2015-12-02 13:41:24 +00:00
Christof Douma	8b5dc2c94e	[AArch64]: Add support for Cortex-A35 Adds support for the new Cortex-A35 ARMv8-A core. llvm-svn: 254503	2015-12-02 11:53:44 +00:00
Nemanja Ivanovic	74e31bc929	Patch to fix a crash in the PowerPC back end due to ISD::ROTL and ISD::ROTR not being expanded. Test case included. llvm-svn: 254501	2015-12-02 10:36:24 +00:00
Hrvoje Varga	672b0f5582	[mips][microMIPS] Implement PREPEND, RADDU.W.QB, RDDSP, REPL.PH, REPL.QB, REPLV.PH, REPLV.QB and MTHLIP instructions Differential Revision: http://reviews.llvm.org/D14527 llvm-svn: 254496	2015-12-02 09:31:24 +00:00
Simon Pilgrim	3fc3454a0c	[X86][FMA] Optimize FNEG(FMUL) Patterns On FMA targets, we can avoid having to load a constant to negate a float/double multiply by instead using a FNMSUB (-(X*Y)-0) Fix for PR24366 Differential Revision: http://reviews.llvm.org/D14909 llvm-svn: 254495	2015-12-02 09:07:55 +00:00
Elena Demikhovsky	a1a40cce9f	AVX-512: Updated cost of FP/SINT/UINT conversion operations I checked and updated the cost of AVX-512 conversion operations. Added cost of conversion operations in DQ mode. Conversion of illegal types that requires vector split is not calculated right now (like for other X86 targets). Differential Revision: http://reviews.llvm.org/D15074 llvm-svn: 254494	2015-12-02 08:59:47 +00:00
Asaf Badouh	2489f350c0	[X86][AVX512] add comi with Sae add builtin_ia32_vcomisd and builtin_ia32_vcomisd Differential Revision: http://reviews.llvm.org/D14331 llvm-svn: 254493	2015-12-02 08:17:51 +00:00
David Blaikie	20f52662d4	[llvm-dwp] Don't rely on implicit move assignment operator (MSVC won't synthesize one) llvm-svn: 254492	2015-12-02 07:09:26 +00:00
Akira Hatanaka	237916b537	[AttributeSet] Overload AttributeSet::addAttribute to reduce compile time. The new overloaded function is used when an attribute is added to a large number of slots of an AttributeSet (for example, to function parameters). This is much faster than calling AttributeSet::addAttribute once per slot, because AttributeSet::getImpl (which calls FoldingSet::FIndNodeOrInsertPos) is called only once per function instead of once per slot. With this commit, clang compiles a file which used to take over 22 minutes in just 13 seconds. rdar://problem/23581000 Differential Revision: http://reviews.llvm.org/D15085 llvm-svn: 254491	2015-12-02 06:58:49 +00:00
Craig Topper	f419a1f69a	[X86] Change getZeroVector to take an MVT instead of EVT. One minor change needed to only try to perform 256-it shuffle combines on legal vector types. llvm-svn: 254490	2015-12-02 06:39:19 +00:00
David Blaikie	b073cb9be2	[llvm-dwp] Emit a rather fictional debug_cu_index This is very rudimentary support for debug_cu_index, but it is enough to allow llvm-dwarfdump to find the offsets for contributions and correctly dump debug_info. It will need to actually find the real signature of the unit and build the real hash table with the right number of buckets, as per the DWP specification. It will also need to be expanded to cover the tu_index as well. llvm-svn: 254489	2015-12-02 06:21:34 +00:00
David Blaikie	12e7b99ed0	DebugInfo\DWARF: Privatize some accidentally public members llvm-svn: 254488	2015-12-02 06:21:28 +00:00
Craig Topper	6164297f46	[X86] Fix weird identation. NFC llvm-svn: 254487	2015-12-02 05:24:38 +00:00
Mehdi Amini	ffe2e4aae0	Change ModuleLinker to take a set of GlobalValues to import instead of a single one For efficiency reason, when importing multiple functions for the same Module, we can avoid reparsing it every time. Differential Revision: http://reviews.llvm.org/D15102 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 254486	2015-12-02 04:34:28 +00:00
Kostya Serebryany	fba04273b7	[libFuzzer] add a test that is built with -fsanitize-coverage=trace-bb llvm-svn: 254484	2015-12-02 02:49:37 +00:00
Kostya Serebryany	a3c5347764	[sanitizer coverage] when adding a bb trace instrumentation, do it instead, not in addition to, regular coverage. Do the regular coverage in the run-time instead llvm-svn: 254482	2015-12-02 02:37:13 +00:00
Quentin Colombet	bbdebefff6	[X86] Fix a think-o when checking if the eflags needs to be preserved. llvm-svn: 254480	2015-12-02 02:07:00 +00:00
Mehdi Amini	a11bdc8ef7	Modify FunctionImport to take a callback to load modules When linking static archive, there is no individual module files to load. Instead they can be mmap'ed and could be initialized from a buffer directly. The callback provide flexibility to override the scheme for loading module from the summary. Differential Revision: http://reviews.llvm.org/D15101 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 254479	2015-12-02 02:00:29 +00:00
Quentin Colombet	f1e91c8bf1	[X86] Make sure the prologue does not clobber EFLAGS when it lives accross it. This is a superset of the fix done in r254448. This fixes PR25607. llvm-svn: 254478	2015-12-02 01:22:54 +00:00
Tim Northover	f3be9d5c0b	AArch64: fix 128-bit shifts We mustn't introduce a shift of exactly 64-bits for any inputs, since that's an UNDEF value (and worse, it's not what you want with the natural Arch64 implementation). The generated code is pretty horrific, but I couldn't come up with an obviously better alternative (if the amount is constant EXTR could help). Turns out 128-bit shifts are just nasty. rdar://22491037 llvm-svn: 254475	2015-12-02 00:33:54 +00:00
Rafael Espindola	af714765e6	Use default member initializers. llvm-svn: 254473	2015-12-01 23:06:26 +00:00
Xinliang David Li	d8828fcff0	Define member operator delete For the struct with trailing objects, define a member operator delete. Without this, the program will fail when -fsized-deallocation option is used where the wrong size will be passed to the global delete operator. llvm-svn: 254471	2015-12-01 23:05:27 +00:00
Matt Arsenault	592d068198	AMDGPU: Error on addrspacecasts that aren't actually implemented llvm-svn: 254469	2015-12-01 23:04:05 +00:00
Matt Arsenault	f9bfeafd00	AMDGPU: Implement isNoopAddrSpaceCast llvm-svn: 254468	2015-12-01 23:04:00 +00:00
Rafael Espindola	6d2c313b46	Remove unnecessary getter. llvm-svn: 254466	2015-12-01 23:01:51 +00:00
Rafael Espindola	e39cd5b144	Pass down the dst GV to linkGlobalValueBody. NFC. llvm-svn: 254465	2015-12-01 22:40:40 +00:00
Cong Hou	cb07d7016a	Fix a bug in IfConversion.cpp. The bug is introduced in r254377 which failed some tests on ARM, where a new probability is assigned to a successor but the provided BB may not be a successor. llvm-svn: 254463	2015-12-01 21:50:20 +00:00
Matthias Braun	b258d794dd	ARM: Change ArchCheck field to uint64_t The values in this field are compared against getAvailableFeatures() which returns an uint64_t. This was causing problems in an internal branch. llvm-svn: 254462	2015-12-01 21:48:52 +00:00
Matt Arsenault	3b15967008	AMDGPU: Disallow flat_scr in SI assembler llvm-svn: 254459	2015-12-01 20:31:08 +00:00
Xinliang David Li	a28306db0c	[PGO] Add support for reading multiple versions of indexed profile format profile data Profile readers using incompatible on-disk hash table format can now share the same implementation and interfaces. Differential Revision: http://reviews.llvm.org/D15100 llvm-svn: 254458	2015-12-01 20:26:26 +00:00
Rafael Espindola	edf811d68f	Delete unused includes. llvm-svn: 254457	2015-12-01 20:23:19 +00:00
Justin Bogner	909e1c0135	IR: Clean up some duplicated code in ConstantDataSequential creation. NFC ConstantDataArray::getImpl and ConstantDataVector::getImpl had a lot of copy pasta in how they handled sequences of constants. Break that out into a couple of simple functions. llvm-svn: 254456	2015-12-01 20:20:49 +00:00
Rafael Espindola	e3a933af31	clang-format LinkModules.cpp. Most of the file has been changed recently and was already clang-format clean. llvm-svn: 254454	2015-12-01 20:11:43 +00:00
Sanjay Patel	0b2a94916d	use range-based for loops; NFCI llvm-svn: 254453	2015-12-01 19:57:43 +00:00
Matt Arsenault	856d1928a8	AMDGPU: Optimize VOP2 operand legalization Don't use commuteInstruction, and don't commute if doing so will not improve legality. Skip the more complex checks for literal operands and constant bus restrictions, which are not a concern for VOP2 instructions because src1 does not accept SGPRs or constants and few implicitly read vcc. This gets called quite a few times and the attempts at commuting are a significant fraction of the time spent in SIFixSGPRCopies, so it's somewhat worthwhile to optimize. With this patch and others leading up to it, this reduces the compile time of SIFixSGPRCopies on some of the LuxMark 2 kernels from ~8ms to ~5ms on my system. llvm-svn: 254452	2015-12-01 19:57:17 +00:00
Rafael Espindola	0e309fe860	Use references now that it is natural to do so. The linker never takes ownership of a module or changes which module it is refering to, making it natural to use references. llvm-svn: 254449	2015-12-01 19:50:54 +00:00
Quentin Colombet	9cb01aa30a	[X86] Make sure the prologue does not clobber EFLAGS when it lives accross it. This fixes PR25629. llvm-svn: 254448	2015-12-01 19:49:31 +00:00
Xinliang David Li	0e6a36e17e	Use nullptr (NFC) llvm-svn: 254447	2015-12-01 19:47:32 +00:00
Sanjay Patel	b53791e5a7	don't repeat function/variable names in comments; NFC llvm-svn: 254445	2015-12-01 19:32:35 +00:00
Artyom Skrobov	5d1f2524a0	Fix Thumb1 epilogue generation Summary: This had been broken for a very long time, but nobody noticed until D14357 enabled shrink-wrapping by default. Reviewers: jroelofs, qcolombet Subscribers: tyomitch, llvm-commits, rengolin Differential Revision: http://reviews.llvm.org/D14986 llvm-svn: 254444	2015-12-01 19:25:11 +00:00
Sanjay Patel	96824deebc	fix typo; NFC llvm-svn: 254442	2015-12-01 19:19:18 +00:00
David Blaikie	bb94e440d5	[llvm-dwp] Deduplicate strings in the debug_str.dwo section Also, ensure that references to those strings in debug_str_offsets.dwo correctly refer to the deduplicated strings. llvm-svn: 254441	2015-12-01 19:17:58 +00:00
Weiming Zhao	56ab51870c	[AArch64] Fix a corner case in BitFeild select Summary: When not useful bits, BitWidth becomes 0 and APInt will not be happy. See https://llvm.org/bugs/show_bug.cgi?id=25571 We can just mark the operand as IMPLICIT_DEF is none bits of it is used. Reviewers: t.p.northover, jmolloy Subscribers: gberry, jmolloy, mgrang, aemerson, llvm-commits, rengolin Differential Revision: http://reviews.llvm.org/D14803 llvm-svn: 254440	2015-12-01 19:17:49 +00:00
Matt Arsenault	e830f5427b	AMDGPU: Report extractelement as free in cost model The cost for scalarized operations is computed as N * (scalar operation cost + 1 extractelement + 1 insertelement). This partially fixes inflating the cost of scalarized operations since every operation is scalarized and free. I don't think we want any cost asociated with scalarization, but for now insertelement is still counted. I'm not sure if we should pretend that insertelement is also free, or add a way to compute a custom scalarization cost. llvm-svn: 254438	2015-12-01 19:08:39 +00:00
Keno Fischer	a6c4ce43df	[Verifier] Improve error for cross-module refs By including the module name in the error message. This makes the error message much more useful and saves a trip to the debugger. Reviewers: dexonsmith Subscribers: dexonsmith, llvm-commits Differential Revision: http://reviews.llvm.org/D14473 llvm-svn: 254437	2015-12-01 19:06:36 +00:00
Rafael Espindola	3b80b8854c	Delete dead code. llvm-svn: 254436	2015-12-01 18:50:35 +00:00
Rafael Espindola	4dbdceb6fc	Use a forwarding constructor instead of an init method. llvm-svn: 254435	2015-12-01 18:46:19 +00:00
Rafael Espindola	4808c6d064	Delete the setModule method from the Linker. It was only used from LTO for a debug feature, and LTO can just create another linker. It is pretty odd to have a method to reset the module in the middle of a link. It would make IdentifiedStructTypes inconsistent with the Module for example. llvm-svn: 254434	2015-12-01 18:41:30 +00:00
David Blaikie	98ad82a6a1	[llvm-dwp] Correctly update debug_str_offsets.dwo when linking dwo files This doesn't deduplicate strings in the debug_str section, nor does it properly wire up the index so that debug_info can /find/ these strings, but it does correct the str_offsets specifically. Follow up patches to address those related/next issues. llvm-svn: 254431	2015-12-01 18:07:07 +00:00
Tom Stellard	38b7cbe3e0	AMDGPU/SI: Remove REGISTER_STORE/REGISTER_LOAD code which is now dead Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D15050 llvm-svn: 254427	2015-12-01 17:45:22 +00:00
Tom Stellard	ff63c25753	AMDGPU: Use the default strings for data emission directives Summary: This makes the assembly output look nicer and there is no reason to have custom strings for these. Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D14671 llvm-svn: 254426	2015-12-01 17:45:17 +00:00
Sanjay Patel	60216f6943	[x86] add a convenience method to check for FMA capability; NFCI llvm-svn: 254425	2015-12-01 17:27:55 +00:00
Rafael Espindola	6e8ab928d5	Make appending var linking less of a special case. It has to be a bit special because: * materializeInitFor is not really supposed to call replaceAllUsesWith. The caller has a plain variable with Dst and expects just the initializer to be set, not for it to be removed. * Calling mutateType as we used to do before gets some type inconsistency which breaks the bitcode writer. * If linkAppendingVarProto create a dest decl with the correct type to avoid the above problems, it needs to put the original dst init in some side table for materializeInitFor to use. In the end the simplest solution seems to be to just have linkAppendingVarProto do all the work and set ValueMap[SrcGV to avoid recursion. llvm-svn: 254424	2015-12-01 17:17:04 +00:00
Teresa Johnson	430110cc0b	[ThinLTO] Wrap dbgs() output in DEBUG macro Missed in a couple places. llvm-svn: 254422	2015-12-01 17:12:10 +00:00
Teresa Johnson	d582f5b3f8	[ThinLTO] Remove stale comment (NFC) Stale as of r254036 which added basic profitability check. llvm-svn: 254421	2015-12-01 16:45:23 +00:00
Rafael Espindola	b318fcbd8b	Simplify test. NFC. llvm-svn: 254419	2015-12-01 15:46:46 +00:00
Rafael Espindola	baa3bf8f76	Bring r254336 back: The difference is that now we don't error on out-of-comdat access to internal global values. We copy them instead. This seems to match the expectation of COFF linkers (see pr25686). Original message: Start deciding earlier what to link. A traditional linker is roughly split in symbol resolution and "copying stuff". The two tasks are badly mixed in lib/Linker. This starts splitting them apart. With this patch there are no direct call to linkGlobalValueBody or linkGlobalValueProto. Everything is linked via WapValue. This also includes a few fixes: * A GV goes undefined if the comdat is dropped (comdat11.ll). * We error if an internal GV goes undefined (comdat13.ll). * We don't link an unused comdat. The first two match the behavior of an ELF linker. The second one is equivalent to running globaldce on the input. llvm-svn: 254418	2015-12-01 15:19:48 +00:00
Chad Rosier	869962f962	[LIR] Push check into helper function. NFC. llvm-svn: 254416	2015-12-01 14:26:35 +00:00
Yury Gribov	81f3f15b0d	Fix "WARNING: Title underline too short." introduced by r254404. Patch by Max Ostapenko. llvm-svn: 254413	2015-12-01 13:24:48 +00:00
Elena Demikhovsky	0d0692d854	AVX-512: fixed asm string of vsqrtss (vvsqrtss was generated before) llvm-svn: 254411	2015-12-01 12:43:46 +00:00
Elena Demikhovsky	aa1f17ea95	AVX-512: regenerated test for avx512 arithmetics, NFC llvm-svn: 254410	2015-12-01 12:35:03 +00:00
Elena Demikhovsky	47fa271a9b	Fixed a failure in getSpaltValue() llvm-svn: 254409	2015-12-01 12:30:40 +00:00
Elena Demikhovsky	0781d7b2b4	Fixed a failure in cost calculation for vector GEP Cost calculation for vector GEP failed with due to invalid cast to GEP index operand. The bug is fixed, added a test. http://reviews.llvm.org/D14976 llvm-svn: 254408	2015-12-01 12:08:36 +00:00
Hrvoje Varga	e51b0e13f3	[mips][microMIPS] Implement RECIP.fmt, RINT.fmt, ROUND.L.fmt, ROUND.W.fmt, SEL.fmt, SELEQZ.fmt, SELNEQZ.fmt and CLASS.fmt Differential Revision: http://reviews.llvm.org/D13885 llvm-svn: 254405	2015-12-01 11:59:21 +00:00
Yury Gribov	d7dbb66eb8	Introduce new @llvm.get.dynamic.area.offset.i{32, 64} intrinsics. The @llvm.get.dynamic.area.offset.* intrinsic family is used to get the offset from native stack pointer to the address of the most recent dynamic alloca on the caller's stack. These intrinsics are intendend for use in combination with @llvm.stacksave and @llvm.restore to get a pointer to the most recent dynamic alloca. This is useful, for example, for AddressSanitizer's stack unpoisoning routines. Patch by Max Ostapenko. Differential Revision: http://reviews.llvm.org/D14983 llvm-svn: 254404	2015-12-01 11:40:55 +00:00
Cong Hou	4aef7ef881	Allow known and unknown probabilities coexist in MBB's successor list. Previously it is not allowed for each MBB to have successors with both known and unknown probabilities. However, this may be too strict as at this stage we could not always guarantee that. It is better to remove this restriction now, and I will work on validating MBB's successors' probabilities first (for example, check if the sum is approximate one). llvm-svn: 254402	2015-12-01 11:05:39 +00:00
Oliver Stannard	a34e47066e	[AArch64] Add ARMv8.2-A Statistical Profiling Extension The Statistical Profiling Extension is an optional extension to ARMv8.2-A. Since it is an optional extension, I have added the FeatureSPE subtarget feature to control it. The assembler-visible parts of this extension are the new "psb csync" instruction, which is equivalent to "hint #17", and a number of system registers. Differential Revision: http://reviews.llvm.org/D15021 llvm-svn: 254401	2015-12-01 10:48:51 +00:00
Oliver Stannard	4667071574	[ARM] Add ARMv8.2-A to TargetParser Add ARMv8.2-A to TargetParser, so that it can be used by the clang command-line options and the .arch directive. Most testing of this will be done in clang, checking that the command-line options that this enables work. Differential Revision: http://reviews.llvm.org/D15037 llvm-svn: 254400	2015-12-01 10:33:56 +00:00
Oliver Stannard	8addbf4350	[ARM] Add subtarget features for ARMv8.2-A This adds subtarget features for ARMv8.2-A, which builds on (and requires the features from) ARMv8.1-A. Most assembler-visible features of ARMv8.2-A are system instructions, and are all required parts of the architecture, so just depend on the HasV8_2aOps subtarget feature. There is also one large, optional feature, which adds 16-bit floating point versions of all existing floating-point instructions (VFP and SIMD), this is represented by the FeatureFullFP16 subtarget feature. Differential Revision: http://reviews.llvm.org/D15036 llvm-svn: 254399	2015-12-01 10:23:06 +00:00
NAKAMURA Takumi	54d90f46c5	llvm/test/DebugInfo/X86/safestack-byval.ll: Give an explicit triple for now. It crashes for targeting *-win32. Also revert r254375 and r254361. llvm-svn: 254397	2015-12-01 10:07:41 +00:00
NAKAMURA Takumi	8bd0f0b141	Move llvm/test/DebugInfo/Generic/safestack-byval.ll to X86. It depends on x86-64. llvm-svn: 254396	2015-12-01 10:07:37 +00:00
Sanjoy Das	347d272c5c	Introduce a range version of std::find, and use in SCEV Reviewers: dblaikie, pcc Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D15064 llvm-svn: 254391	2015-12-01 07:49:27 +00:00
Sanjoy Das	ff3b8b4c33	Introduce a range version of std::any_of, and use it in SCEV Reviewers: dblaikie, pcc Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D15063 llvm-svn: 254390	2015-12-01 07:49:23 +00:00
Craig Topper	c458c7c6c9	[X86] Fix patterns for memory forms of FP FSUBR and FDIVR. They need to have memory on the left hand side of the fsub/fdiv operations in their patterns. Not sure how to test this. I noticed by inspection in the isel tables where the same pattern tried to produce DIV and DIVR or SUB and SUBR. llvm-svn: 254388	2015-12-01 06:13:16 +00:00
Craig Topper	271f9ded44	[X86] Use range-based for loops. NFC llvm-svn: 254387	2015-12-01 06:13:15 +00:00
Craig Topper	ba894c3c0d	[X86] Use array_lengthof instead of calculating manually. Also change index types to size_t to match. llvm-svn: 254386	2015-12-01 06:13:13 +00:00
Craig Topper	ddc76f2bed	[Hexagon] Use std::begin() and std::end() instead of doing the same manually. NFC llvm-svn: 254385	2015-12-01 06:13:10 +00:00
Craig Topper	d824f5f0d9	[Hexagon] Use array_lengthof and const correct and type correct the array and array size. NFC llvm-svn: 254384	2015-12-01 06:13:08 +00:00
Craig Topper	6261e1b94d	Use array_lengthof instead of manually calculating it. NFC llvm-svn: 254383	2015-12-01 06:13:06 +00:00
Craig Topper	3da000c07f	[Hexagon] Use ArrayRef to avoid needing to calculate an array size. Interestingly the original code may have had a bug because it was passing the byte size of a uint16_t array instead of the number of entries. llvm-svn: 254382	2015-12-01 06:13:04 +00:00
Craig Topper	8072081b63	[ARM] Use range-based for loops to avoid the need for calculating an array size that I would have otherwise cconverted to array_lengthof. NFC llvm-svn: 254381	2015-12-01 06:13:01 +00:00
Craig Topper	fac9057ef8	Use array_lengthof instead of manually calculating it. NFC llvm-svn: 254380	2015-12-01 06:12:59 +00:00
Davide Italiano	05402671b8	[Windows] Partially revert r254363 until I can test the right fix. Reported by: David Blaikie llvm-svn: 254378	2015-12-01 05:33:24 +00:00
Cong Hou	d97c100dc4	Replace all weight-based interfaces in MBB with probability-based interfaces, and update all uses of old interfaces. (This is the second attempt to submit this patch. The first caused two assertion failures and was reverted. See https://llvm.org/bugs/show_bug.cgi?id=25687) The patch in http://reviews.llvm.org/D13745 is broken into four parts: 1. New interfaces without functional changes (http://reviews.llvm.org/D13908). 2. Use new interfaces in SelectionDAG, while in other passes treat probabilities as weights (http://reviews.llvm.org/D14361). 3. Use new interfaces in all other passes. 4. Remove old interfaces. This patch is 3+4 above. In this patch, MBB won't provide weight-based interfaces any more, which are totally replaced by probability-based ones. The interface addSuccessor() is redesigned so that the default probability is unknown. We allow unknown probabilities but don't allow using it together with known probabilities in successor list. That is to say, we either have a list of successors with all known probabilities, or all unknown probabilities. In the latter case, we assume each successor has 1/N probability where N is the number of successors. An assertion checks if the user is attempting to add a successor with the disallowed mixed use as stated above. This can help us catch many misuses. All uses of weight-based interfaces are now updated to use probability-based ones. Differential revision: http://reviews.llvm.org/D14973 llvm-svn: 254377	2015-12-01 05:29:22 +00:00
Colin LeMahieu	309fb1877e	[Hexagon] Disabling failing safestack test llvm-svn: 254375	2015-12-01 04:56:25 +00:00
Matthias Braun	50f7f585ed	RegisterPressure: If we do not collect dead defs the list must be empty llvm-svn: 254372	2015-12-01 04:20:06 +00:00
Matthias Braun	ba6b225bf9	RegisterPressure: Remove support for recede()/advance() at MBB boundaries Nobody was checking the returnvalue of recede()/advance() so we can simply replace this code with asserts. llvm-svn: 254371	2015-12-01 04:20:04 +00:00
Matthias Braun	34e706e0a1	RegisterPressure: There is no need to make getCurSlot() public llvm-svn: 254370	2015-12-01 04:20:01 +00:00
Matthias Braun	7699ed7814	RegisterPressure: There is no need to make discoverLive{In\|Out} public llvm-svn: 254369	2015-12-01 04:19:58 +00:00
Matthias Braun	f9f8b92d93	RegisterPressure: Split RegisterOperands analysis code from result object; NFC This is in preparation to expose the RegisterOperands class as RegisterPressure API. llvm-svn: 254368	2015-12-01 04:19:56 +00:00
Hans Wennborg	1dbaf67537	Revert r254348: "Replace all weight-based interfaces in MBB with probability-based interfaces, and update all uses of old interfaces." and the follow-up r254356: "Fix a bug in MachineBlockPlacement that may cause assertion failure during BranchProbability construction." Asserts were firing in Chromium builds. See PR25687. llvm-svn: 254366	2015-12-01 03:49:42 +00:00
Davide Italiano	38518e9f53	[Windows] Follow-up r254363, remove return. llvm-svn: 254364	2015-12-01 02:38:42 +00:00
Davide Italiano	b37d6bd7ae	[Windows] Simplify assertion code. NFC. llvm-svn: 254363	2015-12-01 02:35:04 +00:00
Matt Arsenault	456fdfcdc2	Squelch unused variable warning in SIRegisterInfo.cpp. Patch by Justin Lebar llvm-svn: 254362	2015-12-01 02:14:33 +00:00
NAKAMURA Takumi	09eff05c0b	llvm/test/DebugInfo/Generic/safestack-byval.ll is using tls. llvm-svn: 254361	2015-12-01 01:15:03 +00:00
NAKAMURA Takumi	23183f3bba	check-llvm: Introduce the new feature "tls". llvm-svn: 254360	2015-12-01 01:14:58 +00:00
David Blaikie	21ed3b13bd	[llvm-dwp] Add missing Makefile for the old configure+make build llvm-svn: 254358	2015-12-01 01:07:20 +00:00
David Blaikie	32aa0495e8	[llvm-dwp] Add missing dependency from llvm tests on the llvm-dwp tool llvm-svn: 254357	2015-12-01 00:57:05 +00:00
Cong Hou	1ccca9e673	Fix a bug in MachineBlockPlacement that may cause assertion failure during BranchProbability construction. The root cause is the rounding behavior in BranchProbability construction. We may consider to use truncation instead in the future. llvm-svn: 254356	2015-12-01 00:55:42 +00:00
David Blaikie	242b948817	[llvm-dwp] Initial partial prototype This just concatenates the common DWP sections without doing any of the fancy DWP things like: 1) update str_offsets 2) deduplicating strings 3) merging/creating cu/tu_index Patches for these will follow shortly. (also not sure about target triple/object file type for this tool - do I really need a whole triple just to write an object file that contains purely static/hardcoded bytes in each section? & I guess I should just pick it based on the first input, maybe, rather than hardcoding for now - but we only produce .dwo on ELF platforms with objcopy for now anyway) llvm-svn: 254355	2015-12-01 00:48:39 +00:00
David Blaikie	df05525d86	llvm-dwp: Initial layout llvm-svn: 254354	2015-12-01 00:48:34 +00:00
Evgeniy Stepanov	42f3b12274	[safestack] Protect byval function arguments. Detect unsafe byval function arguments and move them to the unsafe stack. llvm-svn: 254353	2015-12-01 00:40:05 +00:00
Evgeniy Stepanov	fd07995363	Extend debug info for function parameters in SDAG. SDAG currently can emit debug location for function parameters when an llvm.dbg.declare points to either a function argument SSA temp, or to an AllocaInst. This change extends this logic by adding a fallback case when neither of the above is true. This is required for SafeStack, which may copy the contents of a byval function argument into something that is not an alloca, and then describe the target as the new location of the said argument. llvm-svn: 254352	2015-12-01 00:34:30 +00:00
Evgeniy Stepanov	a4ac3f4bdf	[safestack] Fix handling of array allocas. The current code does not take alloca array size into account and, as a result, considers any access past the first array element to be unsafe. llvm-svn: 254350	2015-12-01 00:06:13 +00:00
Cong Hou	fa1917c673	Replace all weight-based interfaces in MBB with probability-based interfaces, and update all uses of old interfaces. The patch in http://reviews.llvm.org/D13745 is broken into four parts: 1. New interfaces without functional changes (http://reviews.llvm.org/D13908). 2. Use new interfaces in SelectionDAG, while in other passes treat probabilities as weights (http://reviews.llvm.org/D14361). 3. Use new interfaces in all other passes. 4. Remove old interfaces. This patch is 3+4 above. In this patch, MBB won't provide weight-based interfaces any more, which are totally replaced by probability-based ones. The interface addSuccessor() is redesigned so that the default probability is unknown. We allow unknown probabilities but don't allow using it together with known probabilities in successor list. That is to say, we either have a list of successors with all known probabilities, or all unknown probabilities. In the latter case, we assume each successor has 1/N probability where N is the number of successors. An assertion checks if the user is attempting to add a successor with the disallowed mixed use as stated above. This can help us catch many misuses. All uses of weight-based interfaces are now updated to use probability-based ones. Differential revision: http://reviews.llvm.org/D14973 llvm-svn: 254348	2015-12-01 00:02:51 +00:00
Rafael Espindola	e9841a6bb5	This reverts commit r254336 and r254344. They broke a bot and I am debugging why. llvm-svn: 254347	2015-11-30 23:54:19 +00:00
Rafael Espindola	a891957002	Disable a consistency check. Trying to figure out why it fails on a bot but passes locally. llvm-svn: 254344	2015-11-30 23:05:25 +00:00
Sanjay Patel	8b1fb3daba	[InstCombine] add tests to show potential vector IR shuffle transforms llvm-svn: 254342	2015-11-30 22:39:36 +00:00
Simon Pilgrim	db26b3ddfa	[X86][FMA4] Prefer FMA4 to FMA We currently output FMA instructions on targets which support both FMA4 + FMA (i.e. later Bulldozer CPUS bdver2/bdver3/bdver4). This patch flips this so FMA4 is preferred; this is for several reasons: 1 - FMA4 is non-destructive reducing the need for mov instructions. 2 - Its more straighforward to commute and fold inputs (although the recent work on FMA has reduced this difference). 3 - All supported targets have FMA4 performance equal or better to FMA - Piledriver (bdver2) in particular has half the throughput when executing FMA instructions. Its looks like no future AMD processor lines will support FMA4 after the Bulldozer series so we're not causing problems for later CPUs. Differential Revision: http://reviews.llvm.org/D14997 llvm-svn: 254339	2015-11-30 22:22:06 +00:00
Rafael Espindola	c109200c53	Start deciding earlier what to link. A traditional linker is roughly split in symbol resolution and "copying stuff". The two tasks are badly mixed in lib/Linker. This starts splitting them apart. With this patch there are no direct call to linkGlobalValueBody or linkGlobalValueProto. Everything is linked via WapValue. This also includes a few fixes: * A GV goes undefined if the comdat is dropped (comdat11.ll). * We error if an internal GV goes undefined (comdat13.ll). * We don't link an unused comdat. The first two match the behavior of an ELF linker. The second one is equivalent to running globaldce on the input. llvm-svn: 254336	2015-11-30 22:01:43 +00:00
Paul Robinson	a2550a6da3	Have 'optnone' respect the -fast-isel=false option. This is primarily useful for debugging optnone v. ISel issues. Differential Revision: http://reviews.llvm.org/D14792 llvm-svn: 254335	2015-11-30 21:56:16 +00:00
Cong Hou	eb9c7056f0	[X86] Update test/CodeGen/X86/avg.ll with the help of update_llc_test_checks.py. NFC. llvm-svn: 254334	2015-11-30 21:46:08 +00:00
Matt Arsenault	ada6cf1b22	AMDGPU: Fix unused function llvm-svn: 254333	2015-11-30 21:32:10 +00:00
Matt Arsenault	41003af292	AMDGPU: Error if too many user SGPRs used llvm-svn: 254332	2015-11-30 21:16:07 +00:00
Matt Arsenault	26f8f3db39	AMDGPU: Rework how private buffer passed for HSA If we know we have stack objects, we reserve the registers that the private buffer resource and wave offset are passed and use them directly. If not, reserve the last 5 SGPRs just in case we need to spill. After register allocation, try to pick the next available registers instead of the last SGPRs, and then insert copies from the inputs to the reserved registers in the progloue. This also only selectively enables all of the input registers which are really required instead of always enabling them. llvm-svn: 254331	2015-11-30 21:16:03 +00:00
Matt Arsenault	ac234b604d	AMDGPU: Rename enums to be consistent with HSA code object terminology llvm-svn: 254330	2015-11-30 21:15:57 +00:00
Matt Arsenault	0e3d38937e	AMDGPU: Remove SIPrepareScratchRegs It does not work because of emergency stack slots. This pass was supposed to eliminate dummy registers for the spill instructions, but the register scavenger can introduce more during PrologEpilogInserter, so some would end up left behind if they were needed. The potential for spilling the scratch resource descriptor and offset register makes doing something like this overly complicated. Reserve registers to use for the resource descriptor and use them directly in eliminateFrameIndex. Also removes creating another scratch resource descriptor when directly selecting scratch MUBUF instructions. The choice of which registers are reserved is temporary. For now it attempts to pick the next available registers after the user and system SGPRs. llvm-svn: 254329	2015-11-30 21:15:53 +00:00
Matt Arsenault	ff6da2fe89	AMDGPU: Use assert zext for workgroup sizes llvm-svn: 254328	2015-11-30 21:15:45 +00:00
Quentin Colombet	cdad10f333	[ARM] For old thumb ISA like v4t, we cannot use PC directly in pop. Fix the epilogue emission to account for that. llvm-svn: 254325	2015-11-30 20:37:58 +00:00
Reid Kleckner	8a71273d89	Avoid writing to source directory of tests llvm-svn: 254324	2015-11-30 20:36:23 +00:00
Davide Italiano	9c26161b2e	[SimplifyLibCalls] Remove useless bits of this tests. llvm-svn: 254318	2015-11-30 19:38:35 +00:00
Davide Italiano	1aeed6a955	[SimplifyLibCalls] Transform log(exp2(y)) to y*log(2) under fast-math. llvm-svn: 254317	2015-11-30 19:36:35 +00:00
David Majnemer	bf4119faf6	[X86] Add RIP to GR64_TCW64 The MachineVerifier wants to check that the register operands of an instruction belong to the instruction's register class. RIP-relative control flow instructions violated this by referencing RIP. While this was fixed for SysV, it was never fixed for Win64. llvm-svn: 254315	2015-11-30 19:04:19 +00:00
Kit Barton	f4ce2f3a9e	Enable shrink wrapping for PPC64 Re-enable shrink wrapping for PPC64 Little Endian. One minor modification to PPCFrameLowering::findScratchRegister was necessary to handle fall-thru blocks (blocks with no terminator) correctly. Tested with all LLVM test, clang tests, and the self-hosting build, with no problems found. PHabricator: http://reviews.llvm.org/D14778 llvm-svn: 254314	2015-11-30 18:59:41 +00:00
Rafael Espindola	c98b20b0d6	Fix another llvm.ctors merging bug. We were not looking past casts to see if an element should be included or not. llvm-svn: 254313	2015-11-30 18:54:24 +00:00
Dan Gohman	96029f7880	[WebAssembly] Fix a few minor compiler warnings. NFC. llvm-svn: 254311	2015-11-30 18:42:08 +00:00
Sanjay Patel	239be1fb0d	fix formatting; NFC llvm-svn: 254310	2015-11-30 17:52:02 +00:00
Colin LeMahieu	e6241798c9	[Hexagon] NFC Reordering headers. llvm-svn: 254307	2015-11-30 17:32:34 +00:00
Matt Arsenault	ea03cf2fa1	AMDGPU: Don't reserve SCRATCH_PTR input register This hasn't been doing anything since using relocations was added. llvm-svn: 254304	2015-11-30 15:46:47 +00:00
Aaron Ballman	33c95f08b0	Silencing a 32-bit to 64-bit implicit conversion warning; NFC. llvm-svn: 254302	2015-11-30 14:52:33 +00:00
Hrvoje Varga	c03957f049	[mips][microMIPS] Implement LBUX, LHX, LWX, MAQ_S[A].W.PHL, MAQ_S[A].W.PHR, MFHI, MFLO, MTHI and MTLO instructions Differential Revision: http://reviews.llvm.org/D14436 llvm-svn: 254297	2015-11-30 12:58:39 +00:00
Zoran Jovanovic	a887b36167	[mips][microMIPS] Fix issue with offset operand of BALC and BC instructions Value of offset operand for microMIPS BALC and BC instructions is currently shifted 2 bits, but it should be 1 bit. Differential Revision: http://reviews.llvm.org/D14770 llvm-svn: 254296	2015-11-30 12:56:18 +00:00

... 4 5 6 7 8 ...

124883 Commits