llvm-project

Commit Graph

Author	SHA1	Message	Date
Chandler Carruth	66f38db3ba	[Modules] Followup to r206822 to add a DEBUG_TYPE which is used on ARM and PPC, but not x86. llvm-svn: 206830	2014-04-21 23:58:10 +00:00
Chandler Carruth	e96dd8975f	[Modules] Make Support/Debug.h modular. This requires it to not change behavior based on other files defining DEBUG_TYPE, which means it cannot define DEBUG_TYPE at all. This is actually better IMO as it forces folks to define relevant DEBUG_TYPEs for their files. However, it requires all files that currently use DEBUG(...) to define a DEBUG_TYPE if they don't already. I've updated all such files in LLVM and will do the same for other upstream projects. This still leaves one important change in how LLVM uses the DEBUG_TYPE macro going forward: we need to only define the macro after header files have been #include-ed. Previously, this wasn't possible because Debug.h required the macro to be pre-defined. This commit removes that. By defining DEBUG_TYPE after the includes two things are fixed: - Header files that need to provide a DEBUG_TYPE for some inline code can do so by defining the macro before their inline code and undef-ing it afterward so the macro does not escape. - We no longer have rampant ODR violations due to including headers with different DEBUG_TYPE definitions. This may be mostly an academic violation today, but with modules these types of violations are easy to check for and potentially very relevant. Where necessary to suppor headers with DEBUG_TYPE, I have moved the definitions below the includes in this commit. I plan to move the rest of the DEBUG_TYPE macros in LLVM in subsequent commits; this one is big enough. The comments in Debug.h, which were hilariously out of date already, have been updated to reflect the recommended practice going forward. llvm-svn: 206822	2014-04-21 22:55:11 +00:00
Yi Jiang	b23edebdd2	Set default value of HasExtractBitsInsn to false llvm-svn: 206803	2014-04-21 22:22:44 +00:00
Jim Grosbach	9446534025	ARM64: Refactor away a few redundant helpers. The comment claimed that the register class information wasn't available in the assembly parser, but that's not really true. It's just annoying to get to. Replace the helper functions with references to the auto-generated information. llvm-svn: 206802	2014-04-21 22:13:57 +00:00
Rafael Espindola	bad3f77703	Simplify a vpermil* with constant mask. With a constant mask a vpermil* is just a shufflevector. This patch implements that simplification. This allows us to produce denser code. It should also allow more folding down the line. llvm-svn: 206801	2014-04-21 22:06:04 +00:00
Jim Grosbach	9515c52294	ARM64: Improve diagnostics for malformed reg+reg addressing mode. Make sure only general purpose registers are valid for offset regs and that 32-bit regs are only valid for sxtw and uxtw extends. llvm-svn: 206799	2014-04-21 21:45:57 +00:00
Jim Grosbach	ac901086e5	Move helper functions earlier in the file. No functional change. llvm-svn: 206798	2014-04-21 21:45:53 +00:00
Jim Grosbach	9d205d42f3	ARM64: Extended addressing mode source reg is 64-bit. The canonical form for the extended addressing mode (e.g., "[x1, w2, uxtw #3]" is for the MCInst to have the second register be the full 64-bit GPR64 register class. The instruction printer cleans up the output for display to show the 32-bit register instead, per the specification. This simplifies 205893 now that the aliasing is handled in the printer in 206495 so that the codegen path and the disassembler path give the same MCInst form. llvm-svn: 206797	2014-04-21 21:45:44 +00:00
David Blaikie	09757491d6	Use unique_ptr to manage ownership of GCOVFunctions, Blocks, and Edges. llvm-svn: 206796	2014-04-21 21:40:16 +00:00
Hal Finkel	bae796f0dc	Remove seemingly-unneeded artificial dependency The rationale for this artificial dependency seems to have been lost to the ravages of time, it is covered by no regression tests, and has no impact on test-suite performance numbers on either x86 or PPC. For the test suite, on both x86 and PPC, I ran the test suite 10 times (both as a baseline and with this change), and found no statistically-significant changes. For PPC, I used a P7 box. For x86, I used an Intel Xeon E5430. Both with -O3 -mcpu=native. This was discussed on-list back in January, but I've not had a chance to run the performance tests until today. llvm-svn: 206795	2014-04-21 21:30:25 +00:00
David Blaikie	4c82a809b3	Simplify destruction of Modules in LLVContextImpl. This avoids copying the container by simply deleting until empty. While I'd rather move to a stricter ownership semantic (unique_ptr), SmallPtrSet can't cope with unique_ptr and the ownership semantics here are a bit incestuous (Module sort of owns itself, but sort of doesn't (if the LLVMContext is destroyed before the Module, then it deregisters itself from the context... )). Ideally Modules would be given to the context, or possibly an emplace-like function to construct them there. Modules then shouldn't be destroyed by LLVM API clients, but by interacting with the owner (LLVMContext) directly (but even then, passing a Module* to LLVMContext doesn't provide an easy way to destroy the Module, since the set would be over unique_ptrs and you'd need a heterogenous lookup function which SmallPtrSet doesn't have either). llvm-svn: 206794	2014-04-21 21:27:19 +00:00
Rafael Espindola	6c76d1d7df	Handle _GLOBAL_OFFSET_TABLE_ in 64 bit mode. With this MC is able to handle _GLOBAL_OFFSET_TABLE_ in 64 bit mode, which is needed for medium and large code models. This fixes pr19470. llvm-svn: 206793	2014-04-21 21:15:45 +00:00
Rafael Espindola	83752535ea	clang-format this function. No functionality change, it will just make the next patch easier to read. llvm-svn: 206792	2014-04-21 21:00:58 +00:00
David Blaikie	bc44220eb8	Use unique_ptr to handle GlobalOpt's Evaluator members llvm-svn: 206790	2014-04-21 20:49:36 +00:00
Reid Kleckner	9b2cc647eb	Fix PR7272 in -tailcallelim instead of the inliner The -tailcallelim pass should be checking if byval or inalloca args can be captured before marking calls as tail calls. This was the real root cause of PR7272. With a better fix in place, revert the inliner change from r105255. The test case it introduced still passes and has been moved to test/Transforms/Inline/byval-tail-call.ll. Reviewers: chandlerc Differential Revision: http://reviews.llvm.org/D3403 llvm-svn: 206789	2014-04-21 20:48:47 +00:00
David Blaikie	eb038915ab	Simplify expression that was explicitly naming an operator overload in a call. llvm-svn: 206788	2014-04-21 20:43:51 +00:00
David Blaikie	229de508bb	Use unique_ptr to handle ownership of GCOVFunctions in GCOVProfiler. llvm-svn: 206786	2014-04-21 20:41:55 +00:00
David Blaikie	2b1dfa7244	Use unique_ptr to handle ownership of UserValues in LiveDebugVariablesImpl llvm-svn: 206785	2014-04-21 20:37:07 +00:00
David Blaikie	422b93dcf1	Use unique_ptr to manage objects owned by the ScheduleDAGMI. llvm-svn: 206784	2014-04-21 20:32:32 +00:00
David Blaikie	b0b7b18e8c	Use value semantics to manage DbgVariables rather than dynamic allocation/pointers. Requires switching some vectors to lists to maintain pointer validity. These could be changed to forward_lists (singly linked) with a bit more work - I've left comments to that effect. llvm-svn: 206780	2014-04-21 20:13:09 +00:00
Filipe Cabecinhas	20352216fb	Rename X86insrtps to the proper instruction name. Summary: The INSERTPS pattern fragment was called insrtps (mising 'e'), which would make it harder to grep for the patterns related to this instruction. Renaming it to use the proper instruction name. Reviewers: nadav CC: llvm-commits Differential Revision: http://reviews.llvm.org/D3443 llvm-svn: 206779	2014-04-21 20:07:29 +00:00
Chandler Carruth	a4a2066482	[Modules] Consolidate the DEBUG_TYPE defines in NVPTX to the top of the cpp file rather than in the header and then again in the cpp file. llvm-svn: 206778	2014-04-21 19:53:55 +00:00
Chandler Carruth	5f1f26e891	[Modules] Sink all the DEBUG_TYPE defines for InstCombine out of the header files and into the cpp files. These files will require more touches as the header files actually use DEBUG(). Eventually, I'll have to introduce a matched #define and #undef of DEBUG_TYPE for the header files, but that comes as step N of many to clean all of this up. llvm-svn: 206777	2014-04-21 19:51:41 +00:00
Chandler Carruth	6d23a7b600	[Modules] Sink the DEBUG_TYPE macro out of LegalizeTypes.h and into the various .cpp files. This macro is inherently non-modular, and it wasn't even needed in this header file. llvm-svn: 206775	2014-04-21 19:43:07 +00:00
Yi Jiang	d069f6393a	ARM64: Combine shifts and uses from different basic block to bit-extract instruction llvm-svn: 206774	2014-04-21 19:34:27 +00:00
Jim Grosbach	36f025e697	Revert "[rtdyld,c++11] Range'ify symbol table walking." Tentative revert for http://lab.llvm.org:8011/builders/llvm-mips-linux/builds/8305. This reverts commit c2a58efff07294fca724f89500538f2ddbcd12ff. llvm-svn: 206773	2014-04-21 19:23:59 +00:00
Matt Arsenault	443252c011	Fix unnecessary line break llvm-svn: 206772	2014-04-21 18:39:13 +00:00
Duncan P. N. Exon Smith	254689fcf9	blockfreq: Some cleanup of UnsignedFloat Change `PositiveFloat` to `UnsignedFloat`, and fix some of the comments to indicate that it's disappearing eventually. llvm-svn: 206771	2014-04-21 18:31:58 +00:00
Jim Grosbach	de577e3d68	[rtdyld,c++11] Range'ify symbol table walking. llvm-svn: 206769	2014-04-21 18:10:31 +00:00
Duncan P. N. Exon Smith	10be9a8868	Reapply "blockfreq: Rewrite BlockFrequencyInfoImpl" This reverts commit r206707, reapplying r206704. The preceding commit to CalcSpillWeights should have sorted out the failing buildbots. <rdar://problem/14292693> llvm-svn: 206766	2014-04-21 17:57:07 +00:00
Duncan P. N. Exon Smith	7af3432e22	CalcSpillWeights: Hack to prevent x87 nonsense This gross hack forces `hweight` into memory, preventing hidden precision from making `1 > 1` occasionally equal `true`. <rdar://problem/14292693> llvm-svn: 206765	2014-04-21 17:57:01 +00:00
Evgeniy Stepanov	3939f54820	[msan] Enable out-of-line instrumentation for large functions by default. llvm-svn: 206759	2014-04-21 15:04:05 +00:00
NAKAMURA Takumi	62774f3524	Appease autoconf build since X86Disassembler.c has been disappeared in r206717. It can be reverted a few days later, after X86Disassembler.d is updated not to contain "X86Disassembler.c". llvm-svn: 206758	2014-04-21 14:59:11 +00:00
Kostya Serebryany	35e538312a	[asan] add a run-time flag detect_container_overflow=true/false llvm-svn: 206756	2014-04-21 14:35:00 +00:00
Rafael Espindola	6956b1a517	Convert getFileOffset to getOffset and move it to its only user. We normally don't drop functions from the C API's, but in this case I think we can: * The old implementation of getFileOffset was fairly broken * The introduction of LLVMGetSymbolFileOffset was itself a C api breaking change as it removed LLVMGetSymbolOffset. * It is an incredibly specialized use case. The only reason MCJIT needs it is because of its odd position of being a dynamic linker of .o files. llvm-svn: 206750	2014-04-21 13:45:32 +00:00
Michael Zolotukhin	f2ba994bf6	Reapply r206732. This time without optimization of branches. llvm-svn: 206749	2014-04-21 12:01:33 +00:00
Kostya Serebryany	94c81ca8e1	[asan] instead of inserting inline instrumentation around memset/memcpy/memmove, replace the intrinsic with __asan_memset/etc. This makes the memset/etc handling more complete and consistent with what we do in msan. It may slowdown some cases (when the intrinsic was actually inlined) and speedup other cases (when it was not inlined) llvm-svn: 206746	2014-04-21 11:50:42 +00:00
Chandler Carruth	572e3407c3	[PM] Add a new-PM-style CGSCC pass manager using the newly added LazyCallGraph analysis framework. Wire it up all the way through the opt driver and add some very basic testing that we can build pass pipelines including these components. Still a lot more to do in terms of testing that all of this works, but the basic pieces are here. There is a lot of boiler plate here. It's something I'm going to actively look at reducing, but I don't have any immediate ideas that don't end up making the code terribly complex in order to fold away the boilerplate. Until I figure out something to minimize the boilerplate, almost all of this is based on the code for the existing pass managers, copied and heavily adjusted to suit the needs of the CGSCC pass management layer. The actual CG management still has a bunch of FIXMEs in it. Notably, we don't do any updating of the CG as it is potentially invalidated. I wanted to get this in place to motivate the new analysis, and add update APIs to the analysis and the pass management layers in concert to make sure that the right APIs are present. llvm-svn: 206745	2014-04-21 11:12:00 +00:00
Chandler Carruth	de37c46780	[PM] Fix a bug where we didn't properly clear the list map when the list became empty. This would manifest later as an assert failure due to a non-empty list map but an empty result map. This doesn't easily manifest with just the module pass manager and the function pass manager, but the next commit will add the CGSCC pass manager that hits this assert immediately. llvm-svn: 206744	2014-04-21 11:11:54 +00:00
Kostya Serebryany	94f57d199a	[asan] temporary disable generating __asan_loadN/__asan_storeN llvm-svn: 206741	2014-04-21 10:28:13 +00:00
Benjamin Kramer	d2da720ead	[C++11] Replace OwningPtr with std::unique_ptr in places where it doesn't break the API. No functionality change. llvm-svn: 206740	2014-04-21 09:34:48 +00:00
Lang Hames	5aa6ee80b6	[X86] ISEL (and X, <constant mask>) to BZHI when BMI2 is available. Generating BZHI in the variable mask case, i.e. (and X, (sub (shl 1, N), 1)), was already supported, but we were missing the constant-mask case. This patch fixes that. <rdar://problem/15480077> llvm-svn: 206738	2014-04-21 08:18:53 +00:00
Chandler Carruth	a2533a7bef	Revert r206732 which is causing llc to crash on most of the build bots. Original commit message: Implement builtins for safe division: safe.sdiv.iN, safe.udiv.iN, safe.srem.iN, safe.urem.iN (iN = i8, i61, i32, or i64). llvm-svn: 206735	2014-04-21 07:11:15 +00:00
Kostya Serebryany	86332c010f	[asan] insert __asan_loadN/__asan_storeN as out-lined asan checks, llvm part llvm-svn: 206734	2014-04-21 07:10:43 +00:00
Michael Zolotukhin	137a84616c	Implement builtins for safe division: safe.sdiv.iN, safe.udiv.iN, safe.srem.iN, safe.urem.iN (iN = i8, i16, i32, or i64). llvm-svn: 206732	2014-04-21 05:33:09 +00:00
Chandler Carruth	99b756db04	[LCG] Add some basic debug output to the LCG pass. llvm-svn: 206730	2014-04-21 05:04:24 +00:00
David Blaikie	e9907ba16e	Protect the ArgList dtor It could even be made non-virtual if it weren't for bad compiler warnings. This demonstrates that ArgList objects aren't destroyed polymorphically and possibly that they aren't even used polymorphically. If that's the case, it might be possible to refactor the two ArgList types more separately and simplify the Arg ownership model. continues experimenting llvm-svn: 206727	2014-04-20 23:59:00 +00:00
David Blaikie	f70b21a4b8	Use unique_ptr to handle ownership of synthesized args in DerivedArgList This might be able to be simplified further by using Arg as a value type in a linked list (to maintain pointer validity), but here's something simple to start with. llvm-svn: 206724	2014-04-20 22:37:46 +00:00
Richard Smith	5d50610306	C++ has a bool type! (And C's had one too, for 15 years...) llvm-svn: 206723	2014-04-20 22:15:37 +00:00
Richard Smith	6a6967eeaf	More C++ification. llvm-svn: 206722	2014-04-20 22:10:16 +00:00
Richard Smith	3c3410f139	Remove some more C junk from these files. llvm-svn: 206721	2014-04-20 21:56:02 +00:00
Richard Smith	ac15f1cda3	Don't provide two different definitions of ModRMDecision, OpcodeDecision, and ContextDecision in different source files (depending on #define magic). llvm-svn: 206720	2014-04-20 21:52:16 +00:00
Richard Smith	82b47d5660	Don't define llvm::X86Disassembler::InstructionSpecifier in different ways in different source files. llvm-svn: 206719	2014-04-20 21:35:26 +00:00
Richard Smith	555134215b	Maybe if I touch this file the buildbots will actually rerun configure like they need to... llvm-svn: 206718	2014-04-20 21:28:33 +00:00
Richard Smith	89ee75d786	What year is it! This file has no reason to be written in C, and has doubly no reason to expose a global symbol 'decodeInstruction' nor to pollute the global scope with a bunch of external linkage entities (some of which conflict with others elsewhere in LLVM). This is just the initial transition to C++; more cleanups to follow. llvm-svn: 206717	2014-04-20 21:07:34 +00:00
Alp Toker	9844434151	Remove some empty statements Cleanup only. llvm-svn: 206710	2014-04-19 23:56:35 +00:00
Justin Bogner	a2bfd66e0e	ProfileData: Remove an extra semicolon Spotted by Nick Lewycky in review, thanks! llvm-svn: 206708	2014-04-19 23:42:50 +00:00
Duncan P. N. Exon Smith	e63327e967	Revert "blockfreq: Rewrite BlockFrequencyInfoImpl" This reverts commit r206704, as expected. llvm-svn: 206707	2014-04-19 22:46:00 +00:00
Duncan P. N. Exon Smith	875ddfac75	Reapply "blockfreq: Rewrite BlockFrequencyInfoImpl" This reverts commit r206677, reapplying my BlockFrequencyInfo rewrite. I've done a careful audit, added some asserts, and fixed a couple of bugs (unfortunately, they were in unlikely code paths). There's a small chance that this will appease the failing bots [1][2]. (If so, great!) If not, I have a follow-up commit ready that will temporarily add -debug-only=block-freq to the two failing tests, allowing me to compare the code path between what the failing bots and what my machines (and the rest of the bots) are doing. Once I've triggered those builds, I'll revert both commits so the bots go green again. [1]: http://bb.pgr.jp/builders/ninja-x64-msvc-RA-centos6/builds/1816 [2]: http://llvm-amd64.freebsd.your.org/b/builders/clang-i386-freebsd/builds/18445 <rdar://problem/14292693> llvm-svn: 206704	2014-04-19 22:34:26 +00:00
Yaron Keren	d7ba46b287	Patch by Vadim Chugunov Win64 stack unwinder gets confused when execution flow "falls through" after a call to 'noreturn' function. This fixes the "missing epilogue" problem by emitting a trap instruction for IR 'unreachable' on x86_x64-pc-windows. A secondary use for it would be for anyone wanting to make double-sure that 'noreturn' functions, indeed, do not return. llvm-svn: 206684	2014-04-19 13:47:43 +00:00
Yaron Keren	421304d18c	Patch by Ray Donnelly to print register names instead of numbers. http://reviews.llvm.org/D3422 llvm-svn: 206683	2014-04-19 05:40:09 +00:00
David Blaikie	b5956d232f	Add parens to appease GCC warning. llvm-svn: 206678	2014-04-19 00:50:15 +00:00
Duncan P. N. Exon Smith	76b813619a	Revert "blockfreq: Rewrite BlockFrequencyInfoImpl" (#2 ) This reverts commit r206666, as planned. Still stumped on why the bots are failing. Sanitizer bots haven't turned anything up. If anyone can help me debug either of the failures (referenced in r206666) I'll owe them a beer. (In the meantime, I'll be auditing my patch for undefined behaviour.) llvm-svn: 206677	2014-04-19 00:42:46 +00:00
Justin Bogner	e808171628	OnDiskHashTable: Audit types and use offset_type consistently llvm-svn: 206675	2014-04-19 00:33:15 +00:00
Nick Lewycky	05e0f1ccff	Check whether functions have any lines associated before emitting coverage info for them. This isn't just a size/time saving, gcov may crash on these. llvm-svn: 206671	2014-04-18 23:32:28 +00:00
Kevin Enderby	b7e51f6af5	Change the ARM assembler to require a :lower16: or :upper16 on non-constant expressions for mov instructions instead of silently truncating by default. For the ARM assembler, we want to avoid misleadingly allowing something like "mov r0, <symbol>" especially when we turn it into a movw and the expression <symbol> does not have a :lower16: or :upper16" as part of the expression. We don't want the behavior of silently truncating, which can be unexpected and lead to bugs that are difficult to find since this is an easy mistake to make. This does change the previous behavior of llvm but actually matches an older gnu assembler that would not allow this but print less useful errors of like “invalid constant (0x927c0) after fixup” and “unsupported relocation on symbol foo”. The error for llvm is "immediate expression for mov requires :lower16: or :upper16" with correct location information on the operand as shown in the added test cases. rdar://12342160 llvm-svn: 206669	2014-04-18 23:06:39 +00:00
Duncan P. N. Exon Smith	b3caf3646f	Reapply "blockfreq: Rewrite BlockFrequencyInfoImpl" (#2 ) This reverts commit r206628, reapplying r206622 (and r206626). Two tests are failing only on buildbots [1][2]: i.e., I can't reproduce on Darwin, and Chandler can't reproduce on Linux. Asan and valgrind don't tell us anything, but we're hoping the msan bot will catch it. So, I'm applying this again to get more feedback from the bots. I'll leave it in long enough to trigger builds in at least the sanitizer buildbots (it was failing for reasons unrelated to my commit last time it was in), and hopefully a few others.... and then I expect to revert a third time. [1]: http://bb.pgr.jp/builders/ninja-x64-msvc-RA-centos6/builds/1816 [2]: http://llvm-amd64.freebsd.your.org/b/builders/clang-i386-freebsd/builds/18445 llvm-svn: 206666	2014-04-18 22:30:03 +00:00
Alexey Samsonov	5c39fdfb7b	[llvm-symbolizer] Print file/line for a PC even if there is no DIE describing it. This is important for symbolizing executables with debug info in unavailable .dwo files. Even if all DIE entries are missing, we can still symbolize an address: function name can be fetched from symbol table, and file/line info can be fetched from line table. llvm-svn: 206665	2014-04-18 22:22:44 +00:00
Justin Bogner	b5d368e838	ProfileData: Don't forward declare ComputeHash and make it static inline llvm-svn: 206663	2014-04-18 22:00:22 +00:00
David Blaikie	76d3a3cd35	Compress debug sections only when beneficial. Both ZLIB and the debug info compressed section header ("ZLIB" + the size of the uncompressed data) take some constant overhead so in some cases the compressed data is actually larger than the uncompressed data. In these cases, just don't compress or rename the section at all. llvm-svn: 206659	2014-04-18 21:52:26 +00:00
Justin Bogner	b7aa26303b	ProfileData: Add support for the indexed instrprof format This adds support for an indexed instrumentation based profiling format, which is just a small header and an on disk hash table. This format will be used by clang's -fprofile-instr-use= for PGO. llvm-svn: 206656	2014-04-18 21:48:40 +00:00
Alexey Samsonov	d010999abe	[DWARF parser] Turn DILineInfo into a struct. Immutable DILineInfo doesn't bring any benefits and complicates code. Also, use std::string instead of SmallString<16> for file and function names - their length can vary significantly. No functionality change. llvm-svn: 206654	2014-04-18 21:36:39 +00:00
David Blaikie	c029ab430c	Update the fragments of symbols in compressed sections. While unnamed relocations are already cached in side tables in ELFObjectWriter::RecordRelocation, symbols still need their fragments updated to refer to the newly compressed fragment (even if that fragment isn't big enough to fit the offset). Even though we only create temporary symbols in debug info sections this comes up in 32 bit builds where even temporary symbols in mergeable sections (such as debug_str) have to be emitted as named symbols. I tried a few other ways to do this but they all didn't work for various reasons: 1) Canonicalize the MCSymbolData in RecordRelocation, nulling out the Fragment (so it didn't have to be updated by CompressDebugSection). This doesn't work because some code relies on symbols having fragments to indicate that they're defined, I think. 2) Canonicalize the MCSymbolData in RecordRelocation to be "first fragment + absolute offset" so it would be cheaper to just test and update the fragment in CompressDebugSections. This doesn't work because the offset computed in RecordRelocation isn't that of the symbol's fragment, it's the passed in fragment (I haven't figured out what that fragment is - perhaps it's the location where the relocation is to be written). And if the fragment offset has to be computed only for this use we might as well just do it when we need to, in CompressDebugSection. I also added an assert to help catch this a bit more clearly, even though it is UB. The test case improvements would either assert fail and/or valgrind vail without the fix, even if they wouldn't necessarily fail the FileCheck output. llvm-svn: 206653	2014-04-18 21:24:12 +00:00
Chad Rosier	9149acb053	[ARM64] Ports the Cortex-A53 Machine Model description from AArch64. Summary: This port includes the rudimentary latencies that were provided for the Cortex-A53 Machine Model in the AArch64 backend. It also changes the SchedAlias for COPY in the Cyclone model to an explicit WriteRes mapping to avoid conflicts in other subtargets. Differential Revision: http://reviews.llvm.org/D3427 Patch by Dave Estes <cestes@codeaurora.org>! llvm-svn: 206652	2014-04-18 21:22:04 +00:00
Chandler Carruth	2174f44f61	[LCG] Fix the bugs that Ben pointed out in code review (and the MSan bot caught). Sad that we don't have warnings for these things, but bleh, no idea how to fix that. llvm-svn: 206646	2014-04-18 20:44:16 +00:00
Alexey Samsonov	84e2423d34	[DWARF parser] Respect address ranges specified in compile unit DIE. When address ranges for compile unit are specified in compile unit DIE itself, there is no need to collect ranges from children subprogram DIEs. This change speeds up llvm-symbolizer on Clang-produced binaries with full debug info. For instance, symbolizing a first address in a 1Gb binary is now 2x faster (1s vs. 2s). llvm-svn: 206641	2014-04-18 20:30:27 +00:00
Benjamin Kramer	147644d400	Remove a couple of redundant copies of SmallVector::operator==. No functionality change. llvm-svn: 206635	2014-04-18 19:48:03 +00:00
Adam Nemet	ee7a3e38c9	[X86] Improve buildFromShuffleMostly for AVX For a 256-bit BUILD_VECTOR consisting mostly of shuffles of 256-bit vectors, both the BUILD_VECTOR and its operands may need to be legalized in multiple steps. Consider: (v8f32 (BUILD_VECTOR (extract_vector_elt (v8f32 %vreg0,) Constant<1>), (extract_vector_elt %vreg0, Constant<2>), (extract_vector_elt %vreg0, Constant<3>), (extract_vector_elt %vreg0, Constant<4>), (extract_vector_elt %vreg0, Constant<5>), (extract_vector_elt %vreg0, Constant<6>), (extract_vector_elt %vreg0, Constant<7>), %vreg1)) a. We can't build a 256-bit vector efficiently so, we need to split it into two 128-bit vecs and combine them with VINSERTX128. b. Operands like (extract_vector_elt (v8f32 %vreg0), Constant<7>) needs to be split into a VEXTRACTX128 and a further extract_vector_elt from the resulting 128-bit vector. c. The extract_vector_elt from b. is lowered into a shuffle to the first element and a movss. Depending on the order in which we legalize the BUILD_VECTOR and its operands[1], buildFromShuffleMostly may be faced with: (v4f32 (BUILD_VECTOR (extract_vector_elt (vector_shuffle<1,u,u,u> (extract_subvector %vreg0, Constant<4>), undef), Constant<0>), (extract_vector_elt (vector_shuffle<2,u,u,u> (extract_subvector %vreg0, Constant<4>), undef), Constant<0>), (extract_vector_elt (vector_shuffle<3,u,u,u> (extract_subvector %vreg0, Constant<4>), undef), Constant<0>), %vreg1)) In order to figure out the underlying vector and their identity we need to see through the shuffles. [1] Note that the order in which operations and their operands are legalized is only guaranteed in the first iteration of LegalizeDAG. Fixes <rdar://problem/16296956> llvm-svn: 206634	2014-04-18 19:44:16 +00:00
Benjamin Kramer	ee26b621f0	DebugInfo: Remove some initializer lists to make MSVC happy again. llvm-svn: 206632	2014-04-18 19:01:53 +00:00
David Blaikie	583a31c976	Add range access to MCAssembler's symbol collection. llvm-svn: 206631	2014-04-18 18:24:25 +00:00
Duncan P. N. Exon Smith	0842ff36a6	Revert "blockfreq: Rewrite BlockFrequencyInfoImpl" (#2 ) This reverts commit r206622 and the MSVC fixup in r206626. Apparently the remotely failing tests are still failing, despite my attempt to fix the nondeterminism in r206621. llvm-svn: 206628	2014-04-18 17:56:08 +00:00
Duncan P. N. Exon Smith	38fe464df0	Fixing MSVC after r206622? llvm-svn: 206626	2014-04-18 17:38:01 +00:00
Andrew Trick	1766f93b35	Better comments to explain buffered/unbuffered processor resources. llvm-svn: 206625	2014-04-18 17:35:08 +00:00
Alexey Samsonov	762343da6c	[DWARF parser] Refactor fetching DIE address ranges. Add a helper method to get address ranges specified in a DIE (either by DW_AT_low_pc/DW_AT_high_pc, or by DW_AT_ranges). Use it to untangle and simplify the code. No functionality change. llvm-svn: 206624	2014-04-18 17:25:46 +00:00
Duncan P. N. Exon Smith	f8361d127a	Reapply "blockfreq: Rewrite BlockFrequencyInfoImpl" This reverts commit r206556, effectively reapplying commit r206548 and its fixups in r206549 and r206550. In an intervening commit I've added target triples to the tests that were failing remotely [1] (but passing locally). I'm hoping the mystery is solved? I'll revert this again if the tests are still failing remotely. [1]: http://bb.pgr.jp/builders/ninja-x64-msvc-RA-centos6/builds/1816 llvm-svn: 206622	2014-04-18 17:22:25 +00:00
Tim Northover	37d9a9cebf	ARM64: disable generation of .loh directives outside MachO. Part of PR19455. llvm-svn: 206611	2014-04-18 14:54:46 +00:00
Tim Northover	be1d1b6681	ARM64: don't emit .subsections_via_symbols on ELF. Part of PR19455. llvm-svn: 206610	2014-04-18 14:54:41 +00:00
Tim Northover	be3941cc79	ARM64: add extra NEG pattern. llvm-svn: 206609	2014-04-18 14:54:35 +00:00
Tim Northover	e3028832d1	AArch64/ARM64: add non-scalar lowering for more FCVT operations. llvm-svn: 206591	2014-04-18 13:16:42 +00:00
Tim Northover	01f315a556	AArch64/ARM64: improve spotting of EXT instructions from VECTOR_SHUFFLE. We couldn't cope if the first mask element was UNDEF before, which isn't ideal. llvm-svn: 206588	2014-04-18 12:50:58 +00:00
Evgeniy Stepanov	65120ec8c6	[msan] Add -msan-instrumentation-with-call-threshold. This flag replaces inline instrumentation for checks and origin stores with calls into MSan runtime library. This is a workaround for PR17409. Disabled by default. llvm-svn: 206585	2014-04-18 12:17:20 +00:00
Chandler Carruth	d8d865e266	[LCG] Remove all of the complexity stemming from supporting copying. Reality is that we're never going to copy one of these. Supporting this was becoming a nightmare because nothing even causes it to compile most of the time. Lots of subtle errors built up that wouldn't have been caught by any "normal" testing. Also, make the move assignment actually work rather than the bogus swap implementation that would just infloop if used. As part of that, factor out the graph pointer updates into a helper to share between move construction and move assignment. llvm-svn: 206583	2014-04-18 11:02:33 +00:00
Chandler Carruth	18eadd9260	[LCG] Add support for building persistent and connected SCCs to the LazyCallGraph. This is the start of the whole point of this different abstraction, but it is just the initial bits. Here is a run-down of what's going on here. I'm planning to incorporate some (or all) of this into comments going forward, hopefully with better editing and wording. =] The crux of the problem with the traditional way of building SCCs is that they are ephemeral. The new pass manager however really needs the ability to associate analysis passes and results of analysis passes with SCCs in order to expose these analysis passes to the SCC passes. Making this work is kind-of the whole point of the new pass manager. =] So, when we're building SCCs for the call graph, we actually want to build persistent nodes that stick around and can be reasoned about later. We'd also like the ability to walk the SCC graph in more complex ways than just the traditional postorder traversal of the current CGSCC walk. That means that in addition to being persistent, the SCCs need to be connected into a useful graph structure. However, we still want the SCCs to be formed lazily where possible. These constraints are quite hard to satisfy with the SCC iterator. Also, using that would bypass our ability to actually add data to the nodes of the call graph to facilite implementing the Tarjan walk. So I've re-implemented things in a more direct and embedded way. This immediately makes it easy to get the persistence and connectivity correct, and it also allows leveraging the existing nodes to simplify the algorithm. I've worked somewhat to make this implementation more closely follow the traditional paper's nomenclature and strategy, although it is still a bit obtuse because it isn't recursive, using an explicit stack and a tail call instead, and it is interruptable, resuming each time we need another SCC. The other tricky bit here, and what actually took almost all the time and trials and errors I spent building this, is exactly what graph structure to build for the SCCs. The naive thing to build is the call graph in its newly acyclic form. I wrote about 4 versions of this which did precisely this. Inevitably, when I experimented with them across various use cases, they became incredibly awkward. It was all implementable, but it felt like a complete wrong fit. Square peg, round hole. There were two overriding aspects that pushed me in a different direction: 1) We want to discover the SCC graph in a postorder fashion. That means the root node will be the last node we find. Using the call-SCC DAG as the graph structure of the SCCs results in an orphaned graph until we discover a root. 2) We will eventually want to walk the SCC graph in parallel, exploring distinct sub-graphs independently, and synchronizing at merge points. This again is not helped by the call-SCC DAG structure. The structure which, quite surprisingly, ended up being completely natural to use is the inverse of the call-SCC DAG. We add the leaf SCCs to the graph as "roots", and have edges to the caller SCCs. Once I switched to building this structure, everything just fell into place elegantly. Aside from general cleanups (there are FIXMEs and too few comments overall) that are still needed, the other missing piece of this is support for iterating across levels of the SCC graph. These will become useful for implementing #2, but they aren't an immediate priority. Once SCCs are in good shape, I'll be working on adding mutation support for incremental updates and adding the pass manager that this analysis enables. llvm-svn: 206581	2014-04-18 10:50:32 +00:00
Benjamin Kramer	e6c821ef4c	X86: Pattern match scalar loads + vcvtph2ps into just vcvtph2ps. vcvtph2ps only reads the lower 64 bits of the address passed to the intrinsic. llvm-svn: 206579	2014-04-18 10:45:33 +00:00
Chandler Carruth	1911882569	Revert r206565 (and r206566 which updated tests). This commit was attributed to a different person from the person who posted the patch to the list, and the person who posted it the list claimed when they did that they were not the author, but that the author was yet a third person. I don't know what is going on here, but reverting until the attribution is clear and the author has explicitly contributed the patch. Also, the review hasn't really involved any of the MC maintainers and that seems questionable too. llvm-svn: 206576	2014-04-18 09:35:51 +00:00
Tim Northover	a2c4c71c12	AArch64/ARM64: spot a greater variety of concat_vector operations. Code mostly copied from AArch64, just tidied up a trifle and plumbed into the ARM64 way of doing things. This also enables the AArch64 tests which inspired the previous untested commits. llvm-svn: 206574	2014-04-18 09:31:27 +00:00
Tim Northover	848bb3ced5	ARM64: implement cunning optimisation from AArch64 A vector extract followed by a dup can become a single instruction even if the types don't match. AArch64 handled this in ISelLowering, but a few reasonably simple patterns can take care of it in TableGen, so that's where I've put it. llvm-svn: 206573	2014-04-18 09:31:20 +00:00
Tim Northover	5ec51a8981	ARM64: spot a vector_shuffle that maps to INS and expand. Tests will be coming very shortly when all the optimisations needed to support AArch64's neon-copy.ll file are committed. llvm-svn: 206572	2014-04-18 09:31:15 +00:00
Tim Northover	46d98ea8de	ARM64: nick some AArch64 patterns for extract/insert -> INS. Tests will be committed shortly when all optimisations needed to support AArch64's neon-copy.ll file are supported. llvm-svn: 206571	2014-04-18 09:31:11 +00:00
Tim Northover	8b2fa3dfef	AArch64/ARM64: emit all vector FP comparisons as such. ARM64 was scalarizing some vector comparisons which don't quite map to AArch64's compare and mask instructions. AArch64's approach of sacrificing a little efficiency to emulate them with the limited set available was better, so I ported it across. More "inspired by" than copy/paste since the backend's internal expectations were a bit different, but the tests were invaluable. llvm-svn: 206570	2014-04-18 09:31:07 +00:00
Tim Northover	0a44e66bb8	AArch64/ARM64: port BSL logic from AArch64 & enable test. I enhanced it a little in the process. The decision shouldn't really be beased on whether a BUILD_VECTOR is a splat: any set of constants will do the job provided they're related in the correct way. Also, the BUILD_VECTOR could be any operand of the incoming AND nodes, so it's best to check for all 4 possibilities rather than assuming it'll be the RHS. llvm-svn: 206569	2014-04-18 09:31:01 +00:00
Tim Northover	547a4ae6fa	AArch64/ARM64: copy byval implementation from AArch64. It's not actually used to handle C or C++ ABI rules on ARM64, but could well be emitted by other language front-ends, so it's as well to have a sensible implementation. llvm-svn: 206568	2014-04-18 09:30:52 +00:00
Yaron Keren	8ca45e0c05	Patch by Ray Donnelly. Emit WIN64 SEH registers by name instead of just number. llvm-svn: 206565	2014-04-18 08:03:38 +00:00
Kostya Serebryany	22e8810838	[asan] one more workaround for PR17409: don't do BB-level coverage instrumentation if there are more than N (=1500) basic blocks. This makes ASanCoverage work on libjpeg_turbo/jchuff.c used by Chrome, which has 1824 BBs llvm-svn: 206564	2014-04-18 08:02:42 +00:00
Jiangning Liu	ad874fca28	This commit allows vectorized loops to be unrolled by a factor of 2 for AArch64. A new test case is also added for ARM64. Patched by Z.Zheng llvm-svn: 206563	2014-04-18 07:57:54 +00:00
Matt Arsenault	209a7b92b5	R600: Minor cleanups. Fix indentation, better line wrapping, unused includes. llvm-svn: 206562	2014-04-18 07:40:20 +00:00
Lang Hames	bc876017c2	[ExecutionEngine] Allow JIT clients to enable/disable module verification. Previously module verification was always enabled, with no way to turn it off. As of this commit, module verification is on by default in Debug builds, and off by default in release builds. The default behaviour can be overridden by calling setVerifyModules(bool) on the JIT instance (this works for both the old JIT, and MCJIT). <rdar://problem/16150008> llvm-svn: 206561	2014-04-18 06:48:23 +00:00
Jiangning Liu	40d81e10c5	This is one of the optimizations ported from ARM64 to AArch64 to address the performance gap between these two back ends. The test case newly added for AArch64 already exists in ARM64. Patched by Z.Zheng llvm-svn: 206559	2014-04-18 05:58:09 +00:00
Matt Arsenault	78b8670aac	R600/SI: Try to use scalar BFE. Use scalar BFE with constant shift and offset when possible. This is complicated by the fact that the scalar version packs the two operands of the vector version into one. llvm-svn: 206558	2014-04-18 05:19:26 +00:00
Jiangning Liu	e56c30614f	This commit enables unaligned memory accesses of vector types on AArch64 back end. This should boost vectorized code performance. Patched by Z. Zheng llvm-svn: 206557	2014-04-18 03:58:38 +00:00
Duncan P. N. Exon Smith	e576167df8	Revert "blockfreq: Rewrite BlockFrequencyInfoImpl" This reverts commits r206548, r206549 and r206549. There are some unit tests failing that aren't failing locally [1], so reverting until I have time to investigate. [1]: http://bb.pgr.jp/builders/ninja-x64-msvc-RA-centos6/builds/1816 llvm-svn: 206556	2014-04-18 02:17:43 +00:00
Duncan P. N. Exon Smith	878cf2b804	blockfreq: Really fix r206548 (and r206549) Turns out this code is dead. llvm-svn: 206554	2014-04-18 02:10:09 +00:00
Duncan P. N. Exon Smith	c7abca54cf	blockfreq: Fixing MSVC after r206548? llvm-svn: 206549	2014-04-18 02:06:24 +00:00
Duncan P. N. Exon Smith	12e68e1733	blockfreq: Rewrite BlockFrequencyInfoImpl Rewrite the shared implementation of BlockFrequencyInfo and MachineBlockFrequencyInfo entirely. The old implementation had a fundamental flaw: precision losses from nested loops (or very wide branches) compounded past loop exits (and convergence points). The @nested_loops testcase at the end of test/Analysis/BlockFrequencyAnalysis/basic.ll is motivating. This function has three nested loops, with branch weights in the loop headers of 1:4000 (exit:continue). The old analysis gives non-sensical results: Printing analysis 'Block Frequency Analysis' for function 'nested_loops': ---- Block Freqs ---- entry = 1.0 for.cond1.preheader = 1.00103 for.cond4.preheader = 5.5222 for.body6 = 18095.19995 for.inc8 = 4.52264 for.inc11 = 0.00109 for.end13 = 0.0 The new analysis gives correct results: Printing analysis 'Block Frequency Analysis' for function 'nested_loops': block-frequency-info: nested_loops - entry: float = 1.0, int = 8 - for.cond1.preheader: float = 4001.0, int = 32007 - for.cond4.preheader: float = 16008001.0, int = 128064007 - for.body6: float = 64048012001.0, int = 512384096007 - for.inc8: float = 16008001.0, int = 128064007 - for.inc11: float = 4001.0, int = 32007 - for.end13: float = 1.0, int = 8 Most importantly, the frequency leaving each loop matches the frequency entering it. The new algorithm leverages BlockMass and PositiveFloat to maintain precision, separates "probability mass distribution" from "loop scaling", and uses dithering to eliminate probability mass loss. I have unit tests for these types out of tree, but it was decided in the review to make the classes private to BlockFrequencyInfoImpl, and try to shrink them (or remove them entirely) in follow-up commits. The new algorithm should generally have a complexity advantage over the old. The previous algorithm was quadratic in the worst case. The new algorithm is still worst-case quadratic in the presence of irreducible control flow, but it's linear without it. The key difference between the old algorithm and the new is that control flow within a loop is evaluated separately from control flow outside, limiting propagation of precision problems and allowing loop scale to be calculated independently of mass distribution. Loops are visited bottom-up, their loop scales are calculated, and they are replaced by pseudo-nodes. Mass is then distributed through the function, which is now a DAG. Finally, loops are revisited top-down to multiply through the loop scales and the masses distributed to pseudo nodes. There are some remaining flaws. - Irreducible control flow isn't modelled correctly. LoopInfo and MachineLoopInfo ignore irreducible edges, so this algorithm will fail to scale accordingly. There's a note in the class documentation about how to get closer. See also the comments in test/Analysis/BlockFrequencyInfo/irreducible.ll. - Loop scale is limited to 4096 per loop (2^12) to avoid exhausting the 64-bit integer precision used downstream. - The "bias" calculation proposed on llvmdev is not incorporated here. This will be added in a follow-up commit, once comments from this review have been handled. llvm-svn: 206548	2014-04-18 01:57:45 +00:00
Matt Arsenault	27cc958dff	R600/SI: Match sign_extend_inreg to s_sext_i32_i8 and s_sext_i32_i16 llvm-svn: 206547	2014-04-18 01:53:18 +00:00
Duncan P. N. Exon Smith	49f3ec80c2	PMBuilder: Expose an option to disable tail calls Adds API to allow frontends to disable tail calls in PassManagerBuilder. <rdar://problem/16050591> llvm-svn: 206542	2014-04-18 01:05:15 +00:00
Tom Stellard	1aa6cb4d88	R600/SI: Use SReg_64 instead of VSrc_64 when selecting BUILD_PAIR llvm-svn: 206541	2014-04-18 00:36:21 +00:00
Jim Grosbach	6bfe18a365	[ARM64,C++11] Range'ify another loop. llvm-svn: 206539	2014-04-17 23:41:57 +00:00
Diego Novillo	0915c047c2	Fix bug 19437 - Only add discriminators for DWARF 4 and above. Summary: This prevents the discriminator generation pass from triggering if the DWARF version being used in the module is prior to 4. Reviewers: echristo, dblaikie CC: llvm-commits Differential Revision: http://reviews.llvm.org/D3413 llvm-svn: 206507	2014-04-17 22:33:50 +00:00
Nuno Lopes	9ced19abe8	remove some dead code lib/Analysis/IPA/InlineCost.cpp \| 18 ------------------ lib/Analysis/RegionPass.cpp \| 1 - lib/Analysis/TypeBasedAliasAnalysis.cpp \| 1 - lib/Transforms/Scalar/LoopUnswitch.cpp \| 21 --------------------- lib/Transforms/Utils/LCSSA.cpp \| 2 -- lib/Transforms/Utils/LoopSimplify.cpp \| 6 ------ utils/TableGen/AsmWriterEmitter.cpp \| 13 ------------- utils/TableGen/DFAPacketizerEmitter.cpp \| 7 ------- utils/TableGen/IntrinsicEmitter.cpp \| 2 -- 9 files changed, 71 deletions(-) llvm-svn: 206506	2014-04-17 22:26:44 +00:00
Reed Kotler	720c5ca4ea	Start pushing changes for Mips Fast-Isel llvm-svn: 206505	2014-04-17 22:15:34 +00:00
Tom Stellard	aeeea8a864	R600: Add comment clariying use of sext for result of MUL_U24 llvm-svn: 206501	2014-04-17 21:00:13 +00:00
Tom Stellard	868fd92e54	R600/SI: Stop using i128 as the resource descriptor type Having i128 as a legal type complicates the legalization phase. v4i32 is already a legal type, so we will use that instead. This fixes several piglit tests. llvm-svn: 206500	2014-04-17 21:00:11 +00:00
Tom Stellard	334b29c7f6	R600/SI: Change default register class for i32 to SReg_32 SIFixSGPRCopies is smart enough to handle this now. llvm-svn: 206499	2014-04-17 21:00:09 +00:00
Tom Stellard	4f3b04de21	R600/SI: Teach SIInstrInfo::moveToVALU() how to handle PHI instructions llvm-svn: 206498	2014-04-17 21:00:07 +00:00
Tom Stellard	e1a244502c	R600/SI: Legalize operands after changing dst reg in FixSGPRCopies Otherwise we may not legalize some illegal REG_SEQUENCE instructions. llvm-svn: 206497	2014-04-17 21:00:01 +00:00
Louis Gerbarg	153e695ee2	Improve ARM64 vector creation This patch improves the performance of vector creation in caseiswhere where several of the lanes in the vector are a constant floating point value. It also includes new patterns to fold together some of the instructions when the value is 0.0f. Test cases included. rdar://16349427 llvm-svn: 206496	2014-04-17 20:51:50 +00:00
Jim Grosbach	0fba6d98fc	ARM64: [su]xtw use W regs as inputs, not X regs. Update the SXT[BHW]/UXTW instruction aliases and the shifted reg addressing mode handling. PR19455 and rdar://16650642 llvm-svn: 206495	2014-04-17 20:47:31 +00:00
David Blaikie	5b01593de4	ManagedStatic is never built with a null constructor, remove support for it. llvm-svn: 206492	2014-04-17 20:30:35 +00:00
Tim Northover	11a6082e33	ARM64: switch to IR-based atomic operations. Goodbye code! (Game: spot the bug fixed by the change). llvm-svn: 206490	2014-04-17 20:00:33 +00:00
Tim Northover	0129f298c4	ARM64: add acquire/release versions of the existing atomic intrinsics. These will be needed to support IR-level lowering of atomic operations. llvm-svn: 206489	2014-04-17 20:00:24 +00:00
Gerolf Hoflehner	ecebc3730e	Reverse 206485. After some discussions the preferred semantics of the always_inline attribute is inline always when the compiler can determine that it it safe to do so. llvm-svn: 206487	2014-04-17 19:14:06 +00:00
Josh Magee	adfde5fef6	[stack protector] Make the StackProtector pass respect ssp-buffer-size. Previously, SSPBufferSize was assigned the value of the "stack-protector-buffer-size" attribute after all uses of SSPBufferSize. The effect was that the default SSPBufferSize was always used during analysis. I moved the check for the attribute before the analysis; now --param ssp-buffer-size= works correctly again. Differential Revision: http://reviews.llvm.org/D3349 llvm-svn: 206486	2014-04-17 19:08:36 +00:00
Tim Northover	037f26f212	Atomics: promote ARM's IR-based atomics pass to CodeGen. Still only 32-bit ARM using it at this stage, but the promotion allows direct testing via opt and is a reasonably self-contained patch on the way to switching ARM64. At this point, other targets should be able to make use of it without too much difficulty if they want. (See ARM64 commit coming soon for an example). llvm-svn: 206485	2014-04-17 18:22:47 +00:00
Matt Arsenault	a90d22fad5	R600/SI: f64 frint is legal on CI llvm-svn: 206475	2014-04-17 17:06:37 +00:00
Chad Rosier	c4eb4f8827	[AArch64] Implement the getCSRFirstUseCost API, mirroring that in ARM64. llvm-svn: 206473	2014-04-17 16:19:54 +00:00
NAKAMURA Takumi	cd1fc4bc1b	Inliner::OptimizationRemark: Fix crash in clang/test/Frontend/optimization-remark.c on some hosts, including --vg. DebugLoc in Callsite would not live after Inliner. It should be copied before Inliner. llvm-svn: 206459	2014-04-17 12:22:14 +00:00
Chandler Carruth	b60cb315bc	[LCG] Just move the allocator (now that we can) when moving a call graph. This simplifies the custom move constructor operation to one of walking the graph and updating the 'up' pointers to point to the new location of the graph. Switch the nodes from a reference to a pointer for the 'up' edge to facilitate this. llvm-svn: 206450	2014-04-17 07:25:59 +00:00
Chandler Carruth	81f497d176	[LCG] Remove the Module reference member which we weren't using for anything and doesn't make sense if assigning. llvm-svn: 206449	2014-04-17 07:22:19 +00:00
Craig Topper	0a9bf4c0c5	[X86] Add disassembler support for the 0x0f 0x7f form of movq %mm, %mm. llvm-svn: 206447	2014-04-17 06:33:45 +00:00
Saleem Abdulrasool	1614b26886	MC: rework static_assert to be MSVC compatible Visual Studio does not permit referencing a structure member as a static field for sizeof calculations. Resort to a pointer cast which is compatible across Visual Studio and other compilers. llvm-svn: 206445	2014-04-17 06:17:20 +00:00
Matt Arsenault	51df0c1965	R600/SI: Fix zext from i1 to i64 llvm-svn: 206437	2014-04-17 02:03:08 +00:00
Adam Nemet	287f989dde	[ARM64] Fix "Cannot select" for vector ctpop The commit of r205855: Author: Arnold Schwaighofer <aschwaighofer@apple.com> Date: Wed Apr 9 14:20:47 2014 +0000 SLPVectorizer: Only vectorize intrinsics whose operands are widened equally The vectorizer only knows how to vectorize intrinics by widening all operands by the same factor. Patch by Tyler Nowicki! exposed a backend bug causing a regression (Cannot select ctpop). The commit msg is a bit confusing because the patch actually changes the behavior for the loop-vectorizer as well. As things got refactored into a helper ctpop got snuck in to the trivially-vectorizable helper which is now used by both vectorizers. In other words, we started seeing vector-ctpops in the backend. This change makes ctpop LegalizeAction::Expand for the types not supported by the byte-only CNT instruction. We may be able to custom-lower these later to a single CNT but this is to fix the compiler crash first. Fixes <rdar://problem/16578951> llvm-svn: 206433	2014-04-17 01:01:37 +00:00
Gerolf Hoflehner	5f6268a40e	Inline a function when the always_inline attribute is set even when it contains a indirect branch. The attribute overrules correctness concerns like the escape of a local block address. This is for rdar://16501761 llvm-svn: 206429	2014-04-17 00:21:52 +00:00
Jim Grosbach	6623e7f94a	[c++11] Tidy up AsmPrinter.cpp. Range'ify loops and tidy up some by-reference handling. No functional change. llvm-svn: 206422	2014-04-16 22:38:02 +00:00
Tom Stellard	1580dc78ae	Added new functionality to LLVM C API to use DiagnosticInfo to handle errors Patch by: Darren Powell llvm-svn: 206407	2014-04-16 17:45:04 +00:00
Aaron Ballman	5f1378c2a4	Replacing a non-ASCII character in a comment with an ASCII character. Fixes a C4819 warning in MSVC. llvm-svn: 206403	2014-04-16 17:09:20 +00:00
Diego Novillo	df655013a9	Allow diagnostic handlers to check for optimization remarks. Summary: When optimization remarks are enabled via the driver flag -Rpass, we should allow the FE diagnostic handler to check if the given pass name needs a diagnostic. We were unconditionally checking the pattern defined in opt's -pass-remarks flag. This was causing the FE to not emit any diagnostics. Reviewers: qcolombet CC: llvm-commits Differential Revision: http://reviews.llvm.org/D3362 llvm-svn: 206400	2014-04-16 16:53:41 +00:00
Matheus Almeida	483d7e9349	[mips] Use TwoOperandAliasConstraint for shift instructions. This enables TableGen to generate an additional two operand matcher for our shift_rotate_imm and shift_rotate_reg class of instructions. The tests were also updated so that they include now encoding information for all affected instructions. llvm-svn: 206398	2014-04-16 16:28:59 +00:00
Matheus Almeida	0051f2dc78	[mips] Add initial support for NaN2008 in the back-end. This is so that EF_MIPS_NAN2008 is set if we are using IEEE 754-2008 NaN encoding (-mnan=2008). This patch also adds support for parsing '.nan legacy' and '.nan 2008' assembly directives. The handling of these directives should match GAS' behaviour i.e., the last directive in use sets the ELF header bit (EF_MIPS_NAN2008). Differential Revision: http://reviews.llvm.org/D3346 llvm-svn: 206396	2014-04-16 15:48:55 +00:00
Tim Northover	ef7b34d403	ARM64: silence sign-comparison warning. llvm-svn: 206393	2014-04-16 15:28:06 +00:00
Tim Northover	3e69958b6b	AArch64/ARM64: produce correct relocation for conditional branches. llvm-svn: 206391	2014-04-16 15:27:52 +00:00
Daniel Sanders	82cd99a126	[mips] Indentation llvm-svn: 206389	2014-04-16 14:38:27 +00:00
Daniel Sanders	16fa1db637	[mips] Fix emission of '.option pic0' for MIPS-IV. Summary: This was a case of incorrect usage of hasMips64() vs isABI_N64() Reviewers: matheusalmeida, dsanders Reviewed By: dsanders Differential Revision: http://reviews.llvm.org/D3398 llvm-svn: 206388	2014-04-16 13:58:57 +00:00
Daniel Sanders	a024fb0e04	[mips] Correct r206370 to account for non-Linux targets using the small data section. This should fix the ninja-x64-msvc-RA-centos6 builder. I suspect the check in MipsSubtarget.cpp is incorrect and is really trying to check for a bare-metal target rather and anything other than linux. I'll investigate this. llvm-svn: 206385	2014-04-16 12:29:08 +00:00
Kostya Serebryany	0c02d26d6b	[asan] add two new hidden compile-time flags for asan: asan-instrumentation-with-call-threshold and asan-memory-access-callback-prefix. This is part of the workaround for PR17409 (instrument huge functions with callbacks instead of inlined code). These flags will also help us experiment with kasan (kernel-asan) and clang llvm-svn: 206383	2014-04-16 12:12:19 +00:00
Tim Northover	3ec1de7767	AArch64/ARM64: port across stub handling for ELF C++ exceptions. The most important part here is that we should actuall emit the stubs we refer to in the exception table, but as a side issue this uses more sensible & GCC compatible representations for some of the bits of information. llvm-svn: 206380	2014-04-16 11:52:55 +00:00
Tim Northover	18f68f6d1a	ARM64: use 32-bit moves for constants where possible. If we know that a particular 64-bit constant has all high bits zero, then we can rely on the fact that 32-bit ARM64 instructions automatically zero out the high bits of an x-register. This gives the expansion logic less constraints to satisfy and so sometimes allows it to pick better sequences. Came up while porting test/CodeGen/AArch64/movw-consts.ll: this will allow a 32-bit MOVN to be used in @test8 soon. llvm-svn: 206379	2014-04-16 11:52:51 +00:00
Tim Northover	9cfb57dafa	ARM64: use the integrated assembler on ELF. llvm-svn: 206378	2014-04-16 11:52:40 +00:00
Matheus Almeida	dc7e48e084	[mips] Emit '.set nomicromips' before a function's entry label if not in micromips mode. The test (elf_st_other.ll) was renamed as the name and description didn't make sense as the test wasn't checking any symbol table entry. Differential Revision: http://reviews.llvm.org/D3346 llvm-svn: 206377	2014-04-16 11:46:59 +00:00
Aaron Ballman	58ce7f24cd	Fixing a compile error in debug versions of MSVC. It seems that the range-based for loop is confused by the DEBUG macro expansion unless a compound statement is used. llvm-svn: 206376	2014-04-16 11:15:57 +00:00
Daniel Sanders	11c0c067c2	[mips] Correct callee saved list for the N32 ABI and enable test Summary: Depends on D3339 Reviewers: matheusalmeida, vmedic Reviewed By: matheusalmeida Differential Revision: http://reviews.llvm.org/D3340 llvm-svn: 206371	2014-04-16 10:23:37 +00:00
Tim Northover	97c5b6fe4f	ARM64: mark x7 as used when an i128 gets shunted onto the stack. The second half of a split i128 was ending up in x7, which is not a good thing. This is another part of PR19432. llvm-svn: 206366	2014-04-16 09:03:25 +00:00
Tim Northover	863a789a99	DAGCombiner: don't optimise non-existant litpool load This particular DAG combine is designed to kick in when both ConstantFPs will end up being loaded via a litpool, however those nodes have a semi-legal status, dictated by isFPImmLegal so in some cases there wouldn't have been a litpool in the first place. Don't try to be clever in those circumstances. Picked up while merging some AArch64 tests. llvm-svn: 206365	2014-04-16 09:03:09 +00:00
Timur Iskhodzhanov	42b1b517ce	Simplify a static_assert so VS2013 can build it llvm-svn: 206363	2014-04-16 08:30:32 +00:00
Saleem Abdulrasool	057094c6f6	COFF: fix an off by one error Adjust the tests to validate the number of auxiliary entries used to store the filename. Thanks to majnemer's sharp eye for catching the missing - 1 in the round up calculation. llvm-svn: 206359	2014-04-16 06:22:53 +00:00
Craig Topper	abb4ac7f87	Convert SelectionDAG::getVTList to use ArrayRef llvm-svn: 206357	2014-04-16 06:10:51 +00:00
Craig Topper	ada0857679	[C++11] More 'nullptr' conversion. In some cases just using a boolean check instead of comparing to nullptr. llvm-svn: 206356	2014-04-16 04:21:27 +00:00
Saleem Abdulrasool	a2bf05aa2f	COFF: add support for .file symbols Add support for emitting .file records. This is mostly a quality of implementation change (more complete support for COFF file emission) that was noticed while working on COFF file emission for Windows on ARM. A .file record is emitted as a symbol with storage class FILE (103) and the name ".file". A series of auxiliary format 4 records follow which contain the file name. The filename is stored as an ANSI string and is padded with NULL if the length is not a multiple of COFF::SymbolSize (18). llvm-svn: 206355	2014-04-16 04:15:32 +00:00
Saleem Abdulrasool	0d3d6c45ef	Target: whitespace llvm-svn: 206353	2014-04-16 04:15:25 +00:00
Matt Arsenault	4e46665a80	R600: Expand sign extension of vectors. Setting vector types to expand will result in scalarization on pre SI hw, as those gpus don't have vector shifts either. Expand also i32 vectors, this helps llvm make the correct decision about scalarizing the vector ops. v2: move setOperation() calls to R600ISelLowering.cpp. cleanup the SI code to make it obvious that this patch does is nop for SI Patch by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 206348	2014-04-16 01:41:30 +00:00
Jim Grosbach	36c6a50512	[ARM64,C++11] Tidy up branch relaxation a bit w/ c++11. No functional change. llvm-svn: 206344	2014-04-16 00:42:46 +00:00
Jim Grosbach	01fc5887ad	ARM64: Nuke some dead code. Missed in previous commit. llvm-svn: 206343	2014-04-16 00:42:43 +00:00
Jim Grosbach	80633094f8	[ARM64,C++11] Clean up the ARM64 LOH collection pass. Range'ify a bunch of loops, mainly. As a result, we have a variety of objects via reference rather than by pointer, so propogate that through the various helper functions where it makes sense. llvm-svn: 206337	2014-04-15 22:57:02 +00:00
Matt Arsenault	e500e32939	R600/SI: Print code size along with used registers llvm-svn: 206336	2014-04-15 22:40:47 +00:00
Matt Arsenault	4d7d38333b	R600/SI: Print more immediates in hex format Print in decimal for inline immediates, and hex otherwise. Use hex always for offsets in addressing offsets. This approximately matches what the shader compiler does. llvm-svn: 206335	2014-04-15 22:32:49 +00:00
Matt Arsenault	fcf86c5417	R600/SI: Cleanup parsing of register names. Try to figure out the class and number of subregisters. llvm-svn: 206334	2014-04-15 22:32:42 +00:00
Matt Arsenault	470acd81a8	R600/SI: Fix loads of i1 llvm-svn: 206330	2014-04-15 22:28:39 +00:00
Tobias Grosser	8d941ef30d	RegionInfo: Do not access a value that was just moved away This fixes a regression introduced in r206310. llvm-svn: 206328	2014-04-15 22:09:36 +00:00
Akira Hatanaka	3d90f99d1a	Make FastISel::SelectInstruction return before target specific fast-isel code handles Intrinsic::trap if TargetOptions::TrapFuncName is set. This fixes a bug in which the trap function was not taken into consideration when a program was compiled without optimization (at -O0). <rdar://problem/16291933> llvm-svn: 206323	2014-04-15 21:30:06 +00:00
Andrea Di Biagio	aac2eac4c2	[X86] Improve the lowering of packed shifts by constant build_vector. This patch teaches the backend how to efficiently lower logical and arithmetic packed shifts on both SSE and AVX/AVX2 machines. When possible, instead of scalarizing a vector shift, the backend should try to expand the shift into a sequence of two packed shifts by immedate count followed by a MOVSS/MOVSD. Example (v4i32 (srl A, (build_vector < X, Y, Y, Y>))) Can be rewritten as: (v4i32 (MOVSS (srl A, <Y,Y,Y,Y>), (srl A, <X,X,X,X>))) [with X and Y ConstantInt] The advantage is that the two new shifts from the example would be lowered into X86ISD::VSRLI nodes. This is always cheaper than scalarizing the vector into four scalar shifts plus four pairs of vector insert/extract. llvm-svn: 206316	2014-04-15 19:30:48 +00:00
Quentin Colombet	72dad56c53	[ARM64] Set default CPU to generic instead of cyclone. llvm-svn: 206313	2014-04-15 19:08:46 +00:00
Robert Lougher	a9bf2463b9	Revert r191049/r191059 as it can produce wrong code (see PR17975). It has already been reverted on the 3.4 branch in r196521. llvm-svn: 206311	2014-04-15 18:34:24 +00:00
David Blaikie	ec649acb82	Use unique_ptr to manage ownership of child Regions within llvm::Region llvm-svn: 206310	2014-04-15 18:32:43 +00:00
Julien Lerouge	be4fe32eb8	Add lifetime markers for allocas created to hold byval arguments, make them appear in the InlineFunctionInfo. llvm-svn: 206308	2014-04-15 18:06:46 +00:00
Julien Lerouge	957e91c4d8	Split byval argument initialization so the memcpy(s) are injected at the beginning of the first new block after inlining. llvm-svn: 206307	2014-04-15 18:01:54 +00:00
Duncan P. N. Exon Smith	2b69189c9c	LTO: Add more loop simplification passes to LTO Similar to r202051, add missing loop simplification passes to the LTO optimization pipeline. Patch by Rafael Espindola. llvm-svn: 206306	2014-04-15 17:48:15 +00:00
Duncan P. N. Exon Smith	0d640014ff	verify-di: Add back braces for MSVC compatability Fixup after r206300. <rdar://problem/15500563> llvm-svn: 206305	2014-04-15 17:28:26 +00:00
Duncan P. N. Exon Smith	6ef5f284d6	verify-di: Implement DebugInfoVerifier Implement DebugInfoVerifier, which steals verification relying on DebugInfoFinder from Verifier. - Adds LegacyDebugInfoVerifierPassPass, a ModulePass which wraps DebugInfoVerifier. Uses -verify-di command-line flag. - Change verifyModule() to invoke DebugInfoVerifier as well as Verifier. - Add a call to createDebugInfoVerifierPass() wherever there was a call to createVerifierPass(). This implementation as a module pass should sidestep efficiency issues, allowing us to turn debug info verification back on. <rdar://problem/15500563> llvm-svn: 206300	2014-04-15 16:27:38 +00:00
Duncan P. N. Exon Smith	67b44da0dd	verify-di: split out VerifierSupport Split out assertion and output helpers from Verifier in preparation for writing the DebugInfoVerifier. <rdar://problem/15500563> llvm-svn: 206299	2014-04-15 16:27:32 +00:00
David Blaikie	0afad5e8bc	Use unique_ptr to manage PassInfo instances in the PassRegistry llvm-svn: 206297	2014-04-15 15:17:14 +00:00
NAKAMURA Takumi	e1f3583b96	MipsAsmParser.cpp: Fix vg_leak in MipsOperand::CreateMem(). Mem.Base is managed by k_Memory itself. llvm-svn: 206293	2014-04-15 14:13:21 +00:00
NAKAMURA Takumi	bd524ef129	MipsAsmParser::ParseRegister(): Be responsible to delete an Operand on a temporary Operands. llvm-svn: 206292	2014-04-15 14:06:27 +00:00
Tim Northover	ebb3123a5f	AArch64/ARM64: add missing pattern for extending load. llvm-svn: 206290	2014-04-15 14:00:19 +00:00
Tim Northover	cbcb7a37f7	AArch64/ARM64: only mangle MOVZ/MOVN during encoding when needed Sometimes we need emit the bits that would actually be a MOVN when producing a relocated MOVZ instruction (don't ask). But not always, a check which ARM64 got wrong until now. llvm-svn: 206289	2014-04-15 14:00:15 +00:00
Tim Northover	6e27b8ded5	AArch64/ARM64: add support for large code-model jump tables. I've left the MachO CodeGen as it is, there's a reasonable chance it should use the GOT like ConstPools, but I'm not certain. llvm-svn: 206288	2014-04-15 14:00:11 +00:00
Tim Northover	221b583951	AArch64/ARM64: add patterns for various commutations of FNMADD. llvm-svn: 206287	2014-04-15 14:00:06 +00:00
Tim Northover	b37cff1ae2	AArch64/ARM64: add half as a storage type on ARM64. This brings it into line with the AArch64 behaviour and should open the way for certain OpenCL features. llvm-svn: 206286	2014-04-15 14:00:03 +00:00
Tim Northover	80a70a265a	AArch64/ARM64: copy patterns for fixed-point conversions Code is mostly copied directly across, with a slight extension of the ISelDAGToDAG function so that it can cope with the floating-point constants being behind a litpool. llvm-svn: 206285	2014-04-15 13:59:57 +00:00
Tim Northover	f70577b1cd	ARM64: add constraints to various FastISel operations llvm-svn: 206284	2014-04-15 13:59:53 +00:00
Tim Northover	2f553f326a	FastISel: constrain the RegClass of operands when emitting instructions. ARM64 suffered multiple -verify-machineinstr failures (principally over the xsp/xzr issue) because FastISel was completely ignoring which subset of the general-purpose registers each instruction required. More fixes are coming in ARM64 specific FastISel, but this should cover the generic problems. llvm-svn: 206283	2014-04-15 13:59:49 +00:00
Tim Northover	20603726ce	AArch64/ARM64: add dp tests from AArch64 llvm-svn: 206281	2014-04-15 13:59:40 +00:00
NAKAMURA Takumi	6091e1aed5	ARM64AsmParser.cpp: Fix vg_leak in MC/ARM64/fp-encoding.s. llvm-svn: 206279	2014-04-15 13:22:11 +00:00
Stepan Dyatkovskiy	95cdac43af	Optional hash symbol feature support for ARM64 http://reviews.llvm.org/D3328 llvm-svn: 206276	2014-04-15 11:43:09 +00:00
Vladimir Medic	16d671a413	Current definition of subtract with immediate instruction aliases uses CodeGenOnly defined instructions and post matcher expansion methods to emit real instructions add with immediate. However, they can directly alias add with immediate instruction and remove unnecessary definitions and code in MipsAsmParser.cpp. This patch makes no change in functionality, just removes unnecessary definitions and code. llvm-svn: 206272	2014-04-15 10:14:49 +00:00
Chandler Carruth	785a9228b6	[Allocator] Finally, finish nuking the redundant code that led me here by removing the MallocSlabAllocator entirely and just using MallocAllocator directly. This makes all off these allocators expose and utilize the same core interface. The only ugly part of this is that it exposes the fact that the JIT allocator has no real handling of alignment, any more than the malloc allocator does. =/ It would be nice to fix both of these to support alignments, and then to leverage that in the BumpPtrAllocator to do less over allocation in order to manually align pointers. But, that's another patch for another day. This patch has no functional impact, it just removes the somewhat meaningless wrapper around MallocAllocator. llvm-svn: 206267	2014-04-15 09:44:09 +00:00
Alexey Bataev	b97f9e8698	D3348 - [BUG] "Rotate Loop" pass kills "llvm.vectorizer.enable" metadata llvm-svn: 206266	2014-04-15 09:37:30 +00:00
NAKAMURA Takumi	df72764599	X86JITInfo: [x86] Rework r206240, X86CompilationCallback_SSE() should be called for SSE-enabled code generator, even if LLVM is not built with -msse. llvm-svn: 206261	2014-04-15 08:28:23 +00:00
Nick Lewycky	aad475b324	Break PseudoSourceValue out of the Value hierarchy. It is now the root of its own tree containing FixedStackPseudoSourceValue (which you can use isa/dyn_cast on) and MipsCallEntry (which you can't). Anything that needs to use either a PseudoSourceValue* and Value* is strongly encouraged to use a MachinePointerInfo instead. llvm-svn: 206255	2014-04-15 07:22:52 +00:00
Craig Topper	2617dccea2	[C++11] More 'nullptr' conversion. In some cases just using a boolean check instead of comparing to nullptr. llvm-svn: 206252	2014-04-15 06:32:26 +00:00
David Blaikie	7620b31568	Use unique_ptr to manage TypePromotionActions owned by TypePromotionTransaction. llvm-svn: 206250	2014-04-15 06:17:44 +00:00
David Blaikie	dc72f9774d	Use unique_ptr to manage ownership of GCFunctionInfos in GCStrategy llvm-svn: 206249	2014-04-15 06:07:26 +00:00
David Blaikie	ec528ee93f	Use unique_ptr for the result of Registry entries. llvm-svn: 206248	2014-04-15 05:53:26 +00:00
David Blaikie	88368bae4c	Use unique_ptr to manage ownership of GCStrategy objects in GCMetadata llvm-svn: 206246	2014-04-15 05:34:49 +00:00
David Blaikie	f564ab6267	Use unique_ptr for section/segment ownership in WinCOFFObjectWriter llvm-svn: 206245	2014-04-15 05:25:03 +00:00
David Blaikie	bb97e1b52e	Use unique_ptr to own MCFunctions within MCModule. MCModule's ctor had to be moved out of line so the definition of MCFunction was available. (ctor requires the dtor of members (in case the ctor throws) which required access to the dtor of MCFunction) llvm-svn: 206244	2014-04-15 05:15:19 +00:00
Craig Topper	9f008867c0	[C++11] More 'nullptr' conversion. In some cases just using a boolean check instead of comparing to nullptr. llvm-svn: 206243	2014-04-15 04:59:12 +00:00
David Blaikie	4a7a050910	Use std::unique_ptr to manage MCBasicBlocks in MCFunction. llvm-svn: 206242	2014-04-15 04:56:29 +00:00
Lang Hames	a1bc0f5662	[MC] Require an MCContext when constructing an MCDisassembler. This patch re-introduces the MCContext member that was removed from MCDisassembler in r206063, and requires that an MCContext be passed in at MCDisassembler construction time. (Previously the MCContext member had been initialized in an ad-hoc fashion after construction). The MCCContext member can be used by MCDisassembler sub-classes to construct constant or target-specific MCExprs. This patch updates disassemblers for in-tree targets, and provides the MCRegisterInfo instance that some disassemblers were using through the MCContext (previously those backends were constructing their own MCRegisterInfo instances). llvm-svn: 206241	2014-04-15 04:40:56 +00:00
NAKAMURA Takumi	33ec29ace9	X86JITInfo: [x86] Use X86CompilationCallback_SSE() along; not Subtarget->hasSSE1() but __SSE__, the flag that LLVM libraries are compiled The callback calls internal LLVM JIT libraries. It may be built with -msse (or above). FIXME: JIT may use "host" instead of "generic" by default. llvm-svn: 206240	2014-04-15 04:12:21 +00:00
Jim Grosbach	2c6ff0cbb4	[ARM64,C++11]: Range'ify the dead-register-definition pass. Range-based for loops. No functional change intended. llvm-svn: 206239	2014-04-15 02:14:09 +00:00
Quentin Colombet	97c05b52b4	[MC] Emit an error if cfi_startproc is used before a symbol is defined. Currently, we bind those directives with the last symbol, so if none has been defined, this would lead to a crash of the compiler. <rdar://problem/15939159> llvm-svn: 206236	2014-04-15 01:17:45 +00:00
Quentin Colombet	f9b61e6afd	[ARM64][MC] Set the default CPU string to generic. llvm-svn: 206228	2014-04-15 00:28:39 +00:00
David Blaikie	b818418689	Use std::unique_ptr for DIE children Got bored, removed some manual memory management. Pushed references (rather than pointers) through a few APIs rather than replacing *x with x.get(). llvm-svn: 206222	2014-04-14 22:45:02 +00:00
Jim Grosbach	a344b6c314	X86: Nuke one more CPU autodetect blurb. Missed one in r206094. This brings MC and TargetMachine back into sync. llvm-svn: 206220	2014-04-14 22:23:30 +00:00
David Blaikie	9027abae53	Change argument order and add explanatory comment to r206130 Changes requested in code review by Eric Christopher of r206130. llvm-svn: 206219	2014-04-14 22:23:06 +00:00
Eric Christopher	b45b4814f6	Use FrameSetup on frame instructions for the Mips port. I can't seem to get a testcase to show a difference here, but it's part of the unconditional-br.ll line table weirdness. llvm-svn: 206218	2014-04-14 22:21:22 +00:00
Matt Arsenault	fed3dc8dc6	Revert "Revert r206045, "Fix shift by constants for vector."" Fix cases where the Value itself is used, and not the constant value. llvm-svn: 206214	2014-04-14 21:50:37 +00:00
Quentin Colombet	4097c8959c	[ARM64][MC] Set the default CPU to cyclone when initilizating the MC layer. This matches that ARM64Subtarget does for now. This is related to <rdar://problem/16573920> llvm-svn: 206211	2014-04-14 21:25:53 +00:00
Adrian Prantl	8714aaf0a5	Re-apply r206096 after investigating the gdb buildbot failure. Thanks to dblaikie for updating the testcase! Debug info: (bugfix) C++ C/Dtors can be compiled to multiple functions, therefore, their declaration cannot have one DW_AT_linkage_name. The specific instances however can and should have that attribute. This patch reorders the code in DwarfUnit::getOrCreateSubprogramDIE() to emit linkage names for C/Dtors. rdar://problem/16362674. llvm-svn: 206210	2014-04-14 21:16:04 +00:00
Louis Gerbarg	cfc05450e5	Fix for codegen bug that could cause illegal cmn instruction generation In rare cases the dead definition elimination pass code can cause illegal cmn instructions when it replaces dead registers on instructions that use unmaterialized frame indexes. This patch disables the dead definition optimization for instructions which include frame index operands. rdar://16438284 llvm-svn: 206208	2014-04-14 21:05:05 +00:00
Louis Gerbarg	6d2e3c638f	Add a flag to disable the ARM64DeadRegisterDefinitionsPass This patch adds a -arm64-dead-def-elimination flag so that it is possible to disable dead definition elimination. Includes test case. llvm-svn: 206207	2014-04-14 21:05:02 +00:00
James Molloy	d60571bad7	[ARM64] Port over missing subtarget features, and CPU definitions from AArch64. llvm-svn: 206198	2014-04-14 17:38:00 +00:00
James Molloy	09a53b960b	[ARM64] Add big endian target arm64_be. llvm-svn: 206197	2014-04-14 17:37:53 +00:00
Kaelyn Takata	226794ad97	Replace two calls to object::symbol_iterator::increment(), which had been removed in r200442. llvm-svn: 206196	2014-04-14 17:26:50 +00:00
Kaelyn Takata	55a3cffabe	Remove a variable from r206192 that is only used in an assert. llvm-svn: 206195	2014-04-14 17:21:50 +00:00
Akira Hatanaka	5638b89944	Fix a bug in which BranchProbabilityInfo wasn't setting branch weights of basic blocks inside loops correctly. Previously, BranchProbabilityInfo::calcLoopBranchHeuristics would determine the weights of basic blocks inside loops even when it didn't have enough information to estimate the branch probabilities correctly. This patch fixes the function to exit early if it doesn't see any exit edges or back edges and let the later heuristics determine the weights. This fixes PR18705 and <rdar://problem/15991090>. Differential Revision: http://reviews.llvm.org/D3363 llvm-svn: 206194	2014-04-14 16:56:19 +00:00
Kaelyn Takata	f9d483128c	Fix up MCFixup::getAccessVariant to handle unary expressions. This allows correct relocations to be generated for a symbolic address that is being adjusted by a negative constant. Since r204294, such expressions have triggered undefined behavior when LLVM was built without assertions. Credit goes to Rafael for this patch; I'm submitting it on his behalf as he is on vacation this week. llvm-svn: 206192	2014-04-14 16:50:22 +00:00
Daniel Sanders	863c35a358	[mips] Fix fcopysign for MIPS-IV and add the test. Summary: This was another incorrect use of hasMips64() vs isGP64bit(). Depends on D3344 Reviewers: matheusalmeida, vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D3347 llvm-svn: 206187	2014-04-14 16:24:12 +00:00
Daniel Sanders	3d84935d28	[mips] Fix more incorrect uses of HasMips64 and isMips64() Summary: - Conditional moves acting on 64-bit GPR's should require MIPS-IV rather than MIPS64 - ISD::MUL, and ISD::MULH[US] should be lowered on all 64-bit ISA's Patch by David Chisnall His work was sponsored by: DARPA, AFRL I've added additional testcases to cover as much of the codegen changes affecting MIPS-IV as I can. Where I've been unable to find an existing MIPS64 testcase that can be re-used for MIPS-IV (mainly tests covering ISD::GlobalAddress and similar), I at least agree that MIPS-IV should behave like MIPS64. Further testcases that are fixed by this patch will follow in my next commit. The testcases from that commit that fail for MIPS-IV without this patch are: LLVM :: CodeGen/Mips/2010-07-20-Switch.ll LLVM :: CodeGen/Mips/cmov.ll LLVM :: CodeGen/Mips/eh-dwarf-cfa.ll LLVM :: CodeGen/Mips/largeimmprinting.ll LLVM :: CodeGen/Mips/longbranch.ll LLVM :: CodeGen/Mips/mips64-f128.ll LLVM :: CodeGen/Mips/mips64directive.ll LLVM :: CodeGen/Mips/mips64ext.ll LLVM :: CodeGen/Mips/mips64fpldst.ll LLVM :: CodeGen/Mips/mips64intldst.ll LLVM :: CodeGen/Mips/mips64load-store-left-right.ll LLVM :: CodeGen/Mips/sint-fp-store_pattern.ll Reviewers: dsanders Reviewed By: dsanders CC: matheusalmeida Differential Revision: http://reviews.llvm.org/D3343 llvm-svn: 206183	2014-04-14 15:44:42 +00:00
James Molloy	951e529f66	Teach llvm-lto to respect the given RelocModel. Patch by Nick Tomlinson! llvm-svn: 206177	2014-04-14 13:54:16 +00:00
Tim Northover	cb9c3cfb58	ARM64: remove buggy REV16 pattern. The 32-bit pattern is still valid: 0123 -> 3210 -> 1032. llvm-svn: 206172	2014-04-14 12:59:52 +00:00
Tim Northover	b6abe806c7	AArch64/ARM64: enable directcond.ll test on ARM64. Code change is because optimizeCompareInstr didn't know how to pull the condition code out of FCSEL instructions. llvm-svn: 206171	2014-04-14 12:51:06 +00:00
Tim Northover	0d7bd4f444	ARM64: add patterns for csXYZ with reversed operands. AArch64 tests for this, and it's obviously a good idea. Have to invert the condition code, of course. llvm-svn: 206170	2014-04-14 12:51:02 +00:00
Tim Northover	2f48303436	ARM64: add support for AArch64's addsub_ext.ll There was one definite issue in ARM64 (the off-by-1 check for whether a shift could be folded in) and one difference that is probably correct: ARM64 didn't fold nodes with multiple uses into the arithmetic operations unless optimising for code size. llvm-svn: 206168	2014-04-14 12:50:50 +00:00
Tim Northover	23b1f08282	ARM64: optimise (cmp x, (sub 0, y)) to (cmn x, y). This transformation is only valid when being used for an EQ or NE comparison since the flags change otherwise. llvm-svn: 206167	2014-04-14 12:50:47 +00:00
Richard Osborne	da16ff47cd	[XCore] Don't create invalid MKMSK instructions inside loadImmediate(). Summary: Previously loadImmediate() would produce MKMSK instructions with invalid immediate values such as mkmsk r0, 9. Fix this by checking the mask size is valid. Reviewers: robertlytton Reviewed By: robertlytton CC: llvm-commits Differential Revision: http://reviews.llvm.org/D3289 llvm-svn: 206163	2014-04-14 12:30:35 +00:00
NAKAMURA Takumi	58ad0c87f8	Whitespace. llvm-svn: 206154	2014-04-14 07:03:13 +00:00
NAKAMURA Takumi	26afa982ec	Revert r206045, "Fix shift by constants for vector." It broke some builders, at least, i686. llvm-svn: 206153	2014-04-14 07:02:57 +00:00
Chandler Carruth	1cc9023e7f	[Allocator] Hoist the external helper function into a namespace scope declaration. GCC 4.7 appears to get hopelessly confused by declaring this function within a member function of a class template. Go figure. llvm-svn: 206152	2014-04-14 06:42:56 +00:00
Hal Finkel	56bf297e3a	Don't assert in BasicTTI::getMemoryOpCost for non-simple types BasicTTI::getMemoryOpCost must explicitly check for non-simple types; setting AllowUnknown=true with TLI->getSimpleValueType is not sufficient because, for example, non-power-of-two vector types return non-simple EVTs (not MVT::Other). llvm-svn: 206150	2014-04-14 05:59:09 +00:00
Chandler Carruth	eed3466a42	[Allocator] Make the underlying allocator a template instead of an abstract interface. The only user of this functionality is the JIT memory manager and it is quite happy to have a custom type here. This removes a virtual function call and a lot of unnecessary abstraction from the common case where this is just a very thin vaneer around a call to malloc. Hopefully still no functionality changed here. =] llvm-svn: 206149	2014-04-14 05:11:27 +00:00
Chandler Carruth	f5babf97ff	[Allocator] Switch the BumpPtrAllocator to use a vector of pointers to slabs rather than embedding a singly linked list in the slabs themselves. This has a few advantages: - Better utilization of the slab's memory by not wasting 16-bytes at the front. - Simpler allocation strategy by not having a struct packed at the front. - Avoids paging every allocated slab in just to traverse them for deallocating or dumping stats. The latter is the really nice part. Folks have complained from time to time bitterly that tearing down a BumpPtrAllocator, even if it doesn't run any destructors, pages in all of the memory allocated. Now it won't. =] Also resolves a FIXME with the scaling of the slab sizes. The scaling now disregards specially sized slabs for allocations larger than the threshold. llvm-svn: 206147	2014-04-14 03:55:11 +00:00
Serge Pavlov	b5f3ddc7a1	Use APInt arithmetic, fixed typo. Thanks to Benjamin Kramer for noticing that. llvm-svn: 206144	2014-04-14 02:20:19 +00:00
Craig Topper	c0196b1b40	[C++11] More 'nullptr' conversion. In some cases just using a boolean check instead of comparing to nullptr. llvm-svn: 206142	2014-04-14 00:51:57 +00:00
Hal Finkel	0192cbac66	[PowerPC] [Constant Hoisting] Enable constant hoisting on PPC Implements the various TTI functions to enable constant hoisting on PPC. The only significant test-suite change is this: MultiSource/Benchmarks/VersaBench/bmm/bmm - 20% speedup (which essentially reverses the slowdown from r206120). llvm-svn: 206141	2014-04-13 23:02:40 +00:00
Saleem Abdulrasool	2c08051cc5	MC: check machine magic when applying offset adjustments The values for the relocation type can (and do) overlap across various architectures. When performing an adjustment of the emitted relocation in the final object file, check that the file magic matches the target for which the relocation type is valid (e.g. a I386 relocation is only applied to an X86 object file, and an AMD64 relocation is only applied to an X86_64 object file). This was noticed while adding support for ARM WinCOFF object file emission. A test case for this is not really possible as the values for REL32 do not overlap on I386 and AMD64, which is why this was never noticed in practice. The ARM WinCOFF emission is not yet ready to merge into the tree. llvm-svn: 206138	2014-04-13 20:47:55 +00:00
Serge Pavlov	4bb54d51c8	Recognize test for overflow in integer multiplication. If multiplication involves zero-extended arguments and the result is compared as in the patterns: %mul32 = trunc i64 %mul64 to i32 %zext = zext i32 %mul32 to i64 %overflow = icmp ne i64 %mul64, %zext or %overflow = icmp ugt i64 %mul64 , 0xffffffff then the multiplication may be replaced by call to umul.with.overflow. This change fixes PR4917 and PR4918. Differential Revision: http://llvm-reviews.chandlerc.com/D2814 llvm-svn: 206137	2014-04-13 18:23:41 +00:00
Hal Finkel	d9963c75da	[PowerPC] Fix rlwimi isel when mask is not constant We had been using the known-zero values of the operand of the or to construct the mask for an rlwimi; this is not quite correct, but fine when the mask is constant. When the mask is constant, then the known zeros of the operand must be a superset of the zeros in the mask. However, when the mask is not a constant, then there might be bits in the operand that are not known to be zero that, at runtime, might be zero in the mask. Therefore, we check that any bits not known to be zero are known to be one in the mask. Otherwise, we can't fold the mask with the or and shift. This was revealed as a miscompile of MultiSource/Benchmarks/BitBench/drop3/drop3 when I started experimenting with constant hoisting. llvm-svn: 206136	2014-04-13 17:10:58 +00:00
David Blaikie	269e0fb2e4	Fix instruction debug info location during legalization I found this from a particular GDB test suite case of inlining (something similar is provided as a test case) but came across a few other related cases (other callers of the same functions, and one other instance of the same coding mistake in a separate function). I'm not sure what the best way to test this is (let alone to cover the other cases I discovered), so hopefully this sufficies - open to ideas. llvm-svn: 206130	2014-04-13 06:39:55 +00:00
Craig Topper	bb694de649	[C++11] More 'nullptr' conversion or in some cases just using a boolean check instead of comparing to nullptr. llvm-svn: 206129	2014-04-13 04:57:38 +00:00
Lang Hames	0563ca1be8	[X86] unique_ptr'ify one of X86GenericDisassembler's members. llvm-svn: 206127	2014-04-13 04:09:16 +00:00
Hal Finkel	34974ed503	[PowerPC] Implement some additional TLI callbacks Add implementations of: bool isLegalICmpImmediate(int64_t Imm) const bool isLegalAddImmediate(int64_t Imm) const bool isTruncateFree(Type Ty1, Type Ty2) const bool isTruncateFree(EVT VT1, EVT VT2) const bool shouldConvertConstantLoadToIntImm(const APInt &Imm, Type *Ty) const Unfortunately, this regresses counter-register-based loop formation because some of the loops now end up in forms were SE cannot compute loop counts. However, nevertheless, the test-suite results favor committing: SingleSource/Benchmarks/BenchmarkGame/puzzle: 26% speedup MultiSource/Benchmarks/FreeBench/analyzer/analyzer: 21% speedup MultiSource/Benchmarks/MiBench/automotive-susan/automotive-susan: 20% speedup SingleSource/Benchmarks/Polybench/linear-algebra/kernels/trisolv/trisolv: 19% speedup SingleSource/Benchmarks/Polybench/linear-algebra/kernels/gesummv/gesummv: 15% speedup MultiSource/Benchmarks/FreeBench/pcompress2/pcompress2: 2% speedup MultiSource/Benchmarks/VersaBench/bmm/bmm: 26% slowdown llvm-svn: 206120	2014-04-12 21:52:38 +00:00
Benjamin Kramer	44a53da346	Spell the specialization namespace correctly. Not sure why clang didn't diagnose this (GCC does). llvm-svn: 206117	2014-04-12 18:45:24 +00:00
Benjamin Kramer	30120c0626	Make helper static and place random global into the llvm namespace. llvm-svn: 206116	2014-04-12 18:39:57 +00:00
Benjamin Kramer	502b9e1d7f	Retire llvm::array_endof in favor of non-member std::end. While there make array_lengthof constexpr if we have support for it. llvm-svn: 206112	2014-04-12 16:15:53 +00:00
Benjamin Kramer	6b841f18d3	Move MDBuilder's methods out of line. Making them inline was a historical accident, they're neither hot nor templated. llvm-svn: 206109	2014-04-12 14:26:59 +00:00
David Blaikie	ee11f22640	PR13337: Omit DW_TAG_restrict_type when compiling for DWARF2 DWARF3 introduced DW_TAG_restrict_type, so avoid using it in prior versions. llvm-svn: 206105	2014-04-12 05:35:59 +00:00
Adrian Prantl	d3dd11d628	Revert "Debug info: (bugfix) C++ C/Dtors can be compiled to multiple functions," This reverts commit 206096 while I investigate why this broke the gdb buildbot. llvm-svn: 206103	2014-04-12 04:25:02 +00:00
Juergen Ributzka	cf03068d91	[ARM64] Never hoist the shift value of a shift instruction. There is no need to check if we want to hoist the immediate value of an shift instruction. Simply return TCC_Free right away. llvm-svn: 206101	2014-04-12 02:53:51 +00:00
Juergen Ributzka	6e17aa45a3	[ARM64] Fix the cost model for cheap large constants. Originally the cost model would give up for large constants and just return the maximum cost. This is not what we want for constant hoisting, because some of these constants are large in bitwidth, but are still cheap to materialize. This commit fixes the cost model to either return TCC_Free if the cost cannot be determined, or accurately calculate the cost even for large constants (bitwidth > 128). This fixes <rdar://problem/16591573>. llvm-svn: 206100	2014-04-12 02:36:28 +00:00
David Blaikie	efc403b91b	Use dwarf::Tag rather than unsigned for DIE::Tag to make debugging easier. Nice to be able to just print out the Tag and have the debugger print dwarf::DW_TAG_subprogram or whatever, rather than an int. It's a bit finicky (for example DIDescriptor::getTag still returns unsigned) because some places still handle real dwarf tags + our fake tags (one day we'll remove the fake tags, hopefully). llvm-svn: 206098	2014-04-12 02:24:04 +00:00
Adrian Prantl	1f2f3c3434	Debug info: (bugfix) C++ C/Dtors can be compiled to multiple functions, therefore, their declaration cannot have one DW_AT_linkage_name. The specific instances however can and should have that attribute. This patch reorders the code in DwarfUnit::getOrCreateSubprogramDIE() to emit linkage names for C/Dtors. rdar://problem/16362674. llvm-svn: 206096	2014-04-12 01:44:42 +00:00
Jim Grosbach	48551fbdba	X86: Remove TargetMachine CPU auto-detection. This logic is properly in the realm of whatever is creating the TargetMachine. This makes plain 'llc foo.ll' consistent across heterogenous machines. llvm-svn: 206094	2014-04-12 01:34:29 +00:00
Hal Finkel	3b48d08f54	Reenable use of TBAA during CodeGen We had disabled use of TBAA during CodeGen (even when otherwise using AA) because the ptrtoint/inttoptr used by CGP for address sinking caused BasicAA to miss basic type punning that it should catch (and, thus, we'd fail to override TBAA when we should). However, when AA is in use during CodeGen, CGP now uses normal GEPs and bitcasts, instead of ptrtoint/inttoptr, when doing address sinking. As a result, BasicAA should be able to make us do the right thing in the face of type-punning, and it seems safe to enable use of TBAA again. self-hosting seems fine on PPC64/Linux on the P7, with TBAA enabled and -misched=shuffle. Note: We still don't update TBAA when merging stack slots, although because BasicAA should now catch all such cases, this is no longer a blocking issue. Nevertheless, I plan to commit code to deal with this properly in the near future. llvm-svn: 206093	2014-04-12 01:26:00 +00:00
Hal Finkel	c3998306f4	Add the ability to use GEPs for address sinking in CGP The current memory-instruction optimization logic in CGP, which sinks parts of the address computation that can be adsorbed by the addressing mode, does this by explicitly converting the relevant part of the address computation into IR-level integer operations (making use of ptrtoint and inttoptr). For most targets this is currently not a problem, but for targets wishing to make use of IR-level aliasing analysis during CodeGen, the use of ptrtoint/inttoptr is a problem for two reasons: 1. BasicAA becomes less powerful in the face of the ptrtoint/inttoptr 2. In cases where type-punning was used, and BasicAA was used to override TBAA, BasicAA may no longer do so. (this had forced us to disable all use of TBAA in CodeGen; something which we can now enable again) This (use of GEPs instead of ptrtoint/inttoptr) is not currently enabled by default (except for those targets that use AA during CodeGen), and so aside from some PowerPC subtargets and SystemZ, there should be no change in behavior. We may be able to switch completely away from the ptrtoint/inttoptr sinking on all targets, but further testing is required. I've doubled-up on a number of existing tests that are sensitive to the address sinking behavior (including some store-merging tests that are sensitive to the order of the resulting ADD operations at the SDAG level). llvm-svn: 206092	2014-04-12 00:59:48 +00:00
Chad Rosier	4ec124bc3e	[AArch64] Implement the isLegalAddressingMode and getScalingFactorCost APIs. llvm-svn: 206089	2014-04-12 00:14:23 +00:00
Duncan P. N. Exon Smith	689a50736e	blockfreq: Rename BlockFrequencyImpl to BlockFrequencyInfoImpl This is a shared implementation class for BlockFrequencyInfo and MachineBlockFrequencyInfo, not for BlockFrequency, a related (but distinct) class. No functionality change. <rdar://problem/14292693> llvm-svn: 206083	2014-04-11 23:20:58 +00:00
Duncan P. N. Exon Smith	37bd529964	blockfreq: Use getSuccessorIndex() No functionality change. <rdar://problem/14292693> llvm-svn: 206082	2014-04-11 23:20:52 +00:00
David Blaikie	b60e61c15f	Pull out a named variable for the cached section names to aid readability. Based on a code review suggestion from Eric Christopher in r205990 llvm-svn: 206080	2014-04-11 22:49:14 +00:00
Louis Gerbarg	b9a0551862	Add ARM64 CLS patterns This patch adds patterns to generate the cls instruction ARM64. Includes tests for 64 bit and 32 bit operands. rdar://15611957 llvm-svn: 206079	2014-04-11 22:27:58 +00:00
David Blaikie	e06e801a67	Format fixes for r205990 llvm-svn: 206078	2014-04-11 22:11:50 +00:00
Quentin Colombet	4344da1c71	[RegAllocGreedy][Last Chance Recoloring] Change the name of the exhaustive search option. fexhaustive-register-search => exhaustive-register-search 'f' is a Clang thing! This is related to PR18747. llvm-svn: 206075	2014-04-11 21:51:09 +00:00
Quentin Colombet	567e30bc2b	[RegAllocGreedy][Last Chance Recoloring] Addition of -fexhaustive-register-search option to allow an exhaustive search during last chance recoloring. This is related to PR18747 Patch by MAYUR PANDEY <mayur.p@samsung.com>. llvm-svn: 206072	2014-04-11 21:39:44 +00:00
Matt Arsenault	e1f030ca66	R600: Check if a sextload should be used for parameter loads. Through some oddity where truncate (sextload x) isn't folded into an anyextload for vectors, the sextload remains if the vector isn't immediately scalarized. This keeps the expected zextload instructions in the kernel-args test when small type vectors aren't scalarized. llvm-svn: 206070	2014-04-11 20:59:54 +00:00
Lang Hames	95400e22f9	Remove redundant symbolization support from MCDisassembler interface. MCDisassembler has an MCSymbolizer member that is meant to take care of symbolizing during disassembly, but it also has several methods that enable the disassembler to do symbolization internally (i.e. without an attached symbolizer object). There is no need for this duplication, but ARM64 had been making use of it. This patch moves the ARM64 symbolization logic out of ARM64Disassembler and into an ARM64ExternalSymbolizer class, and removes the duplicated MCSymbolizer functionality from the MCDisassembler interface. Symbolization will now be done exclusively through MCSymbolizers. There should be no impact on disassembly for any platform, but this allows us to tidy up the MCDisassembler interface and simplify the process of (and invariants related to) disassembler setup. llvm-svn: 206063	2014-04-11 20:07:58 +00:00
Quentin Colombet	13c310e3ac	[Register Coalescer] Fix wrong live-range information with rematerialization. When rematerializing an instruction that defines a super register that would be used by a physical subregisters we use the related physical super register for the definition. To keep the live-range information accurate, all the defined subregisters must be marked as dead def, otherwise the register allocation may miss some interferences. Working on a reduced test-case! <rdar://problem/16582185> llvm-svn: 206060	2014-04-11 19:45:07 +00:00
Matt Arsenault	0cb92e133f	R600/SI: Refactor SOPC classes slightly. Better match what is done for VOPC to eventually prefer selecting these. llvm-svn: 206048	2014-04-11 19:25:18 +00:00
Rafael Espindola	9ef844165a	Don't lose the thumb bit by using relocations with sections. This fixes a regression from r205076. llvm-svn: 206047	2014-04-11 19:18:01 +00:00
Matt Arsenault	173a1e577c	Fix shift by constants for vector. ashr <N x iM>, <N x iM> M -> undef llvm-svn: 206045	2014-04-11 17:57:53 +00:00
Adrian Prantl	3bdcb52dd1	Debug info: Store the DIVariable in DebugLocEntry also for constants, so DwarfDebug::emitDebugLocEntry can emit them with the correct signedness. rdar://problem/15928306 llvm-svn: 206042	2014-04-11 17:49:47 +00:00
Matt Arsenault	9ec3cf2c8a	Move ExtractVectorElements to SelectionDAG. This seems generally useful, and makes sense to go along with SplitVector. llvm-svn: 206041	2014-04-11 17:47:30 +00:00
Tom Stellard	a1a5d9aa2e	SelectionDAG: Use helper function to improve legalization of ISD::MUL The TargetLowering::expandMUL() helper contains lowering code extracted from the DAGTypeLegalizer and allows the SelectionDAGLegalizer to expand more ISD::MUL patterns without having to use a library call. llvm-svn: 206037	2014-04-11 16:12:01 +00:00
Tom Stellard	b3a7fa2d17	SelectionDAG: Factor ISD::MUL lowering code out of DAGTypeLegalizer This code has been moved to a new function in the TargetLowering class called expandMUL(). The purpose of this is to be able to share lowering code between the SelectionDAGLegalize and DAGTypeLegalizer classes. No functionality changed intended. llvm-svn: 206036	2014-04-11 16:11:58 +00:00
Diego Novillo	199de39bf0	Fix use-after-free bug caught by address sanitizer: http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux-bootstrap/builds/2959 The location string is returned as a std::string, not a StringRef. llvm-svn: 206032	2014-04-11 13:55:56 +00:00
Simon Atanasyan	42ac0dd3c3	[yaml2obj][ELF] ELF Relocations Support. The patch implements support for both relocation record formats: Elf_Rel and Elf_Rela. It is possible to define relocation against symbol only. Relocations against sections will be implemented later. Now yaml2obj recognizes X86_64, MIPS and Hexagon relocation types. Example of relocation section specification: Sections: - Name: .text Type: SHT_PROGBITS Content: "0000000000000000" AddressAlign: 16 Flags: [SHF_ALLOC] - Name: .rel.text Type: SHT_REL Info: .text AddressAlign: 4 Relocations: - Offset: 0x1 Symbol: glob1 Type: R_MIPS_32 - Offset: 0x2 Symbol: glob2 Type: R_MIPS_CALL16 The patch reviewed by Michael Spencer, Sean Silva, Shankar Easwaran. llvm-svn: 206017	2014-04-11 04:13:39 +00:00
David Blaikie	ceec2bdaa5	Implement depth_first and inverse_depth_first range factory functions. Also updated as many loops as I could find using df_begin/idf_begin - strangely I found no uses of idf_begin. Is that just used out of tree? Also a few places couldn't use df_begin because either they used the member functions of the depth first iterators or had specific ordering constraints (I added a comment in the latter case). Based on a patch by Jim Grosbach. (Jim - you just had iterator_range<T> where you needed iterator_range<idf_iterator<T>>) llvm-svn: 206016	2014-04-11 01:50:01 +00:00
Jim Grosbach	5d049b9732	[c++11] Range'ify use list loops in InstrEmitter. llvm-svn: 206015	2014-04-11 01:13:16 +00:00
Jim Grosbach	e816003d3f	[c++11] Range'ify use list loops in DAGCombiner. llvm-svn: 206014	2014-04-11 01:13:13 +00:00
Jim Grosbach	f77265bfee	[ARM64,C++11] Range'ify use-lists iterators in address type promotion. llvm-svn: 206013	2014-04-11 01:13:10 +00:00

... 4 5 6 7 8 ...

68991 Commits