llvm-project

Commit Graph

Author	SHA1	Message	Date
Elena Demikhovsky	8ac0bf96f0	X86Disassembler - fixed a bug in immediate print llvm-svn: 206953	2014-04-23 07:21:04 +00:00
Stepan Dyatkovskiy	afc364bd51	Integrated assbemler, macros: added 'vararg' argument qualifier support. Note, currently we have no 'vararg' support for darwin macros. llvm-svn: 206951	2014-04-23 06:56:28 +00:00
Kevin Qin	a4ee178762	[ARM64] Enable feature predicates for NEON / FP / CRYPTO. AArch64 has feature predicates for NEON, FP and CRYPTO instructions. This allows the compiler to generate code without using FP, NEON or CRYPTO instructions. llvm-svn: 206949	2014-04-23 06:22:48 +00:00
Chandler Carruth	3f9869a8e2	[LCG] Hoist the logic for forming a new SCC from the top of the DFSStack into a helper function. I plan to re-use it for doing incremental DFS-based updates to the SCCs when we mutate the call graph. llvm-svn: 206948	2014-04-23 06:09:03 +00:00
Chandler Carruth	0b623baeb3	[LCG] Switch the Callee sets to be DenseMaps pointing to the index into the Callee list. This is going to be quite important to prevent removal from going quadratic. No functionality changed at this point, this is one of the refactoring patches I've broken out of my initial work toward mutation updates of the call graph. llvm-svn: 206938	2014-04-23 04:00:17 +00:00
David Blaikie	637cac42ed	Requisite reformatting for previous commit. llvm-svn: 206927	2014-04-22 23:09:36 +00:00
David Blaikie	f9b6a558c8	Push memory ownership of DwarfUnits into clients of DwarfFile. This prompted me to push references through most of DwarfDebug. Sorry for the churn. Honestly it's a bit silly that we're passing around units all over the place like that anyway and I think it's mostly due to the DIE attribute adding utility functions being utilities in DwarfUnit. I should have another go at moving them out of DwarfUnit... llvm-svn: 206925	2014-04-22 22:39:41 +00:00
Kevin Enderby	7ee97cebfc	Change the prototype for MCContext::FatalError() so it can be called from places like MCCodeEmitter() in the MC backend when the MCContext is const. I was going to use this in my change for r206669 but Jim convinced me to use an assert there. But this still is a good tweak. llvm-svn: 206923	2014-04-22 21:42:18 +00:00
David Blaikie	c33b3cdb0c	Use std::unique_ptr to handle ownership of DwarfUnits in DwarfFile. So Chandler - how about those range algorithms? (would really love a dereferencing range adapter for this sort of stuff) llvm-svn: 206921	2014-04-22 21:27:37 +00:00
Rafael Espindola	3e993d0f42	Follow aliases when determining if a symbol is thumb. This fixes pr19484. llvm-svn: 206917	2014-04-22 19:11:07 +00:00
Juergen Ributzka	575bcb770a	[Constant Hoisting] Materialize the constant before the cloned cast instruction. In the case where the constant comes from a cloned cast instruction, the materialization code has to go before the cloned cast instruction. This commit fixes the method that finds the materialization insertion point by making it aware of this case. This fixes <rdar://problem/15532441> llvm-svn: 206913	2014-04-22 18:06:58 +00:00
Juergen Ributzka	a1444b39fb	[Constant Hoisting] Print the instructions in the correct order for debugging. No functional change. llvm-svn: 206912	2014-04-22 18:06:51 +00:00
Rafael Espindola	89992b0d6b	Fix DataLayout::operator==(). Patch by Maks Naumov! llvm-svn: 206911	2014-04-22 17:47:03 +00:00
Kevin Enderby	96918bc406	Fix the assembler to print a better relocatable expression error diagnostic that includes location information. Currently if one has this assembly: .quad (0x1234 + (4 * SOME_VALUE)) where SOME_VALUE is undefined ones gets the less than useful error message with no location information: % clang -c x.s clang -cc1as: fatal error: error in backend: expected relocatable expression With this fix one now gets a more useful error message with location information: % clang -c x.s x.s:5:8: error: expected relocatable expression .quad (0x1234 + (4 * SOME_VALUE)) ^ To do this I plumbed the SMLoc through the MCObjectStreamer EmitValue() and EmitValueImpl() interfaces so it could be used when creating the MCFixup. rdar://12391022 llvm-svn: 206906	2014-04-22 17:27:29 +00:00
David Blaikie	5f1a001071	Simplify address pool index assignment. llvm-svn: 206905	2014-04-22 17:21:40 +00:00
Matt Arsenault	16353871c3	R600: Emit error instead of unreachable on function call llvm-svn: 206904	2014-04-22 16:42:00 +00:00
Tom Stellard	8d6d449756	R600/SI: Reorganize SIInstructions.td llvm-svn: 206902	2014-04-22 16:33:57 +00:00
Elena Demikhovsky	acc5c9e83e	AVX-512: store and truncstore for i1 values llvm-svn: 206897	2014-04-22 14:13:10 +00:00
NAKAMURA Takumi	7a25ca63ba	Remove DOS CRLF. llvm-svn: 206894	2014-04-22 13:35:50 +00:00
Tim Northover	a962398a3f	AArch64/ARM64: make use of ANDS and BICS instructions for comparisons. llvm-svn: 206888	2014-04-22 12:45:42 +00:00
Lang Hames	64f6ebb8a9	[X86] Require HasBMI2 for the new BZHI tablegen patterns. Evidently tablegen doesn't infer this from the HasBMI2 predicate on the BZHI instructions. This should fix the recent bot failures. llvm-svn: 206885	2014-04-22 12:04:53 +00:00
Robert Khasanov	189e7fdcfb	[AVX512] Implemented integer conversions up/down with masking. Added encoding tests. llvm-svn: 206884	2014-04-22 11:36:19 +00:00
Kostya Serebryany	c9a2c17ad3	[asan] Support outline instrumentation for wide types and delete dead code, patch by Yuri Gribov llvm-svn: 206883	2014-04-22 11:19:45 +00:00
Lang Hames	70fa72d340	[X86] Remove Tablegen def of X86bzhi SDNode: It's not needed as of r206879. llvm-svn: 206880	2014-04-22 10:50:46 +00:00
Lang Hames	3067ab2344	[X86] Use tablegen instead of DAG combines to match BZHI instructions, as suggested by Ben Kramer in review of r206738. Thanks again Ben! llvm-svn: 206879	2014-04-22 10:41:56 +00:00
Matheus Almeida	2852af8a00	[mips] Clang-format MipsAsmParser. No functional changes. llvm-svn: 206878	2014-04-22 10:15:54 +00:00
Tim Northover	00b4ee848f	AArch64/ARM64: add patterns for scalar_to_vector/extract pairs llvm-svn: 206876	2014-04-22 10:10:18 +00:00
Tim Northover	978d25f391	ARM: disable emission of __XYZvfp in soft-float environment. The point of these calls is to allow Thumb-1 code to make use of the VFP unit to perform its operations. This is not desirable with -msoft-float, since most of the reasons you'd want that apply equally to the runtime library. rdar://problem/13766161 llvm-svn: 206874	2014-04-22 10:10:09 +00:00
Hao Liu	c636d15284	Fix an infinite loop bug in DAG Combine about keeping transfering between ANY_EXTEND and SIGN_EXTEND. llvm-svn: 206873	2014-04-22 09:57:06 +00:00
Lang Hames	f6f42cac3f	[X86] Don't use BZHI for short masks (>=32 bits). Thanks to Ben Kramer for the review. llvm-svn: 206869	2014-04-22 07:40:34 +00:00
David Blaikie	afd2c6be0e	Revert "Use value semantics to manage DbgVariables rather than dynamic allocation/pointers." This reverts commit r206780. This commit was regressing gdb.opt/inline-locals.exp in the GDB 7.5 test suite. Reverting until I can fix the issue. llvm-svn: 206867	2014-04-22 05:41:06 +00:00
Matt Arsenault	a3c8cde77b	R600: Change how vector truncating stores are packed. Don't introduce new operations on an illegal sub 32-bit type. Do the operations on a 32-bit value, and then use a truncating store. llvm-svn: 206864	2014-04-22 04:11:14 +00:00
Matt Arsenault	5dbd5db518	R600: Make sign_extend_inreg legal. Don't know why I didn't just do this in the first place. llvm-svn: 206862	2014-04-22 03:49:30 +00:00
Jiangning Liu	87486e0bac	[AArch64] Enable global merge pass. llvm-svn: 206861	2014-04-22 03:33:26 +00:00
Duncan P. N. Exon Smith	b3380ea60a	blockfreq: Skip irreducible backedges inside functions The branch that skips irreducible backedges was only active when propagating mass at the top-level. In particular, when propagating mass through a loop recognized by `LoopInfo` with irreducible control flow inside, irreducible backedges would not be skipped. Not sure where that idea came from, but the result was that mass was lost until after loop exit. Added a testcase that covers this case. llvm-svn: 206860	2014-04-22 03:31:53 +00:00
Duncan P. N. Exon Smith	d1aec79d7a	blockfreq: Rename PackagedLoops => Loops llvm-svn: 206859	2014-04-22 03:31:50 +00:00
Duncan P. N. Exon Smith	2984a64bae	blockfreq: Use a pointer for ContainingLoop too llvm-svn: 206858	2014-04-22 03:31:44 +00:00
Duncan P. N. Exon Smith	e1423639bb	blockfreq: Use pointers to loops instead of an index Store pointers directly to loops inside the nodes. This could have been done without changing the type stored in `std::vector<>`. However, rather than computing the number of loops before constructing them (which `LoopInfo` doesn't provide directly), I've switched to a `vector<unique_ptr<LoopData>>`. This adds some heap overhead, but the number of loops is typically small. llvm-svn: 206857	2014-04-22 03:31:37 +00:00
Duncan P. N. Exon Smith	dc2d66e7b3	blockfreq: Implement clear() explicitly This was implicitly with copy assignment before, which fails to actually clear `std::vector<>`'s heap storage. Move assignment would work, but since MSVC can't imply those anyway, explicitly `clear()`-ing members makes more sense. llvm-svn: 206856	2014-04-22 03:31:34 +00:00
Duncan P. N. Exon Smith	cc88ebfa5f	blockfreq: Rename PackagedLoopData => LoopData No functionality change. llvm-svn: 206855	2014-04-22 03:31:31 +00:00
Chandler Carruth	6464826597	[Modules] Fix potential ODR violations by sinking the DEBUG_TYPE definition below all of the header #include lines, lib/Support edition. llvm-svn: 206847	2014-04-22 03:07:47 +00:00
Chandler Carruth	f58e376d23	[Modules] Fix potential ODR violations by sinking the DEBUG_TYPE definition below all the header #include lines. This updates most of the miscellaneous other lib/... directories. A few left though. llvm-svn: 206845	2014-04-22 03:04:17 +00:00
Chandler Carruth	964daaaf19	[Modules] Fix potential ODR violations by sinking the DEBUG_TYPE definition below all of the header #include lines, lib/Transforms/... edition. This one is tricky for two reasons. We again have a couple of passes that define something else before the includes as well. I've sunk their name macros with the DEBUG_TYPE. Also, InstCombine contains headers that need DEBUG_TYPE, so now those headers #define and #undef DEBUG_TYPE around their code, leaving them well formed modular headers. Fixing these headers was a large motivation for all of these changes, as "leaky" macros of this form are hard on the modules implementation. llvm-svn: 206844	2014-04-22 02:55:47 +00:00
Chandler Carruth	f1221bd01b	[Modules] Fix potential ODR violations by sinking the DEBUG_TYPE definition below all the header #include lines, lib/Analysis/... edition. This one has a bit extra as there were other #define's before #include lines in addition to DEBUG_TYPE. I've sunk all of them as a block. llvm-svn: 206843	2014-04-22 02:48:03 +00:00
Chandler Carruth	84e68b2994	[Modules] Fix potential ODR violations by sinking the DEBUG_TYPE definition below all of the header #include lines, lib/Target/... edition. llvm-svn: 206842	2014-04-22 02:41:26 +00:00
Chandler Carruth	ff55593c40	[cleanup] Fix two headers where we included a standard library header after including the generated code from tablegen. llvm-svn: 206841	2014-04-22 02:28:45 +00:00
Chandler Carruth	b5e481ac91	[cleanup] Fix another place where we were including the tablegen'ed code of a '.inc' file before including actual headers. In this case we had both duplicated a header's include and were including a standard header. llvm-svn: 206840	2014-04-22 02:25:17 +00:00
Chandler Carruth	d174b72a28	[cleanup] Lift using directives, DEBUG_TYPE definitions, and even some system headers above the includes of generated '.inc' files that actually contain code. In a few targets this was already done pretty consistently, but it wasn't done really consistently anywhere. It is strictly cleaner IMO and necessary in a bunch of places where the DEBUG_TYPE is referenced from the generated code. Consistency with the necessary places trumps. Hopefully the build bots are OK with the movement of intrin.h... llvm-svn: 206838	2014-04-22 02:03:14 +00:00
Chandler Carruth	1b9dde087e	[Modules] Remove potential ODR violations by sinking the DEBUG_TYPE define below all header includes in the lib/CodeGen/... tree. While the current modules implementation doesn't check for this kind of ODR violation yet, it is likely to grow support for it in the future. It also removes one layer of macro pollution across all the included headers. Other sub-trees will follow. llvm-svn: 206837	2014-04-22 02:02:50 +00:00
Quentin Colombet	d4f44690ef	[CodeGenPrepare] Use APInt to check the value of the immediate in a and while checking candidate for bit field extract. Otherwise the value may not fit in uint64_t and this will trigger an assertion. This fixes PR19503. llvm-svn: 206834	2014-04-22 01:20:34 +00:00
Chandler Carruth	66f38db3ba	[Modules] Followup to r206822 to add a DEBUG_TYPE which is used on ARM and PPC, but not x86. llvm-svn: 206830	2014-04-21 23:58:10 +00:00
Chandler Carruth	e96dd8975f	[Modules] Make Support/Debug.h modular. This requires it to not change behavior based on other files defining DEBUG_TYPE, which means it cannot define DEBUG_TYPE at all. This is actually better IMO as it forces folks to define relevant DEBUG_TYPEs for their files. However, it requires all files that currently use DEBUG(...) to define a DEBUG_TYPE if they don't already. I've updated all such files in LLVM and will do the same for other upstream projects. This still leaves one important change in how LLVM uses the DEBUG_TYPE macro going forward: we need to only define the macro after header files have been #include-ed. Previously, this wasn't possible because Debug.h required the macro to be pre-defined. This commit removes that. By defining DEBUG_TYPE after the includes two things are fixed: - Header files that need to provide a DEBUG_TYPE for some inline code can do so by defining the macro before their inline code and undef-ing it afterward so the macro does not escape. - We no longer have rampant ODR violations due to including headers with different DEBUG_TYPE definitions. This may be mostly an academic violation today, but with modules these types of violations are easy to check for and potentially very relevant. Where necessary to suppor headers with DEBUG_TYPE, I have moved the definitions below the includes in this commit. I plan to move the rest of the DEBUG_TYPE macros in LLVM in subsequent commits; this one is big enough. The comments in Debug.h, which were hilariously out of date already, have been updated to reflect the recommended practice going forward. llvm-svn: 206822	2014-04-21 22:55:11 +00:00
Yi Jiang	b23edebdd2	Set default value of HasExtractBitsInsn to false llvm-svn: 206803	2014-04-21 22:22:44 +00:00
Jim Grosbach	9446534025	ARM64: Refactor away a few redundant helpers. The comment claimed that the register class information wasn't available in the assembly parser, but that's not really true. It's just annoying to get to. Replace the helper functions with references to the auto-generated information. llvm-svn: 206802	2014-04-21 22:13:57 +00:00
Rafael Espindola	bad3f77703	Simplify a vpermil* with constant mask. With a constant mask a vpermil* is just a shufflevector. This patch implements that simplification. This allows us to produce denser code. It should also allow more folding down the line. llvm-svn: 206801	2014-04-21 22:06:04 +00:00
Jim Grosbach	9515c52294	ARM64: Improve diagnostics for malformed reg+reg addressing mode. Make sure only general purpose registers are valid for offset regs and that 32-bit regs are only valid for sxtw and uxtw extends. llvm-svn: 206799	2014-04-21 21:45:57 +00:00
Jim Grosbach	ac901086e5	Move helper functions earlier in the file. No functional change. llvm-svn: 206798	2014-04-21 21:45:53 +00:00
Jim Grosbach	9d205d42f3	ARM64: Extended addressing mode source reg is 64-bit. The canonical form for the extended addressing mode (e.g., "[x1, w2, uxtw #3]" is for the MCInst to have the second register be the full 64-bit GPR64 register class. The instruction printer cleans up the output for display to show the 32-bit register instead, per the specification. This simplifies 205893 now that the aliasing is handled in the printer in 206495 so that the codegen path and the disassembler path give the same MCInst form. llvm-svn: 206797	2014-04-21 21:45:44 +00:00
David Blaikie	09757491d6	Use unique_ptr to manage ownership of GCOVFunctions, Blocks, and Edges. llvm-svn: 206796	2014-04-21 21:40:16 +00:00
Hal Finkel	bae796f0dc	Remove seemingly-unneeded artificial dependency The rationale for this artificial dependency seems to have been lost to the ravages of time, it is covered by no regression tests, and has no impact on test-suite performance numbers on either x86 or PPC. For the test suite, on both x86 and PPC, I ran the test suite 10 times (both as a baseline and with this change), and found no statistically-significant changes. For PPC, I used a P7 box. For x86, I used an Intel Xeon E5430. Both with -O3 -mcpu=native. This was discussed on-list back in January, but I've not had a chance to run the performance tests until today. llvm-svn: 206795	2014-04-21 21:30:25 +00:00
David Blaikie	4c82a809b3	Simplify destruction of Modules in LLVContextImpl. This avoids copying the container by simply deleting until empty. While I'd rather move to a stricter ownership semantic (unique_ptr), SmallPtrSet can't cope with unique_ptr and the ownership semantics here are a bit incestuous (Module sort of owns itself, but sort of doesn't (if the LLVMContext is destroyed before the Module, then it deregisters itself from the context... )). Ideally Modules would be given to the context, or possibly an emplace-like function to construct them there. Modules then shouldn't be destroyed by LLVM API clients, but by interacting with the owner (LLVMContext) directly (but even then, passing a Module* to LLVMContext doesn't provide an easy way to destroy the Module, since the set would be over unique_ptrs and you'd need a heterogenous lookup function which SmallPtrSet doesn't have either). llvm-svn: 206794	2014-04-21 21:27:19 +00:00
Rafael Espindola	6c76d1d7df	Handle _GLOBAL_OFFSET_TABLE_ in 64 bit mode. With this MC is able to handle _GLOBAL_OFFSET_TABLE_ in 64 bit mode, which is needed for medium and large code models. This fixes pr19470. llvm-svn: 206793	2014-04-21 21:15:45 +00:00
Rafael Espindola	83752535ea	clang-format this function. No functionality change, it will just make the next patch easier to read. llvm-svn: 206792	2014-04-21 21:00:58 +00:00
David Blaikie	bc44220eb8	Use unique_ptr to handle GlobalOpt's Evaluator members llvm-svn: 206790	2014-04-21 20:49:36 +00:00
Reid Kleckner	9b2cc647eb	Fix PR7272 in -tailcallelim instead of the inliner The -tailcallelim pass should be checking if byval or inalloca args can be captured before marking calls as tail calls. This was the real root cause of PR7272. With a better fix in place, revert the inliner change from r105255. The test case it introduced still passes and has been moved to test/Transforms/Inline/byval-tail-call.ll. Reviewers: chandlerc Differential Revision: http://reviews.llvm.org/D3403 llvm-svn: 206789	2014-04-21 20:48:47 +00:00
David Blaikie	eb038915ab	Simplify expression that was explicitly naming an operator overload in a call. llvm-svn: 206788	2014-04-21 20:43:51 +00:00
David Blaikie	229de508bb	Use unique_ptr to handle ownership of GCOVFunctions in GCOVProfiler. llvm-svn: 206786	2014-04-21 20:41:55 +00:00
David Blaikie	2b1dfa7244	Use unique_ptr to handle ownership of UserValues in LiveDebugVariablesImpl llvm-svn: 206785	2014-04-21 20:37:07 +00:00
David Blaikie	422b93dcf1	Use unique_ptr to manage objects owned by the ScheduleDAGMI. llvm-svn: 206784	2014-04-21 20:32:32 +00:00
David Blaikie	b0b7b18e8c	Use value semantics to manage DbgVariables rather than dynamic allocation/pointers. Requires switching some vectors to lists to maintain pointer validity. These could be changed to forward_lists (singly linked) with a bit more work - I've left comments to that effect. llvm-svn: 206780	2014-04-21 20:13:09 +00:00
Filipe Cabecinhas	20352216fb	Rename X86insrtps to the proper instruction name. Summary: The INSERTPS pattern fragment was called insrtps (mising 'e'), which would make it harder to grep for the patterns related to this instruction. Renaming it to use the proper instruction name. Reviewers: nadav CC: llvm-commits Differential Revision: http://reviews.llvm.org/D3443 llvm-svn: 206779	2014-04-21 20:07:29 +00:00
Chandler Carruth	a4a2066482	[Modules] Consolidate the DEBUG_TYPE defines in NVPTX to the top of the cpp file rather than in the header and then again in the cpp file. llvm-svn: 206778	2014-04-21 19:53:55 +00:00
Chandler Carruth	5f1f26e891	[Modules] Sink all the DEBUG_TYPE defines for InstCombine out of the header files and into the cpp files. These files will require more touches as the header files actually use DEBUG(). Eventually, I'll have to introduce a matched #define and #undef of DEBUG_TYPE for the header files, but that comes as step N of many to clean all of this up. llvm-svn: 206777	2014-04-21 19:51:41 +00:00
Chandler Carruth	6d23a7b600	[Modules] Sink the DEBUG_TYPE macro out of LegalizeTypes.h and into the various .cpp files. This macro is inherently non-modular, and it wasn't even needed in this header file. llvm-svn: 206775	2014-04-21 19:43:07 +00:00
Yi Jiang	d069f6393a	ARM64: Combine shifts and uses from different basic block to bit-extract instruction llvm-svn: 206774	2014-04-21 19:34:27 +00:00
Jim Grosbach	36f025e697	Revert "[rtdyld,c++11] Range'ify symbol table walking." Tentative revert for http://lab.llvm.org:8011/builders/llvm-mips-linux/builds/8305. This reverts commit c2a58efff07294fca724f89500538f2ddbcd12ff. llvm-svn: 206773	2014-04-21 19:23:59 +00:00
Matt Arsenault	443252c011	Fix unnecessary line break llvm-svn: 206772	2014-04-21 18:39:13 +00:00
Duncan P. N. Exon Smith	254689fcf9	blockfreq: Some cleanup of UnsignedFloat Change `PositiveFloat` to `UnsignedFloat`, and fix some of the comments to indicate that it's disappearing eventually. llvm-svn: 206771	2014-04-21 18:31:58 +00:00
Jim Grosbach	de577e3d68	[rtdyld,c++11] Range'ify symbol table walking. llvm-svn: 206769	2014-04-21 18:10:31 +00:00
Duncan P. N. Exon Smith	10be9a8868	Reapply "blockfreq: Rewrite BlockFrequencyInfoImpl" This reverts commit r206707, reapplying r206704. The preceding commit to CalcSpillWeights should have sorted out the failing buildbots. <rdar://problem/14292693> llvm-svn: 206766	2014-04-21 17:57:07 +00:00
Duncan P. N. Exon Smith	7af3432e22	CalcSpillWeights: Hack to prevent x87 nonsense This gross hack forces `hweight` into memory, preventing hidden precision from making `1 > 1` occasionally equal `true`. <rdar://problem/14292693> llvm-svn: 206765	2014-04-21 17:57:01 +00:00
Evgeniy Stepanov	3939f54820	[msan] Enable out-of-line instrumentation for large functions by default. llvm-svn: 206759	2014-04-21 15:04:05 +00:00
NAKAMURA Takumi	62774f3524	Appease autoconf build since X86Disassembler.c has been disappeared in r206717. It can be reverted a few days later, after X86Disassembler.d is updated not to contain "X86Disassembler.c". llvm-svn: 206758	2014-04-21 14:59:11 +00:00
Kostya Serebryany	35e538312a	[asan] add a run-time flag detect_container_overflow=true/false llvm-svn: 206756	2014-04-21 14:35:00 +00:00
Rafael Espindola	6956b1a517	Convert getFileOffset to getOffset and move it to its only user. We normally don't drop functions from the C API's, but in this case I think we can: * The old implementation of getFileOffset was fairly broken * The introduction of LLVMGetSymbolFileOffset was itself a C api breaking change as it removed LLVMGetSymbolOffset. * It is an incredibly specialized use case. The only reason MCJIT needs it is because of its odd position of being a dynamic linker of .o files. llvm-svn: 206750	2014-04-21 13:45:32 +00:00
Michael Zolotukhin	f2ba994bf6	Reapply r206732. This time without optimization of branches. llvm-svn: 206749	2014-04-21 12:01:33 +00:00
Kostya Serebryany	94c81ca8e1	[asan] instead of inserting inline instrumentation around memset/memcpy/memmove, replace the intrinsic with __asan_memset/etc. This makes the memset/etc handling more complete and consistent with what we do in msan. It may slowdown some cases (when the intrinsic was actually inlined) and speedup other cases (when it was not inlined) llvm-svn: 206746	2014-04-21 11:50:42 +00:00
Chandler Carruth	572e3407c3	[PM] Add a new-PM-style CGSCC pass manager using the newly added LazyCallGraph analysis framework. Wire it up all the way through the opt driver and add some very basic testing that we can build pass pipelines including these components. Still a lot more to do in terms of testing that all of this works, but the basic pieces are here. There is a lot of boiler plate here. It's something I'm going to actively look at reducing, but I don't have any immediate ideas that don't end up making the code terribly complex in order to fold away the boilerplate. Until I figure out something to minimize the boilerplate, almost all of this is based on the code for the existing pass managers, copied and heavily adjusted to suit the needs of the CGSCC pass management layer. The actual CG management still has a bunch of FIXMEs in it. Notably, we don't do any updating of the CG as it is potentially invalidated. I wanted to get this in place to motivate the new analysis, and add update APIs to the analysis and the pass management layers in concert to make sure that the right APIs are present. llvm-svn: 206745	2014-04-21 11:12:00 +00:00
Chandler Carruth	de37c46780	[PM] Fix a bug where we didn't properly clear the list map when the list became empty. This would manifest later as an assert failure due to a non-empty list map but an empty result map. This doesn't easily manifest with just the module pass manager and the function pass manager, but the next commit will add the CGSCC pass manager that hits this assert immediately. llvm-svn: 206744	2014-04-21 11:11:54 +00:00
Kostya Serebryany	94f57d199a	[asan] temporary disable generating __asan_loadN/__asan_storeN llvm-svn: 206741	2014-04-21 10:28:13 +00:00
Benjamin Kramer	d2da720ead	[C++11] Replace OwningPtr with std::unique_ptr in places where it doesn't break the API. No functionality change. llvm-svn: 206740	2014-04-21 09:34:48 +00:00
Lang Hames	5aa6ee80b6	[X86] ISEL (and X, <constant mask>) to BZHI when BMI2 is available. Generating BZHI in the variable mask case, i.e. (and X, (sub (shl 1, N), 1)), was already supported, but we were missing the constant-mask case. This patch fixes that. <rdar://problem/15480077> llvm-svn: 206738	2014-04-21 08:18:53 +00:00
Chandler Carruth	a2533a7bef	Revert r206732 which is causing llc to crash on most of the build bots. Original commit message: Implement builtins for safe division: safe.sdiv.iN, safe.udiv.iN, safe.srem.iN, safe.urem.iN (iN = i8, i61, i32, or i64). llvm-svn: 206735	2014-04-21 07:11:15 +00:00
Kostya Serebryany	86332c010f	[asan] insert __asan_loadN/__asan_storeN as out-lined asan checks, llvm part llvm-svn: 206734	2014-04-21 07:10:43 +00:00
Michael Zolotukhin	137a84616c	Implement builtins for safe division: safe.sdiv.iN, safe.udiv.iN, safe.srem.iN, safe.urem.iN (iN = i8, i16, i32, or i64). llvm-svn: 206732	2014-04-21 05:33:09 +00:00
Chandler Carruth	99b756db04	[LCG] Add some basic debug output to the LCG pass. llvm-svn: 206730	2014-04-21 05:04:24 +00:00
David Blaikie	e9907ba16e	Protect the ArgList dtor It could even be made non-virtual if it weren't for bad compiler warnings. This demonstrates that ArgList objects aren't destroyed polymorphically and possibly that they aren't even used polymorphically. If that's the case, it might be possible to refactor the two ArgList types more separately and simplify the Arg ownership model. continues experimenting llvm-svn: 206727	2014-04-20 23:59:00 +00:00
David Blaikie	f70b21a4b8	Use unique_ptr to handle ownership of synthesized args in DerivedArgList This might be able to be simplified further by using Arg as a value type in a linked list (to maintain pointer validity), but here's something simple to start with. llvm-svn: 206724	2014-04-20 22:37:46 +00:00
Richard Smith	5d50610306	C++ has a bool type! (And C's had one too, for 15 years...) llvm-svn: 206723	2014-04-20 22:15:37 +00:00
Richard Smith	6a6967eeaf	More C++ification. llvm-svn: 206722	2014-04-20 22:10:16 +00:00
Richard Smith	3c3410f139	Remove some more C junk from these files. llvm-svn: 206721	2014-04-20 21:56:02 +00:00
Richard Smith	ac15f1cda3	Don't provide two different definitions of ModRMDecision, OpcodeDecision, and ContextDecision in different source files (depending on #define magic). llvm-svn: 206720	2014-04-20 21:52:16 +00:00
Richard Smith	82b47d5660	Don't define llvm::X86Disassembler::InstructionSpecifier in different ways in different source files. llvm-svn: 206719	2014-04-20 21:35:26 +00:00
Richard Smith	555134215b	Maybe if I touch this file the buildbots will actually rerun configure like they need to... llvm-svn: 206718	2014-04-20 21:28:33 +00:00
Richard Smith	89ee75d786	What year is it! This file has no reason to be written in C, and has doubly no reason to expose a global symbol 'decodeInstruction' nor to pollute the global scope with a bunch of external linkage entities (some of which conflict with others elsewhere in LLVM). This is just the initial transition to C++; more cleanups to follow. llvm-svn: 206717	2014-04-20 21:07:34 +00:00
Alp Toker	9844434151	Remove some empty statements Cleanup only. llvm-svn: 206710	2014-04-19 23:56:35 +00:00
Justin Bogner	a2bfd66e0e	ProfileData: Remove an extra semicolon Spotted by Nick Lewycky in review, thanks! llvm-svn: 206708	2014-04-19 23:42:50 +00:00
Duncan P. N. Exon Smith	e63327e967	Revert "blockfreq: Rewrite BlockFrequencyInfoImpl" This reverts commit r206704, as expected. llvm-svn: 206707	2014-04-19 22:46:00 +00:00
Duncan P. N. Exon Smith	875ddfac75	Reapply "blockfreq: Rewrite BlockFrequencyInfoImpl" This reverts commit r206677, reapplying my BlockFrequencyInfo rewrite. I've done a careful audit, added some asserts, and fixed a couple of bugs (unfortunately, they were in unlikely code paths). There's a small chance that this will appease the failing bots [1][2]. (If so, great!) If not, I have a follow-up commit ready that will temporarily add -debug-only=block-freq to the two failing tests, allowing me to compare the code path between what the failing bots and what my machines (and the rest of the bots) are doing. Once I've triggered those builds, I'll revert both commits so the bots go green again. [1]: http://bb.pgr.jp/builders/ninja-x64-msvc-RA-centos6/builds/1816 [2]: http://llvm-amd64.freebsd.your.org/b/builders/clang-i386-freebsd/builds/18445 <rdar://problem/14292693> llvm-svn: 206704	2014-04-19 22:34:26 +00:00
Yaron Keren	d7ba46b287	Patch by Vadim Chugunov Win64 stack unwinder gets confused when execution flow "falls through" after a call to 'noreturn' function. This fixes the "missing epilogue" problem by emitting a trap instruction for IR 'unreachable' on x86_x64-pc-windows. A secondary use for it would be for anyone wanting to make double-sure that 'noreturn' functions, indeed, do not return. llvm-svn: 206684	2014-04-19 13:47:43 +00:00
Yaron Keren	421304d18c	Patch by Ray Donnelly to print register names instead of numbers. http://reviews.llvm.org/D3422 llvm-svn: 206683	2014-04-19 05:40:09 +00:00
David Blaikie	b5956d232f	Add parens to appease GCC warning. llvm-svn: 206678	2014-04-19 00:50:15 +00:00
Duncan P. N. Exon Smith	76b813619a	Revert "blockfreq: Rewrite BlockFrequencyInfoImpl" (#2 ) This reverts commit r206666, as planned. Still stumped on why the bots are failing. Sanitizer bots haven't turned anything up. If anyone can help me debug either of the failures (referenced in r206666) I'll owe them a beer. (In the meantime, I'll be auditing my patch for undefined behaviour.) llvm-svn: 206677	2014-04-19 00:42:46 +00:00
Justin Bogner	e808171628	OnDiskHashTable: Audit types and use offset_type consistently llvm-svn: 206675	2014-04-19 00:33:15 +00:00
Nick Lewycky	05e0f1ccff	Check whether functions have any lines associated before emitting coverage info for them. This isn't just a size/time saving, gcov may crash on these. llvm-svn: 206671	2014-04-18 23:32:28 +00:00
Kevin Enderby	b7e51f6af5	Change the ARM assembler to require a :lower16: or :upper16 on non-constant expressions for mov instructions instead of silently truncating by default. For the ARM assembler, we want to avoid misleadingly allowing something like "mov r0, <symbol>" especially when we turn it into a movw and the expression <symbol> does not have a :lower16: or :upper16" as part of the expression. We don't want the behavior of silently truncating, which can be unexpected and lead to bugs that are difficult to find since this is an easy mistake to make. This does change the previous behavior of llvm but actually matches an older gnu assembler that would not allow this but print less useful errors of like “invalid constant (0x927c0) after fixup” and “unsupported relocation on symbol foo”. The error for llvm is "immediate expression for mov requires :lower16: or :upper16" with correct location information on the operand as shown in the added test cases. rdar://12342160 llvm-svn: 206669	2014-04-18 23:06:39 +00:00
Duncan P. N. Exon Smith	b3caf3646f	Reapply "blockfreq: Rewrite BlockFrequencyInfoImpl" (#2 ) This reverts commit r206628, reapplying r206622 (and r206626). Two tests are failing only on buildbots [1][2]: i.e., I can't reproduce on Darwin, and Chandler can't reproduce on Linux. Asan and valgrind don't tell us anything, but we're hoping the msan bot will catch it. So, I'm applying this again to get more feedback from the bots. I'll leave it in long enough to trigger builds in at least the sanitizer buildbots (it was failing for reasons unrelated to my commit last time it was in), and hopefully a few others.... and then I expect to revert a third time. [1]: http://bb.pgr.jp/builders/ninja-x64-msvc-RA-centos6/builds/1816 [2]: http://llvm-amd64.freebsd.your.org/b/builders/clang-i386-freebsd/builds/18445 llvm-svn: 206666	2014-04-18 22:30:03 +00:00
Alexey Samsonov	5c39fdfb7b	[llvm-symbolizer] Print file/line for a PC even if there is no DIE describing it. This is important for symbolizing executables with debug info in unavailable .dwo files. Even if all DIE entries are missing, we can still symbolize an address: function name can be fetched from symbol table, and file/line info can be fetched from line table. llvm-svn: 206665	2014-04-18 22:22:44 +00:00
Justin Bogner	b5d368e838	ProfileData: Don't forward declare ComputeHash and make it static inline llvm-svn: 206663	2014-04-18 22:00:22 +00:00
David Blaikie	76d3a3cd35	Compress debug sections only when beneficial. Both ZLIB and the debug info compressed section header ("ZLIB" + the size of the uncompressed data) take some constant overhead so in some cases the compressed data is actually larger than the uncompressed data. In these cases, just don't compress or rename the section at all. llvm-svn: 206659	2014-04-18 21:52:26 +00:00
Justin Bogner	b7aa26303b	ProfileData: Add support for the indexed instrprof format This adds support for an indexed instrumentation based profiling format, which is just a small header and an on disk hash table. This format will be used by clang's -fprofile-instr-use= for PGO. llvm-svn: 206656	2014-04-18 21:48:40 +00:00
Alexey Samsonov	d010999abe	[DWARF parser] Turn DILineInfo into a struct. Immutable DILineInfo doesn't bring any benefits and complicates code. Also, use std::string instead of SmallString<16> for file and function names - their length can vary significantly. No functionality change. llvm-svn: 206654	2014-04-18 21:36:39 +00:00
David Blaikie	c029ab430c	Update the fragments of symbols in compressed sections. While unnamed relocations are already cached in side tables in ELFObjectWriter::RecordRelocation, symbols still need their fragments updated to refer to the newly compressed fragment (even if that fragment isn't big enough to fit the offset). Even though we only create temporary symbols in debug info sections this comes up in 32 bit builds where even temporary symbols in mergeable sections (such as debug_str) have to be emitted as named symbols. I tried a few other ways to do this but they all didn't work for various reasons: 1) Canonicalize the MCSymbolData in RecordRelocation, nulling out the Fragment (so it didn't have to be updated by CompressDebugSection). This doesn't work because some code relies on symbols having fragments to indicate that they're defined, I think. 2) Canonicalize the MCSymbolData in RecordRelocation to be "first fragment + absolute offset" so it would be cheaper to just test and update the fragment in CompressDebugSections. This doesn't work because the offset computed in RecordRelocation isn't that of the symbol's fragment, it's the passed in fragment (I haven't figured out what that fragment is - perhaps it's the location where the relocation is to be written). And if the fragment offset has to be computed only for this use we might as well just do it when we need to, in CompressDebugSection. I also added an assert to help catch this a bit more clearly, even though it is UB. The test case improvements would either assert fail and/or valgrind vail without the fix, even if they wouldn't necessarily fail the FileCheck output. llvm-svn: 206653	2014-04-18 21:24:12 +00:00
Chad Rosier	9149acb053	[ARM64] Ports the Cortex-A53 Machine Model description from AArch64. Summary: This port includes the rudimentary latencies that were provided for the Cortex-A53 Machine Model in the AArch64 backend. It also changes the SchedAlias for COPY in the Cyclone model to an explicit WriteRes mapping to avoid conflicts in other subtargets. Differential Revision: http://reviews.llvm.org/D3427 Patch by Dave Estes <cestes@codeaurora.org>! llvm-svn: 206652	2014-04-18 21:22:04 +00:00
Chandler Carruth	2174f44f61	[LCG] Fix the bugs that Ben pointed out in code review (and the MSan bot caught). Sad that we don't have warnings for these things, but bleh, no idea how to fix that. llvm-svn: 206646	2014-04-18 20:44:16 +00:00
Alexey Samsonov	84e2423d34	[DWARF parser] Respect address ranges specified in compile unit DIE. When address ranges for compile unit are specified in compile unit DIE itself, there is no need to collect ranges from children subprogram DIEs. This change speeds up llvm-symbolizer on Clang-produced binaries with full debug info. For instance, symbolizing a first address in a 1Gb binary is now 2x faster (1s vs. 2s). llvm-svn: 206641	2014-04-18 20:30:27 +00:00
Benjamin Kramer	147644d400	Remove a couple of redundant copies of SmallVector::operator==. No functionality change. llvm-svn: 206635	2014-04-18 19:48:03 +00:00
Adam Nemet	ee7a3e38c9	[X86] Improve buildFromShuffleMostly for AVX For a 256-bit BUILD_VECTOR consisting mostly of shuffles of 256-bit vectors, both the BUILD_VECTOR and its operands may need to be legalized in multiple steps. Consider: (v8f32 (BUILD_VECTOR (extract_vector_elt (v8f32 %vreg0,) Constant<1>), (extract_vector_elt %vreg0, Constant<2>), (extract_vector_elt %vreg0, Constant<3>), (extract_vector_elt %vreg0, Constant<4>), (extract_vector_elt %vreg0, Constant<5>), (extract_vector_elt %vreg0, Constant<6>), (extract_vector_elt %vreg0, Constant<7>), %vreg1)) a. We can't build a 256-bit vector efficiently so, we need to split it into two 128-bit vecs and combine them with VINSERTX128. b. Operands like (extract_vector_elt (v8f32 %vreg0), Constant<7>) needs to be split into a VEXTRACTX128 and a further extract_vector_elt from the resulting 128-bit vector. c. The extract_vector_elt from b. is lowered into a shuffle to the first element and a movss. Depending on the order in which we legalize the BUILD_VECTOR and its operands[1], buildFromShuffleMostly may be faced with: (v4f32 (BUILD_VECTOR (extract_vector_elt (vector_shuffle<1,u,u,u> (extract_subvector %vreg0, Constant<4>), undef), Constant<0>), (extract_vector_elt (vector_shuffle<2,u,u,u> (extract_subvector %vreg0, Constant<4>), undef), Constant<0>), (extract_vector_elt (vector_shuffle<3,u,u,u> (extract_subvector %vreg0, Constant<4>), undef), Constant<0>), %vreg1)) In order to figure out the underlying vector and their identity we need to see through the shuffles. [1] Note that the order in which operations and their operands are legalized is only guaranteed in the first iteration of LegalizeDAG. Fixes <rdar://problem/16296956> llvm-svn: 206634	2014-04-18 19:44:16 +00:00
Benjamin Kramer	ee26b621f0	DebugInfo: Remove some initializer lists to make MSVC happy again. llvm-svn: 206632	2014-04-18 19:01:53 +00:00
David Blaikie	583a31c976	Add range access to MCAssembler's symbol collection. llvm-svn: 206631	2014-04-18 18:24:25 +00:00
Duncan P. N. Exon Smith	0842ff36a6	Revert "blockfreq: Rewrite BlockFrequencyInfoImpl" (#2 ) This reverts commit r206622 and the MSVC fixup in r206626. Apparently the remotely failing tests are still failing, despite my attempt to fix the nondeterminism in r206621. llvm-svn: 206628	2014-04-18 17:56:08 +00:00
Duncan P. N. Exon Smith	38fe464df0	Fixing MSVC after r206622? llvm-svn: 206626	2014-04-18 17:38:01 +00:00
Andrew Trick	1766f93b35	Better comments to explain buffered/unbuffered processor resources. llvm-svn: 206625	2014-04-18 17:35:08 +00:00
Alexey Samsonov	762343da6c	[DWARF parser] Refactor fetching DIE address ranges. Add a helper method to get address ranges specified in a DIE (either by DW_AT_low_pc/DW_AT_high_pc, or by DW_AT_ranges). Use it to untangle and simplify the code. No functionality change. llvm-svn: 206624	2014-04-18 17:25:46 +00:00
Duncan P. N. Exon Smith	f8361d127a	Reapply "blockfreq: Rewrite BlockFrequencyInfoImpl" This reverts commit r206556, effectively reapplying commit r206548 and its fixups in r206549 and r206550. In an intervening commit I've added target triples to the tests that were failing remotely [1] (but passing locally). I'm hoping the mystery is solved? I'll revert this again if the tests are still failing remotely. [1]: http://bb.pgr.jp/builders/ninja-x64-msvc-RA-centos6/builds/1816 llvm-svn: 206622	2014-04-18 17:22:25 +00:00
Tim Northover	37d9a9cebf	ARM64: disable generation of .loh directives outside MachO. Part of PR19455. llvm-svn: 206611	2014-04-18 14:54:46 +00:00
Tim Northover	be1d1b6681	ARM64: don't emit .subsections_via_symbols on ELF. Part of PR19455. llvm-svn: 206610	2014-04-18 14:54:41 +00:00
Tim Northover	be3941cc79	ARM64: add extra NEG pattern. llvm-svn: 206609	2014-04-18 14:54:35 +00:00
Tim Northover	e3028832d1	AArch64/ARM64: add non-scalar lowering for more FCVT operations. llvm-svn: 206591	2014-04-18 13:16:42 +00:00
Tim Northover	01f315a556	AArch64/ARM64: improve spotting of EXT instructions from VECTOR_SHUFFLE. We couldn't cope if the first mask element was UNDEF before, which isn't ideal. llvm-svn: 206588	2014-04-18 12:50:58 +00:00
Evgeniy Stepanov	65120ec8c6	[msan] Add -msan-instrumentation-with-call-threshold. This flag replaces inline instrumentation for checks and origin stores with calls into MSan runtime library. This is a workaround for PR17409. Disabled by default. llvm-svn: 206585	2014-04-18 12:17:20 +00:00
Chandler Carruth	d8d865e266	[LCG] Remove all of the complexity stemming from supporting copying. Reality is that we're never going to copy one of these. Supporting this was becoming a nightmare because nothing even causes it to compile most of the time. Lots of subtle errors built up that wouldn't have been caught by any "normal" testing. Also, make the move assignment actually work rather than the bogus swap implementation that would just infloop if used. As part of that, factor out the graph pointer updates into a helper to share between move construction and move assignment. llvm-svn: 206583	2014-04-18 11:02:33 +00:00
Chandler Carruth	18eadd9260	[LCG] Add support for building persistent and connected SCCs to the LazyCallGraph. This is the start of the whole point of this different abstraction, but it is just the initial bits. Here is a run-down of what's going on here. I'm planning to incorporate some (or all) of this into comments going forward, hopefully with better editing and wording. =] The crux of the problem with the traditional way of building SCCs is that they are ephemeral. The new pass manager however really needs the ability to associate analysis passes and results of analysis passes with SCCs in order to expose these analysis passes to the SCC passes. Making this work is kind-of the whole point of the new pass manager. =] So, when we're building SCCs for the call graph, we actually want to build persistent nodes that stick around and can be reasoned about later. We'd also like the ability to walk the SCC graph in more complex ways than just the traditional postorder traversal of the current CGSCC walk. That means that in addition to being persistent, the SCCs need to be connected into a useful graph structure. However, we still want the SCCs to be formed lazily where possible. These constraints are quite hard to satisfy with the SCC iterator. Also, using that would bypass our ability to actually add data to the nodes of the call graph to facilite implementing the Tarjan walk. So I've re-implemented things in a more direct and embedded way. This immediately makes it easy to get the persistence and connectivity correct, and it also allows leveraging the existing nodes to simplify the algorithm. I've worked somewhat to make this implementation more closely follow the traditional paper's nomenclature and strategy, although it is still a bit obtuse because it isn't recursive, using an explicit stack and a tail call instead, and it is interruptable, resuming each time we need another SCC. The other tricky bit here, and what actually took almost all the time and trials and errors I spent building this, is exactly what graph structure to build for the SCCs. The naive thing to build is the call graph in its newly acyclic form. I wrote about 4 versions of this which did precisely this. Inevitably, when I experimented with them across various use cases, they became incredibly awkward. It was all implementable, but it felt like a complete wrong fit. Square peg, round hole. There were two overriding aspects that pushed me in a different direction: 1) We want to discover the SCC graph in a postorder fashion. That means the root node will be the last node we find. Using the call-SCC DAG as the graph structure of the SCCs results in an orphaned graph until we discover a root. 2) We will eventually want to walk the SCC graph in parallel, exploring distinct sub-graphs independently, and synchronizing at merge points. This again is not helped by the call-SCC DAG structure. The structure which, quite surprisingly, ended up being completely natural to use is the inverse of the call-SCC DAG. We add the leaf SCCs to the graph as "roots", and have edges to the caller SCCs. Once I switched to building this structure, everything just fell into place elegantly. Aside from general cleanups (there are FIXMEs and too few comments overall) that are still needed, the other missing piece of this is support for iterating across levels of the SCC graph. These will become useful for implementing #2, but they aren't an immediate priority. Once SCCs are in good shape, I'll be working on adding mutation support for incremental updates and adding the pass manager that this analysis enables. llvm-svn: 206581	2014-04-18 10:50:32 +00:00
Benjamin Kramer	e6c821ef4c	X86: Pattern match scalar loads + vcvtph2ps into just vcvtph2ps. vcvtph2ps only reads the lower 64 bits of the address passed to the intrinsic. llvm-svn: 206579	2014-04-18 10:45:33 +00:00
Chandler Carruth	1911882569	Revert r206565 (and r206566 which updated tests). This commit was attributed to a different person from the person who posted the patch to the list, and the person who posted it the list claimed when they did that they were not the author, but that the author was yet a third person. I don't know what is going on here, but reverting until the attribution is clear and the author has explicitly contributed the patch. Also, the review hasn't really involved any of the MC maintainers and that seems questionable too. llvm-svn: 206576	2014-04-18 09:35:51 +00:00
Tim Northover	a2c4c71c12	AArch64/ARM64: spot a greater variety of concat_vector operations. Code mostly copied from AArch64, just tidied up a trifle and plumbed into the ARM64 way of doing things. This also enables the AArch64 tests which inspired the previous untested commits. llvm-svn: 206574	2014-04-18 09:31:27 +00:00
Tim Northover	848bb3ced5	ARM64: implement cunning optimisation from AArch64 A vector extract followed by a dup can become a single instruction even if the types don't match. AArch64 handled this in ISelLowering, but a few reasonably simple patterns can take care of it in TableGen, so that's where I've put it. llvm-svn: 206573	2014-04-18 09:31:20 +00:00
Tim Northover	5ec51a8981	ARM64: spot a vector_shuffle that maps to INS and expand. Tests will be coming very shortly when all the optimisations needed to support AArch64's neon-copy.ll file are committed. llvm-svn: 206572	2014-04-18 09:31:15 +00:00
Tim Northover	46d98ea8de	ARM64: nick some AArch64 patterns for extract/insert -> INS. Tests will be committed shortly when all optimisations needed to support AArch64's neon-copy.ll file are supported. llvm-svn: 206571	2014-04-18 09:31:11 +00:00
Tim Northover	8b2fa3dfef	AArch64/ARM64: emit all vector FP comparisons as such. ARM64 was scalarizing some vector comparisons which don't quite map to AArch64's compare and mask instructions. AArch64's approach of sacrificing a little efficiency to emulate them with the limited set available was better, so I ported it across. More "inspired by" than copy/paste since the backend's internal expectations were a bit different, but the tests were invaluable. llvm-svn: 206570	2014-04-18 09:31:07 +00:00
Tim Northover	0a44e66bb8	AArch64/ARM64: port BSL logic from AArch64 & enable test. I enhanced it a little in the process. The decision shouldn't really be beased on whether a BUILD_VECTOR is a splat: any set of constants will do the job provided they're related in the correct way. Also, the BUILD_VECTOR could be any operand of the incoming AND nodes, so it's best to check for all 4 possibilities rather than assuming it'll be the RHS. llvm-svn: 206569	2014-04-18 09:31:01 +00:00
Tim Northover	547a4ae6fa	AArch64/ARM64: copy byval implementation from AArch64. It's not actually used to handle C or C++ ABI rules on ARM64, but could well be emitted by other language front-ends, so it's as well to have a sensible implementation. llvm-svn: 206568	2014-04-18 09:30:52 +00:00
Yaron Keren	8ca45e0c05	Patch by Ray Donnelly. Emit WIN64 SEH registers by name instead of just number. llvm-svn: 206565	2014-04-18 08:03:38 +00:00
Kostya Serebryany	22e8810838	[asan] one more workaround for PR17409: don't do BB-level coverage instrumentation if there are more than N (=1500) basic blocks. This makes ASanCoverage work on libjpeg_turbo/jchuff.c used by Chrome, which has 1824 BBs llvm-svn: 206564	2014-04-18 08:02:42 +00:00
Jiangning Liu	ad874fca28	This commit allows vectorized loops to be unrolled by a factor of 2 for AArch64. A new test case is also added for ARM64. Patched by Z.Zheng llvm-svn: 206563	2014-04-18 07:57:54 +00:00
Matt Arsenault	209a7b92b5	R600: Minor cleanups. Fix indentation, better line wrapping, unused includes. llvm-svn: 206562	2014-04-18 07:40:20 +00:00
Lang Hames	bc876017c2	[ExecutionEngine] Allow JIT clients to enable/disable module verification. Previously module verification was always enabled, with no way to turn it off. As of this commit, module verification is on by default in Debug builds, and off by default in release builds. The default behaviour can be overridden by calling setVerifyModules(bool) on the JIT instance (this works for both the old JIT, and MCJIT). <rdar://problem/16150008> llvm-svn: 206561	2014-04-18 06:48:23 +00:00
Jiangning Liu	40d81e10c5	This is one of the optimizations ported from ARM64 to AArch64 to address the performance gap between these two back ends. The test case newly added for AArch64 already exists in ARM64. Patched by Z.Zheng llvm-svn: 206559	2014-04-18 05:58:09 +00:00
Matt Arsenault	78b8670aac	R600/SI: Try to use scalar BFE. Use scalar BFE with constant shift and offset when possible. This is complicated by the fact that the scalar version packs the two operands of the vector version into one. llvm-svn: 206558	2014-04-18 05:19:26 +00:00
Jiangning Liu	e56c30614f	This commit enables unaligned memory accesses of vector types on AArch64 back end. This should boost vectorized code performance. Patched by Z. Zheng llvm-svn: 206557	2014-04-18 03:58:38 +00:00
Duncan P. N. Exon Smith	e576167df8	Revert "blockfreq: Rewrite BlockFrequencyInfoImpl" This reverts commits r206548, r206549 and r206549. There are some unit tests failing that aren't failing locally [1], so reverting until I have time to investigate. [1]: http://bb.pgr.jp/builders/ninja-x64-msvc-RA-centos6/builds/1816 llvm-svn: 206556	2014-04-18 02:17:43 +00:00
Duncan P. N. Exon Smith	878cf2b804	blockfreq: Really fix r206548 (and r206549) Turns out this code is dead. llvm-svn: 206554	2014-04-18 02:10:09 +00:00
Duncan P. N. Exon Smith	c7abca54cf	blockfreq: Fixing MSVC after r206548? llvm-svn: 206549	2014-04-18 02:06:24 +00:00
Duncan P. N. Exon Smith	12e68e1733	blockfreq: Rewrite BlockFrequencyInfoImpl Rewrite the shared implementation of BlockFrequencyInfo and MachineBlockFrequencyInfo entirely. The old implementation had a fundamental flaw: precision losses from nested loops (or very wide branches) compounded past loop exits (and convergence points). The @nested_loops testcase at the end of test/Analysis/BlockFrequencyAnalysis/basic.ll is motivating. This function has three nested loops, with branch weights in the loop headers of 1:4000 (exit:continue). The old analysis gives non-sensical results: Printing analysis 'Block Frequency Analysis' for function 'nested_loops': ---- Block Freqs ---- entry = 1.0 for.cond1.preheader = 1.00103 for.cond4.preheader = 5.5222 for.body6 = 18095.19995 for.inc8 = 4.52264 for.inc11 = 0.00109 for.end13 = 0.0 The new analysis gives correct results: Printing analysis 'Block Frequency Analysis' for function 'nested_loops': block-frequency-info: nested_loops - entry: float = 1.0, int = 8 - for.cond1.preheader: float = 4001.0, int = 32007 - for.cond4.preheader: float = 16008001.0, int = 128064007 - for.body6: float = 64048012001.0, int = 512384096007 - for.inc8: float = 16008001.0, int = 128064007 - for.inc11: float = 4001.0, int = 32007 - for.end13: float = 1.0, int = 8 Most importantly, the frequency leaving each loop matches the frequency entering it. The new algorithm leverages BlockMass and PositiveFloat to maintain precision, separates "probability mass distribution" from "loop scaling", and uses dithering to eliminate probability mass loss. I have unit tests for these types out of tree, but it was decided in the review to make the classes private to BlockFrequencyInfoImpl, and try to shrink them (or remove them entirely) in follow-up commits. The new algorithm should generally have a complexity advantage over the old. The previous algorithm was quadratic in the worst case. The new algorithm is still worst-case quadratic in the presence of irreducible control flow, but it's linear without it. The key difference between the old algorithm and the new is that control flow within a loop is evaluated separately from control flow outside, limiting propagation of precision problems and allowing loop scale to be calculated independently of mass distribution. Loops are visited bottom-up, their loop scales are calculated, and they are replaced by pseudo-nodes. Mass is then distributed through the function, which is now a DAG. Finally, loops are revisited top-down to multiply through the loop scales and the masses distributed to pseudo nodes. There are some remaining flaws. - Irreducible control flow isn't modelled correctly. LoopInfo and MachineLoopInfo ignore irreducible edges, so this algorithm will fail to scale accordingly. There's a note in the class documentation about how to get closer. See also the comments in test/Analysis/BlockFrequencyInfo/irreducible.ll. - Loop scale is limited to 4096 per loop (2^12) to avoid exhausting the 64-bit integer precision used downstream. - The "bias" calculation proposed on llvmdev is not incorporated here. This will be added in a follow-up commit, once comments from this review have been handled. llvm-svn: 206548	2014-04-18 01:57:45 +00:00
Matt Arsenault	27cc958dff	R600/SI: Match sign_extend_inreg to s_sext_i32_i8 and s_sext_i32_i16 llvm-svn: 206547	2014-04-18 01:53:18 +00:00
Duncan P. N. Exon Smith	49f3ec80c2	PMBuilder: Expose an option to disable tail calls Adds API to allow frontends to disable tail calls in PassManagerBuilder. <rdar://problem/16050591> llvm-svn: 206542	2014-04-18 01:05:15 +00:00
Tom Stellard	1aa6cb4d88	R600/SI: Use SReg_64 instead of VSrc_64 when selecting BUILD_PAIR llvm-svn: 206541	2014-04-18 00:36:21 +00:00
Jim Grosbach	6bfe18a365	[ARM64,C++11] Range'ify another loop. llvm-svn: 206539	2014-04-17 23:41:57 +00:00
Diego Novillo	0915c047c2	Fix bug 19437 - Only add discriminators for DWARF 4 and above. Summary: This prevents the discriminator generation pass from triggering if the DWARF version being used in the module is prior to 4. Reviewers: echristo, dblaikie CC: llvm-commits Differential Revision: http://reviews.llvm.org/D3413 llvm-svn: 206507	2014-04-17 22:33:50 +00:00
Nuno Lopes	9ced19abe8	remove some dead code lib/Analysis/IPA/InlineCost.cpp \| 18 ------------------ lib/Analysis/RegionPass.cpp \| 1 - lib/Analysis/TypeBasedAliasAnalysis.cpp \| 1 - lib/Transforms/Scalar/LoopUnswitch.cpp \| 21 --------------------- lib/Transforms/Utils/LCSSA.cpp \| 2 -- lib/Transforms/Utils/LoopSimplify.cpp \| 6 ------ utils/TableGen/AsmWriterEmitter.cpp \| 13 ------------- utils/TableGen/DFAPacketizerEmitter.cpp \| 7 ------- utils/TableGen/IntrinsicEmitter.cpp \| 2 -- 9 files changed, 71 deletions(-) llvm-svn: 206506	2014-04-17 22:26:44 +00:00
Reed Kotler	720c5ca4ea	Start pushing changes for Mips Fast-Isel llvm-svn: 206505	2014-04-17 22:15:34 +00:00
Tom Stellard	aeeea8a864	R600: Add comment clariying use of sext for result of MUL_U24 llvm-svn: 206501	2014-04-17 21:00:13 +00:00
Tom Stellard	868fd92e54	R600/SI: Stop using i128 as the resource descriptor type Having i128 as a legal type complicates the legalization phase. v4i32 is already a legal type, so we will use that instead. This fixes several piglit tests. llvm-svn: 206500	2014-04-17 21:00:11 +00:00
Tom Stellard	334b29c7f6	R600/SI: Change default register class for i32 to SReg_32 SIFixSGPRCopies is smart enough to handle this now. llvm-svn: 206499	2014-04-17 21:00:09 +00:00
Tom Stellard	4f3b04de21	R600/SI: Teach SIInstrInfo::moveToVALU() how to handle PHI instructions llvm-svn: 206498	2014-04-17 21:00:07 +00:00
Tom Stellard	e1a244502c	R600/SI: Legalize operands after changing dst reg in FixSGPRCopies Otherwise we may not legalize some illegal REG_SEQUENCE instructions. llvm-svn: 206497	2014-04-17 21:00:01 +00:00
Louis Gerbarg	153e695ee2	Improve ARM64 vector creation This patch improves the performance of vector creation in caseiswhere where several of the lanes in the vector are a constant floating point value. It also includes new patterns to fold together some of the instructions when the value is 0.0f. Test cases included. rdar://16349427 llvm-svn: 206496	2014-04-17 20:51:50 +00:00
Jim Grosbach	0fba6d98fc	ARM64: [su]xtw use W regs as inputs, not X regs. Update the SXT[BHW]/UXTW instruction aliases and the shifted reg addressing mode handling. PR19455 and rdar://16650642 llvm-svn: 206495	2014-04-17 20:47:31 +00:00
David Blaikie	5b01593de4	ManagedStatic is never built with a null constructor, remove support for it. llvm-svn: 206492	2014-04-17 20:30:35 +00:00
Tim Northover	11a6082e33	ARM64: switch to IR-based atomic operations. Goodbye code! (Game: spot the bug fixed by the change). llvm-svn: 206490	2014-04-17 20:00:33 +00:00
Tim Northover	0129f298c4	ARM64: add acquire/release versions of the existing atomic intrinsics. These will be needed to support IR-level lowering of atomic operations. llvm-svn: 206489	2014-04-17 20:00:24 +00:00
Gerolf Hoflehner	ecebc3730e	Reverse 206485. After some discussions the preferred semantics of the always_inline attribute is inline always when the compiler can determine that it it safe to do so. llvm-svn: 206487	2014-04-17 19:14:06 +00:00
Josh Magee	adfde5fef6	[stack protector] Make the StackProtector pass respect ssp-buffer-size. Previously, SSPBufferSize was assigned the value of the "stack-protector-buffer-size" attribute after all uses of SSPBufferSize. The effect was that the default SSPBufferSize was always used during analysis. I moved the check for the attribute before the analysis; now --param ssp-buffer-size= works correctly again. Differential Revision: http://reviews.llvm.org/D3349 llvm-svn: 206486	2014-04-17 19:08:36 +00:00
Tim Northover	037f26f212	Atomics: promote ARM's IR-based atomics pass to CodeGen. Still only 32-bit ARM using it at this stage, but the promotion allows direct testing via opt and is a reasonably self-contained patch on the way to switching ARM64. At this point, other targets should be able to make use of it without too much difficulty if they want. (See ARM64 commit coming soon for an example). llvm-svn: 206485	2014-04-17 18:22:47 +00:00
Matt Arsenault	a90d22fad5	R600/SI: f64 frint is legal on CI llvm-svn: 206475	2014-04-17 17:06:37 +00:00
Chad Rosier	c4eb4f8827	[AArch64] Implement the getCSRFirstUseCost API, mirroring that in ARM64. llvm-svn: 206473	2014-04-17 16:19:54 +00:00
NAKAMURA Takumi	cd1fc4bc1b	Inliner::OptimizationRemark: Fix crash in clang/test/Frontend/optimization-remark.c on some hosts, including --vg. DebugLoc in Callsite would not live after Inliner. It should be copied before Inliner. llvm-svn: 206459	2014-04-17 12:22:14 +00:00
Chandler Carruth	b60cb315bc	[LCG] Just move the allocator (now that we can) when moving a call graph. This simplifies the custom move constructor operation to one of walking the graph and updating the 'up' pointers to point to the new location of the graph. Switch the nodes from a reference to a pointer for the 'up' edge to facilitate this. llvm-svn: 206450	2014-04-17 07:25:59 +00:00
Chandler Carruth	81f497d176	[LCG] Remove the Module reference member which we weren't using for anything and doesn't make sense if assigning. llvm-svn: 206449	2014-04-17 07:22:19 +00:00
Craig Topper	0a9bf4c0c5	[X86] Add disassembler support for the 0x0f 0x7f form of movq %mm, %mm. llvm-svn: 206447	2014-04-17 06:33:45 +00:00
Saleem Abdulrasool	1614b26886	MC: rework static_assert to be MSVC compatible Visual Studio does not permit referencing a structure member as a static field for sizeof calculations. Resort to a pointer cast which is compatible across Visual Studio and other compilers. llvm-svn: 206445	2014-04-17 06:17:20 +00:00
Matt Arsenault	51df0c1965	R600/SI: Fix zext from i1 to i64 llvm-svn: 206437	2014-04-17 02:03:08 +00:00
Adam Nemet	287f989dde	[ARM64] Fix "Cannot select" for vector ctpop The commit of r205855: Author: Arnold Schwaighofer <aschwaighofer@apple.com> Date: Wed Apr 9 14:20:47 2014 +0000 SLPVectorizer: Only vectorize intrinsics whose operands are widened equally The vectorizer only knows how to vectorize intrinics by widening all operands by the same factor. Patch by Tyler Nowicki! exposed a backend bug causing a regression (Cannot select ctpop). The commit msg is a bit confusing because the patch actually changes the behavior for the loop-vectorizer as well. As things got refactored into a helper ctpop got snuck in to the trivially-vectorizable helper which is now used by both vectorizers. In other words, we started seeing vector-ctpops in the backend. This change makes ctpop LegalizeAction::Expand for the types not supported by the byte-only CNT instruction. We may be able to custom-lower these later to a single CNT but this is to fix the compiler crash first. Fixes <rdar://problem/16578951> llvm-svn: 206433	2014-04-17 01:01:37 +00:00
Gerolf Hoflehner	5f6268a40e	Inline a function when the always_inline attribute is set even when it contains a indirect branch. The attribute overrules correctness concerns like the escape of a local block address. This is for rdar://16501761 llvm-svn: 206429	2014-04-17 00:21:52 +00:00
Jim Grosbach	6623e7f94a	[c++11] Tidy up AsmPrinter.cpp. Range'ify loops and tidy up some by-reference handling. No functional change. llvm-svn: 206422	2014-04-16 22:38:02 +00:00
Tom Stellard	1580dc78ae	Added new functionality to LLVM C API to use DiagnosticInfo to handle errors Patch by: Darren Powell llvm-svn: 206407	2014-04-16 17:45:04 +00:00
Aaron Ballman	5f1378c2a4	Replacing a non-ASCII character in a comment with an ASCII character. Fixes a C4819 warning in MSVC. llvm-svn: 206403	2014-04-16 17:09:20 +00:00
Diego Novillo	df655013a9	Allow diagnostic handlers to check for optimization remarks. Summary: When optimization remarks are enabled via the driver flag -Rpass, we should allow the FE diagnostic handler to check if the given pass name needs a diagnostic. We were unconditionally checking the pattern defined in opt's -pass-remarks flag. This was causing the FE to not emit any diagnostics. Reviewers: qcolombet CC: llvm-commits Differential Revision: http://reviews.llvm.org/D3362 llvm-svn: 206400	2014-04-16 16:53:41 +00:00
Matheus Almeida	483d7e9349	[mips] Use TwoOperandAliasConstraint for shift instructions. This enables TableGen to generate an additional two operand matcher for our shift_rotate_imm and shift_rotate_reg class of instructions. The tests were also updated so that they include now encoding information for all affected instructions. llvm-svn: 206398	2014-04-16 16:28:59 +00:00
Matheus Almeida	0051f2dc78	[mips] Add initial support for NaN2008 in the back-end. This is so that EF_MIPS_NAN2008 is set if we are using IEEE 754-2008 NaN encoding (-mnan=2008). This patch also adds support for parsing '.nan legacy' and '.nan 2008' assembly directives. The handling of these directives should match GAS' behaviour i.e., the last directive in use sets the ELF header bit (EF_MIPS_NAN2008). Differential Revision: http://reviews.llvm.org/D3346 llvm-svn: 206396	2014-04-16 15:48:55 +00:00
Tim Northover	ef7b34d403	ARM64: silence sign-comparison warning. llvm-svn: 206393	2014-04-16 15:28:06 +00:00
Tim Northover	3e69958b6b	AArch64/ARM64: produce correct relocation for conditional branches. llvm-svn: 206391	2014-04-16 15:27:52 +00:00
Daniel Sanders	82cd99a126	[mips] Indentation llvm-svn: 206389	2014-04-16 14:38:27 +00:00
Daniel Sanders	16fa1db637	[mips] Fix emission of '.option pic0' for MIPS-IV. Summary: This was a case of incorrect usage of hasMips64() vs isABI_N64() Reviewers: matheusalmeida, dsanders Reviewed By: dsanders Differential Revision: http://reviews.llvm.org/D3398 llvm-svn: 206388	2014-04-16 13:58:57 +00:00
Daniel Sanders	a024fb0e04	[mips] Correct r206370 to account for non-Linux targets using the small data section. This should fix the ninja-x64-msvc-RA-centos6 builder. I suspect the check in MipsSubtarget.cpp is incorrect and is really trying to check for a bare-metal target rather and anything other than linux. I'll investigate this. llvm-svn: 206385	2014-04-16 12:29:08 +00:00
Kostya Serebryany	0c02d26d6b	[asan] add two new hidden compile-time flags for asan: asan-instrumentation-with-call-threshold and asan-memory-access-callback-prefix. This is part of the workaround for PR17409 (instrument huge functions with callbacks instead of inlined code). These flags will also help us experiment with kasan (kernel-asan) and clang llvm-svn: 206383	2014-04-16 12:12:19 +00:00
Tim Northover	3ec1de7767	AArch64/ARM64: port across stub handling for ELF C++ exceptions. The most important part here is that we should actuall emit the stubs we refer to in the exception table, but as a side issue this uses more sensible & GCC compatible representations for some of the bits of information. llvm-svn: 206380	2014-04-16 11:52:55 +00:00
Tim Northover	18f68f6d1a	ARM64: use 32-bit moves for constants where possible. If we know that a particular 64-bit constant has all high bits zero, then we can rely on the fact that 32-bit ARM64 instructions automatically zero out the high bits of an x-register. This gives the expansion logic less constraints to satisfy and so sometimes allows it to pick better sequences. Came up while porting test/CodeGen/AArch64/movw-consts.ll: this will allow a 32-bit MOVN to be used in @test8 soon. llvm-svn: 206379	2014-04-16 11:52:51 +00:00
Tim Northover	9cfb57dafa	ARM64: use the integrated assembler on ELF. llvm-svn: 206378	2014-04-16 11:52:40 +00:00
Matheus Almeida	dc7e48e084	[mips] Emit '.set nomicromips' before a function's entry label if not in micromips mode. The test (elf_st_other.ll) was renamed as the name and description didn't make sense as the test wasn't checking any symbol table entry. Differential Revision: http://reviews.llvm.org/D3346 llvm-svn: 206377	2014-04-16 11:46:59 +00:00
Aaron Ballman	58ce7f24cd	Fixing a compile error in debug versions of MSVC. It seems that the range-based for loop is confused by the DEBUG macro expansion unless a compound statement is used. llvm-svn: 206376	2014-04-16 11:15:57 +00:00
Daniel Sanders	11c0c067c2	[mips] Correct callee saved list for the N32 ABI and enable test Summary: Depends on D3339 Reviewers: matheusalmeida, vmedic Reviewed By: matheusalmeida Differential Revision: http://reviews.llvm.org/D3340 llvm-svn: 206371	2014-04-16 10:23:37 +00:00
Tim Northover	97c5b6fe4f	ARM64: mark x7 as used when an i128 gets shunted onto the stack. The second half of a split i128 was ending up in x7, which is not a good thing. This is another part of PR19432. llvm-svn: 206366	2014-04-16 09:03:25 +00:00
Tim Northover	863a789a99	DAGCombiner: don't optimise non-existant litpool load This particular DAG combine is designed to kick in when both ConstantFPs will end up being loaded via a litpool, however those nodes have a semi-legal status, dictated by isFPImmLegal so in some cases there wouldn't have been a litpool in the first place. Don't try to be clever in those circumstances. Picked up while merging some AArch64 tests. llvm-svn: 206365	2014-04-16 09:03:09 +00:00
Timur Iskhodzhanov	42b1b517ce	Simplify a static_assert so VS2013 can build it llvm-svn: 206363	2014-04-16 08:30:32 +00:00
Saleem Abdulrasool	057094c6f6	COFF: fix an off by one error Adjust the tests to validate the number of auxiliary entries used to store the filename. Thanks to majnemer's sharp eye for catching the missing - 1 in the round up calculation. llvm-svn: 206359	2014-04-16 06:22:53 +00:00
Craig Topper	abb4ac7f87	Convert SelectionDAG::getVTList to use ArrayRef llvm-svn: 206357	2014-04-16 06:10:51 +00:00
Craig Topper	ada0857679	[C++11] More 'nullptr' conversion. In some cases just using a boolean check instead of comparing to nullptr. llvm-svn: 206356	2014-04-16 04:21:27 +00:00
Saleem Abdulrasool	a2bf05aa2f	COFF: add support for .file symbols Add support for emitting .file records. This is mostly a quality of implementation change (more complete support for COFF file emission) that was noticed while working on COFF file emission for Windows on ARM. A .file record is emitted as a symbol with storage class FILE (103) and the name ".file". A series of auxiliary format 4 records follow which contain the file name. The filename is stored as an ANSI string and is padded with NULL if the length is not a multiple of COFF::SymbolSize (18). llvm-svn: 206355	2014-04-16 04:15:32 +00:00
Saleem Abdulrasool	0d3d6c45ef	Target: whitespace llvm-svn: 206353	2014-04-16 04:15:25 +00:00
Matt Arsenault	4e46665a80	R600: Expand sign extension of vectors. Setting vector types to expand will result in scalarization on pre SI hw, as those gpus don't have vector shifts either. Expand also i32 vectors, this helps llvm make the correct decision about scalarizing the vector ops. v2: move setOperation() calls to R600ISelLowering.cpp. cleanup the SI code to make it obvious that this patch does is nop for SI Patch by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 206348	2014-04-16 01:41:30 +00:00
Jim Grosbach	36c6a50512	[ARM64,C++11] Tidy up branch relaxation a bit w/ c++11. No functional change. llvm-svn: 206344	2014-04-16 00:42:46 +00:00
Jim Grosbach	01fc5887ad	ARM64: Nuke some dead code. Missed in previous commit. llvm-svn: 206343	2014-04-16 00:42:43 +00:00
Jim Grosbach	80633094f8	[ARM64,C++11] Clean up the ARM64 LOH collection pass. Range'ify a bunch of loops, mainly. As a result, we have a variety of objects via reference rather than by pointer, so propogate that through the various helper functions where it makes sense. llvm-svn: 206337	2014-04-15 22:57:02 +00:00
Matt Arsenault	e500e32939	R600/SI: Print code size along with used registers llvm-svn: 206336	2014-04-15 22:40:47 +00:00
Matt Arsenault	4d7d38333b	R600/SI: Print more immediates in hex format Print in decimal for inline immediates, and hex otherwise. Use hex always for offsets in addressing offsets. This approximately matches what the shader compiler does. llvm-svn: 206335	2014-04-15 22:32:49 +00:00
Matt Arsenault	fcf86c5417	R600/SI: Cleanup parsing of register names. Try to figure out the class and number of subregisters. llvm-svn: 206334	2014-04-15 22:32:42 +00:00
Matt Arsenault	470acd81a8	R600/SI: Fix loads of i1 llvm-svn: 206330	2014-04-15 22:28:39 +00:00
Tobias Grosser	8d941ef30d	RegionInfo: Do not access a value that was just moved away This fixes a regression introduced in r206310. llvm-svn: 206328	2014-04-15 22:09:36 +00:00
Akira Hatanaka	3d90f99d1a	Make FastISel::SelectInstruction return before target specific fast-isel code handles Intrinsic::trap if TargetOptions::TrapFuncName is set. This fixes a bug in which the trap function was not taken into consideration when a program was compiled without optimization (at -O0). <rdar://problem/16291933> llvm-svn: 206323	2014-04-15 21:30:06 +00:00
Andrea Di Biagio	aac2eac4c2	[X86] Improve the lowering of packed shifts by constant build_vector. This patch teaches the backend how to efficiently lower logical and arithmetic packed shifts on both SSE and AVX/AVX2 machines. When possible, instead of scalarizing a vector shift, the backend should try to expand the shift into a sequence of two packed shifts by immedate count followed by a MOVSS/MOVSD. Example (v4i32 (srl A, (build_vector < X, Y, Y, Y>))) Can be rewritten as: (v4i32 (MOVSS (srl A, <Y,Y,Y,Y>), (srl A, <X,X,X,X>))) [with X and Y ConstantInt] The advantage is that the two new shifts from the example would be lowered into X86ISD::VSRLI nodes. This is always cheaper than scalarizing the vector into four scalar shifts plus four pairs of vector insert/extract. llvm-svn: 206316	2014-04-15 19:30:48 +00:00
Quentin Colombet	72dad56c53	[ARM64] Set default CPU to generic instead of cyclone. llvm-svn: 206313	2014-04-15 19:08:46 +00:00
Robert Lougher	a9bf2463b9	Revert r191049/r191059 as it can produce wrong code (see PR17975). It has already been reverted on the 3.4 branch in r196521. llvm-svn: 206311	2014-04-15 18:34:24 +00:00
David Blaikie	ec649acb82	Use unique_ptr to manage ownership of child Regions within llvm::Region llvm-svn: 206310	2014-04-15 18:32:43 +00:00
Julien Lerouge	be4fe32eb8	Add lifetime markers for allocas created to hold byval arguments, make them appear in the InlineFunctionInfo. llvm-svn: 206308	2014-04-15 18:06:46 +00:00
Julien Lerouge	957e91c4d8	Split byval argument initialization so the memcpy(s) are injected at the beginning of the first new block after inlining. llvm-svn: 206307	2014-04-15 18:01:54 +00:00
Duncan P. N. Exon Smith	2b69189c9c	LTO: Add more loop simplification passes to LTO Similar to r202051, add missing loop simplification passes to the LTO optimization pipeline. Patch by Rafael Espindola. llvm-svn: 206306	2014-04-15 17:48:15 +00:00
Duncan P. N. Exon Smith	0d640014ff	verify-di: Add back braces for MSVC compatability Fixup after r206300. <rdar://problem/15500563> llvm-svn: 206305	2014-04-15 17:28:26 +00:00
Duncan P. N. Exon Smith	6ef5f284d6	verify-di: Implement DebugInfoVerifier Implement DebugInfoVerifier, which steals verification relying on DebugInfoFinder from Verifier. - Adds LegacyDebugInfoVerifierPassPass, a ModulePass which wraps DebugInfoVerifier. Uses -verify-di command-line flag. - Change verifyModule() to invoke DebugInfoVerifier as well as Verifier. - Add a call to createDebugInfoVerifierPass() wherever there was a call to createVerifierPass(). This implementation as a module pass should sidestep efficiency issues, allowing us to turn debug info verification back on. <rdar://problem/15500563> llvm-svn: 206300	2014-04-15 16:27:38 +00:00
Duncan P. N. Exon Smith	67b44da0dd	verify-di: split out VerifierSupport Split out assertion and output helpers from Verifier in preparation for writing the DebugInfoVerifier. <rdar://problem/15500563> llvm-svn: 206299	2014-04-15 16:27:32 +00:00
David Blaikie	0afad5e8bc	Use unique_ptr to manage PassInfo instances in the PassRegistry llvm-svn: 206297	2014-04-15 15:17:14 +00:00
NAKAMURA Takumi	e1f3583b96	MipsAsmParser.cpp: Fix vg_leak in MipsOperand::CreateMem(). Mem.Base is managed by k_Memory itself. llvm-svn: 206293	2014-04-15 14:13:21 +00:00
NAKAMURA Takumi	bd524ef129	MipsAsmParser::ParseRegister(): Be responsible to delete an Operand on a temporary Operands. llvm-svn: 206292	2014-04-15 14:06:27 +00:00
Tim Northover	ebb3123a5f	AArch64/ARM64: add missing pattern for extending load. llvm-svn: 206290	2014-04-15 14:00:19 +00:00
Tim Northover	cbcb7a37f7	AArch64/ARM64: only mangle MOVZ/MOVN during encoding when needed Sometimes we need emit the bits that would actually be a MOVN when producing a relocated MOVZ instruction (don't ask). But not always, a check which ARM64 got wrong until now. llvm-svn: 206289	2014-04-15 14:00:15 +00:00
Tim Northover	6e27b8ded5	AArch64/ARM64: add support for large code-model jump tables. I've left the MachO CodeGen as it is, there's a reasonable chance it should use the GOT like ConstPools, but I'm not certain. llvm-svn: 206288	2014-04-15 14:00:11 +00:00
Tim Northover	221b583951	AArch64/ARM64: add patterns for various commutations of FNMADD. llvm-svn: 206287	2014-04-15 14:00:06 +00:00
Tim Northover	b37cff1ae2	AArch64/ARM64: add half as a storage type on ARM64. This brings it into line with the AArch64 behaviour and should open the way for certain OpenCL features. llvm-svn: 206286	2014-04-15 14:00:03 +00:00
Tim Northover	80a70a265a	AArch64/ARM64: copy patterns for fixed-point conversions Code is mostly copied directly across, with a slight extension of the ISelDAGToDAG function so that it can cope with the floating-point constants being behind a litpool. llvm-svn: 206285	2014-04-15 13:59:57 +00:00
Tim Northover	f70577b1cd	ARM64: add constraints to various FastISel operations llvm-svn: 206284	2014-04-15 13:59:53 +00:00

... 3 4 5 6 7 ...

68991 Commits