llvm-project

Commit Graph

Author	SHA1	Message	Date
Rui Ueyama	a045b73a96	Don't use DataRefImpl to implement ImportDirectoryEntryRef. DataRefImpl (a union of two integers and a pointer) is not the ideal data type to represent a reference to an import directory entity. We should just use the pointer to the import table and an offset instead to simplify. No functionality change. llvm-svn: 199349	2014-01-16 03:13:19 +00:00
Manman Ren	2ebfb42fe9	Report a warning when dropping outdated debug info metadata. Use DiagnosticInfo to emit the warning. llvm-svn: 199346	2014-01-16 01:51:12 +00:00
Reed Kotler	43788a20c6	Adjust offsets for max load instruction offsets. This is more pessimistic than it needs to be by 1 bit but I need to finish some other things so that all the boundary cases will work in that situation. constpool.c in test-suite will fail to assemble under our new internal test-suite sync without this change. llvm-svn: 199343	2014-01-16 00:47:46 +00:00
David Peixotto	c0f92a2dc9	Fix parsing of .symver directive on ARM ARM assembly syntax uses @ for a comment, execpt for the second parameter of the .symver directive which requires @ as part of the symbol name. This commit fixes the parsing of this directive by adding a special case for ARM for this one argumnet. To make the change we had to move the AllowAtInIdentifier variable to the MCAsmLexer interface (from AsmLexer) and expose a setter for the value. The ELFAsmParser then toggles this value when parsing the second argument to the .symver directive for a target that uses @ as a comment symbol llvm-svn: 199339	2014-01-15 22:40:02 +00:00
Quentin Colombet	5fa1f6f57a	[LTO] Add a hook to map LLVM diagnostics into the clients of LTO. Add a hook in the C API of LTO so that clients of the code generator can set their own handler for the LLVM diagnostics. The handler is defined like this: typedef void (lto_diagnostic_handler_t)(lto_codegen_diagnostic_severity_t severity, const char diag, void *ctxt) - severity says how bad this is. - diag is a string that contains the diagnostic message. - ctxt is the registered context for this handler. This hook is more general than the lto_get_error_message, since this function keeps only the latest message and can only be queried when something went wrong (no warning for instance). <rdar://problem/15517596> llvm-svn: 199338	2014-01-15 22:04:35 +00:00
Bob Wilson	f8d5da6e0b	Remove support for armv7f slice. <rdar://problem/12478440> This was never used for anything so we should just get rid of it. llvm-svn: 199337	2014-01-15 21:44:14 +00:00
Andrea Di Biagio	d7c03ec348	[DAGCombiner] Fix a wrong check in method SimplifyVBinOp. This fixes a regression intruced by r199135. Revision 199135 tried to simplify part of the logic in method DAGCombiner::SimplifyVBinOp introducing calls to method BuildVectorSDNode::isConstant(). However, that revision wrongly changed the check performed by method SimplifyVBinOp to identify dag nodes that can be folded. Before revision 199135, that method only tried to simplify vector binary operations if both operands were build_vector of Constant/ConstantFP/Undef only. After revision 199135, method SimplifyVBinop tried to simplify also vector binary operations with only one constant operand. This fixes the problem restoring the old behavior of SimplifyVBinOp. llvm-svn: 199328	2014-01-15 19:51:32 +00:00
Rafael Espindola	63da295045	Return an ErrorOr<Binary *> from createBinary. I did write a version returning ErrorOr<OwningPtr<Binary> >, but it is too cumbersome to use without std::move. I will keep the patch locally and submit when we switch to c++11. llvm-svn: 199326	2014-01-15 19:37:43 +00:00
Kevin Enderby	2e13b1c7f1	Update the X86 assembler for .intel_syntax to accept the \| and & bitwise operators. rdar://15570412 llvm-svn: 199323	2014-01-15 19:05:24 +00:00
Zoran Jovanovic	7d63392da9	LL and SC decoder method fix. llvm-svn: 199316	2014-01-15 13:17:33 +00:00
Zoran Jovanovic	d4cb61cf0e	Added support for LWU microMIPS instruction. llvm-svn: 199315	2014-01-15 13:01:18 +00:00
David Majnemer	dee105772c	WinCOFF: Transform IR expressions featuring __ImageBase into image relative relocations MSVC on x64 requires that we create image relative symbol references to refer to RTTI data. Seeing as how there is no way to explicitly make reference to a given relocation type in LLVM IR, pattern match expressions of the form &foo - &__ImageBase. Differential Revision: http://llvm-reviews.chandlerc.com/D2523 llvm-svn: 199312	2014-01-15 09:16:42 +00:00
Elena Demikhovsky	79b75d9048	Fixed identation. llvm-svn: 199301	2014-01-15 07:18:11 +00:00
Andrew Trick	ee5aa7f71a	Fix PR18449: SCEV needs more precise max BECount for multi-exit loop. llvm-svn: 199299	2014-01-15 06:42:11 +00:00
Craig Topper	30a134b68d	Add OpSize16 to the two byte forms of INC/DEC that we only use in 64-bit mode and a 64-bit only LEA. Even though we'll not be in 16-bit mode when we use them it makes their tables consistent with their 32-bit counterparts. llvm-svn: 199297	2014-01-15 05:20:59 +00:00
Jiangning Liu	0a791c348b	For AArch64, lowering sext_inreg and generate optimized code by using SXTL. llvm-svn: 199296	2014-01-15 05:08:01 +00:00
Hans Wennborg	4744ac1733	Switch-to-lookup tables: set threshold to 3 cases There has been an old FIXME to find the right cut-off for when it's worth analyzing and potentially transforming a switch to a lookup table. The switches always have two or more cases. I could not measure any speed-up by transforming a switch with two cases. A switch with three cases gets a nice speed-up, and I couldn't measure any compile-time regression, so I think this is the right threshold. In a Clang self-host, this causes 480 new switches to be transformed, and reduces the final binary size with 8 KB. llvm-svn: 199294	2014-01-15 05:00:27 +00:00
Arnold Schwaighofer	dc4c9460a2	LoopVectorize: Only strip casts from integer types when replacing symbolic strides Fixes PR18480. llvm-svn: 199291	2014-01-15 03:35:46 +00:00
Rafael Espindola	9d795caea4	Fix uninitialized variable. llvm-svn: 199288	2014-01-15 03:27:26 +00:00
Rafael Espindola	26e917cde0	Only mark functions as micromips. The GNU as behavior is a bit different and very strange. It will mark any label that contains an instruction. We can implement that, but using the type looks more natural since gas will not mark a function if a .word is used to output the instructions! llvm-svn: 199287	2014-01-15 03:07:12 +00:00
Weiming Zhao	fe26fd27b4	PR 18466: Fix ARM Pseudo Expansion When expanding neon pseudo stores, it may miss the implicit uses of sub regs, which may cause post RA scheduler reorder instructions that breakes anti dependency. For example: VST1d64QPseudo %R0<kill>, 16, %Q9_Q10, pred:14, pred:%noreg will be expanded to VST1d64Q %R0<kill>, 16, %D18, pred:14, pred:%noreg; An instruction that defines %D20 may be scheduled before the store by mistake. This patches adds implicit uses for such case. For the example above, it emits: VST1d64Q %R0<kill>, 8, %D18, pred:14, pred:%noreg, %Q9_Q10<imp-use> llvm-svn: 199282	2014-01-15 01:32:12 +00:00
Rafael Espindola	8f31e213e4	Make parseBitcodeFile return an ErrorOr<Module *>. llvm-svn: 199279	2014-01-15 01:08:23 +00:00
Eric Christopher	1ad8457570	Make sure we emit a relocation to the debug_ranges section in the presence of CU ranges. llvm-svn: 199276	2014-01-15 00:04:29 +00:00
Rafael Espindola	e9fab9b077	Return an error_code from materializeAllPermanently. llvm-svn: 199275	2014-01-14 23:51:27 +00:00
Rafael Espindola	1d06f7208c	Use error_code in Module::materializeAll. llvm-svn: 199269	2014-01-14 23:02:01 +00:00
Tim Northover	463a5f24d1	ARM: correctly determine final tBX_LR in Thumb1 functions The changes caused by folding an sp-adjustment into a "pop" previously disrupted the forward search for the final real instruction in a terminating block. This switches to a backward search (skipping debug instrs). This fixes PR18399. Patch by Zhaoshi. llvm-svn: 199266	2014-01-14 22:53:28 +00:00
Tim Northover	6e219cd588	AArch64: don't try to handle [SU]MUL_LOHI nodes We should set them to expand for now since there are no patterns dealing with them. Actually, there are no instructions either so I doubt they'll ever be acceptable. llvm-svn: 199265	2014-01-14 22:53:22 +00:00
Eric Christopher	39cde8cc90	Enable use of ranges for translation units in the presence of -ffunction-sections and update comments and TODOs about other places that we should enable this. llvm-svn: 199263	2014-01-14 22:44:17 +00:00
Matt Arsenault	2d353d1a10	Do pointer cast simplifications on addrspacecast llvm-svn: 199254	2014-01-14 20:00:45 +00:00
Matt Arsenault	f08a44f903	Remove a check for an illegal condition. Bitcasts can't be between address spaces anymore. llvm-svn: 199253	2014-01-14 19:56:57 +00:00
Lang Hames	06234ec147	Add FPExt option to CCValAssign::LocInfo. When generating calling-convention promotion code, Tablegen will now select FPExt for floating point promotions (previously it had returned AExt, which is not valid for floating point types). Any out-of-tree targets that were relying on AExt being returned for FP promotions will need to update their code check for FPExt instead. llvm-svn: 199252	2014-01-14 19:56:36 +00:00
Rafael Espindola	08ff298d51	Revert "[AArch64] Added vselect patterns with float and double types" This reverts commit r199242. It is causing CodeGen/AArch64/neon-bsl.ll to fail. llvm-svn: 199248	2014-01-14 19:24:08 +00:00
Matt Arsenault	e55a2c2e6b	Make nocapture analysis work with addrspacecast llvm-svn: 199246	2014-01-14 19:11:52 +00:00
Rafael Espindola	6633d57ae4	Fix a low hanging use of hasRawTextSupport. This also fixes the placement of the function label comment. It was being placed next to the mips16 directive instead of next to the label. llvm-svn: 199245	2014-01-14 18:57:12 +00:00
Duncan P. N. Exon Smith	93be7c4fb3	Reapply "LTO: add API to set strategy for -internalize" Reapply r199191, reverted in r199197 because it carelessly broke Other/link-opts.ll. The problem was that calling createInternalizePass("main") would select createInternalizePass(bool("main")) instead of createInternalizePass(ArrayRef<const char >("main")). This commit fixes the bug. The original commit message follows. Add API to LTOCodeGenerator to specify a strategy for the -internalize pass. This is a new attempt at Bill's change in r185882, which he reverted in r188029 due to problems with the gold linker. This puts the onus on the linker to decide whether (and what) to internalize. In particular, running internalize before outputting an object file may change a 'weak' symbol into an internal one, even though that symbol could be needed by an external object file --- e.g., with arclite. This patch enables three strategies: - LTO_INTERNALIZE_FULL: the default (and the old behaviour). - LTO_INTERNALIZE_NONE: skip -internalize. - LTO_INTERNALIZE_HIDDEN: only -internalize symbols with hidden visibility. LTO_INTERNALIZE_FULL should be used when linking an executable. Outputting an object file (e.g., via ld -r) is more complicated, and depends on whether hidden symbols should be internalized. E.g., for ld -r, LTO_INTERNALIZE_NONE can be used when -keep_private_externs, and LTO_INTERNALIZE_HIDDEN can be used otherwise. However, LTO_INTERNALIZE_FULL is inappropriate, since the output object file will eventually need to link with others. lto_codegen_set_internalize_strategy() sets the strategy for subsequent calls to lto_codegen_write_merged_modules() and lto_codegen_compile(). <rdar://problem/14334895> llvm-svn: 199244	2014-01-14 18:52:17 +00:00
Ana Pazos	787f540daa	[AArch64] Added vselect patterns with float and double types llvm-svn: 199242	2014-01-14 18:45:48 +00:00
Nico Rieck	c60647f0db	Handle dllexport for global aliases llvm-svn: 199219	2014-01-14 15:23:25 +00:00
Nico Rieck	7157bb765e	Decouple dllexport/dllimport from linkage Representing dllexport/dllimport as distinct linkage types prevents using these attributes on templates and inline functions. Instead of introducing further mixed linkage types to include linkonce and weak ODR, the old import/export linkage types are replaced with a new separate visibility-like specifier: define available_externally dllimport void @f() {} @Var = dllexport global i32 1, align 4 Linkage for dllexported globals and functions is now equal to their linkage without dllexport. Imported globals and functions must be either declarations with external linkage, or definitions with AvailableExternallyLinkage. llvm-svn: 199218	2014-01-14 15:22:47 +00:00
Elena Demikhovsky	767fc967b4	AVX-512: optimized scalar compare patterns removed AVX512SI format, since it is similar to AVX512BI. llvm-svn: 199217	2014-01-14 15:10:08 +00:00
Patrik Hagglund	682a10d4cc	Fix valgrind warning for gcc builds. Sorry, I don't understand why the warning is generated (a gcc bug?). Anyhow, the change should improve readablity. No functionality change intended. llvm-svn: 199214	2014-01-14 14:09:00 +00:00
Andrea Di Biagio	5448a3c771	[X86] Fix assertion failure caused by a wrong folding of vector shifts by immediate count. This fixes a regression intruced by r198113. Revision r198113 introduced an algorithm that tries to fold a vector shift by immediate count into a build_vector if the input vector is a known vector of constants. However the algorithm only worked under the assumption that the input vector type and the shift type are exactly the same. This patch disables the folding of vector shift by immediate count if the input vector type and the shift value type are not the same. llvm-svn: 199213	2014-01-14 13:17:12 +00:00
Tim Northover	56cc5c92db	ARM: add constraint that RdLo != Rn != RdHi for v5 MLA insts. llvm-svn: 199212	2014-01-14 13:05:47 +00:00
Tim Northover	c4c34b4f5c	ARM: remove unused UMAALv5 node It was incorrect anyway, since it didn't have accumulator inputs and wasn't even supported on v5. llvm-svn: 199211	2014-01-14 13:05:42 +00:00
Nico Rieck	9d2e0df049	Revert "Decouple dllexport/dllimport from linkage" Revert this for now until I fix an issue in Clang with it. This reverts commit r199204. llvm-svn: 199207	2014-01-14 12:38:32 +00:00
Nico Rieck	1794b62f54	Revert "Handle dllexport for global aliases" This reverts commit r199205. llvm-svn: 199206	2014-01-14 12:36:54 +00:00
Nico Rieck	4192acdbc3	Handle dllexport for global aliases llvm-svn: 199205	2014-01-14 11:55:40 +00:00
Nico Rieck	e43aaf7967	Decouple dllexport/dllimport from linkage Representing dllexport/dllimport as distinct linkage types prevents using these attributes on templates and inline functions. Instead of introducing further mixed linkage types to include linkonce and weak ODR, the old import/export linkage types are replaced with a new separate visibility-like specifier: define available_externally dllimport void @f() {} @Var = dllexport global i32 1, align 4 Linkage for dllexported globals and functions is now equal to their linkage without dllexport. Imported globals and functions must be either declarations with external linkage, or definitions with AvailableExternallyLinkage. llvm-svn: 199204	2014-01-14 11:55:03 +00:00
Nico Rieck	da881a2742	Fix fastcall mangling of dllimported symbols fastcall requires @ as global prefix instead of _ but getNameWithPrefix wrongly assumes the OutName buffer is empty and replaces at index 0. For imported functions this buffer is pre-filled with "__imp_" resulting in broken "@_imp_foo@0" mangling. Instead replace at the proper index. We also never have to prepend the @-prefix because this fastcall mangling is only used on 32-bit Windows targets which have _ has global prefix. llvm-svn: 199203	2014-01-14 11:53:26 +00:00
NAKAMURA Takumi	23c0ab53b2	Revert r199191, "LTO: add API to set strategy for -internalize" Please update also Other/link-opts.ll, in next time. llvm-svn: 199197	2014-01-14 09:40:18 +00:00
Craig Topper	ae11aed9d7	Separate the concept of 16-bit/32-bit operand size controlled by 0x66 prefix and the current mode from the concept of SSE instructions using 0x66 prefix as part of their encoding without being affected by the mode. This should allow SSE instructions to be encoded correctly in 16-bit mode which r198586 probably broke. llvm-svn: 199193	2014-01-14 07:41:20 +00:00
Duncan P. N. Exon Smith	43ea3478bf	LTO: add API to set strategy for -internalize Add API to LTOCodeGenerator to specify a strategy for the -internalize pass. This is a new attempt at Bill's change in r185882, which he reverted in r188029 due to problems with the gold linker. This puts the onus on the linker to decide whether (and what) to internalize. In particular, running internalize before outputting an object file may change a 'weak' symbol into an internal one, even though that symbol could be needed by an external object file --- e.g., with arclite. This patch enables three strategies: - LTO_INTERNALIZE_FULL: the default (and the old behaviour). - LTO_INTERNALIZE_NONE: skip -internalize. - LTO_INTERNALIZE_HIDDEN: only -internalize symbols with hidden visibility. LTO_INTERNALIZE_FULL should be used when linking an executable. Outputting an object file (e.g., via ld -r) is more complicated, and depends on whether hidden symbols should be internalized. E.g., for ld -r, LTO_INTERNALIZE_NONE can be used when -keep_private_externs, and LTO_INTERNALIZE_HIDDEN can be used otherwise. However, LTO_INTERNALIZE_FULL is inappropriate, since the output object file will eventually need to link with others. lto_codegen_set_internalize_strategy() sets the strategy for subsequent calls to lto_codegen_write_merged_modules() and lto_codegen_compile*(). <rdar://problem/14334895> llvm-svn: 199191	2014-01-14 06:37:26 +00:00
Jakob Stoklund Olesen	b6b35a4955	Always let value types influence register classes. When creating a virtual register for a def, the value type should be used to pick the register class. If we only use the register class constraint on the instruction, we might pick a too large register class. Some registers can store values of different sizes. For example, the x86 xmm registers can hold f32, f64, and 128-bit vectors. The three different value sizes are represented by register classes with identical register sets: FR32, FR64, and VR128. These register classes have different spill slot sizes, so it is important to use the right one. The register class constraint on an instruction doesn't necessarily care about the size of the value its defining. The value type determines that. This fixes a problem where InstrEmitter was picking 32-bit register classes for 64-bit values on SPARC. llvm-svn: 199187	2014-01-14 06:18:38 +00:00
Jakob Stoklund Olesen	209120621a	Switch the NEON register class from QPR to DPair. The already allocatable DPair superclass contains odd-even D register pair in addition to the even-odd pairs in the QPR register class. There is no reason to constrain the set of D register pairs that can be used for NEON values. Any NEON instructions that require a Q register will automatically constrain the register class to QPR. The allocation order for DPair begins with the QPR registers, so register allocation is unlikely to change much. llvm-svn: 199186	2014-01-14 06:18:34 +00:00
Rafael Espindola	6d5f7ce348	Replace .mips_hack_stocg with ".set micromips" and ".set nomicromips". This matches what gnu as does and implementing this is easier than arguing about it. llvm-svn: 199181	2014-01-14 04:25:13 +00:00
Mark Seaborn	8271118a65	Fix llc to not reuse spill slots in functions that invoke setjmp() We need to ensure that StackSlotColoring.cpp does not reuse stack spill slots in functions that call "returns_twice" functions such as setjmp(), otherwise this can lead to miscompiled code, because a stack slot would be clobbered when it's still live. This was already handled correctly for functions that call setjmp() (though this wasn't covered by a test), but not for functions that invoke setjmp(). We fix this by changing callsFunctionThatReturnsTwice() to check for invoke instructions. This fixes PR18244. llvm-svn: 199180	2014-01-14 04:20:01 +00:00
Rafael Espindola	4a1a360634	Make getTargetStreamer return a possibly null pointer. This will allow it to be called from target independent parts of the main streamer that don't know if there is a registered target streamer or not. This in turn will allow targets to perform extra actions at specified points in the interface: add extra flags for some labels, extra work during finalization, etc. llvm-svn: 199174	2014-01-14 01:21:46 +00:00
Cameron McInally	f0379fa41a	Fix uninitialized warning in llvm/lib/IR/DataLayout.cpp. llvm-svn: 199147	2014-01-13 22:04:55 +00:00
Juergen Ributzka	6840282c99	[DAG] Refactor ReassociateOps - no functional change intended. llvm-svn: 199146	2014-01-13 21:49:25 +00:00
Juergen Ributzka	7384405f23	[DAG] Teach DAG to also reassociate vector operations This commit teaches DAG to reassociate vector ops, which in turn enables constant folding of vector op chains that appear later on during custom lowering and DAG combine. Reviewed by Andrea Di Biagio llvm-svn: 199135	2014-01-13 20:51:35 +00:00
Andrew Trick	7daf6a45f4	Hide the pre-RA-sched= option. This is a very confusing option for a feature that will go away. -enable-misched is exposed instead to help triage issues with the new scheduler. llvm-svn: 199133	2014-01-13 20:08:27 +00:00
Weiming Zhao	f66be56bf7	Fix PR 18369: [Thumbv8] asserts due to inconsistent CPSR liveness of IT blocks The issue is caused when Post-RA scheduler reorders a bundle instruction (IT block). However, it only flips the CPSR liveness of the bundle instruction, leaves the instructions inside the bundle unchanged, which causes inconstancy and crashes Thumb2SizeReduction.cpp::ReduceMBB(). llvm-svn: 199127	2014-01-13 18:47:54 +00:00
Rafael Espindola	5b6c1e8e59	Update getLazyBitcodeModule to use ErrorOr for error handling. llvm-svn: 199125	2014-01-13 18:31:04 +00:00
Andrea Di Biagio	9bc0415c1f	[AArch64] Fix assertion failure caused by an invalid comparison between APInt values. APInt only knows how to compare values with the same BitWidth and asserts in all other cases. With this fix, function PerformORCombine does not use the APInt equality operator if the APInt values returned by 'isConstantSplat' differ in BitWidth. In that case they are different and no comparison is needed. llvm-svn: 199119	2014-01-13 16:51:00 +00:00
Joerg Sonnenberger	808df6725f	Fix indentation. llvm-svn: 199118	2014-01-13 15:50:36 +00:00
Richard Sandiford	32379b8141	[SystemZ] Optimize (sext (ashr (shl ...), ...)) ...into (ashr (shl (anyext X), ...), ...), which requires one fewer instruction. The (anyext X) can sometimes be simplified too. I didn't do this in DAGCombiner because widening shifts isn't a win on all targets. llvm-svn: 199114	2014-01-13 15:17:53 +00:00
Tim Northover	1328c1ae32	ARM: constrain Thumb LDRLIT pseudo-instructions to r0-r7. Previously we only used GPR for the destination placeholder in "ldr rD, [pc, incorrect codegen under the integrated assembler. This should fix both issues (which probably only affect MachO targets at the moment). rdar://problem/15800156 llvm-svn: 199108	2014-01-13 14:19:17 +00:00
David Woodhouse	4e033b0e92	[x86] Fix retq/retl handling in 64-bit mode This finishes the job started in r198756, and creates separate opcodes for 64-bit vs. 32-bit versions of the rest of the RET instructions too. LRETL/LRETQ are interesting... I can't see any justification for their existence in the SDM. There should be no 'LRETL' in 64-bit mode, and no need for a REX.W prefix for LRETQ. But this is what GAS does, and my Sandybridge CPU and an Opteron 6376 concur when tested as follows: asm __volatile__("pushq $0x1234\nmovq $0x33,%rax\nsalq $32,%rax\norq $1f,%rax\npushq %rax\nlretl $8\n1:"); asm __volatile__("pushq $1234\npushq $0x33\npushq $1f\nlretq $8\n1:"); asm __volatile__("pushq $0x33\npushq $1f\nlretq\n1:"); asm __volatile__("pushq $0x1234\npushq $0x33\npushq $1f\nlretq $8\n1:"); cf. PR8592 and commit r118903, which added LRETQ. I only added LRETIQ to match it. I don't quite understand how the Intel syntax parsing for ret instructions is working, despite r154468 allegedly fixing it. Aren't the explicitly sized 'retw', 'retd' and 'retq' supposed to work? I have at least made the 'lretq' work with (and indeed require) the 'q'. llvm-svn: 199106	2014-01-13 14:05:59 +00:00
Chandler Carruth	73523021d0	[PM] Split DominatorTree into a concrete analysis result object which can be used by both the new pass manager and the old. This removes it from any of the virtual mess of the pass interfaces and lets it derive cleanly from the DominatorTreeBase<> template. In turn, tons of boilerplate interface can be nuked and it turns into a very straightforward extension of the base DominatorTree interface. The old analysis pass is now a simple wrapper. The names and style of this split should match the split between CallGraph and CallGraphWrapperPass. All of the users of DominatorTree have been updated to match using many of the same tricks as with CallGraph. The goal is that the common type remains the resulting DominatorTree rather than the pass. This will make subsequent work toward the new pass manager significantly easier. Also in numerous places things became cleaner because I switched from re-running the pass (!!! mid way through some other passes run!!!) to directly recomputing the domtree. llvm-svn: 199104	2014-01-13 13:07:17 +00:00
Elena Demikhovsky	b19c9dc1a1	AVX-512: Embedded Rounding Control - encoding and printing Changed intrinsics for vrcp14/vrcp28 vrsqrt14/vrsqrt28 - aligned with GCC. llvm-svn: 199102	2014-01-13 12:55:03 +00:00
Chandler Carruth	e509db410a	[PM] Pull the generic graph algorithms and data structures for dominator trees into the Support library. These are all expressed in terms of the generic GraphTraits and CFG, with no reliance on any concrete IR types. Putting them in support clarifies that and makes the fact that the static analyzer in Clang uses them much more sane. When moving the Dominators.h file into the IR library I claimed that this was the right home for it but not something I planned to work on. Oops. So why am I doing this? It happens to be one step toward breaking the requirement that IR verification can only be performed from inside of a pass context, which completely blocks the implementation of verification for the new pass manager infrastructure. Fixing it will also allow removing the concept of the "preverify" step (WTF???) and allow the verifier to cleanly flag functions which fail verification in a way that precludes even computing dominance information. Currently, that results in a fatal error even when you ask the verifier to not fatally error. It's awesome like that. The yak shaving will continue... llvm-svn: 199095	2014-01-13 10:52:56 +00:00
Tim Northover	7fdd4857f7	Revert "ReMat: fix overly cavalier attitude to sub-register indices" Very sorry, this was a premature patch that I still need to investigate and finish off (for some reason beyond me at the moment it doesn't actually fix the issue in all cases). This reverts commit r199091. llvm-svn: 199093	2014-01-13 10:49:11 +00:00
Tim Northover	59f8d4b4ee	ReMat: fix overly cavalier attitude to sub-register indices There are two attempted optimisations in reMaterializeTrivialDef, trying to avoid promoting the size of a register too much when rematerializing. Unfortunately, both appear to be flawed. First, we see if the original register would have worked, but this is inadequate. Consider: v1 = SOMETHING (v1 is QQ) v2:Q0 = COPY v1:Q1 (v1, v2 are QQ) ... uses of v2 In this case even though v2 could be used directly as the output of SOMETHING, this would set the wrong bits of the QQ register involved. The correct rematerialization must be: v2:Q0_Q1 = SOMETHING (v2 promoted to QQQ) ... uses of v2:Q1_Q2 For the second optimisation, if the correct remat is "v2:idx = SOMETHING" then we can't necessarily expect v2 itself to be valid for SOMETHING, but we do try to hunt for a class between v1 and v2 that works. Unfortunately, this is also wrong: v1 = SOMETHING (v1 is QQ) v2:Q0_Q1 = COPY v1 (v1 is QQ, v2 is QQQ) ... uses of v2 as a QQQ The canonical rematerialization here is "v2:Q0_Q1 = SOMETHING". However current logic would decide that v2 could be a QQ (no interest is taken in later uses). This patch, therefore, always accepts the widened register class without trying to be clever. Generally there is no penalty to this (e.g. in the common GR32 < GR64 case, expanding the width doesn't matter because it's not like you were going to do anything else with the high bits of a GR32 register). It can increase register pressure in cases like the ARM VFP regs though (multiple non-overlapping but equivalent subregisters). Hopefully this situation is rare enough that it won't matter. Unfortunately, no in-tree targets actually expose this as far as I can tell (there are so few isAsCheapAsAMove instructions for it to trigger on) so I've been unable to produce a test. It was exposed in our ARM64 SPEC tests though, and I will be adding a test there that we should be able to contribute soon(TM). llvm-svn: 199091	2014-01-13 10:47:01 +00:00
Chandler Carruth	5ad5f15cff	[cleanup] Move the Dominators.h and Verifier.h headers into the IR directory. These passes are already defined in the IR library, and it doesn't make any sense to have the headers in Analysis. Long term, I think there is going to be a much better way to divide these matters. The dominators code should be fully separated into the abstract graph algorithm and have that put in Support where it becomes obvious that evn Clang's CFGBlock's can use it. Then the verifier can manually construct dominance information from the Support-driven interface while the Analysis library can provide a pass which both caches, reconstructs, and supports a nice update API. But those are very long term, and so I don't want to leave the really confusing structure until that day arrives. llvm-svn: 199082	2014-01-13 09:26:24 +00:00
Chandler Carruth	07baed53e8	Re-sort #include lines again, prior to moving headers around. llvm-svn: 199080	2014-01-13 08:04:33 +00:00
Chandler Carruth	b7bdfd65ac	[PM] Wire up support for writing bitcode with new PM. This moves the old pass creation functionality to its own header and updates the callers of that routine. Then it adds a new PM supporting bitcode writer to the header file, and wires that up in the opt tool. A test is added that round-trips code into bitcode and back out using the new pass manager. llvm-svn: 199078	2014-01-13 07:38:24 +00:00
Kevin Qin	cfef55d6d4	[AArch64 NEON] Add missing patterns for bitcast from or to v1f64 llvm-svn: 199070	2014-01-13 01:58:38 +00:00
Kevin Qin	21e8f1c4eb	[AArch64 NEON] Add more scenarios to use perm instructions when lowering shuffle_vector This patch covered 2 more scenarios: 1. Two operands of shuffle_vector are the same, like %shuffle.i = shufflevector <8 x i8> %a, <8 x i8> %a, <8 x i32> <i32 0, i32 2, i32 4, i32 6, i32 8, i32 10, i32 12, i32 14> 2. One of operands is undef, like %shuffle.i = shufflevector <8 x i8> %a, <8 x i8> undef, <8 x i32> <i32 0, i32 2, i32 4, i32 6, i32 8, i32 10, i32 12, i32 14> After this patch, perm instructions will have chance to be emitted instead of lots of INS. llvm-svn: 199069	2014-01-13 01:56:29 +00:00
Saleem Abdulrasool	a6505ca4c2	correct target directive handling error handling The target specific parser should return `false' if the target AsmParser handles the directive, and `true' if the generic parser should handle the directive. Many of the target specific directive handlers would `return Error' which does not follow these semantics. This change simply changes the target specific routines to conform to the semantis of the ParseDirective correctly. Conformance to the semantics improves diagnostics emitted for the invalid directives. X86 is taken as a sample to ensure that multiple diagnostics are not presented for a single error. llvm-svn: 199068	2014-01-13 01:15:39 +00:00
Jakob Stoklund Olesen	1995b9fead	Handle bundled terminators in isBlockOnlyReachableByFallthrough. Targets like SPARC and MIPS have delay slots and normally bundle the delay slot instruction with the corresponding terminator. Teach isBlockOnlyReachableByFallthrough to find any MBB operands on bundled terminators so SPARC doesn't need to specialize this function. llvm-svn: 199061	2014-01-12 19:24:08 +00:00
NAKAMURA Takumi	4961f7a888	raw_fd_ostream: Don't change STDERR to O_BINARY, or w*printf() (in assert()) would barf wide chars after llvm::errs(). llvm-svn: 199057	2014-01-12 16:14:24 +00:00
NAKAMURA Takumi	79addb8d8f	raw_stream formatter: [Win32] Use std::signbit() if available, instead of _fpclass(). FIXME: It should be generic to C++11. For now, it is dedicated to mingw-w64. llvm-svn: 199052	2014-01-12 14:44:46 +00:00
Nico Rieck	b5262d6d8f	Fix non-deterministic SDNodeOrder-dependent codegen Reset SelectionDAGBuilder's SDNodeOrder to ensure deterministic code generation. llvm-svn: 199050	2014-01-12 14:09:17 +00:00
Chandler Carruth	52eef8876e	[PM] Add module and function printing passes for the new pass manager. This implements the legacy passes in terms of the new ones. It adds basic testing using explicit runs of the passes. Next up will be wiring the basic output mechanism of opt up when the new pass manager is engaged unless bitcode writing is requested. llvm-svn: 199049	2014-01-12 12:15:39 +00:00
Chandler Carruth	e0af664cd8	[PM] Simplify the IR printing passes significantly now that a narrower API is exposed. This removes the support for deleting the ostream, switches the member and constructor order arround to be consistent with the creation routines, and switches to using references. llvm-svn: 199047	2014-01-12 11:40:03 +00:00
Chandler Carruth	9d805139bd	[PM] Simplify the interface exposed for IR printing passes. Nothing was using the ability of the pass to delete the raw_ostream it printed to, and nothing was trying to pass it a pointer to the raw_ostream. Also, the function variant had a different order of arguments from all of the others which was just really confusing. Now the interface accepts a reference, doesn't offer to delete it, and uses a consistent order. The implementation of the printing passes haven't been updated with this simplification, this is just the API switch. llvm-svn: 199044	2014-01-12 11:30:46 +00:00
Chandler Carruth	3dd261d0c9	[PM] Run clang-format and remove redundant or obvious comments before the heavy factoring needed to share logic between the new pass manager and the old. llvm-svn: 199043	2014-01-12 11:16:01 +00:00
Chandler Carruth	b8ddc7043c	[PM] Rename the IR printing pass header to a more generic and correct name to match the source file which I got earlier. Update the include sites. Also modernize the comments in the header to use the more recommended doxygen style. llvm-svn: 199041	2014-01-12 11:10:32 +00:00
Saleem Abdulrasool	bdae4b8743	ARM IAS: fix diagnostics of improper qualification An improper qualifier would result in a superfluous error due to the parser not consuming the remainder of the statement. Simply consume the remainder of the statement to avoid the error. llvm-svn: 199035	2014-01-12 05:25:44 +00:00
Venkatraman Govindaraju	cd4d9ac62a	[Sparc] Add support for parsing floating point instructions. llvm-svn: 199033	2014-01-12 04:48:54 +00:00
Saleem Abdulrasool	fb3950ec63	ARM: change implicit immediate forms of {ld,st}r{,b}t to psuedo-instructions The implicit immediate 0 forms are assembly aliases, not distinct instruction encodings. Fix the initial implementation introduced in r198914 to an alias to avoid two separate instruction definitions for the same encoding. An InstAlias is insufficient in this case as the necessary due to the need to add a new additional operand for the implicit zero. By using the AsmPsuedoInst, fall back to the C++ code to transform the instruction to the equivalent _POST_IMM form, inserting the additional implicit immediate 0. llvm-svn: 199032	2014-01-12 04:36:01 +00:00
Venkatraman Govindaraju	0b9debf1f6	[Sparc] Replace (unsigned)-1 with ~OU as suggested by Reid Kleckner. llvm-svn: 199031	2014-01-12 04:34:31 +00:00
Jakob Stoklund Olesen	e7084a1c5c	The SPARCv9 ABI returns a float in %f0. This is different from the argument passing convention which puts the first float argument in %f1. With this patch, all returned floats are treated as if the 'inreg' flag were set. This means multiple float return values get packed in %f0, %f1, %f2, ... Note that when returning a struct in registers, clang will set the 'inreg' flag on the return value, so that behavior is unchanged. This also happens when returning a float _Complex. llvm-svn: 199028	2014-01-12 04:13:17 +00:00
Joerg Sonnenberger	485f00fe0f	Add missing mul aliases for armv4 support. Add checks that armv4 can assemble the various mul instructions. llvm-svn: 199026	2014-01-12 03:35:18 +00:00
Hans Wennborg	ac114a3ce7	Switch-to-lookup tables: Don't require a result for the default case when the lookup table doesn't have any holes. This means we can build a lookup table for switches like this: switch (x) { case 0: return 1; case 1: return 2; case 2: return 3; case 3: return 4; default: exit(1); } The default case doesn't yield a constant result here, but that doesn't matter, since a default result is only necessary for filling holes in the lookup table, and this table doesn't have any holes. This makes us transform 505 more switches in a clang bootstrap, and shaves 164 KB off the resulting clang binary. llvm-svn: 199025	2014-01-12 00:44:41 +00:00
Venkatraman Govindaraju	a66b314c34	[Sparc] Add missing processor types: v7 and niagara llvm-svn: 199024	2014-01-11 23:56:13 +00:00
Saleem Abdulrasool	2d48edeca3	ARM IAS: support emitting constant values in target expressions A 32-bit immediate value can be formed from a constant expression and loaded into a register. Add support to emit this into an object file. Because this value is a constant, a relocation must not be produced for it. llvm-svn: 199023	2014-01-11 23:03:48 +00:00
Arnold Schwaighofer	66c742aeea	LoopVectorizer: Enable strided memory accesses versioning per default I saw no compile or execution time regressions on x86_64 -mavx -O3. radar://13075509 llvm-svn: 199015	2014-01-11 20:40:34 +00:00
Venkatraman Govindaraju	0653218b2b	[Sparc] Bundle instruction with delay slow and its filler. Now, we can use -verify-machineinstrs with SPARC backend. llvm-svn: 199014	2014-01-11 19:38:03 +00:00
Alp Toker	798060e006	Fix 'ned' typo in doc comment Patch by Jasper Neumann! llvm-svn: 199007	2014-01-11 14:01:43 +00:00
Chandler Carruth	a13f27cc34	[PM] Add names to passes under the new pass manager, and a debug output mode that can be used to debug the execution of everything. No support for analyses here, that will come later. This already helps show parts of the opt commandline integration that isn't working. Tests of that will start using it as the bugs are fixed. llvm-svn: 199004	2014-01-11 11:52:05 +00:00

1 2 3 4 5 ...

66493 Commits