llvm-project

Commit Graph

Author	SHA1	Message	Date
Duncan P. N. Exon Smith	43ea3478bf	LTO: add API to set strategy for -internalize Add API to LTOCodeGenerator to specify a strategy for the -internalize pass. This is a new attempt at Bill's change in r185882, which he reverted in r188029 due to problems with the gold linker. This puts the onus on the linker to decide whether (and what) to internalize. In particular, running internalize before outputting an object file may change a 'weak' symbol into an internal one, even though that symbol could be needed by an external object file --- e.g., with arclite. This patch enables three strategies: - LTO_INTERNALIZE_FULL: the default (and the old behaviour). - LTO_INTERNALIZE_NONE: skip -internalize. - LTO_INTERNALIZE_HIDDEN: only -internalize symbols with hidden visibility. LTO_INTERNALIZE_FULL should be used when linking an executable. Outputting an object file (e.g., via ld -r) is more complicated, and depends on whether hidden symbols should be internalized. E.g., for ld -r, LTO_INTERNALIZE_NONE can be used when -keep_private_externs, and LTO_INTERNALIZE_HIDDEN can be used otherwise. However, LTO_INTERNALIZE_FULL is inappropriate, since the output object file will eventually need to link with others. lto_codegen_set_internalize_strategy() sets the strategy for subsequent calls to lto_codegen_write_merged_modules() and lto_codegen_compile*(). <rdar://problem/14334895> llvm-svn: 199191	2014-01-14 06:37:26 +00:00
Jakob Stoklund Olesen	b6b35a4955	Always let value types influence register classes. When creating a virtual register for a def, the value type should be used to pick the register class. If we only use the register class constraint on the instruction, we might pick a too large register class. Some registers can store values of different sizes. For example, the x86 xmm registers can hold f32, f64, and 128-bit vectors. The three different value sizes are represented by register classes with identical register sets: FR32, FR64, and VR128. These register classes have different spill slot sizes, so it is important to use the right one. The register class constraint on an instruction doesn't necessarily care about the size of the value its defining. The value type determines that. This fixes a problem where InstrEmitter was picking 32-bit register classes for 64-bit values on SPARC. llvm-svn: 199187	2014-01-14 06:18:38 +00:00
Jakob Stoklund Olesen	209120621a	Switch the NEON register class from QPR to DPair. The already allocatable DPair superclass contains odd-even D register pair in addition to the even-odd pairs in the QPR register class. There is no reason to constrain the set of D register pairs that can be used for NEON values. Any NEON instructions that require a Q register will automatically constrain the register class to QPR. The allocation order for DPair begins with the QPR registers, so register allocation is unlikely to change much. llvm-svn: 199186	2014-01-14 06:18:34 +00:00
Chandler Carruth	e2d5663b10	[PM] Fix stale header blocker, found by Duncan Smith in code review! llvm-svn: 199185	2014-01-14 05:50:19 +00:00
Chandler Carruth	9384786b14	Remove the last weird subproject, 'privbracket'. llvm-svn: 199183	2014-01-14 05:05:18 +00:00
Chandler Carruth	75a6545d0e	Add checks to configure for sufficiently modern host compilers. This requires Clang 3.1 or GCC 4.7. If the compiler isn't Clang or GCC, we don't try to do any sanity checking, but this give us at least a reasonable baseline of modern compilers. Also, I'm not claiming that this is the best way to do compiler version tests. I'm happy for anyone to suggest better ways of doing this test. llvm-svn: 199182	2014-01-14 05:02:38 +00:00
Rafael Espindola	6d5f7ce348	Replace .mips_hack_stocg with ".set micromips" and ".set nomicromips". This matches what gnu as does and implementing this is easier than arguing about it. llvm-svn: 199181	2014-01-14 04:25:13 +00:00
Mark Seaborn	8271118a65	Fix llc to not reuse spill slots in functions that invoke setjmp() We need to ensure that StackSlotColoring.cpp does not reuse stack spill slots in functions that call "returns_twice" functions such as setjmp(), otherwise this can lead to miscompiled code, because a stack slot would be clobbered when it's still live. This was already handled correctly for functions that call setjmp() (though this wasn't covered by a test), but not for functions that invoke setjmp(). We fix this by changing callsFunctionThatReturnsTwice() to check for invoke instructions. This fixes PR18244. llvm-svn: 199180	2014-01-14 04:20:01 +00:00
Chandler Carruth	af968eda27	Ok, really, for the last time, llvm-gcc is dead Jim. Also, so is stacker, llvm-tv, etc. Wow. But will someone please fess up to what projects/privbracket is and why our autoconf build supports it? llvm-svn: 199179	2014-01-14 04:01:01 +00:00
Chandler Carruth	b4dd3c6840	llvm-gcc is dead. REALLY. IT'S DEAD JIM. llvm-svn: 199178	2014-01-14 03:46:00 +00:00
Rafael Espindola	4a1a360634	Make getTargetStreamer return a possibly null pointer. This will allow it to be called from target independent parts of the main streamer that don't know if there is a registered target streamer or not. This in turn will allow targets to perform extra actions at specified points in the interface: add extra flags for some labels, extra work during finalization, etc. llvm-svn: 199174	2014-01-14 01:21:46 +00:00
Duncan P. N. Exon Smith	f56df6e999	Remove extra } in documentation comment llvm-svn: 199162	2014-01-13 23:11:48 +00:00
Cameron McInally	da3bba445b	Clean up RUN command for Assembler/getInt.ll. llvm-svn: 199158	2014-01-13 22:37:35 +00:00
Chandler Carruth	8388597361	Factor the option and checking of compiler version better. Put the option with the others in the top level CMakeLists, and put the check in HandleLLVMOptions. This will also let it be used from the standalone Clang builds. llvm-svn: 199149	2014-01-13 22:21:34 +00:00
Chandler Carruth	5aad86a940	Raise the minimum CMake version to 2.8.8 -- we have a report that the compiler version checking doesn't work on 2.8.7. This feature was documented in 2.8.10, but existed for an unknown amount of time before that. I'm actually happy to revert this and remove the use of the feature if there is anyone with a specific problem updating CMake. Please just let me know. I don't want to re-implement this CMake functionality unless there is a reason, and this is the only real way to find that out. llvm-svn: 199148	2014-01-13 22:05:20 +00:00
Cameron McInally	f0379fa41a	Fix uninitialized warning in llvm/lib/IR/DataLayout.cpp. llvm-svn: 199147	2014-01-13 22:04:55 +00:00
Juergen Ributzka	6840282c99	[DAG] Refactor ReassociateOps - no functional change intended. llvm-svn: 199146	2014-01-13 21:49:25 +00:00
Chandler Carruth	24b40f59da	Add a check that the host compiler is modern to CMake, take 1. This is likely to be reverted and re-applied a few times. The minimum versions we're aiming at: GCC 4.7 Clang 3.1 MSVC 17.0 (Visual Studio 2012) Let me know if something breaks! llvm-svn: 199145	2014-01-13 21:47:35 +00:00
Juergen Ributzka	7384405f23	[DAG] Teach DAG to also reassociate vector operations This commit teaches DAG to reassociate vector ops, which in turn enables constant folding of vector op chains that appear later on during custom lowering and DAG combine. Reviewed by Andrea Di Biagio llvm-svn: 199135	2014-01-13 20:51:35 +00:00
Andrew Trick	7daf6a45f4	Hide the pre-RA-sched= option. This is a very confusing option for a feature that will go away. -enable-misched is exposed instead to help triage issues with the new scheduler. llvm-svn: 199133	2014-01-13 20:08:27 +00:00
Weiming Zhao	f66be56bf7	Fix PR 18369: [Thumbv8] asserts due to inconsistent CPSR liveness of IT blocks The issue is caused when Post-RA scheduler reorders a bundle instruction (IT block). However, it only flips the CPSR liveness of the bundle instruction, leaves the instructions inside the bundle unchanged, which causes inconstancy and crashes Thumb2SizeReduction.cpp::ReduceMBB(). llvm-svn: 199127	2014-01-13 18:47:54 +00:00
Rafael Espindola	5b6c1e8e59	Update getLazyBitcodeModule to use ErrorOr for error handling. llvm-svn: 199125	2014-01-13 18:31:04 +00:00
Andrea Di Biagio	9bc0415c1f	[AArch64] Fix assertion failure caused by an invalid comparison between APInt values. APInt only knows how to compare values with the same BitWidth and asserts in all other cases. With this fix, function PerformORCombine does not use the APInt equality operator if the APInt values returned by 'isConstantSplat' differ in BitWidth. In that case they are different and no comparison is needed. llvm-svn: 199119	2014-01-13 16:51:00 +00:00
Joerg Sonnenberger	808df6725f	Fix indentation. llvm-svn: 199118	2014-01-13 15:50:36 +00:00
Richard Sandiford	36b376914d	[SystemZ] Flesh out stackrestore test (frame-11.ll) ...so that it does something vaguely sensible. llvm-svn: 199117	2014-01-13 15:44:44 +00:00
Richard Sandiford	9b9e057ced	[SystemZ] Add "volatile" to a dead store in variable-loc.ll llvm-svn: 199116	2014-01-13 15:42:16 +00:00
Richard Sandiford	64c0c4c015	[SystemZ] Improve risbg-01.ll test The old mask in f24 wasn't well chosen because the lshr would always be zero. CodeGen didn't detect this but InstCombine would. The new mask ensures that both shifts are needed. f26 is specifically testing for a wrap-around mask. The AND can be applied to just the shift left, either before or after the shift. Again, CodeGen kept it in the original form but InstCombine would mask after the shift instead. The exact choice of NILF isn't important for the test so I just dropped it and kept the rotate. llvm-svn: 199115	2014-01-13 15:40:25 +00:00
Richard Sandiford	32379b8141	[SystemZ] Optimize (sext (ashr (shl ...), ...)) ...into (ashr (shl (anyext X), ...), ...), which requires one fewer instruction. The (anyext X) can sometimes be simplified too. I didn't do this in DAGCombiner because widening shifts isn't a win on all targets. llvm-svn: 199114	2014-01-13 15:17:53 +00:00
Chris Lattner	bdf5178467	fix a -Wdocumentation warning. llvm-svn: 199113	2014-01-13 15:10:11 +00:00
Tim Northover	7d074a5ad6	ARM: add test for r199108. Oops. rdar://problem/15800156 llvm-svn: 199109	2014-01-13 14:20:25 +00:00
Tim Northover	1328c1ae32	ARM: constrain Thumb LDRLIT pseudo-instructions to r0-r7. Previously we only used GPR for the destination placeholder in "ldr rD, [pc, incorrect codegen under the integrated assembler. This should fix both issues (which probably only affect MachO targets at the moment). rdar://problem/15800156 llvm-svn: 199108	2014-01-13 14:19:17 +00:00
David Woodhouse	4e033b0e92	[x86] Fix retq/retl handling in 64-bit mode This finishes the job started in r198756, and creates separate opcodes for 64-bit vs. 32-bit versions of the rest of the RET instructions too. LRETL/LRETQ are interesting... I can't see any justification for their existence in the SDM. There should be no 'LRETL' in 64-bit mode, and no need for a REX.W prefix for LRETQ. But this is what GAS does, and my Sandybridge CPU and an Opteron 6376 concur when tested as follows: asm __volatile__("pushq $0x1234\nmovq $0x33,%rax\nsalq $32,%rax\norq $1f,%rax\npushq %rax\nlretl $8\n1:"); asm __volatile__("pushq $1234\npushq $0x33\npushq $1f\nlretq $8\n1:"); asm __volatile__("pushq $0x33\npushq $1f\nlretq\n1:"); asm __volatile__("pushq $0x1234\npushq $0x33\npushq $1f\nlretq $8\n1:"); cf. PR8592 and commit r118903, which added LRETQ. I only added LRETIQ to match it. I don't quite understand how the Intel syntax parsing for ret instructions is working, despite r154468 allegedly fixing it. Aren't the explicitly sized 'retw', 'retd' and 'retq' supposed to work? I have at least made the 'lretq' work with (and indeed require) the 'q'. llvm-svn: 199106	2014-01-13 14:05:59 +00:00
Chandler Carruth	73523021d0	[PM] Split DominatorTree into a concrete analysis result object which can be used by both the new pass manager and the old. This removes it from any of the virtual mess of the pass interfaces and lets it derive cleanly from the DominatorTreeBase<> template. In turn, tons of boilerplate interface can be nuked and it turns into a very straightforward extension of the base DominatorTree interface. The old analysis pass is now a simple wrapper. The names and style of this split should match the split between CallGraph and CallGraphWrapperPass. All of the users of DominatorTree have been updated to match using many of the same tricks as with CallGraph. The goal is that the common type remains the resulting DominatorTree rather than the pass. This will make subsequent work toward the new pass manager significantly easier. Also in numerous places things became cleaner because I switched from re-running the pass (!!! mid way through some other passes run!!!) to directly recomputing the domtree. llvm-svn: 199104	2014-01-13 13:07:17 +00:00
Chandler Carruth	ca9af6cad9	[PM][cleanup] Clean up comments and use modern doxygen in this file. This is a precursor to breaking the pass that computes the DominatorTree apart from the concrete DominatorTree. llvm-svn: 199103	2014-01-13 13:06:58 +00:00
Elena Demikhovsky	b19c9dc1a1	AVX-512: Embedded Rounding Control - encoding and printing Changed intrinsics for vrcp14/vrcp28 vrsqrt14/vrsqrt28 - aligned with GCC. llvm-svn: 199102	2014-01-13 12:55:03 +00:00
Chandler Carruth	db9120a037	[PM] Fix the const-correctness of the generic DominatorTreeBase to support notionally const queries even though they may trigger DFS numbering updates. The updating of DFS numbers and tracking of slow queries do not mutate the observable state of the domtree. They should be const to differentiate them from the APIs which mutate the tree directly to do incremental updates. This will make it possible in a world where the DominatorTree is not a pass but merely the result of running a pass to derive DominatorTree from the base class as it was originally designed, removing a huge duplication of API in DominatorTree. llvm-svn: 199101	2014-01-13 11:58:34 +00:00
Chandler Carruth	e509db410a	[PM] Pull the generic graph algorithms and data structures for dominator trees into the Support library. These are all expressed in terms of the generic GraphTraits and CFG, with no reliance on any concrete IR types. Putting them in support clarifies that and makes the fact that the static analyzer in Clang uses them much more sane. When moving the Dominators.h file into the IR library I claimed that this was the right home for it but not something I planned to work on. Oops. So why am I doing this? It happens to be one step toward breaking the requirement that IR verification can only be performed from inside of a pass context, which completely blocks the implementation of verification for the new pass manager infrastructure. Fixing it will also allow removing the concept of the "preverify" step (WTF???) and allow the verifier to cleanly flag functions which fail verification in a way that precludes even computing dominance information. Currently, that results in a fatal error even when you ask the verifier to not fatally error. It's awesome like that. The yak shaving will continue... llvm-svn: 199095	2014-01-13 10:52:56 +00:00
Tim Northover	7fdd4857f7	Revert "ReMat: fix overly cavalier attitude to sub-register indices" Very sorry, this was a premature patch that I still need to investigate and finish off (for some reason beyond me at the moment it doesn't actually fix the issue in all cases). This reverts commit r199091. llvm-svn: 199093	2014-01-13 10:49:11 +00:00
Tim Northover	cdc5395680	Docs: fix sign of division and increase equivocation on code generated. I should have been a politician. llvm-svn: 199092	2014-01-13 10:47:04 +00:00
Tim Northover	59f8d4b4ee	ReMat: fix overly cavalier attitude to sub-register indices There are two attempted optimisations in reMaterializeTrivialDef, trying to avoid promoting the size of a register too much when rematerializing. Unfortunately, both appear to be flawed. First, we see if the original register would have worked, but this is inadequate. Consider: v1 = SOMETHING (v1 is QQ) v2:Q0 = COPY v1:Q1 (v1, v2 are QQ) ... uses of v2 In this case even though v2 could be used directly as the output of SOMETHING, this would set the wrong bits of the QQ register involved. The correct rematerialization must be: v2:Q0_Q1 = SOMETHING (v2 promoted to QQQ) ... uses of v2:Q1_Q2 For the second optimisation, if the correct remat is "v2:idx = SOMETHING" then we can't necessarily expect v2 itself to be valid for SOMETHING, but we do try to hunt for a class between v1 and v2 that works. Unfortunately, this is also wrong: v1 = SOMETHING (v1 is QQ) v2:Q0_Q1 = COPY v1 (v1 is QQ, v2 is QQQ) ... uses of v2 as a QQQ The canonical rematerialization here is "v2:Q0_Q1 = SOMETHING". However current logic would decide that v2 could be a QQ (no interest is taken in later uses). This patch, therefore, always accepts the widened register class without trying to be clever. Generally there is no penalty to this (e.g. in the common GR32 < GR64 case, expanding the width doesn't matter because it's not like you were going to do anything else with the high bits of a GR32 register). It can increase register pressure in cases like the ARM VFP regs though (multiple non-overlapping but equivalent subregisters). Hopefully this situation is rare enough that it won't matter. Unfortunately, no in-tree targets actually expose this as far as I can tell (there are so few isAsCheapAsAMove instructions for it to trigger on) so I've been unable to produce a test. It was exposed in our ARM64 SPEC tests though, and I will be adding a test there that we should be able to contribute soon(TM). llvm-svn: 199091	2014-01-13 10:47:01 +00:00
Chandler Carruth	20d4e6bee4	[cleanup] Re-sort the examples #include lines with my sort_includes script. llvm-svn: 199089	2014-01-13 09:58:03 +00:00
Chandler Carruth	d7cd9ac914	[cleanup] Fix the includes in the examples for r199082. llvm-svn: 199087	2014-01-13 09:53:45 +00:00
Chandler Carruth	634cdb61d2	[cleanup] Switch comments to use '\brief' style instead of '@brief' style, and remove some unnecessary comments (the code is perfectly self-documenting here). Also clang-format the function declarations as they wrap cleanly now. llvm-svn: 199084	2014-01-13 09:31:09 +00:00
Chandler Carruth	5ad5f15cff	[cleanup] Move the Dominators.h and Verifier.h headers into the IR directory. These passes are already defined in the IR library, and it doesn't make any sense to have the headers in Analysis. Long term, I think there is going to be a much better way to divide these matters. The dominators code should be fully separated into the abstract graph algorithm and have that put in Support where it becomes obvious that evn Clang's CFGBlock's can use it. Then the verifier can manually construct dominance information from the Support-driven interface while the Analysis library can provide a pass which both caches, reconstructs, and supports a nice update API. But those are very long term, and so I don't want to leave the really confusing structure until that day arrives. llvm-svn: 199082	2014-01-13 09:26:24 +00:00
Chandler Carruth	01e5037fec	[cleanup] Add a missing include exposed by resorting other includes. Should fix the build. llvm-svn: 199081	2014-01-13 08:09:47 +00:00
Chandler Carruth	07baed53e8	Re-sort #include lines again, prior to moving headers around. llvm-svn: 199080	2014-01-13 08:04:33 +00:00
Chandler Carruth	b7bdfd65ac	[PM] Wire up support for writing bitcode with new PM. This moves the old pass creation functionality to its own header and updates the callers of that routine. Then it adds a new PM supporting bitcode writer to the header file, and wires that up in the opt tool. A test is added that round-trips code into bitcode and back out using the new pass manager. llvm-svn: 199078	2014-01-13 07:38:24 +00:00
NAKAMURA Takumi	eccd28d519	llvm/test/ExecutionEngine/MCJIT/load-object-a.ll: Put together rm(1) and mkdir(1) at the top. llvm-svn: 199077	2014-01-13 05:55:10 +00:00
NAKAMURA Takumi	f0a1ab8f2a	[CMake] Move BUG_REPORT_URL from clang to llvm. It was too late to set BUG_REPORT_URL after configure_file(config.h). BUG_REPORT_URL in config.h.cmake would be updated at 2nd run of cmake. It caused many recompilations. FYI, configure handles BUG_REPORT_URL in llvm side. llvm-svn: 199076	2014-01-13 05:25:13 +00:00
Chandler Carruth	b353c3f7f2	[PM] Wire up support for printing assembly output from the opt command. This lets us round-trip IR in the expected manner with the opt tool. llvm-svn: 199075	2014-01-13 05:16:45 +00:00
Chandler Carruth	949282efec	[PM] Add an enum for describing the desired output strategy, and run that through the interface rather than a simple bool. This should allow starting to wire up real output to round-trip IR through opt with the new pass manager. llvm-svn: 199071	2014-01-13 03:08:40 +00:00
Kevin Qin	cfef55d6d4	[AArch64 NEON] Add missing patterns for bitcast from or to v1f64 llvm-svn: 199070	2014-01-13 01:58:38 +00:00
Kevin Qin	21e8f1c4eb	[AArch64 NEON] Add more scenarios to use perm instructions when lowering shuffle_vector This patch covered 2 more scenarios: 1. Two operands of shuffle_vector are the same, like %shuffle.i = shufflevector <8 x i8> %a, <8 x i8> %a, <8 x i32> <i32 0, i32 2, i32 4, i32 6, i32 8, i32 10, i32 12, i32 14> 2. One of operands is undef, like %shuffle.i = shufflevector <8 x i8> %a, <8 x i8> undef, <8 x i32> <i32 0, i32 2, i32 4, i32 6, i32 8, i32 10, i32 12, i32 14> After this patch, perm instructions will have chance to be emitted instead of lots of INS. llvm-svn: 199069	2014-01-13 01:56:29 +00:00
Saleem Abdulrasool	a6505ca4c2	correct target directive handling error handling The target specific parser should return `false' if the target AsmParser handles the directive, and `true' if the generic parser should handle the directive. Many of the target specific directive handlers would `return Error' which does not follow these semantics. This change simply changes the target specific routines to conform to the semantis of the ParseDirective correctly. Conformance to the semantics improves diagnostics emitted for the invalid directives. X86 is taken as a sample to ensure that multiple diagnostics are not presented for a single error. llvm-svn: 199068	2014-01-13 01:15:39 +00:00
Jakob Stoklund Olesen	1995b9fead	Handle bundled terminators in isBlockOnlyReachableByFallthrough. Targets like SPARC and MIPS have delay slots and normally bundle the delay slot instruction with the corresponding terminator. Teach isBlockOnlyReachableByFallthrough to find any MBB operands on bundled terminators so SPARC doesn't need to specialize this function. llvm-svn: 199061	2014-01-12 19:24:08 +00:00
NAKAMURA Takumi	9668890568	[CMake] Add a comment to tablegen's copy_if_different. Ninja reports every action by default. llvm-svn: 199058	2014-01-12 17:42:43 +00:00
NAKAMURA Takumi	4961f7a888	raw_fd_ostream: Don't change STDERR to O_BINARY, or w*printf() (in assert()) would barf wide chars after llvm::errs(). llvm-svn: 199057	2014-01-12 16:14:24 +00:00
Nico Rieck	f15341c9de	Make test independent of scheduling llvm-svn: 199055	2014-01-12 15:57:38 +00:00
NAKAMURA Takumi	79addb8d8f	raw_stream formatter: [Win32] Use std::signbit() if available, instead of _fpclass(). FIXME: It should be generic to C++11. For now, it is dedicated to mingw-w64. llvm-svn: 199052	2014-01-12 14:44:46 +00:00
NAKAMURA Takumi	d7032ac21e	llvm/test/CodeGen/X86/shl_undef.ll: Tweak to satisfy r199050. Use intel syntax, or "shl" might hit "pushl". llvm-svn: 199051	2014-01-12 14:41:41 +00:00
Nico Rieck	b5262d6d8f	Fix non-deterministic SDNodeOrder-dependent codegen Reset SelectionDAGBuilder's SDNodeOrder to ensure deterministic code generation. llvm-svn: 199050	2014-01-12 14:09:17 +00:00
Chandler Carruth	52eef8876e	[PM] Add module and function printing passes for the new pass manager. This implements the legacy passes in terms of the new ones. It adds basic testing using explicit runs of the passes. Next up will be wiring the basic output mechanism of opt up when the new pass manager is engaged unless bitcode writing is requested. llvm-svn: 199049	2014-01-12 12:15:39 +00:00
Chandler Carruth	4287ad9679	[PM] Revert an accidental commit of total BS code. This was halfway through being editted, and I forgot to delete it before committing. What's more awesome is that it compiles cleanly! llvm-svn: 199048	2014-01-12 11:41:43 +00:00
Chandler Carruth	e0af664cd8	[PM] Simplify the IR printing passes significantly now that a narrower API is exposed. This removes the support for deleting the ostream, switches the member and constructor order arround to be consistent with the creation routines, and switches to using references. llvm-svn: 199047	2014-01-12 11:40:03 +00:00
Chandler Carruth	3bdf043c98	[PM] Update one user of the printing pass API that I missed. llvm-svn: 199046	2014-01-12 11:39:04 +00:00
Chandler Carruth	9d805139bd	[PM] Simplify the interface exposed for IR printing passes. Nothing was using the ability of the pass to delete the raw_ostream it printed to, and nothing was trying to pass it a pointer to the raw_ostream. Also, the function variant had a different order of arguments from all of the others which was just really confusing. Now the interface accepts a reference, doesn't offer to delete it, and uses a consistent order. The implementation of the printing passes haven't been updated with this simplification, this is just the API switch. llvm-svn: 199044	2014-01-12 11:30:46 +00:00
Chandler Carruth	3dd261d0c9	[PM] Run clang-format and remove redundant or obvious comments before the heavy factoring needed to share logic between the new pass manager and the old. llvm-svn: 199043	2014-01-12 11:16:01 +00:00
Chandler Carruth	b8ddc7043c	[PM] Rename the IR printing pass header to a more generic and correct name to match the source file which I got earlier. Update the include sites. Also modernize the comments in the header to use the more recommended doxygen style. llvm-svn: 199041	2014-01-12 11:10:32 +00:00
Chandler Carruth	a54dc82e33	[PM] Un-indent this file-level namespace. It's far more common to not indent the outer-most llvm namespace in header files. llvm-svn: 199040	2014-01-12 10:56:57 +00:00
Chandler Carruth	6546cb6313	[PM] Fix a bunch of bugs I spotted by inspection when working on this code. Copious tests added to cover these cases. llvm-svn: 199039	2014-01-12 10:02:02 +00:00
Chandler Carruth	d833098d17	[PM] Add support for parsing function passes and function pass manager nests to the opt commandline support. This also showcases the implicit-initial-manager support which will be most useful for testing. There are several bugs that I spotted by inspection here that I'll fix with test cases in subsequent commits. llvm-svn: 199038	2014-01-12 09:34:22 +00:00
Saleem Abdulrasool	bdae4b8743	ARM IAS: fix diagnostics of improper qualification An improper qualifier would result in a superfluous error due to the parser not consuming the remainder of the statement. Simply consume the remainder of the statement to avoid the error. llvm-svn: 199035	2014-01-12 05:25:44 +00:00
Venkatraman Govindaraju	cd4d9ac62a	[Sparc] Add support for parsing floating point instructions. llvm-svn: 199033	2014-01-12 04:48:54 +00:00
Saleem Abdulrasool	fb3950ec63	ARM: change implicit immediate forms of {ld,st}r{,b}t to psuedo-instructions The implicit immediate 0 forms are assembly aliases, not distinct instruction encodings. Fix the initial implementation introduced in r198914 to an alias to avoid two separate instruction definitions for the same encoding. An InstAlias is insufficient in this case as the necessary due to the need to add a new additional operand for the implicit zero. By using the AsmPsuedoInst, fall back to the C++ code to transform the instruction to the equivalent _POST_IMM form, inserting the additional implicit immediate 0. llvm-svn: 199032	2014-01-12 04:36:01 +00:00
Venkatraman Govindaraju	0b9debf1f6	[Sparc] Replace (unsigned)-1 with ~OU as suggested by Reid Kleckner. llvm-svn: 199031	2014-01-12 04:34:31 +00:00
Jakob Stoklund Olesen	e7084a1c5c	The SPARCv9 ABI returns a float in %f0. This is different from the argument passing convention which puts the first float argument in %f1. With this patch, all returned floats are treated as if the 'inreg' flag were set. This means multiple float return values get packed in %f0, %f1, %f2, ... Note that when returning a struct in registers, clang will set the 'inreg' flag on the return value, so that behavior is unchanged. This also happens when returning a float _Complex. llvm-svn: 199028	2014-01-12 04:13:17 +00:00
Joerg Sonnenberger	4bde03023b	Typo llvm-svn: 199027	2014-01-12 03:38:30 +00:00
Joerg Sonnenberger	485f00fe0f	Add missing mul aliases for armv4 support. Add checks that armv4 can assemble the various mul instructions. llvm-svn: 199026	2014-01-12 03:35:18 +00:00
Hans Wennborg	ac114a3ce7	Switch-to-lookup tables: Don't require a result for the default case when the lookup table doesn't have any holes. This means we can build a lookup table for switches like this: switch (x) { case 0: return 1; case 1: return 2; case 2: return 3; case 3: return 4; default: exit(1); } The default case doesn't yield a constant result here, but that doesn't matter, since a default result is only necessary for filling holes in the lookup table, and this table doesn't have any holes. This makes us transform 505 more switches in a clang bootstrap, and shaves 164 KB off the resulting clang binary. llvm-svn: 199025	2014-01-12 00:44:41 +00:00
Venkatraman Govindaraju	a66b314c34	[Sparc] Add missing processor types: v7 and niagara llvm-svn: 199024	2014-01-11 23:56:13 +00:00
Saleem Abdulrasool	2d48edeca3	ARM IAS: support emitting constant values in target expressions A 32-bit immediate value can be formed from a constant expression and loaded into a register. Add support to emit this into an object file. Because this value is a constant, a relocation must not be produced for it. llvm-svn: 199023	2014-01-11 23:03:48 +00:00
Benjamin Kramer	c10563d14e	Fix broken CHECK lines. llvm-svn: 199016	2014-01-11 21:06:00 +00:00
Arnold Schwaighofer	66c742aeea	LoopVectorizer: Enable strided memory accesses versioning per default I saw no compile or execution time regressions on x86_64 -mavx -O3. radar://13075509 llvm-svn: 199015	2014-01-11 20:40:34 +00:00
Venkatraman Govindaraju	0653218b2b	[Sparc] Bundle instruction with delay slow and its filler. Now, we can use -verify-machineinstrs with SPARC backend. llvm-svn: 199014	2014-01-11 19:38:03 +00:00
Alp Toker	749971901a	lit: Provide source locations in cfg files with older Python versions This commit prospectively brings the benefits of r198766 to older supported Python versions (2.5+). Tested with Python 2.6, 2.7, 3.1 and 3.3 (!) llvm-svn: 199009	2014-01-11 14:34:18 +00:00
Alp Toker	798060e006	Fix 'ned' typo in doc comment Patch by Jasper Neumann! llvm-svn: 199007	2014-01-11 14:01:43 +00:00
Alp Toker	f0a245944e	lit: execfile() isn't present in Python 3.3 On the other hand, exec(compile()) doesn't work in older Python versions in the 2.x series. This commit introduces exec(compile()) with a fallback to plain exec(). That'll hopefully hit the sweet spot in terms of version support. Followup to r198766 which added enhanced source locations for lit cfg parsing. llvm-svn: 199006	2014-01-11 13:27:28 +00:00
Chandler Carruth	258dbb3b12	[PM] Actually nest pass managers correctly when parsing the pass pipeline string. Add tests that cover this now that we have execution dumping in the pass managers. llvm-svn: 199005	2014-01-11 12:06:47 +00:00
Chandler Carruth	a13f27cc34	[PM] Add names to passes under the new pass manager, and a debug output mode that can be used to debug the execution of everything. No support for analyses here, that will come later. This already helps show parts of the opt commandline integration that isn't working. Tests of that will start using it as the bugs are fixed. llvm-svn: 199004	2014-01-11 11:52:05 +00:00
Chandler Carruth	d7693d8444	[PM] Somehow I missed the header guards on this file. Yikes! llvm-svn: 199003	2014-01-11 10:59:00 +00:00
NAKAMURA Takumi	41c409ce0d	LoopVectorize.cpp: Appease MSC16. Excuse me, I hope msc16 builders would be fine till its end day. Introduce nullptr then. ;) llvm-svn: 199001	2014-01-11 09:59:27 +00:00
NAKAMURA Takumi	a64d0bccc8	llvm/test/Transforms/SampleProfile/syntax.ll: Eliminate locale-sensitive message check. llvm-svn: 199000	2014-01-11 09:23:52 +00:00
NAKAMURA Takumi	80a474c1c3	llvm/test/CodeGen/X86/anyregcc.ll: Add explicit -mtriple=x86_64-unknown-unknown. XMM(s) are really spilling for targeting Win64. llvm-svn: 198999	2014-01-11 09:23:44 +00:00
Chandler Carruth	66445382ff	[PM] Add (very skeletal) support to opt for running the new pass manager. I cannot emphasize enough that this is a WIP. =] I expect it to change a great deal as things stabilize, but I think its really important to get some functionality here so that the infrastructure can be tested more traditionally from the commandline. The current design is looking something like this: ./bin/opt -passes='module(pass_a,pass_b,function(pass_c,pass_d))' So rather than custom-parsed flags, there is a single flag with a string argument that is parsed into the pass pipeline structure. This makes it really easy to have nice structural properties that are very explicit. There is one obvious and important shortcut. You can start off the pipeline with a pass, and the minimal context of pass managers will be built around the entire specified pipeline. This makes the common case for tests super easy: ./bin/opt -passes=instcombine,sroa,gvn But this won't introduce any of the complexity of the fully inferred old system -- we only ever do this for the entire argument, and we only look at the first pass. If the other passes don't fit in the pass manager selected it is a hard error. The other interesting aspect here is that I'm not relying on any registration facilities. Such facilities may be unavoidable for supporting plugins, but I have alternative ideas for plugins that I'd like to try first. My plan is essentially to build everything without registration until we hit an absolute requirement. Instead of registration of pass names, there will be a library dedicated to parsing pass names and the pass pipeline strings described above. Currently, this is directly embedded into opt for simplicity as it is very early, but I plan to eventually pull this into a library that opt, bugpoint, and even Clang can depend on. It should end up as a good home for things like the existing PassManagerBuilder as well. There are a bunch of FIXMEs in the code for the parts of this that are just stubbed out to make the patch more incremental. A quick list of what's coming up directly after this: - Support for function passes and building the structured nesting. - Support for printing the pass structure, and FileCheck tests of all of this code. - The .def-file based pass name parsing. - IR priting passes and the corresponding tests. Some obvious things that I'm not going to do right now, but am definitely planning on as the pass manager work gets a bit further: - Pull the parsing into library, including the builders. - Thread the rest of the target stuff into the new pass manager. - Wire support for the new pass manager up to llc. - Plugin support. Some things that I'd like to have, but are significantly lower on my priority list. I'll get to these eventually, but they may also be places where others want to contribute: - Adding nice error reporting for broken pass pipeline descriptions. - Typo-correction for pass names. llvm-svn: 198998	2014-01-11 08:16:35 +00:00
Juergen Ributzka	976d94b834	[anyregcc] Fix callee-save mask for anyregcc Use separate callee-save masks for XMM and YMM registers for anyregcc on X86 and select the proper mask depending on the target cpu we compile for. llvm-svn: 198985	2014-01-11 01:00:27 +00:00
Eric Christopher	942f22c439	Revert r198979 - accidental commit. llvm-svn: 198981	2014-01-11 00:28:12 +00:00
Eric Christopher	ceec7b02fa	Reformat. llvm-svn: 198980	2014-01-11 00:23:18 +00:00
Eric Christopher	67cde9ac07	Update function name and add some helpful comments. llvm-svn: 198979	2014-01-11 00:23:16 +00:00
Eric Christopher	a052e12c97	Fix odd whitespace. llvm-svn: 198978	2014-01-11 00:23:11 +00:00
Diego Novillo	9518b63bfc	Extend and simplify the sample profile input file. 1- Use the line_iterator class to read profile files. 2- Allow comments in profile file. Lines starting with '#' are completely ignored while reading the profile. 3- Add parsing support for discriminators and indirect call samples. Our external profiler can emit more profile information that we are currently not handling. This patch does not add new functionality to support this information, but it allows profile files to provide it. I will add actual support later on (for at least one of these features, I need support for DWARF discriminators in Clang). A sample line may contain the following additional information: Discriminator. This is used if the sampled program was compiled with DWARF discriminator support (http://wiki.dwarfstd.org/index.php?title=Path_Discriminators). This is currently only emitted by GCC and we just ignore it. Potential call targets and samples. If present, this line contains a call instruction. This models both direct and indirect calls. Each called target is listed together with the number of samples. For example, 130: 7 foo:3 bar:2 baz:7 The above means that at relative line offset 130 there is a call instruction that calls one of foo(), bar() and baz(). With baz() being the relatively more frequent call target. Differential Revision: http://llvm-reviews.chandlerc.com/D2355 4- Simplify format of profile input file. This implements earlier suggestions to simplify the format of the sample profile file. The symbol table is not necessary and function profiles do not need to know the number of samples in advance. Differential Revision: http://llvm-reviews.chandlerc.com/D2419 llvm-svn: 198973	2014-01-10 23:23:51 +00:00
Diego Novillo	0accb3d2bc	Propagation of profile samples through the CFG. This adds a propagation heuristic to convert instruction samples into branch weights. It implements a similar heuristic to the one implemented by Dehao Chen on GCC. The propagation proceeds in 3 phases: 1- Assignment of block weights. All the basic blocks in the function are initial assigned the same weight as their most frequently executed instruction. 2- Creation of equivalence classes. Since samples may be missing from blocks, we can fill in the gaps by setting the weights of all the blocks in the same equivalence class to the same weight. To compute the concept of equivalence, we use dominance and loop information. Two blocks B1 and B2 are in the same equivalence class if B1 dominates B2, B2 post-dominates B1 and both are in the same loop. 3- Propagation of block weights into edges. This uses a simple propagation heuristic. The following rules are applied to every block B in the CFG: - If B has a single predecessor/successor, then the weight of that edge is the weight of the block. - If all the edges are known except one, and the weight of the block is already known, the weight of the unknown edge will be the weight of the block minus the sum of all the known edges. If the sum of all the known edges is larger than B's weight, we set the unknown edge weight to zero. - If there is a self-referential edge, and the weight of the block is known, the weight for that edge is set to the weight of the block minus the weight of the other incoming edges to that block (if known). Since this propagation is not guaranteed to finalize for every CFG, we only allow it to proceed for a limited number of iterations (controlled by -sample-profile-max-propagate-iterations). It currently uses the same GCC default of 100. Before propagation starts, the pass builds (for each block) a list of unique predecessors and successors. This is necessary to handle identical edges in multiway branches. Since we visit all blocks and all edges of the CFG, it is cleaner to build these lists once at the start of the pass. Finally, the patch fixes the computation of relative line locations. The profiler emits lines relative to the function header. To discover it, we traverse the compilation unit looking for the subprogram corresponding to the function. The line number of that subprogram is the line where the function begins. That becomes line zero for all the relative locations. llvm-svn: 198972	2014-01-10 23:23:46 +00:00
Tom Roeder	583a77e09d	Space formatting fix for r198966. llvm-svn: 198971	2014-01-10 23:17:39 +00:00
Roman Divacky	9dc6df5744	Constant propagate MachineInstrClassName. llvm-svn: 198969	2014-01-10 22:59:49 +00:00
Tom Roeder	9b41aa7275	Fixing build break: should be in the if statement, not outside. llvm-svn: 198966	2014-01-10 22:55:25 +00:00
Tom Roeder	50b892e7d5	Restore the library dependency of LLVMgold on LTO; this was removed recently but is needed for LLVMgold to load in ld. llvm-svn: 198965	2014-01-10 22:48:35 +00:00
Rafael Espindola	1840ad4e57	Add a note about the old asm printer being removed. llvm-svn: 198960	2014-01-10 22:06:26 +00:00
Rafael Espindola	f581314932	All backends use MC now. llvm-svn: 198959	2014-01-10 21:49:27 +00:00
Rafael Espindola	81e7fd011f	Use the simpler version of sys::fs::remove when possible. llvm-svn: 198958	2014-01-10 21:40:29 +00:00
Rafael Espindola	78dcc03c37	Remove remove_all. A compiler has no need for recursively deleting a directory. llvm-svn: 198955	2014-01-10 20:36:42 +00:00
Duncan P. N. Exon Smith	bccb4fdd05	LTO: whitespace changes llvm-svn: 198954	2014-01-10 20:24:35 +00:00
Arnold Schwaighofer	c2e9d759f2	LoopVectorizer: Handle strided memory accesses by versioning for (i = 0; i < N; ++i) A[i * Stride1] += B[i * Stride2]; We take loops like this and check that the symbolic strides 'Strided1/2' are one and drop to the scalar loop if they are not. This is currently disabled by default and hidden behind the flag 'enable-mem-access-versioning'. radar://13075509 llvm-svn: 198950	2014-01-10 18:20:32 +00:00
Arnold Schwaighofer	cebfcceec1	SCEVRewriter: Optionally interpret constants in value map as SCEVConstant An upcoming loop vectorizer commit will want to replace a SCEVUnknown(Value*) by a SCEVConstant. This commit modifies the SCEVParameterRewriter to support this. The SCEVParameterRewriter constructor can optionally specify to follow this behavior. llvm-svn: 198949	2014-01-10 18:20:29 +00:00
Artyom Skrobov	4e62c0b2b2	Amending test/MC/ARM/thumb2-mclass.s to match its apparent original purpose (to test the ARMv6M/ARMv7M commonality), and creating a new test case for the differences between ARMv6M and ARMv7M llvm-svn: 198946	2014-01-10 16:49:49 +00:00
Artyom Skrobov	4d91d944ae	Must not produce Tag_CPU_arch_profile for pre-ARMv7 cores (e.g. cortex-m0) llvm-svn: 198945	2014-01-10 16:42:55 +00:00
Saleem Abdulrasool	b16c09f241	ARM: fix regression caused by r198914 The disassembler would no longer be able to disambiguage between the two variants (explicit immediate #0 vs implicit, omitted #0) for the ldrt, strt, ldrbt, strbt mnemonics as both versions indicated the disassembler routine. llvm-svn: 198944	2014-01-10 16:22:47 +00:00
Kristof Beyls	90ff80e329	Silence unused variable warning for non-asserting builds that was introduced in r198937. llvm-svn: 198941	2014-01-10 14:20:45 +00:00
Rafael Espindola	af77e1205a	Use 'w' instead of 'c' to represent the win32 mangling. This change was requested to avoid confusion if we ever support non windows coff systems. llvm-svn: 198938	2014-01-10 13:42:12 +00:00
Kristof Beyls	58306ad903	Make sure -use-init-array has intended effect on all AArch64 ELF targets, not just linux. llvm-svn: 198937	2014-01-10 13:41:49 +00:00
NAKAMURA Takumi	ea1ff6fe33	Whitespace. llvm-svn: 198934	2014-01-10 11:12:01 +00:00
NAKAMURA Takumi	1f5cf85fd4	Sink add_llvm_library(gtest_main) to UnitTestMain/CMakeLists.txt. llvm-svn: 198933	2014-01-10 11:02:26 +00:00
NAKAMURA Takumi	d38ac74662	llvm/test/ExecutionEngine/MCJIT/load-object-a.ll: Remove "REQUIRES:shell". This doesn't depend on shell's behavior. llvm-svn: 198931	2014-01-10 10:38:52 +00:00
NAKAMURA Takumi	566080cc80	llvm/test/ExecutionEngine/MCJIT/lit.local.cfg: Add "AMD64" in the host_arch list. FIXME: We should not take CMake's ${CMAKE_SYSTEM_PROCESSOR}... llvm-svn: 198930	2014-01-10 10:38:46 +00:00
NAKAMURA Takumi	d7fd6d99ec	lli: Tweak CacheName not to contain DOS driveletter. llvm-svn: 198929	2014-01-10 10:38:40 +00:00
NAKAMURA Takumi	390e060916	lli: LLIObjectCache: Use llvm::sys::path to get dirname. llvm-svn: 198928	2014-01-10 10:38:34 +00:00
NAKAMURA Takumi	f462f9c7e0	Whitespace. llvm-svn: 198927	2014-01-10 10:38:28 +00:00
NAKAMURA Takumi	52f9d3818b	llvm/test/ExecutionEngine/MCJIT/load-object-a.ll: Fix not to use %t.cachedir/%p. %p is like X:\foo\bar. llvm-svn: 198926	2014-01-10 10:38:23 +00:00
Kostya Serebryany	a6afef7a51	reapply r198858: Disable LeakSanitizer in TableGen binaries, see PR18325; this time LeakSanitizerIsTurnedOffForTheCurrentProcess is used instead of __lsan_is_turned_off llvm-svn: 198922	2014-01-10 08:05:42 +00:00
Saleem Abdulrasool	435f45653a	ARM IAS: support #:{lower,upper}16: for GNU compatibility The GNU assembler supports prefixing the expression with a '#' to indiciate that the value that is being moved is infact a constant. This improves the compatibility of the integrated assembler's parser for this. llvm-svn: 198916	2014-01-10 04:38:40 +00:00
Saleem Abdulrasool	e6e6d71477	ARM IAS: support GNU extension for ldrd, strd The GNU assembler has an extension that allows for the elision of the paired register (dt2) for the LDRD and STRD mnemonics. Add support for this in the assembly parser. Canonicalise the usage during the instruction parsing from the specified version. llvm-svn: 198915	2014-01-10 04:38:35 +00:00
Saleem Abdulrasool	5bfefb6a8f	ARM IAS: support implicit immediate 0s for {LD,ST}R{B,}T The ARM ARM indicates the mnemonics as follows: ldrbt{<c>}{<q>} <Rt>, [<Rn>], {, #+/-<imm>} ldrt{<c>}{<q>} <Rt>, [<Rn>] {, #+/-<imm>} strbt{<c>}{<q>} <Rt>, [<Rn>] {, #<imm>} strt{<c>}{<q>} <Rt>, [<Rn>] {, #+/-<imm>} This improves the parser to deal with the implicit immediate 0 for the mnemonics as per the specification. Thanks to Joerg Sonnenberger for the tests! llvm-svn: 198914	2014-01-10 04:38:31 +00:00
Venkatraman Govindaraju	ad40dfcb4b	[Sparc] Emit retl/ret instead of jmp instruction. It improves the readability of the assembly generated. llvm-svn: 198910	2014-01-10 02:55:27 +00:00
Venkatraman Govindaraju	0d288d3105	[Sparc] Add support for parsing jmpl instruction and make indirect call and jmp instructions as aliases to jmpl. llvm-svn: 198909	2014-01-10 01:48:17 +00:00
David Blaikie	15ed5ebfc5	Revert "Revert r198851, "Prototype of skeleton type units for fission"" This reverts commit r198865 which reverts r198851. ASan identified a use-of-uninitialized of the DwarfTypeUnit::Ty variable in skeleton type units. llvm-svn: 198908	2014-01-10 01:38:41 +00:00
Kevin Enderby	9bd296ab55	Fix a bug with the ARM thumb2 CBNZ and CBNZ instructions that branch to the next instruction. This can not be encoded but can be turned into a NOP. rdar://15062072 llvm-svn: 198904	2014-01-10 00:43:32 +00:00
Chandler Carruth	85dac69ba1	Update the developer policy to more clearly spell out the steps for contributors to submit patches to the LLVM project. Thanks to Danny, Chris, Alp, and others for reviewing. llvm-svn: 198901	2014-01-10 00:08:34 +00:00
Justin Bogner	a3570186b2	Bitcode: Fix a typo in an assert llvm-svn: 198894	2014-01-09 22:02:05 +00:00
Venkatraman Govindaraju	6ff62cc539	[Sparc] Multiclass for loads/stores. No functionality change intended. llvm-svn: 198893	2014-01-09 21:49:18 +00:00
Evan Cheng	aa37d35d78	Clean up an inconsistency in v7s feature default. llvm-svn: 198889	2014-01-09 20:24:00 +00:00
Rafael Espindola	cd56deb6bf	Add a unit test for the copy constructor. I would not normally add tests like these, but the copy constructor is not used at all in our codebase with c++11, so having this tests might prevent breaking the c++03 build again. llvm-svn: 198886	2014-01-09 19:47:39 +00:00
Alp Toker	2a2a354ee9	Revert "Disable LeakSanitizer in TableGen binaries, see PR18325" To declare or define reserved identifers is undefined behaviour in standard C++. This needs to be addressed in compiler-rt before it can be used in LLVM. See the list discussion for details. This reverts commit r198858. llvm-svn: 198884	2014-01-09 19:40:55 +00:00
Nadav Rotem	032b39e40d	Re-remove dead code. This reverts r198854. llvm-svn: 198879	2014-01-09 19:22:07 +00:00
Rafael Espindola	a24f5cf273	Update example to be more idiomatic. llvm-svn: 198872	2014-01-09 14:40:43 +00:00
NAKAMURA Takumi	c5bf572993	Revert r198851, "Prototype of skeleton type units for fission" It caused undefined behavior. DwarfTypeUnit::Ty might not be initialized properly, I guess. llvm-svn: 198865	2014-01-09 13:08:00 +00:00
Stepan Dyatkovskiy	431993b57b	Fixed old typo in ScalarEvolution, that caused wrong SCEVs zext operation. Detailed description is here: http://llvm.org/bugs/show_bug.cgi?id=18000#c16 For participation in bugfix process special thanks to David Wiberg. llvm-svn: 198863	2014-01-09 12:26:12 +00:00
Richard Sandiford	3875cb60f3	[SystemZ] Fix RNSBG bug introduced by r197802 The zext handling added in r197802 wasn't right for RNSBG. This patch restricts it to ROSBG, RXSBG and RISBG. (The tests for RISBG were added in r197802 since RISBG was the motivating example.) llvm-svn: 198862	2014-01-09 11:28:53 +00:00
Richard Sandiford	15cfc1c33c	Handle masked rotate amounts At the moment we expect rotates to have the form: (or (shl X, Y), (shr X, Z)) where Y == bitsize(X) - Z or Z == bitsize(X) - Y. This form means that the (or ...) is undefined for Y == 0 or Z == 0. This undefinedness can be avoided by using Y == (C * bitsize(X) - Z) & (bitsize(X) - 1) or Z == (C * bitsize(X) - Y) & (bitsize(X) - 1) for any integer C (including 0, the most natural choice). llvm-svn: 198861	2014-01-09 10:56:42 +00:00
Richard Sandiford	0f264db3c6	Match the InstCombine form of rotates by X+C InstCombine converts (sub 32, (add X, C)) into (sub 32-C, X), so a rotate left of a 32-bit Y by X+C could appear as either: (or (shl Y, (add X, C)), (shr Y, (sub 32, (add X, C)))) without InstCombine or: (or (shl Y, (add X, C)), (shr Y, (sub 32-C, X))) with it. We already matched the first form. This patch handles the second too. llvm-svn: 198860	2014-01-09 10:49:40 +00:00
Kostya Serebryany	bc60254543	Disable LeakSanitizer in TableGen binaries, see PR18325 llvm-svn: 198858	2014-01-09 09:26:26 +00:00
Nadav Rotem	d677b310e8	Revert r198819 - "Remove dead code." llvm-svn: 198854	2014-01-09 07:50:34 +00:00
Lang Hames	f9dd8fdc5e	Fix accidental use of the exotic "std::string::back()" method. Turns out it's new in C++11. llvm-svn: 198853	2014-01-09 05:29:59 +00:00
Lang Hames	1ddecc0777	Add an "-object-cache-dir=<string>" option to LLI. This option specifies the root path to which object files managed by the LLIObjectCache instance should be written. This option defaults to "", in which case objects are cached in the same directory as the bitcode they are derived from. The load-object-a.ll test has been rewritten to use this option to support testing in environments where the test directory is not writable. llvm-svn: 198852	2014-01-09 05:24:05 +00:00
David Blaikie	a588365df6	Prototype of skeleton type units for fission llvm-svn: 198851	2014-01-09 05:08:28 +00:00
David Blaikie	92d9d627af	llvm-dwarfdump: type unit dwo support llvm-svn: 198850	2014-01-09 05:08:24 +00:00
Saleem Abdulrasool	5b060a92d6	llvm-readobj: address review comments for ARM EHABI printing Rename bytecode to opcodes to make it more clear. Change an impossible case to llvm_unreachable instead. Avoid allocation of a buffer by modifying the PrintOpcodes iteration. llvm-svn: 198848	2014-01-09 04:31:18 +00:00
Saleem Abdulrasool	b7b8a8f46d	llvm-readobj: fix endianness Explicitly handle endianness to ensure that bytes are read properly on big-endian systems. llvm-svn: 198847	2014-01-09 04:31:14 +00:00
David Blaikie	38fe6342f6	DwarfDebug: Refactor out common skeleton construction code to be reused for type unit skeletons. llvm-svn: 198846	2014-01-09 04:28:46 +00:00
Richard Smith	c198b450cc	Extend llvm::AlignedCharArrayUnion to support up to 10 arguments, as required by Clang's APValue. llvm-svn: 198844	2014-01-09 03:28:55 +00:00
David Blaikie	b334e94492	Reformatting for r198842 llvm-svn: 198843	2014-01-09 03:24:13 +00:00
David Blaikie	f645f963ff	DwarfUnit: Rename "Node" to "CUNode" and propagate it through DwarfTypeUnit as well. Since we'll now also need the split dwarf file name along with the language in DwarfTypeUnits, just use the whole DICompileUnit rather than explicitly handling each field needed. llvm-svn: 198842	2014-01-09 03:23:41 +00:00
David Blaikie	7480ae6e19	Revert "DwarfUnit: Move the DICompileUnit Node to the DwarfCompileUnit only" This reverts commit r198830. Decided to go a different way with this... llvm-svn: 198841	2014-01-09 03:03:27 +00:00
Chandler Carruth	12e9d2b5c1	[PM] Rename this source file to something a bit more generic before I add support for the new pass manager to it. llvm-svn: 198838	2014-01-09 02:39:45 +00:00
Chandler Carruth	d48cdbf0c3	Put the functionality for printing a value to a raw_ostream as an operand into the Value interface just like the core print method is. That gives a more conistent organization to the IR printing interfaces -- they are all attached to the IR objects themselves. Also, update all the users. This removes the 'Writer.h' header which contained only a single function declaration. llvm-svn: 198836	2014-01-09 02:29:41 +00:00
David Blaikie	08badfd2ba	DwarfUnit: Move the DICompileUnit Node to the DwarfCompileUnit only It's unused in DwarfTypeUnit, as is expected. llvm-svn: 198830	2014-01-09 01:20:14 +00:00
Eric Christopher	d7ed36b87c	Remove the test for endianness in configure.ac and regenerate. llvm-svn: 198825	2014-01-09 01:09:57 +00:00
Lang Hames	eecd2dc954	Replace fstream use with raw_fd_ostream. llvm-svn: 198821	2014-01-09 00:47:54 +00:00
Rafael Espindola	5e10aaeb18	Remove dead code. llvm-svn: 198819	2014-01-09 00:32:54 +00:00
Rafael Espindola	d2d23ed04a	Use the existing typedef to avoid forming a reference to a reference. llvm-svn: 198817	2014-01-09 00:25:25 +00:00
Andrew Trick	32e1be7bd0	llvm.experimental.stackmap: fix encoding of large constants. In the stackmap format we advertise the constant field as signed. However, we were determining whether to promote to a 64-bit constant pool based on an unsigned comparison. This fix allows -1 to be encoded as a small constant. llvm-svn: 198816	2014-01-09 00:22:31 +00:00
David Blaikie	66865d6d94	Simplify/collapse/denest a conditions/blocks. llvm-svn: 198813	2014-01-09 00:13:35 +00:00
David Blaikie	622dce4194	llvm-dwarfdump: reorder dwo sections to immediately proceed their non-dwo equivalents This makes it easier to write a test that's mostly shared between fission and non-fission (using FileCheck's multiple prefix support). llvm-svn: 198806	2014-01-08 23:29:59 +00:00
Rafael Espindola	3c426afdc6	Fix the C++03 build. With c++11 we never instantiate the copy constructor. llvm-svn: 198803	2014-01-08 22:27:04 +00:00
Rafael Espindola	1c704b4a2e	Use getError and remove the error_code operator. llvm-svn: 198799	2014-01-08 22:03:39 +00:00
Chandler Carruth	98f3de8880	Remove vestigal bits of MC from the mangler. It no longer uses this, and having the include could cause weird layering problems between the IR and MC libraries. llvm-svn: 198796	2014-01-08 21:59:22 +00:00
Hal Finkel	2150e3a743	Conservatively handle multiple MMOs in MIsNeedChainEdge MIsNeedChainEdge, which is used by -enable-aa-sched-mi (AA in misched), had an llvm_unreachable when -enable-aa-sched-mi is enabled and we reach an instruction with multiple MMOs. Instead, return a conservative answer. This allows testing -enable-aa-sched-mi on x86. Also, this moves the check above the isUnsafeMemoryObject checks. isUnsafeMemoryObject is currently correct only for instructions with one MMO (as noted in the comment in isUnsafeMemoryObject): // We purposefully do no check for hasOneMemOperand() here // in hope to trigger an assert downstream in order to // finish implementation. The problem with this is that, had the candidate edge passed the "!MIa->mayStore() && !MIb->mayStore()" check, the hoped-for assert would never happen (which could, in theory, lead to incorrect behavior if one of these secondary MMOs was volatile, for example). llvm-svn: 198795	2014-01-08 21:52:02 +00:00
Matt Arsenault	a64ee177a0	Move declaration of variables down to first use. llvm-svn: 198794	2014-01-08 21:47:14 +00:00
Matt Arsenault	d13105d793	Add missing definitions of key_type and value_type to DenseSet. This matches std::set and allows using DenseSet with the functions in SetOperations.h llvm-svn: 198793	2014-01-08 21:38:04 +00:00
Rafael Espindola	5d16475b31	Add get and getError methods to ErrorOr. ErrorOr is modeled after boost::optional which has a get method. llvm-svn: 198792	2014-01-08 21:17:09 +00:00
Ana Pazos	cfd2ca5826	[AArch64][NEON] Added UXTL and UXTL2 instruction aliases llvm-svn: 198791	2014-01-08 21:02:13 +00:00
Roman Divacky	fb4d390766	Force emit a relocation for @gnu_indirect_function symbols so that the indirect resolution works. llvm-svn: 198780	2014-01-08 18:50:32 +00:00
David Woodhouse	df1e1960ac	[x86] Remove OpSize16 flag from MOV32r0 It's not a real instruction any more and doesn't need encoding information. llvm-svn: 198778	2014-01-08 18:38:26 +00:00
Andrea Di Biagio	23df4e4a2d	Teach the DAGCombiner how to fold 'vselect' dag nodes according to the following two rules: 1) fold (vselect (build_vector AllOnes), A, B) -> A 2) fold (vselect (build_vector AllZeros), A, B) -> B llvm-svn: 198777	2014-01-08 18:33:04 +00:00
Rafael Espindola	ac8c55222e	Add missing rename from the previous commit. No idea how this was compiling locally. Found by the bots. llvm-svn: 198775	2014-01-08 17:56:46 +00:00
Rafael Espindola	3ccb3f201e	Rename get to getStorage and getError to getErrorStorage. These private functions return pointers to the internal storage. llvm-svn: 198774	2014-01-08 17:43:26 +00:00
Lang Hames	7b6f99ff0d	Add missing test case for r198737. llvm-svn: 198772	2014-01-08 16:31:16 +00:00
Nico Rieck	ea623c6f10	Remove mention of old deleted test scripts from testing guide llvm-svn: 198771	2014-01-08 16:30:03 +00:00
Richard Sandiford	95c864d9bd	[DAGCombiner] Factor duplicated rotate code into a separate function No functional change intended. llvm-svn: 198768	2014-01-08 15:40:47 +00:00
Alp Toker	9e628916f6	lit: Provide file location in cfg error messages Python doesn't do a good job at diagnosing string exec() so use execfile() where available. This should be a timesaver when trying to get to the bottom of build bot failures. Before: File "llvm/utils/lit/lit/TestingConfig.py", line 93, in load_from_path exec("exec data in cfg_globals") File "<string>", line 1, in <module> File "<string>", line 194, in <module> NameError: name 'typo' is not defined After: File "llvm/utils/lit/lit/TestingConfig.py", line 95, in load_from_path execfile(path, cfg_globals) File "clang/test/lit.cfg", line 194, in <module> typo ^~~~ NameError: name 'typo' is not defined llvm-svn: 198766	2014-01-08 14:20:59 +00:00
David Woodhouse	adfc885997	[x86] Support R_386_PC8, R_386_PC16 and R_X86_64_PC8 llvm-svn: 198763	2014-01-08 12:58:40 +00:00
David Woodhouse	9785f512cb	[x86] Add JMP_2 and other 16-bit PC-relative branch instructions Mark them as requiring 16-bit mode for now, since we don't yet have relaxation support for FK_Data_2. llvm-svn: 198762	2014-01-08 12:58:36 +00:00
David Woodhouse	8bceb5d217	[x86] Do not relax PUSHi16 to PUSHi32 (PR18414) They do different things to %esp, so they are not equivalent. Rename PUSHi8 to PUSH32i8 and add the missing PUSH16i8. llvm-svn: 198761	2014-01-08 12:58:32 +00:00
David Woodhouse	6dbda4415a	[x86] Make AsmParser validate registers for memory operands a bit better We can't do a perfect job here. We have to allow (%dx) even in 64-bit mode, for example, because it might be used for an unofficial form of the in/out instructions. We actually want to do a better job of validation later. Perhaps instead of doing it where we are at the moment. But for now, doing what validation we can do in the place that the code already has its validation, is an improvement. llvm-svn: 198760	2014-01-08 12:58:28 +00:00
David Woodhouse	32da3c8f3b	[x86] Fix MOV8ao8 et al for 16-bit mode, fix up disassembler to understand It seems there is no separate instruction class for having AdSize and OpSize bits set, which is required in order to disambiguate between all these instructions. So add that to the disassembler. Hm, perhaps we do need an AdSize16 bit after all? llvm-svn: 198759	2014-01-08 12:58:24 +00:00
David Woodhouse	374243a290	[x86] Use 16-bit addressing where possible in 16-bit mode Where "where possible" means that it's an immediate value and it's below 0x10000. In fact GAS will either truncate or error with larger values, and will insist on using the addr32 prefix to get 32-bit addressing. So perhaps we should do that, in a later patch. llvm-svn: 198758	2014-01-08 12:58:18 +00:00
David Woodhouse	84ed54f91e	[x86] Fix JCXZ,JECXZ_32 for 16-bit mode JCXZ should have the 0x67 prefix only if we're in 32-bit mode, so make that appropriately conditional. And JECXZ needs the prefix instead. llvm-svn: 198757	2014-01-08 12:58:12 +00:00
David Woodhouse	79dd505ce1	[x86] Disambiguate RET[QL] and fix aliases for 16-bit mode I couldn't see how to do this sanely without splitting RETQ from RETL. Eric says: "sad about the inability to roundtrip them now, but...". I have no idea what that means, but perhaps it wants preserving in the commit comment. llvm-svn: 198756	2014-01-08 12:58:07 +00:00
David Woodhouse	c178fbe2a2	[x86] Disambiguate [LS][IG]DT{32,64}m and add 16-bit versions, fix aliases llvm-svn: 198755	2014-01-08 12:57:55 +00:00
David Woodhouse	fd46016e7f	[x86] Add JMP16[rm],CALL16[rm] instructions, and fix up aliases llvm-svn: 198754	2014-01-08 12:57:49 +00:00
David Woodhouse	13574a7517	[x86] Add PUSHA16,POPA16 instructions, and fix aliases for 16-bit mode llvm-svn: 198753	2014-01-08 12:57:45 +00:00
David Woodhouse	956965ca69	[x86] Add OpSize16 to instructions that need it This fixes the bulk of 16-bit output, and the corresponding test case x86-16.s now looks mostly like the x86-32.s test case that it was originally based on. A few irrelevant instructions have been dropped, and there are still some corner cases to be fixed in subsequent patches. llvm-svn: 198752	2014-01-08 12:57:40 +00:00
Rafael Espindola	4daaa8e8f2	Use -std=gnu99 in tools/llvm-c-test/CMakeLists.txt With a current mingw (gcc 4.8.1) it looks like we hit some variation of http://gcc.gnu.org/bugzilla/show_bug.cgi?id=40278 The end result is that off_t is not defined and the build fails without this patch. llvm-svn: 198749	2014-01-08 11:48:19 +00:00

... 2 3 4 5 6 ...

99279 Commits