llvm-project

Commit Graph

Author	SHA1	Message	Date
Andrew Trick	7cb710d58c	Implemented public interface for modifying registered (not positional or sink options) command line options at runtime. Patch by Dan Liew! llvm-svn: 181254	2013-05-06 21:56:35 +00:00
Andrew Trick	0537a98878	Support command line option categories. Patch by Dan Liew! llvm-svn: 181253	2013-05-06 21:56:23 +00:00
Krzysztof Parzyszek	59df52c585	Cleanup of the HexagonTargetMachine setup. llvm-svn: 181250	2013-05-06 21:25:45 +00:00
David Majnemer	70f286d95f	InstCombine: (X ^ signbit) + C -> X + (signbit ^ C) llvm-svn: 181249	2013-05-06 21:21:31 +00:00
Eric Christopher	0cdce8351a	Hoist boundary condition out of loop header. llvm-svn: 181248	2013-05-06 21:19:44 +00:00
Eric Christopher	34ea33680f	Untabify. llvm-svn: 181247	2013-05-06 21:19:41 +00:00
Jyotsna Verma	84c471029b	Hexagon: Add multiclass/encoding bits for the New-Value Jump instructions. llvm-svn: 181235	2013-05-06 18:49:23 +00:00
Krzysztof Parzyszek	d50074712f	Make references to HexagonTargetMachine "const". llvm-svn: 181233	2013-05-06 18:38:37 +00:00
Andrew Trick	9c72b071fe	Rotate multi-exit loops even if the latch was simplified. Test case by Michele Scandale! Fixes PR10293: Load not hoisted out of loop with multiple exits. There are few regressions with this patch, now tracked by rdar:13817079, and a roughly equal number of improvements. The regressions are almost certainly back luck because LoopRotate has very little idea of whether rotation is profitable. Doing better requires a more comprehensive solution. This checkin is a quick fix that lacks generality (PR10293 has a counter-example). But it trivially fixes the case in PR10293 without interfering with other cases, and it does satify the criteria that LoopRotate is a loop canonicalization pass that should avoid heuristics and special cases. I can think of two approaches that would probably be better in the long run. Ultimately they may both make sense. (1) LoopRotate should check that the current header would make a good loop guard, and that the loop does not already has a sufficient guard. The artifical SimplifiedLoopLatch check would be unnecessary, and the design would be more general and canonical. Two difficulties: - We need a strong guarantee that we won't endlessly rotate, so the analysis would need to be precise in order to avoid the SimplifiedLoopLatch precondition. - Analysis like this are usually based on SCEV, which we don't want to rely on. (2) Rotate on-demand in late loop passes. This could even be done by shoving the loop back on the queue after the optimization that needs it. This could work well when we find LICM opportunities in multi-branch loops. This requires some work, and it doesn't really solve the problem of SCEV wanting a loop guard before the analysis. llvm-svn: 181230	2013-05-06 17:58:18 +00:00
Tom Stellard	d93cede8e4	R600: Remove dead code from the CodeEmitter v2 v2: - Replace switch statement with TSFlags query Reviewed-by: Vincent Lejeune <vljn@ovi.com> Tested-By: Aaron Watry <awatry@gmail.com> llvm-svn: 181229	2013-05-06 17:50:57 +00:00
Tom Stellard	043de4c5af	R600: Emit config values in register / value pairs Reviewed-by: Vincent Lejeune <vljn@ovi.com> Tested-By: Aaron Watry <awatry@gmail.com> llvm-svn: 181228	2013-05-06 17:50:51 +00:00
Eric Christopher	6c6de847a8	Remove unnecessary instance variable and rework logic accordingly. llvm-svn: 181227	2013-05-06 17:50:50 +00:00
Eric Christopher	f0303324be	Grammar. llvm-svn: 181226	2013-05-06 17:50:46 +00:00
Tom Stellard	cfe2ef8fea	R600: Stop emitting the instruction type byte before each instruction Reviewed-by: Vincent Lejeune <vljn@ovi.com> Tested-By: Aaron Watry <awatry@gmail.com> llvm-svn: 181225	2013-05-06 17:50:44 +00:00
Eric Christopher	92f3c0b49c	Don't emit .dwo sections unless they exist. llvm-svn: 181224	2013-05-06 17:50:42 +00:00
Tom Stellard	dbbcaf31b6	R600: Emit ISA for CALL_FS_* instructions Reviewed-by: Vincent Lejeune <vljn@ovi.com> Tested-By: Aaron Watry <awatry@gmail.com> llvm-svn: 181223	2013-05-06 17:50:26 +00:00
Ulrich Weigand	e7c6dfeb4b	[SystemZ] Update non-pic DWARF encodings As pointed out by Rafael Espindola, we should match the DWARF encodings produced by GCC in both pic and non-pic modes. This was not the case for the non-pic case. This patch changes all DWARF encodings to DW_EH_PE_absptr for the non-pic case, just like GCC does. The test case is updated to check for both variants. llvm-svn: 181222	2013-05-06 17:28:30 +00:00
Adhemerval Zanella	e8bd03da5c	PowerPC: Fix unimplemented relocation on ppc64 This patch handles the R_PPC64_REL64 relocation type for powerpc64 for mcjit. llvm-svn: 181220	2013-05-06 17:21:23 +00:00
Jean-Luc Duprat	3e4fc3ef24	Provide InstCombines for the following 3 cases: A * (1 - (uitofp i1 C)) -> select C, 0, A B * (uitofp i1 C) -> select C, B, 0 select C, 0, A + select C, B, 0 -> select C, B, A These come up in code that has been hand-optimized from a select to a linear blend, on platforms where that may have mattered. We want to undo such changes with the following transform: A(1 - uitofp i1 C) + B(uitofp i1 C) -> select C, A, B llvm-svn: 181216	2013-05-06 16:55:50 +00:00
Ulrich Weigand	5f613dfd1f	[SystemZ] Add back end This adds the actual lib/Target/SystemZ target files necessary to implement the SystemZ target. Note that at this point, the target cannot yet be built since the configure bits are missing. Those will be provided shortly by a follow-on patch. This version of the patch incorporates feedback from reviews by Chris Lattner and Anton Korobeynikov. Thanks to all reviewers! Patch by Richard Sandiford. llvm-svn: 181203	2013-05-06 16:15:19 +00:00
Ulrich Weigand	0213e7fcb8	[SystemZ] Define DWARF encoding This is another patch in preparation for adding the SystemZ target. It defines the appropriate values for DWARF encodings; the intent is to be compatible with what GCC currently does on the target. Patch by Richard Sandiford. llvm-svn: 181201	2013-05-06 16:11:12 +00:00
Ulrich Weigand	509c240ce5	[PowerPC] Fix memory corruption in AsmParser As pointed out by Evgeniy Stepanov, assigning a std::string temporary to a StringRef is not a good idea. Rework MatchRegisterName to avoid using the .lower routine. llvm-svn: 181192	2013-05-06 11:16:57 +00:00
Michael Kuperstein	ac868757d0	Fix slightly too aggressive conact_vector optimization. (Would sometimes optimize away conacts used to extend a vector with undef values) llvm-svn: 181186	2013-05-06 08:06:13 +00:00
Nadav Rotem	632b25b743	Update the comment to mention that we use TTI. llvm-svn: 181178	2013-05-06 03:06:36 +00:00
Nadav Rotem	c70ef4e93c	Revert r164763 because it introduces new shuffles. Thanks Nick Lewycky for pointing this out. llvm-svn: 181177	2013-05-06 02:39:09 +00:00
Matt Arsenault	c23753a53e	Fix unchecked uses of DominatorTree in MemoryDependenceAnalysis. Use unknown results for places where it would be needed llvm-svn: 181176	2013-05-06 02:07:24 +00:00
Rafael Espindola	c229a4fff4	Fix const merging when an alias of a const is llvm.used. We used to disable constant merging not only if a constant is llvm.used, but also if an alias of a constant is llvm.used. This change fixes that. llvm-svn: 181175	2013-05-06 01:48:55 +00:00
Rafael Espindola	fa5942bc2c	Add EH support to the MCJIT. This gets exception handling working on ELF and Macho (x86-64 at least). Other than the EH frame registration, this patch also implements support for GOT relocations which are used to locate the personality function on MachO. llvm-svn: 181167	2013-05-05 20:43:10 +00:00
Evan Cheng	9fad6352d4	ARM AnalyzeBranch should conservatively return true when it sees a predicated indirect branch at the end of the BB. Otherwise if-converter, branch folding pass may incorrectly update its successor info if it consider BB as fallthrough to the next BB. rdar://13782395 llvm-svn: 181161	2013-05-05 18:06:32 +00:00
Evan Cheng	8b8e8d88ff	Teach if-converter to avoid removing BBs whose addresses are takne. rdar://13782395 llvm-svn: 181160	2013-05-05 18:03:49 +00:00
Benjamin Kramer	3e3f2a4b8d	LoopVectorize: Print values instead of pointers in debug output. llvm-svn: 181157	2013-05-05 14:54:52 +00:00
Richard Osborne	4498bd352f	[XCore] Add LDAPB instructions. With the change the disassembler now supports the XCore ISA in its entirety. llvm-svn: 181155	2013-05-05 13:36:53 +00:00
Richard Osborne	e41cdbd3aa	[XCore] Update LDAP to use pcrel_imm. llvm-svn: 181154	2013-05-05 13:33:10 +00:00
Richard Osborne	8bdfdf717a	[XCore] Rename calltarget -> pcrel_imm. No functionality change. llvm-svn: 181153	2013-05-05 13:29:02 +00:00
Richard Osborne	4d3514ee94	[XCore] Add BLRB instructions. llvm-svn: 181152	2013-05-05 13:24:16 +00:00
Richard Osborne	53a04fe2b4	[XCore] Remove '-' from back branch asm syntax. Instead operands are treated as negative immediates where the sign bit is implicit in the instruction encoding. llvm-svn: 181151	2013-05-05 13:20:22 +00:00
Benjamin Kramer	391f5a6e21	InlineSpiller: Remove quadratic behavior. No functionality change. llvm-svn: 181149	2013-05-05 11:29:14 +00:00
Stepan Dyatkovskiy	8c02c98259	For ARM backend, fixed "byval" attribute support. Now even the small structures could be passed within byval (small enough to be stored in GPRs). In regression tests next function prototypes are checked: PR15293: %artz = type { i32 } define void @foo(%artz* byval %s) define void @foo2(%artz* byval %s, i32 %p, %artz* byval %s2) foo: "s" stored in R0 foo2: "s" stored in R0, "s2" stored in R2. Next AAPCS rules are checked: 5.5 Parameters Passing, C.4 and C.5, "ParamSize" is parameter size in 32bit words: -- NSAA != 0, NCRN < R4 and NCRN+ParamSize > R4. Parameter should be sent to the stack; NCRN := R4. -- NSAA != 0, and NCRN < R4, NCRN+ParamSize < R4. Parameter stored in GPRs; NCRN += ParamSize. llvm-svn: 181148	2013-05-05 07:48:36 +00:00
David Majnemer	66fb70de38	Remove a recently redundant transform from X86ISelLowering. X86ISelLowering has support to treat: (icmp ne (and (xor %flags, -1), (shl 1, flag)), 0) as if it were actually: (icmp eq (and %flags, (shl 1, flag)), 0) However, r179386 has code at the InstCombine level to handle this. llvm-svn: 181145	2013-05-05 02:00:10 +00:00
Arnold Schwaighofer	d96e427eac	LoopVectorize: Add support for floating point min/max reductions Add support for min/max reductions when "no-nans-float-math" is enabled. This allows us to assume we have ordered floating point math and treat ordered and unordered predicates equally. radar://13723044 llvm-svn: 181144	2013-05-05 01:54:48 +00:00
Arnold Schwaighofer	f5183729db	LoopVectorizer: Cleanup of miminimum/maximum pattern match code No need for setting the operands. The pointers are going to be bound by the matcher. radar://13723044 llvm-svn: 181142	2013-05-05 01:54:44 +00:00
Arnold Schwaighofer	a670a0a3aa	LoopVectorize: We don't need an identity element for min/max reductions We can just use the initial element that feeds the reduction. max(max(x, y), z) == max(max(x,y), max(x,z)) radar://13723044 llvm-svn: 181141	2013-05-05 01:54:42 +00:00
Dmitri Gribenko	3238fb7595	Add ArrayRef constructor from None, and do the cleanups that this constructor enables Patch by Robert Wilhelm. llvm-svn: 181138	2013-05-05 00:40:33 +00:00
Nadav Rotem	d61dcfc4fd	whitespace llvm-svn: 181137	2013-05-04 23:27:32 +00:00
Nadav Rotem	42932bdcd0	Fix an odd comment. llvm-svn: 181136	2013-05-04 23:24:56 +00:00
Tim Northover	7b55b97dba	AArch64: enable MCJIT and tests now that everything passes. This removes dire warnings about AArch64 being unsupported and enables the tests when appropriate on this platform. llvm-svn: 181135	2013-05-04 20:14:22 +00:00
Tim Northover	b23d8dbbac	AArch64: implement 64-bit absolute relocation in MCJIT This is about the simplest relocation, but surprisingly rare in actual code. It occurs in (for example) the MCJIT test test-ptr-reloc.ll. llvm-svn: 181134	2013-05-04 20:14:14 +00:00
Tim Northover	37cde9755d	AArch64: add stubs to support long function calls on MCJIT As with global accesses, external functions could exist anywhere in memory. Therefore the stub must create a complete 64-bit address. This patch implements the fragment as (roughly): movz x16, #:abs_g3:somefunc movk x16, #:abs_g2_nc:somefunc movk x16, #:abs_g1_nc:somefunc movk x16, #:abs_g0_nc:somefunc br x16 In principle we could save 4 bytes by using a literal-load instead, but it is unclear that would be more efficient and can only be tested when real hardware is readily available. This allows (for example) the MCJIT test 2003-05-07-ArgumentTest to pass on AArch64. llvm-svn: 181133	2013-05-04 20:14:09 +00:00
Tim Northover	4d01c1e0e6	AArch64: implement relocations for global access The large memory model (default and main viable for JIT) emits addresses in need of relocation as movz x0, #:abs_g3:somewhere movk x0, #:abs_g2_nc:somewhere movk x0, #:abs_g1_nc:somewhere movk x0, #:abs_g0_nc:somewhere To support this we must implement those four relocations in the dynamic loader. This allows (for example) the test-global.ll MCJIT test to pass on AArch64. llvm-svn: 181132	2013-05-04 20:14:04 +00:00
Tim Northover	fa1b2f85da	AArch64: implement first relocation required for MCJIT R_AARCH64_PCREL32 is present in even trivial .eh_frame sections and so is required to compile any function without the "nounwind" attribute. This change implements very basic infrastructure in the RuntimeDyldELF file and allows (for example) the test-shift.ll MCJIT test to pass on AArch64. llvm-svn: 181131	2013-05-04 20:13:59 +00:00
Tim Northover	a958a57081	Build system changes to enable MCJIT on AArch64 These changes just allow AArch64 to take part in the MCJIT world when built correctly. llvm-svn: 181130	2013-05-04 20:13:52 +00:00
Tim Northover	6c26b327ef	AArch64: use __clear_cache under GCCish environments AArch64 is going to need some kind of cache-invalidation in order to successfully JIT since it has a weak memory-model. This is provided by a __clear_cache builtin in libgcc, which acts very much like the 32-bit ARM equivalent (on platforms where it exists). llvm-svn: 181129	2013-05-04 18:52:44 +00:00
Richard Osborne	2f75a0c0d8	Fix buildbot failure on 64 bit linux due to std::max() having different operand types. llvm-svn: 181128	2013-05-04 17:41:01 +00:00
Richard Osborne	0a7abb655b	[XCore] Remove unused operand type. llvm-svn: 181127	2013-05-04 17:30:05 +00:00
Richard Osborne	54ff84a8f8	[XCore] Make use of the target independent global address offset folding. This let us to remove some custom code that matched constant offsets from globals at instruction selection time as a special addressing mode. No intended functionality change. llvm-svn: 181126	2013-05-04 17:24:33 +00:00
Richard Osborne	a282fa5b60	[XCore] Simplify code that checks for an aligned base plus a constant. The code now makes use of ComputeMaskedBits, SelectionDAG::isBaseWithConstantOffset and TargetLowering::isGAPlusOffset where appropriate reducing the amount of logic needed in XCoreISelLowering. No intended functionality change. llvm-svn: 181125	2013-05-04 17:17:10 +00:00
Richard Osborne	8bbea9cde7	[XCore] Move lowering of thread local storage to a separate pass. Thread local storage is not supported by the XMOS linker so we handle thread local variables by lowering the variable to an array of n elements (where n is the number of hardware threads per core, currently 8 for all XMOS devices) indexed by the the current thread ID. Previously this lowering was spread across the XCoreISelLowering and the XCoreAsmPrinter classes. Moving this to a separate pass should be much cleaner. llvm-svn: 181124	2013-05-04 17:01:55 +00:00
Tim Northover	85dcbde239	AArch64: assert code model is small for TLS accesses Supporting TLS in the large memory model is rather difficult at the moment, so make sure no-one gets into difficulties by mistake. llvm-svn: 181121	2013-05-04 16:54:11 +00:00
Tim Northover	885698a25c	AArch64: support literal pool access in large memory model. llvm-svn: 181120	2013-05-04 16:54:07 +00:00
Tim Northover	8ff187df5f	AArch64: support large code model for jump-tables llvm-svn: 181119	2013-05-04 16:54:00 +00:00
Tim Northover	9fc1cddb21	AArch64: implement support for blockaddress in large code model llvm-svn: 181118	2013-05-04 16:53:53 +00:00
Tim Northover	2dbef3452c	AArch64: implement large code model access to global variables. The MOVZ/MOVK instruction sequence may not be the most efficient (a literal-pool load could be better) but adding that would require reinstating the ConstantIslands pass. For now the sequence is correct, and that's enough. Beware, as of commit GNU ld does not appear to support the relocations needed for this. Its primary purpose (for now) will be to support JITed code, since in that case there is no guarantee of where your code will end up in memory relative to external symbols it references. llvm-svn: 181117	2013-05-04 16:53:46 +00:00
Richard Osborne	df9e574105	[XCore] Use static relocation model by default. This allows us to get get rid of a hack in XCoreTargetObjectFile where the the DataRel* sections were overridden. llvm-svn: 181116	2013-05-04 16:40:58 +00:00
Tim Northover	fee13d1e11	Allow host triple to be correctly overridden in CMake builds The intended semantics mirror autoconf, where the user is able to specify a host triple, but if it's left to the build system then "config.guess" is invoked for the default. This also renames the LLVM_HOSTTRIPLE define to LLVM_HOST_TRIPLE to fit in with the style of the surrounding defines. llvm-svn: 181112	2013-05-04 07:36:23 +00:00
Rafael Espindola	aa9918aac7	Fix a performance bug in the Linker. Now that we hava a convinient place to keep it, remeber the set of identified structs as we merge modules. This speeds up the linking of all the bitcode files in clang with the gold plugin and -plugin-opt=emit-llvm (i.e., link only, no codegen) from 5:25 minutes to 13.6 seconds! Patch by Xiaofei Wan! llvm-svn: 181104	2013-05-04 05:05:18 +00:00
Rafael Espindola	287f18b4b8	Implement Linker::LinkModules with Linker::linkInModule. Flipping which one is the implementation will let us optimize linkInModule. llvm-svn: 181102	2013-05-04 04:08:02 +00:00
Rafael Espindola	3df61b7bef	Now that Linker.cpp is almost empty, merge it into LinkModules.cpp. Also remove unused includes. llvm-svn: 181100	2013-05-04 03:48:37 +00:00
Rafael Espindola	a8023c1c9f	Last batch of cleanups to Linker.h. Update comments, fix * placement, fix method names that are not used in clang, add a linkInModule that takes a Mode and put it in Linker.cpp. llvm-svn: 181099	2013-05-04 03:06:50 +00:00
Rafael Espindola	0229acaa0f	Don't construct or delete a module on the Linker. The linker is now responsible only for actually linking the modules, it is up to the clients to create and destroy them. llvm-svn: 181098	2013-05-04 02:43:00 +00:00
Rafael Espindola	02a071aca8	Don't store the context in the Linker. llvm-svn: 181097	2013-05-04 02:34:41 +00:00
Rafael Espindola	40bbfa1080	Remove unused members and constructor arguments. llvm-svn: 181096	2013-05-04 02:28:57 +00:00
Rafael Espindola	f1d3a37427	Delete dead code from the linker. llvm-svn: 181094	2013-05-04 02:13:18 +00:00
Krzysztof Parzyszek	cd410d04db	Use consistent function names. llvm-svn: 181090	2013-05-04 01:30:49 +00:00
Nick Lewycky	881e9d62e2	Tabs to spaces. No functionality change. llvm-svn: 181082	2013-05-04 01:08:15 +00:00
Amara Emerson	d9104c0359	Revert r181009. llvm-svn: 181079	2013-05-03 23:57:17 +00:00
Reed Kotler	0f2b10eb0d	Remove some uneeded pseudos in the presence of the naked function attribute. llvm-svn: 181072	2013-05-03 23:17:24 +00:00
Ulrich Weigand	b9d5d073d6	[PowerPC] Avoid using '$' in generated assembler code PowerPC assemblers are supposed to support a stand-alone '$' symbol as an alternative of '.' to refer to the current PC. This does not work in the LLVM assembler parser yet. To avoid bootstrap failures when using the LLVM assembler as system assembler, this patch modifies the assembler source code generated by LLVM to avoid using '$' (and simply use '.' instead). llvm-svn: 181054	2013-05-03 19:53:04 +00:00
Ulrich Weigand	2c3a219b76	[PowerPC] Parse platform-specifc variant kinds in AsmParser This patch adds support for PowerPC platform-specific variant kinds in MCSymbolRefExpr::getVariantKindForName, and also adds a test case to verify they are translated to the appropriate fixup type. llvm-svn: 181053	2013-05-03 19:52:35 +00:00
Ulrich Weigand	300b6875fb	[PowerPC] Add some Book II instructions to AsmParser This patch adds a couple of Book II instructions (isync, icbi) to the PowerPC assembler parser. These are needed when bootstrapping clang with the integrated assembler forced on, because they are used in inline asm statements in the code base. The test case adds the full list of Book II storage control instructions, including associated extended mnemonics. Again, those that are not yet supported as marked as FIXME. llvm-svn: 181052	2013-05-03 19:51:09 +00:00
Ulrich Weigand	d839490f16	[PowerPC] Support extended mnemonics in AsmParser This patch adds infrastructure to support extended mnemonics in the PowerPC assembler parser. It adds support specifically for those extended mnemonics that LLVM will itself generate. The test case lists all extended mnemonics according to the PowerPC ISA v2.06 Book I, but marks those not yet supported as FIXME. llvm-svn: 181051	2013-05-03 19:50:27 +00:00
Ulrich Weigand	640192daa8	[PowerPC] Add assembler parser This adds assembler parser support to the PowerPC back end. The parser will run for any powerpc-- and powerpc64-- triples, but was tested only on 64-bit Linux. The supported syntax is intended to be compatible with the GNU assembler. The parser does not yet support all PowerPC instructions, but it does support anything that is generated by LLVM itself. There is no support for testing restricted instruction sets yet, i.e. the parser will always accept any instructions it knows, no matter what feature flags are given. Instruction operands will be checked for validity and errors generated. (Error handling in general could still be improved.) The patch adds a number of test cases to verify instruction and operand encodings. The tests currently cover all instructions from the following PowerPC ISA v2.06 Book I facilities: Branch, Fixed-point, Floating-Point, and Vector. Note that a number of these instructions are not yet supported by the back end; they are marked with FIXME. A number of follow-on check-ins will add extra features. When they are all included, LLVM passes all tests (including bootstrap) when using clang -cc1as as the system assembler. llvm-svn: 181050	2013-05-03 19:49:39 +00:00
Shuxin Yang	637b9bebd4	Decompose GVN::processNonLocalLoad() (about 400 LOC) into smaller helper functions. No function change. This function consists of following steps: 1. Collect dependent memory accesses. 2. Analyze availability. 3. Perform fully redundancy elimination, or 4. Perform PRE, depending on the availability Step 2, 3 and 4 are now moved to three helper routines. llvm-svn: 181047	2013-05-03 19:17:26 +00:00
Akira Hatanaka	e86bd4f652	[mips] Split the DSP control register and define one register for each field of its fields. This removes false dependencies between DSP instructions which access different fields of the the control register. Implicit register operands are added to instructions RDDSP and WRDSP after instruction selection, depending on the value of the mask operand. llvm-svn: 181041	2013-05-03 18:37:49 +00:00
Nadav Rotem	4ce060b3da	LoopVectorizer: Add support for if-conversion of PHINodes with 3+ incoming values. By supporting the vectorization of PHINodes with more than two incoming values we can increase the complexity of nested if statements. We can now vectorize this loop: int foo(int A, int B, int n) { for (int i=0; i < n; i++) { int x = 9; if (A[i] > B[i]) { if (A[i] > 19) { x = 3; } else if (B[i] < 4 ) { x = 4; } else { x = 5; } } A[i] = x; } } llvm-svn: 181037	2013-05-03 17:42:55 +00:00
Tom Stellard	4489b85f2b	R600: Expand vector or, shl, srl, and xor nodes llvm-svn: 181035	2013-05-03 17:21:31 +00:00
Tom Stellard	6a6ecedcb7	R600: BFI_INT is a vector-only instruction llvm-svn: 181034	2013-05-03 17:21:24 +00:00
Tom Stellard	eac65dde30	R600: Add pattern for SHA-256 Ma function This can be optimized using the BFI_INT instruction. llvm-svn: 181033	2013-05-03 17:21:20 +00:00
Tom Stellard	c2516c6e40	R600: Clean up comments in Processors.td llvm-svn: 181032	2013-05-03 17:21:14 +00:00
Tobias Grosser	a7ddc98206	RegionInfo: Do not crash if unreachable block is found llvm-svn: 181025	2013-05-03 15:48:34 +00:00
Richard Sandiford	ca0440826a	[SystemZ] Add MCJIT support Another step towards reinstating the SystemZ backend. I'll commit the configure changes separately (TARGET_HAS_JIT etc.), then commit a patch to enable the MCJIT tests on SystemZ. llvm-svn: 181015	2013-05-03 14:15:35 +00:00
Ulrich Weigand	90c9abdd27	[SystemZ] Support System Z as host architecture The llvm::sys::AddSignalHandler function (as well as related routines) in lib/Support/Unix/Signals.inc currently registers a signal handler routine via "sigaction". When this handler is called due to a SIGSEGV, SIGILL or similar signal, it will show a stack backtrace, deactivate the handler, and then simply return to the operating system. The intent is that the OS will now retry execution at the same location as before, which ought to again trigger the same error condition and cause the same signal to be delivered again. Since the hander is now deactivated, the OS will take its default action (usually, terminate the program and possibly create a core dump). However, this method doesn't work reliably on System Z: With certain signals (namely SIGILL, SIGFPE, and SIGTRAP), the program counter stored by the kernel on the signal stack frame (which is the location where execution will resume) is not the instruction that triggered the fault, but then instruction after it. When the LLVM signal handler simply returns to the kernel, execution will then resume at that address, which will not trigger the problem again, but simply go on and execute potentially unrelated code leading to random errors afterwards. To fix this, the patch simply goes and re-raises the signal in question directly from the handler instead of returning from it. This is done only on System Z and only for those signals that have this particular problem. llvm-svn: 181010	2013-05-03 12:22:11 +00:00
Amara Emerson	2f54d9fe10	Add support for reading ARM ELF build attributes. Build attribute sections can now be read if they exist via ELFObjectFile, and the llvm-readobj tool has been extended with an option to dump this information if requested. Regression tests are also included which exercise these features. Also update the docs with a fixed ARM ABI link and a new link to the Addenda which provides the build attributes specification. llvm-svn: 181009	2013-05-03 11:36:35 +00:00
Richard Sandiford	a238c5e08f	[SystemZ] Add llvm::Triple::systemz First step towards reinstating the SystemZ backend. Tests will be included in the main backend patch. llvm-svn: 181007	2013-05-03 11:05:17 +00:00
Benjamin Kramer	b44c4275d5	X86: Add target description for btver2; make autodetection logic aware of AVX. llvm-svn: 181005	2013-05-03 10:20:08 +00:00
Aaron Ballman	cc958f0050	Unbreaking the non-x86 build bots by protecting the AVX test code properly. llvm-svn: 180992	2013-05-03 02:52:21 +00:00
Aaron Ballman	63fe014888	Correctly testing for AVX support in x86 based off code from Hosts.cpp. llvm-svn: 180991	2013-05-03 02:39:21 +00:00
Reid Kleckner	1c76f155b1	Fix missing include in Hexagon code for Release+Asserts llvm-svn: 180983	2013-05-03 00:54:56 +00:00
John McCall	f73981b213	In MC asm parsing, account for the possibility of whitespace within the "identifier" parsed by the frontend callback by skipping forward until we've consumed a token that ends at the point dictated by the callback. In addition, inform the callback when it's parsing an unevaluated operand (e.g. mov eax, LENGTH A::x) as opposed to an evaluated one (e.g. mov eax, [A::x]). This commit depends on a clang commit. llvm-svn: 180978	2013-05-03 00:15:41 +00:00
Akira Hatanaka	5705f546e5	[mips] Handle reading, writing or copying of ccond field of DSP control register. - Define pseudo instructions which store or load ccond field of the DSP control register. - Emit the pseudos in MipsSEInstrInfo::storeRegToStack and loadRegFromStack. - Expand the pseudos before callee-scan save. - Emit instructions RDDSP or WRDSP to copy between ccond field and GPRs. llvm-svn: 180969	2013-05-02 23:07:05 +00:00
Jyotsna Verma	a841af7556	reverting r180953 llvm-svn: 180964	2013-05-02 22:10:59 +00:00
Vincent Lejeune	ddd43383ef	R600: Signed literals are 64bits wide llvm-svn: 180960	2013-05-02 21:53:03 +00:00
Vincent Lejeune	2a44ae0053	R600: If previous bundle is dot4, PV valid chan is always X llvm-svn: 180959	2013-05-02 21:52:55 +00:00
Vincent Lejeune	b0422e24a9	R600: Improve asmPrint of ALU clause llvm-svn: 180957	2013-05-02 21:52:40 +00:00
Vincent Lejeune	f97af796a9	R600: Prettier asmPrint of Alu llvm-svn: 180956	2013-05-02 21:52:30 +00:00
Jyotsna Verma	7e7c730c4f	Hexagon: Add multiclass/encoding bits for the New-Value Jump instructions. llvm-svn: 180953	2013-05-02 21:21:57 +00:00
Shuxin Yang	af2c3ddf0d	[GV] Remove dead code which is really difficult to decipher. Actually it took me couple of hours trying to make sense of them and only to find they are dead code. I guess the original author used "allSingleSucc" to indicate if there are any critial edge emanating from some blocks, and tried to perform code motion (actually speculation) in the presence of these critical edges; but later on he/she changed mind and decided to perform edge-splitting first. llvm-svn: 180951	2013-05-02 21:14:31 +00:00
Pranav Bhandarkar	7dda912cd7	Hexagon - Add peephole optimizations for zero extends. * lib/Target/Hexagon/HexagonInstrInfo.td: Add patterns to combine a sequence of a pair of i32->i64 extensions followed by a "bitwise or" into COMBINE_rr. * lib/Target/Hexagon/HexagonPeephole.cpp: Copy propagate Rx in the instruction Rp = COMBINE_Ir_V4(0, Rx) to the uses of Rp:subreg_loreg. * test/CodeGen/Hexagon/union-1.ll: New test. * test/CodeGen/Hexagon/combine_ir.ll: Fix test. llvm-svn: 180946	2013-05-02 20:22:51 +00:00
Richard Sandiford	e93c62e87d	[mips] Fix the head Mips16RegisterInfo.cpp comment ...aka a test commit. llvm-svn: 180936	2013-05-02 18:28:03 +00:00
Jyotsna Verma	1d29750b7d	Hexagon: Honor __builtin_expect by using branch probabilities. * lib/Target/Hexagon/HexagonInstrInfo.cpp (GetDotNewPredOp): Given a jump opcode return the right pred.new jump opcode with a taken vs not-taken hint based on branch probabilities provided by the target independent module. * lib/Target/Hexagon/HexagonVLIWPacketizer.cpp: Use the above function. * lib/Target/Hexagon/HexagonNewValueJump.cpp(getNewvalueJumpOpcode): Enhance existing function use branch probabilities like HexagonInstrInfo::GetDotNewPredOp but for New Value (GPR) Jumps. llvm-svn: 180923	2013-05-02 15:39:30 +00:00
Tom Stellard	40b7f1f6c3	R600: Use new tablegen syntax for patterns All but two patterns have been converted to the new syntax. The remaining two patterns will require COPY_TO_REGCLASS instructions, which the VLIW DAG Scheduler cannot handle. llvm-svn: 180922	2013-05-02 15:30:12 +00:00
Tom Stellard	5447ae20ff	R600/SI: remove nonsense select pattern Fortunately this pattern never matched, otherwise we would have generated incorrect code. Signed-off-by: Christian K??nig <christian.koenig@amd.com> llvm-svn: 180921	2013-05-02 15:30:07 +00:00
Michael Liao	06badde1ac	80-col fixup. llvm-svn: 180915	2013-05-02 09:22:04 +00:00
Michael Liao	afafa98fa8	Avoid duplicating logic on frame register selecting when lowering eh_return No functionality change llvm-svn: 180914	2013-05-02 09:18:38 +00:00
Michael Liao	31d39a4a47	Avoid duplicating logic on frame register selecting when lowering frameaddr No functionality change llvm-svn: 180912	2013-05-02 08:21:56 +00:00
Evan Cheng	f85a76f477	TiedTo flag can now be placed on implicit operands. isTwoAddrUse() should look at all of the operands. Previously it was skipping over implicit operands which cause infinite looping when the two-address pass try to reschedule a two-address instruction below the kill of tied operand. I'm unable to come up with a reasonably sized test case. rdar://13747577 llvm-svn: 180906	2013-05-02 02:07:32 +00:00
Akira Hatanaka	ae4a5567e1	[mips] Rename class and functions. Simplify code. No functionality changes. llvm-svn: 180897	2013-05-01 23:41:31 +00:00
Filip Pizlo	85e0d2731b	This exposes more MCJIT options via the C API: CodeModel: It's now possible to create an MCJIT instance with any CodeModel you like. Previously it was only possible to create an MCJIT that used CodeModel::JITDefault. EnableFastISel: It's now possible to turn on the fast instruction selector. The CodeModel option required some trickery. The problem is that previously, we were ensuring future binary compatibility in the MCJITCompilerOptions by mandating that the user bzero's the options struct and passes the sizeof() that he saw; the bindings then bzero the remaining bits. This works great but assumes that the bitwise zero equivalent of any field is a sensible default value. But this is not the case for LLVMCodeModel, or its internal equivalent, llvm::CodeModel::Model. In both of those, the default for a JIT is CodeModel::JITDefault (or LLVMCodeModelJITDefault), which is not bitwise zero. Hence this change introduces LLVMInitializeMCJITCompilerOptions(), which will initialize the user's options struct with defaults. The user will use this in the same way that they would have previously used memset() or bzero(). MCJITCAPITest.cpp illustrates the change, as does the comment in ExecutionEngine.h. llvm-svn: 180893	2013-05-01 22:58:00 +00:00
Bill Wendling	8f2e6feb8e	Revert r180737. The companion patch was reverted, and this is not relevant right now. llvm-svn: 180889	2013-05-01 22:32:08 +00:00
Jyotsna Verma	5ed5181178	Hexagon: Use multiclass for Jump instructions. llvm-svn: 180885	2013-05-01 21:37:34 +00:00
Jyotsna Verma	cd66c0a270	Hexagon: Clear isKill flag on the predicate register in PredicateInstruction function. llvm-svn: 180884	2013-05-01 21:27:30 +00:00
Filip Pizlo	dec20e43c0	This patch breaks up Wrap.h so that it does not have to include all of the things, and renames it to CBindingWrapping.h. I also moved CBindingWrapping.h into Support/. This new file just contains the macros for defining different wrap/unwrap methods. The calls to those macros, as well as any custom wrap/unwrap definitions (like for array of Values for example), are put into corresponding C++ headers. Doing this required some #include surgery, since some .cpp files relied on the fact that including Wrap.h implicitly caused the inclusion of a bunch of other things. This also now means that the C++ headers will include their corresponding C API headers; for example Value.h must include llvm-c/Core.h. I think this is harmless, since the C API headers contain just external function declarations and some C types, so I don't believe there should be any nasty dependency issues here. llvm-svn: 180881	2013-05-01 20:59:00 +00:00
Nadav Rotem	1e211913b5	SROA: Generate selects instead of shuffles when blending values because this is the cannonical form. Shuffles are more difficult to lower and we usually don't touch them, while we do optimize selects more often. llvm-svn: 180875	2013-05-01 19:53:30 +00:00
Chad Rosier	8e4824f350	[inline asm] Return an undef SDValue of the expected value type, rather than report a fatal error. This allows us to continue processing the translation unit. Test case to come on the clang side because we need an inline asm diagnostics handler in place. rdar://13446483 llvm-svn: 180873	2013-05-01 19:49:26 +00:00
Nadav Rotem	e5a2dda372	Optimize away nop CONCAT_VECTOR nodes. Optimize CONCAT_VECTOR nodes that merge EXTRACT_SUBVECTOR values that extract from the same vector. rdar://13402653 PR15866 llvm-svn: 180871	2013-05-01 19:18:51 +00:00
Rafael Espindola	cbf5a7ad06	Now that the underlying issue is fixed, revert r180750 and r180722. The cause of the windows failures was fixed by r180791. Revert to the state after Sabre's original revert. Original message: revert r179735, it has no testcases, and doesn't really make sense. llvm-svn: 180844	2013-05-01 13:07:03 +00:00
Rafael Espindola	817c1d92b4	Put VMOVPQIto64rr in the VRPDI class. Patch by Joshua Magee. llvm-svn: 180842	2013-05-01 13:00:16 +00:00
Aaron Ballman	fd86e16dbd	Fixes a buffer overrun where the allocated buffer wasn't large enough to accommodate the closing quote escape rules in some instances. llvm-svn: 180836	2013-05-01 02:53:14 +00:00
Jim Grosbach	d11584a7f7	Revert "InstCombine: Fold more shuffles of shuffles." This reverts commit r180802 There's ongoing discussion about whether this is the right place to make this transformation. Reverting for now while we figure it out. llvm-svn: 180834	2013-05-01 00:25:27 +00:00
Akira Hatanaka	4254319ef9	[mips] Fix handling of instructions which copy to/from accumulator registers. Expand copy instructions between two accumulator registers before callee-saved scan is done. Handle copies between integer GPR and hi/lo registers in MipsSEInstrInfo::copyPhysReg. Delete pseudo-copy instructions that are not needed. llvm-svn: 180827	2013-04-30 23:22:09 +00:00
Stephen Lin	699808ceb2	Only pass 'returned' to target-specific lowering code when the value of entire register is guaranteed to be preserved. llvm-svn: 180825	2013-04-30 22:49:28 +00:00
Richard Trieu	624c2ebcbb	Fix a use after free. RI is freed before the call to getDebugLoc(). To prevent this, capture the location before RI is freed. llvm-svn: 180824	2013-04-30 22:45:10 +00:00
Akira Hatanaka	68741cc38d	[mips] Instruction selection patterns for DSP-ASE vector select and compare instructions. llvm-svn: 180820	2013-04-30 22:37:26 +00:00
Adrian Prantl	a2888e71eb	Temporarily revert "Change the informal convention of DBG_VALUE so that we can express a" because it breaks some buildbots. This reverts commit 180816. llvm-svn: 180819	2013-04-30 22:35:14 +00:00
Adrian Prantl	9a576644e4	Change the informal convention of DBG_VALUE so that we can express a register-indirect address with an offset of 0. It used to be that a DBG_VALUE is a register-indirect value if the offset (operand 1) is nonzero. The new convention is that a DBG_VALUE is register-indirect if the first operand is a register and the second operand is an immediate. For plain registers use the combination reg, reg. rdar://problem/13658587 llvm-svn: 180816	2013-04-30 22:16:46 +00:00
Andrew Trick	dd77014acc	MI Sched: revert a minor heuristic that snuck in with -misched-vcopy. I'll fix the heuristic in a general way in a follow-up commit. llvm-svn: 180815	2013-04-30 22:10:59 +00:00
Akira Hatanaka	9da442f506	[mips] Simplify code. No intended functionality changes. llvm-svn: 180807	2013-04-30 21:17:07 +00:00
Nadav Rotem	9feda6071a	Fix a typo llvm-svn: 180806	2013-04-30 21:04:51 +00:00
Jim Grosbach	0b914fe839	InstCombine: Fold more shuffles of shuffles. Always fold a shuffle-of-shuffle into a single shuffle when there's only one input vector in the first place. Continue to be more conservative when there's multiple inputs. rdar://13402653 PR15866 llvm-svn: 180802	2013-04-30 20:43:52 +00:00
Akira Hatanaka	84d6d9bdaa	[mips] Clear isCommutable bit of instructions which are not commutable. llvm-svn: 180801	2013-04-30 20:40:39 +00:00
Hal Finkel	7153251ab5	LocalStackSlotAllocation improvements First, taking advantage of the fact that the virtual base registers are allocated in order of the local frame offsets, remove the quadratic register-searching behavior. Because of the ordering, we only need to check the last virtual base register created. Second, store the frame index in the FrameRef structure, and get the frame index and the local offset from this structure at the top of the loop iteration. This allows us to de-nest the loops in insertFrameReferenceRegisters (and I think makes the code cleaner). I also moved the needsFrameBaseReg check into the first loop over instructions so that we don't bother pushing FrameRefs for instructions that don't want a virtual base register anyway. Lastly, and this is the only functionality change, avoid the creation of single-use virtual base registers. These are currently not useful because, in general, they end up replacing what would be one r+r instruction with an add and a r+i instruction. Committing this removes the XFAIL in CodeGen/PowerPC/2007-09-07-LoadStoreIdxForms.ll Jim has okayed this off-list. llvm-svn: 180799	2013-04-30 20:04:37 +00:00
Rafael Espindola	789a1c8a23	Text files should not be marked executable. Patch by Oliver Pinter. llvm-svn: 180797	2013-04-30 19:06:15 +00:00
Adrian Prantl	8beccf9e6d	Spelling. Thanks, Eric. llvm-svn: 180794	2013-04-30 17:33:32 +00:00
Adrian Prantl	0941638a1b	Set debug locations for branch instructions created during inlining, even the inlined function has multiple returns. rdar://problem/12415623 llvm-svn: 180793	2013-04-30 17:08:16 +00:00
Rafael Espindola	dd27530a44	Change getSlotIndex to return unsigned. The actual storage was already using unsigned, but the interface was using uint64_t. This is wasteful on 32 bits and looks to be the root causes of a miscompilation on Windows where a value was being sign extended to 64bits to compare with the result of getSlotIndex. Patch by Pasi Parviainen! llvm-svn: 180791	2013-04-30 16:53:38 +00:00
Rafael Espindola	52501033d0	Fix Addend computation for non external relocations on Macho. llvm-svn: 180790	2013-04-30 15:40:54 +00:00
David Majnemer	d73f37bb83	Fix a bug in foldSelectICmpAndOr. Differences in bitwidth between X and Y could exist even if C1 and C2 have the same Log2 representation. llvm-svn: 180779	2013-04-30 10:36:33 +00:00
Mihai Popa	af22d91af0	s tightens up the encoding description for ARM post-indexed ldr instructions. All instructions in this class have bit 4 cleared. It turns out that there is a test case for this, but it was marked XFAIL. llvm-svn: 180778	2013-04-30 09:00:12 +00:00
David Majnemer	8d048d0482	Fix "Combine bit test + conditional or into simple math" This fixes the optimization introduced in r179748 and reverted in r179750. While the optimization was sound, it did not properly respect differences in bit-width. llvm-svn: 180777	2013-04-30 08:57:58 +00:00
Stepan Dyatkovskiy	f5aa83dbb0	Refactoring patch. 1. VarArgStyleRegisters: functionality that emits "store" instructions for byval regs moved out into separated method "StoreByValRegs". Before this patch VarArgStyleRegisters had confused use-cases. It was used for both variadic functions and for regular functions with byval parameters. In last case it created new stack-frame and registered it as VarArg frame, that is wrong. This patch replaces VarArgsStyleRegisters usage for byval parameters with StoreByValRegs method. 2. In ARMMachineFunctionInfo, "get/setVarArgsRegSaveSize" was renamed to "get/setArgRegsSaveSize". By the same reason. Sometimes it was used for variadic functions, and sometimes for byval parameters in regular functions. Actually, this property means the size of registers, that keeps arguments, and thats why it was renamed. 3. In ARMISelLowering.cpp, ARMTargetLowering class, in methods computeRegArea and StoreByValRegs, VARegXXXXXX was renamed to ArgRegsXXXXXX still by the same reasons. llvm-svn: 180774	2013-04-30 07:19:58 +00:00
Rafael Espindola	d00c2765aa	Collect the Addend for external relocs. This fixes 2013-04-04-RelocAddend.ll. We don't have a testcase for non external relocs with an Addend. I will try to write one. llvm-svn: 180767	2013-04-30 01:29:57 +00:00
Vincent Lejeune	3a8d78a2c3	R600: Always use texture cache for compute shaders This will improve the performance of memory reads. llvm-svn: 180762	2013-04-30 00:14:44 +00:00
Vincent Lejeune	3abdbf1cad	R600: use native for alu llvm-svn: 180761	2013-04-30 00:14:38 +00:00
Vincent Lejeune	147700b8b4	R600: Packetize instructions llvm-svn: 180760	2013-04-30 00:14:27 +00:00
Vincent Lejeune	076c0b28e3	R600: Rework Scheduling to handle difference between VLIW4 and VLIW5 chips llvm-svn: 180759	2013-04-30 00:14:17 +00:00
Vincent Lejeune	22c4248213	R600: Add a Bank Swizzle operand llvm-svn: 180758	2013-04-30 00:14:08 +00:00
Vincent Lejeune	7c395f77de	R600: Take inner dependency into tex/vtx clauses llvm-svn: 180757	2013-04-30 00:14:00 +00:00
Vincent Lejeune	3f1d136b02	R600: Turn TEX/VTX into native instructions llvm-svn: 180756	2013-04-30 00:13:53 +00:00
Vincent Lejeune	c299164284	R600: Add FetchInst bit to instruction defs to denote vertex/tex instructions v2[Vincent Lejeune]: Split FetchInst into usesTextureCache/usesVertexCache llvm-svn: 180755	2013-04-30 00:13:39 +00:00
Vincent Lejeune	7d820c0bef	R600: Add some new processor variants llvm-svn: 180753	2013-04-30 00:13:27 +00:00
Vincent Lejeune	f501ea298b	R600: Clean up instruction class definitions llvm-svn: 180752	2013-04-30 00:13:20 +00:00
Vincent Lejeune	4a0beb5207	R600: config section now reports use of killgt llvm-svn: 180751	2013-04-30 00:13:13 +00:00
Bill Wendling	0494597566	Revert the command line option patch. However, keep the part that makes this pass on Windows. I.e., we don't emit the target dependent attributes in a comment before the function. llvm-svn: 180750	2013-04-29 23:48:06 +00:00
Bill Wendling	fb7e32ebd6	Emit the TLS initialization function pointers into the correct section. The `llvm.tls_init_funcs' (created by the front-end) holds pointers to the TLS initialization functions. These need to be placed into the correct section so that they are run before `main()'. <rdar://problem/13733006> llvm-svn: 180737	2013-04-29 22:25:40 +00:00
Rafael Espindola	e4dd2e0132	Add getSymbolAlignment to the ObjectFile interface. For regular object files this is only meaningful for common symbols. An object file format with direct support for atoms should be able to provide alignment information for all symbols. This replaces getCommonSymbolAlignment and fixes test-common-symbols-alignment.ll on darwin. This also includes a fix to MachOObjectFile::getSymbolFlags. It was marking undefined symbols as common (already tested by existing mcjit tests now that it is used). llvm-svn: 180736	2013-04-29 22:24:22 +00:00
Tom Stellard	119ad03c67	R600: Use correct CF_END instruction on Northern Island GPUs llvm-svn: 180735	2013-04-29 22:23:58 +00:00
Tom Stellard	8367067e02	R600: Fix encoding of CF_END_{EG, R600} instructions The EOP bit was not being encoded. llvm-svn: 180734	2013-04-29 22:23:54 +00:00
Rafael Espindola	2b06530ed6	Rationalize what is public in RuntimeDyldMachO and RuntimeDyldELF. The implemented RuntimeDyldImpl interface is public. Everything else is private. Since these classes are not inherited from (yet), there is no need to have protected members. llvm-svn: 180733	2013-04-29 22:06:33 +00:00
Arnold Schwaighofer	474df6d3ed	SimplifyCFG: If convert single conditional stores This resurrects r179957, but adds code that makes sure we don't touch atomic/volatile stores: This transformation will transform a conditional store with a preceeding uncondtional store to the same location: a[i] = may-alias with a[i] load if (cond) a[i] = Y into an unconditional store. a[i] = X may-alias with a[i] load tmp = cond ? Y : X; a[i] = tmp We assume that on average the cost of a mispredicted branch is going to be higher than the cost of a second store to the same location, and that the secondary benefits of creating a bigger basic block for other optimizations to work on outway the potential case where the branch would be correctly predicted and the cost of the executing the second store would be noticably reflected in performance. hmmer's execution time improves by 30% on an imac12,2 on ref data sets. With this change we are on par with gcc's performance (gcc also performs this transformation). There was a 1.2 % performance improvement on a ARM swift chip. Other tests in the test-suite+external seem to be mostly uninfluenced in my experiments: This optimization was triggered on 41 tests such that the executable was different before/after the patch. Only 1 out of the 40 tests (dealII) was reproducable below 100% (by about .4%). Given that hmmer benefits so much I believe this to be a fair trade off. llvm-svn: 180731	2013-04-29 21:28:24 +00:00
Rafael Espindola	b39478e8ec	Update the documentation. llvm-svn: 180725	2013-04-29 19:33:51 +00:00
Rafael Espindola	3700894249	Use a RelocationRef instead of a relocation_iterator. No functionality change. llvm-svn: 180723	2013-04-29 19:03:21 +00:00
Reid Kleckner	e02c622baa	Revert "revert r179735, it has no testcases, and doesn't really make sense." This un-reverts r179735 and reverts commit r180574. This fixes assertion failures for me locally and should fix the failures on Windows reported widely on llvm-dev. We should check if the bots caught this and if so why not. llvm-svn: 180722	2013-04-29 18:23:53 +00:00
Andrew Kaylor	31be5eff33	Exposing MCJIT through C API Re-submitting with fix for OCaml dependency problems (removing dependency on SectionMemoryManager when it isn't used). Patch by Fili Pizlo llvm-svn: 180720	2013-04-29 17:49:40 +00:00
Rafael Espindola	f1f1c626e7	Propagate relocation info to resolveRelocation. This gets most of the MCJITs tests passing with MachO. llvm-svn: 180716	2013-04-29 17:24:34 +00:00
Rafael Espindola	4d4a48d91f	Replace ObjRelocationInfo with relocation_iterator. For MachO we need information that is not represented in ObjRelocationInfo. Instead of copying the bits we think are needed from a relocation_iterator, just pass the relocation_iterator down to the format specific functions. No functionality change yet as we still drop the information once processRelocationRef returns. llvm-svn: 180711	2013-04-29 14:44:23 +00:00
Michael Gottesman	03cf3c8966	Add in some conditional compilation in order to silence an unused variable warning. llvm-svn: 180700	2013-04-29 07:29:08 +00:00
Michael Gottesman	214ca90f8e	[objc-arc] Apply the RV optimization to retains next to calls in ObjCARCContract instead of ObjCARCOpts. Turning retains into retainRV calls disrupts the data flow analysis in ObjCARCOpts. Thus we move it as late as we can by moving it into ObjCARCContract. We leave in the conversion from retainRV -> retain in ObjCARCOpt since it enables the dataflow analysis. rdar://10813093 llvm-svn: 180698	2013-04-29 06:53:53 +00:00
Michael Gottesman	9c11815978	Added statistics to count the number of retains/releases before/after optimization. llvm-svn: 180697	2013-04-29 06:16:57 +00:00
Michael Gottesman	8005ad3f3e	Removed trailing whitespace. llvm-svn: 180696	2013-04-29 06:16:55 +00:00
Michael Gottesman	3e3977c49f	Fix for r180693. = /. llvm-svn: 180694	2013-04-29 05:25:39 +00:00
Michael Gottesman	a87bb8f50b	[objc-arc-annotations] Moved the disabling of call movement to ConnectTDBUTraversals so that I can prevent Changed = true from being set. This prevents an infinite loop. llvm-svn: 180693	2013-04-29 05:13:13 +00:00
Benjamin Kramer	83e2a44a13	Inline variable into the #ifdef block where it's used. llvm-svn: 180688	2013-04-28 07:47:04 +00:00
Jia Liu	a5a5c715e1	AArch64 InstrFormats: delete blank. llvm-svn: 180687	2013-04-28 01:45:11 +00:00
Joerg Sonnenberger	447440907e	Fix typo. Stupid me. llvm-svn: 180686	2013-04-27 22:32:54 +00:00
Joerg Sonnenberger	66241831dc	Only use cxxabi.h's demangler, if it is actually available. llvm-svn: 180684	2013-04-27 22:12:32 +00:00
Shuxin Yang	04a4fd43aa	Fix a XOR reassociation bug. When Reassociator optimize "(x \| C1)" ^ "(X & C2)", it may swap the two subexpressions, however, it forgot to swap cached constants (of C1 and C2) accordingly. rdar://13739160 llvm-svn: 180676	2013-04-27 18:02:12 +00:00
Andrew Trick	85058af650	Generalize the MachineTraceMetrics public API. Naturally, we should be able to pass in extra instructions, not just extra blocks. llvm-svn: 180667	2013-04-27 03:54:20 +00:00
Eric Christopher	203e12bf9e	Use the target triple from the target machine rather than the module to determine whether or not we're on a darwin platform for debug code emitting. Solves the problem of a module with no triple on the command line and no triple in the module using non-gdb ok features on darwin. Fix up the member-pointers test to check the correct things for cross platform (DW_FORM_flag is a good prefix). Unfortunately no testcase because I have no ideas how to test something without a triple and without a triple in the module yet check precisely on two platforms. Ideas welcome. llvm-svn: 180660	2013-04-27 01:07:52 +00:00
Rafael Espindola	1357ab74e5	Make all darwin ppc stubs local. This fixes pr15763. Patch by David Fang. llvm-svn: 180657	2013-04-27 00:43:16 +00:00
Manman Ren	5c37106d65	Struct-path aware TBAA: change the format of TBAAStructType node. We switch the order of offset and field type to make TBAAStructType node (name, parent node, offset) similar to scalar TBAA node (name, parent node). TypeIsImmutable is added to TBAAStructTag node. llvm-svn: 180654	2013-04-27 00:26:11 +00:00
Adrian Prantl	d4c0dd4776	Cleanup and document MachineLocation. Clarify documentation and API to make the difference between register and register-indirect addressed locations more explicit. Put in a comment to point out that with the current implementation we cannot specify a register-indirect location with offset 0 (a breg 0 in DWARF). No functionality change intended. rdar://problem/13658587 llvm-svn: 180641	2013-04-26 21:57:17 +00:00
Bill Wendling	55a9c97c9c	Micro-optimization TLVs probably won't be as common as the other types of variables. Check for them last before defaulting to "DATA". llvm-svn: 180631	2013-04-26 21:15:08 +00:00
Nadav Rotem	be0e89d9e8	Teach the interpreter to handle vector compares and additional vector arithmetic operations. Patch by Yuri Veselov. llvm-svn: 180626	2013-04-26 20:19:41 +00:00
Rafael Espindola	6e040c0be2	Use llvm/Object/MachO.h in macho-dumper. Drop the old macho parser. For Mach-O there were 2 implementations for parsing object files. A standalone llvm/Object/MachOObject.h and llvm/Object/MachO.h which implements the generic interface in llvm/Object/ObjectFile.h. This patch adds the missing features to MachO.h, moves macho-dump to use MachO.h and removes ObjectFile.h. In addition to making sure that check-all is clean, I checked that the new version produces exactly the same output in all Mach-O files in a llvm+clang build directory (including executables and shared libraries). To test the performance, I ran macho-dump over all the files in a llvm+clang build directory again, but this time redirecting the output to /dev/null. Both the old and new versions take about 4.6 seconds (2.5 user) to finish. llvm-svn: 180624	2013-04-26 20:07:33 +00:00
Tom Stellard	456adc6c4e	R600: Initialize AMDGPUMachineFunction::ShaderType to ShaderType::COMPUTE We need to intialize this to something and since clang does not set the shader type attribute and clang is used only for compute shaders, initializing it to COMPUTE seems like the best choice. Reviewed-by: Christian König <christian.koenig@amd.com> llvm-svn: 180620	2013-04-26 18:32:24 +00:00
Adrian Prantl	d00333a4b2	fix a typo that due to cu&paste quadrupled itself rdar://problem/13056109 llvm-svn: 180618	2013-04-26 18:10:50 +00:00
Quentin Colombet	a83d5e9f91	ARM: Fix encoding of hint instruction for Thumb. "hint" space for Thumb actually overlaps the encoding space of the CPS instruction. In actuality, hints can be defined as CPS instructions where imod and M bits are all nil. Handle decoding of permitted nop-compatible hints (i.e. nop, yield, wfi, wfe, sev) in DecodeT2CPSInstruction. This commit adds a proper diagnostic message for Imm0_4 and updates all tests. Patch by Mihail Popa <Mihail.Popa@arm.com>. llvm-svn: 180617	2013-04-26 17:54:54 +00:00
Adrian Prantl	29b9de7bf1	Bugfix for the debug intrinsic handling in InstCombiner: Since we can't guarantee that the original dbg.declare instrinsic is removed by LowerDbgDeclare(), we need to make sure that we are not inserting the same dbg.value intrinsic over and over. This removes tons of redundant DIEs when compiling optimized code. rdar://problem/13056109 llvm-svn: 180615	2013-04-26 17:48:33 +00:00
Ulrich Weigand	136ac22eaa	PowerPC: Use RegisterOperand instead of RegisterClass operands In the default PowerPC assembler syntax, registers are specified simply by number, so they cannot be distinguished from immediate values (without looking at the opcode). This means that the default operand matching logic for the asm parser does not work, and we need to specify custom matchers. Since those can only be specified with RegisterOperand classes and not directly on the RegisterClass, all instructions patterns used by the asm parser need to use a RegisterOperand (instead of a RegisterClass) for all their register operands. This patch adds one RegisterOperand for each RegisterClass, using the same name as the class, just in lower case, and updates all instruction patterns to use RegisterOperand instead of RegisterClass operands. llvm-svn: 180611	2013-04-26 16:53:15 +00:00
Silviu Baranga	af7e8c367f	Re-write the address propagation code for pre-indexed loads/stores to take into account some previously misssed cases (PRE_DEC addressing mode, the offset and base address are swapped, etc). This should fix PR15581. llvm-svn: 180609	2013-04-26 15:52:24 +00:00
Ulrich Weigand	551b085d55	PowerPC: Fix encoding of vsubcuw and vsum4sbs instructions When testing the asm parser, I noticed wrong encodings for the above instructions (wrong sub-opcodes). Tests will be added together with the asm parser. llvm-svn: 180608	2013-04-26 15:39:57 +00:00
Ulrich Weigand	48b949b650	PowerPC: Fix encoding of stfsu and stfdu instructions When testing the asm parser, I noticed wrong encodings for the above instructions (wrong sub-opcodes). Note that apparently the compiler currently never generates pre-inc instructions for floating point types for some reason ... Tests will be added together with the asm parser. llvm-svn: 180607	2013-04-26 15:39:40 +00:00
Ulrich Weigand	fa451ba1b9	PowerPC: Fix encoding of rldimi and rldcl instructions When testing the asm parser, I noticed wrong encodings for the above instructions (wrong operand name in rldimi, wrong form and sub-opcode for rldcl). Tests will be added together with the asm parser. llvm-svn: 180606	2013-04-26 15:39:12 +00:00
Ulrich Weigand	72a7dc0d7d	PowerPC: Support PC-relative fixup_ppc_brcond14. When testing the asm parser, I ran into an error when using a conditional branch to an external symbol (this doesn't occur in compiler-generated code) due to missing support in PPCELFObjectWriter::getRelocTypeInner. llvm-svn: 180605	2013-04-26 15:38:30 +00:00
Benjamin Kramer	ae81474a38	ARM/NEON: Pattern match vector integer abs to vabs. llvm-svn: 180604	2013-04-26 15:00:57 +00:00
Benjamin Kramer	aec90531f9	X86: Now that we have a canonical form for vector integer abs, match it into pabs. llvm-svn: 180600	2013-04-26 12:05:21 +00:00
Benjamin Kramer	d56ffc709d	DAGCombiner: Canonicalize vector integer abs in the same way we do it for scalars. This already helps SSE2 x86 a lot because it lacks an efficient way to represent a vector select. The long term goal is to enable the backend to match a canonicalized pattern into a single instruction (e.g. vabs or pabs). llvm-svn: 180597	2013-04-26 09:19:19 +00:00
Nadav Rotem	13306816fc	LoopVectorizer: Calculate the number of pointers to disambiguate at runtime based on the numbers of reads and writes. llvm-svn: 180593	2013-04-26 05:08:59 +00:00
Michael Gottesman	47cf8a4c12	Revert "[objc-arc] Added ImpreciseAutoreleaseSet to track autorelease calls that were once autoreleaseRV instructions." This reverts commit r180222. I think this might tie in with a different problem which will require a different approach potentially. I am reverting this in the case I need to go down that second path. My apologies for the noise. = /. llvm-svn: 180590	2013-04-26 01:12:18 +00:00
Jack Carter	c15c1d245b	Mips assembler: .set reorder support Mips have delayslots for certain instructions like jumps and branches. These are instructions that follow the branch or jump and are executed before the jump or branch is completed. Early Mips compilers could not cope with delayslots and left them up to the assembler. The assembler would fill the delayslots with the appropriate instruction, usually just a nop to allow correct runtime behavior. The default behavior for this is set with .set reorder. To tell the assembler that you don't want it to mess with the delayslot one used .set noreorder. For backwards compatibility we need to support .set reorder and have it be the default behavior in the assembler. Our support for it is to insert a NOP directly after an instruction with a delayslot when in .set reorder mode. Contributer: Vladimir Medic llvm-svn: 180584	2013-04-25 23:31:35 +00:00
Preston Gurd	128920d9fa	Make function documentation conform to llvm standards. Expunge all remaining traces and use of live variable information. llvm-svn: 180577	2013-04-25 21:31:33 +00:00
Arnold Schwaighofer	9881dcf2f2	ARM cost model: Integer div and rem is lowered to a function call Reflect this in the cost model. I observed this in MiBench/consumer-lame. radar://13354716 llvm-svn: 180576	2013-04-25 21:16:18 +00:00
Andrew Kaylor	ced4e8ff6e	Re-enabling MCJIT object caching with memory leak fixed llvm-svn: 180575	2013-04-25 21:02:36 +00:00
Chris Lattner	6b2702a6cc	revert r179735, it has no testcases, and doesn't really make sense. llvm-svn: 180574	2013-04-25 20:34:16 +00:00
Preston Gurd	8b7ab4ba2b	This patch adds the X86FixupLEAs pass, which will reduce instruction latency for certain models of the Intel Atom family, by converting instructions into their equivalent LEA instructions, when it is both useful and possible to do so. llvm-svn: 180573	2013-04-25 20:29:37 +00:00
Nadav Rotem	f43cbeee15	LoopVectorizer: No need to generate pointer disambiguation checks between readonly pointers. llvm-svn: 180570	2013-04-25 19:55:03 +00:00
Reid Kleckner	d973ca3c51	[mc-coff] Forward Linker Option flags into the .drectve section Summary: This is modelled on the Mach-O linker options implementation and should support a Clang implementation of #pragma comment(lib/linker). Reviewers: rafael CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D724 llvm-svn: 180569	2013-04-25 19:34:41 +00:00
Rafael Espindola	b770f897ee	Fix section relocation for SECTIONREL32 with immediate offset. Patch by Kai Nacke. This matches the gnu as output. llvm-svn: 180568	2013-04-25 19:27:05 +00:00
Rafael Espindola	04d3f49608	Use a pointer as the relocation iterator. Since the relocation iterator walks only the relocations in one section, we can just use a pointer and avoid fetching information about the section at every reference. llvm-svn: 180262	2013-04-25 12:45:46 +00:00
Rafael Espindola	1e48387962	Clarify getRelocationAddress x getRelocationOffset a bit. getRelocationAddress is for dynamic libraries and executables, getRelocationOffset for relocatable objects. Mark the getRelocationAddress of COFF and MachO as not implemented yet. Add a test of ELF's. llvm-readobj -r now prints the same values as readelf -r. llvm-svn: 180259	2013-04-25 12:28:45 +00:00
Silviu Baranga	4ad2bc5963	Fix constant folding for one lane vector types. Constant folding one lane vector types not returns a vector instead of a scalar. llvm-svn: 180254	2013-04-25 09:32:33 +00:00
Rafael Espindola	72780ed996	Revert "Adding object caching support to MCJIT" This reverts commit 07f03923137a91e3cca5d7fc075a22f8c9baf33a. Looks like it broke the valgrind bot: http://lab.llvm.org:8011/builders/llvm-x86_64-linux-vg_leak/builds/649 llvm-svn: 180249	2013-04-25 03:47:41 +00:00
Rafael Espindola	837448bc19	Revert "Exposing MCJIT through C API" This reverts commit 8c31b298149ca3c3f2bbd9e8aa9a01c4d91f3d74. It looks like this commit broke some bots: http://lab.llvm.org:8011/builders/llvm-ppc64-linux2/builds/5209 llvm-svn: 180248	2013-04-25 03:19:12 +00:00
Akira Hatanaka	f0aa6c9101	[mips] Add definitions of micromips load and store instructions. Patch by Zoran Jovanovic. llvm-svn: 180241	2013-04-25 01:21:25 +00:00
Akira Hatanaka	cd9b74a599	[mips] Add definitions of micromips shift instructions. Patch by Zoran Jovanovic. llvm-svn: 180238	2013-04-25 01:11:15 +00:00
Tom Stellard	87047f69ad	R600: Initialize BooleanVectorContents Fixes test/CodeGen/R600/setcc.ll llvm-svn: 180231	2013-04-24 23:56:18 +00:00
Tom Stellard	34e4068d05	R600: Use SHT_PROGBITS for the .AMDGPU.config section The libelf implementation that is distributed here: http://www.mr511.de/software/english.html will not parse sections that are marked SHT_NULL. llvm-svn: 180230	2013-04-24 23:56:14 +00:00
Andrew Kaylor	ee1e45796e	Exposing MCJIT through C API Patch by Filip Pizlo llvm-svn: 180229	2013-04-24 23:33:53 +00:00
Andrew Trick	2e87517144	Fix for r180193 - MI Sched: eliminate local vreg. Fixes PR15838. Need to check for blocks with nothing but dbg.value. I'm not sure how to force this situation with a unit test. I tried to reduce the test case in PR15838 (1k lines of metadata) but gave up. llvm-svn: 180227	2013-04-24 23:19:56 +00:00
Chad Rosier	108d5a61b7	[inline asm] Fix a crasher for an invalid value type/register class. rdar://13731657 llvm-svn: 180226	2013-04-24 22:53:10 +00:00
Andrew Kaylor	f91b5acc99	Making invalidateInstructionCache automatic in SectionMemoryManager llvm-svn: 180225	2013-04-24 22:39:12 +00:00
Michael Gottesman	fdb497a9b2	[objc-arc] Added ImpreciseAutoreleaseSet to track autorelease calls that were once autoreleaseRV instructions. Due to the semantics of ARC, we must be extremely conservative with autorelease calls inserted by the frontend since ARC gaurantees that said object will be in the autorelease pool after that point, an optimization invariant that the optimizer must respect. On the other hand, we are allowed significantly more flexibility with autoreleaseRV instructions. Often times though this flexibility is disrupted by early transformations which transform objc_autoreleaseRV => objc_autorelease if said instruction is no longer being used as part of an RV pair (generally due to inlining). Since we can not tell the difference in between an autorelease put into place by the frontend and one created through said ``strength reduction'' we can not perform these optimizations. The addition of this set gets around said issues by allowing us to differentiate in between said two cases. rdar://problem/13697741. llvm-svn: 180222	2013-04-24 22:18:18 +00:00
Michael Gottesman	cd5b02701c	Fixed comment typo. llvm-svn: 180221	2013-04-24 22:18:15 +00:00
Rafael Espindola	75c3036d4b	Use pointers to iterate over symbols. While here, don't report a dummy symbol for relocations that don't have symbols. We used to says such relocations were for the first defined symbol, but now we return end_symbols(). The llvm-readobj output change agrees with otool. llvm-svn: 180214	2013-04-24 19:47:55 +00:00
Arnold Schwaighofer	3fa801fbc2	LoopVectorizer: Change variable name Stride to ConsecutiveStride This makes it easier to read the code. No functionality change. llvm-svn: 180197	2013-04-24 16:16:03 +00:00
Arnold Schwaighofer	a6578f7056	LoopVectorize: Scalarize padded types This patch disables memory-instruction vectorization for types that need padding bytes, e.g., x86_fp80 has 10 bytes store size with 6 bytes padding in darwin on x86_64. Because the load/store vectorization is performed by the bit casting to a packed vector, which has incompatible memory layout due to the lack of padding bytes, the present vectorizer produces inconsistent result for memory instructions of those types. This patch checks an equality of the AllocSize of a scalar type and allocated size for each vector element, to ensure that there is no padding bytes and the array can be read/written using vector operations. Patch by Daisuke Takahashi! Fixes PR15758. llvm-svn: 180196	2013-04-24 16:16:01 +00:00
Arnold Schwaighofer	23a0589bce	LoopVectorizer: Bail out if we don't have datalayout we need it llvm-svn: 180195	2013-04-24 16:15:58 +00:00
Rafael Espindola	b68c5f6bc3	Revert r180189. This should bring the ppc bots back. I will try to write a test that would have found the problem on a little endian system too. llvm-svn: 180194	2013-04-24 16:10:49 +00:00
Andrew Trick	85a1d4cbc0	MI Sched: eliminate local vreg copies. For now, we just reschedule instructions that use the copied vregs and let regalloc elliminate it. I would really like to eliminate the copies on-the-fly during scheduling, but we need a complete implementation of repairIntervalsInRange() first. The general strategy is for the register coalescer to eliminate as many global copies as possible and shrink live ranges to be extended-basic-block local. The coalescer should not have to worry about resolving local copies (e.g. it shouldn't attemp to reorder instructions). The scheduler is a much better place to deal with local interference. The coalescer side of this equation needs work. llvm-svn: 180193	2013-04-24 15:54:43 +00:00
Andrew Trick	608a698cdf	Register Coalescing: add a flag to disable rescheduling. When MachineScheduler is enabled, this functionality can be removed. Until then, provide a way to disable it for test cases and designing MachineScheduler heuristics. llvm-svn: 180192	2013-04-24 15:54:39 +00:00
Andrew Trick	7c791a3dc4	MI Sched: regpressure tracing. llvm-svn: 180191	2013-04-24 15:54:36 +00:00
Rafael Espindola	137faa05a2	Formatting fixes. llvm-svn: 180190	2013-04-24 15:14:22 +00:00
Rafael Espindola	ec4e350f12	Use a pointer as the relocation iterator. Since the relocation iterator walks only the relocations in one section, we can just use a pointer and avoid fetching information about the section at every reference. llvm-svn: 180189	2013-04-24 15:02:03 +00:00
Eric Christopher	4eb5eb5bc8	Formatting. llvm-svn: 180186	2013-04-24 12:56:18 +00:00
Bill Wendling	4e9fc023c6	Align the __LD,__compact_unwind section. I know what would be cool! We should align the compact unwind section because aligned data access is faster. <rdar://problem/13723271> llvm-svn: 180171	2013-04-24 03:11:14 +00:00
Eric Christopher	9efcc4ae7a	Fix dependency layering issues caused by r180112. Patch by Tom Stellard. (Committed while he's afk per request) llvm-svn: 180157	2013-04-23 22:53:53 +00:00
Andrew Kaylor	1d2d8e0e84	Adding object caching support to MCJIT llvm-svn: 180146	2013-04-23 21:26:38 +00:00
Jyotsna Verma	af2359b98c	Hexagon: Use multiclass for combine and STri[bhwd]_shl_V4 instructions. llvm-svn: 180145	2013-04-23 21:17:40 +00:00
Jyotsna Verma	f00aab98a0	Hexagon: Define relations for GP-relative instructions. No functionality change. llvm-svn: 180144	2013-04-23 21:05:55 +00:00
Adrian Prantl	15db52bf6d	Make sure the instruction right after an inlined function has a debug location. This solves a problem where range of an inlined subroutine is emitted wrongly. Patch by Manman Ren. Fixes rdar://problem/12415623 llvm-svn: 180140	2013-04-23 19:56:03 +00:00
Stephen Lin	8118e0b588	Add more tests for r179925 to verify correct handling of signext/zeroext; strengthen condition check to require actual MVT::i32 virtual register types, just in case (no actual functionality change) llvm-svn: 180138	2013-04-23 19:42:25 +00:00
Stephen Lin	4eedb29b05	Lowercase "is" boolean variable prefix for consistency within function, no functionality change. llvm-svn: 180136	2013-04-23 19:30:12 +00:00
Jyotsna Verma	89c84821ea	Hexagon: Remove assembler mapped instruction definitions. llvm-svn: 180133	2013-04-23 19:15:55 +00:00
Bill Schmidt	a76bf5a6d0	Change commentary for PowerPC Boolean vector contents. No functional change intended. llvm-svn: 180131	2013-04-23 18:49:44 +00:00
Akira Hatanaka	e9d0b318b1	[mips] Compare splat value with element size instead of calling isUIntN. No intended changes in functionality. llvm-svn: 180130	2013-04-23 18:09:42 +00:00
Owen Anderson	2d4cca35c3	DAGCombine should not aggressively fold SEXT(VSETCC(...)) into a wider VSETCC without first checking the target's vector boolean contents. This exposed an issue with PowerPC AltiVec where it appears it was setting the wrong vector boolean contents. The included change fixes the PowerPC tests, and was OK'd by Hal. llvm-svn: 180129	2013-04-23 18:09:28 +00:00
Aaron Ballman	31c0adc68c	Testing for _XCR_XFEATURE_ENABLED_MASK instead of a specific MSVC version because some MSVC 2010 SP1 installations do not have the _xgetbv intrinsic. Patch thanks to Serge Pavlov! llvm-svn: 180125	2013-04-23 17:38:44 +00:00
Vincent Lejeune	117f075f6e	R600: Use .AMDGPU.config section to emit stacksize llvm-svn: 180124	2013-04-23 17:34:12 +00:00
Vincent Lejeune	b6bfe85a07	R600: Add CF_END llvm-svn: 180123	2013-04-23 17:34:00 +00:00
Nadav Rotem	71c9d6d333	LoopVectorizer: Fix 15830. When scalarizing and unrolling stores make sure that the order in which the elements are scalarized is the same as the original order. This fixes a miscompilation in FreeBSD's regex library. llvm-svn: 180121	2013-04-23 17:12:42 +00:00
Jyotsna Verma	a696239bec	Hexagon: Remove duplicate instructions to handle global/immediate values for absolute/absolute-set addressing modes. llvm-svn: 180120	2013-04-23 17:11:46 +00:00
Pekka Jaaskelainen	d3c90e132a	Call the potentially costly isAnnotatedParallel() only once. Made the uniform write test's checks a bit stricter. llvm-svn: 180119	2013-04-23 16:44:43 +00:00
Stephen Lin	6c70dc7842	Add some constraints to use of 'returned': 1) Disallow 'returned' on parameter that is also 'sret' (no sensible semantics, as far as I can tell). 2) Conservatively disallow tail calls through 'returned' parameters that also are 'zext' or 'sext' (for consistency with treatment of other zero-extending and sign-extending operations in tail call position detection...can be revised later to handle situations that can be determined to be safe). This is a new attribute that is not yet used, so there is no impact. llvm-svn: 180118	2013-04-23 16:31:56 +00:00
Tom Stellard	a1fd35a04c	Wrap.h: Define wrap / unwrap function for ExecutionEngine llvm-svn: 180112	2013-04-23 15:13:36 +00:00
Alexey Samsonov	fdcff04ad5	Fixup for r180094: properly use MSan interface functions llvm-svn: 180103	2013-04-23 13:35:32 +00:00
Carlo Kok	8c6719bf07	Expose IRBuilder::CreateAtomicRMW as LLVMBuildAtomicRMW in llvm-c. llvm-svn: 180100	2013-04-23 13:21:19 +00:00
Alexey Samsonov	0c9f1bfae5	Tell MSan that memory initialized by libz is valid llvm-svn: 180094	2013-04-23 12:17:46 +00:00
Alexey Samsonov	068fc8ae6e	Use zlib to uncompress debug sections in DWARF parser. This makes llvm-dwarfdump and llvm-symbolizer understand debug info sections compressed by ld.gold linker. llvm-svn: 180088	2013-04-23 10:17:34 +00:00
Hans Wennborg	63761d4bc4	Add llvm_unreachable after fully covered switch to pacify GCC llvm-svn: 180087	2013-04-23 10:12:16 +00:00
Alexey Samsonov	28acf056e1	Add more guards around zlib-dependent code llvm-svn: 180084	2013-04-23 08:57:30 +00:00
Alexey Samsonov	2fb337e77a	Add basic zlib support to LLVM. This would allow to use compression/uncompression in selected LLVM tools. llvm-svn: 180083	2013-04-23 08:28:39 +00:00
Pekka Jaaskelainen	6f2f66b63f	Refuse to (even try to) vectorize loops which have uniform writes, even if erroneously annotated with the parallel loop metadata. Fixes Bug 15794: "Loop Vectorizer: Crashes with the use of llvm.loop.parallel metadata" llvm-svn: 180081	2013-04-23 08:08:51 +00:00
Tim Northover	2ac2d4c59d	AArch64: remove unnecessary check that RS is valid AArch64 always demands a register-scavenger, so the pointer should never be NULL. However, in the spirit of paranoia, we'll assert it before use just in case. llvm-svn: 180080	2013-04-23 06:55:15 +00:00
Manman Ren	4a4970ec6a	Struct-path aware TBAA: update getMostGenericTBAA The tag is of type TBAANode when flag EnableStructPathTBAA is off. Move implementation of MDNode::getMostGenericTBAA to TypeBasedAliasAnalysis.cpp since it depends on how to interprete the MDNodes for scalar TBAA and struct-path aware TBAA. llvm-svn: 180068	2013-04-22 23:00:44 +00:00
Matt Arsenault	034ca0fe41	Remove unused DwarfSectionOffsetDirective string The value isn't actually used, and setting it emits a COFF specific directive. llvm-svn: 180064	2013-04-22 22:49:11 +00:00
Eric Christopher	04d4e9312c	Move C++ code out of the C headers and into either C++ headers or the C++ files themselves. This enables people to use just a C compiler to interoperate with LLVM. llvm-svn: 180063	2013-04-22 22:47:22 +00:00
Chad Rosier	65dd0399c6	[ms-inline asm] Removed this unnecessary check. In the current implementation, Disp will always be one of MCSymbolRefExpr or MCConstantExpr, and never NULL. llvm-svn: 180059	2013-04-22 22:38:35 +00:00
Chad Rosier	dba3fe557c	[ms-inline asm] Get the OpDecl and remove a redundant lookup. Part of rdar://13663589 llvm-svn: 180057	2013-04-22 22:12:12 +00:00
Chad Rosier	732b837a41	[ms-inline asm] Add the OpDecl to the InlineAsmIdentifierInfo struct and in turn the MCParsedAsmOperand. Part of rdar://13663589 llvm-svn: 180054	2013-04-22 22:04:25 +00:00
Eli Bendersky	58b04b7e2e	Optimize MachineBasicBlock::getSymbol by caching the symbol. Since the symbol name computation is expensive, this helps save about 25% of the time spent in this function. llvm-svn: 180049	2013-04-22 21:21:08 +00:00
Anat Shemer	10260a75e3	Changed back (relative to commit 179786) the operations executed when extract(cast) is transformed to cast(extract). It uses the Builder class as before. In addition the result node is added to the Worklist, so all the previous extract users will become the new scalar cast users. llvm-svn: 180045	2013-04-22 20:51:10 +00:00
Chad Rosier	eeb0034918	Fix unused variable warning. llvm-svn: 180044	2013-04-22 20:42:32 +00:00
Akira Hatanaka	d8fb032cff	80 columns. llvm-svn: 180040	2013-04-22 20:13:37 +00:00
Akira Hatanaka	0d6964cf4a	[mips] In performDSPShiftCombine, check that all elements in the vector are shifted by the same amount and the shift amount is smaller than the element size. llvm-svn: 180039	2013-04-22 19:58:23 +00:00
Chad Rosier	cb78f0d05e	[ms-inline asm] Remove the identifier parsing logic from the AsmParser. This is now taken care of by the frontend, which allows us to parse arbitrary C/C++ variables. Part of rdar://13663589 llvm-svn: 180037	2013-04-22 19:42:15 +00:00
Reid Kleckner	74679a93b2	[Support] Fix argv string escape bug on Windows Summary: This is http://llvm.org/PR15802. Backslashes preceding double quotes in arguments must be escaped. The interesting bit is that all other backslashes should not be escaped, because the un-escaping logic is only triggered by the presence of a double quote character. Reviewers: Bigcheese CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D705 llvm-svn: 180035	2013-04-22 19:03:55 +00:00
Peter Collingbourne	8988687d6b	COFF: Fix weak external aliases. Differential Revision: http://llvm-reviews.chandlerc.com/D700 llvm-svn: 180034	2013-04-22 18:48:56 +00:00
Eli Bendersky	d9806687bc	Fix for PR 14965: Better error message for GEP with partially defined contents llvm-svn: 180030	2013-04-22 17:03:42 +00:00
Chad Rosier	f6675c3d3e	[ms-inline asm] Refactor/clean up the SemaLookup interface. No functional change indended. Part of rdar://13663589 llvm-svn: 180028	2013-04-22 17:01:46 +00:00
Rafael Espindola	8bd2c228f8	Also verify llvm.compiler_used. llvm-svn: 180020	2013-04-22 15:16:51 +00:00
Rafael Espindola	74f2e46eef	Clarify that llvm.used can contain aliases. Also add a check for llvm.used in the verifier and simplify clients now that they can assume they have a ConstantArray. llvm-svn: 180019	2013-04-22 14:58:02 +00:00
Eric Christopher	cc2cfe426d	No really, don't store anything to this since it's unconditionally set below. llvm-svn: 180015	2013-04-22 14:11:25 +00:00
Eric Christopher	6647fb2c60	Remove variable store that is never read. llvm-svn: 180014	2013-04-22 13:51:44 +00:00
Eric Christopher	845c2ca78c	Remove variable store that is never read. llvm-svn: 180013	2013-04-22 13:46:33 +00:00
Stepan Dyatkovskiy	f80f9513ce	Fix for 5.5 Parameter Passing --> Stage C: -- C.4 and C.5 statements, when NSAA is not equal to SP. -- C.1.cp statement for VA functions. Note: There are no VFP CPRCs in a variadic procedure. Before this patch "NSAA != 0" means "don't use GPRs anymore ". But there are some exceptions in AAPCS. 1. For non VA function: allocate all VFP regs for CPRC. When all VFPs are allocated CPRCs would be sent to stack, while non CPRCs may be still allocated in GRPs. 2. Check that for VA functions all params uses GPRs and then stack. No exceptions, no CPRCs here. llvm-svn: 180011	2013-04-22 13:06:52 +00:00
Eric Christopher	44c6aa670f	Tidy. llvm-svn: 180000	2013-04-22 07:51:08 +00:00
Eric Christopher	25e3509c78	Update comment. Whitespace. llvm-svn: 179999	2013-04-22 07:47:40 +00:00
David Blaikie	f55abeaf4c	Revert "Revert "PR14606: debug info imported_module support"" This reverts commit r179840 with a fix to test/DebugInfo/two-cus-from-same-file.ll I'm not sure why that test only failed on ARM & MIPS and not X86 Linux, even though the debug info was clearly invalid on all of them, but this ought to fix it. llvm-svn: 179996	2013-04-22 06:12:31 +00:00
Craig Topper	7af39d7de0	Convert windows line endings to linux/unix line endings. llvm-svn: 179995	2013-04-22 05:38:01 +00:00
Craig Topper	2172ad64f9	Fix indentation. No functional change. llvm-svn: 179994	2013-04-22 04:24:02 +00:00
Craig Topper	f15655b2d9	Put 'else' on same line as preceding curly brace per coding standards. No functional change. llvm-svn: 179993	2013-04-22 04:22:40 +00:00

... 4 5 6 7 8 ...

61349 Commits