llvm-project

Commit Graph

Author	SHA1	Message	Date
Andrew Trick	7cb710d58c	Implemented public interface for modifying registered (not positional or sink options) command line options at runtime. Patch by Dan Liew! llvm-svn: 181254	2013-05-06 21:56:35 +00:00
Andrew Trick	0537a98878	Support command line option categories. Patch by Dan Liew! llvm-svn: 181253	2013-05-06 21:56:23 +00:00
Krzysztof Parzyszek	59df52c585	Cleanup of the HexagonTargetMachine setup. llvm-svn: 181250	2013-05-06 21:25:45 +00:00
David Majnemer	70f286d95f	InstCombine: (X ^ signbit) + C -> X + (signbit ^ C) llvm-svn: 181249	2013-05-06 21:21:31 +00:00
Eric Christopher	0cdce8351a	Hoist boundary condition out of loop header. llvm-svn: 181248	2013-05-06 21:19:44 +00:00
Eric Christopher	34ea33680f	Untabify. llvm-svn: 181247	2013-05-06 21:19:41 +00:00
Jyotsna Verma	84c471029b	Hexagon: Add multiclass/encoding bits for the New-Value Jump instructions. llvm-svn: 181235	2013-05-06 18:49:23 +00:00
Krzysztof Parzyszek	d50074712f	Make references to HexagonTargetMachine "const". llvm-svn: 181233	2013-05-06 18:38:37 +00:00
Andrew Trick	9c72b071fe	Rotate multi-exit loops even if the latch was simplified. Test case by Michele Scandale! Fixes PR10293: Load not hoisted out of loop with multiple exits. There are few regressions with this patch, now tracked by rdar:13817079, and a roughly equal number of improvements. The regressions are almost certainly back luck because LoopRotate has very little idea of whether rotation is profitable. Doing better requires a more comprehensive solution. This checkin is a quick fix that lacks generality (PR10293 has a counter-example). But it trivially fixes the case in PR10293 without interfering with other cases, and it does satify the criteria that LoopRotate is a loop canonicalization pass that should avoid heuristics and special cases. I can think of two approaches that would probably be better in the long run. Ultimately they may both make sense. (1) LoopRotate should check that the current header would make a good loop guard, and that the loop does not already has a sufficient guard. The artifical SimplifiedLoopLatch check would be unnecessary, and the design would be more general and canonical. Two difficulties: - We need a strong guarantee that we won't endlessly rotate, so the analysis would need to be precise in order to avoid the SimplifiedLoopLatch precondition. - Analysis like this are usually based on SCEV, which we don't want to rely on. (2) Rotate on-demand in late loop passes. This could even be done by shoving the loop back on the queue after the optimization that needs it. This could work well when we find LICM opportunities in multi-branch loops. This requires some work, and it doesn't really solve the problem of SCEV wanting a loop guard before the analysis. llvm-svn: 181230	2013-05-06 17:58:18 +00:00
Tom Stellard	d93cede8e4	R600: Remove dead code from the CodeEmitter v2 v2: - Replace switch statement with TSFlags query Reviewed-by: Vincent Lejeune <vljn@ovi.com> Tested-By: Aaron Watry <awatry@gmail.com> llvm-svn: 181229	2013-05-06 17:50:57 +00:00
Tom Stellard	043de4c5af	R600: Emit config values in register / value pairs Reviewed-by: Vincent Lejeune <vljn@ovi.com> Tested-By: Aaron Watry <awatry@gmail.com> llvm-svn: 181228	2013-05-06 17:50:51 +00:00
Eric Christopher	6c6de847a8	Remove unnecessary instance variable and rework logic accordingly. llvm-svn: 181227	2013-05-06 17:50:50 +00:00
Eric Christopher	f0303324be	Grammar. llvm-svn: 181226	2013-05-06 17:50:46 +00:00
Tom Stellard	cfe2ef8fea	R600: Stop emitting the instruction type byte before each instruction Reviewed-by: Vincent Lejeune <vljn@ovi.com> Tested-By: Aaron Watry <awatry@gmail.com> llvm-svn: 181225	2013-05-06 17:50:44 +00:00
Eric Christopher	92f3c0b49c	Don't emit .dwo sections unless they exist. llvm-svn: 181224	2013-05-06 17:50:42 +00:00
Tom Stellard	dbbcaf31b6	R600: Emit ISA for CALL_FS_* instructions Reviewed-by: Vincent Lejeune <vljn@ovi.com> Tested-By: Aaron Watry <awatry@gmail.com> llvm-svn: 181223	2013-05-06 17:50:26 +00:00
Ulrich Weigand	e7c6dfeb4b	[SystemZ] Update non-pic DWARF encodings As pointed out by Rafael Espindola, we should match the DWARF encodings produced by GCC in both pic and non-pic modes. This was not the case for the non-pic case. This patch changes all DWARF encodings to DW_EH_PE_absptr for the non-pic case, just like GCC does. The test case is updated to check for both variants. llvm-svn: 181222	2013-05-06 17:28:30 +00:00
Adhemerval Zanella	e8bd03da5c	PowerPC: Fix unimplemented relocation on ppc64 This patch handles the R_PPC64_REL64 relocation type for powerpc64 for mcjit. llvm-svn: 181220	2013-05-06 17:21:23 +00:00
Jean-Luc Duprat	3e4fc3ef24	Provide InstCombines for the following 3 cases: A * (1 - (uitofp i1 C)) -> select C, 0, A B * (uitofp i1 C) -> select C, B, 0 select C, 0, A + select C, B, 0 -> select C, B, A These come up in code that has been hand-optimized from a select to a linear blend, on platforms where that may have mattered. We want to undo such changes with the following transform: A(1 - uitofp i1 C) + B(uitofp i1 C) -> select C, A, B llvm-svn: 181216	2013-05-06 16:55:50 +00:00
Ulrich Weigand	5f613dfd1f	[SystemZ] Add back end This adds the actual lib/Target/SystemZ target files necessary to implement the SystemZ target. Note that at this point, the target cannot yet be built since the configure bits are missing. Those will be provided shortly by a follow-on patch. This version of the patch incorporates feedback from reviews by Chris Lattner and Anton Korobeynikov. Thanks to all reviewers! Patch by Richard Sandiford. llvm-svn: 181203	2013-05-06 16:15:19 +00:00
Ulrich Weigand	0213e7fcb8	[SystemZ] Define DWARF encoding This is another patch in preparation for adding the SystemZ target. It defines the appropriate values for DWARF encodings; the intent is to be compatible with what GCC currently does on the target. Patch by Richard Sandiford. llvm-svn: 181201	2013-05-06 16:11:12 +00:00
Ulrich Weigand	509c240ce5	[PowerPC] Fix memory corruption in AsmParser As pointed out by Evgeniy Stepanov, assigning a std::string temporary to a StringRef is not a good idea. Rework MatchRegisterName to avoid using the .lower routine. llvm-svn: 181192	2013-05-06 11:16:57 +00:00
Michael Kuperstein	ac868757d0	Fix slightly too aggressive conact_vector optimization. (Would sometimes optimize away conacts used to extend a vector with undef values) llvm-svn: 181186	2013-05-06 08:06:13 +00:00
Nadav Rotem	632b25b743	Update the comment to mention that we use TTI. llvm-svn: 181178	2013-05-06 03:06:36 +00:00
Nadav Rotem	c70ef4e93c	Revert r164763 because it introduces new shuffles. Thanks Nick Lewycky for pointing this out. llvm-svn: 181177	2013-05-06 02:39:09 +00:00
Matt Arsenault	c23753a53e	Fix unchecked uses of DominatorTree in MemoryDependenceAnalysis. Use unknown results for places where it would be needed llvm-svn: 181176	2013-05-06 02:07:24 +00:00
Rafael Espindola	c229a4fff4	Fix const merging when an alias of a const is llvm.used. We used to disable constant merging not only if a constant is llvm.used, but also if an alias of a constant is llvm.used. This change fixes that. llvm-svn: 181175	2013-05-06 01:48:55 +00:00
Rafael Espindola	fa5942bc2c	Add EH support to the MCJIT. This gets exception handling working on ELF and Macho (x86-64 at least). Other than the EH frame registration, this patch also implements support for GOT relocations which are used to locate the personality function on MachO. llvm-svn: 181167	2013-05-05 20:43:10 +00:00
Evan Cheng	9fad6352d4	ARM AnalyzeBranch should conservatively return true when it sees a predicated indirect branch at the end of the BB. Otherwise if-converter, branch folding pass may incorrectly update its successor info if it consider BB as fallthrough to the next BB. rdar://13782395 llvm-svn: 181161	2013-05-05 18:06:32 +00:00
Evan Cheng	8b8e8d88ff	Teach if-converter to avoid removing BBs whose addresses are takne. rdar://13782395 llvm-svn: 181160	2013-05-05 18:03:49 +00:00
Benjamin Kramer	3e3f2a4b8d	LoopVectorize: Print values instead of pointers in debug output. llvm-svn: 181157	2013-05-05 14:54:52 +00:00
Richard Osborne	4498bd352f	[XCore] Add LDAPB instructions. With the change the disassembler now supports the XCore ISA in its entirety. llvm-svn: 181155	2013-05-05 13:36:53 +00:00
Richard Osborne	e41cdbd3aa	[XCore] Update LDAP to use pcrel_imm. llvm-svn: 181154	2013-05-05 13:33:10 +00:00
Richard Osborne	8bdfdf717a	[XCore] Rename calltarget -> pcrel_imm. No functionality change. llvm-svn: 181153	2013-05-05 13:29:02 +00:00
Richard Osborne	4d3514ee94	[XCore] Add BLRB instructions. llvm-svn: 181152	2013-05-05 13:24:16 +00:00
Richard Osborne	53a04fe2b4	[XCore] Remove '-' from back branch asm syntax. Instead operands are treated as negative immediates where the sign bit is implicit in the instruction encoding. llvm-svn: 181151	2013-05-05 13:20:22 +00:00
Benjamin Kramer	391f5a6e21	InlineSpiller: Remove quadratic behavior. No functionality change. llvm-svn: 181149	2013-05-05 11:29:14 +00:00
Stepan Dyatkovskiy	8c02c98259	For ARM backend, fixed "byval" attribute support. Now even the small structures could be passed within byval (small enough to be stored in GPRs). In regression tests next function prototypes are checked: PR15293: %artz = type { i32 } define void @foo(%artz* byval %s) define void @foo2(%artz* byval %s, i32 %p, %artz* byval %s2) foo: "s" stored in R0 foo2: "s" stored in R0, "s2" stored in R2. Next AAPCS rules are checked: 5.5 Parameters Passing, C.4 and C.5, "ParamSize" is parameter size in 32bit words: -- NSAA != 0, NCRN < R4 and NCRN+ParamSize > R4. Parameter should be sent to the stack; NCRN := R4. -- NSAA != 0, and NCRN < R4, NCRN+ParamSize < R4. Parameter stored in GPRs; NCRN += ParamSize. llvm-svn: 181148	2013-05-05 07:48:36 +00:00
David Majnemer	66fb70de38	Remove a recently redundant transform from X86ISelLowering. X86ISelLowering has support to treat: (icmp ne (and (xor %flags, -1), (shl 1, flag)), 0) as if it were actually: (icmp eq (and %flags, (shl 1, flag)), 0) However, r179386 has code at the InstCombine level to handle this. llvm-svn: 181145	2013-05-05 02:00:10 +00:00
Arnold Schwaighofer	d96e427eac	LoopVectorize: Add support for floating point min/max reductions Add support for min/max reductions when "no-nans-float-math" is enabled. This allows us to assume we have ordered floating point math and treat ordered and unordered predicates equally. radar://13723044 llvm-svn: 181144	2013-05-05 01:54:48 +00:00
Arnold Schwaighofer	f5183729db	LoopVectorizer: Cleanup of miminimum/maximum pattern match code No need for setting the operands. The pointers are going to be bound by the matcher. radar://13723044 llvm-svn: 181142	2013-05-05 01:54:44 +00:00
Arnold Schwaighofer	a670a0a3aa	LoopVectorize: We don't need an identity element for min/max reductions We can just use the initial element that feeds the reduction. max(max(x, y), z) == max(max(x,y), max(x,z)) radar://13723044 llvm-svn: 181141	2013-05-05 01:54:42 +00:00
Dmitri Gribenko	3238fb7595	Add ArrayRef constructor from None, and do the cleanups that this constructor enables Patch by Robert Wilhelm. llvm-svn: 181138	2013-05-05 00:40:33 +00:00
Nadav Rotem	d61dcfc4fd	whitespace llvm-svn: 181137	2013-05-04 23:27:32 +00:00
Nadav Rotem	42932bdcd0	Fix an odd comment. llvm-svn: 181136	2013-05-04 23:24:56 +00:00
Tim Northover	7b55b97dba	AArch64: enable MCJIT and tests now that everything passes. This removes dire warnings about AArch64 being unsupported and enables the tests when appropriate on this platform. llvm-svn: 181135	2013-05-04 20:14:22 +00:00
Tim Northover	b23d8dbbac	AArch64: implement 64-bit absolute relocation in MCJIT This is about the simplest relocation, but surprisingly rare in actual code. It occurs in (for example) the MCJIT test test-ptr-reloc.ll. llvm-svn: 181134	2013-05-04 20:14:14 +00:00
Tim Northover	37cde9755d	AArch64: add stubs to support long function calls on MCJIT As with global accesses, external functions could exist anywhere in memory. Therefore the stub must create a complete 64-bit address. This patch implements the fragment as (roughly): movz x16, #:abs_g3:somefunc movk x16, #:abs_g2_nc:somefunc movk x16, #:abs_g1_nc:somefunc movk x16, #:abs_g0_nc:somefunc br x16 In principle we could save 4 bytes by using a literal-load instead, but it is unclear that would be more efficient and can only be tested when real hardware is readily available. This allows (for example) the MCJIT test 2003-05-07-ArgumentTest to pass on AArch64. llvm-svn: 181133	2013-05-04 20:14:09 +00:00
Tim Northover	4d01c1e0e6	AArch64: implement relocations for global access The large memory model (default and main viable for JIT) emits addresses in need of relocation as movz x0, #:abs_g3:somewhere movk x0, #:abs_g2_nc:somewhere movk x0, #:abs_g1_nc:somewhere movk x0, #:abs_g0_nc:somewhere To support this we must implement those four relocations in the dynamic loader. This allows (for example) the test-global.ll MCJIT test to pass on AArch64. llvm-svn: 181132	2013-05-04 20:14:04 +00:00
Tim Northover	fa1b2f85da	AArch64: implement first relocation required for MCJIT R_AARCH64_PCREL32 is present in even trivial .eh_frame sections and so is required to compile any function without the "nounwind" attribute. This change implements very basic infrastructure in the RuntimeDyldELF file and allows (for example) the test-shift.ll MCJIT test to pass on AArch64. llvm-svn: 181131	2013-05-04 20:13:59 +00:00

1 2 3 4 5 ...

61099 Commits