llvm-project

Commit Graph

Author	SHA1	Message	Date
Roman Kashitsyn	9cf3625742	Test commit access llvm-svn: 215284	2014-08-09 16:05:23 +00:00
Joerg Sonnenberger	5f6b6cec70	Update disassembler test to check the full dccci/iccci form. llvm-svn: 215283	2014-08-09 14:01:10 +00:00
Joerg Sonnenberger	0d5e068fd5	Use the full form of dccci and iccci from the early PPC 405 documents, since the operands are actually used on those cores. Provide aliases for the only documented case in the newer Power ISA speec. llvm-svn: 215282	2014-08-09 13:58:31 +00:00
Eric Christopher	0ead61c336	Initialize PPC DataLayout based on the Triple only. llvm-svn: 215281	2014-08-09 04:53:17 +00:00
Eric Christopher	3770cf5961	Remove extraneous 64-bit argument to the PPC TargetMachine constructor and update initialization. llvm-svn: 215280	2014-08-09 04:38:56 +00:00
Eric Christopher	e950b6776b	Initialize X86 DataLayout based on the Triple only. llvm-svn: 215279	2014-08-09 04:38:53 +00:00
Matt Arsenault	996a0ef99e	R600: Disable FP exceptions. llvm-svn: 215277	2014-08-09 03:46:58 +00:00
Eric Christopher	4629ed75e4	Move some X86 subtarget configuration onto the subtarget that's being created. llvm-svn: 215271	2014-08-09 01:07:25 +00:00
Tom Stellard	c0503db9e2	R600/SI: Custom lower CONCAT_VECTORS This will lower them using register copies rather than loads and stores to the stack. llvm-svn: 215270	2014-08-09 01:06:56 +00:00
Tom Stellard	4f575f7aaf	R600/SI: Update concat_vectors.ll to check for scratch usage These tests were using SI-NOT: MOVREL to make sure concat vectors weren't being lowered to stack loads and stores, but we are using scratch buffers for the stack now instead of registers, so we need to add an additional SI-NOT check for scratch buffers. With this change I was able to uncover one broken test which will be fixed in a future commit. llvm-svn: 215269	2014-08-09 01:06:53 +00:00
Rafael Espindola	ffbabb7925	Fix expected windows result. llvm-svn: 215267	2014-08-09 00:37:05 +00:00
Eric Christopher	afd122fc87	Fix typo. llvm-svn: 215266	2014-08-09 00:26:27 +00:00
Lang Hames	25d93099dd	[MCJIT] Simplify immediate decoding code in the RuntimeDyldMachO hierarchy. Cleanup only: no functional change. This patch makes RuntimeDyldMachO targets directly responsible for decoding immediates, rather than letting them implement catch a callback from generic code. Since this is a very target specific operation, it makes sense to let the target-specific code drive it. llvm-svn: 215255	2014-08-08 23:12:22 +00:00
Rui Ueyama	4c956fe129	[FastISel][X86] Silence -Wenum-compare warning llvm-svn: 215253	2014-08-08 22:47:49 +00:00
Rafael Espindola	674ef1d7d3	Fix the windows build. Sorry for the noise. llvm-svn: 215249	2014-08-08 22:09:31 +00:00
Eric Christopher	9721bd029a	Reword comment slightly. llvm-svn: 215248	2014-08-08 22:09:00 +00:00
Rafael Espindola	7e774c249f	Remove dead code. Fixes pr20544. llvm-svn: 215243	2014-08-08 21:35:52 +00:00
Rafael Espindola	d649b9d4af	Convert from Windows to Unix paths in sys::path::native. Part of pr20544. Test to follow in a second. llvm-svn: 215241	2014-08-08 21:29:34 +00:00
Joerg Sonnenberger	eb9d13fcd1	Allow large immediates for branch instructions in 32bit mode. llvm-svn: 215240	2014-08-08 20:57:58 +00:00
Joerg Sonnenberger	7ee0f31a8b	Provide an implementation of getNoopForMachoTarget for PPC, otherwise empty functions will assert in the MC object writer. llvm-svn: 215238	2014-08-08 19:13:23 +00:00
Juergen Ributzka	793f28d274	[FastISel][X86] Fix INC/DEC optimization (r215230) I accidentally also used INC/DEC for unsigned arithmetic which doesn't work, because INC/DEC don't set the required flag which is used for the overflow check. llvm-svn: 215237	2014-08-08 18:47:04 +00:00
Tim Northover	e42fac5191	AArch64: avoid deleting the current iterator in a loop. std::map invalidates the iterator to any element that gets deleted, which means we can't increment it correctly afterwards. This was causing Darwin test failures. llvm-svn: 215233	2014-08-08 17:31:52 +00:00
Juergen Ributzka	241fd486eb	[FastISel][AArch64] Attach MachineMemOperands to load and store instructions. llvm-svn: 215231	2014-08-08 17:24:10 +00:00
Juergen Ributzka	4022614899	[FastISel][X86] Use INC/DEC when possible for {sadd\|ssub}.with.overflow intrinsics. This is a small peephole optimization to emit INC/DEC when possible. Fixes <rdar://problem/17952308>. llvm-svn: 215230	2014-08-08 17:21:37 +00:00
David Blaikie	bd56fbb976	DebugInfo: Recommit (reverted in r215217, originally committed in r215157) the assertion that no argument variable is overwritten by subsequent argument variables. This turned up a bug in clang where arguments were emitted with duplicate argument numbers (see r215227). llvm-svn: 215228	2014-08-08 17:12:35 +00:00
NAKAMURA Takumi	08e30fd3d2	AArch64A57FPLoadBalancing.cpp: Define ColorNames in !NDEBUG. llvm-svn: 215226	2014-08-08 17:00:59 +00:00
NAKAMURA Takumi	40df32f3d9	DataTypes.h.cmake: Define PRIx32 &c for !HAVE_INTTYPES_H hosts. I supposed PRIx32 might be unused in the tree. llvm-svn: 215225	2014-08-08 17:00:47 +00:00
Rafael Espindola	8f7d5f29f8	Delete dead code. NFC. llvm-svn: 215224	2014-08-08 16:49:35 +00:00
Pedro Artigas	caa565887d	Added a TLI hook to signal that the target does not have or does not care about floating point exceptions, added use of flag to fold potentially exception raising floating point math in selection DAG. No functionality change, as targets have to explicitly ask for this behavior and none does today. llvm-svn: 215222	2014-08-08 16:46:53 +00:00
Joerg Sonnenberger	eb8655afd3	Add low-level option for avoiding float stores from va_start until soft-float is properly supported. llvm-svn: 215221	2014-08-08 16:46:10 +00:00
Joerg Sonnenberger	0013b9292d	Add support for SPE load/store from memory. llvm-svn: 215220	2014-08-08 16:43:49 +00:00
Rafael Espindola	676223170f	getLoadName is only implemented for ELF, make it ELF only. llvm-svn: 215219	2014-08-08 16:39:22 +00:00
Rafael Espindola	72318b47fc	Use a simpler predicate. NFC. llvm-svn: 215218	2014-08-08 16:30:17 +00:00
David Blaikie	2b07c88668	DebugInfo: Remove assertion (added in r215157) that's firing on a blocks test in the test-suite while I investigate further. llvm-svn: 215217	2014-08-08 16:21:50 +00:00
Rafael Espindola	40f5446d84	pr20589: Fix duplicated arch flag. llvm-svn: 215216	2014-08-08 16:18:29 +00:00
Rafael Espindola	9ccd52f5b5	pr20588: add missing calls to va_end. llvm-svn: 215212	2014-08-08 15:57:37 +00:00
Daniel Sanders	feb613028b	[mips] Invert the abicalls feature bit to be noabicalls so that it's possible for -mno-abicalls to take effect. Also added the testcase that should have been in r215194. This behaviour has surprised me a few times now. The problem is that the generated MipsSubtarget::ParseSubtargetFeatures() contains code like this: if ((Bits & Mips::FeatureABICalls) != 0) IsABICalls = true; so '-abicalls' means 'leave it at the default' and '+abicalls' means 'set it to true'. In this case, (and the similar -modd-spreg case) I'd like the code to be IsABICalls = (Bits & Mips::FeatureABICalls) != 0; or possibly: if ((Bits & Mips::FeatureABICalls) != 0) IsABICalls = true; else IsABICalls = false; and preferably arrange for 'Bits & Mips::FeatureABICalls' to be true by default (on some triples). llvm-svn: 215211	2014-08-08 15:47:17 +00:00
Josh Klontz	ac0d28dfe6	Add missing Interpreter intrinsic lowering for sin, cos and ceil llvm-svn: 215209	2014-08-08 15:00:12 +00:00
Josh Klontz	da295e921b	Fix for #20408 - CMake LLVM_ENABLE_FFI=ON build fails on reconfigure llvm-svn: 215207	2014-08-08 14:32:56 +00:00
Jiangning Liu	dcc651f99f	[AArch64] Fix a type conversion bug for anlyzing compare. The bug can cause spec2006/483.xalancbmk failure. Patched by David Xu. llvm-svn: 215206	2014-08-08 14:19:29 +00:00
Rafael Espindola	a97373f235	Fix bug 20125 - clang-format segfaults on bad config. The problem was in unchecked dyn_cast inside of Input::createHNodes. Patch by Roman Kashitsyn! llvm-svn: 215205	2014-08-08 13:58:00 +00:00
Daniel Sanders	c30f30fe8a	[mips] Remove reason for XFAIL from a test that isn't actually XFAILed. llvm-svn: 215201	2014-08-08 12:58:17 +00:00
James Molloy	65b08f5e46	[LoopVectorizer] Enable support for floating-point subtraction reductions llvm-svn: 215200	2014-08-08 12:41:08 +00:00
James Molloy	3feea9c11a	[AArch64] Add an FP load balancing pass for Cortex-A57 For best-case performance on Cortex-A57, we should try to use a balanced mix of odd and even D-registers when performing a critical sequence of independent, non-quadword FP/ASIMD floating-point multiply or multiply-accumulate operations. This pass attempts to detect situations where the register allocation may adversely affect this load balancing and to change the registers used so as to better utilize the CPU. Ideally we'd just take each multiply or multiply-accumulate in turn and allocate it alternating even or odd registers. However, multiply-accumulates are most efficiently performed in the same functional unit as their accumulation operand. Therefore this pass tries to find maximal sequences ("Chains") of multiply-accumulates linked via their accumulation operand, and assign them all the same "color" (oddness/evenness). This optimization affects S-register and D-register floating point multiplies and FMADD/FMAs, as well as vector (floating point only) muls and FMADD/FMA. Q register instructions (and 128-bit vector instructions) are not affected. llvm-svn: 215199	2014-08-08 12:33:21 +00:00
Tim Northover	06af260b85	llvm-objdump: add missing % in format specifier. llvm-svn: 215198	2014-08-08 12:08:51 +00:00
Tim Northover	b911bf84dc	llvm-objdump: use portable format specifiers for info. ARM bots (& others, I think, now that I look) were failing because we were using incorrect printf-style format specifiers. They were wrong on almost any platform, actually, just mostly harmlessly so. llvm-svn: 215196	2014-08-08 12:00:09 +00:00
Daniel Sanders	35837ac9a9	[mips] Initial implementation of -mabicalls/-mno-abicalls. This patch implements the main rules for -mno-abicalls such as reserving $gp, and emitting the correct .option directive. Patch by Matheus Almeida and Toma Tabacu Differential Revision: http://reviews.llvm.org/D4231 llvm-svn: 215194	2014-08-08 10:01:29 +00:00
Tim Northover	0f18ff9817	AArch64: stop trying to take control of all UnknownArch triples. This short-circuited our error reporting for incorrectly specified target triples (you'd get AArch64 code instead). Should fix PR20567. llvm-svn: 215191	2014-08-08 08:27:44 +00:00
Patrik Hagglund	b0e86ec814	[pr19635] Revert most of r170537, and add new testcase. Patch provided by Andrey Kuharev. Sorry, r170537 was obviously wrong. llvm-svn: 215190	2014-08-08 08:21:19 +00:00
David Majnemer	fe8c7540b0	GlobalOpt: Optimize in the face of insertvalue/extractvalue GlobalOpt didn't know how to simulate InsertValueInst or ExtractValueInst. Optimizing these is pretty straightforward. N.B. This came up when looking at clang's IRGen for MS ABI member pointers; they are represented as aggregates. llvm-svn: 215184	2014-08-08 05:50:43 +00:00
NAKAMURA Takumi	15ac9af4f4	Fix llvm/test/DebugInfo/X86/recursive_inlining.ll to use %llc_dwarf. llvm-svn: 215181	2014-08-08 02:24:05 +00:00
NAKAMURA Takumi	40da267976	AArch64InstrInfo.cpp: Fix \param(s). [-Wdocumentation] llvm-svn: 215180	2014-08-08 02:04:18 +00:00
Anton Yartsev	671dff1e49	[tablegen] - Eliminate memory leaks in TGParser.cpp Ugly solution indicating that a refactoring is necessary to get the ownership under control. llvm-svn: 215176	2014-08-08 00:29:54 +00:00
Adam Nemet	7d498629f1	[AVX512] Add zero-masking variant to AVX512_masking multiclass This completes one item from the todo-list of r215125 "Generate masking instruction variants with tablegen". The AddedComplexity is needed just like for the k variant. Added a codegen test based on valignq. llvm-svn: 215173	2014-08-07 23:53:38 +00:00
Gerolf Hoflehner	ea96a3d336	Fix for multi-line comment warning llvm-svn: 215169	2014-08-07 23:19:55 +00:00
Adam Nemet	fa1f7201fc	[AVX512] Add codegen test for the masking variant of valign The AddedComplexity is needed just like in avx512_perm_3src. There may be a bug in the complexity computation... llvm-svn: 215168	2014-08-07 23:18:18 +00:00
Akira Hatanaka	5acc58fcfb	[stack protector] Look through bitcasts to get global variable __stack_chk_guard. Handle the case where the pointer operand of the load instruction that loads the stack guard is not a global variable but instead a bitcast. %StackGuard = load i8 bitcast (i64 @__stack_chk_guard to i8*) call void @llvm.stackprotector(i8 %StackGuard, i8** %StackGuardSlot) Original test case provided by Ana Pazos. This fixes PR20558. llvm-svn: 215167	2014-08-07 23:08:24 +00:00
Adrian Prantl	80c8b2742f	Make these regexes stricter by disallowing any additional characters in the output. Thanks to dblaikie for pointing this out! llvm-svn: 215166	2014-08-07 23:04:07 +00:00
Arnold Schwaighofer	4fb3c47456	SLPVectorizer: Use the type of the value loaded/stored to get the ABI alignment We were using the pointer type which is incorrect. llvm-svn: 215162	2014-08-07 22:47:27 +00:00
Adrian Prantl	03c67849d1	Add a separate testcase for a DWARF expression describing a value in a subregister. llvm-svn: 215161	2014-08-07 22:44:34 +00:00
Adrian Prantl	26e66b155f	Reflow this comment. llvm-svn: 215160	2014-08-07 22:44:24 +00:00
David Blaikie	09fdfabdda	DebugInfo: Fix overwriting/loss of inlined arguments to recursively inlined functions. Due to an unnecessary special case, inlined arguments that happened to be from the same function as they were inlined into were misclassified as non-inline arguments and would overwrite the non-inlined arguments. Assert that we never overwrite a function's arguments, and stop misclassifying inlined arguments as non-inline arguments to fix this issue. Excuse the rather crappy test case - handcrafted IR might do better, or someone who understands better how to tickle the inliner to create a recursive inlining situation like this (though it may also be necessary to tickle the variable in a particular way to cause it to be recorded in the MMI side table and go down this particular path for location information). llvm-svn: 215157	2014-08-07 22:22:49 +00:00
Reed Kotler	87048a4c9e	fix materialization of one bit constants and global values which are accessed through a base GOT entry. Summary: get tip of tree mips fast-isel to pass test-suite Two bugs were fixed: 1) one bit booleans were treated as 1 bit signed integers and so the literal '1' could become sign extended. 2) mips uses got for pic but in certain cases, as with string constants for example, many items can be referenced from the same got entry and this case was not handled properly. Test Plan: test-suite Reviewers: dsanders Reviewed By: dsanders Subscribers: mcrosier Differential Revision: http://reviews.llvm.org/D4801 llvm-svn: 215155	2014-08-07 22:09:01 +00:00
Eric Christopher	b9fd9ed37e	Temporarily Revert "Nuke the old JIT." as it's not quite ready to be deleted. This will be reapplied as soon as possible and before the 3.6 branch date at any rate. Approved by Jim Grosbach, Lang Hames, Rafael Espindola. This reverts commits r215111, 215115, 215116, 215117, 215136. llvm-svn: 215154	2014-08-07 22:02:54 +00:00
Gerolf Hoflehner	b5220dc779	Debugging Utility - optional ability for dumping critical path length llvm-svn: 215153	2014-08-07 21:49:44 +00:00
Gerolf Hoflehner	97c383bc36	MachineCombiner Pass for selecting faster instruction sequence on AArch64 Re-commit of r214832,r21469 with a work-around that avoids the previous problem with gcc build compilers The work-around is to use SmallVector instead of ArrayRef of basic blocks in preservesResourceLen()/MachineCombiner.cpp llvm-svn: 215151	2014-08-07 21:40:58 +00:00
Kevin Enderby	ae2a9a236f	Add two missing ARM cpusubtypes to the switch statement in MachOObjectFile::getArch(uint32_t CPUType, uint32_t CPUSubType) . Upcoming changes will cause existing test cases to use this but I wanted to check in this obvious change separately. llvm-svn: 215150	2014-08-07 21:30:25 +00:00
Owen Anderson	6c19ab1b5d	Fix a case in SROA where lifetime intrinsics could inhibit alloca promotion. In this case, the code path dealing with vector promotion was missing the explicit checks for lifetime intrinsics that were present on the corresponding integer promotion path. llvm-svn: 215148	2014-08-07 21:07:35 +00:00
Lang Hames	4ea28e2294	[MCJIT] Replace a c-style cast with reinterpret_cast + static_cast. C-style casts (and reinterpret_casts) result in implementation defined values when a pointer is cast to a larger integer type. On some platforms this was leading to bogus address computations in RuntimeDyldMachOAArch64. This should fix http://llvm.org/PR20501. llvm-svn: 215143	2014-08-07 20:41:57 +00:00
Richard Smith	56579b6324	Remove Support/IncludeFile.h and its only user. This is actively harmful, since it breaks the modules builds (where CallGraph.h can be quite reasonably transitively included by an unimported portion of a module, and CallGraph.cpp not linked in), and appears to have been entirely redundant since PR780 was fixed back in 2008. If this breaks anything, please revert; I have only tested this with a single configuration, and it's possible that this is still somehow fixing something (though I doubt it, since no other similar file uses this mechanism any more). llvm-svn: 215142	2014-08-07 20:41:17 +00:00
Rafael Espindola	cc847b6376	Fix test failure on ARM. llvm-svn: 215140	2014-08-07 20:33:06 +00:00
Richard Smith	c54302a1d0	[modules] Update module map workaround to cope with the problematic file having been relocated. llvm-svn: 215139	2014-08-07 20:27:08 +00:00
Frederic Riss	e6bb1871eb	test commit: remove trailing whitespace. llvm-svn: 215138	2014-08-07 20:04:00 +00:00
Rafael Espindola	e20470d399	Remove a few XFAILs. These tests now pass with MCJIT. llvm-svn: 215136	2014-08-07 19:35:22 +00:00
Akira Hatanaka	bbd33f6766	[Branch probability] Recompute branch weights of tail-merged basic blocks. BranchFolderPass was not correctly setting the basic block branch weights when tail-merging created or merged blocks. This patch recomutes the weights of tail-merged blocks using the following formula: branch_weight(merged block to successor j) = sum(block_frequency(bb) * branch_probability(bb -> j)) bb is a block that is in the set of merged blocks. <rdar://problem/16256423> llvm-svn: 215135	2014-08-07 19:30:13 +00:00
Joerg Sonnenberger	54c340b76a	Add the majority of the remaining SPE instructions. llvm-svn: 215131	2014-08-07 18:52:39 +00:00
Justin Bogner	1b9f936f91	FileCheck: Add a flag to allow checking empty input Currently FileCheck errors out on empty input. This is usually the right thing to do, but makes testing things like "this command does not emit some error message" hard to test. This usually leads to people using "command 2>&1 \| count 0" instead, and then the bots that use guard malloc fail a few hours later. By adding a flag to FileCheck that allows empty inputs, we can make tests that consist entirely of "CHECK-NOT" lines feasible. llvm-svn: 215127	2014-08-07 18:40:37 +00:00
Joerg Sonnenberger	f74c74693c	Indent llvm-svn: 215126	2014-08-07 18:05:32 +00:00
Adam Nemet	2e2537f665	[AVX512] Generate masking instruction variants with tablegen After adding the masking variants to several instructions, I have decided to experiment with generating these from the non-masking/unconditional variant. This will hopefully reduce the amount repetition that we currently have in order to define an instruction with all its variants (for a reg/mem instruction this would be 6 instruction defs and 2 Pat<> for the intrinsic). The patch is the first cut that is currently only applied to valignd/q to make the patch small. A few notes on the approach: * In order to stitch together the dag for both the conditional and the unconditional patterns I pass the RHS of the set rather than the full pattern (set dest, RHS). * Rather than subclassing each instruction base class (e.g. AVX512AIi8), with a masking variant which wouldn't scale, I derived the masking instructions from a new base class AVX512 (this is just I<> with Requires<HasAVX512>). The instructions derive from this now, plus a new set of classes that add the format bits and everything else that instruction base class provided (i.e. AVX512AIi8 vs. AVX512AIi8Base). I hope we can go incrementally from here. I expect that: * We will need different variants of the masking class. One example is instructions requiring three vector sources. In this case we tie one of the source operands to dest rather than a new implicit source operand ($src0) * Add the zero-masking variant * Add more AVX512*Base classes as new uses are added I've looked at X86.td.expanded before and after to make sure that nothing got lost for valignd/q. llvm-svn: 215125	2014-08-07 17:53:55 +00:00
NAKAMURA Takumi	17ae2fca4c	llvm/test/tools/llvm-objdump: Reorganize target-dependent some tests. llvm-svn: 215122	2014-08-07 17:17:19 +00:00
Rafael Espindola	a3ddbc9d23	Fix the ocaml bindings. llvm-svn: 215117	2014-08-07 14:48:13 +00:00
Rafael Espindola	ea9c317000	fix configure+make build llvm-svn: 215116	2014-08-07 14:38:49 +00:00
Rafael Espindola	f8b27c41e8	Nuke the old JIT. I am sure we will be finding bits and pieces of dead code for years to come, but this is a good start. Thanks to Lang Hames for making MCJIT a good replacement! llvm-svn: 215111	2014-08-07 14:21:18 +00:00
Joerg Sonnenberger	84d35dfe96	Add mfasr and mtasr llvm-svn: 215110	2014-08-07 13:35:34 +00:00
Joerg Sonnenberger	853feaa808	Add mfrtcu and mfrtcl instructions llvm-svn: 215109	2014-08-07 13:16:58 +00:00
Joerg Sonnenberger	1837a7b4fa	Support mttbl and mttbu mnemonic llvm-svn: 215108	2014-08-07 13:06:23 +00:00
Joerg Sonnenberger	a3d4dc9eb4	Add RFID instruction. llvm-svn: 215105	2014-08-07 12:39:59 +00:00
Joerg Sonnenberger	83ef5c7753	Fix Itineray class of rfi llvm-svn: 215104	2014-08-07 12:35:16 +00:00
Joerg Sonnenberger	6ae087abc6	Spell e500 feature in lower case. llvm-svn: 215103	2014-08-07 12:31:28 +00:00
Joerg Sonnenberger	39f095ae5a	Add first bunch of SPE instructions. As they overlap with Altivec, mark them as parser-only until the disassembler is extended to handle predicates properly. llvm-svn: 215102	2014-08-07 12:18:21 +00:00
Alexander Kornienko	7151ad7762	Insert parens to avoid a warning: suggest parentheses around arithmetic in operand of '^' [-Wparentheses] llvm-svn: 215101	2014-08-07 12:09:34 +00:00
Aaron Ballman	b677f7ac4b	Silencing an MSVC C4334 warning ('<<' : result of 32-bit shift implicitly converted to 64 bits (was 64-bit shift intended?)). No functional changes intended. llvm-svn: 215100	2014-08-07 12:07:33 +00:00
Daniel Sanders	449344315f	[mips] Add assembler support for .set msa/nomsa directive. Summary: These directives are used to toggle whether the assembler accepts MSA-specific instructions or not. Patch by Matheus Almeida and Toma Tabacu. Reviewers: dsanders Reviewed By: dsanders Differential Revision: http://reviews.llvm.org/D4783 llvm-svn: 215099	2014-08-07 12:03:36 +00:00
Pavel Chupin	124889243a	Fix lld-x86_64-win7 Build #11969 llvm-svn: 215097	2014-08-07 11:09:59 +00:00
Chandler Carruth	4e8fcbd3fd	[x86] Fix another miscompile found through fuzz testing the new vector shuffle lowering. This is closely related to the previous one. Here we failed to use the source offset when swapping in the other case -- where we end up swapping the final shuffle. The cause of this bug is a bit different: I simply wasn't thinking about the fact that this mask is actually a slice of a wide mask and thus has numbers that need SourceOffset applied. Simple fix. Would be even more simple with an algorithm-y thing to use here, but correctness first. =] llvm-svn: 215095	2014-08-07 10:37:35 +00:00
Chandler Carruth	e206385e99	[x86] Fix another miscompile in the new vector shuffle lowering found via the fuzz tester. Here I missed an offset when round-tripping a value through a shuffle mask. I got it right 2 lines below. See a problem? I do. ;] I'll probably be adding a little "swap" algorithm which accepts a range and two values and swaps those values where they occur in the range. Don't really have a name for it, let me know if you do. llvm-svn: 215094	2014-08-07 10:14:27 +00:00
Chandler Carruth	78494364d1	[x86] Fix another miscompile in the new vector shuffle lowering found through the new fuzzer. This one is great: bad operator precedence led the modulus to happen at the wrong point. All the asserts didn't fire because there were usually the right values past the end of the 4 element region we were looking at. Probably could have gotten a crash here with ASan + fuzzing, but the correctness tests pinpointed this really nicely. llvm-svn: 215092	2014-08-07 09:45:02 +00:00
Pavel Chupin	f55eb450e5	[x32] Use ebp/esp as frame and stack pointer Summary: Since pointers are 32-bit on x32 we can use ebp and esp as frame and stack pointer. Some operations like PUSH/POP and CFI_INSTRUCTION still require 64-bit register, so using 64-bit MachineFramePtr where required. X86_64 NaCl uses 64-bit frame/stack pointers, however it's been found that both isTarget64BitLP64 and isTarget64BitILP32 are true for NaCl. Addressing this issue here as well by making isTarget64BitLP64 false. Also mark hasReservedSpillSlot unreachable on X86. See inlined comments. Test Plan: Add one new simple test and upgrade 2 existing with x32 target case. Reviewers: nadav, dschuff Subscribers: llvm-commits, zinovy.nis Differential Revision: http://reviews.llvm.org/D4617 llvm-svn: 215091	2014-08-07 09:41:19 +00:00
Chandler Carruth	27046758de	[x86] Fix a miscompile in the new shuffle lowering found through the new fuzz testing. The function which tested for adjacency did what it said on the tin, but when I called it, I wanted it to do something more thorough: I wanted to know if the pairs of shuffle elements were adjacent and started at 0 mod 2. In one place I had the decency to try to test for this, but in the other it was completely skipped, miscompiling this test case. Fix this by making the helper actually do what I wanted it to do everywhere I called it (and removing the now redundant code in one place). I really dislike the name "canWidenShuffleElements" for this predicate. If anyone can come up with a better name, please let me know. The other name I thought about was "canWidenShuffleMask" but is it really widening the mask to reduce the number of lanes shuffled? I don't know. Naming things is hard. llvm-svn: 215089	2014-08-07 08:11:31 +00:00
Pete Cooper	9b90dc7c6a	Update Tablegen documents given that binary literals are now sized llvm-svn: 215088	2014-08-07 05:47:13 +00:00
Pete Cooper	94891ddf0d	Update BitRecTy::convertValue to allow if expressions with bit values on both sides of the if llvm-svn: 215087	2014-08-07 05:47:10 +00:00
Pete Cooper	0bf1ea72ee	Change the { } expression in tablegen to accept sized binary literals which are not just 0 and 1. It also allows nested { } expressions, as now that they are sized, we can merge pull bits from the nested value. In the current behaviour, everything in { } must have been convertible to a single bit. However, now that binary literals are sized, its useful to be able to initialize a range of bits. So, for example, its now possible to do bits<8> x = { 0, 1, { 0b1001 }, 0, 0b0 } llvm-svn: 215086	2014-08-07 05:47:07 +00:00
Pete Cooper	2cfdfe5882	Change BitsInit to inherit from TypedInit. This is useful in a later patch where binary literals such as 0b000 will become BitsInit values instead of IntInit values. llvm-svn: 215085	2014-08-07 05:47:04 +00:00
Pete Cooper	2597764ad9	Change TableGen so that binary literals such as 0b001 are now sized. Instead of these becoming an integer literal internally, they now become bits<n> values. Prior to this change, 0b001 was 1 bit long. This is confusing as clearly the user gave 3 bits. This new type holds both the literal value and the size, and so can ensure sizes match on initializers. For example, this used to be legal bits<1> x = 0b00; but now it must be written as bits<2> x = 0b00; llvm-svn: 215084	2014-08-07 05:47:00 +00:00
Pete Cooper	99ad2a3b67	TableGen: Change { } to only accept bits<n> entries when n == 1. Prior to this change, it was legal to do something like bits<2> opc = { 0, 1 }; bits<2> opc2 = { 1, 0 }; bits<2> a = { opc, opc2 }; This involved silently dropping bits from opc and opc2 which is very hard to debug. Now the above test would be an error. Having tested with an assert, none of LLVM/clang was relying on this behaviour. Thanks to Adam Nemet for the above test. llvm-svn: 215083	2014-08-07 05:46:57 +00:00
Pete Cooper	c18261d467	Fix a whole bunch of binary literals which were the wrong size. All were being silently zero extended to the correct width. The commit after this changes { } and 0bxx literals to be of type bits<n> and not int. This means we need to write exactly the right number of bits, and not rely on the values being silently zero extended for us. llvm-svn: 215082	2014-08-07 05:46:54 +00:00
Chandler Carruth	ae42d7d020	Add an option to the shuffle fuzzer that lets you fuzz exclusively within a single bit-width of vectors. This is particularly useful for when you know you have bugs in a certain area and want to find simpler test cases than those produced by an open-ended fuzzing that ends up legalizing the vector in addition to shuffling it. llvm-svn: 215056	2014-08-07 04:49:54 +00:00
Bill Wendling	fcb526020a	Use the minor number for the revision numbers. llvm-svn: 215055	2014-08-07 04:21:45 +00:00
Chandler Carruth	7f3facb7e6	Add a vector shuffle fuzzer. This is a python script which for a given seed generates a random sequence of random shuffles of a random vector width. It embeds this into a function and emits a main function which calls the test routine and checks that the results (where defined) match the obvious results. I'll be using this to drive out miscompiles from the new vector shuffle logic now that it is clean of any crashes I can find with llvm-stress. Note, my python skills are very poor. Sorry if this is terrible code, and feel free to tell me how I should write this or just patch it as necessary. The tests generated try to be very portable and use boring C routines. It technically will mis-declare the C routines and pass 32-bit integers to parametrs that expect 64-bit integers. If someone wants to fix this and has less terrible ideas of how to do it, I'm all ears. Fortunately, this "just works" for x86. =] llvm-svn: 215054	2014-08-07 04:13:51 +00:00
Justin Bogner	989cbc9058	DebugInfo: Make a test more portable mach-o doesn't like sections without segments, and elf is perfectly happy with commas in section names, so use a Darwin-like section name. Suggestion by Eric Christopher. llvm-svn: 215052	2014-08-07 03:47:28 +00:00
Saleem Abdulrasool	64a8cc7d0d	MC: split Win64EHUnwindEmitter into a shared streamer This changes Win64EHEmitter into a utility WinEH UnwindEmitter that can be shared across multiple architectures and a target specific bit which is overridden (Win64::UnwindEmitter). This enables sharing the section selection code across X86 and the intended use in ARM for emitting unwind information for Windows on ARM. llvm-svn: 215050	2014-08-07 02:59:41 +00:00
Quentin Colombet	0233d49574	[X86][SchedModel] Fixed missing/wrong scheduling model found by code inspection. Source: Agner Fog's Instruction tables. Related to <rdar://problem/15607571> llvm-svn: 215045	2014-08-07 00:20:44 +00:00
Kevin Enderby	c959562092	Add the -mcpu= option to llvm-objdump for use with the disassemblers. Also make the disassembler created with the Mach-O parser (the -m option) pick up the Target specific attributes specified with -mattr option. llvm-svn: 215032	2014-08-06 23:24:41 +00:00
Reid Kleckner	ce63b791fe	MC X86: Accept ".att_syntax prefix" and diagnose noprefix Fixes PR18916. I don't think we need to implement support for either hybrid syntax. Nobody should write Intel assembly with '%' prefixes on their registers or AT&T assembly without them. llvm-svn: 215031	2014-08-06 23:21:13 +00:00
David Blaikie	ff3dd1701c	Revert "Reapply "DebugInfo: Ensure that all debug location scope chains from instructions within a function, lead to the function itself."" This reverts commit r214761. Revert while Reid investigates & provides a reproduction for an assertion failure for this on Windows. llvm-svn: 214999	2014-08-06 22:30:12 +00:00
Sanjay Patel	b63e43c931	fix typo llvm-svn: 214995	2014-08-06 21:08:38 +00:00
Yaron Keren	f394f83f82	getNewMemBuffer memsets the buffer to zeros, the caller don't have to initialize it. llvm-svn: 214994	2014-08-06 20:59:09 +00:00
Sanjay Patel	cd47959eb6	Fix a test that has no checks. X86 doesn't have fneg, so check for xor. Differential Revision: http://reviews.llvm.org/D4812 llvm-svn: 214992	2014-08-06 20:45:30 +00:00
Matt Arsenault	a6dc6c281c	R600: Cleanup fadd and fsub tests llvm-svn: 214991	2014-08-06 20:27:55 +00:00
Rui Ueyama	c487f7728e	Revert "r214897 - Remove dead zero store to calloc initialized memory" It broke msan. llvm-svn: 214989	2014-08-06 19:30:38 +00:00
Eric Christopher	b5217507c7	Remove the target machine from CCState. Previously it was only used to get the subtarget and that's accessible from the MachineFunction now. This helps clear the way for smaller changes where we getting a subtarget will require passing in a MachineFunction/Function as well. llvm-svn: 214988	2014-08-06 18:45:26 +00:00
Adrian Prantl	364d13170a	Improve performance of calculateDbgValueHistory. In r210492 the logic of calculateDbgValueHistory was changed to end register variable live ranges at the end of MBB conditionally on the fact that the register was or not clobbered by the function body. This requires an initial scan of all the operands of the function to collect all clobbered registers. In a second pass over all instructions, we compare this set with the set of clobbered registers for the current MachineInstruction. This modification incurred a compilation time regression on some benchmarks: the debug info emission phase takes ~10% more time. While a small performance hit is unavoidable due to the initial scan requirement, we can improve the situation by avoiding to create too many temporary sets and just use lambdas to work directly on the result of the initial scan. Fixes <rdar://problem/17884104> Patch by Frederic Riss! llvm-svn: 214987	2014-08-06 18:41:24 +00:00
Adrian Prantl	e2d637597c	Cleanup collectChangingRegs The handling of the epilogue is best expressed as an early exit and there is no reason to look for register defs in DbgValue MIs. Patch by Frederic Riss! llvm-svn: 214986	2014-08-06 18:41:19 +00:00
David Blaikie	6e9477af8c	DebugInfo: Fix ranges+gmlt test case to actually exercise the gmlt situation. Originally this test case tested the specified behavior (that -gmlt would not produce DW_AT_ranges and that when no CU DW_AT_ranges were produced, no debug_ranges section (not even an empty list) would be produced) but then the ranges emission code was improved not to create ranges of a single element (instead favoring high_pc/low_pc) and so this test case no longer exercised the -gmlt portion of the behavior. This caused me some confusion when reading the comments and trying to update this test case for future changes to -gmlt. I've made this test resilient to those changes (by using the {{DW_TAG\|NULL}} pattern to block the end of the attribute search at the end of the CU's attribute list without mandating that it must (or must not) be followed by another tag (the future changes to -gmlt should produce no subprograms in this CU)) Fix the test case to have two functions in distinct sections to force the use of DW_AT_ranges. llvm-svn: 214985	2014-08-06 18:24:19 +00:00
Reid Kleckner	2daa731bab	Add a triple to this test to get the right IR mangling llvm-svn: 214982	2014-08-06 18:09:15 +00:00
Reid Kleckner	61bac93faa	Don't count inreg params when mangling fastcall functions This is consistent with MSVC. llvm-svn: 214981	2014-08-06 18:09:04 +00:00
Reid Kleckner	e41d957028	Round up the size of byval arguments to MinAlign Otherwise we can end up with an argument frame size that is not a multiple of stack slot size, which is very awkward. This fixes PR20547, which was a bug in x86_64 Sys V vararg handling. However, it's much easier to test this with x86 callee-cleanup functions, which previously ended in "retl $6" instead of "retl $8". This does affect behavior of all backends, but it presumably fixes the same bug in all of them. llvm-svn: 214980	2014-08-06 17:57:23 +00:00
Duncan P. N. Exon Smith	04642a4972	UseListOrder: Use std::vector I initially used a `SmallVector<>` for `UseListOrder::Shuffle`, which was a silly choice. When I realized my error I quickly rolled a custom data structure. This commit simplifies it to a `std::vector<>`. Now that I've had a chance to measure performance, this data structure isn't part of a bottleneck, so the additional complexity is unnecessary. This is part of PR5680. llvm-svn: 214979	2014-08-06 17:36:08 +00:00
Chad Rosier	b481bdfec4	[AArch64] Add a few isTarget* API to AArch64 Subtarget. llvm-svn: 214977	2014-08-06 16:56:58 +00:00
Chad Rosier	5b78a642e4	Add test case omitted in r214974. llvm-svn: 214975	2014-08-06 16:06:41 +00:00
Chad Rosier	afe7c93c7f	[AArch64] Fix OS ABI flag for aarch64-linux-gnu target. For triple aarch64-linux-gnu we were incorrectly setting IRIX. For triple aarch64 we are correctly setting SYSV. Patch by Ana Pazos <apazos@codeaurora.org>. llvm-svn: 214974	2014-08-06 16:05:02 +00:00
Sanjay Patel	d26358e12d	use register iterators that include self to reduce code duplication in CriticalAntiDepBreaker This patch addresses 2 FIXME comments that I added to CriticalAntiDepBreaker while fixing PR20020. Initialize an MCSubRegIterator and an MCRegAliasIterator to include the self reg. Assuming that works as advertised, there should be functional difference with this patch, just less code. Also, remove the associated asserts - we're setting those values just before, so the asserts don't do anything meaningful. Differential Revision: http://reviews.llvm.org/D4566 llvm-svn: 214973	2014-08-06 15:58:15 +00:00
Robert Khasanov	3c30c4bdec	[AVX512] Added load/store instructions to Register2Memory opcode tables. Added lowering tests for load/store. Reviewed by Elena Demikhovsky <elena.demikhovsky@intel.com> llvm-svn: 214972	2014-08-06 15:40:34 +00:00
James Molloy	99917946da	[AArch64] Add a testcase for r214957. llvm-svn: 214965	2014-08-06 13:31:32 +00:00
James Molloy	568da0990e	Add a new option -run-slp-after-loop-vectorization. This swaps the order of the loop vectorizer and the SLP/BB vectorizers. It is disabled by default so we can do performance testing - ideally we want to change to having the loop vectorizer running first, and the SLP vectorizer using its leftovers instead of the other way around. llvm-svn: 214963	2014-08-06 12:56:19 +00:00
Tim Northover	2a417b96d4	ARM: do not generate BLX instructions on Cortex-M CPUs. Particularly on MachO, we were generating "blx _dest" instructions on M-class CPUs, which don't actually exist. They happen to get fixed up by the linker into valid "bl _dest" instructions (which is why such a massive issue has remained largely undetected), but we shouldn't rely on that. llvm-svn: 214959	2014-08-06 11:13:14 +00:00
Tim Northover	d4d294dd51	ARM-MachO: materialize callee address correctly on v4t. llvm-svn: 214958	2014-08-06 11:13:06 +00:00
James Molloy	f089ab70f4	[AArch64] Conditional selects are expensive on out-of-order cores. Specifically Cortex-A57. This probably applies to Cyclone too but I haven't enabled it for that as I can't test it. This gives ~4% improvement on SPEC 174.vpr, and ~1% in 471.omnetpp. llvm-svn: 214957	2014-08-06 10:42:18 +00:00
Chandler Carruth	c3927cd8c9	[x86] Fix two independent miscompiles in the process of getting the same test case to actually generate correct code. The primary miscompile fixed here is that we weren't correctly handling in-place elements in one half of a single-input v8i16 shuffle when moving a dword of elements from that half to the other half. Some times, we would clobber the in-place elements in forming the dword to move across halves. The fix to this involves forcibly marking the in-place inputs even when there is no need to gather them into a dword, and to much more carefully re-arrange the elements when grouping them into a dword to move across halves. With these two changes we would generate correct shuffles for the test case, but found another miscompile. There are also some random perturbations of the generated shuffle pattern in SSE2. It looks like a wash; more instructions in some cases fewer in others. The second miscompile would corrupt the results into nonsense. This is a buggy pattern in one of the added DAG combines. Mapping elements through a PSHUFD when pairing redundant half-shuffles is much harder than this code makes it out to be -- it requires reasoning about all of where the input is used in the PSHUFD, not just one part of where it is used. Plus, we can't combine a half shuffle into a PSHUFD but the code didn't guard against it. I think this was just a bad idea and I've just removed that aspect of the combine. No tests regress as a consequence so seems OK. llvm-svn: 214954	2014-08-06 10:16:36 +00:00
Chandler Carruth	8f23ba26d2	[x86] Switch to a formulation of a for loop that is much more obviously not corrupting the mask by mutating it more times than intended. No functionality changed (the results were non-overlapping so the old version "worked" but was non-obvious). llvm-svn: 214953	2014-08-06 10:16:33 +00:00
Adam Nemet	5ec912881f	[X86] Fixes commit r214890 to match the posted patch This was another fallout from my local rebase where something went wrong :( llvm-svn: 214951	2014-08-06 07:13:12 +00:00
Matt Arsenault	515c24b7e0	Correct comment llvm-svn: 214945	2014-08-06 00:44:25 +00:00
Peter Collingbourne	df240b252a	[dfsan] Try not to create too many additional basic blocks in functions which already have a large number of blocks. Works around a performance issue with the greedy register allocator. llvm-svn: 214944	2014-08-06 00:33:40 +00:00
Matt Arsenault	d5f4de27b6	R600: Increase nearby load scheduling threshold. This partially fixes weird looking load scheduling in memcpy test. The load clustering doesn't seem particularly smart, but this method seems to be partially deprecated so it might not be worth trying to fix. llvm-svn: 214943	2014-08-06 00:29:49 +00:00
Matt Arsenault	c10853f29f	R600/SI: Implement areLoadsFromSameBasePtr This currently has a noticable effect on the kernel argument loads. LDS and global loads are more problematic, I think because of how copies are currently inserted to ensure that the address is a VGPR. llvm-svn: 214942	2014-08-06 00:29:43 +00:00
Quentin Colombet	33ea1681ce	[X86][SchedModel] Fixed some wrong scheduling model found by code inspection. Source: Agner Fog's Instruction tables. Related to <rdar://problem/15607571> llvm-svn: 214940	2014-08-06 00:22:39 +00:00
David Blaikie	fb0412f039	DebugInfo: Assert that any CU for which debug_loc lists are emitted, has at least one range. This was coming in weird debug info that had variables (and hence debug_locs) but was in GMLT mode (because it was missing the 13th field of the compile_unit metadata) so no ranges were constructed. We should always have at least one range for any CU with a debug_loc in it - because the range should cover the debug_loc. The assertion just ensures that the "!= 1" range case inside the subsequent loop doesn't get entered for the case where there are no ranges at all, which should never reach here in the first place. llvm-svn: 214939	2014-08-06 00:21:25 +00:00
David Blaikie	cabf54a313	DebugInfo: Fix a bunch of tests that, owing to their compile_unit metadata not including a 13th field, had some subtle behavior. Without the 13th field, the "emission kind" field defaults to 0 (which is not equal to either of the values of the emission kind enum (1 == full debug info, 2 == line tables only)). In this particular instance, the comparison with "FullDebugInfo" was done when adding elements to the ranges list - so for these test cases no values were added to the ranges list. This got weirder when emitting debug_loc entries as the addresses should be relative to the range of the CU if the CU has only one range (the reasonable assumption is that if we're emitting debug_loc lists for a CU that CU has at least one range - but due to the above situation, it has zero) so the ranges were emitted relative to the start of the section rather than relative to the start of the CU's singular range. Fix these tests by accounting for the difference in the description of debug_loc entries (in some cases making the test ignorant to these differences, in others adding the extra label difference expression, etc) or the presence/absence of high/low_pc on the CU, and add the 13th field to their CUs to enable proper "full debug info" emission here. In a future commit I'll fix up a bunch of other test cases that are not so rigorously depending on this behavior, but still doing similarly weird things due to the missing 13th field. llvm-svn: 214937	2014-08-05 23:57:31 +00:00
Matt Arsenault	1070511847	R600/SI: Add definitions for ds_read2st64_ / ds_write2st64_ llvm-svn: 214936	2014-08-05 23:53:20 +00:00
JF Bastien	ac8b66b32c	Fix typos in comments and doc Committing http://reviews.llvm.org/D4798 for Robin Morisset (morisset@google.com) llvm-svn: 214934	2014-08-05 23:27:34 +00:00

1 2 3 4 5 ...

106628 Commits