llvm-project

Commit Graph

Author	SHA1	Message	Date
Chandler Carruth	8a102c21e3	There is no portable std::abs overload for int64_t, use the llvm::abs64 which exists for this purpose. llvm-svn: 154199	2012-04-06 20:10:52 +00:00
Sean Callanan	e804b5b762	Fixed two leaks in the MC disassembler. The MC disassembler requires a MCSubtargetInfo and a MCInstrInfo to exist in order to initialize the instruction printer and disassembler; however, although the printer and disassembler keep references to these objects they do not own them. Previously, the MCSubtargetInfo and MCInstrInfo objects were just leaked. I have extended LLVMDisasmContext to own these objects and delete them when it is destroyed. llvm-svn: 154192	2012-04-06 18:21:09 +00:00
Jakob Stoklund Olesen	967b86a0a2	Allow negative immediates in ARM and Thumb2 compares. ARM and Thumb2 mode can use cmn instructions to compare against negative immediates. Thumb1 mode can't. llvm-svn: 154183	2012-04-06 17:45:04 +00:00
David Chisnall	c1c9cdab23	Reintroduce InlineCostAnalyzer::getInlineCost() variant with explicit callee parameter until we have a more sensible API for doing the same thing. Reviewed by Chandler. llvm-svn: 154180	2012-04-06 17:27:41 +00:00
Chandler Carruth	49da93396e	Sink the collection of return instructions until after all simplification has been performed. This is a bit less efficient (requires another ilist walk of the basic blocks) but shouldn't matter in practice. More importantly, it's just too much work to keep track of all the various ways the return instructions can be mutated while simplifying them. This fixes yet another crasher, reported by Daniel Dunbar. llvm-svn: 154179	2012-04-06 17:21:31 +00:00
Chandler Carruth	e547fefcb7	Tweak this test to ensure the inliner did indeed fire. Thanks to Richard Smith for pointing this out in review. llvm-svn: 154178	2012-04-06 17:21:28 +00:00
Duncan Sands	d12b18f820	Make GVN's propagateEquality non-recursive. No intended functionality change. The modifications are a lot more trivial than they appear to be in the diff! llvm-svn: 154174	2012-04-06 15:31:09 +00:00
Craig Topper	bdc9f071a4	Test case for PR12413 llvm-svn: 154172	2012-04-06 14:38:25 +00:00
Benjamin Kramer	3cacabfb04	Fix narrowing conversion. llvm-svn: 154171	2012-04-06 13:33:52 +00:00
Benjamin Kramer	15e21a159e	DenseMap: Perform the pod-like object optimization when the value type is POD-like, not the DenseMapInfo for it. Purge now unused template arguments. This has been broken since r91421. Patch by Lubos Lunak! llvm-svn: 154170	2012-04-06 10:43:44 +00:00
Craig Topper	447417c932	Allow 256-bit shuffles to be split if a 128-bit lane contains elements from a single source. This is a rewrite of the 256-bit shuffle splitting code based on similar code from legalize types. Fixes PR12413. llvm-svn: 154166	2012-04-06 07:45:23 +00:00
Craig Topper	4eb9616b24	Add the tests that were supposed to go with r153935 that I forgot svn add llvm-svn: 154165	2012-04-06 07:09:59 +00:00
Chandler Carruth	17e335888c	Actually finish this sentence in the comment the way I intended. Thanks Matt for pointing this out. llvm-svn: 154158	2012-04-06 01:19:38 +00:00
Chandler Carruth	e41f6f4189	Sink the return instruction collection until after we're done deleting dead code, including dead return instructions in some cases. Otherwise, we end up having a bogus poniter to a return instruction that blows up much further down the road. It turns out that this pattern is both simpler to code, easier to update in the face of enhancements to the inliner cleanup, and likely cheaper given that it won't add dead instructions to the list. Thanks to John Regehr's numerous test cases for teasing this out. llvm-svn: 154157	2012-04-06 01:11:52 +00:00
Jakob Stoklund Olesen	6a2e99a46a	Deduplicate ARM call-related instructions. We had special instructions for iOS because r9 is call-clobbered, but that is represented dynamically by the register mask operands now, so there is no need for the pseudo-instructions. llvm-svn: 154144	2012-04-06 00:04:58 +00:00
Jim Grosbach	d6a1a1dc2f	ARM: Don't form a t2LDRi8 or t2STRi8 with an offset of zero. The load/store optimizer splits LDRD/STRD into two instructions when the register pairing doesn't work out. For negative offsets in Thumb2, it uses t2STRi8 to do that. That's fine, except for the case when the offset is in the range [-4,-1]. In that case, we'll also form a second t2STRi8 with the original offset plus 4, resulting in a t2STRi8 with a non-negative offset, which ends up as if it were an STRT, which is completely bogus. Similarly for loads. No testcase, unfortunately, as any I've been able to construct is both large and extremely fragile. rdar://11193937 llvm-svn: 154141	2012-04-05 23:51:24 +00:00
Kaelyn Uhrain	cb5b585cca	Fix the build breakage introduced by r154131. The empty 1-argument operator delete is for the benefit of the destructor. A couple of spot checks of running yaml-bench under valgrind against a few of the files under test/YAMLParser did not reveal any leaks introduced by this change. llvm-svn: 154137	2012-04-05 23:06:17 +00:00
Kaelyn Uhrain	64aa24e13f	Really fix -Wnon-virtual-dtor warnings; gcc needs the dtors to be explicitly marked as virtual. llvm-svn: 154131	2012-04-05 22:11:12 +00:00
Bill Wendling	4f60125dd8	The internalize pass can be dangerous for LTO. Consider the following program: $ cat main.c void foo(void) { } int main(int argc, char *argv[]) { foo(); return 0; } $ cat bundle.c extern void foo(void); void bar(void) { foo(); } $ clang -o main main.c $ clang -o bundle.so bundle.c -bundle -bundle_loader ./main $ nm -m bundle.so 0000000000000f40 (__TEXT,__text) external _bar (undefined) external _foo (from executable) (undefined) external dyld_stub_binder (from libSystem) $ clang -o main main.c -O4 $ clang -o bundle.so bundle.c -bundle -bundle_loader ./main Undefined symbols for architecture x86_64: "_foo", referenced from: _bar in bundle-elQN6d.o ld: symbol(s) not found for architecture x86_64 clang: error: linker command failed with exit code 1 (use -v to see invocation) The linker was told that the 'foo' in 'main' was 'internal' and had no uses, so it was dead stripped. Another situation is something like: define void @foo() { ret void } define void @bar() { call asm volatile "call _foo" ... ret void } The only use of 'foo' is inside of an inline ASM call. Since we don't look inside those for uses of functions, we don't specify this as a "use." Get around this by not invoking the 'internalize' pass by default. This is an admitted hack for LTO correctness. <rdar://problem/11185386> llvm-svn: 154124	2012-04-05 21:26:44 +00:00
Jim Grosbach	930f2f66e7	ARM assembly aliases for add negative immediates using sub. 'add r2, #-1024' should just use 'sub r2, #1024' rather than erroring out. Thumb1 aliases for adding a negative immediate to the stack pointer, also. rdar://11192734 llvm-svn: 154123	2012-04-05 20:57:13 +00:00
Akira Hatanaka	43fb2b2cea	Reapply test case in 154038, this time with triple to prevent the backend from emitting gp_rel relocation. llvm-svn: 154122	2012-04-05 20:44:35 +00:00
Eric Christopher	aec8a82694	Patch to set is_stmt a little better for prologue lines in a function. This enables debuggers to see what are interesting lines for a breakpoint rather than any line that starts a function. rdar://9852092 llvm-svn: 154120	2012-04-05 20:39:05 +00:00
Jakob Stoklund Olesen	37492eac8c	Don't break the IV update in TLI::SimplifySetCC(). LSR always tries to make the ICmp in the loop latch use the incremented induction variable. This allows the induction variable to be kept in a single register. When the induction variable limit is equal to the stride, SimplifySetCC() would break LSR's hard work by transforming: (icmp (add iv, stride), stride) --> (cmp iv, 0) This forced us to use lea for the IC update, preventing the simpler incl+cmp. <rdar://problem/7643606> <rdar://problem/11184260> llvm-svn: 154119	2012-04-05 20:30:20 +00:00
Dan Gohman	cc64bbca81	Fix accidentally inverted logic from r152803, and make the testcase slightly less trivial. This fixes rdar://11171718. llvm-svn: 154118	2012-04-05 20:27:21 +00:00
Sylvestre Ledru	e8235fef31	Fix a problem in the target detection for Debian GNU/HURD llvm-svn: 154117	2012-04-05 19:34:15 +00:00
Sylvestre Ledru	4cf7dae516	Fix a problem in the target detection for Debian GNU/kFreeBSD llvm-svn: 154114	2012-04-05 18:53:09 +00:00
Owen Anderson	a6eebf6013	Treat f16 the same as f80/f128 for the purposes of generating constants during instruction selection. llvm-svn: 154113	2012-04-05 18:50:32 +00:00
Silviu Baranga	af3c79f0ac	Added support for unpredictable ADC/SBC instructions on ARM, and also fixed some corner cases involving the PC register as an operand for these instructions. llvm-svn: 154101	2012-04-05 16:19:29 +00:00
Silviu Baranga	d365397daa	Added support for handling unpredictable arithmetic instructions on ARM. llvm-svn: 154100	2012-04-05 16:13:15 +00:00
Hongbin Zheng	31d33b8318	BBVectorize: Add the const modifier to the VectorizeConfig because we won't modify it. llvm-svn: 154098	2012-04-05 16:07:49 +00:00
Hongbin Zheng	d6825173d3	Introduce the VectorizeConfig class, with which we can control the behavior of the BBVectorizePass without using command line option. As pointed out by Hal, we can ask the TargetLoweringInfo for the architecture specific VectorizeConfig to perform vectorizing with architecture specific information. llvm-svn: 154096	2012-04-05 15:46:55 +00:00
James Molloy	1ea6473688	An oversight when applying the patches for r150956 and r150957 to a vanilla tree meant I forgot to svn add these testcases. Noticed while investigating PR12274! llvm-svn: 154090	2012-04-05 10:01:12 +00:00
Hongbin Zheng	6edbc39bd7	Add the function "vectorizeBasicBlock" which allow users vectorize a BasicBlock in other passes, e.g. we can call vectorizeBasicBlock in the loop unroll pass right after the loop is unrolled. llvm-svn: 154089	2012-04-05 08:05:16 +00:00
Jim Grosbach	15c6884a4b	ARM assembly aliases for two-operand V[R]SHR instructions. rdar://11189467 llvm-svn: 154087	2012-04-05 07:23:53 +00:00
Argyrios Kyrtzidis	ef909265e8	In MemoryBuffer::getOpenFile() make sure that the buffer is null-terminated if the caller requested a null-terminated one. When mapping the file there could be a racing issue that resulted in the file being larger than the FileSize passed by the caller. We already have an assertion for this in MemoryBuffer::init() but have a runtime guarantee that the buffer will be null-terminated, so do a copy that adds a null-terminator. Protects against crash of rdar://11161822. llvm-svn: 154082	2012-04-05 04:23:56 +00:00
Jim Grosbach	3d00eecc53	ARM assembly parsing for 'msr' plain 'cpsr' operand. Plain 'cpsr' is an alias for 'cpsr_fc'. rdar://11153753 llvm-svn: 154080	2012-04-05 03:17:53 +00:00
Jakob Stoklund Olesen	f2390e8303	Pass the right sign to TLI->isLegalICmpImmediate. LSR can fold three addressing modes into its ICmpZero node: ICmpZero BaseReg + Offset => ICmp BaseReg, -Offset ICmpZero -1ScaleReg + Offset => ICmp ScaleReg, Offset ICmpZero BaseReg + -1ScaleReg => ICmp BaseReg, ScaleReg The first two cases are only used if TLI->isLegalICmpImmediate() likes the offset. Make sure the right Offset sign is passed to this method in the second case. The ARM version is not symmetric. <rdar://problem/11184260> llvm-svn: 154079	2012-04-05 03:10:56 +00:00
Bob Wilson	1864146ab7	Do not include multiple -arch options in CPPFLAGS. llvm-svn: 154070	2012-04-05 00:35:55 +00:00
Michael J. Spencer	b2d30b8699	Fix -Wnon-virtual-dtor warnings. llvm-svn: 154063	2012-04-04 22:34:55 +00:00
Akira Hatanaka	121342fcc2	Reapply 154038 without the failing test. llvm-svn: 154062	2012-04-04 22:16:36 +00:00
Owen Anderson	4743c6e159	Revert r154038. It was causing make check failures. llvm-svn: 154054	2012-04-04 21:18:58 +00:00
Pete Cooper	d7290700e6	REG_SEQUENCE expansion to COPY instructions wasn't taking account of sub register indices on the source registers. No simple test case llvm-svn: 154051	2012-04-04 21:03:25 +00:00
Benjamin Kramer	379018b2da	Fix a C++11 UDL conflict. Still not fixed in the standard ;) llvm-svn: 154044	2012-04-04 20:33:56 +00:00
Pete Cooper	8a3dc0ed8c	f16 FREM can now be legalized by promoting to f32 llvm-svn: 154039	2012-04-04 19:36:31 +00:00
Akira Hatanaka	9705c865d9	Fix LowerGlobalAddress to produce instructions with the correct relocation types for N32 ABI. Add new test case and update existing ones. llvm-svn: 154038	2012-04-04 19:02:38 +00:00
Akira Hatanaka	591ecdd7c1	Fix LowerJumpTable to produce instructions with the correct relocation types for N32 ABI. Test case will be updated after the patch that fixes TargetLowering::getPICJumpTableRelocBase is checked in. llvm-svn: 154036	2012-04-04 18:31:32 +00:00
Akira Hatanaka	b3a2b8c199	Fix LowerConstantPool to produce instructions with the correct relocation types for N32 ABI and update test case. llvm-svn: 154034	2012-04-04 18:26:12 +00:00
Jakob Stoklund Olesen	0a5b72f0e4	Implement ARMBaseInstrInfo::commuteInstruction() for MOVCCr. A MOVCCr instruction can be commuted by inverting the condition. This can help reduce register pressure and remove unnecessary copies in some cases. <rdar://problem/11182914> llvm-svn: 154033	2012-04-04 18:23:42 +00:00
Jakob Stoklund Olesen	92fd79a639	Remove spurious debug output. llvm-svn: 154032	2012-04-04 18:23:38 +00:00
Akira Hatanaka	aeff24e424	Fix LowerBlockAddress to produce instructions with the correct relocation types for N32 ABI and update test case. llvm-svn: 154031	2012-04-04 18:22:53 +00:00
Hongbin Zheng	e1fd20172b	Add testcase for r154007, when a function has the optsize attribute, the loop should be unrolled according the value of OptSizeUnrollThreshold. llvm-svn: 154014	2012-04-04 13:24:40 +00:00
Rafael Espindola	ba0a6cabb8	Always compute all the bits in ComputeMaskedBits. This allows us to keep passing reduced masks to SimplifyDemandedBits, but know about all the bits if SimplifyDemandedBits fails. This allows instcombine to simplify cases like the one in the included testcase. llvm-svn: 154011	2012-04-04 12:51:34 +00:00
Hongbin Zheng	b21b865fe8	LoopUnrollPass: Use variable "Threshold" instead of "CurrentThreshold" when reducing unroll count, otherwise the reduced unroll count is not taking the "OptimizeForSize" attribute into account. llvm-svn: 154007	2012-04-04 11:44:08 +00:00
Benjamin Kramer	a1355d17ca	Move yaml::Stream's dtor out of line so it can see Scanner's dtor. llvm-svn: 154004	2012-04-04 08:53:34 +00:00
Benjamin Kramer	e43bde73aa	Implement DwarfLLVMRegPair::operator< without violating asymmetry. MSVC8 verifies this. llvm-svn: 154002	2012-04-04 08:24:08 +00:00
Craig Topper	34487838bf	Convert assert(false) followed by a return to llvm_unreachable llvm-svn: 153997	2012-04-04 04:55:46 +00:00
Craig Topper	4c7d995029	Remove default case from switch that was already covering all cases. llvm-svn: 153996	2012-04-04 04:42:42 +00:00
Pete Cooper	e7bff68a5e	Removed useless switch for default case when switch was covering all the enum values llvm-svn: 153984	2012-04-04 00:53:04 +00:00
Bob Wilson	3e66d73259	Fix the install location for the Embedded makefile target. svn r145378 inadvertently changed the destination for the Embedded target in the makefile. Add a "/Developer" suffix to DSTROOT to compensate. llvm-svn: 153980	2012-04-03 23:44:39 +00:00
Michael J. Spencer	afc0d6a36f	Sorry about that. MSVC seems to accept just about any random string you give it ;/ llvm-svn: 153979	2012-04-03 23:36:44 +00:00
Bob Wilson	9d12ffcd71	Remove dead code for installing libLTO when building llvmCore. llvm-svn: 153978	2012-04-03 23:13:26 +00:00
Michael J. Spencer	22120c47a7	Add YAML parser to Support. llvm-svn: 153977	2012-04-03 23:09:22 +00:00
Pete Cooper	9511ec86f9	Add VSELECT to LegalizeVectorTypes::ScalariseVectorResult. Previously it would crash if it encountered a 1 element VSELECT. Solution is slightly more complicated than just creating a SELET as we have to mask or sign extend the vector condition if it had different boolean contents from the scalar condition. Fixes <rdar://problem/11178095> llvm-svn: 153976	2012-04-03 22:57:55 +00:00
Pete Cooper	b98934cf72	Removed one last bad continue statement meant to be removed in r153914. llvm-svn: 153975	2012-04-03 22:18:49 +00:00
Bob Wilson	8bbd98df00	When building llvmCore, pass the SDKROOT and -arch setting to configure. So far all of configure tests have been run against the default SDK and architecture, regardless of what is actually being built. We've gotten lucky until now. <rdar://problem/11112479> llvm-svn: 153972	2012-04-03 21:50:26 +00:00
Bob Wilson	5512ec8bae	Remove a reference to the C backend. llvm-svn: 153971	2012-04-03 21:50:24 +00:00
Chad Rosier	2a02fe1bb2	Fix an issue in SimplifySetCC() specific to vector comparisons. When folding X == X we need to check getBooleanContents() to determine if the result is a vector of ones or a vector of negative ones. I tried creating a test case, but the problem seems to only be exposed on a much older version of clang (around r144500). rdar://10923049 llvm-svn: 153966	2012-04-03 20:11:24 +00:00
Anton Korobeynikov	d0b458d694	Set soname for FreeBSD as well. Patch by Bernard Cafarelli! llvm-svn: 153965	2012-04-03 19:48:31 +00:00
Eric Christopher	b81e2b403c	Fix thinko check for number of operands to be the one that actually might have more than 19 operands. Add a testcase to make sure I never screw that up again. Part of rdar://11026482 llvm-svn: 153961	2012-04-03 17:55:42 +00:00
Lang Hames	ffa52d2ae2	Matrix simplification in PBQP may push infinite costs onto register options. The colorability heuristic should count these as denied registers. No test case - this exposed a bug on an out-of-tree target. llvm-svn: 153958	2012-04-03 16:27:16 +00:00
Dylan Noblesmith	7a3973d3e0	ARMDisassembler: drop bogus dependency on ARMCodeGen And indirectly, a dependency on most of the core LLVM optimization libraries. llvm-svn: 153957	2012-04-03 15:48:14 +00:00
Dylan Noblesmith	6338485d59	Object: drop bogus VMCore dependency llvm-svn: 153956	2012-04-03 15:48:10 +00:00
Bill Wendling	e2cf674310	The speedup doesn't appear to have been from this, but was an anomaly of my testing machine. llvm-svn: 153951	2012-04-03 11:19:21 +00:00
Bill Wendling	dd91e73409	Reserve space for the eventual filling of the vector. This gives a small speedup. llvm-svn: 153949	2012-04-03 10:50:09 +00:00
Nadav Rotem	269703f983	Add an additional testcase which checks ops with multiple users. llvm-svn: 153939	2012-04-03 07:39:36 +00:00
Anton Korobeynikov	325e92668b	Make PPCCompilationCallbackC function to be static, so there will be no need to issue call via PLT when LLVM is built as shared library. This mimics the X86 backend towards the approach. llvm-svn: 153938	2012-04-03 06:59:28 +00:00
Craig Topper	9c252ebe4c	Tidy up spacing in some tablegen outputs. llvm-svn: 153937	2012-04-03 06:52:47 +00:00
Craig Topper	7629d63bc4	Add support for AVX enhanced comparison predicates. Patch from Kay Tiong Khoo. llvm-svn: 153935	2012-04-03 05:20:24 +00:00
Bill Wendling	32867652c9	Reformatting. No functionality change. llvm-svn: 153928	2012-04-03 03:56:52 +00:00
Bill Wendling	7d350efddc	As Eric pointed out, even a Debug build should be equal. Leave the flag that can turn off comparisons though. llvm-svn: 153927	2012-04-03 03:27:43 +00:00
Akira Hatanaka	d19f025374	Revert r153924. Delete test/MC/Disassembler/Mips and lib/Target/Mips/Disassembler. llvm-svn: 153926	2012-04-03 03:01:13 +00:00
Akira Hatanaka	55059262aa	Revert r153924. There were buildbot failures. llvm-svn: 153925	2012-04-03 02:51:09 +00:00
Akira Hatanaka	e2498d014b	MIPS disassembler support. Patch by Vladimir Medic. llvm-svn: 153924	2012-04-03 02:20:58 +00:00
Andrew Trick	a890e3c69a	Cleanup set_union usage. The same thing but a bit cleaner now. llvm-svn: 153922	2012-04-03 01:35:52 +00:00
Andrew Trick	c544e7c0a7	Use std::set_union instead of nasty custom code. I just noticed Jakob's examples of the proper application of std::set... routines. llvm-svn: 153918	2012-04-03 00:47:23 +00:00
Eric Christopher	34164196af	Add a line number for the scope of the function (starting at the first brace) so that we get more accurate line number information about the declaration of a given function and the line where the function first starts. Part of rdar://11026482 llvm-svn: 153916	2012-04-03 00:43:49 +00:00
Pete Cooper	4f0dbb27d9	Fixes to r153903. Added missing explanation of behaviour when the VirtRegMap is NULL. Also changed it in this case to just avoid updating the map, but live ranges or intervals will still get updated and created llvm-svn: 153914	2012-04-03 00:28:46 +00:00
Bill Wendling	d70cde134b	Compare the .o files only for release builds. Add an option to bypass the comparison altogether. llvm-svn: 153909	2012-04-02 23:27:43 +00:00
Pete Cooper	3ca96f9950	Moved LiveRangeEdit.h so that it can be called from other parts of the backend, not just libCodeGen llvm-svn: 153906	2012-04-02 22:44:18 +00:00
Rafael Espindola	f76bff0504	Make dominatedBySlowTreeWalk private and assert cases handled by the caller. llvm-svn: 153905	2012-04-02 22:37:54 +00:00
Jakob Stoklund Olesen	291007b055	Allocate virtual registers in ascending order. This is just the fallback tie-breaker ordering, the main allocation order is still descending size. Patch by Shamil Kurmangaleev! llvm-svn: 153904	2012-04-02 22:30:39 +00:00
Pete Cooper	2bde2f42b1	Refactored the LiveRangeEdit interface so that MachineFunction, TargetInstrInfo, MachineRegisterInfo, LiveIntervals, and VirtRegMap are all passed into the constructor and stored as members instead of passed in to each method. llvm-svn: 153903	2012-04-02 22:22:53 +00:00
Bill Wendling	932b992888	Add an option to turn off the expensive GVN load PRE part of GVN. llvm-svn: 153902	2012-04-02 22:16:50 +00:00
Owen Anderson	98f2c0c384	Add predicates for checking whether targets have free FNEG and FABS operations, and prevent the DAGCombiner from turning them into bitwise operations if they do. llvm-svn: 153901	2012-04-02 22:10:29 +00:00
Lang Hames	aaafacd07e	During two-address lowering, rescheduling an instruction does not untie operands. Make TryInstructionTransform return false to reflect this. Fixes PR11861. llvm-svn: 153892	2012-04-02 19:58:43 +00:00
Rafael Espindola	2e5c58e77b	No need to run llvm-as. llvm-svn: 153890	2012-04-02 19:44:20 +00:00
Akira Hatanaka	b1f68f9696	Initial 64 bit direct object support. This patch allows llvm to recognize that a 64 bit object file is being produced and that the subsequently generated ELF header has the correct information. The test case checks for both big and little endian flavors. Patch by Jack Carter. llvm-svn: 153889	2012-04-02 19:25:22 +00:00
Hal Finkel	7591afa235	The binutils for the IBM BG/P are too old to support CFI. llvm-svn: 153886	2012-04-02 19:09:04 +00:00
Hal Finkel	f208af02a4	Add triple support for the IBM BG/P and BG/Q supercomputers. llvm-svn: 153882	2012-04-02 18:31:33 +00:00
Eric Christopher	ad9fe8955a	Turn on the accelerator tables for Darwin. llvm-svn: 153880	2012-04-02 17:58:52 +00:00
Stepan Dyatkovskiy	f62ffeca88	Fast fix for PR12343: http://llvm.org/bugs/show_bug.cgi?id=12343 We have not trivial way for splitting edges that are goes from indirect branch. We can do it with some tricks, but it should be additionally discussed. And it is still dangerous due to difficulty of indirect branches controlling. Fix forbids this case for unswitching. llvm-svn: 153879	2012-04-02 17:16:45 +00:00
Roman Divacky	b9663ccd6b	Implement the SVR4 byval alignment for aggregates. Fixing a FIXME. llvm-svn: 153876	2012-04-02 15:49:30 +00:00
Silviu Baranga	98144e9e1a	Second part for the 153874 one llvm-svn: 153875	2012-04-02 15:46:46 +00:00
Silviu Baranga	ac37acd31b	Added fix in TableGen instruction decoder generation. The decoder now breaks for every leaf node. llvm-svn: 153874	2012-04-02 15:20:39 +00:00
Rafael Espindola	ebe09ec137	Add missing 'd'. llvm-svn: 153872	2012-04-02 13:02:57 +00:00
Bill Wendling	71b19bbdc8	Hack the hack. If we have a situation where an ASM object is defined but isn't reflected in the LLVM IR (as a declare or something), then treat it like a data object. N.B. This isn't 100% correct. The ASM parser should supply more information so that we know what type of object it is, and what attributes it should have. llvm-svn: 153870	2012-04-02 10:01:21 +00:00
Benjamin Kramer	22d093e4f1	Emit the asm writer's mnemonic table with SequenceToOffsetTable. This way we can get AVX v-prefixed instructions tail merged with the normal insns. llvm-svn: 153869	2012-04-02 09:13:46 +00:00
Benjamin Kramer	1c0541b031	Move getOpcodeName from the various target InstPrinters into the superclass MCInstPrinter. All implementations used the same code. llvm-svn: 153866	2012-04-02 08:32:38 +00:00
Craig Topper	4de7373862	Reorder fields in MatchEntry and OperandMatchEntry to reduce padding. A bit tricky due to the target specific sizes for some of the fields so the ordering is only optimal for the targets in the tree. llvm-svn: 153865	2012-04-02 07:48:39 +00:00
Nadav Rotem	702f080767	Optimizing swizzles of complex shuffles may generate additional complex shuffles. Do not try to optimize swizzles of shuffles if the source shuffle has more than a single user, except when the source shuffle is also a swizzle. llvm-svn: 153864	2012-04-02 07:11:12 +00:00
Craig Topper	dab9e35ad0	Remove getInstructionName from MCInstPrinter implementations in favor of using the instruction name table from MCInstrInfo. Reduces static data in the InstPrinter implementations. llvm-svn: 153863	2012-04-02 07:01:04 +00:00
Eric Christopher	8e52bdce7b	Fix CXXFLAGS for huge_val.m4. Patch by Jeremy Huddleston! llvm-svn: 153862	2012-04-02 06:54:01 +00:00
Craig Topper	54bfde79db	Make MCInstrInfo available to the MCInstPrinter. This will be used to remove getInstructionName and the static data it contains since the same tables are already in MCInstrInfo. llvm-svn: 153860	2012-04-02 06:09:36 +00:00
Bill Wendling	3a0bcf06ef	It could come about that we parse the inline ASM before we get a potential definition for it. In that case, we want to wait for the potential definition before we create a symbol for it. llvm-svn: 153859	2012-04-02 03:33:31 +00:00
Craig Topper	7a2cea1814	Use SequenceToOffsetTable to generate instruction name table for AsmWriter. llvm-svn: 153857	2012-04-02 00:47:39 +00:00
Chandler Carruth	219173a1be	Start cleaning up the InlineCost class. This switches to sentinel values rather than a bitfield, a great suggestion by Chris during code review. There is still quite a bit of cruft in the interface, but that requires sorting out some awkward uses of the cost inside the actual inliner. No functionality changed intended here. llvm-svn: 153853	2012-04-01 22:44:09 +00:00
Hal Finkel	3ecfa7b277	Fix some 80-col. violations I introduced with the A2 PPC64 core. llvm-svn: 153852	2012-04-01 21:20:14 +00:00
Hal Finkel	322e41a914	Enable prefetch generation on PPC64. llvm-svn: 153851	2012-04-01 20:08:17 +00:00
Hal Finkel	9032344c15	Add LdStSTD* itin. for the PPC64 A2 core. llvm-svn: 153850	2012-04-01 20:08:08 +00:00
Nadav Rotem	b078350872	This commit contains a few changes that had to go in together. 1. Simplify xor/and/or (bitcast(A), bitcast(B)) -> bitcast(op (A,B)) (and also scalar_to_vector). 2. Xor/and/or are indifferent to the swizzle operation (shuffle of one src). Simplify xor/and/or (shuff(A), shuff(B)) -> shuff(op (A, B)) 3. Optimize swizzles of shuffles: shuff(shuff(x, y), undef) -> shuff(x, y). 4. Fix an X86ISelLowering optimization which was very bitcast-sensitive. Code which was previously compiled to this: movd (%rsi), %xmm0 movdqa .LCPI0_0(%rip), %xmm2 pshufb %xmm2, %xmm0 movd (%rdi), %xmm1 pshufb %xmm2, %xmm1 pxor %xmm0, %xmm1 pshufb .LCPI0_1(%rip), %xmm1 movd %xmm1, (%rdi) ret Now compiles to this: movl (%rsi), %eax xorl %eax, (%rdi) ret llvm-svn: 153848	2012-04-01 19:31:22 +00:00
Lang Hames	652f21274f	Fix typo. llvm-svn: 153846	2012-04-01 19:27:25 +00:00
Hal Finkel	88ed4e3b15	Set the default PPC node scheduling preference to ILP (for the embedded cores). The 440 and A2 cores have detailed itineraries, and this allows them to be fully used to maximize throughput. llvm-svn: 153845	2012-04-01 19:23:08 +00:00
Hal Finkel	b9845f5758	Add ppc440 itin. entries for LdStSTD* llvm-svn: 153844	2012-04-01 19:23:04 +00:00
Hal Finkel	ec5a1e3669	Use full anti-dep. breaking with post-ra sched. on the embedded ppc cores. Post-RA scheduling gives a significant performance improvement on the embedded cores, so turn it on. Using full anti-dep. breaking is important for FP-intensive blocks, so turn it on (just on the embedded cores for now; this should also be good on the 970s because post-ra scheduling is all that we have for now, but that should have more testing first). llvm-svn: 153843	2012-04-01 19:22:57 +00:00
Hal Finkel	9f9f8929ee	Add instruction itinerary for the PPC64 A2 core. This adds a full itinerary for IBM's PPC64 A2 embedded core. These cores form the basis for the CPUs in the new IBM BG/Q supercomputer. llvm-svn: 153842	2012-04-01 19:22:40 +00:00
Craig Topper	91773ab2ca	Use SequenceToOffsetTable to create instruction name table. Saves space particularly on X86 where AVX instructions just add a 'v' to the front of other instructions. llvm-svn: 153841	2012-04-01 18:14:14 +00:00
Benjamin Kramer	12af4285d1	Emit the LLVM<->DWARF register mapping as a sorted table and use binary search to do the lookup. This also avoids emitting the information twice, which led to code bloat. On i386-linux-Release+Asserts with all targets built this change shaves a whopping 1.3 MB off clang. The number is probably exaggerated by recent inliner changes but the methods were already enormous with the old inline cost computation. The DWARF reg -> LLVM reg mapping doesn't seem to have holes in it, so it could be a simple lookup table. I didn't implement that optimization yet to avoid potentially changing functionality. There is still some duplication both in tablegen and the generated code that should be cleaned up eventually. llvm-svn: 153837	2012-04-01 14:23:58 +00:00
Chandler Carruth	45ae88f5fc	Belatedly address some code review from Chris. As a side note, I really dislike array_pod_sort... Do we really still care about any STL implementations that get this so wrong? Does libc++? llvm-svn: 153834	2012-04-01 10:41:24 +00:00
Chandler Carruth	cdb1f8cff1	Add some more testing to cover the remaining two cases where always-inlining is disabled: recursive functions and indirectbr. llvm-svn: 153833	2012-04-01 10:36:17 +00:00
Chandler Carruth	c5bfb3c0f5	Fix a pretty scary bug I introduced into the always inliner with a single missing character. Somehow, this had gone untested. I've added tests for returns-twice logic specifically with the always-inliner that would have caught this, and fixed the bug. Thanks to Matt for the careful review and spotting this!!! =D llvm-svn: 153832	2012-04-01 10:21:05 +00:00
Chandler Carruth	1989bb9c43	Replace four tiny tests with various uses of grep and not with a single test and FileCheck. llvm-svn: 153831	2012-04-01 10:11:17 +00:00
Andrew Trick	779b32a44e	misched: Add finalizeScheduler to complete the target interface. llvm-svn: 153827	2012-04-01 07:24:23 +00:00
Eli Bendersky	f5becf617f	Removing a file that's no longer being used after the recent refactorings llvm-svn: 153825	2012-04-01 06:50:01 +00:00
Hal Finkel	59607e63cb	Split the LdStGeneral PPC itin. class into LdStLoad and LdStStore. Loads and stores can have different pipeline behavior, especially on embedded chips. This change allows those differences to be expressed. Except for the 440 scheduler, there are no functionality changes. On the 440, the latency adjustment is only by one cycle, and so this probably does not affect much. Nevertheless, it will make a larger difference in the future and this removes a FIXME from the 440 itin. llvm-svn: 153821	2012-04-01 04:44:16 +00:00
Rafael Espindola	1eaae50734	Add a workaround for building with old versions of clang. llvm-svn: 153820	2012-03-31 21:54:20 +00:00
Rafael Espindola	77242fa79e	Add a triple to the test. llvm-svn: 153818	2012-03-31 18:59:07 +00:00
Rafael Espindola	80c540e656	Teach CodeGen's version of computeMaskedBits to understand the range metadata. This is the CodeGen equivalent of r153747. I tested that there is not noticeable performance difference with any combination of -O0/-O2 /-g when compiling gcc as a single compilation unit. llvm-svn: 153817	2012-03-31 18:14:00 +00:00
Hal Finkel	51861b4855	Fix dynamic linking on PPC64. Dynamic linking on PPC64 has had problems since we had to move the top-down hazard-detection logic post-ra. For dynamic linking to work there needs to be a nop placed after every call. It turns out that it is really hard to guarantee that nothing will be placed in between the call (bl) and the nop during post-ra scheduling. Previous attempts at fixing this by placing logic inside the hazard detector only partially worked. This is now fixed in a different way: call+nop codegen-only instructions. As far as CodeGen is concerned the pair is now a single instruction and cannot be split. This solution works much better than previous attempts. The scoreboard hazard detector is also renamed to be more generic, there is currently no cpu-specific logic in it. llvm-svn: 153816	2012-03-31 14:45:15 +00:00
Chandler Carruth	1a4cc6cc9f	Fix a typo reported in IRC by someone reviewing this code. llvm-svn: 153815	2012-03-31 13:18:09 +00:00
Chandler Carruth	a88a0faaa3	Give the always-inliner its own custom filter. It shouldn't have to pay the very high overhead of the complex inline cost analysis when all it wants to do is detect three patterns which must not be inlined. Comment the code, clean it up, and leave some hints about possible performance improvements if this ever shows up on a profile. Moving this off of the (now more expensive) inline cost analysis is particularly important because we have to run this inliner even at -O0. llvm-svn: 153814	2012-03-31 13:17:18 +00:00
Chandler Carruth	edd2826f3e	Remove a bunch of empty, dead, and no-op methods from all of these interfaces. These methods were used in the old inline cost system where there was a persistent cache that had to be updated, invalidated, and cleared. We're now doing more direct computations that don't require this intricate dance. Even if we resume some level of caching, it would almost certainly have a simpler and more narrow interface than this. llvm-svn: 153813	2012-03-31 12:48:08 +00:00
Chandler Carruth	0539c071ea	Initial commit for the rewrite of the inline cost analysis to operate on a per-callsite walk of the called function's instructions, in breadth-first order over the potentially reachable set of basic blocks. This is a major shift in how inline cost analysis works to improve the accuracy and rationality of inlining decisions. A brief outline of the algorithm this moves to: - Build a simplification mapping based on the callsite arguments to the function arguments. - Push the entry block onto a worklist of potentially-live basic blocks. - Pop the first block off of the front of the worklist (for breadth-first ordering) and walk its instructions using a custom InstVisitor. - For each instruction's operands, re-map them based on the simplification mappings available for the given callsite. - Compute any simplification possible of the instruction after re-mapping, and store that back int othe simplification mapping. - Compute any bonuses, costs, or other impacts of the instruction on the cost metric. - When the terminator is reached, replace any conditional value in the terminator with any simplifications from the mapping we have, and add any successors which are not proven to be dead from these simplifications to the worklist. - Pop the next block off of the front of the worklist, and repeat. - As soon as the cost of inlining exceeds the threshold for the callsite, stop analyzing the function in order to bound cost. The primary goal of this algorithm is to perfectly handle dead code paths. We do not want any code in trivially dead code paths to impact inlining decisions. The previous metric was extremely flawed here, and would always subtract the average cost of two successors of a conditional branch when it was proven to become an unconditional branch at the callsite. There was no handling of wildly different costs between the two successors, which would cause inlining when the path actually taken was too large, and no inlining when the path actually taken was trivially simple. There was also no handling of the code path, only the immediate successors. These problems vanish completely now. See the added regression tests for the shiny new features -- we skip recursive function calls, SROA-killing instructions, and high cost complex CFG structures when dead at the callsite being analyzed. Switching to this algorithm required refactoring the inline cost interface to accept the actual threshold rather than simply returning a single cost. The resulting interface is pretty bad, and I'm planning to do lots of interface cleanup after this patch. Several other refactorings fell out of this, but I've tried to minimize them for this patch. =/ There is still more cleanup that can be done here. Please point out anything that you see in review. I've worked really hard to try to mirror at least the spirit of all of the previous heuristics in the new model. It's not clear that they are all correct any more, but I wanted to minimize the change in this single patch, it's already a bit ridiculous. One heuristic that is not yet mirrored is to allow inlining of functions with a dynamic alloca if the caller has a dynamic alloca. I will add this back, but I think the most reasonable way requires changes to the inliner itself rather than just the cost metric, and so I've deferred this for a subsequent patch. The test case is XFAIL-ed until then. As mentioned in the review mail, this seems to make Clang run about 1% to 2% faster in -O0, but makes its binary size grow by just under 4%. I've looked into the 4% growth, and it can be fixed, but requires changes to other parts of the inliner. llvm-svn: 153812	2012-03-31 12:42:41 +00:00
Chandler Carruth	056b460917	Add support to the InstVisitor for visiting a generic callsite. The visitor will now visit a CallInst and an InvokeInst with instruction-specific visitors, then visit a generic CallSite visitor, then delegate back to the Instruction visitor and the TerminatorInst visitors depending on whether a call or an invoke originally. This will be used in the soon-to-land inline cost rewrite. llvm-svn: 153811	2012-03-31 11:31:24 +00:00
Bill Wendling	62152e7389	Move trivial functions into the class definition. llvm-svn: 153810	2012-03-31 11:25:18 +00:00
Bill Wendling	4f87c4fdff	Trim headers. llvm-svn: 153809	2012-03-31 11:22:30 +00:00
Bill Wendling	0e1824cb9e	Indent according to LLVM's style guide. llvm-svn: 153808	2012-03-31 11:15:43 +00:00
Bill Wendling	dbc02d84ce	Cleanup whitespace and trim some of the #includes. llvm-svn: 153807	2012-03-31 11:10:35 +00:00
Benjamin Kramer	53dc873342	Internalize: Remove reference of @llvm.noinline, it was replaced with the noinline attribute a long time ago. llvm-svn: 153806	2012-03-31 11:03:47 +00:00
Bill Wendling	5c15044f47	These strings aren't 'const char ' but 'char '. llvm-svn: 153805	2012-03-31 10:51:45 +00:00
Bill Wendling	39d942bf91	Cleanup whitespace. llvm-svn: 153804	2012-03-31 10:50:14 +00:00
Bill Wendling	534a6588f2	Free the codegen options when deleting LTO code generator object. llvm-svn: 153803	2012-03-31 10:49:43 +00:00
Bill Wendling	152e4739a2	Cleanup whitespace and remove unneeded 'extern' keyword on function definitions. llvm-svn: 153802	2012-03-31 10:44:20 +00:00
Chandler Carruth	6f202a7ced	Clean up the naming in this test. Someone pointed this out in review at one point, and I forgot to go back and clean it up. Sorry about that. =/ llvm-svn: 153801	2012-03-31 10:38:48 +00:00
Chandler Carruth	564b4ba704	FileCheck-ize this test, and generally tidy it up prior to changing things around. llvm-svn: 153799	2012-03-31 09:22:33 +00:00
Duncan Sands	26a80f3ddb	I noticed in passing that the Metadata getIfExists method was creating a new node and returning it if one didn't exist. llvm-svn: 153798	2012-03-31 08:20:11 +00:00
Hal Finkel	5cad8742cc	Correctly vectorize powi. The powi intrinsic requires special handling because it always takes a single integer power regardless of the result type. As a result, we can vectorize only if the powers are equal. Fixes PR12364. llvm-svn: 153797	2012-03-31 03:38:40 +00:00
Andrew Trick	cdefdf1f5b	comment typo llvm-svn: 153796	2012-03-31 02:39:17 +00:00
Akira Hatanaka	8f4e3a0088	Select static relocation model if it is jitting. llvm-svn: 153795	2012-03-31 02:38:36 +00:00
Andrew Trick	1a004ca084	Introduce Register Units: Give each leaf register a number. First small step toward modeling multi-register multi-pressure. In the future, register units can also be used to model liveness and aliasing. llvm-svn: 153794	2012-03-31 01:35:59 +00:00
Jakob Stoklund Olesen	d915503486	Add a 2 byte safety margin in offset computations. ARMConstantIslandPass still has bugs where jump table compression can cause constant pool entries to go out of range. Add a safety margin of 2 bytes when placing constant islands, but use the real max displacement for verification. <rdar://problem/11156595> llvm-svn: 153789	2012-03-31 00:06:44 +00:00
Jakob Stoklund Olesen	24bb3d59d7	Add more debugging output to ARMConstantIslandPass. llvm-svn: 153788	2012-03-31 00:06:42 +00:00
Bill Wendling	8f6c8a971a	* Set the scope attributes for the ASM symbol we added to be the value passed into the function. * Reorder some header files. llvm-svn: 153783	2012-03-30 23:26:06 +00:00
Benjamin Kramer	682de39f2d	Rip out emission of the regIsInRegClass function for the asm printer. It's slow, bloated and completely redundant with MCRegisterClass::contains. llvm-svn: 153782	2012-03-30 23:13:40 +00:00
Jim Grosbach	913cc3072d	ARM fix encoding fixup resolution for ldrd and friends. The 8-bit payload is not contiguous in the opcode. Move the upper nibble over 4 bits into the correct place. rdar://11158641 llvm-svn: 153780	2012-03-30 21:54:22 +00:00
Jakob Stoklund Olesen	892f48058b	Use SequenceToOffsetTable in emitRegisterNameString. This allows suffix sharing in register names. (AX is a suffix of EAX). llvm-svn: 153777	2012-03-30 21:12:52 +00:00
Jakob Stoklund Olesen	066aba5fe9	Reapply 153764 and 153761 with a fix. Use an explicit comparator instead of the default. The sets are sorted, but not using the default comparator. Hopefully, this will unbreak the Linux builders. llvm-svn: 153772	2012-03-30 20:24:14 +00:00
Rafael Espindola	fc06055173	Revert 153764 and 153761. They broke a --enable-optimized --enable-assertions --enable-expensive-checks build. llvm-svn: 153771	2012-03-30 20:09:06 +00:00
Jim Grosbach	fdaab531b7	ARM assembler should prefer non-aliases encoding of cmp. When an immediate is both a value [t2_]so_imm and a [t2_]so_imm_neg, we want to use the non-negated form to make sure we prefer the normal encoding, not the aliased encoding via the negation of, e.g., 'cmp.w'. llvm-svn: 153770	2012-03-30 19:59:02 +00:00
Jim Grosbach	daa04130ed	ARM encoding for VSWP got the second operand incorrect. Make the non-tied register operand names line up with what the base class encoding handler expects. rdar://11157236 llvm-svn: 153766	2012-03-30 18:53:01 +00:00
Jim Grosbach	74005ae691	ARM can only use narrow encoding for low regs. llvm-svn: 153765	2012-03-30 18:39:43 +00:00
Jakob Stoklund Olesen	e214c3df40	Compress SimpleValueType lists by sharing. Many register classes have the same value types. Share the table space. llvm-svn: 153764	2012-03-30 17:42:04 +00:00
Jakob Stoklund Olesen	569e116d35	Compress register lists by sharing suffixes. TableGen emits lists of sub-registers, super-registers, and overlaps. Put them all in a single table and use a SequenceToOffsetTable to share suffixes. llvm-svn: 153761	2012-03-30 17:25:43 +00:00
Jakob Stoklund Olesen	a234f2efbd	Add a SequenceToOffsetTable to TableGen. This is similar to the StringToOffsetTable we use to produce string tables, but it can be used for other sequences than strings, and it eliminates entries for suffixes. llvm-svn: 153760	2012-03-30 17:25:40 +00:00
Jim Grosbach	def5e34812	ARM integrated assembler should encoding choice for add/sub imm. For 'adds r2, r2, #56' outside of an IT block, the 16-bit encoding T2 can be used for this syntax. Prefer the narrow encoding when possible. rdar://11156277 llvm-svn: 153759	2012-03-30 17:20:40 +00:00
Rafael Espindola	a53c46aaa3	Handle unreachable code in the dominates functions. This changes users when needed for correctness, but still doesn't clean up code that now unnecessary checks for reachability. llvm-svn: 153755	2012-03-30 16:46:21 +00:00
Danil Malyshev	70d22ccb22	Re-factored RuntimeDyLd: 1. The main works will made in the RuntimeDyLdImpl with uses the ObjectFile class. RuntimeDyLdMachO and RuntimeDyLdELF now only parses relocations and resolve it. This is allows to make improvements of the RuntimeDyLd more easily. In addition the support for COFF can be easily added. 2. Added ARM relocations to RuntimeDyLdELF. 3. Added support for stub functions for the ARM, allowing to do a long branch. 4. Added support for external functions that are not loaded from the object files, but can be loaded from external libraries. Now MCJIT can correctly execute the code containing the printf, putc, and etc. 5. The sections emitted instead functions, thanks Jim Grosbach. MemoryManager.startFunctionBody() and MemoryManager.endFunctionBody() have been removed. 6. MCJITMemoryManager.allocateDataSection() and MCJITMemoryManager. allocateCodeSection() used JMM->allocateSpace() instead of JMM->allocateCodeSection() and JMM->allocateDataSection(), because I got an error: "Cannot allocate an allocated block!" with object file contains more than one code or data sections. llvm-svn: 153754	2012-03-30 16:45:19 +00:00
Jim Grosbach	199ab90946	ARM assembly parsing needs to be paranoid about negative immediates. Make sure to treat immediates as unsigned when doing relative comparisons. rdar://11153621 llvm-svn: 153753	2012-03-30 16:31:31 +00:00
Rafael Espindola	53190539db	Add computeMaskedBitsLoad back, as it was the change to instsimplify that caused the slowdown last time. llvm-svn: 153747	2012-03-30 15:52:11 +00:00
Benjamin Kramer	88d31b3f0c	Add a note about a missed cmov -> sbb opportunity. llvm-svn: 153741	2012-03-30 13:02:58 +00:00
Bill Wendling	36cbf03b9b	Cleanup whitespace. Doxygenize comments. And indent to llvm coding standards. llvm-svn: 153740	2012-03-30 10:29:38 +00:00
James Molloy	fb5cd6085f	Ensure conditional BL instructions for ARM are given the fixup fixup_arm_condbranch. Patch by Tim Northover! llvm-svn: 153737	2012-03-30 09:15:32 +00:00
Evan Cheng	a40d40602c	ARM target should allow codegenprep to duplicate ret instructions to enable tailcall opt. rdar://11140249 llvm-svn: 153717	2012-03-30 01:24:39 +00:00
Bill Wendling	afe7ec7070	Testcase for r153710. llvm-svn: 153711	2012-03-30 00:26:54 +00:00
Bill Wendling	4f2a951275	Add testcase for r153705 llvm-svn: 153706	2012-03-30 00:05:02 +00:00
Bill Wendling	9f829f1cc4	If we have a VLA that has a "use" in a metadata node that's then used here but it has no other uses, then we have a problem. E.g., int foo (const int x) { char a[x]; return 0; } If we assign 'a' a vreg and fast isel later on has to use the selection DAG isel, it will want to copy the value to the vreg. However, there are no uses, which goes counter to what selection DAG isel expects. <rdar://problem/11134152> llvm-svn: 153705	2012-03-30 00:02:55 +00:00
Lang Hames	323a5ced21	Change the constant in this testcase so that it results in a constant pool load. llvm-svn: 153704	2012-03-29 23:52:38 +00:00
Bill Wendling	76fdc4b885	Revert r153694. It was causing failures in the buildbots. llvm-svn: 153701	2012-03-29 23:23:59 +00:00
Jakob Stoklund Olesen	d8af9a5ee1	Invalidate liveness in ARMConstantIslandPass. This pass splits basic blocks to insert constant islands, and it doesn't recompute the live-in lists. No later passes depend on accurate liveness information. This fixes PR12410 where the machine code verifier was complaining. llvm-svn: 153700	2012-03-29 23:14:26 +00:00
Jakob Stoklund Olesen	2f2897372a	Prefer even-odd D-register pairs. We are sometimes allocatinog from the DPair register class which contains odd-even pairs in addition to the Q registers. Place the Q registers first in the DPair allocation order as they can be copied with a single instruction. The odd-even pairs should only be allocated as a last resort. llvm-svn: 153699	2012-03-29 22:54:32 +00:00
Chandler Carruth	d6735ce57a	Filecheck-ize this test so that it actually tests something reasonable. llvm-svn: 153697	2012-03-29 22:01:41 +00:00
Lang Hames	591cdaf2ee	Try using vmov.i32 to materialize FP32 constants that can't be materialized by vmov.f32. llvm-svn: 153696	2012-03-29 21:56:11 +00:00
Danil Malyshev	3548eaf896	Re-factored RuntimeDyld. Added ExecutionEngine/MCJIT tests. llvm-svn: 153694	2012-03-29 21:46:18 +00:00
Eric Christopher	c13fd6d1e1	Lowercase the tag name to match the rest of dwarf. llvm-svn: 153691	2012-03-29 21:35:05 +00:00
Jim Grosbach	0b0298302c	ARM assembly 'cmp lr, #0' should not encode using 'cmn'. The CMP->CMN alias was matching for an immediate of zero when it should only match for negative values. rdar://11129224 llvm-svn: 153689	2012-03-29 21:19:52 +00:00
Lang Hames	dd1211b4e1	The shuffle scheduler is only available in asserts build - make misched-new.ll testcase require asserts. llvm-svn: 153687	2012-03-29 21:11:47 +00:00
Jakob Stoklund Olesen	caa6bd273f	Handle register copies for the new ARM register classes. ARM recently gained DPair, DTriple, and DQuad register classes. Update copyPhysReg() to handle copies in these register classes. No test case, it is difficult to make the register allocator emit the odd copies reliably. The missing DPair copy caused a failure on partialsums in the nightly test suite. <rdar://problem/11147997> llvm-svn: 153686	2012-03-29 21:10:40 +00:00
Benjamin Kramer	cca02750c8	Drop O4 from the llc manpage, it was removed in r70445. llvm-svn: 153684	2012-03-29 20:40:18 +00:00
Lang Hames	5569ce7d56	Make x86 REP_MOV* and REP_STO instructions use the correct operand sizes in 64-bit mode. llvm-svn: 153680	2012-03-29 19:54:28 +00:00
Danil Malyshev	7a98c9bb9c	Fix missed files in JIT unittests Makefile llvm-svn: 153672	2012-03-29 18:53:15 +00:00
Akira Hatanaka	0603ad8c65	Expand FREM. llvm-svn: 153671	2012-03-29 18:43:11 +00:00

... 2 3 4 5 6 ...

81575 Commits