llvm-project

Commit Graph

Author	SHA1	Message	Date
Hal Finkel	82656cb200	Implement optimizeCompareInstr for PPC Many PPC instructions have a so-called 'record form' which stores to a specific condition register the result of comparing the result of the instruction with zero (always as a signed comparison). For integer operations on PPC64, this is always a 64-bit comparison. This implementation is derived from the implementation in the ARM backend; there are some differences because PPC condition registers are allocatable virtual registers (although the record forms always use a specific one), and we look for a matching subtraction instruction after the compare (but before the first use) in addition to before it. llvm-svn: 179802	2013-04-18 22:15:08 +00:00
Bill Wendling	6a97e89df6	Make the TargetIndependent flag have the right boolean value. llvm-svn: 179798	2013-04-18 21:45:04 +00:00
Benjamin Kramer	c557828805	X86: Add an SSE2 lowering for 64 bit compares when pcmpgtq (SSE4.2) isn't available. This pattern started popping up in vectorized min/max reductions. llvm-svn: 179797	2013-04-18 21:37:45 +00:00
Eli Bendersky	f6a88ccd73	Fix typo llvm-svn: 179793	2013-04-18 20:49:17 +00:00
Bill Wendling	211316cc54	Cleanup patch: Semantics of parameters named Index and Idx were inconsistent between "include/llvm/IR/Attributes.h", "lib/IR/AttributeImpl.h" and "lib/IR/Attributes.cpp": sometimes these were fixed 1-based indexes of IR parameters (or AttributeSet::ReturnIndex for IR return values or AttributeSet::FunctionIndex for IR functions), other times they were the internal slot for storage in the underlying AttributeSetImpl. I renamed usage of the former to "Index" and usage of the latter to "Slot" ("Slot" was already being used consistently for the latter in a subset of cases) Patch by Stephen Lin! llvm-svn: 179791	2013-04-18 20:17:28 +00:00
Bill Wendling	e3a60a9bc0	This patch addresses two cleanup issues: 1. Verify::VerifyParameterAttrs in "lib/IR/Verifier.cpp" and AttrBuilder::removeFunctionOnlyAttrs in "lib/IR/Attributes.cpp" (only called by Verify::VerifyFunctionAttrs) separately maintained a list of function-only attribute types. I've consolidated the logic into a new function used for both cases in "lib/IR/Verifier.cpp", so this logic is in one place (other than the AsmParser front-end) 2. Various functions in "lib/IR/Verifier.cpp" passed AttributeSet around by reference needlessly, as it's just a handle to an immutable pimpl body. Patch by Stephen Lin! llvm-svn: 179790	2013-04-18 20:15:25 +00:00
Dmitri Gribenko	d29ea04446	Fix a -Wdocumentation warning llvm-svn: 179789	2013-04-18 20:13:04 +00:00
Anat Shemer	5570318f43	In the function InstCombiner::visitExtractElementInst() removed the limitation that extract is promoted over a cast only if the cast has only one use. llvm-svn: 179786	2013-04-18 19:56:44 +00:00
Tom Stellard	62c03207d5	C API: Fix coding style llvm-svn: 179785	2013-04-18 19:50:53 +00:00
Anat Shemer	0c95efad7e	Added a function scalarizePHI() that sclarizes a vector phi instruction if it has only 2 uses: one to promote the vector phi in a loop and the other use is an extract operation of one element at a constant location. llvm-svn: 179783	2013-04-18 19:35:39 +00:00
Bill Wendling	c62789f47e	Fix comment. Patch by Stephen Lin. llvm-svn: 179780	2013-04-18 18:30:16 +00:00
Rafael Espindola	56f976f6bd	At Jim Grosbach's request detemplate Object/MachO.h. We are still able to handle mixed endian objects by swapping one struct at a time. llvm-svn: 179778	2013-04-18 18:08:55 +00:00
Chris Lattner	8cf09416ea	Fix a comment, PR15777. llvm-svn: 179775	2013-04-18 17:42:14 +00:00
Derek Schuff	a403d243d1	Allow misaligned stores in x86 fast-isel. In X86FastISel::X86SelectStore(), improperly aligned stores are rejected and handled by the DAG-based ISel. However, X86FastISel::X86SelectLoad() makes no such requirement. There doesn't appear to be an x86 architectural correctness issue with allowing potentially unaligned store instructions. This patch removes this restriction. Patch by Jim Stichnot. llvm-svn: 179774	2013-04-18 17:41:08 +00:00
Arnold Schwaighofer	4cd6aa110c	LoopVectorizer: Recognize min/max reductions A min/max operation is represented by a select(cmp(lt/le/gt/ge, X, Y), X, Y) sequence in LLVM. If we see such a sequence we can treat it just as any other commutative binary instruction and reduce it. This appears to help bzip2 by about 1.5% on an imac12,2. radar://12960601 llvm-svn: 179773	2013-04-18 17:22:34 +00:00
Eli Bendersky	c0ef3d8514	Fix grammar in LLVMBuild.rst llvm-svn: 179768	2013-04-18 16:39:32 +00:00
Chad Rosier	db003998fb	[ms-inline asm] Simplify some logic and add a FIXME for unhandled unary minus. llvm-svn: 179765	2013-04-18 16:28:19 +00:00
Chad Rosier	c2f055d114	Make this private method. llvm-svn: 179764	2013-04-18 16:13:18 +00:00
Eli Bendersky	97ad9245f1	Fixes to LangRef.rst: incorrect attributes syntax and misplaced 'nobuiltin' Patch by Stephen Lin llvm-svn: 179763	2013-04-18 16:11:44 +00:00
Chad Rosier	2045b01171	Fix comment spacing. llvm-svn: 179761	2013-04-18 15:19:45 +00:00
Benjamin Kramer	8df2cfb858	LoopVectorize: Use a set to avoid longer cycles in the reduction chain too. Fixes PR15748. llvm-svn: 179757	2013-04-18 14:29:13 +00:00
Hao Liu	a2ff69863e	Fix for PR14824, An ARM Load/Store Optimization bug llvm-svn: 179751	2013-04-18 09:11:08 +00:00
David Majnemer	81af06e003	Revert "Combine bit test + conditional or into simple math" It is causing stage2 builds to fail, let's get them running again. llvm-svn: 179750	2013-04-18 08:42:33 +00:00
David Majnemer	bdf0caf6b1	Combine bit test + conditional or into simple math Simplify: (select (icmp eq (and X, C1), 0), Y, (or Y, C2)) Into: (or (shl (and X, C1), C3), y) Where: C3 = Log(C2) - Log(C1) If: C1 and C2 are both powers of two llvm-svn: 179748	2013-04-18 07:30:07 +00:00
Michael Gottesman	323964ca9e	[objc-arc] Do not mismatch up retains inside a for loop with releases outside said for loop in the presense of differing provenance caused by escaping blocks. This occurs due to an alloca representing a separate ownership from the original pointer. Thus consider the following pseudo-IR: objc_retain(%a) for (...) { objc_retain(%a) %block <- %a F(%block) objc_release(%block) } objc_release(%a) From the perspective of the optimizer, the %block is a separate provenance from the original %a. Thus the optimizer pairs up the inner retain for %a and the outer release from %a, resulting in segfaults. This is fixed by noting that the signature of a mismatch of retain/releases inside the for loop is a Use/CanRelease top down with an None bottom up (since bottom up the Retain-CanRelease-Use-Release sequence is completed by the inner objc_retain, but top down due to the differing provenance from the objc_release said sequence is not completed). In said case in CheckForCFGHazards, we now clear the state of %a implying that no pairing will occur. Additionally a test case is included. rdar://12969722 llvm-svn: 179747	2013-04-18 05:39:45 +00:00
Michael Gottesman	9e5181393a	Removed trailing whitespace. llvm-svn: 179746	2013-04-18 04:34:11 +00:00
Michael Gottesman	a15ab25238	Streamline arc-annotation test (removing some cases which do not add any extra coverage) and set it up to use FileCheck variables to make the test more robust. llvm-svn: 179745	2013-04-18 04:34:06 +00:00
Akira Hatanaka	89af58991a	[mips] Rename function. llvm-svn: 179741	2013-04-18 01:00:46 +00:00
Akira Hatanaka	59bfaf774b	[mips] DSP-ASE move from HI/LO register instructions. llvm-svn: 179739	2013-04-18 00:52:44 +00:00
Jack Carter	d0bd642464	Mips assembler: formatting and comment changes. This patch should not have any functional changes. llvm-svn: 179737	2013-04-18 00:41:53 +00:00
Bill Wendling	877cf534ab	Add an option `-enable-old-style-attr-syntax' to print out function attributes in the "old" style. It's sometimes beneficial to emit a testcase with the old style attribute syntax. Allow someone to do this. <rdar://problem/13563209> llvm-svn: 179735	2013-04-17 23:35:59 +00:00
Michael Gottesman	4e88ce68ae	[objc-arc] Added annotation option to only emit annotations for a specific ssa identifier. llvm-svn: 179729	2013-04-17 21:59:41 +00:00
Rafael Espindola	035b41653e	Two small cleanups for ELF's templates. * We only ever specialize these templates with an instantiation of ELFType, so we don't need a template template. * Replace LLVM_ELF_COMMA with just passing the individual parameters to the macro. This requires a second macro for when we only have ELFT, but that is still a small win. llvm-svn: 179726	2013-04-17 21:20:55 +00:00
Peter Collingbourne	2f495b93ee	Add support for subsections to the ELF assembler. Fixes PR8717. Differential Revision: http://llvm-reviews.chandlerc.com/D598 llvm-svn: 179725	2013-04-17 21:18:16 +00:00
Chad Rosier	6241c1a63d	[ms-inline asm] These should be int64_t, not uint64_t. llvm-svn: 179724	2013-04-17 21:14:38 +00:00
Michael Gottesman	adb921affa	Fixed typo. llvm-svn: 179721	2013-04-17 21:03:53 +00:00
Chad Rosier	3124627aa8	[ms-inline asm] Add support for the minus unary operator. Previously, we were unable to handle cases such as __asm mov eax, 8*-8. This patch also attempts to simplify the state machine. Further, the error reporting has been improved. Test cases included, but more will be added to the clang side shortly. rdar://13668445 llvm-svn: 179719	2013-04-17 21:01:45 +00:00
Michael Gottesman	6806b51ad2	[objc-arc] Added descriptions for EnableARCAnnotations, EnableCheckForCFGHazards, EnableARCOptimizations. llvm-svn: 179718	2013-04-17 20:48:03 +00:00
Michael Gottesman	ffef24f964	[objc-arc] Added an option to arc-annotations for turning off CheckForCFGHazard. llvm-svn: 179717	2013-04-17 20:48:01 +00:00
Eli Bendersky	239a78b835	More consistent formatting and tidying-up llvm-svn: 179716	2013-04-17 20:17:08 +00:00
Eli Bendersky	24a36eb331	This patch teaches x86 fast-isel to generate the native div/idiv instructions for the sdiv/srem/udiv/urem bitcode instructions. This is done for the i8, i16, and i32 types, as well as i64 for the x86_64 target. Patch by Jim Stichnoth llvm-svn: 179715	2013-04-17 20:10:13 +00:00
Arnold Schwaighofer	c0c7ff4ac0	X86 cost model: Exit before calling getSimpleVT on non-simple VTs getSimpleVT can only handle simple value types. radar://13676022 llvm-svn: 179714	2013-04-17 20:04:53 +00:00
Bill Wendling	9ca12c137f	A limit of 500 was still a bit too high for some tests. PR15000 has a testcase where the time to compile was bordering on 30s. When I dropped the limit value to 100, it became a much more managable 6s. The compile time seems to increase in a roughly linear fashion based on increasing the limit value. (See the runtimes below.) So, let's lower the limit to 100 so that they can get a more reasonable compile time. Limit Value Time ----------- ---- 10 0.9744s 20 1.8035s 30 2.3618s 40 2.9814s 50 3.6988s 60 4.5486s 70 4.9314s 80 5.8012s 90 6.4246s 100 7.0852s 110 7.6634s 120 8.3553s 130 9.0552s 140 9.6820s 150 9.8804s 160 10.8901s 170 10.9855s 180 12.0114s 190 12.6816s 200 13.2754s 210 13.9942s 220 13.8097s 230 14.3272s 240 15.7753s 250 15.6673s 260 16.0541s 270 16.7625s 280 17.3823s 290 18.8213s 300 18.6120s 310 20.0333s 320 19.5165s 330 20.2505s 340 20.7068s 350 21.1833s 360 22.9216s 370 22.2152s 380 23.9390s 390 23.4609s 400 24.0426s 410 24.6410s 420 26.5208s 430 27.7155s 440 26.4142s 450 28.5646s 460 27.3494s 470 29.7255s 480 29.4646s 490 30.5001s llvm-svn: 179713	2013-04-17 20:02:32 +00:00
Quentin Colombet	6f03f624df	Fix treatment of ARM unallocated hint instructions. The reference manual defines only 5 permitted values for the immediate field of the "hint" instruction: 1. nop (imm == 0) 2. yield (imm == 1) 3. wfe (imm == 2) 4. wfi (imm == 3) 5. sev (imm == 4) Therefore, restrict the permitted values for the "hint" instruction to 0 through 4. Patch by Mihail Popa <Mihail.Popa@arm.com> llvm-svn: 179707	2013-04-17 18:46:12 +00:00
Bill Wendling	b544363d0e	Appease a gcc warning about an overflow in a constant conversion. llvm-svn: 179703	2013-04-17 18:26:02 +00:00
Benjamin Kramer	c7400488b9	Don't store AttributeSet::FunctionIndex as an int. GCC complains: Core.cpp:1449:27: warning: overflow in implicit constant conversion [-Woverflow] I'm not sure if that's really a problem here, but using the enum type is better style anyways. llvm-svn: 179696	2013-04-17 17:51:19 +00:00
Ulrich Weigand	d0585d8686	PowerPC: Mark some more patterns as isCodeGenOnly. A couple of recently introduced conditional branch patterns also need to be marked as isCodeGenOnly since they cannot be handled by the asm parser. No change in generated code. llvm-svn: 179690	2013-04-17 17:19:05 +00:00
Eli Bendersky	ca38084d57	Make formatting more consistent and tidy-up. llvm-svn: 179689	2013-04-17 17:17:20 +00:00
Vincent Lejeune	2d5c341cee	R600: Make Export Instruction not duplicable llvm-svn: 179686	2013-04-17 15:17:39 +00:00
Vincent Lejeune	218093e834	R600: Export is emitted as a CF_NATIVE inst llvm-svn: 179685	2013-04-17 15:17:32 +00:00

1 2 3 4 5 ...

91225 Commits